Skip to content

feat(anthropic): update model YAMLs [bot]#1046

Closed
models-bot[bot] wants to merge 1 commit into
mainfrom
bot/update-anthropic-20260515-052024
Closed

feat(anthropic): update model YAMLs [bot]#1046
models-bot[bot] wants to merge 1 commit into
mainfrom
bot/update-anthropic-20260515-052024

Conversation

@models-bot
Copy link
Copy Markdown
Contributor

@models-bot models-bot Bot commented May 15, 2026

Auto-generated by poc-agent for provider anthropic.


Note

Low Risk
Low risk config-only change that adds a new pricing field; main risk is mispricing if downstream parsers don’t recognize the new key.

Overview
Updates the Anthropic model config for claude-opus-4-7 to include cache_creation_input_token_cost_per_hour, extending the pricing metadata used for prompt caching cost calculations.

Reviewed by Cursor Bugbot for commit 0d679b6. Bugbot is set up for automated code reviews on this repo. Configure here.

@github-actions
Copy link
Copy Markdown
Contributor

/test-models

@harshiv-26
Copy link
Copy Markdown
Collaborator

Gateway test results

  • Total: 20
  • Passed: 10
  • Failed: 9
  • Validation failed: 1
  • Errored: 0
  • Skipped: 0
  • Success rate: 50.0%
Provider Model Scenarios
anthropic claude-opus-4-7 success: tool-call, tool-call:stream, params, params:stream:anthropic, structured-output:stream:anthropic, tool-call:stream:anthropic, parallel-tool-call:stream:anthropic, structured-output:stream, reasoning:anthropic, reasoning:stream:anthropic

failure: tool-call:anthropic, params:anthropic, reasoning, parallel-tool-call, parallel-tool-call:anthropic, parallel-tool-call:stream, structured-output:anthropic, structured-output, params:stream

validation_failure: reasoning:stream
Failures (10)

anthropic/claude-opus-4-7 — tool-call:anthropic (failure)

Error:

Traceback (most recent call last):
  File "/tmp/tmpm_chrn4c/snippet.py", line 34, in <module>
    response = client.messages.create(
               ^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/anthropic/_utils/_utils.py", line 294, in wrapper
    return func(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/anthropic/resources/messages/messages.py", line 1003, in create
    return self._post(
           ^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/anthropic/_base_client.py", line 1374, in post
    return cast(ResponseT, self.request(cast_to, opts, stream=stream, stream_cls=stream_cls))
                           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/anthropic/_base_client.py", line 1147, in request
    raise self._make_status_error_from_response(err.response) from None
anthropic.AuthenticationError: Error code: 401 - {'status': 'failure', 'message': 'Unauthorized: Service account gateway-tester-v2-92c11b4b-2 does not exist', 'error': {'message': 'Unauthorized: Service account gateway-tester-v2-92c11b4b-2 does not exist', 'type': 'Error', 'code': '401'}, 'error_origin_level': 'authentication'}
Code snippet
import anthropic

_api_key = "***"
_model = "test-v2-anthropic/claude-opus-4-7"

client = anthropic.Anthropic(
    api_key=_api_key,
    base_url="https://internal.devtest.truefoundry.tech/api/llm",
)

tools = [
    {
        "name": "get_weather",
        "description": "Get the current weather for a location.",
        "input_schema": {
            "type": "object",
            "properties": {
                "location": {
                    "type": "string",
                    "description": "The city name, e.g. London",
                },
            },
            "required": ["location"],
        },
    },
]

messages = [
    {"role": "user", "content": "Hi"},
    {"role": "assistant", "content": "Hi, how can I help you"},
    {"role": "user", "content": "Use the get_weather tool to check the weather in London. You must call the tool, do not respond with plain text."},
]

response = client.messages.create(
    model=_model,
    system="You are a helpful assistant with access to tools. You MUST strictly use the provided tools to answer. Never respond with plain text when a tool is available.",
    messages=messages,
    tools=tools,
    tool_choice={"type": "auto"},
    max_tokens=1024,
)

for block in response.content:
    if block.type == "tool_use":
        print(f"Tool: {block.name}")
        print(f"Input: {block.input}")
    elif block.type == "text":
        print(block.text)

_tool_uses = [block for block in response.content if block.type == "tool_use"]

if _tool_uses:
    for _tu in _tool_uses:
        print(f"Tool: {_tu.name}")
        print(f"Input: {_tu.input}")
else:
    _text_blocks = [block.text for block in response.content if block.type == "text"]
    print("\n".join(_text_blocks))

if not _tool_uses:
    raise Exception("VALIDATION FAILED: tool-call - no tool uses in Anthropic response")
print("VALIDATION: tool-call SUCCESS")

anthropic/claude-opus-4-7 — params:anthropic (failure)

Error:

Traceback (most recent call last):
  File "/tmp/tmpgt2f29yk/snippet.py", line 17, in <module>
    response = client.messages.create(
               ^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/anthropic/_utils/_utils.py", line 294, in wrapper
    return func(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/anthropic/resources/messages/messages.py", line 1003, in create
    return self._post(
           ^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/anthropic/_base_client.py", line 1374, in post
    return cast(ResponseT, self.request(cast_to, opts, stream=stream, stream_cls=stream_cls))
                           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/anthropic/_base_client.py", line 1147, in request
    raise self._make_status_error_from_response(err.response) from None
anthropic.AuthenticationError: Error code: 401 - {'status': 'failure', 'message': 'Unauthorized: Service account gateway-tester-v2-92c11b4b-2 does not exist', 'error': {'message': 'Unauthorized: Service account gateway-tester-v2-92c11b4b-2 does not exist', 'type': 'Error', 'code': '401'}, 'error_origin_level': 'authentication'}
Code snippet
import anthropic

_api_key = "***"
_model = "test-v2-anthropic/claude-opus-4-7"

client = anthropic.Anthropic(
    api_key=_api_key,
    base_url="https://internal.devtest.truefoundry.tech/api/llm",
)

messages = [
    {"role": "user", "content": "Hi"},
    {"role": "assistant", "content": "Hi, how can I help you"},
    {"role": "user", "content": "What is the capital of France?"},
]

response = client.messages.create(
    model=_model,
    system="You are a helpful assistant.",
    messages=messages,
    max_tokens=256,
)

for block in response.content:
    if block.type == "text":
        print(block.text)

anthropic/claude-opus-4-7 — reasoning (failure)

Error:

Traceback (most recent call last):
  File "/tmp/tmppa32x50d/snippet.py", line 5, in <module>
    response = client.chat.completions.create(
               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/openai/_utils/_utils.py", line 286, in wrapper
    return func(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/openai/resources/chat/completions/completions.py", line 1147, in create
    return self._post(
           ^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/openai/_base_client.py", line 1259, in post
    return cast(ResponseT, self.request(cast_to, opts, stream=stream, stream_cls=stream_cls))
                           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/openai/_base_client.py", line 1047, in request
    raise self._make_status_error_from_response(err.response) from None
openai.AuthenticationError: Error code: 401 - {'status': 'failure', 'message': 'Unauthorized: Service account gateway-tester-v2-92c11b4b-2 does not exist', 'error': {'message': 'Unauthorized: Service account gateway-tester-v2-92c11b4b-2 does not exist', 'type': 'Error', 'code': '401'}, 'error_origin_level': 'authentication'}
Code snippet
from openai import OpenAI

client = OpenAI(api_key="***", base_url="https://internal.devtest.truefoundry.tech/api/llm")

response = client.chat.completions.create(
    model="test-v2-anthropic/claude-opus-4-7",
    messages=[
        {"role": "system", "content": "You are a helpful assistant. You MUST think step by step and show your reasoning. Never skip reasoning steps."},
        {"role": "user", "content": "Hi"},
        {"role": "assistant", "content": "Hi, how can I help you"},
        {"role": "user", "content": "How to calculate 3^3^3^3? Think step by step and show all reasoning."},
    ],
    reasoning_effort="medium",
    stream=False,
)

_usage = getattr(response, "usage", None)
_reasoning_detected = False

_choices = getattr(response, "choices", None)
if _choices and len(_choices) > 0:
    _message = getattr(_choices[0], "message", None)
else:
    _message = None

if _message and getattr(_message, "content", None) is not None:
    print(_message.content)

if _usage is not None:
    _output_token_details = getattr(_usage, "completion_tokens_details", None)
    if _output_token_details and getattr(_output_token_details, "reasoning_tokens", 0) > 0:
        _reasoning_detected = True
    elif getattr(_usage, "reasoning", None) is not None:
        _reasoning_detected = True

if getattr(_message, "reasoning_content", None) is not None:
    _reasoning_detected = True
elif getattr(_message, "reasoning", None) is not None:
    _reasoning_detected = True

if not _reasoning_detected:
    print("Response: ", response)
    raise Exception("VALIDATION FAILED: reasoning - no reasoning information in response")
print("VALIDATION: reasoning SUCCESS")

anthropic/claude-opus-4-7 — parallel-tool-call (failure)

Error:

Traceback (most recent call last):
  File "/tmp/tmpqx6vel5x/snippet.py", line 27, in <module>
    response = client.chat.completions.create(
               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/openai/_utils/_utils.py", line 286, in wrapper
    return func(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/openai/resources/chat/completions/completions.py", line 1147, in create
    return self._post(
           ^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/openai/_base_client.py", line 1259, in post
    return cast(ResponseT, self.request(cast_to, opts, stream=stream, stream_cls=stream_cls))
                           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/openai/_base_client.py", line 1047, in request
    raise self._make_status_error_from_response(err.response) from None
openai.AuthenticationError: Error code: 401 - {'status': 'failure', 'message': 'Unauthorized: Service account gateway-tester-v2-92c11b4b-2 does not exist', 'error': {'message': 'Unauthorized: Service account gateway-tester-v2-92c11b4b-2 does not exist', 'type': 'Error', 'code': '401'}, 'error_origin_level': 'authentication'}
Code snippet
from openai import OpenAI

client = OpenAI(api_key="***", base_url="https://internal.devtest.truefoundry.tech/api/llm")

tools = [
    {
        "type": "function",
        "function": {
            "name": "get_weather",
            "description": "Get the current weather for a location.",
            "parameters": {
                "type": "object",
                "properties": {
                    "location": {
                        "type": "string",
                        "description": "The city name, e.g. London",
                    },
                },
                "required": ["location"],
                "additionalProperties": False,
            },
            "strict": True,
        },
    },
]

response = client.chat.completions.create(
    model="test-v2-anthropic/claude-opus-4-7",
    messages=[
        {"role": "system", "content": "You are a helpful assistant with access to tools. You MUST strictly call multiple tools in parallel whenever possible. Never call them sequentially."},
        {"role": "user", "content": "Hi"},
        {"role": "assistant", "content": "Hi, how can I help you"},
        {"role": "user", "content": "Use the get_weather tool to check the weather in London and Paris. You MUST make both tool calls strictly in parallel, not sequentially."},
    ],
    tools=tools,
    tool_choice="auto",
    parallel_tool_calls=True,
    stream=False,
)

_message = response.choices[0].message
if _message.tool_calls:
    print(f"Number of parallel tool calls: {len(_message.tool_calls)}")
    for _tc in _message.tool_calls:
        print(f"Function: {_tc.function.name}")
        print(f"Arguments: {_tc.function.arguments}")
else:
    print(_message.content)

if not _message.tool_calls or len(_message.tool_calls) < 1:
    raise Exception(
        f"VALIDATION FAILED: parallel-tool-call - expected at least 1 tool call, "
        f"got {len(_message.tool_calls) if _message.tool_calls else 0}"
    )
print("VALIDATION: parallel-tool-call SUCCESS")

anthropic/claude-opus-4-7 — parallel-tool-call:anthropic (failure)

Error:

Traceback (most recent call last):
  File "/tmp/tmppooyxfrd/snippet.py", line 34, in <module>
    response = client.messages.create(
               ^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/anthropic/_utils/_utils.py", line 294, in wrapper
    return func(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/anthropic/resources/messages/messages.py", line 1003, in create
    return self._post(
           ^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/anthropic/_base_client.py", line 1374, in post
    return cast(ResponseT, self.request(cast_to, opts, stream=stream, stream_cls=stream_cls))
                           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/anthropic/_base_client.py", line 1147, in request
    raise self._make_status_error_from_response(err.response) from None
anthropic.AuthenticationError: Error code: 401 - {'status': 'failure', 'message': 'Unauthorized: Service account gateway-tester-v2-92c11b4b-2 does not exist', 'error': {'message': 'Unauthorized: Service account gateway-tester-v2-92c11b4b-2 does not exist', 'type': 'Error', 'code': '401'}, 'error_origin_level': 'authentication'}
Code snippet
import anthropic

_api_key = "***"
_model = "test-v2-anthropic/claude-opus-4-7"

client = anthropic.Anthropic(
    api_key=_api_key,
    base_url="https://internal.devtest.truefoundry.tech/api/llm",
)

tools = [
    {
        "name": "get_weather",
        "description": "Get the current weather for a location.",
        "input_schema": {
            "type": "object",
            "properties": {
                "location": {
                    "type": "string",
                    "description": "The city name, e.g. London",
                },
            },
            "required": ["location"],
        },
    },
]

messages = [
    {"role": "user", "content": "Hi"},
    {"role": "assistant", "content": "Hi, how can I help you"},
    {"role": "user", "content": "Use the get_weather tool to check the weather in London and Paris. You MUST make both tool calls strictly in parallel, not sequentially."},
]

response = client.messages.create(
    model=_model,
    system="You are a helpful assistant with access to tools. You MUST strictly call multiple tools in parallel whenever possible. Never call them sequentially.",
    messages=messages,
    tools=tools,
    tool_choice={"type": "auto"},
    max_tokens=1024,
)

for block in response.content:
    if block.type == "tool_use":
        print(f"Tool: {block.name}")
        print(f"Input: {block.input}")
    elif block.type == "text":
        print(block.text)

_tool_uses = [block for block in response.content if block.type == "tool_use"]

if _tool_uses:
    print(f"Number of parallel tool calls: {len(_tool_uses)}")
    for _tu in _tool_uses:
        print(f"Tool: {_tu.name}")
        print(f"Input: {_tu.input}")
else:
    _text_blocks = [block.text for block in response.content if block.type == "text"]
    print("\n".join(_text_blocks))

if not _tool_uses or len(_tool_uses) < 1:
    raise Exception(
        f"VALIDATION FAILED: parallel-tool-call - expected at least 1 tool call, "
        f"got {len(_tool_uses) if _tool_uses else 0}"
    )
print("VALIDATION: parallel-tool-call SUCCESS")

anthropic/claude-opus-4-7 — parallel-tool-call:stream (failure)

Error:

Traceback (most recent call last):
  File "/tmp/tmpds7dwzy1/snippet.py", line 27, in <module>
    response = client.chat.completions.create(
               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/openai/_utils/_utils.py", line 286, in wrapper
    return func(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/openai/resources/chat/completions/completions.py", line 1147, in create
    return self._post(
           ^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/openai/_base_client.py", line 1259, in post
    return cast(ResponseT, self.request(cast_to, opts, stream=stream, stream_cls=stream_cls))
                           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/openai/_base_client.py", line 1047, in request
    raise self._make_status_error_from_response(err.response) from None
openai.AuthenticationError: Error code: 401 - {'status': 'failure', 'message': 'Unauthorized: Service account gateway-tester-v2-92c11b4b-2 does not exist', 'error': {'message': 'Unauthorized: Service account gateway-tester-v2-92c11b4b-2 does not exist', 'type': 'Error', 'code': '401'}, 'error_origin_level': 'authentication'}
Code snippet
from openai import OpenAI

client = OpenAI(api_key="***", base_url="https://internal.devtest.truefoundry.tech/api/llm")

tools = [
    {
        "type": "function",
        "function": {
            "name": "get_weather",
            "description": "Get the current weather for a location.",
            "parameters": {
                "type": "object",
                "properties": {
                    "location": {
                        "type": "string",
                        "description": "The city name, e.g. London",
                    },
                },
                "required": ["location"],
                "additionalProperties": False,
            },
            "strict": True,
        },
    },
]

response = client.chat.completions.create(
    model="test-v2-anthropic/claude-opus-4-7",
    messages=[
        {"role": "system", "content": "You are a helpful assistant with access to tools. You MUST strictly call multiple tools in parallel whenever possible. Never call them sequentially."},
        {"role": "user", "content": "Hi"},
        {"role": "assistant", "content": "Hi, how can I help you"},
        {"role": "user", "content": "Use the get_weather tool to check the weather in London and Paris. You MUST make both tool calls strictly in parallel, not sequentially."},
    ],
    tools=tools,
    tool_choice="auto",
    parallel_tool_calls=True,
    stream=True,
)

_tool_call_indices = set()
for chunk in response:
    if chunk.choices and len(chunk.choices) > 0:
        delta = chunk.choices[0].delta
        if delta.content is not None:
            print(delta.content, end="", flush=True)
        if delta.tool_calls:
            for _tc in delta.tool_calls:
                _tool_call_indices.add(_tc.index)
                if _tc.function:
                    print(_tc.function.arguments or "", end="", flush=True)

if len(_tool_call_indices) < 1:
    raise Exception(
        f"VALIDATION FAILED: parallel-tool-call stream - expected at least 1 tool call, "
        f"got {len(_tool_call_indices)}"
    )
print(f"\nNumber of parallel tool calls: {len(_tool_call_indices)}")
print("VALIDATION: parallel-tool-call stream SUCCESS")

anthropic/claude-opus-4-7 — structured-output:anthropic (failure)

Error:

Traceback (most recent call last):
  File "/tmp/tmpsfjhcrx2/snippet.py", line 39, in <module>
    response = client.messages.create(
               ^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/anthropic/_utils/_utils.py", line 294, in wrapper
    return func(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/anthropic/resources/messages/messages.py", line 1003, in create
    return self._post(
           ^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/anthropic/_base_client.py", line 1374, in post
    return cast(ResponseT, self.request(cast_to, opts, stream=stream, stream_cls=stream_cls))
                           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/anthropic/_base_client.py", line 1147, in request
    raise self._make_status_error_from_response(err.response) from None
anthropic.AuthenticationError: Error code: 401 - {'status': 'failure', 'message': 'Unauthorized: Service account gateway-tester-v2-92c11b4b-2 does not exist', 'error': {'message': 'Unauthorized: Service account gateway-tester-v2-92c11b4b-2 does not exist', 'type': 'Error', 'code': '401'}, 'error_origin_level': 'authentication'}
Code snippet
import anthropic
import json

_api_key = "***"
_model = "test-v2-anthropic/claude-opus-4-7"

client = anthropic.Anthropic(
    api_key=_api_key,
    base_url="https://internal.devtest.truefoundry.tech/api/llm",
)

response_schema = {
    "type": "object",
    "properties": {
        "name": {"type": "string"},
        "date": {"type": "string"},
        "participants": {
            "type": "array",
            "items": {"type": "string"},
        },
    },
    "required": ["name", "date", "participants"],
}

tools = [
    {
        "name": "CalendarEvent",
        "description": "Extract event information as a structured CalendarEvent.",
        "input_schema": response_schema,
    },
]

messages = [
    {"role": "user", "content": "Hi"},
    {"role": "assistant", "content": "Hi, how can I help you"},
    {"role": "user", "content": "Alice and Bob are going to a science fair on Friday."},
]

response = client.messages.create(
    model=_model,
    system="Extract the event information using the CalendarEvent tool.",
    messages=messages,
    tools=tools,
    tool_choice={"type": "tool", "name": "CalendarEvent"},
    max_tokens=1024,
)

for block in response.content:
    if block.type == "tool_use":
        print(json.dumps(block.input, indent=2))
    elif block.type == "text":
        print(block.text)

import json as _json

_tool_uses = [block for block in response.content if block.type == "tool_use"]

if _tool_uses:
    _parsed = _tool_uses[0].input
else:
    _text_blocks = [block.text for block in response.content if block.type == "text"]
    _text = "".join(_text_blocks)
    _parsed = _json.loads(_text)

print(_json.dumps(_parsed, indent=2))

if "name" not in _parsed or "date" not in _parsed or "participants" not in _parsed:
    raise Exception("VALIDATION FAILED: structured-output - missing expected fields (name, date, participants)")

if not isinstance(_parsed.get("participants"), list):
    raise Exception("VALIDATION FAILED: structured-output - 'participants' is not a list, schema not enforced")

print("VALIDATION: structured-output SUCCESS")

anthropic/claude-opus-4-7 — structured-output (failure)

Error:

Traceback (most recent call last):
  File "/tmp/tmptic_ym3v/snippet.py", line 21, in <module>
    response = client.chat.completions.create(
               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/openai/_utils/_utils.py", line 286, in wrapper
    return func(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/openai/resources/chat/completions/completions.py", line 1147, in create
    return self._post(
           ^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/openai/_base_client.py", line 1259, in post
    return cast(ResponseT, self.request(cast_to, opts, stream=stream, stream_cls=stream_cls))
                           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/openai/_base_client.py", line 1047, in request
    raise self._make_status_error_from_response(err.response) from None
openai.AuthenticationError: Error code: 401 - {'status': 'failure', 'message': 'Unauthorized: Service account gateway-tester-v2-92c11b4b-2 does not exist', 'error': {'message': 'Unauthorized: Service account gateway-tester-v2-92c11b4b-2 does not exist', 'type': 'Error', 'code': '401'}, 'error_origin_level': 'authentication'}
Code snippet
from openai import OpenAI
import json

client = OpenAI(api_key="***", base_url="https://internal.devtest.truefoundry.tech/api/llm")

response_schema = json.loads('''{
  "title": "CalendarEvent",
  "type": "object",
  "properties": {
    "name": { "type": "string" },
    "date": { "type": "string" },
    "participants": {
      "type": "array",
      "items": { "type": "string" }
    }
  },
  "required": ["name", "date", "participants"],
  "additionalProperties": false
}''')

response = client.chat.completions.create(
    model="test-v2-anthropic/claude-opus-4-7",
    messages=[
        {"role": "system", "content": "Extract the event information as JSON."},
        {"role": "user", "content": "Hi"},
        {"role": "assistant", "content": "Hi, how can I help you"},
        {"role": "user", "content": "Alice and Bob are going to a science fair on Friday. Extract the event details as JSON."},
    ],
    response_format={"type": "json_schema", "json_schema": {"name": "CalendarEvent", "schema": response_schema}},
    stream=False,
)

import json as _json

_content = response.choices[0].message.content
print(_content)

if not _content:
    raise Exception("VALIDATION FAILED: structured-output - response content is empty")

_parsed = _json.loads(_content)

if "name" not in _parsed or "date" not in _parsed or "participants" not in _parsed:
    raise Exception("VALIDATION FAILED: structured-output - missing expected fields (name, date, participants)")

if not isinstance(_parsed.get("participants"), list):
    raise Exception("VALIDATION FAILED: structured-output - 'participants' is not a list, schema not enforced")

if set(_parsed.keys()) != {"name", "date", "participants"}:
    raise Exception(
        f"VALIDATION FAILED: structured-output - unexpected keys present: {set(_parsed.keys())}"
    )

print("VALIDATION: structured-output SUCCESS")

anthropic/claude-opus-4-7 — params:stream (failure)

Error:

Traceback (most recent call last):
  File "/tmp/tmprv843wcj/snippet.py", line 5, in <module>
    response = client.chat.completions.create(
               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/openai/_utils/_utils.py", line 286, in wrapper
    return func(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/openai/resources/chat/completions/completions.py", line 1147, in create
    return self._post(
           ^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/openai/_base_client.py", line 1259, in post
    return cast(ResponseT, self.request(cast_to, opts, stream=stream, stream_cls=stream_cls))
                           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/openai/_base_client.py", line 1047, in request
    raise self._make_status_error_from_response(err.response) from None
openai.AuthenticationError: Error code: 401 - {'status': 'failure', 'message': 'Unauthorized: Service account gateway-tester-v2-92c11b4b-2 does not exist', 'error': {'message': 'Unauthorized: Service account gateway-tester-v2-92c11b4b-2 does not exist', 'type': 'Error', 'code': '401'}, 'error_origin_level': 'authentication'}
Code snippet
from openai import OpenAI

client = OpenAI(api_key="***", base_url="https://internal.devtest.truefoundry.tech/api/llm")

response = client.chat.completions.create(
    model="test-v2-anthropic/claude-opus-4-7",
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "Hi"},
        {"role": "assistant", "content": "Hi, how can I help you"},
        {"role": "user", "content": "What is the capital of France?"},
    ],
    max_tokens=256,
    stream=True,
)

for chunk in response:
    if chunk.choices and len(chunk.choices) > 0:
        delta = chunk.choices[0].delta
        if delta.content is not None:
            print(delta.content, end="", flush=True)

anthropic/claude-opus-4-7 — reasoning:stream (validation_failure)

Error:

Traceback (most recent call last):
  File "/tmp/tmp9mjbq1ud/snippet.py", line 35, in <module>
    raise Exception("VALIDATION FAILED: reasoning stream - no reasoning information in stream")
Exception: VALIDATION FAILED: reasoning stream - no reasoning information in stream
Code snippet
from openai import OpenAI

client = OpenAI(api_key="***", base_url="https://internal.devtest.truefoundry.tech/api/llm")

response = client.chat.completions.create(
    model="test-v2-anthropic/claude-opus-4-7",
    messages=[
        {"role": "system", "content": "You are a helpful assistant. You MUST think step by step and show your reasoning. Never skip reasoning steps."},
        {"role": "user", "content": "Hi"},
        {"role": "assistant", "content": "Hi, how can I help you"},
        {"role": "user", "content": "How to calculate 3^3^3^3? Think step by step and show all reasoning."},
    ],
    reasoning_effort="medium",
    stream=True,
)

_reasoning_detected = False
for chunk in response:
    if chunk.choices and len(chunk.choices) > 0:
        delta = chunk.choices[0].delta
        if delta.content is not None:
            print(delta.content, end="", flush=True)
        if getattr(delta, "reasoning_content", None) is not None:
            _reasoning_detected = True
        if getattr(delta, "reasoning", None) is not None:
            _reasoning_detected = True

    _usage = getattr(chunk, "usage", None)
    if _usage is not None:
        _details = getattr(_usage, "completion_tokens_details", None)
        if _details and getattr(_details, "reasoning_tokens", 0) > 0:
            _reasoning_detected = True

if not _reasoning_detected:
    raise Exception("VALIDATION FAILED: reasoning stream - no reasoning information in stream")
print("\nVALIDATION: reasoning stream SUCCESS")

@harshiv-26 harshiv-26 closed this May 15, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant