feat(openai): route reasoning + tools to the Responses API (refs #785) by andrew-woblavobla · Pull Request #786 · crmne/ruby_llm

andrew-woblavobla · 2026-05-29T15:22:18Z

What

Transparently routes with_thinking(effort:) + tools for OpenAI to /v1/responses — the only endpoint that accepts reasoning together with function tools for gpt-5.x / o-series. The default /v1/chat/completions path is unchanged (gated by @openai_responses_mode).

Why

OpenAI reasoning models 400 on reasoning_effort + function tools via chat/completions ("use /v1/responses instead"), so chat.with_thinking(effort:).with_tools(...) is impossible for the whole gpt-5 reasoning family. Details + repro in #785.

How (auto-route within the OpenAI provider)

OpenAI#render_payload sets @openai_responses_mode = instance_of?(OpenAI) && responses_api?(tools:, thinking:) (true only when both are present) and renders a Responses payload; completion_url / parse_completion_response branch on it. The Chat module's render_payload stays pure chat/completions, and the instance_of? guard keeps subclasses (Azure/OpenRouter/Mistral/Perplexity/xAI/GPUStack) on chat/completions — they have no /v1/responses.
New OpenAI::Responses module: request translation — input items (incl. function_call / function_call_output round-trip), flat {type:"function",…} tools, top-level reasoning:{effort:}, text.format for structured output, store:false; response parsing — output[] → Message / ToolCall / Thinking + usage (incl. reasoning_tokens).
stream_response raises a clear error in responses mode (Responses SSE streaming not implemented yet).

Verified

Live: gpt-5.5 + with_thinking(:high) + a function tool completes the tool loop with reasoning tokens (previously a 400).
18 keyless unit specs for the request/response translation + routing; the provider suite passes; RuboCop clean.

Known follow-ups

Responses streaming (different SSE event set)
input_image multimodal
Reasoning-item round-trip across turns (include: ["reasoning.encrypted_content"])

I'm open to design changes — e.g. extracting this into a dedicated :openai_responses provider rather than auto-routing within OpenAI, or any other shape you'd prefer.

Refs #785.

OpenAI reasoning models (gpt-5.x, o-series) reject `reasoning_effort` together with function tools on /v1/chat/completions: "Function tools with reasoning_effort are not supported for gpt-5.5 in /v1/chat/completions. Please use /v1/responses instead." So `chat.with_thinking(effort:).with_tools(...)` is impossible for the entire gpt-5 reasoning family today. This transparently routes that combo to /v1/responses inside the OpenAI provider: render_payload sets @openai_responses_mode when thinking && tools, and completion_url / parse_completion_response branch on it. The default chat/completions path is unchanged (gated). Translates request (input items, flat tools, reasoning:{effort:}, text.format) and response (output[] -> Message/ToolCall/Thinking + usage). Verified live against gpt-5.5: reasoning (88 reasoning tokens) + a function tool complete in one turn. Prototype scope — not yet implemented: Responses streaming (guarded), image input, reasoning-item round-trip across turns, cassette tests.

Chat#render_payload is also called directly as a module function in specs (RubyLLM::Providers::OpenAI::Chat.render_payload). Calling responses_api? from there raised NoMethodError because that helper lives in OpenAI::Responses, which is mixed into the provider *instance*, not the Chat module — breaking 3 schema render_payload specs. Move the thinking+tools -> Responses routing into an OpenAI#render_payload override (instance context, both modules mixed in); the Chat module's render_payload is pure chat/completions again. Gate on instance_of?(OpenAI) so the OpenAI subclasses (Azure/OpenRouter/Mistral/Perplexity/xAI/GPUStack) keep chat/completions — they have no /v1/responses endpoint. Re-verified live: gpt-5.5 + with_thinking + a function tool completes the tool loop with reasoning tokens.

codecov · 2026-05-29T15:38:15Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 87.43%. Comparing base (5bdda1a) to head (135489a).

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #786      +/-   ##
==========================================
+ Coverage   87.21%   87.43%   +0.21%     
==========================================
  Files         121      122       +1     
  Lines        5703     5802      +99     
  Branches     1442     1478      +36     
==========================================
+ Hits         4974     5073      +99     
  Misses        729      729

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

…ion + routing) Adds keyless unit specs for OpenAI::Responses: responses_api? gating, the render_payload routing (thinking+tools -> /v1/responses; subclasses + no-thinking stay on chat/completions), render_responses_payload request shape (input items, flat tools, reasoning effort, instructions, text.format), function_call / function_call_output round-trip, parse_responses_response (message/tool_call/ reasoning/usage + output_text fallback + error body), tool_choice, and the streaming guard. Raises patch coverage flagged by Codecov.

Adds cases for responses_tool_for provider_params deep-merge and the responses_text_content Content/.text and to_s fallbacks — the 4 lines Codecov flagged. responses.rb is now fully covered.

…ches Exercises the last partial branches Codecov flagged: an assistant message with text content rendering an output_text input item, and parse_responses_response returning nil for an empty body.

Covers the 8 partial branches Codecov folded into patch %: render without effort (no :reasoning), tool_prefs choice/parallel_calls, unknown output items, non-output_text / non-summary_text content blocks, empty tool-call arguments, and empty-content message build. responses.rb now 100% line + branch coverage.

reeganviljoen · 2026-06-09T12:26:27Z

Hi does anyone know how close this is to being landed as this is an issue that breaks this libraries amazing interface with openai models unless it is fixed

zavan · 2026-06-09T12:28:54Z

@reeganviljoen My guess is it won't be merged until streaming is supported (in my case that is essential).

crmne · 2026-06-09T12:34:53Z

Responses API requires us to make a new layer in RubyLLM's architecture: protocols. It will be done that way.

#213 (comment)

crmne · 2026-06-09T12:35:31Z

@zavan streaming is supported since 1.0

zavan · 2026-06-09T12:36:48Z

@crmne I was talking specifically about streaming support in this pull request:

Known follow-ups

Responses streaming (different SSE event set)

reeganviljoen · 2026-06-09T12:56:10Z

@crmne thanks, I didn't see that comment, is there any assistance I can provide to help get that working ?

crmne · 2026-06-12T10:59:11Z

Following up here too: Responses support landed on main via the new protocols layer (0875ce2), including the semantic streaming events, so the streaming follow-up discussed in this thread is covered. Thanks for pushing on this @andrew-woblavobla.

andrew-woblavobla added 2 commits May 29, 2026 18:20

andrew-woblavobla changed the title ~~feat(openai): route reasoning + tools to the Responses API (prototype, refs #785)~~ feat(openai): route reasoning + tools to the Responses API (refs #785) May 29, 2026

andrew-woblavobla marked this pull request as ready for review May 29, 2026 15:57

andrew-woblavobla mentioned this pull request May 29, 2026

OpenAI Responses API: enable reasoning_effort + function tools (gpt-5.x / o-series) #785

Closed

andrew-woblavobla added 3 commits May 29, 2026 19:01

test(openai): cover remaining Responses branches (100% patch)

54a7303

Adds cases for responses_tool_for provider_params deep-merge and the responses_text_content Content/.text and to_s fallbacks — the 4 lines Codecov flagged. responses.rb is now fully covered.

test(openai): cover assistant output_text + empty-body Responses bran…

eeaf418

…ches Exercises the last partial branches Codecov flagged: an assistant message with text content rendering an output_text input item, and parse_responses_response returning nil for an empty body.

crmne closed this Jun 9, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(openai): route reasoning + tools to the Responses API (refs #785)#786

feat(openai): route reasoning + tools to the Responses API (refs #785)#786
andrew-woblavobla wants to merge 6 commits into
crmne:mainfrom
andrew-woblavobla:feat/openai-responses-reasoning-tools

andrew-woblavobla commented May 29, 2026 •

edited

Loading

Uh oh!

codecov Bot commented May 29, 2026 •

edited

Loading

Uh oh!

reeganviljoen commented Jun 9, 2026

Uh oh!

zavan commented Jun 9, 2026

Uh oh!

crmne commented Jun 9, 2026

Uh oh!

crmne commented Jun 9, 2026

Uh oh!

zavan commented Jun 9, 2026 •

edited

Loading

Uh oh!

reeganviljoen commented Jun 9, 2026

Uh oh!

crmne commented Jun 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

andrew-woblavobla commented May 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What

Why

How (auto-route within the OpenAI provider)

Verified

Known follow-ups

Uh oh!

codecov Bot commented May 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

reeganviljoen commented Jun 9, 2026

Uh oh!

zavan commented Jun 9, 2026

Uh oh!

crmne commented Jun 9, 2026

Uh oh!

crmne commented Jun 9, 2026

Uh oh!

zavan commented Jun 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

reeganviljoen commented Jun 9, 2026

Uh oh!

crmne commented Jun 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

andrew-woblavobla commented May 29, 2026 •

edited

Loading

codecov Bot commented May 29, 2026 •

edited

Loading

zavan commented Jun 9, 2026 •

edited

Loading