feat(agent): add stream_final_turn_only parameter to stream_async by zhifanl · Pull Request #2104 · strands-agents/sdk-python

zhifanl · 2026-04-09T23:39:44Z

Motivation

When using stream_async with tool-using agents, text events from every model turn are yielded to the caller — including intermediate reasoning before tool calls. For production chat UIs and SSE endpoints, this is noise. The only workaround today requires consumers to implement fragile buffering logic that depends on SDK internals like start_event_loop, raw messageStop events, and the end_turn → tool_use override.

This adds a first-class SDK option to stream only the final answer, eliminating the need for consumer-side buffering.

Resolves: #2055

Public API Changes

Agent.stream_async accepts a new stream_final_turn_only keyword argument:

# Before: consumers receive text from ALL model turns
async for event in agent.stream_async("Analyze this data"):
    if "data" in event:
        yield event["data"]  # Includes intermediate "Let me look that up..." text

# After: consumers receive text only from the final turn
async for event in agent.stream_async("Analyze this data", stream_final_turn_only=True):
    if "data" in event:
        yield event["data"]  # Only final answer tokens

When stream_final_turn_only=True, intermediate turn text events are buffered internally and discarded when the turn ends with tool use. Text from the final turn (where stop_reason == "end_turn") is flushed to both the caller and callback handler. Non-text events (lifecycle, tool use, reasoning, citations, model stream chunks) pass through unchanged regardless of this setting.

Default is False — fully backward compatible, no behavior change unless opted in.

Use Cases

Chat applications streaming via SSE where users should only see the final answer
API endpoints wrapping agents where downstream consumers expect a single coherent streamed response
Any production deployment where intermediate model reasoning is noise for the end user

Related Issues

#2055

Type of Change

New feature

Testing

8 unit tests covering backward compatibility, single/multi-turn scenarios, callback handler behavior, empty final turns, and non-text event passthrough
All 408 agent tests pass
I ran hatch run prepare

All test passed

Checklist

I have read the CONTRIBUTING document
I have added any necessary tests that prove my fix is effective or my feature works
I have updated the documentation accordingly - link
I have added an appropriate example to the documentation to outline the feature, or no new docs are needed - Will update once gather positive feedback
My changes generate no new warnings
Any dependent changes have been merged and published

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

zhifanl · 2026-04-14T00:06:42Z

can anyone help take a look at this?

zhifanl · 2026-05-12T19:58:18Z

Can anyone help me approve this? https://github.com/strands-agents/sdk-python/actions/runs/24648225417/job/72065246092?pr=2104

Add a stream_final_turn_only parameter to Agent.stream_async that buffers intermediate turn text events and only yields text from the final model turn. Non-text events (lifecycle, tool use, reasoning, citations) pass through unchanged. Closes strands-agents#2055

codecov · 2026-05-27T16:01:18Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

github-actions · 2026-05-27T16:14:57Z

+                                continue
+                            elif isinstance(event, EventLoopStopEvent):
+                                stop_reason = event["stop"][0]
+                                if stop_reason == "end_turn":


Issue: When stream_final_turn_only=True and the final turn ends with a non-end_turn stop reason (e.g., max_tokens, cancelled, content_filtered), all buffered text from that turn is silently discarded. In production, this means if a model hits its token limit on the final turn, the user receives zero text output with no indication of what happened.

Suggestion: Consider flushing buffered text for any stop reason that is not tool_use (since tool_use is the only reason that indicates "this isn't the final turn"). For example:

elif isinstance(event, EventLoopStopEvent): stop_reason = event["stop"][0] if stop_reason != "tool_use": for buffered in text_event_buffer: callback_handler(**buffered) yield buffered text_event_buffer.clear()

This way, if the agent is cancelled or hits max_tokens on the final turn, the partial text is still delivered to the caller. If you decide to keep the current behavior, please document explicitly in the docstring that text is only delivered for end_turn stop reasons (not just "final turn").

github-actions · 2026-05-27T16:15:00Z

+                text events from the final turn (where stop_reason is "end_turn"). Non-text events such as
+                lifecycle, tool use, reasoning, and citation events are yielded normally regardless of this
+                setting. When False (default), all events are yielded as they are produced with no change
+                in behavior.


Issue: The docstring says "Non-text events such as lifecycle, tool use, reasoning, and citation events are yielded normally regardless of this setting." While accurate, this creates an asymmetry that may confuse users: reasoning text from intermediate turns passes through (it's a ReasoningTextStreamEvent, not a TextStreamEvent), but regular text from those same turns does not. For agents using extended thinking, users would see intermediate reasoning but not intermediate text.

Suggestion: Consider calling this out explicitly in the docstring with a brief note, e.g.:

Note: Reasoning events from intermediate turns are still yielded since they are distinct from text stream events. Only {"data": ...} text events are buffered/filtered.

github-actions · 2026-05-27T16:15:02Z

Assessment: Comment

Clean implementation of a useful feature that addresses a real pain point for production streaming use cases. The approach of buffering at the stream_async level using existing typed events is well-designed and minimally invasive.

Review Categories

Edge case handling: The current implementation only flushes buffered text for stop_reason == "end_turn", which means max_tokens, cancelled, and other terminal stop reasons silently discard text. This is the primary concern.
Testing: Good coverage of the happy path and multi-turn scenarios. Missing tests for non-end_turn final stop reasons.
Documentation: The docstring could better clarify the reasoning/text asymmetry for intermediate turns.

The overall design is solid and the test suite is thorough for the core scenarios.

JackYPCOnline · 2026-05-27T16:18:27Z

Hi @zhifanl,

Thank you for submitting this PR! I’ve gone ahead and rebased it to align with the latest changes in the main branch

Could you please review the feedback/comments left on the PR and consider addressing the suggested fixes?

zastrowm · 2026-05-27T16:34:30Z

This adds a first-class SDK option to stream only the final answer, eliminating the need for consumer-side buffering.

What's the use case for this versus agent.invoke? The events are buffered as is so I'm not clear why you would use this instead of agent.invoke which provides the completed message as well

github-actions Bot added the size/m label Apr 9, 2026

zhifanl had a problem deploying to manual-approval April 9, 2026 23:39 — with GitHub Actions Failure

zhifanl had a problem deploying to manual-approval April 9, 2026 23:40 — with GitHub Actions Failure

zhifanl mentioned this pull request Apr 9, 2026

[FEATURE] Make agent only yield final reponse #2055

Open

github-actions Bot added size/m and removed size/m labels Apr 20, 2026

zhifanl had a problem deploying to manual-approval April 20, 2026 04:19 — with GitHub Actions Failure

zhifanl mentioned this pull request Apr 20, 2026

docs(streaming): add stream_final_turn_only documentation strands-agents/docs#768

Open

4 tasks

yonib05 added area-async Related to asynchronous flows or multi-threading area-devx Developer experience improvements labels May 27, 2026

Tom Li added 2 commits May 27, 2026 11:56

Update doc string

5815a8b

JackYPCOnline force-pushed the feat/stream-final-turn-only branch from fc7c4d1 to 5815a8b Compare May 27, 2026 15:58

github-actions Bot added size/m and removed size/m labels May 27, 2026

JackYPCOnline temporarily deployed to manual-approval May 27, 2026 15:58 — with GitHub Actions Inactive

JackYPCOnline requested a deployment to manual-approval May 27, 2026 15:58 — with GitHub Actions Waiting

JackYPCOnline self-assigned this May 27, 2026

github-actions Bot added the strands-running label May 27, 2026

github-actions Bot reviewed May 27, 2026

View reviewed changes

github-actions Bot removed the strands-running label May 27, 2026

zastrowm reviewed May 27, 2026

View reviewed changes

yonib05 added the python Pull requests that update python code label May 29, 2026

yonib05 added the enhancement New feature or request label May 29, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(agent): add stream_final_turn_only parameter to stream_async#2104

feat(agent): add stream_final_turn_only parameter to stream_async#2104
zhifanl wants to merge 2 commits into
strands-agents:mainfrom
zhifanl:feat/stream-final-turn-only

zhifanl commented Apr 9, 2026 •

edited

Loading

Uh oh!

zhifanl commented Apr 14, 2026

Uh oh!

zhifanl commented May 12, 2026

Uh oh!

codecov Bot commented May 27, 2026

Uh oh!

github-actions Bot May 27, 2026

Uh oh!

github-actions Bot May 27, 2026

Uh oh!

github-actions Bot commented May 27, 2026

Uh oh!

JackYPCOnline commented May 27, 2026

Uh oh!

zastrowm May 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

zhifanl commented Apr 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Public API Changes

Use Cases

Related Issues

Type of Change

Testing

Checklist

Uh oh!

zhifanl commented Apr 14, 2026

Uh oh!

zhifanl commented May 12, 2026

Uh oh!

codecov Bot commented May 27, 2026

Codecov Report

Uh oh!

github-actions Bot May 27, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot May 27, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented May 27, 2026

Uh oh!

JackYPCOnline commented May 27, 2026

Uh oh!

zastrowm May 27, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

zhifanl commented Apr 9, 2026 •

edited

Loading