Bidirectional Streaming Agent #1276

mehtarac · 2025-12-02T17:00:19Z

Description

This PR introduces bidirectional streaming capabilities to Strands SDK, enabling real-time voice and audio conversations with AI models through persistent streaming connections.

Overview

Bidirectional streaming moves beyond traditional request-response patterns by maintaining long-running conversations where users can interrupt, provide continuous input, and receive real-time audio responses. This implementation is marked as experimental as we refine the API based on user feedback and evolving model capabilities.

Key Features:

Real-time audio I/O streaming with PyAudio integration
Automatic interruption detection that clears audio buffers when users speak
Concurrent tool execution during active conversations
Multi-modal input support for text, audio, and images
Provider-agnostic event system with strongly-typed, JSON-serializable events

Implementation Details

Core Components:

BidiAgent - Main agent class with start(), send(), receive(), stop() lifecycle methods
_BidiAgentLoop - Event processing engine handling model events and tool execution with connection restart logic
BidiModel - Model interface for bidirectional model providers
BidiInput/BidiOutput - Pluggable I/O channel abstractions

Model Providers:

BidiNovaSonicModel - AWS Bedrock Nova Sonic with complex event sequencing
BidiGeminiLiveModel - Google Gemini Live using official SDK
BidiOpenAIRealtimeModel - OpenAI Realtime API via WebSocket

I/O Handlers:

BidiAudioIO - PyAudio-based microphone/speaker handling with buffering
BidiTextIO - Terminal-based text input/output

Usage Example

import asyncio from strands.experimental.bidi import BidiAgent from strands.experimental.bidi.models import BidiNovaSonicModel from strands.experimental.bidi.io import BidiAudioIO, BidiTextIO from strands_tools import calculator async def main(): model = BidiNovaSonicModel() agent = BidiAgent(model=model, tools=[calculator]) audio_io = BidiAudioIO() text_io = BidiTextIO() await agent.run( inputs=[audio_io.input()], outputs=[audio_io.output(), text_io.output()] ) asyncio.run(main())

This is a new experimental feature under strands.experimental.bidi.

Related Issues

#217

Documentation PR

Type of Change

New feature

Testing

How have you tested the change? Verify that the changes do not break functionality or introduce warnings in consuming repositories: agents-docs, agents-tools, agents-cli

[] I ran hatch run prepare
I ran hatch run bidi:prepare: This is done to isolate the bidirectional streaming environment which needs Python 3.12+

Checklist

[ x] I have read the CONTRIBUTING document
I have added any necessary tests that prove my fix is effective or my feature works
I have updated the documentation accordingly
I have added an appropriate example to the documentation to outline the feature, or no new docs are needed
My changes generate no new warnings
Any dependent changes have been merged and published

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

Co-authored-by: Nick Clegg <nac542@gmail.com>

- Remove adapter from constructor - Implement BidirectionlIO interface - Add adapter the run() method

…cies

…ssive logging

feat: (Agent): Finalize Bidirectional Agent class

…s unit tests and integ tests

Move test scripts into dedicated directory so tests directory only has unit tests and integ tests

Rename bidirectional components

Fix main branch. Temporarily rename loop to original name

Changes: - Keep main's architecture: BidirectionalConnection + start/stop functions - Apply our event renames: BidiTextInputEvent, BidiAudioInputEvent, etc. - Update agent to use BidiAgent, BidiModel, BidiNovaSonicModel - Update tests to use new class names - Fix imports across codebase Tests status: - ✅ 14/14 type tests passing - ⚠️ Integration tests running but failing (models need update to check 'type' field instead of isinstance) Known issue: Models use isinstance() checks which don't work with TypedDict. Need to update models to check content.get('type') field instead.

The agent's send() method was passing plain dicts directly to models, but models expect TypedEvent instances for isinstance() checks to work. Added dict-to-TypedEvent conversion logic that was lost in merge: - Checks event 'type' field in dict - Reconstructs appropriate TypedEvent (BidiTextInputEvent, BidiAudioInputEvent, etc.) - Maintains backward compatibility with WebSocket/dict-based clients Tests: - ✅ 14/14 type tests passing - ✅ 2/2 integration tests passing (nova_sonic, openai)

Updated test imports and usages: - GeminiLiveModel → BidiGeminiLiveModel - NovaSonicModel → BidiNovaSonicModel - OpenAIRealtimeModel → BidiOpenAIRealtimeModel Note: 21 model tests still failing because they call .connect() but models now use .start(). This is a pre-existing issue that needs separate fix - tests need API update.

Updated all test calls from old API to new API: - .connect() → .start() - .close() → .stop() - Updated error message expectations to match actual errors All tests now passing: - ✅ 47/47 bidirectional streaming tests passing - ✅ 14/14 type tests - ✅ 33/33 model tests - ✅ 2/2 integration tests

rename modules

remove scripts directory before merging with sdk-python/main

add bidi to README

codecov · 2025-12-02T17:02:27Z

Codecov Report

❌ Patch coverage is 90.00000% with 14 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/strands/types/session.py	54.54%	2 Missing and 3 partials ⚠️
src/strands/session/repository_session_manager.py	90.62%	1 Missing and 2 partials ⚠️
src/strands/session/session_manager.py	72.72%	3 Missing ⚠️
src/strands/tools/_caller.py	66.66%	0 Missing and 1 partial ⚠️
src/strands/tools/executors/_executor.py	95.23%	0 Missing and 1 partial ⚠️
src/strands/types/_events.py	85.71%	1 Missing ⚠️

📢 Thoughts on this report? Let us know!

fix minor linting and integ test failure errors

mehtarac and others added 30 commits November 6, 2025 06:23

Merge branch 'main' into bar_raise_agent

aaa9471

Update src/strands/experimental/bidirectional_streaming/agent/agent.py

9240bad

Co-authored-by: Nick Clegg <nac542@gmail.com>

Update imports

2a2861b

fix: remove turn id

30c0a5d

fix: fix nova completion id tracking

6ad0120

fix: remove unnecessary if condition

774ab86

Update implementation based on bar-raising

0a63829

- Remove adapter from constructor - Implement BidirectionlIO interface - Add adapter the run() method

temp commit message, review the changes

a9784f0

Updates: make ToolCaller private, minor updates based on PR comments

8d9a298

Update: file names, locations, and ToolCaller class name

73416d7

Update method names imports for io.py and audio.py and their dependen…

a49273b

…cies

use input event in method signatures and update outdated comments

986fc45

Merge branch 'bidi-event-types' into bidi-gemini-improvements

5ace082

fix(openai): Improve interruption handling

69965d2

fix: improve gemini test script to display interrupts and remove exce…

f7c18d4

…ssive logging

Merge pull request #12 from mehtarac/bar_raise_agent

fd11282

feat: (Agent): Finalize Bidirectional Agent class

Move test scripts into dedicated directory so tests directory only ha…

843133b

…s unit tests and integ tests

Merge pull request #27 from mehtarac/move_tests

3f0a527

Move test scripts into dedicated directory so tests directory only has unit tests and integ tests

refactor: rename events and files

f8ab2a0

Rename bidirectional components

b815706

Merge pull request #28 from mehtarac/rename_agent

5c30596

Rename bidirectional components

Fix main branch. Temporarily rename loop to original name

805aa3a

Merge pull request #29 from mehtarac/fix_main

5a22ad9

Fix main branch. Temporarily rename loop to original name

fix: fix bidi tests

8918757

refactor: rename to bidi input and output events

3a9f944

refactor: change event type prefix to bidi

5eca8f9

mehtarac and others added 19 commits November 30, 2025 16:09

rename files

0dd05fe

fix formatting on docstrings (#98)

78b3ebc

Merge branch 'main' into rename_modules

75d865c

addressed comments

e472b92

addrsssed comments

75a7775

address comments

69d8f09

minor update

9829494

Merge pull request #99 from mehtarac/rename_modules

094a64e

rename modules

Merge branch 'main' into del_scripts

dadda61

Merge pull request #90 from mehtarac/del_scripts

55fb736

remove scripts directory before merging with sdk-python/main

minor update

4c58c43

isolate model inference configs (#100)

a46828d

fix agent send dict to event construction (#101)

78423eb

add bidi to README

75b91aa

address comments

185db15

address comments

f9f0e2d

address comments

6cbac51

address comments

873da65

Merge pull request #102 from mehtarac/update_rm

29e0989

add bidi to README

github-actions bot added the size/xl label Dec 2, 2025

mehtarac had a problem deploying to auto-approve December 2, 2025 17:00 — with GitHub Actions Failure

mehtarac added 2 commits December 2, 2025 12:33

fix minor linting and integ test failure errors

4ce00d8

Merge pull request #103 from mehtarac/bidi_agent

799fbd5

fix minor linting and integ test failure errors

github-actions bot removed the size/xl label Dec 2, 2025

mehtarac had a problem deploying to auto-approve December 2, 2025 17:46 — with GitHub Actions Failure

github-actions bot added the size/xl label Dec 2, 2025

mehtarac marked this pull request as ready for review December 2, 2025 17:52

pgrayy enabled auto-merge (squash) December 3, 2025 04:45

pgrayy disabled auto-merge December 3, 2025 04:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bidirectional Streaming Agent #1276

Bidirectional Streaming Agent #1276

Uh oh!

mehtarac commented Dec 2, 2025 •

edited

Loading

codecov bot commented Dec 2, 2025 •

edited

Loading

Labels

2 participants

Bidirectional Streaming Agent #1276

Are you sure you want to change the base?

Bidirectional Streaming Agent #1276

Uh oh!

Conversation

mehtarac commented Dec 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Overview

Key Features:

Implementation Details

Usage Example

Related Issues

Documentation PR

Type of Change

Testing

Checklist

codecov bot commented Dec 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Labels

2 participants

mehtarac commented Dec 2, 2025 •

edited

Loading

codecov bot commented Dec 2, 2025 •

edited

Loading