Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

pipecat-ai / pipecat Public

Notifications You must be signed in to change notification settings
Fork 439
Star 4.2k

Code
Issues 52
Pull requests 34
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Security
Insights

Releases: pipecat-ai/pipecat

Releases · pipecat-ai/pipecat

v0.0.22

23 May 21:04

aconchillo

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v0.0.22

Added

Added Daily transport start_dialout() to be able to make phone or SIP calls.
See https://reference-python.daily.co/api_reference.html#daily.CallClient.start_dialout
Added Daily transport support for dial-in use cases.
Added Daily transport events: on_dialout_connected, on_dialout_stopped, on_dialout_error and on_dialout_warning. See
https://reference-python.daily.co/api_reference.html#daily.EventHandler

Assets 2

Loading

All reactions

v0.0.21

23 May 04:45

aconchillo

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v0.0.21

Added

Added vision support to Anthropic service.
Added WakeCheckFilter which allows you to pass information downstream only if you say a certain phrase/word.

Changed

Filter has been renamed to FrameFilter and it's now under processors/filters.

Fixed

Fixed Anthropic service to use new frame types.
Fixed an issue in LLMUserResponseAggregator and UserResponseAggregator that would cause frames after a brief pause to not be pushed to the LLM.
Clear the audio output buffer if we are interrupted.
Re-add exponential smoothing after volume calculation. This makes sure the volume value being used doesn't fluctuate so much.

Assets 2

Loading

All reactions

v0.0.20

22 May 21:29

aconchillo

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v0.0.20

Added

In order to improve interruptions we now compute a loudness level using pyloudnorm. The audio coming WebRTC transports (e.g. Daily) have an Automatic Gain Control (AGC) algorithm applied to the signal, however we don't do that on our local PyAudio signals. This means that currently incoming audio from PyAudio is kind of broken. We will fix it in future releases.

Fixed

Fixed an issue where StartInterruptionFrame would cause LLMUserResponseAggregator to push the accumulated text causing the LLM respond in the wrong task. The StartInterruptionFrame should not trigger any new LLM response because that would be spoken in a different task.
Fixed an issue where tasks and threads could be paused because the executor didn't have more tasks available. This was causing issues when cancelling and recreating tasks during interruptions.

Assets 2

Loading

All reactions

v0.0.19

21 May 04:41

aconchillo

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v0.0.19

Changed

LLMUserResponseAggregator and LLMAssistantResponseAggregator internal messages are now exposed through the messages property.

Fixed

Fixed an issue where LLMAssistantResponseAggregator was not accumulating the full response but short sentences instead. If there's an interruption we only accumulate what the bot has spoken until now in a long response as well.

Assets 2

Loading

All reactions

v0.0.18

20 May 17:33

aconchillo

Compare

Choose a tag to compare

Loading

v0.0.18

Fixed

Fixed an issue in DailyOuputTransport where transport messages were not being sent.

Assets 2

Loading

All reactions

v0.0.17

20 May 02:29

aconchillo

Compare

Choose a tag to compare

Loading

v0.0.17

Added

Added google.generativeai model support, including vision. This new google service defaults to using gemini-1.5-flash-latest. Example in examples/foundational/12a-describe-video-gemini-flash.py.
Added vision support to openai service. Example in examples/foundational/12a-describe-video-gemini-flash.py.
Added initial interruptions support. The assistant contexts (or aggregators) should now be placed after the output transport. This way, only the completed spoken context is added to the assistant context.
Added VADParams so you can control voice confidence level and others.
VADAnalyzer now uses an exponential smoothed volume to improve speech detection. This is useful when voice confidence is high (because there's someone talking near you) but volume is low.

Fixed

Fixed an issue where TTSService was not pushing TextFrames downstream.
Fixed issues with Ctrl-C program termination.
Fixed an issue that was causing StopTaskFrame to actually not exit the PipelineTask.

Assets 2

Loading

All reactions

v0.0.16

17 May 01:16

aconchillo

Compare

Choose a tag to compare

Loading

v0.0.16

Fixed

DailyTransport: don't publish camera and audio tracks if not enabled.
Fixed an issue in BaseInputTransport that was causing frames pushed downstream not pushed in the right order.

Assets 2

Loading

All reactions

v0.0.15

16 May 00:08

aconchillo

Compare

Choose a tag to compare

Loading

v0.0.15

Fixed

Quick hot fix for receiving DailyTransportMessage.

Assets 2

Loading

All reactions

v0.0.14

15 May 23:00

aconchillo

Compare

Choose a tag to compare

Loading

v0.0.14

Added

Added DailyTransport event on_participant_left.
Added support for receiving DailyTransportMessage.

Fixed

Images are now resized to the size of the output camera. This was causing images not being displayed.
Fixed an issue in DailyTransport that would not allow the input processor to shutdown if no participant ever joined the room.
Fixed base transports start and stop. In some situation processors would halt or not shutdown properly.

Assets 2

Loading

All reactions

v0.0.13

15 May 02:09

aconchillo

Compare

Choose a tag to compare

Loading

v0.0.13

Changed

MoondreamService argument model_id is now model.
VADAnalyzer arguments have been renamed for more clarity.

Fixed

Fixed an issue with DailyInputTransport and DailyOutputTransport that could cause some threads to not start properly.
Fixed STTService. Add max_silence_secs and max_buffer_secs to handle better what's being passed to the STT service. Also add exponential smoothing to the RMS.
Fixed WhisperSTTService. Add no_speech_prob to avoid garbage output text.

Assets 2

Loading

All reactions

Previous 1 2 3 4 5 6 Next

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.