Releases: pipecat-ai/pipecat
v0.0.22
Added
-
Added Daily transport
start_dialout()
to be able to make phone or SIP calls.
See https://reference-python.daily.co/api_reference.html#daily.CallClient.start_dialout -
Added Daily transport support for dial-in use cases.
-
Added Daily transport events:
on_dialout_connected
,on_dialout_stopped
,on_dialout_error
andon_dialout_warning
. See
https://reference-python.daily.co/api_reference.html#daily.EventHandler
v0.0.21
Added
-
Added vision support to Anthropic service.
-
Added
WakeCheckFilter
which allows you to pass information downstream only if you say a certain phrase/word.
Changed
Filter
has been renamed toFrameFilter
and it's now underprocessors/filters
.
Fixed
-
Fixed Anthropic service to use new frame types.
-
Fixed an issue in
LLMUserResponseAggregator
andUserResponseAggregator
that would cause frames after a brief pause to not be pushed to the LLM. -
Clear the audio output buffer if we are interrupted.
-
Re-add exponential smoothing after volume calculation. This makes sure the volume value being used doesn't fluctuate so much.
v0.0.20
Added
- In order to improve interruptions we now compute a loudness level using pyloudnorm. The audio coming WebRTC transports (e.g. Daily) have an Automatic Gain Control (AGC) algorithm applied to the signal, however we don't do that on our local PyAudio signals. This means that currently incoming audio from PyAudio is kind of broken. We will fix it in future releases.
Fixed
-
Fixed an issue where
StartInterruptionFrame
would causeLLMUserResponseAggregator
to push the accumulated text causing the LLM respond in the wrong task. TheStartInterruptionFrame
should not trigger any new LLM response because that would be spoken in a different task. -
Fixed an issue where tasks and threads could be paused because the executor didn't have more tasks available. This was causing issues when cancelling and recreating tasks during interruptions.
v0.0.19
Changed
LLMUserResponseAggregator
andLLMAssistantResponseAggregator
internal messages are now exposed through themessages
property.
Fixed
- Fixed an issue where
LLMAssistantResponseAggregator
was not accumulating the full response but short sentences instead. If there's an interruption we only accumulate what the bot has spoken until now in a long response as well.
v0.0.18
Fixed
- Fixed an issue in
DailyOuputTransport
where transport messages were not being sent.
v0.0.17
Added
-
Added
google.generativeai
model support, including vision. This newgoogle
service defaults to usinggemini-1.5-flash-latest
. Example inexamples/foundational/12a-describe-video-gemini-flash.py
. -
Added vision support to
openai
service. Example inexamples/foundational/12a-describe-video-gemini-flash.py
. -
Added initial interruptions support. The assistant contexts (or aggregators) should now be placed after the output transport. This way, only the completed spoken context is added to the assistant context.
-
Added
VADParams
so you can control voice confidence level and others. -
VADAnalyzer
now uses an exponential smoothed volume to improve speech detection. This is useful when voice confidence is high (because there's someone talking near you) but volume is low.
Fixed
-
Fixed an issue where TTSService was not pushing TextFrames downstream.
-
Fixed issues with Ctrl-C program termination.
-
Fixed an issue that was causing
StopTaskFrame
to actually not exit thePipelineTask
.
v0.0.16
Fixed
-
DailyTransport
: don't publish camera and audio tracks if not enabled. -
Fixed an issue in
BaseInputTransport
that was causing frames pushed downstream not pushed in the right order.
v0.0.15
Fixed
- Quick hot fix for receiving
DailyTransportMessage
.
v0.0.14
Added
-
Added
DailyTransport
eventon_participant_left
. -
Added support for receiving
DailyTransportMessage
.
Fixed
-
Images are now resized to the size of the output camera. This was causing images not being displayed.
-
Fixed an issue in
DailyTransport
that would not allow the input processor to shutdown if no participant ever joined the room. -
Fixed base transports start and stop. In some situation processors would halt or not shutdown properly.
v0.0.13
Changed
-
MoondreamService
argumentmodel_id
is nowmodel
. -
VADAnalyzer
arguments have been renamed for more clarity.
Fixed
-
Fixed an issue with
DailyInputTransport
andDailyOutputTransport
that could cause some threads to not start properly. -
Fixed
STTService
. Addmax_silence_secs
andmax_buffer_secs
to handle better what's being passed to the STT service. Also add exponential smoothing to the RMS. -
Fixed
WhisperSTTService
. Addno_speech_prob
to avoid garbage output text.