-
Notifications
You must be signed in to change notification settings - Fork 17
Backchannel based on VAD
sagatake edited this page Apr 7, 2025
·
5 revisions
Backchannel generation module based on VAD.
We implemented this module inspired by the previous backchannel module based on the following paper.
Elisabetta Bevacqua, Etienne de Sevin, Sylwia Julia Hyniewska, Catherine Pelachaud, A listener model: introducing personality traits. J. Multimodal User Interfaces 6(1-2): 27-38 (2012)
This module considers both of audio signal from the user and the agent.
Currently, this supports the following backchanels:
- Reactive backchannel: immediate backchannel when the agent recognize the start of the user's utterance
- Response backchannel: this is trrigered when the agent detect the end of user's utterance or inter-pausal unit (IPU)
Please follow common installation
- In Modular.jar, add Microphone module from [Add -> Input -> Microphone]
- In Modular.jar, add TurnManagement module from [Add -> Input -> Dialogue -> TurnManagement]
- Create the following connections in Modular.jar: Feedback -> TurnManagement, TurnManagement -> BehaviorPlanner
- When you are speaking something and Greta is not talking, the module randomly picks one XML file from bin/Examples/DemoEN/backchannel upon several rules written in /bin/Common/Data/TurnManagement/turnManager.py
- Some rules were based on the previous backchannels module
- For more details, please check the following sources
- Microphone java project at auxiliary/Microphone
- Python source code at bin/Common/Data/microphone
- Generating New Facial expressions
- Generating New Gestures
- Generating new Hand configurations
- Creating an Instance for Interaction
- Create a new virtual character
- Creating a Greta Module in Java
- Integration examples
- Development guideline
- Naming policy
- Github repository
- Technical showcase
- Python integration
- Known issues
- Technical Specifications
- Pitfalls for Greta development
-
FML/BML
-
Tools
-
AUs from external sources
-
Large language model (LLM)
-
Automatic speech recognition (ASR)
-
Extensions
Nothing to show :)
- Incrementality
- Motion Capture to BAP values
- Interruptions
- Back Channels
- Appreciation Generator
- SSI Greta Integration
- TopicPlanner
- Shore
- Disco
- Watson
- Object node controller
- OSC communication for Unity
- 3D pose estimation from RGB camera through VAE (PFE-OpenPose-to-VAE-to-BVH)
- HOW TO CREATE BEHAVIOR SET
- Projects
- Signals