Skip to content

Turn Management

sagatake edited this page Apr 4, 2025 · 5 revisions

This module controls turn taking behaviors based on audio-visual signals from both of the user and the agent. They are implemented with voice activity detection (VAD) model and voice activity projection (VAP) model.

There are two sub-modules are implemented in this module:

Common installation

  • Install conda or anaconda from https://www.anaconda.com/
  • Install python3 (usually installed with anaconda but not for some reasons [e.g. Path to "python.exe" is not set globally])
  • You can test it by loading Greta - Microphone - backchannel.xml from Modular.jar. If it is correctly installed, Greta will do some nodding to your utterance.

Getting started with Greta

Basics

Advanced

For developpers

Functionalities

Core functionality

Auxiliary functionality

Preview functionality (only in dev branch)

Nothing to show :)

Previous functionality (it might work, but not supported anymore)

Tips

Clone this wiki locally