Skip to content

vad v1 #7

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 26 commits into
base: main
Choose a base branch
from
Open

vad v1 #7

wants to merge 26 commits into from

Conversation

Araxeus
Copy link
Owner

@Araxeus Araxeus commented Jul 13, 2024

  • We now record audio via vad which is inserted into the page, which allows detection of voice activity (auto end when stopped speaking instead of pressing a button to stop)

  • The audio is sent to the server, which sends it to open ai Whisper - which sends backs a transcription. the server then handle the transcription like before and sends back the response


VAD library used: https://github.com/ricky0123/vad (which makes https://github.com/snakers4/silero-vad accessible in browser)

openai reference: https://platform.openai.com/docs/api-reference/audio/createTranscription?lang=node

cons:

Copy link

render bot commented Jul 13, 2024

Your Render PR Server URL is https://operate-pr-7.onrender.com.

Follow its progress at https://dashboard.render.com/web/srv-cq9a9qjv2p9s73egs4i0.

@Araxeus Araxeus force-pushed the vad branch 2 times, most recently from 6e17924 to 0310d81 Compare July 13, 2024 16:57
@Araxeus Araxeus force-pushed the vad branch 2 times, most recently from 7ac7e29 to 8147bca Compare July 13, 2024 18:54
@Araxeus Araxeus force-pushed the main branch 2 times, most recently from a81493f to b8e87f4 Compare July 14, 2024 01:26
Araxeus added 4 commits July 14, 2024 15:21
might be overkill, but eachof the complete examples was an instance that gpt4o actually failed at
@Araxeus Araxeus force-pushed the main branch 2 times, most recently from 7437c94 to 091339d Compare July 14, 2024 14:34
Copy link

@Araxeus Araxeus force-pushed the main branch 3 times, most recently from fffbbca to d01194f Compare December 25, 2024 19:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant