AudioTranscribeAI

Installation

This project is working with Python 3.8.8+, you should have it installed on your machine.

Set up the backend

Install the requirements

pip install -r requirements.txt

Set up the frontend

The frontend is inside the frontend/AudioTranscribeAI folder.

The application is built with node.js and vue.js.

So to start the server, you need to install the node.js and yarn package manager.

npm install --global yarn

Install the requirements

# if you are in the root directory
yarn --cwd frontend/AudioTranscribeAI/  
# or
cd frontend/AudioTranscribeAI 
# if you are in the frontend/AudioTranscribeAI directory
yarn

Run the application

Because we are using separate backend and frontend architecture, so we need to run both of them.

Run the backend

python app.py

Run the frontend

# if you are in the root directory
yarn --cwd frontend/AudioTranscribeAI/ dev 
# or
cd frontend/AudioTranscribeAI 
# if you are in the frontend/AudioTranscribeAI directory
yarn dev

Other commands for the frontend

Build the release version of frontend application

yarn --cwd frontend/AudioTranscribeAI/ build # if you are in the root directory
# or
cd frontend/AudioTranscribeAI 
yarn build # if you are in the frontend/AudioTranscribeAI directory

Architecture

Machine Learning Model

Speech Recognition

✨Model: openai/whisper-small

Large Language Model (Text Summarization and Question Answering)

✨Model:

Default: TinyLlama/TinyLlama-1.1B-Chat-v1.0
Alternative: meta-llama/Llama-2-7b-chat-hf

NLP Model

✨Model spacy: en_core_web_sm

Wikipedia Retrieval

PyDictionary package
pywikibot package
Text processing with nltk package

Backend

Frontend

Evaluation

For aduio model evaluation, we use the LibriSpeech dataset.

cd asr && python eval_asr.py

For the LLM evaluation on summarization, direct use the llm.py:

python llm/llm.py

Name		Name	Last commit message	Last commit date
Latest commit History 92 Commits
asr		asr
audio_data		audio_data
data		data
docs		docs
frontend/AudioTranscribeAI		frontend/AudioTranscribeAI
keyword_wiki_retrieval		keyword_wiki_retrieval
llm		llm
llm_eval_results		llm_eval_results
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
app.py		app.py
log.py		log.py
model.py		model.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AudioTranscribeAI

Installation

Set up the backend

Install the requirements

Set up the frontend

Install the requirements

Run the application

Run the backend

Run the frontend

Other commands for the frontend

Build the release version of frontend application

Architecture

Machine Learning Model

Speech Recognition

Large Language Model (Text Summarization and Question Answering)

NLP Model

Wikipedia Retrieval

Backend

Frontend

Evaluation

Results

About

Releases

Packages

Contributors 3

Languages

Gary0232/AudioTranscribeAI

Folders and files

Latest commit

History

Repository files navigation

AudioTranscribeAI

Installation

Set up the backend

Install the requirements

Set up the frontend

Install the requirements

Run the application

Run the backend

Run the frontend

Other commands for the frontend

Build the release version of frontend application

Architecture

Machine Learning Model

Speech Recognition

Large Language Model (Text Summarization and Question Answering)

NLP Model

Wikipedia Retrieval

Backend

Frontend

Evaluation

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages