Resemble AI

All

43 repositories

resemble-node
Public
resemble.ai API SDK
TypeScript
•
MIT License
•3•10•2•1•Updated Jul 9, 2025Jul 9, 2025
Perth
Public
Open Audio Watermarking Tool
Python
•
MIT License
•15•229•5•0•Updated Jun 26, 2025Jun 26, 2025
xformers
Public
Hackable and optimized Transformers building blocks, supporting a composable construction.
Python
•
Other
•697•0•0•0•Updated Jun 23, 2025Jun 23, 2025
chatterbox
Public
SoTA open-source TTS
Python
•
MIT License
•1.2k•9.7k•86•23•Updated Jun 13, 2025Jun 13, 2025
monotonic_align
Public
Monotonic Alignment Search
Cython
•
MIT License
•14•96•0•0•Updated Jun 9, 2025Jun 9, 2025
flowhigh
Public
[ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"
Python
•
MIT License
•8•8•0•0•Updated May 12, 2025May 12, 2025
espeak-ng
Public
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
C
•
GNU General Public License v3.0
•1.1k•4•0•0•Updated Mar 31, 2025Mar 31, 2025
agents
Public
Build real-time multimodal AI applications 🤖🎙️📹
Python
•
Apache License 2.0
•1.1k•5•0•0•Updated Mar 20, 2025Mar 20, 2025
agents-js
Public
Build realtime multimodal AI agents with Node.js
TypeScript
•
Apache License 2.0
•132•3•0•0•Updated Mar 18, 2025Mar 18, 2025
fairseq
Public
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Python
•
MIT License
•6.6k•0•0•0•Updated Feb 27, 2025Feb 27, 2025
python-pesq
Public
A python package for calculating the PESQ.
Python
•
MIT License
•72•0•0•0•Updated Feb 22, 2025Feb 22, 2025
PyTSMod
Public
An open-source Python library for audio time-scale modification.
Python
•
GNU General Public License v3.0
•27•6•0•0•Updated Feb 13, 2025Feb 13, 2025
peft
Public
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Python
•
Apache License 2.0
•2k•1•0•0•Updated Jan 2, 2025Jan 2, 2025
resemble-enhance
Public
AI powered speech denoising and enhancement
speech-processing denoise speech-enhancement speech-denoising
Python
•
MIT License
•224•1.9k•54•2•Updated Dec 3, 2024Dec 3, 2024
mup
Public
maximal update parametrization (µP)
Jupyter Notebook
•
MIT License
•104•0•0•1•Updated Sep 5, 2024Sep 5, 2024
resemble-live-sts-socket
Public
Python
•
MIT License
•0•5•1•0•Updated Sep 5, 2024Sep 5, 2024
resemble-examples
Public
Python
•1•5•1•0•Updated May 8, 2024May 8, 2024
aiortc
Public
WebRTC and ORTC implementation for Python using asyncio
Python
•
BSD 3-Clause "New" or "Revised" License
•831•0•0•0•Updated Mar 27, 2024Mar 27, 2024
aioice
Public
asyncio-based Interactive Connectivity Establishment (RFC 5245)
Python
•
BSD 3-Clause "New" or "Revised" License
•61•0•0•0•Updated Feb 15, 2024Feb 15, 2024
resemble-streaming-demo
Public
TypeScript
•3•16•0•0•Updated Dec 16, 2023Dec 16, 2023
resemble-go
Public
Go
•1•2•0•0•Updated Nov 13, 2023Nov 13, 2023
cog-whisper
Public
Run OpenAI Whisper as a Cog model
Python
•
Apache License 2.0
•50•2•0•0•Updated Nov 8, 2023Nov 8, 2023
transformer-deploy
Public
Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀
Python
•
Apache License 2.0
•153•0•0•0•Updated Oct 25, 2023Oct 25, 2023
Resemblyzer
Public
A python package to analyze and compare voices with deep learning
Python
•
Apache License 2.0
•452•3k•42•2•Updated Oct 12, 2023Oct 12, 2023
heroku-buildpack-ffmpeg-latest
Public
A Heroku buildpack for ffmpeg that always downloads the latest static build
Shell
•
MIT License
•724•0•0•0•Updated Aug 21, 2023Aug 21, 2023
g2pW
Public
Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)
Python
•
Apache License 2.0
•43•0•0•0•Updated Jul 8, 2023Jul 8, 2023
univnet
Public
Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)
Python
•
BSD 3-Clause "New" or "Revised" License
•45•0•0•0•Updated May 19, 2023May 19, 2023
NeMo
Public
NeMo: a toolkit for conversational AI
Python
•
Apache License 2.0
•3k•9•0•0•Updated Jan 18, 2023Jan 18, 2023
phonemizer
Public
Simple text to phonemes converter for multiple languages
Python
•
GNU General Public License v3.0
•191•20•0•1•Updated Nov 21, 2022Nov 21, 2022
whisper
Public
Robust Speech Recognition via Large-Scale Weak Supervision
Jupyter Notebook
•
MIT License
•11k•1•0•0•Updated Oct 4, 2022Oct 4, 2022