Common Voice is a publicly available voice dataset, powered by the voices of volunteer contributors around the world. People who want to build voice applications can use the dataset to train machine learning models.
Learn more our project on the onboarding page and check out Common Voice Platform demo mode.
Pages | About |
---|---|
Onboarding | Are you new to the Common Voice Project ? Read this page ! |
Localization | Translating project tools and material to be understood by contributors in their language |
Text Corpus | Gathering, validating and processing public domain sentences |
Voice Corpus | Recording and validating voice clips to create a public domain dataset |
Communities | Connect with the variety of communities participating in Common Voice |
Mobilization | Resources and tips for mobilizing your community |
CC0 Waiver Process | How to secure a cc0 license for text corpus |
Variants for Languages | Community guidance on selecting variants for your language. |
Our community playbook is a living document of our communities history and knowledge. After reading the playbook chapters, you will understand:
- The goals and ethos of the Common Voice project
- The journey of language onto Common Voice
- How to set up and maintain a language community as part of Common Voice
As you read the playbook....
🔨 Make sure you check the required skills for each section and look for people who can fit.
💬 Check the channels section to learn how to set up your local forums and chat to communicate with other people in your language.
Each chapter covers; Purpose - Who we are - Success - How to join - What we do - Roles - Channels
To help you quickly navigate the playbook, this list provides you with possible information you would like to have and the associated content.
I would like to....
- Add my language to Common Voice
- Contribute text to help build the Common Voice dataset
- Learn more about recording and validating voices on Common voice
- Build a language community and access engagement materials
- Connect with language communities across Common Voice
- Localize the playbook for my language community
- What does Public Domain or CC0 mean ?
Common Voice communities are governed by Mozilla's code of conduct and etiquette guidelines, we take this very seriously and no violations are tolerated.
We encourage you to please read Mozilla Community Participation Guidelines before contributing to this project.
For more information on how to report violations of the Community Participation Guidelines, please read our 'How to Report' page.
Mozilla Foundation stewards the overall Common Voice project and is the ultimate decision-maker for its direction and goals. It also oversees the development of some tools and channels described here to support our communities. Read more about how the project is governed.
Common Voice language communities are self-organized, and you don’t need to ask for permission to participate or mobilize any of these communities in your language. All the data generated by communities is published under open licences. Some community roles exist formally and informally, and they all should follow the Mozilla leadership shared agreements.
The Common Voice Team at Mozilla Foundation, share weekly updates to the community on discourse.
Common Voice has a variety of communities that support the project in different important areas, they are usually grouped by language.
👥 A language's journey onto Common Voice is made possible with communities of multidisciplinary teams of committed people. Roles vary from no coding needed to organizing roles. Our community mobilization resource and Community page can connect you to resources and existing language communities that can support you.
ℹ️ Note: Mozilla welcomes small and minority language communities, and we understand some of these goals may seem out of reach. In that case, feel free to share with us how they are different for you, and we will try to help. Connect with the Common Voice Team on discourse or github issues.
- Thank you to our authors, reviewers and maintainers of the community playbook without your support this playbook wouldn’t be possible!
- Learn more about how we maintain the playbook and the Playbook's License
- Content coordination: Hillary Juma, Common Voice Community Manager