• abb
  • Afrikaans
  • Aragonés
  • Asturianu
  • Ásụ̀sụ́ Ìgbò
  • Ayisyen
  • Azərbaycanca
  • Bahasa Indonesia
  • Bahasa Melayu
  • Basaa
  • bax
  • bba
  • bbj
  • bbl
  • bci
  • beb
  • bfd
  • bft
  • bgp
  • bkm
  • bnm
  • bnn
  • Brezhoneg
  • bri
  • bsh
  • bsk
  • bum
  • byv
  • català
  • cdo
  • Čeština
  • cut
  • cux
  • Cymraeg
  • dag
  • Dansk
  • dav
  • Deutsch
  • Dioula ye
  • dmk
  • dml
  • Dolnoserbšćina
  • dru
  • dua
  • ebr
  • eesti
  • Emakhuwa
  • English
  • Español
  • Esperanto
  • esu
  • eto
  • Euskara
  • ewo
  • fan
  • Français
  • Frysk
  • fub
  • fue
  • Gaeilge
  • Galego
  • gju
  • Guarani
  • gv
  • gwc
  • gwt
  • gya
  • Hausa
  • hno
  • Hornjoserbšćina
  • Hrvatski
  • ibb
  • Ikinyarwanda
  • Interlingua
  • ipk
  • IsiNdebele (Sewula)
  • IsiXhosa
  • Íslenska
  • Italiano
  • jqr
  • Kernowek
  • khw
  • Kiswahili
  • kln
  • kls
  • ksf
  • Kurdî (Kurmancî)
  • Kurdkî (Zazakî)
  • Laiholh (Hakha)
  • Latgalīšu
  • Latviešu
  • Lietuvių
  • Ligure
  • lss
  • Luganda
  • luo
  • Magyar
  • Malti
  • mau
  • mbo
  • mvy
  • mxu
  • ncx
  • Nederlands
  • nhi
  • nnh
  • Norsk (bokmål)
  • Norsk (nynorsk)
  • O‘zbek
  • occitan
  • oru
  • pcm
  • phl
  • plk
  • polski
  • Português
  • pua
  • pwn
  • Quechua Chanka
  • qup
  • qux
  • qva
  • qvl
  • qwa
  • qws
  • qxa
  • qxp
  • qxt
  • qxu
  • qxw
  • Română
  • romontsch sursilvan
  • Rumantsch vallader
  • Sardu
  • scl
  • sd
  • sei
  • Sesotho sa Borwa
  • Sesotho sa Leboa
  • Setswana
  • Shqip
  • Sicilianu
  • Siswati
  • slovenčina
  • slovenščina
  • suomi
  • sva
  • Svenska
  • szy
  • t'pur
  • Taqbaylit
  • tay
  • toki pona
  • trv
  • trw
  • Tshivenḓa
  • Türkçe
  • Türkmençe
  • Twi
  • ush
  • vad̕d̕a
  • var
  • Việt
  • wbl
  • wes
  • xhe
  • Xitsonga
  • xka
  • xmf
  • yaq
  • Yòrùbá
  • zoc
  • Zulu
  • Ελληνικά
  • Адыгабзэ
  • Адыгэбзэ (Къэбэрдей)
  • Аԥсуа
  • Башҡорт
  • Беларуская
  • Български
  • Ирон
  • Кыргызча
  • Кырык мары
  • Қазақ тілі
  • Македонски
  • Марий
  • Мокшень кяль
  • Монгол хэл
  • Русский
  • Саха тыла
  • Српски
  • Татар
  • Тоҷикӣ
  • Українська
  • Чӑвашла
  • Эрзянь кель
  • ქართული
  • Հայերեն
  • אידיש
  • עברית
  • ئۇيغۇرچە
  • اردو
  • العربية
  • پښتو
  • سرائیکی
  • فارسی
  • کوردیی ناوەندی
  • ދިވެހި
  • ⵜⴰⵎⴰⵣⵉⵖⵜ
  • ትግረ
  • ትግርኛ
  • አማርኛ
  • नेपाली
  • मराठी
  • हिंदी
  • অসমীয়া
  • বাংলা
  • ਪੰਜਾਬੀ
  • ଓଡ଼ିଆ
  • தமிழ்
  • తెలుగు
  • മലയാളം
  • ꯃꯤꯇꯩ ꯂꯣꯟ
  • ไทย
  • ພາສາລາວ
  • ᱥᱟᱱᱛᱟᱲᱤ (ᱚᱞ ᱪᱤᱠᱤ)
  • 한국어
  • 中文(香港)
  • 台語
  • 日本語
  • 汉语(中国大陆)
  • 粵語
  • 華語(台灣)

Technology that speaks your language

Why should AI only work for a few of the world’s languages?

Our language is our story, our community, our culture. Let's create the datasets that we want to see in the world.

Get started
Sound wavesSound waves

Common Voice is a free, open source platform for community-led data creation

Anyone can preserve, revitalise and elevate their language by sharing, creating and curating text and speech datasets.

Scripted Speech

Read sentences aloud in your language and contribute to the most diverse public participation speech dataset in the world.

Speak

Spontaneous Speech

Respond to prompts to create datasets for organic, colloquial contexts. Perfect for oral-first languages.

Coming soon

Language Text

Create or share public domain prompts, sentences, and text for translation, small language models, and more.

Add Text

Powered by global communities, for global communities — 130 languages and growing!

Participate in language community discussions, ask questions, and learn about upcoming events and talks.

discord iconJoin us on Discord
Community Section

Publicly accessible open speech datasets in 130+ languages

Datasets for ASR, STT, TTS, and other NLP contexts - created through community participation.

Explore datasets

Support open, community-led datasets

Spontaneous Speech

Language Text

Partner with us

Civil society and researchers - create, host and share impactful datasets for free

Tech companies - invest in open dataset creation for a thriving multilingual AI ecosystem

Philanthropy - sponsor dataset creation to fuel local innovation and development

Our partners include...