The Indo-European languages include some 449 (SIL estimate, 2018 edition) languages and dialects spoken by about or more than 3.5 billion people (roughly half of the world population). Most of the major languages belonging to language branches and groups of Europe, and Western and southern Asia, belong to the Indo-European language family. Therefore, Indo-European is the biggest language family in the world by number of mother tongue speakers (but not by number of languages in which it is the 3rd or 5th biggest). Eight of the top ten biggest languages, by number of native speakers, are Indo-European. One of these languages, English, is the de facto World Lingua Franca with an estimate of over one billion second language speakers.
Each subfamily or linguistic branch in this list contains many subgroups and individual languages. Indo-European language family has 10 known branches or subfamilies, of which eight are living and two are extinct. The relation of Indo-European branches, how they are related to one another and branched from the ancestral proto-language is a matter of further research and not yet well known. There are some individual Indo-European languages that are unclassified within the language family, they are not yet classified in a branch and could be members of their own branch.
The 449 Indo-European languages identified in the SIL estimate, 2018 edition, are mostly living languages, however, if all the known extinct Indo-European languages are added, they number more than 800. This list includes all known Indo-European languages, living and extinct.
A distinction between a language and a dialect is not clear-cut and simple because there is, in many cases, several dialect continuums, transitional dialects and languages and also because there is no consensual standard to what amount of vocabulary, grammar, pronunciation and prosody differences there is a language or there is a dialect (mutual intelligibility can be a standard but there are closely related languages that are also mutual intelligible to some degree, even if it is an asymmetric intelligibility). Because of this, in this list, several dialect groups and some individual dialects of languages are shown (in italics), especially if a language is or was spoken by a large number of people and over a big land area, but also if it has or had divergent dialects.
At the end of the second millennium BC Indo-European speakers were many millions and lived in a vast geographical area in most of western and southern Eurasia (including western Central Asia).
In the following two millennia the number of speakers of Indo-European languages increased even further.
By geographical area, Indo-European languages remained spoken in big land areas, although most of western Central Asia and Asia Minor was lost to another language family (mainly Turkic) due to Turkic expansion, conquests and settlement (after the middle of the first millennium AD and the beginning and middle of the second millennium AD respectively) and also to Mongol invasions and conquests (that changed Central Asia ethnolinguistic composition). Another land area lost to non-Indo-European languages was today's Hungary due to Magyar/Hungarian (Uralic language speakers) conquest and settlement.
However, in the second half of the second millennium AD, Indo-European languages expanded their territories to North Asia (Siberia), through Russian expansion, and North America, South America, Australia and New Zealand as the result of the age of European discoveries and European conquests through the expansions of the Portuguese, Spanish, French, English and the Dutch (these peoples had the biggest continental or maritime empires in the world and their countries were major powers).
The contact between different peoples and languages, especially as a result of European colonization, also gave origin to the many pidgins, creoles and mixed languages that are mainly based in Indo-European languages (many of which are spoken in island groups and coastal regions).
Hypothetical relation to other language families and their proto-languages (controversial and yet unresolved issue of high level classification of known language families into larger clades of older age that descend from common ancestors in the remote past)
Late Proto-Indo-European (Last phase of indo-European as spoken language before splitting into several languages that originated in the regional dialects that diverged in time, and in space with Indo-European migrations, these languages were the direct ancestors of today's subfamilies or "branches" of descendant languages) (larger clades of Indo-European than the individual subfamilies or the way individual subfamilies are related to each other is still an unresolved issue)
Although all Indo-European languages descend from a common ancestor called Proto-Indo-European, the kinship between the subfamilies or branches (large groups of more closely related languages within the language family), that descend from other more recent proto-languages, is not the same because there are subfamilies that are closer or further, and they did not split-off at the same time, the affinity or kinship of Indo-European subfamilies or branches between themselves is still an unresolved and controversial issue (the reason for they are shown as separate and by alphabetical order in this list of Indo-European languages).
Using a mathematical analysis borrowed from evolutionary biology, Don Ringe and Tandy Warnow propose the following tree of Indo-European branches:
Armenian dialects, according to Adjarian (1909) (before 1st World War and Armenian Genocide). In many regions of the contiguous area shown in the map, Armenian speakers were the majority or a significant minority.
Modern geographical distribution of the Armenian language.
Area of Balto-Slavic dialect continuum (purple) with proposed material cultures correlating to speakers Balto-Slavic in Bronze Age (white). Red dots= archaic Slavic hydronyms.
Political map of Europe with countries where a Slavic language is a national language marked in shades of green and where a Baltic language is a national language marked in light orange. Wood green represents East Slavic languages, pale green represents West Slavic languages, and sea green represents South Slavic languages. Contemporary Baltic languages are all from the same group: Eastern Baltic
Baltic languages (extinct languages shown in stripes).
Slavic languages in Europe (2008). Areas where languages overlap are shown in stripes.
Russian Language - Map of all the areas where the Russian language is the language spoken by the majority of the population. Based on the latest census available per country (2013). Russian is the biggest Slavic language both in number of first language speakers and in geographical area where the language is spoken (a vast land area of Eastern Europe and North Asia - Siberia, i.e. most of Northern Eurasia).
A map of the modern distribution of the Celtic languages. Red: Welsh; Purple:Cornish; Black: Breton; Green: Irish Gaelic; Blue: Scottish Gaelic: Yellow: Manx Gaelic. Areas where languages overlap are shown in stripes.
Germanic languages and main dialect groups in Europe.
Germanic languages in the World. Countries and sub-national entities where one or more Germanic languages are spoken. Dark Red: First language; Red: Official or Co-Official language, Pink: Spoken by a significant minority as second language.
Pennsylvania German (Pennsylvania "Dutch") (Deitsch / Pennsilfaanisch Deitsch) (Pennsilfaanisch Deitsch is the self name or autonym of the language, "Deitsch" and "Dutch" are cognates but now have different meanings: one for Germanic language in a broad sense, not only for German in a narrow sense, and the other for specifically the Dutch or Nederlandic language, hence the name "Pennsylvania Dutch" for the language in English due to the similarity of names)
Modern Athenian / Metropolitan Athenian Greek (close to Standard Modern Greek) (not quite a Southern or Northern Greek dialect, although Standard Modern Greek is based predominantly on the southern dialects, especially those of the Peloponnese)
Northern-Central Anatolian Greek/Northern-Central Asia Minor Greek (more divergent than Western and Southern Anatolian Greek, that were more in contact with other Greek dialects, divergent enough to be considered separate languages although closely related to Modern Greek, they descend from Medieval or Byzantine Greek)
Geographic distribution of modern Indo-Iranian languages. Blue, dark purple and green colour shades: Iranic languages. Dark pink: Nuristani languages. Red, light purple and orange colour shades: Indo-Aryan languages. Areas where languages overlap are shown in stripes.
Distribution of major modern Iranian languages.
Geographic distribution of modern Iranian languages (Central Iran languages are shown in blue dots).
Distribution of language groups and major modern Indo-Aryan languages. Pink: Dardic; Dark Blue: Northwestern Indo-Aryan; Purple: Northern Indo-Aryan; Red: Western Indo-Aryan; Orange: Central and East Central Indo-Aryan; Yellow: Eastern Indo-Aryan; Green: Southern Indo-Aryan. Areas where languages overlap are shown in stripes.
Jassic(extinct)(Ossetic variant, more closely related to Digor, of a nomadic tribe, the Jassic people, settled in Hungary at the 13th century, in Jaszsag)(not confuse with the language of the Iazyges, a related but separate language)
Bodo Parja / Jharia (tribal dialect of Odia spoken mostly in Koraput district of Southern Odisha)
Desiya Odia or Koraputia Odia (spoken in Koraput, Kalahandi, Rayagada, Nabarangapur and Malkangiri Districts of Odisha and in the hilly regions of Vishakhapatnam, Vizianagaram District of Andhra Pradesh)
Sambalpuri / Western Odia (Kosali) (spoken in western Odisha, East India, in Bargarh, Bolangir, Boudh, Debagarh, Nuapada, Sambalpur, Subarnapur districts of Odisha and in Raigarh, Mahasamund, Raipur districts of Chhattisgarh state) (it is not to be confused with "Kosali", a term sometimes also used for Awadhi and related languages)
Reli / Relli (spoken in Southern Odisha and bordering areas of Andhra Pradesh)
Kupia (spoken by the Valmiki caste people in the Indian state of Telangana and Andhra Pradesh, mostly in Hyderabad, Mahabubnagar, Srikakulam, Vizianagaram, East Godavari and Visakhapatnam districts)
Tharu (थारु - Tharu) (not only one language) (Pre-Indo-European, Pre-Dravidian and Pre-Sino-Tibetan substrate of an unknown language or languages of a possible indigenous language family) (mainly in the Terai)
Iron Age Italy (c.500 B.C.). Italic languages in green colours.
European extent of Romance languages in the 20th century.
Romance languages in Europe (major dialect groups are also shown).
Romance languages in the World. Countries and sub-national entities where one or more Romance languages are spoken. Dark colours: First language, Light colours: Official or Co-Official language; Very Light colours: Spoken by a significant minority as first or second language. Blue: French; Green: Spanish; Orange: Portuguese; Yellow: Italian; Red: Romanian.
Southern Latin(retention of archaic features in the periphery of the Latin speaking world)
Insular Latin(Not Insular Romance)(Latin that was spoken by the insular populations of Corsica and Sardinia)
African Latin(Not African Romance)(West North Africa, in many regions of today's Maghreb)(Latin that was spoken by the African Romans in North Africa, especially in the Africa province, the origin of the name "Africa" that was later applied to the whole continent)
Latin Sociolects(most provinces)
Imperial Latin(Sociolect used by ruling class Romans)
Reggino(in the Metropolitan City of Reggio Calabria, especially on the Scilla–Bova line, and excluding the areas of Locri and Rosarno which represent the first isogloss which divide Sicilian from the continental varieties)
Modenese(spoken in the Province of Modena, although Bolognese is more widespread in the Castelfranco area. In the northern part of the province of Modena, the lowlands around the town of Mirandola, a Mirandolese sub-dialect of Modenese is spoken)
Reggiano(spoken in the Province of Reggio Emilia, although the northern parts, such as Guastalla, Luzzara and Reggiolo, of the province are not part of this group and closer to Mantovano)
Parmigiano(spoken in the Province of Parma. Those from the area refer to the Parmigiano spoken outside of Parma as Arioso or Parmense, although today's urban and rural dialects are so mixed that only a few speak the original. The language spoken in Casalmaggiore in the Province of Cremona to the north of Parma is closely related to Parmigiano)
Piacentino(spoken west of the River Taro in the Province of Piacenza and on the border with the province of Parma. The variants of Piacentino are strongly influenced by Lombard, Piedmontese, and Ligurian)
Western Waloon / Wallo-Picard (Walo-Picård) - the dialect closest to French proper and with a strong Picard influence, spoken in Charleroi (Tchårlerwè), Nivelles (Nivele), and Philippeville (Flipvile)
Central Waloon / Namurois (Walon do Mitan) - spoken in Namur (Nameur), the Wallon capital, and the cities of Wavre (Åve) and Dinant
Eastern Waloon / Liégeois (Walon do Levant) – in many respects the most conservative and idiosyncratic of the dialects, spoken in Liège (Lidje), Verviers (Vervî), Malmedy (Måmdi), Huy (Hu), and Waremme (Wareme)
Southern Waloon / Wallo-Lorrain (Walon Nonnrece) – close to the Lorrain and to a lesser extent Champenois languages, spoken in Bastogne, Marche-en-Famenne (Måtche-el-Fåmene), and Neufchâteau (Li Tchestea), all in the Ardennes region.
Poitevin-Saintongeais (Southwest Oïl) (South Gallo-Romance Occitan substrate)
Central Aragonese (roughly in the original area where the Romance language called "Navarro-Aragonese" originated) (extinct)(people shifted to an Aragonese Castilian variety with an Aragonese substrate)
Far-Eastern Leonese (Leonese of Palencia-Valladolid-Salamanca) (extinct) (in the past it was spoken in most of Palencia, Valladolid and Salamanca provinces but there people shifted to a Leonese Castilian variety)
Northern(some features are transitional to Galician)(a typical feature of the Northern Portuguese dialects is that they have betacism, i.e. they don't distinguish between b [b or β] and v [v] phonemes, i.e v [v] phoneme is absent)
Alto Minhoto(geographically in Minho Province but more closely related to the Transmontano dialect)
Guarda District dialect(more features in common with Northern dialects but in the phonetics distinguishes between b [b] and v [v] phonemes, a typical feature of the Central and Southern dialects)
Central-Southern(a typical feature of the Central and Southern Portuguese dialects is that in the phonetics they don't have betacism, i.e. they distinguish between b [b] and v [v] phonemes, i.e. v [v] phoneme is clearly pronounced)
Coastal Central(Extremaduran Portuguese)(Português Estremenho)(Transitional Northern-Southern)(basis of Modern Standard European Portuguese but not identical)(although in the 20th century a province in the Central Coastal Lowlands region was called Beira Litoral, i.e. Litoral / Coastal Beira, older and traditional Beira Province was an inland province in the Highlands, while all Central Coastal Lowlands region of Mainland Portugal was the province of Estremadura until the 18th century) ("Beira" name means edge, slope, mountain slope, or border, with the specific meaning of "Mountainous Borderland" or "Edge Borderland")(until the 14th century the broad or colective name for all the portuguese territories south of Douro river was "Extremadura", i.e. "Far Border Land", the name derives from "Extrema", "Extremada" - extreme in the sense of extreme borderland, far borderland)(this name is cognate and has equivalents with the Leonese, Castilian and Aragonese Extremaduras, that were also old Borderlands at the beginning of the Christian Reconquista)(therefore "Estremadura" and "Beira" names had the meaning of "Borderland" in the context of the Christian Reconquista)
Northern Coastal Central(more features in common with Central and Southern dialects, but in the phonetics, some areas, mainly in Aveiro county, don't distinguish between b [b] and v [v] phonemes, i.e. they don't have v [v] phoneme, a typical feature of the Northern dialects)
Southern Coastal Central(Standard European Portuguese is mainly based on this dialect)
Lisbon dialect(early Lisbon dialect, Lisboeta, was only spoken in Lisbon itself and was an enclave, however today it is spoken in Lisbon metropolitan area, and is a very widespread dialect, many dialects are under pressure and being replaced by the standard language that closely resembles Lisbon dialect)
Inland Southern Central(Beira-Baixa-Far Northern Alto-Alentejo)(a divergent group of Portuguese dialects in phonetics and some vocabulary, it forms its own dialectal group)(its more typical phonetic feature is the presence of the vowels ö [ø] and ü [y], phonemes that don't exist in the other Portuguese dialects or other Iberian Romance / Hispano Romance languages and dialects but are a typical common feature of the Gallo-Romance languages and dialects; several placenames / toponyms in Beira Baixa, roughly Castelo Branco County, and Far North Alto Alentejo, North Portalegre County, such as Proença, Old Occitan name of Provence, Ródão, from Rodano, a name for Rhodanus river, Tolosa, Occitan name of Toulouse, seem to testify an old Gallo-Romance presence of speakers in enclaves, they were assimilated to Galician-Portuguese but left a phonetic infuence in the dialect of this region; in the 13th century, speakers of this dialect group also settled in Western Algarve, at the end of the Portuguese Reconquista; in the 15th and 16th centuries, speakers of this dialect group, mixed with speakers of other dialectal groups, settled in several islands of the Archipelagos of the Azores and Madeira)(declining and extinct in many municipalities where it was spoken)
Far Northern Alto-Alentejo(South of Tagus river, geographically in Alentejo but closely related to the Beira Baixa dialect and not to the Alentejo dialect)
Far Western Algarvian(geographically in the Algarve but is more related to the Beira Baixa dialect and not to the Algarvian dialect, it is an Inland Southern Central dialect enclave in Far Southwestern Mainland Portugal)(has the ü [y] phoneme but doesn't have the ö [ø] phoneme)
Ribatejano(along Tagus River banks)(in Ribatejo Province) ("Ribatejo - Riba Tejo" name means "Tagus Banks", from "Riba" - River Bank and "Tejo" - the Tagus river)
Setubalense(in the Setubal Peninsula)(its more typical phonetic feature is that it doesn't distinguish between trilled r [r] and guttural r [ʁ] i.e. r is always pronounced as guttural r [ʁ])(overlaps and under pressure of the modern Lisbon metropolitan area dialect)
Alentejano(its more typical phonetic feature is the pronunciation of more open vowels than in Standard European Portuguese, final vowel e [e] is generaly pronounced as i [i] or the [i] vowel is added after a final consonant where Standard European Portuguese doesn't have a final vowel after a consonant, and has a distinct prosody)(in South Alto Alentejo and Baixo Alentejo Provinces) ("Alentejo - Além Tejo" name means "Beyond Tagus")
Islander(Geographical Grouping and not a Linguistic Genealogical one)(a divergent group of Portuguese dialects in phonetics and some vocabulary, several linguistic archaisms from Middle Portuguese when the islands were settled)(Azores and Madeira didn't had native Pre-European peoples)
Azorean(nine dialects in the nine islands of the Azores Archipelago, it's not only a single dialect)
Micaelense(São Miguel Island dialect)(its more typical phonetic feature is the presence of the vowels ö [ø] and ü [y] in its phonemes, a common phonetic feature with Inland Southern Central dialects, mainly Baixo Beirão dialect, and with the more distant Gallo-Romance languages and dialects, it has more vowels than Standard European Portuguese and several long vowels, and it has a "French-like" prosody)
Terceirense(Terceira Island dialect)(its more typical phonetic feature is the presence of the semivowels [j] and [w] before a vowel in many words where Standard European Portuguese only has one vowel and a "singing-like" prosody)
Faialense(Faial Island dialect)(Faial island dialect is closer to Standard European Portuguese than the dialects of other islands, initial Flemish settlers, that spoke the germanic Flemish dialect of Dutch, some years later were rapidly surpassed and assimilated by a big majority of Portuguese settlers that came from Coastal Central Portugal, whose dialect is the basis of European Standard Portuguese, and did not influenced Faial Island dialect)
Madeirense(Madeira Island dialect)(its more typical phonetic feature is the pronunciation of the vowel u [u], in many cases, as a Schwa [ə], where Micaelense and Baixo-Beirão dialects have ü [y] and the palatalization of l [l] to [λ] before i [i])
Amazonic Range(Serra Amazônica) / Deforestation Arc(Arco do Desflorestamento)
Southern / Broad Southern(one of its earlier centers, in the 16th century, was São Vicente, in the western half of the island with the same name, closely offshore of São Paulo State coast, in the eastern half of the island is Santos city)
Tocharian languagesA (blue), B (red) and C (green) in the Tarim Basin. Tarim oasis towns are given as listed in the Book of Han (c. 2nd century BC). The areas of the squares are proportional to population.
East Central Asia Indo-European / Wusun-Yuezhi? (Chinese exonyms for ancient Indo-European speaking peoples of East Central Asia, the endonyms are unknown, may have been the same or related to the same language subfamily of the "Tocharians" - Agni-Kuči, Indo-Aryan languages, another Indo-European subfamily or not directly related and belong to two different Indo-European subfamilies - one Tocharian and the other Iranian)
Yuezhiean (it was spoken by the Yuezhi, an ancient Indo-European speaking people, in the modern Chinese province of Gansu, during the 1st millennium BC, or in Dunhong, in the Tian Shan) (true Tocharian)
Greater-Yuezhiean (Dà Yuèzhī - 大月氏) (dialect ancestral to the hypothetical Kushanite language spoken in Kushana)
Lesser-Yuezhiean (Xiǎo Yuèzhī - 小月氏) (dialect ancestral to Tocharian)
Ossinean (Ancient language of the steppe, spoken by the Asii)
Gushiean (Language of an obscure ancient people on the Turpan Basin, known as the Jushi. It eventually diverged into two dialects, as noted by diplomats from the Han empire)
Nearer Gushiean (Dialect spoken in eastern Gansu in closer proximity to the Chinese)
Further Gushiean (Dialect spoken in Xinjian further away from the Chinese)
Yanghainese (Language of the Yanghai Tomb Culture, left a substrate in Gushiean, ~800 BCE)
Possible Indo-European languages (all extinct)
Unclassified languages that may have been Indo-European or members of other language families
Hypothetical Indo-European languages (all extinct)
Languages that may have existed and may have been Indo-European
Euphratic / Proto-Euphratean (a hypothetical early Indo-European language that influenced some languages of the Euphrates river basin, Euphratic languages possibly include Euphratic Proper, Zagrotic (Spoken in the Zagros), And Tigritic Spoken near the Tigres River)
Sorothaptic (a hypothetical pre-Celtic Bronze Age Indo-European language of the Urnfield culture in the Iberian Peninsula) (possibly part of an older Pre-Celtic Indo-European branch)