Indo-Aryan languages


The Indo-Aryan or Indic languages are a major language family of South Asia. They constitute a branch of the Indo-Iranian languages, themselves a branch of the Indo-European language family. In the early 21st century, Indo-Aryan languages were spoken by more than 800 million people, primarily in India, Bangladesh, Nepal, Pakistan and Sri Lanka. Moreover, there are large immigrant and expatriate Indo-Aryan-speaking communities in Northwestern Europe, Western Asia, North America and Australia. There are well over 200 known Indo-Aryan languages.
Modern Indo-Aryan languages are descended from Old Indo-Aryan languages such as early Sanskrit, through Middle Indo-Aryan languages. The largest in terms of L1 speakers are Hindi-Urdu, Bengali, Punjabi, Marathi,, Gujarati, Rajasthani, Bhojpuri, Odia, Maithili, Sindhi, Nepali, Assamese, Kamta and other languages, with a 2005 estimate placing the total number of native speakers at nearly 900 million.

Classification

There can be no definitive enumeration of Indo-Aryan languages because the dialects form a group of continua.
The classification follows Masica and Kausen.

Dardic

;Kashmiri - 5.6 million speakers
;Shina
;Pashayi - 400,000 speakers
;Kunar
;Chitral
;Kohistani

Northern Zone

;Eastern Pahari
;Central Pahari
;Western Pahari/ Himachali

Northwestern Zone

;Punjabi
;Sindhi

Western Zone

Ethnologue lists the following languages under the Western Zone that are not already covered in other subgroups:
;Rajasthani
;Gujarati - 49 million speakers
;Bhil - 10.4 million speakers
;Khandeshi - 1.9 million speakers
;Lambadi - 4.5 Million speakers
;Domari - 4 million speakers
;Romani - 1.5 million speakers

Central Zone (Madhya ''or'' Hindi)

;Parya - 4,000 speakers
;Western Hindi
;Eastern Hindi
Parya historically belonged to the Central Zone but lost intelligibility with other languages of the group due to geographic distance and numerous grammatical and lexical innovations.

Eastern Zone

These languages derive from Magadhi Prakrit.
Tharu - 1.9 million speakers
Bihari
Odia-Halbic
;Odia - 35 million speakers
;Halbic
Bengali–Assamese
;Assamese - 15 million speakers
;Bengali - 268 million speakers
Marathi-Konkani
This group of languages developed from Maharashtri Prakrit. It is not clear if Dakhini is part of Hindustani along with Standard Urdu or a separate Persian-influenced development from Marathi.
;Marathi - 93 million speakers
;Konkani - 2.26 million speakers
Insular Indo-Aryan
The Insular Indo-Aryan languages are descendant of Elu Prakrit and share several characteristics that set them apart significantly from the continental languages.
;Sinhala - 17 million speakers
;Maldivian - 340,000 speakers

Unclassified

The following languages are related to each other, but otherwise unclassified within Indo-Aryan:
Chinali–Lahul Lohar
The following other poorly attested languages are listed as unclassified within the Indo-Aryan family by Ethnologue 17:
Also Degaru, Mina, Bhalay and Gowlan are all names for the Gowli caste, rather than a language.
Kholosi

History

Proto-Indo-Aryan

Proto-Indo-Aryan, or sometimes Proto-Indic, is the reconstructed proto-language of the Indo-Aryan languages. It is intended to reconstruct the language of the pre-Vedic Indo-Aryans. Proto-Indo-Aryan is meant to be the predecessor of Old Indo-Aryan which is directly attested as Vedic and Mitanni-Aryan. Despite the great archaicity of Vedic, however, the other Indo-Aryan languages preserve a small number of archaic features lost in Vedic.

Indian subcontinent

Dates indicate only a rough time frame.
The earliest evidence of the group is from Vedic Sanskrit, that is used in the ancient preserved texts of the Indian subcontinent, the foundational canon of Hinduism known as the Vedas. The Indo-Aryan superstrate in Mitanni is of similar age to the language of the Rigveda, but the only evidence of it is a few proper names and specialized loanwords.
From Vedic Sanskrit, "Sanskrit" developed as the prestige language of culture, science and religion as well as the court, theatre, etc. Modern Sanskrit is a continuation of Classical Sanskrit and is largely mutually unintelligible with Vedic Sanskrit.

Middle Indo-Aryan (Prakrits)

show some Middle Indo-Aryan characteristics along with Old Indo-Aryan, for example sapta in Old Indo-Aryan becomes satta. According to S.S. Misra this language can be similar to Buddhist-hybrid Sanskrit which might not be a mixed language but an early middle Indo-Aryan occurring much before Prakrit.
Outside the learned sphere of Sanskrit, vernacular dialects continued to evolve. The oldest attested Prakrits are the Buddhist and Jain canonical languages Pali and Ardhamagadhi Prakrit, respectively. By medieval times, the Prakrits had diversified into various Middle Indo-Aryan languages. Apabhraṃśa is the conventional cover term for transitional dialects connecting late Middle Indo-Aryan with early Modern Indo-Aryan, spanning roughly the 6th to 13th centuries. Some of these dialects showed considerable literary production; the Śravakacāra of Devasena is now considered to be the first Hindi book.
The next major milestone occurred with the Muslim conquests in the Indian subcontinent in the 13th–16th centuries. Under the flourishing Turco-Mongol Mughal Empire, Persian became very influential as the language of prestige of the Islamic courts due to adoptation of the foreign language by the Mughal emperors. However, Persian was soon displaced by Hindustani. This Indo-Aryan language is a combination with Persian, Arabic, and Turkic elements in its vocabulary, with the grammar of the local dialects.
The two largest languages that formed from Apabhraṃśa were Bengali and Hindustani; others include Assamese, Sindhi, Gujarati, Odia, Marathi, and Punjabi.

New Indo-Aryan

Medieval Hindustani
In the Central Zone Hindi-speaking areas, for a long time the prestige dialect was Braj Bhasha, but this was replaced in the 19th century by Dehlavi-based Hindustani. Hindustani was strongly influenced by Persian, with these and later Sanskrit influence leading to the emergence of Modern Standard Hindi and Modern Standard Urdu as registers of the Hindustani language. This state of affairs continued until the division of the British Indian Empire in 1947, when Hindi became the official language in India and Urdu became official in Pakistan. Despite the different script the fundamental grammar remains identical, the difference is more sociolinguistic than purely linguistic. Today it is widely understood/spoken as a second or third language throughout South Asia and one of the most widely known languages in the world in terms of number of speakers.

Mitanni-Aryan

Some theonyms, proper names and other terminology of the Mitanni exhibit an Indo-Aryan superstrate, suggest that an Indo-Aryan elite imposed itself over the Hurrians in the course of the Indo-Aryan expansion. In a treaty between the Hittites and the Mitanni, the deities Mitra, Varuna, Indra, and the Ashvins are invoked. Kikkuli's horse training text includes technical terms such as aika, tera, panza, satta, na, vartana. The numeral aika "one" is of particular importance because it places the superstrate in the vicinity of Indo-Aryan proper as opposed to Indo-Iranian in general or early Iranian
Another text has babru, parita, and pinkara. Their chief festival was the celebration of the solstice which was common in most cultures in the ancient world. The Mitanni warriors were called marya, the term for "warrior" in Sanskrit as well; note mišta-nnu "payment ".
Sanskritic interpretations of Mitanni royal names render Artashumara as Ṛtasmara "who thinks of Ṛta", Biridashva as Prītāśva "whose horse is dear", Priyamazda as Priyamedha "whose wisdom is dear", Citrarata as Citraratha "whose chariot is shining", Indaruda/Endaruta as Indrota "helped by Indra", Shativaza as Sātivāja "winning the race price", Šubandhu as Subandhu "having good relatives", Tushratta as *tṷaiašaratha, Vedic Tvastar "whose chariot is vehement".

Romani, Lomavren, and Domari languages

Domari

is an Indo-Aryan language spoken by older Dom people scattered across the Middle East. The language is reported to be spoken as far north as Azerbaijan and as far south as central Sudan. Based on the systematicity of sound changes, we know with a fair degree of certainty that the names Domari and Romani derive from the Indo-Aryan word ḍom.

Lomavren

is a nearly extinct mixed language, spoken by the Lom people, that arose from language contact between a language related to Romani and Domari and the Armenian language.

Romani

The Romani language is usually included in the Western Indo-Aryan languages. Romani—spoken mainly in various parts of Europe—is conservative in maintaining almost intact the Middle Indo-Aryan present-tense person concord markers, and in maintaining consonantal endings for nominal case—both features that have been eroded in most other modern languages of Central India. It shares an innovative pattern of past-tense person concord with the languages of the Northwest, such as Kashmiri and Shina. This is believed to be further proof that Romani originated in the Central region, then migrated to the Northwest.
There are no known historical documents about the early phases of the Romani language.
Linguistic evaluation carried out in the nineteenth century by Pott and Miklosich showed that the Romani language is to be classed as a New Indo-Aryan language, not Middle Indo-Aryan, establishing that the ancestors of the Romani could not have left India significantly earlier than AD 1000.
The principal argument favouring a migration during or after the transition period to NIA is the loss of the old system of nominal case, and its reduction to just a two-way case system, nominative vs. oblique. A secondary argument concerns the system of gender differentiation. Romani has only two genders . Middle Indo-Aryan languages generally had three genders, and some modern Indo-Aryan languages retain this old system even today.
It is argued that loss of the neuter gender did not occur until the transition to NIA. Most of the neuter nouns became masculine while a few feminine, like the neuter अग्नि in the Prakrit became the feminine आग in Hindi and jag in Romani. The parallels in grammatical gender evolution between Romani and other NIA languages have been cited as evidence that the forerunner of Romani remained on the Indian subcontinent until a later period, perhaps even as late as the tenth century.

Phonology

Consonants

Stop positions

The normative system of New Indo-Aryan stops consists of five points of articulation: labial, dental, "retroflex", palatal, and velar, which is the same as that of Sanskrit. The "retroflex" position may involve retroflexion, or curling the tongue to make the contact with the underside of the tip, or merely retraction. The point of contact may be alveolar or postalveolar, and the distinctive quality may arise more from the shaping than from the position of the tongue. Palatals stops have affricated release and are traditionally included as involving a distinctive tongue position. Widely transcribed as, claims to be a more accurate rendering.
Moving away from the normative system, some languages and dialects have alveolar affricates instead of palatal, though some among them retain in certain positions: before front vowels, before, or when geminated. Alveolar as an additional point of articulation occurs in Marathi and Konkani where dialect mixture and others factors upset the aforementioned complementation to produce minimal environments, in some West Pahari dialects through internal developments, and in Kashmiri. The addition of a retroflex affricate to this in some Dardic languages maxes out the number of stop positions at seven, while a reduction to the inventory involves *ts >, which has happened in Assamese, Chittagonian, Sinhala, and Southern Mewari.
Further reductions in the number of stop articulations are in Assamese and Romany, which have lost the characteristic dental/retroflex contrast, and in Chittagonian, which may lose its labial and velar articulations through spirantisation in many positions.
Stop seriesLanguage
,,,, Hindi, Punjabi, Dogri, Sindhi, Gujarati, Sinhala, Odia, Standard Bengali, dialects of Rajasthani
,,,, Sanskrit, Maithili, Magahi, Bhojpuri
,,,, Nepali, dialects of Rajasthani, Northern Lahnda's Kagani, Kumauni, many West Pahari dialects
,,,,, Marathi, Konkani, certain W. Pahari dialects, Kashmiri
,,,,,, Shina, Bashkarik, Gawarbati, Phalura, Kalasha, Khowar, Shumashti, Kanyawali, Pashai
,,, Rajasthani's S. Mewari
,,,,, E. and N. dialects of Bengali
,, Assamese
,,, Romani
,, Sylheti
, Chittagonian

Nasals

Sanskrit was noted as having five nasal-stop articulations corresponding to its oral stops, and among modern languages and dialects Dogri, Kacchi, Kalasha, Rudhari, Shina, Saurasthtri, and Sindhi have been analysed as having this full complement of phonemic nasals , with the last two generally as the result of the loss of the stop from a homorganic nasal + stop cluster, though there are other sources as well.

Charts

The following are consonant systems of major and representative New Indo-Aryan languages, as presented in, though here they are in IPA. Parentheses indicate those consonants found only in loanwords: square brackets indicate those with "very low functional load". The arrangement is roughly geographical.

Language and dialect

In the context of South Asia, the choice between the appellations "language" and "dialect" is a difficult one, and any distinction made using these terms is obscured by their ambiguity. In one general colloquial sense, a language is a "developed" dialect: one that is standardised, has a written tradition and enjoys social prestige. As there are degrees of development, the boundary between a language and a dialect thus defined is not clear-cut, and there is a large middle ground where assignment is contestable.
There is a second meaning of these terms, in which the distinction is drawn on the basis of linguistic similarity. Though seemingly a "proper" linguistics sense of the terms, it is still problematic: methods that have been proposed for quantifying difference have not been seriously applied in practice; and any relationship established in this framework is relative.