|16th century CE-present|
|The Brahmic script and its descendants|
Gurmukhī (Punjabi: ਗੁਰਮੁਖੀ, [ˈɡʊɾmʊkʰiː], Shahmukhi: گُرمُکھی) is an abugida developed from the Laṇḍā scripts, standardized and used by the second Sikh guru, Guru Angad (1504–1552). Commonly regarded as a Sikh script, Gurmukhi is used in Punjab, India as the official script of the Punjabi language.
The primary scripture of Sikhism, the Guru Granth Sahib, is written in Gurmukhī, in various dialects and languages often subsumed under the generic title Sant Bhasha or saint language, in addition to other languages like Persian and various phases of Indo-Aryan languages.
Modern Gurmukhī has thirty-five original letters, hence its common alternative term pentī or "the thirty-five," plus six additional consonants, nine vowel diacritics, two diacritics for nasal sounds, one diacritic that geminates consonants and three subscript characters.
The Gurmukhī script is generally believed to have roots in the Proto-Sinaitic alphabet by way of the Brahmi script, which developed further into the Northwestern group (Sharada, or Śāradā, and its descendants, including Landa and Takri), the Central group (Nagari and its descendants, including Devanagari, Gujarati and Modi) and the Eastern group (evolved from Siddhaṃ, including Bangla, Tibetan, and some Nepali scripts), as well as several prominent writing systems of Southeast Asia and Sinhala in Sri Lanka, in addition to scripts used historically in Central Asia for extinct languages like Saka and Tocharian. Gurmukhi is derived from Sharada in the Northwestern group, of which it is the only major surviving member, with full modern currency. Notable features:
|Possible derivation of Gurmukhi from earlier writing systems.[note 1] The Greek alphabet, also descended from Phoenician, is included for comparison.|
Gurmukhi evolved in cultural and historical circumstances notably different from other regional scripts, for the purpose of recording scriptures of Sikhism, a far less Sanskritized cultural tradition than others of the subcontinent. This independence from the Sanskritic model allowed it the freedom to evolve unique orthographical features. These include:
and other features.
From the 10th century onwards, regional differences started to appear between the Sharada script used in Punjab, the Hill States (partly Himachal Pradesh) and Kashmir. Sharada proper was eventually restricted to very limited ceremonial use in Kashmir, as it grew increasingly unsuitable for writing the Kashmiri language. With the last known inscription dating to 1204 C.E., the early 13th century marks a milestone in the development of Sharada. The regional variety in Punjab continued to evolve from this stage through the 14th century; during this period it starts to appear in forms closely resembling Gurmukhī and other Landa scripts. By the 15th century, Sharada had evolved so considerably that epigraphists denote the script at this point by a special name, Devāśeṣa. Tarlochan Singh Bedi (1999) prefers the name Pritham Gurmukhī, or Proto-Gurmukhī.
The Sikh gurus adopted proto-Gurmukhī to write the Guru Granth Sahib, the religious scriptures of the Sikhs. The Takri alphabet developed through the Devāśeṣa stage of the Sharada script from the 14th-18th centuries and is found mainly in the Hill States such as Chamba, Himachal Pradesh and surrounding areas, where it is called Chambyali, and in Jammu Division, where it is known as Dogri. The local Takri variants got the status of official scripts in some of the Punjab Hill States, and were used for both administrative and literary purposes until the 19th century. After 1948, when Himachal Pradesh was established as an administrative unit, the local Takri variants were replaced by Devanagari.
Meanwhile, the mercantile scripts of Punjab known as the Laṇḍā scripts were normally not used for literary purposes. Landa means alphabet "without tail", implying that the script did not have vowel symbols. In Punjab, there were at least ten different scripts classified as Laṇḍā, Mahajani being the most popular. The Laṇḍā scripts were used for household and trade purposes. Compared to the Laṇḍā, Sikh Gurus favored the use of Proto-Gurmukhī, because of the difficulties involved in pronouncing words without vowel signs.
The usage of Gurmukhī letters in the Guru Granth Sahib meant that the script developed its own orthographical rules. In the following epochs, Gurmukhī became the prime script applied for the literary writings of the Sikhs. The Singh Sabha Movement of the late 1800s, a movement to revitalize Sikh institutions which had declined during colonial rule after the fall of the Sikh Empire, also advocated for the usage of the Gurmukhi script for mass media, with print media publications and Punjabi-language newspapers established in the 1880s. Later in the 20th century, after the struggle of the Punjabi Suba movement, from the founding of modern India in the 1940s to the 1960s, the script was given the authority as the official script of the Punjab, India.
The prevalent view among Punjabi linguists is that as in the early stages the Gurmukhī letters were primarily used by the Guru's followers, Gurmukhs (literally, those who face, or follow, the Guru, as opposed to a Manmukh); the script thus came to be known as Gurmukhī, "the script of those guided by the Guru." Guru Angad is credited in the Sikh tradition with the creation and standardization of Gurmukhi script from earlier Śāradā-descended scripts native to the region. It is now the standard writing script for the Punjabi language in India. The original Sikh scriptures and most of the historic Sikh literature have been written in the Gurmukhi script.
Although the word Gurmukhī has been commonly translated as "from the Mouth of the Guru," the term used for the Punjabi script has somewhat different connotations. The opinion traditional scholars for this is that as the Sikh holy writings, before they were written down, were uttered by the Gurus, they came to be known as Gurmukhī or the "Utterance of the Guru". Consequently, the script that was used for scribing the utterance was also given the same name. The term that would mean "by the Guru's mouth" would be "Gurmū̃hī̃," which sounds considerably different but looks similar in Latin script.
The Gurmukhī alphabet contains thirty-five base letters (akkhar, plural akkharā̃), traditionally arranged in seven rows of five letters each. The first three letters, or mātarā vāhak ("vowel carrier"), are distinct because they form the basis for vowels and are not consonants, or vianjan, like the remaining letters are, and except for the second letter aiṛā are never used on their own; see § Vowel diacritics for further details. The pair of fricatives, or mūl varag ("base class"), share the row, which is followed by the next five sets of consonants, with the consonants in each row arranged by manner of articulation, and the rows by place of articulation, from the back to the front of the mouth. The arrangement, or varaṇmālā, is completed with the antim ṭolī, literally "ending group."
|Voice onset time →||Tenuis||Aspirates||Voiced Stops||Tonal||Nasals|
[ kə̀ ]
[ t͡ʃə̀ ]
[ ʈə̀ ]
[ t̪ə̀ ]
[ pə̀ ]
|Approximants and liquids|
The nasal letters ਙ /ŋəŋːaː/ and ਞ /ɲəɲːaː/ are not used in modern Gurmukhi. They cannot begin a syllable or be placed between two consonants, and the sounds they represent occur most often as allophones of [n] before specific consonant phonemes.
The pronunciation of ਵ can vary allophonically between /ʋ/ and /w/.
The most characteristic feature of the Punjabi language is its tone system. The script has no separate symbol for tones, but they correspond to the tonal consonants that once represented voiced aspirates as well as older *h. To differentiate between consonants, the Punjabi tonal consonants of the fourth column, ਘ kà, ਝ chà, ਢ ṭà, ਧ tà, and ਭ pà, are often transliterated in the way of the voiced aspirate consonants gha, jha, ḍha, dha, and bha respectively, although Punjabi lacks these sounds. Tones in Punjabi can be either rising, neutral, or falling; in the pronunciation of the names of the Gurmukhī letters, they are at the beginning of the word and as such produce the falling tone, hence the grave accent (à) as opposed to the acute. The tone changes to a rising one (á) and precedes the letter when it is in syllabic coda positions, and falling when in stem-medial positions after a short vowel and before a long vowel. The letters now always represent unaspirated consonants, and are unvoiced in initial positions and voiced elsewhere.
In addition to the 35 original letters, there are six supplementary consonants in official usage, referred to as the navīn ṭolī or navīn varag, meaning "new group," created by placing a dot (bindī) at the foot (pair) of the consonant to create pair bindī consonants. These are not present in the Guru Granth Sahib or old texts. These are used most often for loanwords, though not exclusively, and their usage is not always obligatory:
|ਸ਼||sasse pair bindī
[səsːeː pɛ:ɾᵊ bɪnd̪iː]
|ਖ਼||khakkhe pair bindī
[kʰəkʰːeː pɛ:ɾᵊ bɪnd̪iː]
|ਗ਼||gagge pair bindī
[gəgːeː pɛ:ɾᵊ bɪnd̪iː]
|ਜ਼||jajje pair bindī
[d͡ʒəd͡ʒːeː pɛ:ɾᵊ bɪnd̪iː]
|ਫ਼||phapphe pair bindī
[pʰəpʰːeː pɛ:ɾᵊ bɪnd̪iː]
|ਲ਼||lalle pair bindī
[ləlːeː pɛ:ɾᵊ bɪnd̪iː]
The character ਲ਼ /ɭ/, the only character not representing a fricative consonant, was only recently added to the Gurmukhī alphabet. It was not a part of the traditional orthography, as the distinctive phonological difference between 'l' and 'ɭ', while both native sounds, was not reflected in the script. Some sources do not consider it a separate letter. Other characters, like [ਕ਼] /qə/, are also on rare occasion used unofficially, chiefly for transliterating old writings in Persian and Urdu, the knowledge of which is less relevant in modern times.
Three "subscript" letters, called pairī̃ akkhar, or "letters at the foot" are utilised in Gurmukhī: forms of ਹ(h), ਰ(r), and ਵ(v).
The subscript ਰ(r) and ਵ(v) are used to make consonant clusters and behave similarly; subjoined ਹ(h) raises tone.
|Subscript letter||Name, original form||Usage|
|For example, the letter ਪ(p) with a regular ਰ(r) following it would yield the word ਪਰ /pəɾᵊ/ ("but"), but with a subjoined ਰ would appear as ਪ੍ਰ- (/prə-/), resulting in a consonant cluster, as in the word ਪ੍ਰਬੰਧਕ (/pɾəbə́nd̪əkᵊ/, "managerial, administrative"), as opposed to ਪਰਬੰਧਕ /pəɾᵊbə́nd̪əkᵊ/, the Punjabi form of the word used in natural speech in less formal settings (the Punjabi reflex for Sanskrit /pɾə-/ is /pəɾ-/) . This subscript letter is commonly used in Punjabi, not just for Sanskritized words, but also for personal names, some native dialectal words, loanwords from other languages like English, etc.|
|Used occasionally in Gurbani (Sikh religious scriptures) but rare in modern usage, it is largely confined to creating the cluster /sʋə-/ in words borrowed from Sanskrit, the reflex of which in Punjabi is /sʊ-/, e.g. Sanskrit ਸ੍ਵਪ੍ ਨ /s̪ʋɐ́p.n̪ɐ/→Punjabi ਸੁਪਨਾ /'sʊpᵊna:/, "dream," cf. Hindi-Urdu /səpna:/.
For example, ਸ with a subscript ਵ would produce ਸ੍ਵ (sʋə-) as in the Sanskrit word ਸ੍ਵਰਗ (/svəɾəgə/, "heaven"), but followed by a regular ਵ would yield ਸਵ- (səv-) as in the common word ਸਵਰਗ (/səʋəɾəgᵊ/, "heaven"), borrowed earlier from Sanskrit but subsequently changed. The natural Punjabi reflex, ਸੁਰਗ /sʊɾəgᵊ/, is also used in everyday speech.
|The most common subscript, this character does not create consonant clusters, but serves as part of Punjabi's characteristic tone system, indicating a raised tone. It behaves the same way in its use as the regular ਹ(h) does in non-word-initial positions. The regular ਹ(h) is pronounced at the beginning of words but not in other positions, where it instead raises the tone on the preceding vowel. The difference in usage is that the regular ਹ is used after vowels, and the subscript version is used when there is no vowel, and is attached to consonants.
For example, the regular ਹ is used after vowels as in ਮੀਂਹ (transcribed as mĩh (IPA: [míː]), "rain"). The subjoined ਹ(h) acts the same way but instead is used under consonants: ਚ(ch) followed by ੜ(ṛ) yields ਚੜ (chəṛ), but not until the rising tone is introduced via a subscript ਹ(h) does it properly spell the word ਚੜ੍ਹ (chə́ṛ, "climb").
This character's function is analogous to that of the udāt symbol (ੑ U+0A51), which occurs in older texts and indicates a high tone.
In addition to the three subscript letters, there is a half-form of the letter Yayya, /j/ ਯ→੍ਯ, also used exclusively for Sanskrit borrowings, and even then rarely. Use of the subjoined /ʋ/ and conjunct /j/, already rare, is increasingly scarce in modern contexts.
To express vowels (singular, sur), Gurmukhī, as an abugida, makes use of obligatory diacritics called lagā mātarā (plural lage mātare). Gurmukhī is similar to Brahmi scripts in that all consonants are followed by an inherent 'a' sound (unless at the end of a word when the 'a' is usually dropped). This inherent vowel sound can be changed by using dependent vowel signs which attach to a bearing consonant. In some cases, dependent vowel signs cannot be used – at the beginning of a word or syllable for instance – and so an independent vowel character is used instead.
Independent vowels are constructed using three bearer characters: ūṛā (ੳ), aiṛā (ਅ) and īṛī (ੲ). With the exception of aiṛā (which represents the vowel [ə] the bearer consonants are never used without additional vowel signs.
|Vowel||Transcription||IPA||Closest English equivalent|
|ਅ||(none)||ਕ||Muktā||a||[ə]||like a in about|
|ਆ||ਾ||ਕਾ||Kannā||ā||[aː] , [äː]||like a in car|
|ਇ||ਿ||ਕਿ||Sihārī||i||[ɪ]||like i in it|
|ਈ||ੀ||ਕੀ||Bihārī||ī||[iː]||like i in litre|
|ਉ||ੁ||ਕੁ||Onkaṛ||u||[ʊ]||like u in put|
|ਊ||ੂ||ਕੂ||Dulenkaṛ||ū||[uː]||like u in spruce|
|ਏ||ੇ||ਕੇ||Lāvā̃||e||[eː]||like e in Chile|
|ਐ||ੈ||ਕੈ||Dulāvā̃||ai||[ɛː]~[əɪ]||like e in sell|
|ਓ||ੋ||ਕੋ||Hōṛā||o||[oː]||like o in more|
|ਔ||ੌ||ਕੌ||Kanauṛā||au||[ɔː]~[əʊ]||like o in off|
Dotted circles represent the bearer consonant. Vowels are always pronounced after the consonant they are attached to. Thus, sihārī is always written to the left, but pronounced after the character on the right.
The short vowels [ɪ] and [ʊ], when paired with [h] to yield /ɪh/ and /ʊh/, represent [é] and [ó] with high tones respectively, e.g. ਕਿਹੜਾ kihṛā (IPA: [kéːɽaː]) 'which,' ਦੁਹਰਾ duhrā (IPA: [d̪óːɾaː]) 'repeat, reiterate, double.' The sequence of [əh]+[ɪ] or [ʊ] yield [ɛ́] and [ɔ́] respectively, e.g. ਮਹਿੰਗਾ mahingā (IPA: [mɛ́ːŋgaː]) 'expensive,' ਵਹੁਟੀ vahuṭī (IPA: [ʋɔ́ːʈiː]) 'bride.'
Ṭippī ( ੰ ) and bindī ( ਂ ) are used for producing a nasal phoneme depending on the following obstruent or a nasal vowel at the end of a word. All short vowels are nasalized using ṭippī and all long vowels are nasalized using bindī except for dulenkaṛ ( ੂ ), which uses ṭippi instead.
|Diacritic usage||Result||Examples (IPA)|
|Ṭippī on short vowel (/ə/, /ɪ/, /ʊ/), or long vowel /u:/, before a non-nasal consonant||Adds nasal consonant at same place of articulation as following consonant
(/ns/, /n̪t̪/, /ɳɖ/, /mb/, /ŋg/, /nt͡ʃ/ etc.)
|ਹੰਸ /ɦənsᵊ/ "goose"|
ਅੰਤ /ən̪t̪ᵊ/ "end"
ਗੰਢ /gə́ɳɖᵊ/ "knot"
ਅੰਬ /əmbᵊ/ "mango"
ਸਿੰਗ /sɪŋgᵊ/ "horn, antler"
ਕੁੰਜੀ / kʊɲd͡ʒiː/ "key"
ਗੂੰਜ /guːɲd͡ʒᵊ/ "roar"
ਲੂੰਬੜੀ /luːmbᵊɽiː/ "fox"
|Bindī over long vowel (/a:/, /e:/, /i:/, /o:/, /u:/, /ɛ:/, /ɔː/)
before a non-nasal consonant not including /h/
|Adds nasal consonant at same place of articulation as following consonant (/ns/, /n̪t̪/, /ɳɖ/, /mb/, /ŋg/, /nt͡ʃ/ etc.).
May also secondarily nasalize the vowel
|ਕਾਂਸੀ /kaːnsiː/ "bronze"|
ਕੇਂਦਰ /keːn̯d̯əɾᵊ/ "center, core, headquarters"
ਗੁਆਂਢੀ /gʊáːɳɖiː/ "neighbor"
ਚੌਂਕ /t͡ʃɔːŋkᵊ/ "crossroads, plaza"
ਜਾਂਚ /d͡ʒaːɲt͡ʃᵊ/ "trial, examination"
|Ṭippī over consonants followed by long vowel /u:/ (not stand-alone vowel ਊ),
at open syllable at end of word, or ending in /ɦ/
|Vowel nasalization||ਤੂੰ /t̪ũː/ "you"|
ਸਾਨੂੰ /saːnũː/ "to us"
ਮੂੰਹ /mũːɦ/ "mouth"
|Ṭippī on short vowel before nasal consonant (/n̪/ or /m/)||Gemination of nasal consonant
Ṭippī is used to geminate nasal consonants instead of addhak
|ਇੰਨਾ /ɪn̪:a:/ "this much"|
ਕੰਮ /kəm:ᵊ/ "work"
|Bindī over long vowel (/a:/, /e:/, /i:/, /o:/, /u:/, /ɛ:/, /ɔː/),
at open syllable at end of word, or ending in /ɦ/
|Vowel nasalization||ਬਾਂਹ /bã́h/ "arm"|
ਮੈਂ /mɛ̃ː/ "I, me"
ਅਸੀਂ /əsĩː/ "we, us"
ਤੋਂ /t̪õː/ "from"
ਸਿਊਂ /sɪ.ũː/ "sew"
Older texts may follow other conventions.
The use of addhak ( ੱ ) (IPA: ['ə́d̪:əkᵊ]) indicates that the following consonant is geminate, meaning that the subsequent consonant is doubled or reinforced. Consonant length is distinctive in the Punjabi language and the use of this diacritic can change the meaning of a word, for example:
|Without addhak||Transliteration||Meaning||With addhak||Transliteration||Meaning|
|ਪਤਾ||patā||'aware' (of something)||ਪੱਤਾ||pattā||'leaf'|
The halant ( ੍ ) character is not used when writing Punjabi in Gurmukhī. However, it may occasionally be used in Sanskritised text or in dictionaries for extra phonetic information. When it is used, it represents the suppression of the inherent vowel.
The effect of this is shown below:
The visarg symbol (ਃ U+0A03) is used very occasionally in Gurmukhī. It can represent an abbreviation, as the period is used in English, though the period for abbreviation, like commas, exclamation points, and other Western punctuation, is freely used in modern Gurmukhi.
|Hindu–Arabic numeral system|
|Positional systems by base|
|Non-standard positional numeral systems|
|List of numeral systems|
Gurmukhī has its own set of digits, used exactly as in other versions of the Hindu-Arabic numeral system. These are used extensively in older texts. In modern contexts, they are sometimes replaced by standard Western Arabic numerals.
The Unicode block for Gurmukhī is U+0A00–U+0A7F:
Official Unicode Consortium code chart (PDF)
Panjab Digital Library has taken up digitization of all available manuscripts of Gurmukhī Script. The script has been in formal use since the 1500s, and a lot of literature written within this time period is still traceable. Panjab Digital Library has digitized over 5 million pages from different manuscripts and most of them are available online.
|Wikimedia Commons has media related to Gurmukhi.|