Nāgarī-128 [ _naagaree-128 ]
(First Page; 1/3)
A Scheme for Transliteration from ĐevaNāgarī to Latin
I. Conventions / Terminology
The words ĐevaNāgarī and Nāgarī are synonyms.
Where a capital letter is expected or required, I have used a '_' (Low Line) followed by corresponding lower-case letter. 'Low Line' character is also called as 'underscore'. More on this later.
Using this scheme _naagaree-128, we may write, Hinđī as '_hindee' and ĐevaNāgarī as '_devanaagaree'.
I have generally written first occurrence of most Indic-words as per following pattern.
|Nīrajā-Latin-word (_naagaree-128-word; शब्द).|
For Example: Marāthī (_maraaThee; मराठी).
Apart from this scheme '_naagaree-128' we have been developing another transliteration scheme 'Nīrajā-Latin'. The aims of these two transliteration schemes for writing ĐevaNāgarī words in Latin are different.
|*||The Nāgarī alphabet is called Varṇamālā (_varNNamaalaa; वर्णमाला).|
|*||Akṣhar (AkShar; अक्षर) means letter/s.|
|*||Generally, Māŧrā (maatraa; मात्रा) means vowel mark/s.|
You may refresh your knowledge of Varṇamālā by clicking here.
The word 'Nāgarī' looks better than '_naagaree'; and is also easier to read. But think of typing; we can type '_naagaree' much faster than 'Nāgarī'.
There are 'file-systems' and cheap mobile phones that do not support accented Latin/Roman characters like ā, ī, ṇ etc. So if you want to use correctly transliterated Nāgarī words in SMSes, file-names, cheap mobile phones etc. then only continue reading.
If you want to use attractively transliterated Nāgarī words in regular English (western) text and signboards etc. check 'Nīrajā-Latin'.
_naagaree-128 is a lossless scheme; i.e. the Nāgarī text transliterated to Roman-Latin text using this scheme can be exactly transliterated back to Nāgarī.
Most encodings (including ASCII, Unicode, ISCII) support at least basic Latin alphabet and punctuation marks.
Till recently (2013!), Nāgarī texts were mainly encoded in various hacked-encodings. Even now invitation cards, sales brochures etc. are composed in hack-encodings because of easy availability of non-standard stylish fonts. But now useful Nāgarī text is generally encoded in Unicode, while basic Latin text is encoded either in ASCII, ANSI or Unicode. Using such schemes we can write Indian languages (like Hinđī, Marāthī, ...) in basic ASCII.
This scheme can be used for -
|1.||writing words of Indian languages in small (handheld) devices and file systems that are incapable of handling Unicode encoded Indic text, like old/cheap mobile phones and Internet!|
|2.||writing Indian languages in English (Roman-Latin) alphabet for those who are not comfortable with Indian scripts (like Nāgarī).|
|3.||to convey the proper pronunciations of Indic words written in Latin-Roman alphabet.|
|4.||entering Nāgarī text using English alphabets. But the use of such schemes for entering ĐevaNāgarī text is rather limited as efficient keyboard overlays/drivers (like SuNāgarī) for popular computer systems are available now-a-days.|
At present this scheme is only for regular Nāgarī alphabet used to write Hinđī, Marāthī etc. This scheme may be extended in future to include those characters which are sometimes used in Sanskṛiŧ only.
I designed this scheme in 2006-7, so that the ĐevaNāgarī words transliterated to Latin using this scheme -
|#||can be used as filenames.|
|#||resemble commonly transliterated words (by public) as closely as possible.|
Importance was also given to 'losslessness', 'simplicity' and 'clarity'.
_naagaree-128 is partially similar to an old scheme called 'Harvard-Kyoto'! Other schemes for writing Indian languages using Latin alphabet are IAST (International Alphabet of Sanskṛiŧ Transliteration), ITRANS, ...
We have been using _naagaree-128 satisfactorily, so you may try it. No warranties though. In future, the use of _naagaree-128 may be limited to fast typing!
Please remember that this scheme is case-sensitive.
In simple words, no capitalization is allowed.
Why this scheme is case-sensitive?
As there are fewer alphabets in English than Nāgarī and because we don't want to use extended Latin characters (only basic ASCII characters are supported on some file systems), so we have to use uppercase (capital) letters and lowercase letters for different Nāgarī Letters! For example 'd' is used for 'द' while 'D' is used for 'ड'.
Nāgarī has no concept of capitalization, so there is no problem in having this scheme as case-sensitive one.
This section is not a core part of _naagaree-128, you may skip it by clicking following link.
If you want to emphasize a word, prefix it with a 'Low Line' character ('_', which is also called 'Underscore'!).
For Example: _varNNamaalaa
This convention (of starting a word with '_') will also prevent automatic capitalization, which lot of word processors do without even asking the user!
Generally, the word Rām ('राम') gets written as 'Rama' in English text. According to _naagaree-128, we may write either '_raam' or '_raama'.
For ALL CAPS, you can use leading as well as trailing underscore! So _raamaayaNNa_ will mean RĀMĀYAṆA.