[Oriya-group]
Indic Sorting/collation challenging Problem - CLDR 1.3- data for Indic
Hariram Pansari
hrpansari at yahoo.com
Fri Jan 14 04:12:41 IST 2005
As CLDR feedback is being closed on 15.01.2004, I can
not put this issue in proper format in lack of time
and technical know-how, I hereby like to draw kind
attention of the Indic Unicode experts that :
Generally in Indic Unicode the Chandrabindu, Anuswar
and Bisarga is kept at begining and the VIRAM/Halant
is kept at end of all Matraas.
For example in Devanagari
Chandrabindu is 0901
Anuswar is 0902
Bisarga is 0903
and
Viram is at 094D.
This seems wrong and creates lot of problems in
practical uses, which are totaly unscientific in
computing.
Generally in all text books 0901 0902 0903 comes after
अ...औ. As these are Vowel modifiers so these can
be used on all matraas...
कं कां किं कीं कुं
कूं कृं कें कैं कों
कौं
कँ काँ किँ कीँ कुँ
कूँ ....
कः काः किः कीः कुः
कूः ....
क्रँ क्राँ क्रिँ
क्रीँ क्रीँ .... etc..
But as per traditional dictionary sorting order the
words with (अँ अं अः) comes first and
words without these comes afterwards... this seems
wrong with general commonsence also.
How to make a direct collation table in such a manner
that --
The characters/words without vowel modifiers should
come first and characters/words with vowel modifiers
should come afterwards, even if when appearing at the
end of a word???
For example
कहा sould come first
कहाँ sould come after
वर sould come first
वरं sould come after
As generally computer's default direct collation
adopts and shows result like above automatically. This
problem is already accepted as unsolved in ISCII_1991
BIS document also.
-----------
(2)
The Viram is placed at end (094D)
Whereas a character with viram has lower value than a
character without viram/halant
So how to make a direct collation in such a way
that
वाक् should come first and
वाक should come afterwards????
रम् should come first and
रम should come afterwards????
As the halant is vowel deletion/ommission sign
practically a charcter with it undoubtedly has a lower
value.
How to solve these two practical problems of Indic
Sorting order.
Would the Experts kindly pay attention on this
challenging issue and put these questions in CLDR for
Indic collation issues?
With regards.
Hariram Pansari
__________________________________
Do you Yahoo!?
Yahoo! Mail - Find what you need with new enhanced search.
http://info.mail.yahoo.com/mail_250
More information about the Oriya-group
mailing list