ஒரு பார்வை

June 13, 2006

Tamil Unicode ?! -7

Filed under: Tamil Unicode — CAPitalZ @ 12:04 pm

– Latin characters are in the order of English language [English characters are latin]
Devanagiri characters are in the order of Hindi language [even though other languages uses Devanagiri script, the order is in Hindi]
– Latin script has all character encoding [ accents of English, French, German.. etc]
– Chinese, Japan, Korea has all character encoding.

All south indian scripts are arranged accordingly for the order of Hindi language [in Devanagiri script]. People who knows Hindi can see it yourself. Every South Indian letters were exactly placed at the same place as Hindi language [again even though its Devanagiri, its NOT in the order of any other languages that uses Devanagiri]. It is this reason, Tamil characters were not in the order and some gaps, so as ALL south indian scripts.

Even if its justified that its better to have “ka” and “puLLi” than “k” mey, the binary order of Tamil characters were wrong, because its arranched to suite Hindi.

Yes, Unicode Consortium said collation is not their problem. That doesn’t mean we should encode our characters in the WRONG ORDER.

Why English is in correct order if collation is not an advantage if binary encoded correctly?
Why Hindi is in correct order if collation is not an advantage if binary encoded correctly?

May be many other languages have the same advantages as well, but I don’t know all.

Hindi:
http://www.unicode.org/charts/PDF/U0900.pdf

Tamil:
http://www.unicode.org/charts/PDF/U0B80.pdf

In Tamil there are 3 gaps between ஊ and எ. Its because Hindi has 3 characters in there, while Tamil doesn’t have those characters.

In Tamil there are 3 gaps between க and ங. Its because Hindi has 3 characters in there, while Tamil doesn’t have those characters.

The reason ஜ comes in between ச and ஞ, is because Hindi has the sound. Note there is also 2 gaps, is because Hindi has different characters in there.

Look at puLLi at the location 0902 [Hindi] and 0B82 [Tamil]. [100100000011 and 101110000011] positionally same location. Look at Aytham at location 0903 [Devanagiri sign visarga] and 0B83 [Tamil sign Visarga]

You can figure out the rest yourself.

Please don’t say, this is because you want to change scripts intermingly. At the cost of Tamil [and other Indian scripts] Hindi became efficient.

Is it really hard to accept this? Or, there are no programming people in this group.

பாகம் – 08 >>

<< Part – 06

_____
CAPital

Leave a Comment »

No comments yet.

RSS feed for comments on this post. TrackBack URI

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

Blog at WordPress.com.

%d bloggers like this: