Fwd: Re: [indic] Indic Unicode Character level Text-Editor/Data-processor
hrpansari at yahoo.com
Sun Aug 27 23:44:52 IST 2006
Steve Smith <steve_smith at sall.com> wrote: From: "Steve Smith" <steve_smith at sall.com>
Subject: Re: [indic] Indic Unicode Character level Text-Editor/Data-processor
Date: Fri, 25 Aug 2006 04:47:04 -0400
ï»¿ As an example, searching for all occurrences of the sequence à¤¦à¤¯ will not find the word à¤¦à¤¯à¤¾ (or any sequence like à¤¦à¤¯à¥ or à¤¦à¤¯à¥), at least in Word 2003.
> This seriously hampers Advanced Indic Computing uses.
It certainly does. Searching for a sequence of characters should find that sequence if it occurs in the text, regardless of what precedes or follows. Perhaps the behaviour should be user-selectable (ie, an option like "match whole syllable", analagous to "match whole word" which is already an option).
Matching the character string should not be a problem. I think the difficulty is in rendering the result, ie, highlighting only part of the cluster. And what if your search includes a consonant which occurs as part of a cluster that gets rendered as a conjunct (eg, search for à¤ when the text contains à¤à¥à¤·) How do you highlight the à¤? It's a non-trivial problem. But personally, I would be happy to have the conjunct character (or entire syllable, as the case may be) highlighted. At least it would let me know that the search string was found.
I echo Hariram's query for information on applications where this does work.
----- Original Message -----
From: Hariram Pansari
To: Indic at unicode.org
Cc: nlpia2005 at yahoogroups.com
Sent: Friday, August 25, 2006 8:23 AM
Subject: [indic] Indic Unicode Character level Text-Editor/Data-processor
- in MS Windows XP / MS word XP/2003
- in Linux Kbabel
- in Yudit
- in all other Unicode softwares/text editors/Databases,
I request the list to inform:
If any Unicode charcter level Indic text/data editor/processor software is available?
Get your email and more, right on the new Yahoo.com -------------- next part --------------
An HTML attachment was scrubbed...
More information about the Oriya-group