Commit graph

16 commits

Author SHA1 Message Date
amontano
0c891ec96c The org.thdl.tib.scanner.Manipulate class was originally meant as a grab-bag of methods processing strings representing tibetan wylie. It doesn't make sense to leave there the wrap-up methods for converting from and to the various transcription schemes. Moved them to BasicTibetanTranscriptionConverter and updated all classes that point to them. 2006-04-24 19:19:04 +00:00
amontano
67bddb7a7e Updated the translation tool to accept Tibetan Unicode. For the application version, this means that the smart paste is unicode aware and will do the appropriate conversion. In the servlet version tibetan unicode can now be inputted in the form and tibetan machine uni is used to display the results.
The Manipulate class now includes: acipToWylie, wylieToAcip, and unicodeToWylie. They provide a simple interfase to David Chandler's converters that are used by the translation tool.
2006-04-24 06:09:17 +00:00
amontano
4f821b7965 fixed double quote problem that showed up on some dictionaries (that where pre-processed in excel) 2005-04-25 09:29:39 +00:00
amontano
686756116f Fixed importation errors for the translation tool. 2005-02-21 05:10:00 +00:00
amontano
afd3a95a21 Updated the dictionary structure to allow grouping of dictionaries, this is the first step to try to clean up a bit the massive repetitions in dictionaries. 2004-08-13 04:47:35 +00:00
amontano
07c045341b included a guessIfAcip method. Used by the translation tool to paste wylie or
ACIP accordingly.
2004-07-05 03:51:11 +00:00
dchandler
1bfd3772e6 TMW->ACIP is much improved. V and W were confused, # and * were
confused; many glyphs that should have yielded errors were not.

I've added a test case that transforms every TMW glyph save the one with
no TM mapping to ACIP.  I hand-checked that it was correct.

ACIP->TMW is fixed for # and *.  I never noticed it, but each needed an
extra swoosh (U+0F05).

Round-tripping would be good, as would testing real-world use of
TMW->ACIP.
2004-04-14 05:44:51 +00:00
amontano
e89c49651c Now translation tool accepts synonyms separated by ';' in the entry field. 2003-09-14 05:56:20 +00:00
dchandler
115d0e0e6c Fixed ACIP->TMW vowels like 'I etc.
Fixed ACIP->Unicode/TMW for BDE, which should be B-DE, not B+DE, because the former is legal Tibetan.

The ACIP->EWTS subroutine has improved.

TMW->Wylie and TMW->ACIP are improved in error cases.

TMW->ACIP has friendly embedded error messages now.
2003-09-12 05:06:37 +00:00
dchandler
316f59107b A preliminary TMW->ACIP converter is here. There are known bugs, mostly with rare punctuation. 2003-09-02 06:39:33 +00:00
amontano
35a9869aac 1. Fixed parsing error
2. Added support extreme uses of 'a' like le'u'i'o
3. Now parses correctly syllables that have the particles "ang" and "am" added to them. Second works only in "roman script" mode. The converter from tibetan script to roman script does not convert correctly this combinations. ("pa'ang" is converted wrongly into "pa'ng" and "pa'am" is converted wrongly into "pa'ma").
2003-03-23 20:27:54 +00:00
amontano
947ac5537a Updated on comments and made it a bit more consistent. 2002-11-03 17:42:11 +00:00
amontano
37b29c8d33 Added comments to all class headers. Comments to individual methods will
be added as needed.
2002-11-03 08:56:11 +00:00
dchandler
a6cc4a7ff3 Removed/commented out/tagged some unused local variables.
Added a JUnit test for the new Trie that fails at present since the Trie is
case-insensitive.  Running JUnit tests is not something our build system
knows about at present, but Eclipse 2.0 makes it very easy.

Fixed a few compiler errors due to imports I'd forgotten.
2002-11-02 16:01:40 +00:00
amontano
2f21cd29f9 updating acip's import stuff from BinaryFileGenerator 2002-10-12 01:53:17 +00:00
amontano
e0e125c76f includes static methods used by BinaryFileGenerator 2002-10-09 00:11:15 +00:00