Jskad/source/org/thdl/tib/text/tshegbar
dchandler 7198f23361 I really hesitate to commit this because I'm not sure what it brings to the
table exactly and I fear that it makes the ACIP->Tibetan converter code
a lot uglier.  The TODO(DLC)[EWTS->Tibetan] comments littered throughout
are part of the ugliness; they point to the ugliness.  If each were addressed,
cleanliness could perhaps be achieved.

I've largely forgotten exactly what this change does, but it attempts to
improve EWTS->Tibetan conversion.  The lexer is probably really, really
primitive.  I concentrate here on converting a single tsheg bar rather than
a whole document.

Eclipse was used during part of my journey here and some imports were
reorganized merely because I could.  :)

(Eclipse was needed when the usual ant build failed to run a new test
EWTSTest.  And I wanted its debugger.)

Next steps: end-to-end EWTS tests should bring many problems to light.  Fix
those.  Triage all the TODO comments.

I don't know that I'll ever really trust the implementation.  The tests are
valuable, though.  A clean implementation of EWTS->Tibetan in Jython
might hold enough interest for me; I'd like to learn Python.
2005-06-20 06:18:00 +00:00
..
LegalTshegBar.java Added comments referring to a user guide in development now. 2003-12-06 20:26:15 +00:00
LegalTshegBarTest.java The ACIP "BNA" was converting to B-NA instead of B+NA, even though NA cannot take a BA prefix. This was because BNA was interpreted as root-suffix. In ACIP, BN is surely B+N unless N takes a B prefix, so root-suffix is out of the question. 2003-10-26 00:24:28 +00:00
package.html Now uses terminology from the Unicode standard. No more talk of 2002-12-15 03:35:24 +00:00
TibetanSyntaxException.java ant private-javadocs runs without warnings; cleaned up some 2003-04-13 01:46:20 +00:00
TransitionInstruction.java ant private-javadocs runs without warnings; cleaned up some 2003-04-13 01:46:20 +00:00
TshegBar.java Extended Wylie is referred to as THDL Extended Wylie or THDL Wylie 2002-12-15 06:57:32 +00:00
UnicodeCodepointToThdlWylie.java The ~M and ~M` EWTS change had not fully been made. Someone submitted a bug report 911472 that alerted me to this. 2004-03-07 17:02:35 +00:00
UnicodeCodepointToThdlWylieTest.java Added a test for org.thdl.tib.text.tshegbar.UnicodeCodepointToThdlWylie. 2003-03-22 04:55:17 +00:00
UnicodeConstants.java Fixed a bunch of bugs; supports le'u'i'o, sgom pa'am, etc. 2003-03-31 00:33:50 +00:00
UnicodeGraphemeCluster.java ant private-javadocs runs without warnings; cleaned up some 2003-04-13 01:46:20 +00:00
UnicodeGraphemeClusterTest.java Moved ad-hoc test.java test cases to UnicodeGraphemeClusterTest.java, 2003-03-22 03:55:39 +00:00
UnicodeReader.java ant private-javadocs runs without warnings; cleaned up some 2003-04-13 01:46:20 +00:00
UnicodeReadingStateMachineConstants.java The ~M and ~M` EWTS change had not fully been made. Someone submitted a bug report 911472 that alerted me to this. 2004-03-07 17:02:35 +00:00
UnicodeReadyThunk.java Now uses terminology from the Unicode standard. No more talk of 2002-12-15 03:35:24 +00:00
UnicodeUtils.java I really hesitate to commit this because I'm not sure what it brings to the 2005-06-20 06:18:00 +00:00
UnicodeUtilsTest.java The *->Unicode conversions were outputting Unicode that was not 2004-12-13 02:32:46 +00:00
ValidatingUnicodeReader.java I really hesitate to commit this because I'm not sure what it brings to the 2005-06-20 06:18:00 +00:00
ValidatingUnicodeReaderTest.java I really hesitate to commit this because I'm not sure what it brings to the 2005-06-20 06:18:00 +00:00