Commit graph

665 commits

Author SHA1 Message Date
dchandler
8ccf57dccb TMW->{Wylie,ACIP} conversions now preserve font size information. 2004-06-15 02:20:28 +00:00
amontano
aee8630986 updated version info and fixed a parsing error. 2004-06-14 03:42:35 +00:00
amontano
c5ab3ec552 updated info of translation tool for the webstart application 2004-06-14 03:40:14 +00:00
dchandler
e18a4417dc Added a FIXME comment. 2004-06-12 02:26:28 +00:00
dchandler
9f78cabb18 TMW->{Wylie,ACIP} conversions now preserve font size information. 2004-06-12 02:09:28 +00:00
dchandler
7acbce3361 Added errors 142 and 143, which are produced when converting yig chung
to a Unicode text file, which cannot support font size changes.
2004-06-06 21:59:16 +00:00
dchandler
1db0ec7bb5 Fixed javadoc comments. 2004-06-06 21:39:45 +00:00
dchandler
df262aa148 It is now a compile-time option whether to treat []- and {}-bracketed sequences
as text to be passed through (without the brackets in the case of {}) literally,
which is the case by default because Robert Chilton requested it, or the old,
ad-hoc mechanism which could be useful for finding some ugly input.

Made a couple of error messages a little more verbose now that we have
short-message mode.
2004-06-06 21:39:06 +00:00
dchandler
a69f7588b2 I broke warning 507 into two warnings, one high-priority (512) and one
low-priority (507).
2004-05-01 20:55:13 +00:00
dchandler
fd7cba4439 Changed menu item name. 2004-05-01 20:52:22 +00:00
dchandler
8a9271a3d8 I broke warning 507 into two warnings, one high-priority (512) and one
low-priority (507).
2004-05-01 20:49:53 +00:00
dchandler
31bdd39fec The TMW for 'da'i was converting to 'aad'i. Andres found this; it is bug
945744.  I've made it more correct -- 'ad'i is now produced.  The wrong stack
is thought to be the root stack still.
2004-05-01 19:11:15 +00:00
dchandler
1a055f3472 I don't think warning level "None" was really doing the trick. Fixed that.
You can now customize the severities of all warnings, even 504 and 510.

When warning level is "None", scanning, i.e. lexical analysis, is faster.
2004-04-25 00:37:57 +00:00
dchandler
e2d42f36eb Robert Chilton's experience inspired me to make the handling of errors and
warnings in ACIP->Tibetan conversion much more configurable.  You can
now choose from short or long error messages, for one thing.  You can change
the severity of almost all warnings.  Each error and warning has an error code.
Errors and warnings are better tested.

The converter GUI has a new checkbox for short messages; the converter
CLI has a new mandatory option for short messages.

I also fixed a bug whereby certain errors were not being appended to the
'errors' StringBuffer.
2004-04-24 17:49:16 +00:00
dchandler
cc5d096918 David Chapman's latest fix to tibwn.ini (clearing up an issue that Than or I
dropped the ball on) introduced two lines for 8,95.  This is a bad thing, so
I've taken out the second line.  I've also introduced a check in
TibetanMachineWeb.java such that we'll know that tibwn.ini has no such
error in the future just by running 'ant clean jskad-run' and making sure that
the GUI is indeed visible.

I also updated the test baselines now that F03A and 0F82 are squared away.
2004-04-24 13:23:56 +00:00
a1tsal
9e071ea178 Differentiated 0F82 (~M`) and F03A (nyi.zla editor's mark). 2004-04-21 10:04:11 +00:00
dchandler
72442788c1 This displayed poorly for me, so I untabified it. Whitespace changed only. 2004-04-18 18:56:01 +00:00
dchandler
0ee90a0fb0 Added many ACIP->TMW->ACIP tests. They found no bugs. 2004-04-17 17:28:26 +00:00
dchandler
63438d243b getACIP was getting EWTS, not ACIP. 2004-04-17 15:49:40 +00:00
dchandler
de3a19761e Fixes for javadoc tool. 2004-04-17 15:48:50 +00:00
dchandler
adcf9de952 Two new tests. 2004-04-17 15:14:46 +00:00
dchandler
1bfd3772e6 TMW->ACIP is much improved. V and W were confused, # and * were
confused; many glyphs that should have yielded errors were not.

I've added a test case that transforms every TMW glyph save the one with
no TM mapping to ACIP.  I hand-checked that it was correct.

ACIP->TMW is fixed for # and *.  I never noticed it, but each needed an
extra swoosh (U+0F05).

Round-tripping would be good, as would testing real-world use of
TMW->ACIP.
2004-04-14 05:44:51 +00:00
dchandler
244a9d1370 TiblEdit's diacritics panel now works -- dia.dat has been added to the
repository and to TiblEdit's jar.
2004-04-14 05:12:00 +00:00
dchandler
f09a03c9c7 tibbibl is the schema's name; TiblEdit is the application's name. 2004-04-11 14:44:06 +00:00
dchandler
56a02ba41d Fixed the worst TMW->ACIP bug, the one regarding U+0F04 and U+0F05.
TMW->EWTS requires no context information, but TMW->ACIP does.
2004-04-10 18:26:57 +00:00
dchandler
9e7ccf2894 TMW->Unicode conversions have changed; now using U+0F6A for the stacks
whose EWTS transliteration begins with "R+".

ACIP->* conversions and test baselines were updated to deal with the
"r+..."=>"R+..."  change.
2004-04-10 16:58:45 +00:00
dchandler
7eca276a62 TMW->Unicode conversions have changed; now using U+0F6A for the stacks
whose EWTS transliteration begins with "R+".

ACIP->* conversions and test baselines were updated to deal with the
"r+..."=>"R+..."  change.
2004-04-10 16:03:25 +00:00
dchandler
aff34174ab The new EWTS rule regarding R, W, and Y requires that these change. It
may also require changes to the following, but I'm going to ask if it really
should or not.

// Y+Y~185,3~~6,98~1,109~6,120~1,123~1,125~6,106~6,113~f61,fbb
// Y+r~186,3~~6,99~1,109~6,120~1,123~1,125~6,106~6,113~f61,fb2
// Y+w~187,3~~6,100~1,109~6,120~1,123~1,125~6,106~6,113~f61,fad
// Y+s~188,3~~6,101~1,109~6,120~1,123~1,125~6,106~6,113~f61,fb6

// W+y~69,4~~7,79~1,109~8,121~1,123~1,125~8,107~8,114~f5d,fb1
// W+r~70,4~~7,80~1,109~8,121~1,123~1,125~8,107~8,114~f5d,fb2
// W+n~195,4~~7,81~1,109~8,120~1,123~1,125~8,106~8,113~f5d,fa3
// W+W~194,4~~7,82~1,109~8,120~1,123~1,125~8,106~8,113~f5d,fba
2004-04-08 02:55:59 +00:00
dchandler
76356f4009 ACIP->Tibetan now gives an error when {?} is seen alone (not in {[?]} or {[*FOO?]}, but alone). Bug 860192 is fixed. 2004-03-15 00:49:01 +00:00
dchandler
542fb50bf1 The ~M and ~M` EWTS change had not fully been made. Someone submitted a bug report 911472 that alerted me to this. 2004-03-07 17:02:35 +00:00
dchandler
db2ef438d4 Andres requested customization like this. You can now override any property in build.xml using a file 'build.properties'. 2004-03-06 23:17:43 +00:00
dchandler
e0928d8472 New EWTS for 0F82 and 0F83. 2004-03-06 23:00:40 +00:00
amontano
bb8fa6c58f Now the clear button in the http servlet version actually clears. Also added "synchronized" to some methods to ensure that concurrent threads don't crash. 2004-03-03 00:33:18 +00:00
dchandler
d436a4d462 Removed David Chapman's recently added line for U+0F82 -- a line for U+0F82 already existed, and the new line had incorrect TM and incorrect TMW mappings. I changed the existing line for U+0F82 to use the EWTS {~M`}. 2004-03-02 04:29:41 +00:00
a1tsal
8eaaeaa202 Fix careless error: I had the same TMW character for ~M and ~M`! 2004-02-22 09:14:56 +00:00
a1tsal
b14833b5b9 Change ^M to ~M to conform to spec.
Introduce ~M` (for 0F82).
2004-02-20 15:07:49 +00:00
amontano
e5454d3720 Updated the translation tool to conform to the Personal Profile specification of Java.
Before it would run in pocket pc's through the more restricted personalJava specification
but Sun's vm for pocket pc's project was terminated. Now it is designed to run under
IBM's VM for pocket pc's called J9 which implements the Personal Profile specification.
Such specification also supports awt, but not swing so still there is no (hope for) support
of Tibetan script in the pocket pc's,
2004-02-07 18:21:17 +00:00
dchandler
274e1736be Deleted cut-and-paste goof. 2004-01-17 19:45:31 +00:00
dchandler
c69ba26c60 TString now has tracks what Roman transliteration system it is using. Next up is to make ACIPConverter handle EWTS or ACIP TStrings. 2004-01-17 19:28:54 +00:00
dchandler
48b4c5cb07 Added a Unicode->ASCII dump for debugging *->Unicode conversions. To use it, use 'java -cp Jskad.jar org.thdl.util.VerboseUnicodeDump'. 2004-01-17 17:10:12 +00:00
dchandler
6fdb2a26bb Added a Unicode->ASCII dump for debugging *->Unicode conversions. To use it, use 'java -cp Jskad.jar org.thdl.util.VerboseUnicodeDump'. 2004-01-17 16:52:38 +00:00
dchandler
9dd95c5524 I saw this error when I wasn't expecting it, so now, curious, I print more details. 2004-01-17 16:51:33 +00:00
dchandler
f490978227 Now using Jskad/build.properties file, which I keep looking like this:
halt.after.trouble=no
2004-01-17 16:43:47 +00:00
dchandler
4dd40809a5 A user reported that q` caused a crash with TCC keyboard #1. Fixed. TCC keyboard #1 does not support q~ though. 2003-12-21 06:27:36 +00:00
dchandler
c1aa81e943 RFE 860190: ACIP->Unicode now gives a warning when it outputs something that can't be represented in TMW. 2003-12-16 07:45:40 +00:00
dchandler
848349fd3a More tests. 2003-12-15 08:16:06 +00:00
dchandler
e7a9e7968f ACIP->Unicode now uses two characters for consonants instead of one. This matches the dislike for characters like U+0F77 etc.
ACIP->Tibetan was not giving an error for BCWA because it parsed like BCVA.  Fixed.
2003-12-15 07:32:14 +00:00
dchandler
e9f7b2dfed If you want curly brackets around folio markers, you'll have to set
the system property
thdl.acip.to.x.output.curly.brackets.around.folio.markers to true.
2003-12-14 08:47:03 +00:00
dchandler
8664571577 Warnings were not being detected correctly. Fixed.
ACIP->Unicode uses U+0020, ' ', for whitespace.  ACIP->TMW uses the
TMW whitespace for whitespace.
2003-12-14 08:38:10 +00:00
dchandler
01e65176d4 Using less memory and time to figure out if warnings occurred. 2003-12-14 07:41:15 +00:00