dchandler
96d0d0d9d0
My previous commit message failed to mention the following:
...
I refactored the code trying to fit it onto one screen. So not all of the
changes are material to the bug fix.
About this commit: TMW->Wylie for {b.s.d} now gives bsad instead of bas.d.
This fixes part of bug 998476, and is done because Andres thinks it'll work
most of the time. But don't be surprised if an exception comes up in the
future and we have to trivially change the code to catch it.
2005-02-05 22:37:02 +00:00
dchandler
287fc181a0
Fix for part of bug 998476.
2005-02-05 22:16:39 +00:00
dchandler
be632e1874
Cleaned up some code that is relevant to the bug I'm looking into. I need to
...
instrument it. Functionally, this change is a no-op. I just don't want to
confuse refactoring with the actual bug fix.
2005-02-05 19:29:37 +00:00
dchandler
00961b633f
Added a test case for bug 998476. No fix, just a test case verifying the bug.
2005-02-05 18:47:17 +00:00
dchandler
b4155c3264
After a1tsal's changes to tibwn.ini, the tests failed. I'm a bit disheartened
...
that more tests didn't fail.
2005-02-05 16:51:13 +00:00
dchandler
7304c770c9
Just a better comment.
2005-02-05 16:27:34 +00:00
amontano
78373b3094
updated the servlet version. logging still needs work.
2005-01-23 00:57:02 +00:00
a1tsal
28d46bb207
Add corrective comment regarding the bogus Unicode OM characters.
2005-01-19 01:07:52 +00:00
a1tsal
affccad9e5
Use 00A0 rather than 0020 for _, per unicode spec.
2005-01-17 08:58:56 +00:00
a1tsal
91d0e7f4da
Use "precomposed" sanskrit consonant combinations consistently throughout.
2005-01-17 08:49:04 +00:00
a1tsal
22cfee69db
Had wrong Unicode for n+n+y.
...
Had wrong Unicode for space -- but only in comment.
Cleaned up punctuation in another comment.
2005-01-16 01:17:44 +00:00
dchandler
0b0af67ed9
Ximalaya is not nearly as nice as Tibetan Machine Uni, so use the latter.
2005-01-04 02:20:59 +00:00
amontano
88ebeaed3b
Fixed pocket pc version to be completely swing-less.
2004-12-23 19:31:26 +00:00
dchandler
aa5d86a6e3
The *->Unicode conversions were outputting Unicode that was not
...
well-formed. They still do, but they do it less often.
Chris Fynn wrote this a while back:
By normal Tibetan & Dzongkha spelling, writing, and input rules
Tibetan script stacks should be entered and written: 1 headline
consonant (0F40->0F6A), any subjoined consonant(s) (0F90-> 0F9C),
achung (0F71), shabkyu (0F74), any above headline vowel(s) (0F72
0F7A 0F7B 0F7C 0F7D and 0F80); any ngaro (0F7E, 0F82 and 0F83).
Now efforts are made to ensure that the converters conform to the
above rules.
2004-12-13 02:32:46 +00:00
eg3p
c4f4288d2f
eliminated some unnecessary comments i had left in there
2004-08-19 19:51:21 +00:00
eg3p
a39d5c2ba3
changed all occurrences of 'Color.BLACK' to 'Color.black', since the former causes a runtime error on Mac OS X (for Java 1.3.1 at least). not sure why this is, but may be related to the bug mentioned at http://www.oxygenxml.com/forum/viewtopic.php?p=239
2004-08-19 14:59:06 +00:00
amontano
afd3a95a21
Updated the dictionary structure to allow grouping of dictionaries, this is the first step to try to clean up a bit the massive repetitions in dictionaries.
2004-08-13 04:47:35 +00:00
dchandler
6bb0646f1c
Fixed crashing bug reported by Teresa Lam. Added tests so that I'm fairly
...
certain that no more crashing bugs exist. Removed a marker for iffy code
after understanding that code via test cases.
2004-07-05 04:48:27 +00:00
dchandler
11c3898ad2
Now the Sambhota keyboard crashing bug.
...
Fixed crashing bug reported by Teresa Lam. Added tests so that I'm fairly
certain that no more crashing bugs exist. Removed a marker for iffy code
after understanding that code via test cases.
2004-07-05 04:46:39 +00:00
dchandler
e101cc8294
Now the Sambhota keyboard crashing bug.
...
Fixed crashing bug reported by Teresa Lam. Added tests so that I'm fairly
certain that no more crashing bugs exist. Removed a marker for iffy code
after understanding that code via test cases.
2004-07-05 04:36:35 +00:00
dchandler
6cbea9f894
Fixed crashing bug reported by Teresa Lam. Added tests so that I'm fairly
...
certain that no more crashing bugs exist. Removed a marker for iffy code
after understanding that code via test cases.
2004-07-05 04:10:38 +00:00
amontano
805124ebc0
fixed some stuff in dictionary importation,
2004-07-05 03:56:20 +00:00
amontano
0fd26a6c56
now uses strictduffpane instead of duffpane for "smart-pasting".
2004-07-05 03:54:45 +00:00
amontano
353db0c352
fixed a bit the interface for the handheld version of the translation tool.
2004-07-05 03:52:30 +00:00
amontano
07c045341b
included a guessIfAcip method. Used by the translation tool to paste wylie or
...
ACIP accordingly.
2004-07-05 03:51:11 +00:00
amontano
5b2bb9af11
identical to duffpane but customized to the translation tool. the only change is
...
that the paste method is over-ridden for "smart-pasting". If pasting TMW paste
as is. If pasting TM, converts to TMW. If neither of these fonts are used,
assumes transliteration and converts to TMW.
2004-07-05 03:49:50 +00:00
amontano
707ae5e526
updated version number
2004-07-05 03:45:02 +00:00
amontano
650109200f
fixed the paste. when pasting from text (non-rtf) it used to produce garbage.
...
now it interprets it as wylie. also made some attributes protected to inherit it.
2004-07-05 03:43:42 +00:00
dchandler
bff0e6b2fc
Fixed TMW->Wylie/ACIP when multiple font sizes are in use. I was not
...
incrementing the offset at which I was inserting text properly.
2004-06-25 00:22:10 +00:00
dchandler
14fb449f95
I thought my earlier commit preserved font size info for TMW->ACIP/Wylie
...
conversions. It was only at a very coarse level. The feature is now truly
here.
2004-06-20 02:57:28 +00:00
dchandler
8ccf57dccb
TMW->{Wylie,ACIP} conversions now preserve font size information.
2004-06-15 02:20:28 +00:00
amontano
aee8630986
updated version info and fixed a parsing error.
2004-06-14 03:42:35 +00:00
dchandler
e18a4417dc
Added a FIXME comment.
2004-06-12 02:26:28 +00:00
dchandler
9f78cabb18
TMW->{Wylie,ACIP} conversions now preserve font size information.
2004-06-12 02:09:28 +00:00
dchandler
7acbce3361
Added errors 142 and 143, which are produced when converting yig chung
...
to a Unicode text file, which cannot support font size changes.
2004-06-06 21:59:16 +00:00
dchandler
1db0ec7bb5
Fixed javadoc comments.
2004-06-06 21:39:45 +00:00
dchandler
df262aa148
It is now a compile-time option whether to treat []- and {}-bracketed sequences
...
as text to be passed through (without the brackets in the case of {}) literally,
which is the case by default because Robert Chilton requested it, or the old,
ad-hoc mechanism which could be useful for finding some ugly input.
Made a couple of error messages a little more verbose now that we have
short-message mode.
2004-06-06 21:39:06 +00:00
dchandler
a69f7588b2
I broke warning 507 into two warnings, one high-priority (512) and one
...
low-priority (507).
2004-05-01 20:55:13 +00:00
dchandler
fd7cba4439
Changed menu item name.
2004-05-01 20:52:22 +00:00
dchandler
8a9271a3d8
I broke warning 507 into two warnings, one high-priority (512) and one
...
low-priority (507).
2004-05-01 20:49:53 +00:00
dchandler
31bdd39fec
The TMW for 'da'i was converting to 'aad'i. Andres found this; it is bug
...
945744. I've made it more correct -- 'ad'i is now produced. The wrong stack
is thought to be the root stack still.
2004-05-01 19:11:15 +00:00
dchandler
1a055f3472
I don't think warning level "None" was really doing the trick. Fixed that.
...
You can now customize the severities of all warnings, even 504 and 510.
When warning level is "None", scanning, i.e. lexical analysis, is faster.
2004-04-25 00:37:57 +00:00
dchandler
e2d42f36eb
Robert Chilton's experience inspired me to make the handling of errors and
...
warnings in ACIP->Tibetan conversion much more configurable. You can
now choose from short or long error messages, for one thing. You can change
the severity of almost all warnings. Each error and warning has an error code.
Errors and warnings are better tested.
The converter GUI has a new checkbox for short messages; the converter
CLI has a new mandatory option for short messages.
I also fixed a bug whereby certain errors were not being appended to the
'errors' StringBuffer.
2004-04-24 17:49:16 +00:00
dchandler
cc5d096918
David Chapman's latest fix to tibwn.ini (clearing up an issue that Than or I
...
dropped the ball on) introduced two lines for 8,95. This is a bad thing, so
I've taken out the second line. I've also introduced a check in
TibetanMachineWeb.java such that we'll know that tibwn.ini has no such
error in the future just by running 'ant clean jskad-run' and making sure that
the GUI is indeed visible.
I also updated the test baselines now that F03A and 0F82 are squared away.
2004-04-24 13:23:56 +00:00
a1tsal
9e071ea178
Differentiated 0F82 (~M`) and F03A (nyi.zla editor's mark).
2004-04-21 10:04:11 +00:00
dchandler
72442788c1
This displayed poorly for me, so I untabified it. Whitespace changed only.
2004-04-18 18:56:01 +00:00
dchandler
0ee90a0fb0
Added many ACIP->TMW->ACIP tests. They found no bugs.
2004-04-17 17:28:26 +00:00
dchandler
63438d243b
getACIP was getting EWTS, not ACIP.
2004-04-17 15:49:40 +00:00
dchandler
de3a19761e
Fixes for javadoc tool.
2004-04-17 15:48:50 +00:00
dchandler
adcf9de952
Two new tests.
2004-04-17 15:14:46 +00:00
dchandler
1bfd3772e6
TMW->ACIP is much improved. V and W were confused, # and * were
...
confused; many glyphs that should have yielded errors were not.
I've added a test case that transforms every TMW glyph save the one with
no TM mapping to ACIP. I hand-checked that it was correct.
ACIP->TMW is fixed for # and *. I never noticed it, but each needed an
extra swoosh (U+0F05).
Round-tripping would be good, as would testing real-world use of
TMW->ACIP.
2004-04-14 05:44:51 +00:00
dchandler
244a9d1370
TiblEdit's diacritics panel now works -- dia.dat has been added to the
...
repository and to TiblEdit's jar.
2004-04-14 05:12:00 +00:00
dchandler
56a02ba41d
Fixed the worst TMW->ACIP bug, the one regarding U+0F04 and U+0F05.
...
TMW->EWTS requires no context information, but TMW->ACIP does.
2004-04-10 18:26:57 +00:00
dchandler
9e7ccf2894
TMW->Unicode conversions have changed; now using U+0F6A for the stacks
...
whose EWTS transliteration begins with "R+".
ACIP->* conversions and test baselines were updated to deal with the
"r+..."=>"R+..." change.
2004-04-10 16:58:45 +00:00
dchandler
7eca276a62
TMW->Unicode conversions have changed; now using U+0F6A for the stacks
...
whose EWTS transliteration begins with "R+".
ACIP->* conversions and test baselines were updated to deal with the
"r+..."=>"R+..." change.
2004-04-10 16:03:25 +00:00
dchandler
aff34174ab
The new EWTS rule regarding R, W, and Y requires that these change. It
...
may also require changes to the following, but I'm going to ask if it really
should or not.
// Y+Y~185,3~~6,98~1,109~6,120~1,123~1,125~6,106~6,113~f61,fbb
// Y+r~186,3~~6,99~1,109~6,120~1,123~1,125~6,106~6,113~f61,fb2
// Y+w~187,3~~6,100~1,109~6,120~1,123~1,125~6,106~6,113~f61,fad
// Y+s~188,3~~6,101~1,109~6,120~1,123~1,125~6,106~6,113~f61,fb6
// W+y~69,4~~7,79~1,109~8,121~1,123~1,125~8,107~8,114~f5d,fb1
// W+r~70,4~~7,80~1,109~8,121~1,123~1,125~8,107~8,114~f5d,fb2
// W+n~195,4~~7,81~1,109~8,120~1,123~1,125~8,106~8,113~f5d,fa3
// W+W~194,4~~7,82~1,109~8,120~1,123~1,125~8,106~8,113~f5d,fba
2004-04-08 02:55:59 +00:00
dchandler
76356f4009
ACIP->Tibetan now gives an error when {?} is seen alone (not in {[?]} or {[*FOO?]}, but alone). Bug 860192 is fixed.
2004-03-15 00:49:01 +00:00
dchandler
542fb50bf1
The ~M and ~M` EWTS change had not fully been made. Someone submitted a bug report 911472 that alerted me to this.
2004-03-07 17:02:35 +00:00
dchandler
e0928d8472
New EWTS for 0F82 and 0F83.
2004-03-06 23:00:40 +00:00
amontano
bb8fa6c58f
Now the clear button in the http servlet version actually clears. Also added "synchronized" to some methods to ensure that concurrent threads don't crash.
2004-03-03 00:33:18 +00:00
dchandler
d436a4d462
Removed David Chapman's recently added line for U+0F82 -- a line for U+0F82 already existed, and the new line had incorrect TM and incorrect TMW mappings. I changed the existing line for U+0F82 to use the EWTS {~M`}.
2004-03-02 04:29:41 +00:00
a1tsal
8eaaeaa202
Fix careless error: I had the same TMW character for ~M and ~M`!
2004-02-22 09:14:56 +00:00
a1tsal
b14833b5b9
Change ^M to ~M to conform to spec.
...
Introduce ~M` (for 0F82).
2004-02-20 15:07:49 +00:00
amontano
e5454d3720
Updated the translation tool to conform to the Personal Profile specification of Java.
...
Before it would run in pocket pc's through the more restricted personalJava specification
but Sun's vm for pocket pc's project was terminated. Now it is designed to run under
IBM's VM for pocket pc's called J9 which implements the Personal Profile specification.
Such specification also supports awt, but not swing so still there is no (hope for) support
of Tibetan script in the pocket pc's,
2004-02-07 18:21:17 +00:00
dchandler
274e1736be
Deleted cut-and-paste goof.
2004-01-17 19:45:31 +00:00
dchandler
c69ba26c60
TString now has tracks what Roman transliteration system it is using. Next up is to make ACIPConverter handle EWTS or ACIP TStrings.
2004-01-17 19:28:54 +00:00
dchandler
48b4c5cb07
Added a Unicode->ASCII dump for debugging *->Unicode conversions. To use it, use 'java -cp Jskad.jar org.thdl.util.VerboseUnicodeDump'.
2004-01-17 17:10:12 +00:00
dchandler
6fdb2a26bb
Added a Unicode->ASCII dump for debugging *->Unicode conversions. To use it, use 'java -cp Jskad.jar org.thdl.util.VerboseUnicodeDump'.
2004-01-17 16:52:38 +00:00
dchandler
9dd95c5524
I saw this error when I wasn't expecting it, so now, curious, I print more details.
2004-01-17 16:51:33 +00:00
dchandler
4dd40809a5
A user reported that q` caused a crash with TCC keyboard #1 . Fixed. TCC keyboard #1 does not support q~ though.
2003-12-21 06:27:36 +00:00
dchandler
c1aa81e943
RFE 860190: ACIP->Unicode now gives a warning when it outputs something that can't be represented in TMW.
2003-12-16 07:45:40 +00:00
dchandler
848349fd3a
More tests.
2003-12-15 08:16:06 +00:00
dchandler
e7a9e7968f
ACIP->Unicode now uses two characters for consonants instead of one. This matches the dislike for characters like U+0F77 etc.
...
ACIP->Tibetan was not giving an error for BCWA because it parsed like BCVA. Fixed.
2003-12-15 07:32:14 +00:00
dchandler
e9f7b2dfed
If you want curly brackets around folio markers, you'll have to set
...
the system property
thdl.acip.to.x.output.curly.brackets.around.folio.markers to true.
2003-12-14 08:47:03 +00:00
dchandler
8664571577
Warnings were not being detected correctly. Fixed.
...
ACIP->Unicode uses U+0020, ' ', for whitespace. ACIP->TMW uses the
TMW whitespace for whitespace.
2003-12-14 08:38:10 +00:00
dchandler
01e65176d4
Using less memory and time to figure out if warnings occurred.
2003-12-14 07:41:15 +00:00
dchandler
76c2e969ac
Fixed ACIP->Unicode bug for YYE etc., things with full-formed
...
subjoined consonants and vowels.
Fixed ACIP->TMW for YYA etc., things with full-formed subjoined
consonants.
2003-12-14 07:36:21 +00:00
dchandler
f625c937ee
ACIP {B} was not being treated like {BA}; instead, an error was resulting. All the five prefixes were affected.
2003-12-14 05:54:07 +00:00
dchandler
a0e6db11c0
Very minor cleanup.
2003-12-13 21:59:31 +00:00
dchandler
4c30657afa
Adding tests for an ACIP keyboard that will never work correctly, and
...
probably never even be useful. But they were lying around from a
while back, so here are the tests.
2003-12-13 21:34:33 +00:00
dchandler
02967539b0
Slightly improved Jskad's internal documentation. Links to converters' docs.
2003-12-10 07:04:35 +00:00
dchandler
581643cf59
{DAN,\nLHAG} used to be treated like {DAN, LHAG} but that got broken. Fixed.
...
Added tests for lexer's handling of ACIP spaces etc.
2003-12-10 06:55:16 +00:00
dchandler
8e673bbc2c
{NGA,} becomes {NGA\u0f0c,} now instead of {NGA\u0f0b,}.
...
Note: ACIP->Unicode for {NGA,} was not giving the Unicode that {NGA\u0f0b,} gives before.
2003-12-10 06:50:14 +00:00
dchandler
a466bad939
ACIP->TMW now supports EWTS PUA {\uF021}-style escapes. Our extended ACIP is thus TMW-complete and useful for testing.
2003-12-08 07:51:45 +00:00
dchandler
a39c5c12b0
ACIP->TMW now supports EWTS PUA {\uF021}-style escapes. Our extended ACIP is thus TMW-complete and useful for testing.
2003-12-08 07:15:27 +00:00
dchandler
8f7322a056
Use absolute paths when invoking the external viewer; it doesn't know what our current working directory is.
2003-12-08 06:53:37 +00:00
dchandler
b617f761d5
ACIP->TMW for {^GONG SA } used to fail; fixed.
2003-12-07 20:05:41 +00:00
dchandler
115534e688
ACIP->TMW for {^GONG SA } used to fail because we had \u0F38 in the ToWylie section. Now it's in the <?Input:Numbers?> section because I didn't want to introduce a new section. If WylieWord has trouble due to this misuse of the 'numbers' category, we'll introduce a new category, 'other'.
...
TMW->EWTS improved as a result -- {\u0F38.gonga sa } is produced now where {\u0F38agonga sa } was once produced. Even the better version is imperfect; see bug 855877.
2003-12-07 19:40:59 +00:00
dchandler
597cf408dd
Fixed help message.
2003-12-07 19:10:36 +00:00
dchandler
4adf87c401
Updated comments only.
2003-12-06 20:36:56 +00:00
dchandler
3f18623977
Added comments only.
2003-12-06 20:26:45 +00:00
dchandler
6232ee9170
Added comments referring to a user guide in development now.
2003-12-06 20:26:15 +00:00
dchandler
c43e9a446b
Revamped some ACIP->Tibetan error messages.
2003-12-06 20:19:40 +00:00
dchandler
c9c771d1ee
ACIP {&}, as in {KO&HAm,}, is supported.
2003-11-30 02:18:59 +00:00
dchandler
ac412c994b
Now {Pm} is treated like {PAm}; {Pm:} is like {PAm:}; {P:} is like {PA:}.
2003-11-30 02:06:48 +00:00
dchandler
e7c4cc1874
Updated to be in sync with latest EWTS draft.
2003-11-29 22:59:39 +00:00
dchandler
ffd041e32c
ACIP->TMW and ACIP->Unicode now allow for Unicode escapes like K\u0F84. This means that the lack of support for ACIP's backslash, '\\', is mitigated because you can turn ACIP {K\} into ACIP {K\u0F84}.
...
Support for U+F021-U+F0FF, the PUA that the latest EWTS uses, is not provided.
Also, we've traded some speed for memory -- DuffCode now uses bytes, not ints.
2003-11-29 22:57:12 +00:00
dchandler
dfaae4be93
ACIP->TMW and ACIP->Unicode now allow for Unicode escapes like K\u0F84. This means that the lack of support for ACIP's backslash, '\\', is mitigated because you can turn ACIP {K\} into ACIP {K\u0F84}.
...
Support for U+F021-U+F0FF, the PUA that the latest EWTS uses, is not provided.
2003-11-29 22:56:18 +00:00
dchandler
946d8cbc72
Updated the code I used for testing to generate the file containing all glyphs in TM and all glyphs but one in TMW.
2003-11-29 16:22:26 +00:00
dchandler
16bfeac641
These issues are non-issues; removing these comments.
2003-11-25 00:31:33 +00:00