Commit graph

547 commits

Author SHA1 Message Date
dchandler
2a47ff68a4 Added test cases based on real-world issues using Leigh Brasington's UDP which uses this ACIP->Unicode converter. 2006-01-30 06:00:39 +00:00
amontano
c3f85e7106 Made the about text a separate text file instead of having it embedded within the classes. Made other minor changes concerning getting of calls to deprecated methods. 2005-12-10 21:42:34 +00:00
amontano
86b79dba26 Fixed "{" "}" not being validated correctly as proper wylie. 2005-11-03 15:17:27 +00:00
dchandler
8d0b1b3afb Iris.lib.virginia is dead; long live orion.lib.virginia.edu and a new
path prefix.
2005-09-19 23:58:52 +00:00
dchandler
26f28760b8 Fixed a test I broke. 2005-09-19 21:06:41 +00:00
dchandler
b28e7e7c5c Iris is gone in favor of orion. Grep for 'iris' and you'll find just
a couple of references that I didn't grok.
2005-09-19 19:43:10 +00:00
amontano
efec654ee8 Added the following features: now starts with focus on the input pane, shortcut ctrl+T to translate, and after translating focus is automatically set on the first element. 2005-08-27 01:51:40 +00:00
dchandler
58819cf43d Now works with JDK < 1.4. 2005-08-22 22:42:40 +00:00
dchandler
852401c771 For Andres, who's now using Eclipse. I feel your pain! 2005-08-22 22:39:32 +00:00
dchandler
7656a4064b Friendlier error message. 2005-08-08 06:06:26 +00:00
dchandler
5788416629 A reverter that converts Unicode to computer-friendly (but not, yet,
human-friendly) EWTS is here in alpha mode.  It probably doesn't deal
well with non-Tibetan.
2005-08-01 05:54:20 +00:00
dchandler
00afd75362 The scaffolding for a Unicode->EWTS reverter. No guts yet. 2005-07-17 03:32:57 +00:00
eg3p
ebc11a3425 edited the javadocs to mention specific XSLT vendors.
also removed remark worrying about whether convertEwtsTo
should be concerned about what kind of String it returns;
no, it need not: it is just a java.lang.String which will be
treated as such and converted to an XSL String in the appropriate
encoding by the XSLT processor.
2005-07-15 01:29:31 +00:00
dchandler
551f4f094e More EWTS->Unicode tests. 2005-07-14 04:53:11 +00:00
dchandler
05214b8f14 EWTS->Uni was crashing for tabs. 2005-07-14 04:16:36 +00:00
dchandler
dc18165992 Added a class for performing EWTS->Unicode conversions during XSLT
transformations.  I haven't actually used it with Xalan XSLT yet, but
it ought to work if TibetanHTML did (which it must have at one point).

I do have a unit test, but an end-to-end test with Xalan is what we
need.
2005-07-13 07:25:18 +00:00
dchandler
6260c0889d Mentions how to salvage this class. 2005-07-13 06:23:02 +00:00
dchandler
bef1d1b625 Added boilerplate and a class comment and traded in tabs for four
spaces.  A unittest and an example would be great, but this is a
start.
2005-07-13 06:19:27 +00:00
dchandler
8ccd68789a Since I had Eclipse fired up, I had it automatically organized the
imports.  It made two errors, but the compiler found them.  I've cvs
tagged the tree before doing this, just in case.
2005-07-11 03:10:32 +00:00
dchandler
6d419fe641 Numerous EWTS->Unicode and especially EWTS->TMW improvements.
Fixed ordering of Unicode wowels.  [ku+A] gives the correct Unicode
now, e.g.

EWTS->TMW looks better for some wacky wowels like, I'm guessing here, [ku+A].

EWTS->TMW should now give errors any time the full input isn't used.
Previously, wacky wowels like [kai+-i] would lead to some droppage.

EWTS->TMW->Unicode testing is now in effect.  This found a ton of
EWTS->TMW bugs, most or all of which are fixed now.

TMW->Unicode is improved/fixed for {
\u5350,\u534D,\u0F88+k,\u0F88+kh,U }.  (Why U?  "\u0f75" is
discouraged in favor of "\u0f71\u0f74".)

NOTE: TMW_RTF_TO_THDL_WYLIETest is still disabled for the nightly
builds' sake, but I ran it in my sandbox and it passed.
2005-07-11 02:51:06 +00:00
dchandler
36122778b4 EWTS->TMW works now for [#] and for [//]. 2005-07-10 05:36:35 +00:00
dchandler
33fc836e81 EWTS->Unicode for // now produces \u0f0e as it should. 2005-07-10 05:01:03 +00:00
dchandler
64625fd445 Removed the 'Import Wylie' menu item; 'Launch Converter...' is the way to go.
Fixed the converter GUI to work perfectly (AFAIK) for EWTS->Tibetan.
2005-07-07 03:15:59 +00:00
dchandler
cddbbae9a1 Bulletproofed EWTS->Tibetan against nasty pseudo-EWTS like [RAM].
Renamed recoverACIP methods.
2005-07-07 02:54:36 +00:00
dchandler
982350371d EWTS->TMW fixes. Wowel handling still isn't perfect but I'm lazy.
Jskad now uses the new EWTS->TMW routine, not the old, and thus the
"(Buggy)" label is [unfairly, perhaps] dropped.
2005-07-07 01:30:03 +00:00
dchandler
0f99c402df My last commit left the tests broken. Doh.
Also, I'm enabling EWTS->Tibetan converters in the GUI so that I can
ask folks to try them out.
2005-07-06 22:55:19 +00:00
dchandler
b74af71efc Better, but still flawed, handling of EWTS [^] (i.e., U+0F39). 2005-07-06 22:26:55 +00:00
dchandler
f5d87ab226 Fixed EWTS->Tibetan [g.yogs] bug. 2005-07-06 18:37:22 +00:00
dchandler
63ff0fb0c9 Fixed important EWTS->Tibetan conversion bugs. [g.yogs] (and maybe
[hUM^]) are not yet converting correctly.

I have not yet committed the end-to-end test that I'm manually doing
to find these problems.  It will be another document for
TMW_RTF_TO_THDL_WYLIETest.java.  Note that thdl.debug=true is
essential to access the GUI for the EWTS->* converters.
2005-07-06 07:46:21 +00:00
dchandler
0b3a636f63 Tremendously better EWTS->Unicode and EWTS->TMW conversion, though still not tested end-to-end and without perfect unit tests. See EWTSTest.RUN_FAILING_TESTS, for example, to find imperfection. 2005-07-06 02:19:38 +00:00
dchandler
affb9e4b5e Still trying to get the tests to complete on thdl.org's servers.
This will surely work.
2005-06-25 21:51:13 +00:00
dchandler
1062ce9b6a Trying to make the tests run on thdl.org's servers. Yesterday's change didn't do it; maybe this will but it's just a guess as I can't log on to their servers without time and effort. Reverting yesterday's change since it didn't matter. 2005-06-23 18:46:39 +00:00
dchandler
b9f4ed21ab Disabling a test that ran for *way* too long on thdl.org servers 2005-06-22 19:01:17 +00:00
dchandler
2678fc134a Added UI for EWTS->Tibetan conversions. GUI is disabled except in
debug mode for now.

I tested against a really simple-but-real document, found a bug with '*', tried
to implement TMW vowel code but I don't trust it yet.  Differentiated EWTS
code from ACIP where needed.

Several bugs in ewts->tibetan have been exposed; see the TODO
comments.
2005-06-20 09:30:35 +00:00
dchandler
7198f23361 I really hesitate to commit this because I'm not sure what it brings to the
table exactly and I fear that it makes the ACIP->Tibetan converter code
a lot uglier.  The TODO(DLC)[EWTS->Tibetan] comments littered throughout
are part of the ugliness; they point to the ugliness.  If each were addressed,
cleanliness could perhaps be achieved.

I've largely forgotten exactly what this change does, but it attempts to
improve EWTS->Tibetan conversion.  The lexer is probably really, really
primitive.  I concentrate here on converting a single tsheg bar rather than
a whole document.

Eclipse was used during part of my journey here and some imports were
reorganized merely because I could.  :)

(Eclipse was needed when the usual ant build failed to run a new test
EWTSTest.  And I wanted its debugger.)

Next steps: end-to-end EWTS tests should bring many problems to light.  Fix
those.  Triage all the TODO comments.

I don't know that I'll ever really trust the implementation.  The tests are
valuable, though.  A clean implementation of EWTS->Tibetan in Jython
might hold enough interest for me; I'd like to learn Python.
2005-06-20 06:18:00 +00:00
amontano
f64bae8ea6 Fixed loading default dictionary at the beginning of the installed stand-alone version
but making it easily changeable if the user selects another dictionary.
2005-05-13 04:25:59 +00:00
amontano
e096aee67b Gave higher priority to the dictionary name specified in the properties file than to the dictionary name passed as an argument. 2005-04-26 05:25:19 +00:00
amontano
4f821b7965 fixed double quote problem that showed up on some dictionaries (that where pre-processed in excel) 2005-04-25 09:29:39 +00:00
amontano
83f5c19a13 Now it catches exception and displays it to System.err when vowel errors (oo, ie, ee, etc.) come up in converting from wylie to tmw. 2005-04-25 09:28:34 +00:00
amontano
b1cc500abf Fixed stability bug 2005-03-08 09:50:52 +00:00
amontano
57895d945f Fixed whitespaces that got screwed up. 2005-03-08 07:59:05 +00:00
amontano
92e9a15d84 Updated the version numbers to reflect the new release with the conversion errors fixed. 2005-03-07 05:48:18 +00:00
amontano
063e04b7c4 Fixed scope of class and methods. (from protected to public or private). 2005-03-06 10:02:43 +00:00
amontano
db86a19d2b Added code to not fall into a ArrayIndexOutOfBounds. Still not clear why it happens. 2005-03-06 10:00:24 +00:00
amontano
a86bad152f This classes were for private use for processing some dictionaries. They are out of place here. 2005-02-28 03:22:10 +00:00
amontano
d4dc004c2a Made some changes to the OnlineTranslationTool. Now when text is redisplayed as links, is shows up with the declension marks. 2005-02-28 03:04:01 +00:00
dchandler
ff130e6fb1 Organized imports in Eclipse. 2005-02-28 01:37:54 +00:00
dchandler
f8bd7e83cb Fixed comment. 2005-02-27 22:08:47 +00:00
dchandler
6bc72956b6 New tests based on Paul Hackett's e-mail from today. 2005-02-27 20:59:33 +00:00
dchandler
24403bb395 Jskad has two new menu items that are a shortcut for running a standalone
TMW->Unicode conversion: 'Save as Unicode RTF...' and 'Save as
Unicode UTF-8 Text...'.
2005-02-27 10:27:37 +00:00