Commit Graph

101 Commits

Author SHA1 Message Date
amontano 3cd1f09087 Updated URL's from orion...edu to thdl.org 2007-05-02 02:32:43 +00:00
amontano e5e6a64052 Added about info to the packaging of the translation tool. 2006-10-14 07:28:38 +00:00
dchandler ced3767335 New CVS infrastructure; new server name. 2006-05-24 19:13:49 +00:00
dchandler b28e7e7c5c Iris is gone in favor of orion. Grep for 'iris' and you'll find just
a couple of references that I didn't grok.
2005-09-19 19:43:10 +00:00
dchandler 5788416629 A reverter that converts Unicode to computer-friendly (but not, yet,
human-friendly) EWTS is here in alpha mode.  It probably doesn't deal
well with non-Tibetan.
2005-08-01 05:54:20 +00:00
dchandler 00afd75362 The scaffolding for a Unicode->EWTS reverter. No guts yet. 2005-07-17 03:32:57 +00:00
dchandler dc18165992 Added a class for performing EWTS->Unicode conversions during XSLT
transformations.  I haven't actually used it with Xalan XSLT yet, but
it ought to work if TibetanHTML did (which it must have at one point).

I do have a unit test, but an end-to-end test with Xalan is what we
need.
2005-07-13 07:25:18 +00:00
dchandler 0b3a636f63 Tremendously better EWTS->Unicode and EWTS->TMW conversion, though still not tested end-to-end and without perfect unit tests. See EWTSTest.RUN_FAILING_TESTS, for example, to find imperfection. 2005-07-06 02:19:38 +00:00
dchandler 7198f23361 I really hesitate to commit this because I'm not sure what it brings to the
table exactly and I fear that it makes the ACIP->Tibetan converter code
a lot uglier.  The TODO(DLC)[EWTS->Tibetan] comments littered throughout
are part of the ugliness; they point to the ugliness.  If each were addressed,
cleanliness could perhaps be achieved.

I've largely forgotten exactly what this change does, but it attempts to
improve EWTS->Tibetan conversion.  The lexer is probably really, really
primitive.  I concentrate here on converting a single tsheg bar rather than
a whole document.

Eclipse was used during part of my journey here and some imports were
reorganized merely because I could.  :)

(Eclipse was needed when the usual ant build failed to run a new test
EWTSTest.  And I wanted its debugger.)

Next steps: end-to-end EWTS tests should bring many problems to light.  Fix
those.  Triage all the TODO comments.

I don't know that I'll ever really trust the implementation.  The tests are
valuable, though.  A clean implementation of EWTS->Tibetan in Jython
might hold enough interest for me; I'd like to learn Python.
2005-06-20 06:18:00 +00:00
amontano b02f93dd13 Modified "create...jnlp-file" targets of jskad and the translation tool adding the lines in the jnlp file to prompt for the creation of short-cuts. 2005-03-08 07:02:56 +00:00
amontano 350ea96d2f Updated minimum version numbers for java in the translation tool targets.
Servlet version's minimum is now 1.3, which is the version number in UVa's iris server.
Handheld version's minimum is now 1.4. Seems high, but it has been designed for some
time to run on WebSphere Everyplace Micro Environment Personal Profile 1.0 and 1.4 is
the maximum it takes. I didn't find any compatible freeware Java Runtime Environment for windows ce.
For everything else, the minimum is still set at 1.2.
2005-03-07 05:57:46 +00:00
amontano fb5173c2e2 Added comments on what needs to be included within the options of the jvm
when starting tomcat for the debugger to work.
2005-02-28 15:05:47 +00:00
amontano b5a3de3a2a Added targets ttsc-debug and tt-servlet-debug, for being able to debug the
standalone and the servlet version of the translation tool respectively. The
specific parameters are for debugging in Netbeans IDE but probably could
be generalized. Added comments on how to debug the servlet version
via sockets or via shared memory.
2005-02-28 11:36:35 +00:00
amontano 5d99c0ce7f Changed the default vm for target and source to 1.2. Having 1.1 would yield the error
[javac] javac: invalid source release: 1.1
when compiling in JDK1.5.
2005-02-23 23:01:51 +00:00
amontano 82c6047cc2 Added the jvm.source to be able to specify the "source" parameter for javac, else I get an annoying error in java 1.5. David, should the default for both still be 1.1? 2005-02-21 08:08:22 +00:00
dchandler ca8b565fb0 Update about ANT_OPTS=-Xmx512m 2005-02-21 06:04:59 +00:00
dchandler 3e0168b384 Renamed ACIPConverter to TConverter. Added a needed parameter (the
only needed parameter in that class's interface AFAIK.
2005-02-21 01:35:23 +00:00
dchandler 37bf9a736d I did this stuff back in August. It's all in support of EWTS->Tibetan
conversion.  The tag 'TODO(DLC)[EWTS->Tibetan]' exists all over the
place.  EWTS->Tibetan isn't here yet; lexing isn't here yet; this is
mainly a refactoring so that the ACIP->Tibetan code can be reused to
do EWTS->Tibetan.

I'm committing this because tests pass (it shouldn't be breaking
anything), because I want a checkpoint, and because the laptop this
sandbox was on isn't my preferred development environment.
2005-02-21 01:16:10 +00:00
dchandler 8dcb623382 TMW->EWTS:
Fixed part of bug 998476 and part of an undocumented bug.  Discovered a
new bug, "aM" should be generated but only "M" is.

The undocumented bug was that laMA was generated when lAM should have been.

The part of bug 998476 that was fixed: laM, laH, etc. are now generated.

This does nothing about paN etc.

Some refactoring here; this is not a minimal diff.

Added tests of TMW->EWTS that use ACIP to get the TMW in place
because EWTS->TMW is a faulty keyboard at present.
2005-02-07 03:17:40 +00:00
dchandler 00961b633f Added a test case for bug 998476. No fix, just a test case verifying the bug. 2005-02-05 18:47:17 +00:00
amontano 533534eb9f changed the main class for java web-start and pocket pc versions of the translation tool. 2004-12-23 19:29:59 +00:00
dchandler 3115f22484 Now you may run a single JUnit test via a command-line like 'ant
check-one -Dsoletest=org.thdl.tib.text.tshegbar.UnicodeUtilsTest'.
This saves you the time of running the slow tests.
2004-12-13 02:29:05 +00:00
amontano c5ab3ec552 updated info of translation tool for the webstart application 2004-06-14 03:40:14 +00:00
dchandler e2d42f36eb Robert Chilton's experience inspired me to make the handling of errors and
warnings in ACIP->Tibetan conversion much more configurable.  You can
now choose from short or long error messages, for one thing.  You can change
the severity of almost all warnings.  Each error and warning has an error code.
Errors and warnings are better tested.

The converter GUI has a new checkbox for short messages; the converter
CLI has a new mandatory option for short messages.

I also fixed a bug whereby certain errors were not being appended to the
'errors' StringBuffer.
2004-04-24 17:49:16 +00:00
dchandler 244a9d1370 TiblEdit's diacritics panel now works -- dia.dat has been added to the
repository and to TiblEdit's jar.
2004-04-14 05:12:00 +00:00
dchandler f09a03c9c7 tibbibl is the schema's name; TiblEdit is the application's name. 2004-04-11 14:44:06 +00:00
dchandler db2ef438d4 Andres requested customization like this. You can now override any property in build.xml using a file 'build.properties'. 2004-03-06 23:17:43 +00:00
dchandler 6fdb2a26bb Added a Unicode->ASCII dump for debugging *->Unicode conversions. To use it, use 'java -cp Jskad.jar org.thdl.util.VerboseUnicodeDump'. 2004-01-17 16:52:38 +00:00
dchandler 8411ebe932 You couldn't build from the src-dist. I think you can now; it's more
inclusive.
2003-12-13 21:46:09 +00:00
dchandler 31b3020d07 Added a test case that runs almost all the tsheg bars from all
non-reference, publicly available ACIP files (hundreds of megabytes of
them) through the converter.  The frequencies of these tsheg bars in
in the file, too.
2003-10-26 06:02:48 +00:00
dchandler 1afb3a0fdd ACIP->Unicode, without going through TMW, is now possible, so long as
\, the Sanskrit virama, is not used.  Of the 1370-odd ACIP texts I've
got here, about 57% make it through the gauntlet (fewer if you demand
a vowel or disambiguator on every stack of a non-Tibetan tsheg bar).
2003-08-18 02:38:54 +00:00
dchandler e21d3774a9 Added an unfinished ACIP->Tibetan converter. Once it works properly
for ACIP, it'll easily be made to work as a perfect EWTS
Wylie->Tibetan converter.  It has an extensive suite of tests for the
existing functionality.
2003-08-10 19:30:07 +00:00
dchandler 45ded5f4de The translation tool (two of four builds of it) now includes keyboard
documentation to go along with its new multi-keyboard support.
2003-08-09 16:20:18 +00:00
dchandler 1c29566aee I'm now using the Unix diff built in to Apache Jakarta Commons JRCS
(which I found on suigeneris.org, not apache.org) in order to bulletproof the
Tibetan Converter tests.  They used to fail due to nondeterminism in the
Java RTF writer; they should no longer fail.

I've also changed it so that the Tibetan Converter tests run in headless
mode, which means that they'll run on the nightly builds server.
2003-07-14 12:26:26 +00:00
dchandler 086f4bb6ec Renamed the Info menu Help.
Now using CalHTMLPane to surf the offline and the online help.
2003-07-05 22:25:21 +00:00
dchandler a463b686b3 Jskad now ships with both TibetanMachine and TibetanMachineWeb fonts
by default, not just TMW.  Thus users need not install these fonts on their
systems.
2003-07-05 18:00:29 +00:00
dchandler 689c1910aa To deal with java.swing.text.rtf bugs regarding hexadecimal escape
sequences, I've created RTFFixerInputStream.  It turns illegal hexadecimal
escapes into Unicode escapes.
2003-06-29 02:30:08 +00:00
dchandler f547734043 Added Than's converter GUI code; adapted it to work with Jskad's
converters.

TMW->Unicode now uses Ximalaya by default.
2003-06-24 03:02:29 +00:00
dchandler 0f77b32add It's not in use, but there's code to make Jskad a TM carrier just as
it is a TMW carrier.
2003-06-22 22:11:40 +00:00
dchandler 149eecc4fb Added a thdl-nightly-build target for nightly builds on
iris.lib.virginia.edu.  This will need to be edited to use the real
path on iris.
2003-06-19 01:58:59 +00:00
dchandler 1ae7948fed Nightly builds are now done using pserver CVS updates. The
'dc-nightly-build' target no longer does a cvs update, because that's
not always going to work.
2003-06-19 01:11:33 +00:00
dchandler ced830a7d3 Renamed TMW_RTF_TO_THDL_WYLIE TibetanConverter. 2003-06-15 19:19:23 +00:00
dchandler 56d5ac7210 TM->TMW and TMW->TM conversion in RTF is now supported. I've
noticed that formatting is mostly OK but sometimes gets bungled slightly.
I tried everything I could think of, and now I'm passing the buck to Java's
RTF support.

TMW_RTF_TO_THDL_WYLIE (now misnamed) support TMW->TM
conversion (but not TM->TMW).  There is an automated test case for a
TMW->TM conversion.

I have full confidence in this conversion.  Even the smallest glitch in the core
functionality (not formatting) would surprise me.
2003-05-31 23:18:17 +00:00
dchandler ec7fec695f Added some automated JUnit tests for TMW_RTF_TO_THDL_WYLIE. 2003-05-18 17:17:52 +00:00
dchandler e2a9720d9b I've added a command-line converter,
org.thdl.tib.input.TMW_RTF_TO_THDL_WYLIE.  It converts RTF files
consisting of TMW characters to the corresponding THDL Extended Wylie.

It supports --find-some-non-tmw mode, which allows you to ensure that no
unusual characters will spoil the conversion.  The converter has built-in
intelligence that allows it to handle Tahoma '{', '}', and '\\' characters
properly.

The converter works on mixed Roman/TMW also, but --find-some-non-tmw
and --find-all-non-tmw modes are not as useful.

Invoke org.thdl.tib.input.TMW_RTF_TO_THDL_WYLIE, which resides in
Jskad's jar, with no command-line options to see usage information.
2003-05-18 14:14:47 +00:00
dchandler 5e55d05cfe To make it easier to add a keyboard for third parties, all .rtf files
in Jskad/source now go into the Jskad jar.
2003-05-15 00:57:34 +00:00
dchandler 78dc46a979 Jskad keyboards are now configured via keyboards.ini, a file that has
comments that explain its function.  It's quite simple.  This is in
response to Jeff C. H. Wu's request.
2003-05-14 03:25:36 +00:00
dchandler 59175ccfd6 Added a few tests for the ACIP keyboard, which I've improved a bit.
Noted some failures.  "Fixed" the code to do what I want it to do for
the (no sanskrit stacking, tibetan stacking) case [which is exercised
by this keyboard only].
2003-04-14 23:55:00 +00:00
dchandler efa8fc1f25 DuffPane now has the start of a unit test suite. Invoke it via 'ant
clean check'.  Right now there are tests to ensure that typing certain
sequences of keys in the Extended Wylie keyboard gives the expected
Extended Wylie back when "Tools/Convert Tibetan to Wylie" is invoked.

The syntactically illegal d.wa now converts to Tibetan and then back
to d.wa (not dwa, as it did); likewise with the illegal g.wa.  wa
doesn't take any prefixes, but I prefer clean end-to-end
behavior. (jeskd doesn't go end-to-end, though.)

Note that you cannot successfully run the DuffPane tests on a Linux
box unless your DISPLAY variable is set correctly.  Thus, my nightly
builds will fail with an Error (as opposed to a Failure).
2003-04-14 05:22:27 +00:00
dchandler e95e41aef0 At Andres's request, the handheld version of the translation tool no
longer bothers with the TMW fonts, which don't work anyway on such
devices.
2003-04-02 04:06:30 +00:00