Commit graph

552 commits

Author SHA1 Message Date
micha3lp
e69aa769ee A temporary class to handle conversion from Wylie to THDL pronounciation 2006-08-12 23:15:22 +00:00
micha3lp
a15161fc07 *** empty log message *** 2006-08-12 22:31:51 +00:00
micha3lp
592d1d6870 1. Dictionary support in Jskad
2. Local fonts from the .jar file are used if TWM fonts not available on the system
3. Bug fix - SWING not rendering font correctly
2006-08-12 22:23:01 +00:00
amontano
0c891ec96c The org.thdl.tib.scanner.Manipulate class was originally meant as a grab-bag of methods processing strings representing tibetan wylie. It doesn't make sense to leave there the wrap-up methods for converting from and to the various transcription schemes. Moved them to BasicTibetanTranscriptionConverter and updated all classes that point to them. 2006-04-24 19:19:04 +00:00
amontano
67bddb7a7e Updated the translation tool to accept Tibetan Unicode. For the application version, this means that the smart paste is unicode aware and will do the appropriate conversion. In the servlet version tibetan unicode can now be inputted in the form and tibetan machine uni is used to display the results.
The Manipulate class now includes: acipToWylie, wylieToAcip, and unicodeToWylie. They provide a simple interfase to David Chandler's converters that are used by the translation tool.
2006-04-24 06:09:17 +00:00
dchandler
2a47ff68a4 Added test cases based on real-world issues using Leigh Brasington's UDP which uses this ACIP->Unicode converter. 2006-01-30 06:00:39 +00:00
amontano
c3f85e7106 Made the about text a separate text file instead of having it embedded within the classes. Made other minor changes concerning getting of calls to deprecated methods. 2005-12-10 21:42:34 +00:00
amontano
86b79dba26 Fixed "{" "}" not being validated correctly as proper wylie. 2005-11-03 15:17:27 +00:00
dchandler
8d0b1b3afb Iris.lib.virginia is dead; long live orion.lib.virginia.edu and a new
path prefix.
2005-09-19 23:58:52 +00:00
dchandler
26f28760b8 Fixed a test I broke. 2005-09-19 21:06:41 +00:00
dchandler
b28e7e7c5c Iris is gone in favor of orion. Grep for 'iris' and you'll find just
a couple of references that I didn't grok.
2005-09-19 19:43:10 +00:00
amontano
efec654ee8 Added the following features: now starts with focus on the input pane, shortcut ctrl+T to translate, and after translating focus is automatically set on the first element. 2005-08-27 01:51:40 +00:00
dchandler
58819cf43d Now works with JDK < 1.4. 2005-08-22 22:42:40 +00:00
dchandler
852401c771 For Andres, who's now using Eclipse. I feel your pain! 2005-08-22 22:39:32 +00:00
dchandler
7656a4064b Friendlier error message. 2005-08-08 06:06:26 +00:00
dchandler
5788416629 A reverter that converts Unicode to computer-friendly (but not, yet,
human-friendly) EWTS is here in alpha mode.  It probably doesn't deal
well with non-Tibetan.
2005-08-01 05:54:20 +00:00
dchandler
00afd75362 The scaffolding for a Unicode->EWTS reverter. No guts yet. 2005-07-17 03:32:57 +00:00
eg3p
ebc11a3425 edited the javadocs to mention specific XSLT vendors.
also removed remark worrying about whether convertEwtsTo
should be concerned about what kind of String it returns;
no, it need not: it is just a java.lang.String which will be
treated as such and converted to an XSL String in the appropriate
encoding by the XSLT processor.
2005-07-15 01:29:31 +00:00
dchandler
551f4f094e More EWTS->Unicode tests. 2005-07-14 04:53:11 +00:00
dchandler
05214b8f14 EWTS->Uni was crashing for tabs. 2005-07-14 04:16:36 +00:00
dchandler
dc18165992 Added a class for performing EWTS->Unicode conversions during XSLT
transformations.  I haven't actually used it with Xalan XSLT yet, but
it ought to work if TibetanHTML did (which it must have at one point).

I do have a unit test, but an end-to-end test with Xalan is what we
need.
2005-07-13 07:25:18 +00:00
dchandler
6260c0889d Mentions how to salvage this class. 2005-07-13 06:23:02 +00:00
dchandler
bef1d1b625 Added boilerplate and a class comment and traded in tabs for four
spaces.  A unittest and an example would be great, but this is a
start.
2005-07-13 06:19:27 +00:00
dchandler
8ccd68789a Since I had Eclipse fired up, I had it automatically organized the
imports.  It made two errors, but the compiler found them.  I've cvs
tagged the tree before doing this, just in case.
2005-07-11 03:10:32 +00:00
dchandler
6d419fe641 Numerous EWTS->Unicode and especially EWTS->TMW improvements.
Fixed ordering of Unicode wowels.  [ku+A] gives the correct Unicode
now, e.g.

EWTS->TMW looks better for some wacky wowels like, I'm guessing here, [ku+A].

EWTS->TMW should now give errors any time the full input isn't used.
Previously, wacky wowels like [kai+-i] would lead to some droppage.

EWTS->TMW->Unicode testing is now in effect.  This found a ton of
EWTS->TMW bugs, most or all of which are fixed now.

TMW->Unicode is improved/fixed for {
\u5350,\u534D,\u0F88+k,\u0F88+kh,U }.  (Why U?  "\u0f75" is
discouraged in favor of "\u0f71\u0f74".)

NOTE: TMW_RTF_TO_THDL_WYLIETest is still disabled for the nightly
builds' sake, but I ran it in my sandbox and it passed.
2005-07-11 02:51:06 +00:00
dchandler
36122778b4 EWTS->TMW works now for [#] and for [//]. 2005-07-10 05:36:35 +00:00
dchandler
33fc836e81 EWTS->Unicode for // now produces \u0f0e as it should. 2005-07-10 05:01:03 +00:00
dchandler
64625fd445 Removed the 'Import Wylie' menu item; 'Launch Converter...' is the way to go.
Fixed the converter GUI to work perfectly (AFAIK) for EWTS->Tibetan.
2005-07-07 03:15:59 +00:00
dchandler
cddbbae9a1 Bulletproofed EWTS->Tibetan against nasty pseudo-EWTS like [RAM].
Renamed recoverACIP methods.
2005-07-07 02:54:36 +00:00
dchandler
982350371d EWTS->TMW fixes. Wowel handling still isn't perfect but I'm lazy.
Jskad now uses the new EWTS->TMW routine, not the old, and thus the
"(Buggy)" label is [unfairly, perhaps] dropped.
2005-07-07 01:30:03 +00:00
dchandler
0f99c402df My last commit left the tests broken. Doh.
Also, I'm enabling EWTS->Tibetan converters in the GUI so that I can
ask folks to try them out.
2005-07-06 22:55:19 +00:00
dchandler
b74af71efc Better, but still flawed, handling of EWTS [^] (i.e., U+0F39). 2005-07-06 22:26:55 +00:00
dchandler
f5d87ab226 Fixed EWTS->Tibetan [g.yogs] bug. 2005-07-06 18:37:22 +00:00
dchandler
63ff0fb0c9 Fixed important EWTS->Tibetan conversion bugs. [g.yogs] (and maybe
[hUM^]) are not yet converting correctly.

I have not yet committed the end-to-end test that I'm manually doing
to find these problems.  It will be another document for
TMW_RTF_TO_THDL_WYLIETest.java.  Note that thdl.debug=true is
essential to access the GUI for the EWTS->* converters.
2005-07-06 07:46:21 +00:00
dchandler
0b3a636f63 Tremendously better EWTS->Unicode and EWTS->TMW conversion, though still not tested end-to-end and without perfect unit tests. See EWTSTest.RUN_FAILING_TESTS, for example, to find imperfection. 2005-07-06 02:19:38 +00:00
dchandler
affb9e4b5e Still trying to get the tests to complete on thdl.org's servers.
This will surely work.
2005-06-25 21:51:13 +00:00
dchandler
1062ce9b6a Trying to make the tests run on thdl.org's servers. Yesterday's change didn't do it; maybe this will but it's just a guess as I can't log on to their servers without time and effort. Reverting yesterday's change since it didn't matter. 2005-06-23 18:46:39 +00:00
dchandler
b9f4ed21ab Disabling a test that ran for *way* too long on thdl.org servers 2005-06-22 19:01:17 +00:00
dchandler
2678fc134a Added UI for EWTS->Tibetan conversions. GUI is disabled except in
debug mode for now.

I tested against a really simple-but-real document, found a bug with '*', tried
to implement TMW vowel code but I don't trust it yet.  Differentiated EWTS
code from ACIP where needed.

Several bugs in ewts->tibetan have been exposed; see the TODO
comments.
2005-06-20 09:30:35 +00:00
dchandler
7198f23361 I really hesitate to commit this because I'm not sure what it brings to the
table exactly and I fear that it makes the ACIP->Tibetan converter code
a lot uglier.  The TODO(DLC)[EWTS->Tibetan] comments littered throughout
are part of the ugliness; they point to the ugliness.  If each were addressed,
cleanliness could perhaps be achieved.

I've largely forgotten exactly what this change does, but it attempts to
improve EWTS->Tibetan conversion.  The lexer is probably really, really
primitive.  I concentrate here on converting a single tsheg bar rather than
a whole document.

Eclipse was used during part of my journey here and some imports were
reorganized merely because I could.  :)

(Eclipse was needed when the usual ant build failed to run a new test
EWTSTest.  And I wanted its debugger.)

Next steps: end-to-end EWTS tests should bring many problems to light.  Fix
those.  Triage all the TODO comments.

I don't know that I'll ever really trust the implementation.  The tests are
valuable, though.  A clean implementation of EWTS->Tibetan in Jython
might hold enough interest for me; I'd like to learn Python.
2005-06-20 06:18:00 +00:00
amontano
f64bae8ea6 Fixed loading default dictionary at the beginning of the installed stand-alone version
but making it easily changeable if the user selects another dictionary.
2005-05-13 04:25:59 +00:00
amontano
e096aee67b Gave higher priority to the dictionary name specified in the properties file than to the dictionary name passed as an argument. 2005-04-26 05:25:19 +00:00
amontano
4f821b7965 fixed double quote problem that showed up on some dictionaries (that where pre-processed in excel) 2005-04-25 09:29:39 +00:00
amontano
83f5c19a13 Now it catches exception and displays it to System.err when vowel errors (oo, ie, ee, etc.) come up in converting from wylie to tmw. 2005-04-25 09:28:34 +00:00
amontano
b1cc500abf Fixed stability bug 2005-03-08 09:50:52 +00:00
amontano
57895d945f Fixed whitespaces that got screwed up. 2005-03-08 07:59:05 +00:00
amontano
92e9a15d84 Updated the version numbers to reflect the new release with the conversion errors fixed. 2005-03-07 05:48:18 +00:00
amontano
063e04b7c4 Fixed scope of class and methods. (from protected to public or private). 2005-03-06 10:02:43 +00:00
amontano
db86a19d2b Added code to not fall into a ArrayIndexOutOfBounds. Still not clear why it happens. 2005-03-06 10:00:24 +00:00
amontano
a86bad152f This classes were for private use for processing some dictionaries. They are out of place here. 2005-02-28 03:22:10 +00:00