Commit graph

677 commits

Author SHA1 Message Date
amontano
db86a19d2b Added code to not fall into a ArrayIndexOutOfBounds. Still not clear why it happens. 2005-03-06 10:00:24 +00:00
amontano
fb5173c2e2 Added comments on what needs to be included within the options of the jvm
when starting tomcat for the debugger to work.
2005-02-28 15:05:47 +00:00
amontano
b5a3de3a2a Added targets ttsc-debug and tt-servlet-debug, for being able to debug the
standalone and the servlet version of the translation tool respectively. The
specific parameters are for debugging in Netbeans IDE but probably could
be generalized. Added comments on how to debug the servlet version
via sockets or via shared memory.
2005-02-28 11:36:35 +00:00
amontano
a86bad152f This classes were for private use for processing some dictionaries. They are out of place here. 2005-02-28 03:22:10 +00:00
amontano
d4dc004c2a Made some changes to the OnlineTranslationTool. Now when text is redisplayed as links, is shows up with the declension marks. 2005-02-28 03:04:01 +00:00
dchandler
ff130e6fb1 Organized imports in Eclipse. 2005-02-28 01:37:54 +00:00
dchandler
f8bd7e83cb Fixed comment. 2005-02-27 22:08:47 +00:00
dchandler
6bc72956b6 New tests based on Paul Hackett's e-mail from today. 2005-02-27 20:59:33 +00:00
dchandler
24403bb395 Jskad has two new menu items that are a shortcut for running a standalone
TMW->Unicode conversion: 'Save as Unicode RTF...' and 'Save as
Unicode UTF-8 Text...'.
2005-02-27 10:27:37 +00:00
amontano
107fcce565 Added debug info to the servlet version. 2005-02-27 05:48:03 +00:00
dchandler
9507ff3694 At David Germano's request, Jskad's UI now uses
'Tibetan Machine Web (non-Unicode)' rather than 'Tibetan'.

I often use the term 'Tibetan' to mean 'Tibetan (either in Unicode,
TM or TMW in RTF, or any other scheme where it appears as Tibetan
instead of Roman transliteration'.  So this is a good change in my opinion,
though 'TMW' or 'Legacy TMW' is shorter.
2005-02-27 00:14:37 +00:00
dchandler
06d93082cb Better help - OO.org 1.9.79 isn't broken. 2005-02-26 23:58:17 +00:00
dchandler
9cfedadab7 Jskad has a new menu item, 'Copy as Unicode', which does a
TMW->Unicode conversion.
2005-02-26 22:57:38 +00:00
dchandler
25f5440218 We had some useless code for enabling and disabling the clipboard; I've
killed it.  Jskad in applets would take QA and some code changes, I bet, so
a TODO is good enough.

Documented better the clipboard code.
2005-02-26 20:46:40 +00:00
amontano
5d99c0ce7f Changed the default vm for target and source to 1.2. Having 1.1 would yield the error
[javac] javac: invalid source release: 1.1
when compiling in JDK1.5.
2005-02-23 23:01:51 +00:00
dchandler
c16f633ecf Two things:
One, TMW->EWTS gives dbas and dngas instead of dabs and dangs
because Chris Fynn's e-mail from today has dbas and dngas.

Second, Down with ACIPRules.  Long live ACIPTraits.  EWTS->Tibetan
conversion is closer still.
2005-02-22 04:36:54 +00:00
amontano
82c6047cc2 Added the jvm.source to be able to specify the "source" parameter for javac, else I get an annoying error in java 1.5. David, should the default for both still be 1.1? 2005-02-21 08:08:22 +00:00
dchandler
03483d88c6 This memory hog of a test means that ACIP->TMW regressions will be
harder to introduce.
2005-02-21 06:08:19 +00:00
dchandler
ca8b565fb0 Update about ANT_OPTS=-Xmx512m 2005-02-21 06:04:59 +00:00
dchandler
3feef9232a I got this by running Ant with the environment variable ANT_OPTS set to
'-Xmx512m'.  Otherwise I ran out of memory.

This file allows for a sizeable ACIP->TMW regression test.
2005-02-21 05:52:16 +00:00
dchandler
4c268c5ea2 Refactored so that there can be an EWTS scanner and an ACIP scanner. 2005-02-21 05:37:01 +00:00
amontano
7854e4fd93 Changed minor optimization things 2005-02-21 05:27:36 +00:00
amontano
686756116f Fixed importation errors for the translation tool. 2005-02-21 05:10:00 +00:00
dchandler
4b4787411b All I *meant* to do with this commit (you tell me if I did more) was to change
from \r\r\n to \r\n.  I added these files on Linux with \r\n and should've
added them as binary or done a dos2unix first.
2005-02-21 05:05:13 +00:00
dchandler
3e0168b384 Renamed ACIPConverter to TConverter. Added a needed parameter (the
only needed parameter in that class's interface AFAIK.
2005-02-21 01:35:23 +00:00
dchandler
37bf9a736d I did this stuff back in August. It's all in support of EWTS->Tibetan
conversion.  The tag 'TODO(DLC)[EWTS->Tibetan]' exists all over the
place.  EWTS->Tibetan isn't here yet; lexing isn't here yet; this is
mainly a refactoring so that the ACIP->Tibetan code can be reused to
do EWTS->Tibetan.

I'm committing this because tests pass (it shouldn't be breaking
anything), because I want a checkpoint, and because the laptop this
sandbox was on isn't my preferred development environment.
2005-02-21 01:16:10 +00:00
dchandler
83f499b7a8 Formatting in TMW documents is not preserved. I've added an identity
tranformation, TMW->TMW, to help me debug this problem.
2005-02-13 00:34:47 +00:00
dchandler
9025fb42d6 TMW->EWTS 998476 partial fix: "aM" is generated now correctly. Before
you got "M".
2005-02-07 04:00:42 +00:00
dchandler
8dcb623382 TMW->EWTS:
Fixed part of bug 998476 and part of an undocumented bug.  Discovered a
new bug, "aM" should be generated but only "M" is.

The undocumented bug was that laMA was generated when lAM should have been.

The part of bug 998476 that was fixed: laM, laH, etc. are now generated.

This does nothing about paN etc.

Some refactoring here; this is not a minimal diff.

Added tests of TMW->EWTS that use ACIP to get the TMW in place
because EWTS->TMW is a faulty keyboard at present.
2005-02-07 03:17:40 +00:00
amontano
a82afad92c Updated methods to help with the dictionary clean up. 2005-02-06 23:19:44 +00:00
dchandler
96d0d0d9d0 My previous commit message failed to mention the following:
I refactored the code trying to fit it onto one screen.  So not all of the
changes are material to the bug fix.

About this commit: TMW->Wylie for {b.s.d} now gives bsad instead of bas.d.
This fixes part of bug 998476, and is done because Andres thinks it'll work
most of the time.  But don't be surprised if an exception comes up in the
future and we have to trivially change the code to catch it.
2005-02-05 22:37:02 +00:00
dchandler
287fc181a0 Fix for part of bug 998476. 2005-02-05 22:16:39 +00:00
dchandler
be632e1874 Cleaned up some code that is relevant to the bug I'm looking into. I need to
instrument it.  Functionally, this change is a no-op.  I just don't want to
confuse refactoring with the actual bug fix.
2005-02-05 19:29:37 +00:00
dchandler
00961b633f Added a test case for bug 998476. No fix, just a test case verifying the bug. 2005-02-05 18:47:17 +00:00
dchandler
b4155c3264 After a1tsal's changes to tibwn.ini, the tests failed. I'm a bit disheartened
that more tests didn't fail.
2005-02-05 16:51:13 +00:00
dchandler
7304c770c9 Just a better comment. 2005-02-05 16:27:34 +00:00
amontano
78373b3094 updated the servlet version. logging still needs work. 2005-01-23 00:57:02 +00:00
a1tsal
28d46bb207 Add corrective comment regarding the bogus Unicode OM characters. 2005-01-19 01:07:52 +00:00
a1tsal
affccad9e5 Use 00A0 rather than 0020 for _, per unicode spec. 2005-01-17 08:58:56 +00:00
a1tsal
91d0e7f4da Use "precomposed" sanskrit consonant combinations consistently throughout. 2005-01-17 08:49:04 +00:00
a1tsal
22cfee69db Had wrong Unicode for n+n+y.
Had wrong Unicode for space -- but only in comment.

Cleaned up punctuation in another comment.
2005-01-16 01:17:44 +00:00
dchandler
0b0af67ed9 Ximalaya is not nearly as nice as Tibetan Machine Uni, so use the latter. 2005-01-04 02:20:59 +00:00
amontano
88ebeaed3b Fixed pocket pc version to be completely swing-less. 2004-12-23 19:31:26 +00:00
amontano
533534eb9f changed the main class for java web-start and pocket pc versions of the translation tool. 2004-12-23 19:29:59 +00:00
dchandler
aa5d86a6e3 The *->Unicode conversions were outputting Unicode that was not
well-formed.  They still do, but they do it less often.

Chris Fynn wrote this a while back:

   By normal Tibetan & Dzongkha spelling, writing, and input rules
   Tibetan script stacks should be entered and written: 1 headline
   consonant (0F40->0F6A), any subjoined consonant(s) (0F90-> 0F9C),
   achung (0F71), shabkyu (0F74), any above headline vowel(s) (0F72
   0F7A 0F7B 0F7C 0F7D and 0F80); any ngaro (0F7E, 0F82 and 0F83).

Now efforts are made to ensure that the converters conform to the
above rules.
2004-12-13 02:32:46 +00:00
dchandler
3115f22484 Now you may run a single JUnit test via a command-line like 'ant
check-one -Dsoletest=org.thdl.tib.text.tshegbar.UnicodeUtilsTest'.
This saves you the time of running the slow tests.
2004-12-13 02:29:05 +00:00
eg3p
c4f4288d2f eliminated some unnecessary comments i had left in there 2004-08-19 19:51:21 +00:00
eg3p
a39d5c2ba3 changed all occurrences of 'Color.BLACK' to 'Color.black', since the former causes a runtime error on Mac OS X (for Java 1.3.1 at least). not sure why this is, but may be related to the bug mentioned at http://www.oxygenxml.com/forum/viewtopic.php?p=239 2004-08-19 14:59:06 +00:00
amontano
afd3a95a21 Updated the dictionary structure to allow grouping of dictionaries, this is the first step to try to clean up a bit the massive repetitions in dictionaries. 2004-08-13 04:47:35 +00:00
dchandler
6bb0646f1c Fixed crashing bug reported by Teresa Lam. Added tests so that I'm fairly
certain that no more crashing bugs exist.  Removed a marker for iffy code
after understanding that code via test cases.
2004-07-05 04:48:27 +00:00