dchandler
f28bee4c71
The appendage 'um is here too.
2003-10-18 05:10:49 +00:00
dchandler
8c99adeb63
TMW->EWTS, TMW->ACIP, and ACIP->Unicode/TMW now support more appendages. Personal correspondence with Robert Chilton led me to support, besides 'am, 'ang, 'o, 'i, and 'u, the following:
...
'e (used in foreign transliteration)
'ongs
'is
'os
'ur
'us
'ung
2003-10-18 03:04:47 +00:00
dchandler
5e18feb47d
ACIP now stacks greedily. TTTTTA is T+T+T+T+TA, even though that stack doesn't exist in TM or TMW. Robert Chilton, in personal correspondence, agreed that this is the way to do things.
...
ACIP handles the appendages 'AM, 'ANG, 'US, 'UR, 'I, 'O, and 'U correctly.
2003-10-16 04:15:10 +00:00
dchandler
5f4fbfab7c
Bulletproofing and debugging support.
2003-10-16 04:13:14 +00:00
dchandler
129ebccd67
In TCC #1 keyboard, h>cj now works. I may have fixed this in a terrible way, breaking other things even. Hard to say because I don't really understand the code I changed. But DuffPaneTest passes.
...
If we ever clean up the keyboards, the changes made here to tcc_keyboard.ini should probably be undone.
2003-10-12 18:16:17 +00:00
dchandler
d7fdacfcdc
Open menu is now Open..., Save as is now Save as...
2003-10-12 18:12:19 +00:00
dchandler
8dbfff17e1
All .rtf and .Rtf and .RTF files are selectable now.
2003-10-12 18:11:50 +00:00
dchandler
35209ce7fd
I'm going to have to debug this, and the tab stops make the source unreadable. I don't like messing with whitespace, but it seems like I'll be the main maintainer for a while, and the people after me can use cvs diff -wb. So I'm untabifying.
2003-10-12 16:44:28 +00:00
dchandler
749b8d6727
Added toString for debugging.
2003-10-04 16:33:47 +00:00
dchandler
b983af8031
r-t, not rt. This was why converting 'brtul' from TMW to Wylie didn't work.
2003-10-04 16:33:23 +00:00
dchandler
6a11eddb1e
Warning level "None" wasn't working.
2003-10-04 16:12:48 +00:00
dchandler
b10098cc61
"Most" warnings now excludes "the last stack has no vowel", making it much more useful.
2003-10-04 15:10:18 +00:00
dchandler
ee50291ed4
Andres found that "THAG PA" caused a NullPointerException. That's fixed.
...
Renamed ACIPString to TString -- we'll use this for EWTS and ACIP both.
TMW->ACIP for TMW9.61 should work now.
2003-10-04 01:22:59 +00:00
amontano
c8927b827c
Fixed bugs in the scanner. Added reference to yogacara bhumi in the about window.
2003-09-23 19:05:23 +00:00
amontano
e89c49651c
Now translation tool accepts synonyms separated by ';' in the entry field.
2003-09-14 05:56:20 +00:00
dchandler
115d0e0e6c
Fixed ACIP->TMW vowels like 'I etc.
...
Fixed ACIP->Unicode/TMW for BDE, which should be B-DE, not B+DE, because the former is legal Tibetan.
The ACIP->EWTS subroutine has improved.
TMW->Wylie and TMW->ACIP are improved in error cases.
TMW->ACIP has friendly embedded error messages now.
2003-09-12 05:06:37 +00:00
dchandler
16817d0b8e
Fixed Javadocs.
2003-09-10 01:19:05 +00:00
amontano
cc853be387
Fixed a bug with regards to the word order in the servlet version.
2003-09-09 16:02:03 +00:00
amontano
1467f9cd3f
Fixed display of servlet version and added option to include links to
...
other versions. See http://iris.lib.virginia.edu/tibetan/servlet/org.thdl.tib.scanner.OnLineScannerFilter?thdlBanner=on
2003-09-08 21:32:40 +00:00
amontano
73d01111ca
Fixed the "clicking on the translate button makes the thdl menu go away"
...
error. on the servlet version of the translation tool.
2003-09-08 16:39:18 +00:00
amontano
07fbbcaf45
Solved some sorting errors with the servlet version.
...
Also if the service parameter thdlBanner=anything is sent, the THDL's
java script menu is displayed (if it is running on the thdl server). There is
still a bug. Menu goes away when pressing "translate" button. See:
http://iris.lib.virginia.edu/tibetan/servlet/org.thdl.tib.scanner.OnLineScannerFilter?thdlBanner=on
2003-09-08 08:12:56 +00:00
dchandler
e42d76b3b8
Nicer default Latin font for ACIP->* conversions.
...
Performance improvement in non-color-coding mode.
2003-09-07 22:08:35 +00:00
dchandler
6872ea8028
Corrected the usage info.
2003-09-07 22:08:00 +00:00
dchandler
d8657abd44
ACIP font shrinking as in {KA (GA)} is now supported.
2003-09-07 18:30:59 +00:00
dchandler
07e360d9a8
The ACIP {NYA%} is supported. {NYAo} and {NYAx} are confusing to me,
...
because I don't know which glyphs o and x correspond to. For that
reason, they cause ERRORs.
The proposed THDL Extended Wylie ~X and X is now used for U+0F35 and
U+0F37 respectively.
2003-09-07 16:19:50 +00:00
amontano
f57cdda867
Now translation tool displays to where is it connected
2003-09-07 03:40:51 +00:00
amontano
b489034598
Fixed a call to a deprecated method
2003-09-07 03:39:08 +00:00
dchandler
0d6d6ed611
Added GUI support for color-coding. Added support for color-coding
...
and choosing the warning level to TibetanConverter.
Better error checking in the GUI converter.
2003-09-06 22:56:10 +00:00
dchandler
1308f14807
sanskrit=green, prefix-rule-afflicted-tsheg-bar=yellow
2003-09-05 06:05:46 +00:00
dchandler
899b042ec0
Preliminary, untested color support in ACIP->TMW conversion.
2003-09-05 05:54:35 +00:00
dchandler
717c3b94f3
Fixed ACIP->Unicode spaces/tshegs and newlines, especially with shads.
...
"NGA," becomes "NGA-tsheg-," automatically now.
2003-09-05 05:08:47 +00:00
dchandler
5c240ac072
From the converter GUI, you can now choose TMW->ACIP text and
...
TMW->Wylie text. All the conversions show you which format they take
as input and which format they give as output.
File filter for ACIP files added.
The GUI converter suggests a file extension wisely.
Fixed newline bug in ACIP->Unicode converter.
2003-09-05 02:05:34 +00:00
dchandler
4abbf6db37
--to-acip-text and --to-wylie-text added; these get you text files,
...
not RTF files like --to-acip and --to-wylie do. The GUI converter
doesn't yet allow you to get text files.
2003-09-04 05:16:47 +00:00
dchandler
cc615f34df
ACIP->TMW and ACIP->Unicode have my pre-stamp of non-approval. Except
...
for (NYAx} and {NYAo}, they're as good as I'll get them without input
from experts of the employ of a complementary, syllabary-based
approach.
2003-09-04 04:34:18 +00:00
dchandler
ae7a7577bc
ACIP->TMW and ACIP->Unicode are now smart about when a newline is
...
really a newline and when a space is really a tsheg. The space in {KA
,MDO} is a tsheg, but the space in {GA ,MDO} is not.
2003-09-04 04:13:01 +00:00
dchandler
d2749cecd0
ACIP->TMW and ACIP->Unicode are now smart about when a newline is
...
really a newline and when a space is really a tsheg. The space in {KA
,MDO} is a tsheg, but the space in {GA ,MDO} is not.
2003-09-04 04:04:21 +00:00
dchandler
72e531e515
Use shortened 'dreng-bu, not regular. As per TM glyphs. I suspect
...
that the following would look better with shortened 'dreng-bu also,
but I'm sticking with the TM/TMW docs:
dz+r~137,2~~4,46~1,110~4,120~1,123~1,126~4,106~4,113~f5b,fb2
dz+w~138,2~~4,47~1,110~4,120~1,123~1,126~4,106~4,113~f5b,fad
dz+h~139,2~~4,48~1,110~4,120~1,123~1,126~4,106~4,113~0F5C
dz+h+y~140,2~~4,49~1,110~4,121~1,123~1,126~4,107~4,114~f5c,fb1
dz+h+r~141,2~~4,50~1,110~4,121~1,123~1,126~4,107~4,114~f5c,fb2
dz+h+l~249,2~~4,51~1,110~4,123~1,123~1,126~4,110~4,117~f5c,fb3
dz+h+w~143,2~~4,52~1,110~4,122~1,123~1,126~4,108~4,115~f5c,fad
2003-09-04 03:46:35 +00:00
a1tsal
2f58ec2760
A bunch of Sanskrit stacks of the form ts+... and dz+...had 1,125 for their
...
drengbu, but that is actually a naro. I changed it to 1,123
(which is one of the two drengbus).
2003-09-04 02:06:58 +00:00
dchandler
316f59107b
A preliminary TMW->ACIP converter is here. There are known bugs, mostly with rare punctuation.
2003-09-02 06:39:33 +00:00
dchandler
cc9ab06864
Added utility routine. Better comments.
2003-08-31 20:38:28 +00:00
dchandler
045c4069c9
Preliminary ACIP->TMW support is in place. {DU} gives you something
...
less beautiful than what Jskad would give, so more work is needed.
2003-08-31 16:06:35 +00:00
a1tsal
1f4d53be2e
Moved ^M to punctuation section.
...
Removed obsolete comment.
2003-08-31 00:44:23 +00:00
a1tsal
522812996e
Remove unused sections of tibwn.ini.
2003-08-31 00:34:15 +00:00
dchandler
dd22e161a5
Code cleanup for Jskad's Tibetan font converter GUI.
2003-08-30 05:01:15 +00:00
dchandler
896344f2d1
David Chapman removed some lines from tibwn.ini. That breaks TM<->TMW
...
mappings, so I've put them back, but with the EWTS non-correspondences
\tmwXYYY.
Jskad no longer supports superscribed or subscribed numerals, because
EWTS does not.
2003-08-26 01:28:02 +00:00
a1tsal
ccdebf6719
Removed half numbers (no longer in EWTS)
...
Brought <?Other?> closer to EWTS
Removed __TILDE__ (no longer in EWTS)
Changed M^ to ^M per new EWTS draft
Added ai, au, -i from WW tibwn.ini -- they were missing in this version
2003-08-25 23:19:48 +00:00
dchandler
1982c5847b
Jskad's converter now has ACIP-to-Unicode built in. There are known
...
bugs; it is pre-alpha. It's usable, though, and finds tons of errors
in ACIP input files, with the user deciding just how pedantic to be.
The biggest outstanding bug is the silent one: treating { }, space, as
tsheg instead of whitespace when we ought to know better.
2003-08-24 06:40:53 +00:00
dchandler
d5ad760230
TMW->Wylie conversion now takes advantage of prefix rules, the rules
...
that say "ya can take a ga prefix" etc.
The ACIP->Unicode converter now gives warnings (optionally, and by
default, inline). This converter now produces output even when
lexical errors occur, but the output has errors and warnings inline.
2003-08-23 22:03:37 +00:00
dchandler
21ef657921
I'd broken the ACIP->Wylie for ACIP vowels {'A}, {'I}, etc.
2003-08-22 05:13:32 +00:00
dchandler
1afb3a0fdd
ACIP->Unicode, without going through TMW, is now possible, so long as
...
\, the Sanskrit virama, is not used. Of the 1370-odd ACIP texts I've
got here, about 57% make it through the gauntlet (fewer if you demand
a vowel or disambiguator on every stack of a non-Tibetan tsheg bar).
2003-08-18 02:38:54 +00:00