Commit graph

  • 542fb50bf1 The ~M and ~M` EWTS change had not fully been made. Someone submitted a bug report 911472 that alerted me to this. dchandler 2004-03-07 17:02:35 +0000
  • db2ef438d4 Andres requested customization like this. You can now override any property in build.xml using a file 'build.properties'. dchandler 2004-03-06 23:17:43 +0000
  • e0928d8472 New EWTS for 0F82 and 0F83. dchandler 2004-03-06 23:00:40 +0000
  • bb8fa6c58f Now the clear button in the http servlet version actually clears. Also added "synchronized" to some methods to ensure that concurrent threads don't crash. amontano 2004-03-03 00:33:18 +0000
  • d436a4d462 Removed David Chapman's recently added line for U+0F82 -- a line for U+0F82 already existed, and the new line had incorrect TM and incorrect TMW mappings. I changed the existing line for U+0F82 to use the EWTS {~M`}. dchandler 2004-03-02 04:29:41 +0000
  • 8eaaeaa202 Fix careless error: I had the same TMW character for ~M and ~M`! a1tsal 2004-02-22 09:14:56 +0000
  • b14833b5b9 Change ^M to ~M to conform to spec. Introduce ~M` (for 0F82). a1tsal 2004-02-20 15:07:49 +0000
  • e5454d3720 Updated the translation tool to conform to the Personal Profile specification of Java. Before it would run in pocket pc's through the more restricted personalJava specification but Sun's vm for pocket pc's project was terminated. Now it is designed to run under IBM's VM for pocket pc's called J9 which implements the Personal Profile specification. Such specification also supports awt, but not swing so still there is no (hope for) support of Tibetan script in the pocket pc's, amontano 2004-02-07 18:21:17 +0000
  • 274e1736be Deleted cut-and-paste goof. dchandler 2004-01-17 19:45:31 +0000
  • c69ba26c60 TString now has tracks what Roman transliteration system it is using. Next up is to make ACIPConverter handle EWTS or ACIP TStrings. dchandler 2004-01-17 19:28:54 +0000
  • 48b4c5cb07 Added a Unicode->ASCII dump for debugging *->Unicode conversions. To use it, use 'java -cp Jskad.jar org.thdl.util.VerboseUnicodeDump'. dchandler 2004-01-17 17:10:12 +0000
  • 6fdb2a26bb Added a Unicode->ASCII dump for debugging *->Unicode conversions. To use it, use 'java -cp Jskad.jar org.thdl.util.VerboseUnicodeDump'. dchandler 2004-01-17 16:52:38 +0000
  • 9dd95c5524 I saw this error when I wasn't expecting it, so now, curious, I print more details. dchandler 2004-01-17 16:51:33 +0000
  • f490978227 Now using Jskad/build.properties file, which I keep looking like this: dchandler 2004-01-17 16:43:47 +0000
  • 4dd40809a5 A user reported that q` caused a crash with TCC keyboard #1. Fixed. TCC keyboard #1 does not support q~ though. dchandler 2003-12-21 06:27:36 +0000
  • c1aa81e943 RFE 860190: ACIP->Unicode now gives a warning when it outputs something that can't be represented in TMW. dchandler 2003-12-16 07:45:40 +0000
  • 848349fd3a More tests. dchandler 2003-12-15 08:16:06 +0000
  • e7a9e7968f ACIP->Unicode now uses two characters for consonants instead of one. This matches the dislike for characters like U+0F77 etc. dchandler 2003-12-15 07:32:14 +0000
  • e9f7b2dfed If you want curly brackets around folio markers, you'll have to set the system property thdl.acip.to.x.output.curly.brackets.around.folio.markers to true. dchandler 2003-12-14 08:47:03 +0000
  • 8664571577 Warnings were not being detected correctly. Fixed. dchandler 2003-12-14 08:38:10 +0000
  • 01e65176d4 Using less memory and time to figure out if warnings occurred. dchandler 2003-12-14 07:41:15 +0000
  • 76c2e969ac Fixed ACIP->Unicode bug for YYE etc., things with full-formed subjoined consonants and vowels. dchandler 2003-12-14 07:36:21 +0000
  • f625c937ee ACIP {B} was not being treated like {BA}; instead, an error was resulting. All the five prefixes were affected. dchandler 2003-12-14 05:54:07 +0000
  • a0e6db11c0 Very minor cleanup. dchandler 2003-12-13 21:59:31 +0000
  • fcb0c57792 I'm committing an old edit that may or may not help headless tests -- they seemed fine anyway, but what's the harm? dchandler 2003-12-13 21:50:23 +0000
  • 8411ebe932 You couldn't build from the src-dist. I think you can now; it's more inclusive. dchandler 2003-12-13 21:46:09 +0000
  • 4c30657afa Adding tests for an ACIP keyboard that will never work correctly, and probably never even be useful. But they were lying around from a while back, so here are the tests. dchandler 2003-12-13 21:34:33 +0000
  • 02967539b0 Slightly improved Jskad's internal documentation. Links to converters' docs. dchandler 2003-12-10 07:04:35 +0000
  • 581643cf59 {DAN,\nLHAG} used to be treated like {DAN, LHAG} but that got broken. Fixed. dchandler 2003-12-10 06:55:16 +0000
  • 8e673bbc2c {NGA,} becomes {NGA\u0f0c,} now instead of {NGA\u0f0b,}. dchandler 2003-12-10 06:50:14 +0000
  • a466bad939 ACIP->TMW now supports EWTS PUA {\uF021}-style escapes. Our extended ACIP is thus TMW-complete and useful for testing. dchandler 2003-12-08 07:51:45 +0000
  • a39c5c12b0 ACIP->TMW now supports EWTS PUA {\uF021}-style escapes. Our extended ACIP is thus TMW-complete and useful for testing. dchandler 2003-12-08 07:15:27 +0000
  • 8f7322a056 Use absolute paths when invoking the external viewer; it doesn't know what our current working directory is. dchandler 2003-12-08 06:53:37 +0000
  • b617f761d5 ACIP->TMW for {^GONG SA } used to fail; fixed. dchandler 2003-12-07 20:05:41 +0000
  • 115534e688 ACIP->TMW for {^GONG SA } used to fail because we had \u0F38 in the ToWylie section. Now it's in the <?Input:Numbers?> section because I didn't want to introduce a new section. If WylieWord has trouble due to this misuse of the 'numbers' category, we'll introduce a new category, 'other'. dchandler 2003-12-07 19:40:59 +0000
  • 597cf408dd Fixed help message. dchandler 2003-12-07 19:10:36 +0000
  • 4adf87c401 Updated comments only. dchandler 2003-12-06 20:36:56 +0000
  • 3f18623977 Added comments only. dchandler 2003-12-06 20:26:45 +0000
  • 6232ee9170 Added comments referring to a user guide in development now. dchandler 2003-12-06 20:26:15 +0000
  • c43e9a446b Revamped some ACIP->Tibetan error messages. dchandler 2003-12-06 20:19:40 +0000
  • c9c771d1ee ACIP {&}, as in {KO&HAm,}, is supported. dchandler 2003-11-30 02:18:59 +0000
  • ac412c994b Now {Pm} is treated like {PAm}; {Pm:} is like {PAm:}; {P:} is like {PA:}. dchandler 2003-11-30 02:06:48 +0000
  • e7c4cc1874 Updated to be in sync with latest EWTS draft. dchandler 2003-11-29 22:59:39 +0000
  • ffd041e32c ACIP->TMW and ACIP->Unicode now allow for Unicode escapes like K\u0F84. This means that the lack of support for ACIP's backslash, '\\', is mitigated because you can turn ACIP {K\} into ACIP {K\u0F84}. dchandler 2003-11-29 22:57:12 +0000
  • dfaae4be93 ACIP->TMW and ACIP->Unicode now allow for Unicode escapes like K\u0F84. This means that the lack of support for ACIP's backslash, '\\', is mitigated because you can turn ACIP {K\} into ACIP {K\u0F84}. dchandler 2003-11-29 22:56:18 +0000
  • 946d8cbc72 Updated the code I used for testing to generate the file containing all glyphs in TM and all glyphs but one in TMW. dchandler 2003-11-29 16:22:26 +0000
  • 16bfeac641 These issues are non-issues; removing these comments. dchandler 2003-11-25 00:31:33 +0000
  • d3d0ff23a8 Chris Fynn and Tony Duff answered my questions about U+0F3F and U+0F3E. dchandler 2003-11-25 00:28:18 +0000
  • b8608797aa Updated the code I used for testing to generate the file containing all glyphs in TM and all glyphs but one in TMW. dchandler 2003-11-24 05:59:32 +0000
  • 8d18ac53cb N+D+Ya, not N+D+ya, w+Wa, not w+wa .. use W, R, and Y where appropriate. dchandler 2003-11-24 05:50:42 +0000
  • 5d053b41fe Found another inconsistency between Unicode and the TM/TMW docs. I've sent e-mail to Tony Duff asking who's right, but I'm putting this in the errata under the assumption that even if Unicode is wrong, Unicode's wrong view will somehow rule the day. dchandler 2003-11-24 05:49:15 +0000
  • 9a247f5932 N+D+Ya, not N+D+ya, w+Wa, not w+wa .. use W, R, and Y where appropriate. dchandler 2003-11-24 04:55:11 +0000
  • 1ec668c018 Dza is not in the latest EWTS draft. dchandler 2003-11-24 04:28:55 +0000
  • f76c089366 Using Y, R, and W everywhere needed. R+... is never needed in TM/TMW, I concluded (with 50% certainty). dchandler 2003-11-24 04:05:59 +0000
  • 08c676c186 Bug fixes. Plus, now 99% in sync with the new EWTS draft. Search for 'DLC' to find a few open issues. dchandler 2003-11-24 02:40:40 +0000
  • 216c5b0d54 Fixed TWM->Wylie for achen. I even tested this by pretending achen could take a da prefix (when in reality it takes no prefixes). dchandler 2003-11-23 01:22:27 +0000
  • 37e8dfa917 The menu now says (Buggy) in front of "Convert Selection from Wylie to Tibetan" because this feature is, you guessed it, buggy. dchandler 2003-11-22 22:48:41 +0000
  • 93995fff6a Fixed broken link. dchandler 2003-11-22 20:50:37 +0000
  • 113480a882 X is now better supported, so this changed. dchandler 2003-11-15 20:00:59 +0000
  • 8d4fb5d13f We crashed before when '~' was entered. dchandler 2003-11-14 04:50:55 +0000
  • b59b86fd73 Commented this to mention some recent testing. dchandler 2003-11-11 03:45:58 +0000
  • 4023be9612 Better prettyprinting. Untested. dchandler 2003-11-11 03:43:26 +0000
  • 4e6a9c299f ACIP % {MTHAR%} and o {Ko} and ^ {^GONG SA} are now supported. A % always causes a warning. dchandler 2003-11-11 03:43:11 +0000
  • 2cb90bd231 ACIP->Tibetan converters now warn every time {%} is encountered that U+0F14 might've been intended. dchandler 2003-11-09 23:15:58 +0000
  • 084e12a02c Import Wylie is a buggy feature. The menu now calls it "(Buggy) Import Wylie...". t+s+w doesn't even convert correctly! dchandler 2003-11-09 01:25:58 +0000
  • 04816acb74 ACIP->Unicode was broken for KshR, ndRY, ndY, YY, and RY -- those stacks that use full-form subjoined RA and YA consonants. dchandler 2003-11-09 01:07:45 +0000
  • 8193cef5d1 Better comments. dchandler 2003-11-09 01:07:07 +0000
  • dbd9c80ca0 Special tests for rwa and r+wa, which are the only two different stacks with the same hash key modulo - and +. dchandler 2003-11-09 01:06:26 +0000
  • 85e1e0701e Fixed crashing bug in Import Wylie. dchandler 2003-11-08 23:32:53 +0000
  • 8fbd8850f8 New feature: Convert Selection from TWM to ACIP. dchandler 2003-11-08 23:22:06 +0000
  • bab47c4910 There are now extensive tests to make sure that each Tibetan stack in TMW can be typed in using EWTS and correctly converted to TMW and then back to EWTS. These tests unearthed new bugs in the Tibetan! 5.1 docs. dchandler 2003-11-08 22:11:24 +0000
  • 3fa417d3ee phywI, phywU, drwI and drwU now produce vowels and subjoined a-chungs. The Tibetan! 5.1 docs say I and U are not applicable to these stacks, but I say Jskad lets the user decide what's applicable. If you disagree, be sure to give an error message before dropping the I or U request -- we were silent. dchandler 2003-11-08 21:53:34 +0000
  • e058d6252e phywu and drwu now produce zhabs-kyus. The Tibetan! 5.1 docs say the zhabs-kyu is not applicable to these stacks, but I say Jskad lets the user decide what's applicable. If you disagree, be sure to give an error message before dropping the zhabs-kyu request -- we were silent. dchandler 2003-11-08 21:48:08 +0000
  • 55aaeef9d0 l+h+wu now produces a zhabs-kyu. The Tibetan! 5.1 docs say the zhabs-kyu is not applicable to l+h+w, but I say Jskad lets the user decide what's applicable. If you disagree, be sure to give an error message before dropping the zhabs-kyu request -- we were silent. dchandler 2003-11-08 21:23:50 +0000
  • 06edf17b04 Once again, the wrong 'dreng-bu glyphs were listed in the Tibetan! 5.1 docs -- they were na-ro glyphs, actually. dchandler 2003-11-08 21:17:18 +0000
  • f626a04d72 Tests t+r+n glyph. dchandler 2003-11-08 20:28:34 +0000
  • 74d6bc61ab The wrong 'dreng-bu glyphs were listed in the Tibetan! 5.1 docs -- they were na-ro glyphs, actually. dchandler 2003-11-08 20:25:16 +0000
  • a0ae0bf70d Fixes bug 800164. Jskad users can now enter t+r+n on the keyboard. Wylie Word should work for t+r+n too. dchandler 2003-11-08 17:50:10 +0000
  • 9ad767bd22 & -> &amp; dchandler 2003-11-08 03:51:38 +0000
  • 0ac90d7c0f Nathanial -> Nathaniel dchandler 2003-11-08 03:42:51 +0000
  • b1765ec307 & -> &amp; dchandler 2003-11-08 03:41:55 +0000
  • cb78fedc1b An attempt at bringing these pages up to date w.r.t. THDL HTML guidelines. dchandler 2003-10-31 03:58:01 +0000
  • c1a926429f Take two: An attempt at bringing this page up to date w.r.t. THDL HTML guidelines. dchandler 2003-10-31 03:43:16 +0000
  • 3b8c33bcd9 An attempt at bringing this page up to date w.r.t. THDL HTML guidelines. dchandler 2003-10-31 03:37:44 +0000
  • e3f1ed5914 Removed a DOS EOF character (^Z). I haven't a clue how it crept in -- the lexer doesn't let that kind of thing get into tsheg bars. dchandler 2003-10-27 13:58:45 +0000
  • 94a43d3f39 Now anything not clearly native Tibetan is colored green when coloring is enabled. G'EEm is "native", though -- the only "vowel" that implies non-nativeness is {:}, as in {KA:}. dchandler 2003-10-26 18:56:48 +0000
  • 5c36dd81d3 Fixed bug 830332, "Convert selected ACIP=>Tibetan busted". dchandler 2003-10-26 18:25:25 +0000
  • e74547d743 GA-YOGS now parses like G-YOGS and GAYOGS do. dchandler 2003-10-26 18:06:38 +0000
  • 61cf19932e ACIP {B5} and {7'} were problematic; that's fixed. dchandler 2003-10-26 17:47:35 +0000
  • ad7b20e485 Added yet more metadata. dchandler 2003-10-26 16:05:30 +0000
  • 1550fee41a Removed garbage. dchandler 2003-10-26 16:05:07 +0000
  • fe33d67573 Added more metadata. There are 35 million+ tsheg bars here. dchandler 2003-10-26 15:35:08 +0000
  • 050666d735 I'm committing this at 1:55 am EST on Sunday, October 26, 2003. There is no compelling technical reason, but this way I get to have two commits that are both before and after each other. dchandler 2003-10-26 06:56:12 +0000
  • 31b3020d07 Added a test case that runs almost all the tsheg bars from all non-reference, publicly available ACIP files (hundreds of megabytes of them) through the converter. The frequencies of these tsheg bars in in the file, too. dchandler 2003-10-26 06:02:48 +0000
  • 7ba1ad0735 Added a mechanism for end users to have the ACIP/EWTS=>Tibetan converters print all tsheg bars or all unique tsheg bars to standard output. This will be useful for getting a list of all the tsheg bars in ACIP texts, e.g., which can then go into PackageTest.java. A lot of postprocessing would be required to get frequency counts, but you could do it with a perl script, awk, etc. dchandler 2003-10-26 02:42:06 +0000
  • ef24c608bf Added a mechanism for end users to customize ACIP/EWTS=>Tibetan conversions by giving a list of substitutions to be performed. E.g., when I invoke Jskad via 'java -Dorg.thdl.tib.text.ttt.VerboseReplacementMap=false -Dorg.thdl.tib.text.ttt.ReplacementMap="KAsh=>K+sh" -jar Jskad.jar', then the ACIP KAsh becomes K+sh automatically. dchandler 2003-10-26 02:17:19 +0000
  • 6bda550157 The ACIP "BNA" was converting to B-NA instead of B+NA, even though NA cannot take a BA prefix. This was because BNA was interpreted as root-suffix. In ACIP, BN is surely B+N unless N takes a B prefix, so root-suffix is out of the question. dchandler 2003-10-26 00:32:55 +0000
  • d99ae50d8a The ACIP "BNA" was converting to B-NA instead of B+NA, even though NA cannot take a BA prefix. This was because BNA was interpreted as root-suffix. In ACIP, BN is surely B+N unless N takes a B prefix, so root-suffix is out of the question. dchandler 2003-10-26 00:24:28 +0000
  • 1415fc43e3 The ACIP "BNA" was converting to B-NA instead of B+NA, even though NA cannot take a BA prefix. This was because BNA was interpreted as root-suffix. In ACIP, BN is surely B+N unless N takes a B prefix, so root-suffix is out of the question. dchandler 2003-10-26 00:21:54 +0000
  • 306cf2817c Private correspondence with Robert Chilton led to me to add and remove a few prefix rules. BLC and BGL are here, BLK, BLG, BLNG, BLJ, BNG, BJ, BNY, BN, and BDZ are gone. dchandler 2003-10-25 21:47:34 +0000