Jskad

Author	SHA1	Message	Date
dchandler	cc615f34df	ACIP->TMW and ACIP->Unicode have my pre-stamp of non-approval. Except for (NYAx} and {NYAo}, they're as good as I'll get them without input from experts of the employ of a complementary, syllabary-based approach.	2003-09-04 04:34:18 +00:00
dchandler	ae7a7577bc	ACIP->TMW and ACIP->Unicode are now smart about when a newline is really a newline and when a space is really a tsheg. The space in {KA ,MDO} is a tsheg, but the space in {GA ,MDO} is not.	2003-09-04 04:13:01 +00:00
dchandler	d2749cecd0	ACIP->TMW and ACIP->Unicode are now smart about when a newline is really a newline and when a space is really a tsheg. The space in {KA ,MDO} is a tsheg, but the space in {GA ,MDO} is not.	2003-09-04 04:04:21 +00:00
dchandler	72e531e515	Use shortened 'dreng-bu, not regular. As per TM glyphs. I suspect that the following would look better with shortened 'dreng-bu also, but I'm sticking with the TM/TMW docs: dz+r~137,2~~4,46~1,110~4,120~1,123~1,126~4,106~4,113~f5b,fb2 dz+w~138,2~~4,47~1,110~4,120~1,123~1,126~4,106~4,113~f5b,fad dz+h~139,2~~4,48~1,110~4,120~1,123~1,126~4,106~4,113~0F5C dz+h+y~140,2~~4,49~1,110~4,121~1,123~1,126~4,107~4,114~f5c,fb1 dz+h+r~141,2~~4,50~1,110~4,121~1,123~1,126~4,107~4,114~f5c,fb2 dz+h+l~249,2~~4,51~1,110~4,123~1,123~1,126~4,110~4,117~f5c,fb3 dz+h+w~143,2~~4,52~1,110~4,122~1,123~1,126~4,108~4,115~f5c,fad	2003-09-04 03:46:35 +00:00
a1tsal	2f58ec2760	A bunch of Sanskrit stacks of the form ts+... and dz+...had 1,125 for their drengbu, but that is actually a naro. I changed it to 1,123 (which is one of the two drengbus).	2003-09-04 02:06:58 +00:00
dchandler	316f59107b	A preliminary TMW->ACIP converter is here. There are known bugs, mostly with rare punctuation.	2003-09-02 06:39:33 +00:00
dchandler	cc9ab06864	Added utility routine. Better comments.	2003-08-31 20:38:28 +00:00
dchandler	045c4069c9	Preliminary ACIP->TMW support is in place. {DU} gives you something less beautiful than what Jskad would give, so more work is needed.	2003-08-31 16:06:35 +00:00
a1tsal	1f4d53be2e	Moved ^M to punctuation section. Removed obsolete comment.	2003-08-31 00:44:23 +00:00
a1tsal	522812996e	Remove unused sections of tibwn.ini.	2003-08-31 00:34:15 +00:00
dchandler	dd22e161a5	Code cleanup for Jskad's Tibetan font converter GUI.	2003-08-30 05:01:15 +00:00
dchandler	896344f2d1	David Chapman removed some lines from tibwn.ini. That breaks TM<->TMW mappings, so I've put them back, but with the EWTS non-correspondences \tmwXYYY. Jskad no longer supports superscribed or subscribed numerals, because EWTS does not.	2003-08-26 01:28:02 +00:00
a1tsal	ccdebf6719	Removed half numbers (no longer in EWTS) Brought <?Other?> closer to EWTS Removed __TILDE__ (no longer in EWTS) Changed M^ to ^M per new EWTS draft Added ai, au, -i from WW tibwn.ini -- they were missing in this version	2003-08-25 23:19:48 +00:00
dchandler	1982c5847b	Jskad's converter now has ACIP-to-Unicode built in. There are known bugs; it is pre-alpha. It's usable, though, and finds tons of errors in ACIP input files, with the user deciding just how pedantic to be. The biggest outstanding bug is the silent one: treating { }, space, as tsheg instead of whitespace when we ought to know better.	2003-08-24 06:40:53 +00:00
dchandler	d5ad760230	TMW->Wylie conversion now takes advantage of prefix rules, the rules that say "ya can take a ga prefix" etc. The ACIP->Unicode converter now gives warnings (optionally, and by default, inline). This converter now produces output even when lexical errors occur, but the output has errors and warnings inline.	2003-08-23 22:03:37 +00:00
dchandler	21ef657921	I'd broken the ACIP->Wylie for ACIP vowels {'A}, {'I}, etc.	2003-08-22 05:13:32 +00:00
dchandler	1afb3a0fdd	ACIP->Unicode, without going through TMW, is now possible, so long as \, the Sanskrit virama, is not used. Of the 1370-odd ACIP texts I've got here, about 57% make it through the gauntlet (fewer if you demand a vowel or disambiguator on every stack of a non-Tibetan tsheg bar).	2003-08-18 02:38:54 +00:00
dchandler	245aac4911	I'm now stricter about accepting alphabetic characters. F, Q, X, a, b, c, d, e, ... do not belong in ACIP, so the scanner rejects them. This should make it even easier to distinguish automatically between Tibetan and English texts.	2003-08-17 02:38:58 +00:00
dchandler	39451d8879	Fixed a couple of small bugs. Only 250 errors are reported now; this is important if you try to convert an English document.	2003-08-17 02:12:49 +00:00
dchandler	4581a2d8ab	Improved the ACIP scanner (the part of the converter that says, "This is a correction, that's a comment, this is Tibetan, that's Latin (English), that's Tibetan inter-tsheg-bar punctuation, etc.) It now accepts more real-world ACIP files, i.e. it handles illegal constructs. The error checking is more user-friendly. There are now tests. Added some tsheg bars that Peter E. Hauer of Linguasoft sent me to the tests. Many thanks, Peter. I still need to implement rules that say, "This is not Tibetan, it must be Sanskrit, because that letter doesn't take a MA prefix."	2003-08-17 01:45:55 +00:00
dchandler	0b91ed0beb	I've improved the ACIP tsheg bar scanner to handle a lot of illegal constructions that occur in practice.	2003-08-16 16:13:53 +00:00
amontano	2a57439516	Updated the info displayed on the about window.	2003-08-14 14:16:49 +00:00
amontano	da384c6c2f	Now when loading, takes the default font options from the DuffPane.	2003-08-14 14:16:23 +00:00
dchandler	2b59d9838d	I now have a function that takes as input a String of ACIP and breaks up that String into tsheg bars, punctuation, etc., while finding errors. I've tested it some, but I'm not yet committing the tests. Next step: a converter that takes an ACIP file as input and outputs TMW+Latin.	2003-08-14 05:10:47 +00:00
dchandler	57f506384f	The ACIP->Tibetan converter now has perfect low-level functionality, and it has the capability to produce error messages and warnings that make sense to the user. One can now get the correct parse, if one exists, for an ACIP tsheg bar. One could even feed in ACIP and get a list of warnings about things as innocuous as PADMA, which a dumb converter would have trouble with. One could then turn ACIP into well-behaved ACIP for that dumb converter, if you really wanted to. Still to do: o Scan ACIP files into tsheg bars. o Produce TMW/Latin (from which you can get Unicode, etc.). o E-mail the illegal tsheg bars to the ACIP fellows so they can fix the affected documents (most of the Kangyur has unparseable creatures).	2003-08-12 04:13:11 +00:00
dchandler	87266646fb	Removed misinformation.	2003-08-10 19:33:01 +00:00
dchandler	e21d3774a9	Added an unfinished ACIP->Tibetan converter. Once it works properly for ACIP, it'll easily be made to work as a perfect EWTS Wylie->Tibetan converter. It has an extensive suite of tests for the existing functionality.	2003-08-10 19:30:07 +00:00
dchandler	39e0435b6b	Refactored this code so that Wylie->Tibetan and ACIP->Tibetan conversions can make use of it. Hooray for reuse.	2003-08-10 19:02:56 +00:00
dchandler	bcf1c12b6a	We now produce EWTS m.ya, g.rwa, d.rwa, and b.ya during TMW->Wylie. Our disambiguation is now perfect, happening when and only when it is necessary. These are all illegal, so it shouldn't affect many existing conversions. But if there were typos, it could.	2003-08-10 18:46:01 +00:00
dchandler	9093fd3c05	We now produce EWTS m.ya, g.rwa, d.rwa, and b.ya during TMW->Wylie. Our disambiguation is now perfect, happening when and only when it is necessary. These are all illegal, so it shouldn't affect many existing conversions. But if there were typos, it could.	2003-08-10 18:38:20 +00:00
dchandler	251d8feae5	brtan now gives TMW->Wylie brtan, not b.rtan. Etc. See bug report http://sourceforge.net/tracker/index.php?func=detail&aid=785791&group_id=61934&atid=502515.	2003-08-09 17:48:40 +00:00
dchandler	7dffc47cb7	'bad now gives TMW->Wylie 'bad, not TMW->Wylie 'abd. Andres came across this one, so we've added it to the list of ambiguous three-consonant combos.	2003-08-09 17:05:43 +00:00
amontano	52cdc17794	Added support for multiple keyboards and ability to set the preferences for size of tibetan font and type and size of roman font.	2003-08-09 08:00:58 +00:00
amontano	8e4b508de8	Made a new class for the preference window so that other software (i.e. the translation tool) can use re-use that same code to set up the attributes of the tibetan and roman fonts.	2003-08-09 07:57:21 +00:00
amontano	ef0df405d9	Redesigned the interface of the handheld version.	2003-08-03 06:29:08 +00:00
amontano	2b5a5fe67a	Got rid of redundant code	2003-08-03 06:28:22 +00:00
amontano	cce779bf88	Added a wizard window to avoid as much as possible using the command line. This way through clicking on the application through the wizard one can choose to connect to the available on-line dicts, open a local dict or generate a dict database.	2003-08-03 06:27:30 +00:00
dchandler	4caeafa1b1	You shouldn't have one of these without the other, now that there are two. This way neither TM nor TMW fonts will be loaded.	2003-07-26 00:55:32 +00:00
dchandler	2bb499e5a7	This was dying with a NullPointerException when you started it up using 'ant tt-run' with no dictionary. Now it starts up and shows you a nice error message, "Dictionary could not be loaded!", instead.	2003-07-26 00:53:59 +00:00
dchandler	e198519c5f	Jskad now supports EWTS ~, i.e. TMW8.91.	2003-07-25 02:35:31 +00:00
amontano	5df9b5b91a	now supports sorting	2003-07-25 01:43:58 +00:00
amontano	97f5fe91b3	when invalid wylie is encountered, instead of displaying a message it raises an exception.	2003-07-25 01:43:18 +00:00
amontano	7cdbf33333	changed it to support for 30 dictionaries (instead of just 15)	2003-07-25 01:42:17 +00:00
amontano	7b04d7bca5	changed the "about" info	2003-07-25 01:41:30 +00:00
dchandler	a7f0c35738	Added a test for ts.ha vs. tsha ambiguity; there is no ambiguity.	2003-07-18 03:51:29 +00:00
dchandler	dc454b8c0c	More test cases related to the following: The Tibetan d.za was being converted into the Wylie dza incorrectly. This is a rare case, but I want TMW->Wylie to be perfectly unambiguous.	2003-07-18 02:31:02 +00:00
dchandler	f8c959bfb0	The Tibetan d.za was being converted into the Wylie dza incorrectly. This is a rare case, but I want TMW->Wylie to be perfectly unambiguous.	2003-07-18 00:30:27 +00:00
dchandler	1c29566aee	I'm now using the Unix diff built in to Apache Jakarta Commons JRCS (which I found on suigeneris.org, not apache.org) in order to bulletproof the Tibetan Converter tests. They used to fail due to nondeterminism in the Java RTF writer; they should no longer fail. I've also changed it so that the Tibetan Converter tests run in headless mode, which means that they'll run on the nightly builds server.	2003-07-14 12:26:26 +00:00
dchandler	06fb77a82b	Initial revision	2003-07-14 12:22:29 +00:00
dchandler	f900154e7a	Tests disambiguation in TMW->Wylie conversion.	2003-07-14 12:21:02 +00:00
dchandler	0622ac5062	Jskad no longer relies on the <?Consonants?>, <?Vowels?>, <?Other?>, or <?Numbers?> commands; it instead hard-codes the appropriate comma- delimited lists. This is cleaner because WylieWord and Jskad had different values for these lists.	2003-07-14 12:19:46 +00:00
dchandler	fb85f6e8ce	Fix comment.	2003-07-14 12:17:04 +00:00
dchandler	79b3b97326	Remove warning message from menu item.	2003-07-13 23:19:11 +00:00
dchandler	c986684beb	Updated help to talk about new features.	2003-07-13 22:51:35 +00:00
dchandler	f695b1a6c1	Updated baselines because conversions have improved since the last update.	2003-07-13 19:14:41 +00:00
dchandler	d10f97fc06	Disambiguation was not being used appropriately. This makes previous TMW->Wylie conversions with the new-and-improved TMW->Wylie algorithm faulty. Now I'm using it a little more than you need to, e.g. b.lha instead of blha is generated because bla and b.la are ambiguous.	2003-07-13 19:14:15 +00:00
dchandler	96afae795c	Disambiguation was not being used appropriately. This makes previous TMW->Wylie conversions with the new-and-improved TMW->Wylie algorithm faulty. Now I'm using it a little more than you need to, e.g. b.lha instead of blha is generated because bla and b.la are ambiguous.	2003-07-13 18:46:29 +00:00
dchandler	802e0cb588	If this method uses the Wylie representation, you get an infinite recursion when you do a TMW->Wylie conversion for a document with glyphs that have no known Wylie.	2003-07-13 17:40:02 +00:00
dchandler	a86a0f235b	I was missing a break; statement; this caused an Error to be thrown during some TMW->Wylie conversions. No conversions were erroneous, though.	2003-07-13 17:38:00 +00:00
dchandler	6677d1e245	Code cleanup.	2003-07-13 16:53:03 +00:00
dchandler	3b6eaa792e	Fixed javadocs.	2003-07-11 13:33:30 +00:00
dchandler	85176cd9f3	Put in a fix for a new bug in Swing's RTF support. This bug is w.r.t. escapes like \bullet, \emdash, etc., and this fix only works for Windows or OS/2 RTF files, not for Mac RTF files. So if you want a TM->TMW conversion to work, use MS Word for Windows, not for the Mac.	2003-07-11 13:30:22 +00:00
dchandler	d726bc0258	A couple of changes to TMW->Unicode thanks to Than's reply to my questions.	2003-07-09 01:44:15 +00:00
dchandler	9db233bdf8	Cosmetic change.	2003-07-08 14:31:14 +00:00
dchandler	02558a1d78	Jskad supports <7, >8, etc. again; it no longer supports the punctuation '<' and '>'. The current keyboard implementation makes this an either-or proposition, when fundamentally it need not be. Added a <?Numbers?> command and an <?Input:Numbers?> command to tibwn.ini; broke the numbers apart from the consonants. This facilitates the new-and-improved Tibetan->Wylie conversion. Tibetan->Wylie is now done by forming legal tsheg-bars. A legal tsheg bar is converted into perfect THDL Wylie. See code comments to learn what it thinks is a legal tsheg-bar, but it inlcudes bskyUMbsH minus the trailing punctuation (H), e.g. Illegal sequences, such as runs of transliterated Sanskrit, are turned into unambiguous Wylie; each glyph is followed by a vowel or a disambiguator ('.'). I've made it so that the illegal sequences are as beautiful as possible. You get 'pad+me', for example, not the equivalent but uglier 'pad+m.e.'.	2003-07-08 14:30:17 +00:00
dchandler	c04a3f189b	Rearranged the topics.	2003-07-08 12:50:27 +00:00
dchandler	23d18c925f	Tibetan! 5.1's docs were again faulty. fa and va were getting the wrong vowels.	2003-07-08 02:59:17 +00:00
dchandler	24ac6fd06c	The Trie of possible inputs fixed this bug.	2003-07-06 16:31:13 +00:00
dchandler	d88141512b	Small changes w.r.t. clearing preferences. Some code cleanup.	2003-07-06 16:24:29 +00:00
dchandler	086f4bb6ec	Renamed the Info menu Help. Now using CalHTMLPane to surf the offline and the online help.	2003-07-05 22:25:21 +00:00
dchandler	8c4ab30a52	Rearranged the Tools menu; made the converter smart about "find some..." and "find all..." modes.	2003-07-05 21:02:46 +00:00
dchandler	72d2eee503	Code cleanup.	2003-07-05 19:26:58 +00:00
dchandler	a463b686b3	Jskad now ships with both TibetanMachine and TibetanMachineWeb fonts by default, not just TMW. Thus users need not install these fonts on their systems.	2003-07-05 18:00:29 +00:00
dchandler	9effee0564	If you opened a file from the recently opened files list and very quickly mouse-clicked on the new Jskad window, you could cause an infinite regression of requestFocus() operations because the menu would try to get focus back. I grab focus from the menu now.	2003-07-05 02:30:00 +00:00
dchandler	51679c158b	Final fixes completed; recently opened files can now be selected from Jskad's file menu.	2003-07-05 02:15:33 +00:00
dchandler	4410b52c07	There's still a small bug in this, but here's the real stuff: Recently opened files can now be selected from Jskad's file menu. A Jskad now gives the focus to the DuffPane when that Jskad gets the focus.	2003-07-04 03:29:25 +00:00
dchandler	d863446d25	I think this compiles...	2003-07-04 02:32:40 +00:00
dchandler	407020108f	I didn't mean to commit the previous revision; I'm still tweaking it.	2003-07-04 02:32:03 +00:00
dchandler	9f0b1c3250	Recently opened files can now be selected from Jskad's file menu. A Jskad now gives the focus to the DuffPane when that Jskad gets the focus.	2003-07-04 02:31:23 +00:00
dchandler	7500b4e06b	Jskad won't allow you to exit by closing the last window anymore. Instead, you get a dialog box saying to use File/Exit.	2003-07-04 00:21:07 +00:00
dchandler	6c286573ba	Fixed Javadocs.	2003-07-04 00:12:59 +00:00
dchandler	0a1bc0d30b	getWylie now takes a parameter for error detection; I'm not detecting errors here though. Fixed a typo in a property name.	2003-07-01 23:20:08 +00:00
dchandler	0d1999d055	getWylie now takes a parameter for error detection; I'm not detecting errors here though.	2003-07-01 22:52:18 +00:00
dchandler	a48ec641d5	Better error messages in TMW->Wylie conversions. The user knows what's up.	2003-07-01 03:43:33 +00:00
dchandler	3113a4b8de	Some of the \tmw80.. mappings were out of date. 3+1/2 is not EWTS; took these out.	2003-07-01 03:42:30 +00:00
dchandler	e7e7c2bf15	The command-line tool runs in headless mode by default, so it will work on a Linux console, e.g. The JUnit tests will too, though 'ant check' still fails because we don't sneak the -Djava.awt.headless=true into the process early enough.	2003-07-01 02:50:09 +00:00
dchandler	6151a7bc94	TMW->Wylie now occurs in the TibetanDocument, not in DuffPane, which means that the command-line tool can finally function with a headless graphics device. Hopefully it will speed things up, too. It also means that entering Roman text into the TMW->Unicode conversion and TMW->TM conversion will be easy.	2003-07-01 01:21:57 +00:00
dchandler	61d29fc355	The TMW->Wylie mapping was busted w.r.t. tshegs. Also, I now map both TMW7.90 and TMW7.91 to EWTS 'M'.	2003-07-01 00:17:18 +00:00
dchandler	229536884f	I've validated by hand the TM<->TMW mappings. A few things changed, so no previous TM->TMW or TMW->TM conversions can be trusted.	2003-06-30 02:24:11 +00:00
dchandler	dc03083433	I've validated by hand the TM<->TMW mappings. A few things changed, so no previous TM->TMW conversions can be trusted.	2003-06-30 02:22:09 +00:00
dchandler	58644a6ef9	Better error handling.	2003-06-30 02:20:52 +00:00
dchandler	b16fb8a85c	This is correct; the Tibetan! 5.1 documentation is not. This affects TM->TMW conversions. See http://sourceforge.net/tracker/index.php?func=detail&aid=746871&group_id=61934&atid=502515 for a full list of Tibetan! 5.1 documentation errors.	2003-06-29 22:11:00 +00:00
dchandler	aedef4b44d	An error now appears if you try to convert from format A to format B but no glyphs in format A appear. In this case, it is likely that you meant to convert a different file or do a different conversion.	2003-06-29 21:31:48 +00:00
dchandler	ee14b7b97f	Jskad now has the ability to open its buffer with an external viewer, e.g. Microsoft Word. Better OOM error handling in the GUI converter; untested, though.	2003-06-29 20:49:30 +00:00
dchandler	646e23b4a4	Tweaked the converter GUI so that you can open the old and the new files with the external viewer.	2003-06-29 16:45:15 +00:00
dchandler	3f76c3692d	Fixed Javadoc warnings.	2003-06-29 15:37:35 +00:00
dchandler	b841a7f14b	The converter GUI can now be run standalone or from Jskad's Tools menu. The converter GUI gives nicer error messages in at least one case.	2003-06-29 04:18:36 +00:00
dchandler	7938648ca8	TM->TMW conversion has no known bugs. Oddballs have been comprehensively handled.	2003-06-29 03:03:07 +00:00
dchandler	689c1910aa	To deal with java.swing.text.rtf bugs regarding hexadecimal escape sequences, I've created RTFFixerInputStream. It turns illegal hexadecimal escapes into Unicode escapes.	2003-06-29 02:30:08 +00:00
dchandler	0b849aed97	Fixed comments w.r.t. javadoc warnings.	2003-06-29 02:22:20 +00:00
dchandler	4e279defb4	Fixed a couple of array bounds checks. Added support for two more oddballs. Deprecated the oddball lookup method because it drops up to 30 glyphs in TibetanMachine. The correct solution is to transform the RTF before Java's busted RTF readers ever see it. \'97 becomes \u151, e.g.	2003-06-28 16:33:58 +00:00
dchandler	2a359c45ef	Bad conversions were not leaving the unconvertable characters at the beginning of the document as they should and as they are documented to. They now do, and they bracket the bad characters with the TM or TMW for U+0F3C on the left and the TM or TMW for U+0F3D on the right. Some cleanup.	2003-06-28 16:20:19 +00:00
dchandler	c39d8d6326	My earlier code cleanup introduced this bug; TMW->TM conversion was busted.	2003-06-26 22:48:51 +00:00
dchandler	25510542b2	Now with a nicer error message in one case.	2003-06-26 22:48:05 +00:00
dchandler	c34259b105	Code cleanup.	2003-06-25 01:04:24 +00:00
dchandler	9e6c3009ac	Added an About button. Code cleanup. Changed the Cancel button to the Close button.	2003-06-25 00:49:11 +00:00
dchandler	569fba6467	Made the comments in the my_thdl_preferences.txt file use standard line separators.	2003-06-25 00:03:46 +00:00
dchandler	0f3c4174b6	Made the comments in the my_thdl_preferences.txt file more useful.	2003-06-24 23:48:00 +00:00
dchandler	33beb7b782	Bye bye debugging output.	2003-06-24 12:23:37 +00:00
dchandler	f547734043	Added Than's converter GUI code; adapted it to work with Jskad's converters. TMW->Unicode now uses Ximalaya by default.	2003-06-24 03:02:29 +00:00
dchandler	19d7cabfe6	Forget the final=faster myth.	2003-06-24 03:01:13 +00:00
dchandler	917864574c	Fixed a logic bug in mapTMWtoTM and mapTMtoTMW. You can now specify which Unicode font to use via 'java -Dthdl.tmw.to.unicode.font=Ximalaya ...'.	2003-06-23 01:58:11 +00:00
dchandler	b6d8fd89f9	When errors in (all but TMW->Wylie and Wylie->TMW) conversion occur, the troublesome glyphs are now put at the beginning of the document AFTER AN ACHEN. This makes a glyph like \tmw7095 visible atop the achen. Major fix to the handling of paragraphs in conversion; we were (for whatever reason) dropping paragraphs before.	2003-06-23 01:24:02 +00:00
dchandler	1f4343bed0	TMW->TM, TM->TMW, and TMW->Unicode conversions are all (at least 2) orders of magnitude faster.	2003-06-22 22:10:58 +00:00
dchandler	afe73c2228	The pseudo-file '-', referring to standard input, is now accepted as a command-line argument.	2003-06-22 21:05:16 +00:00
dchandler	900f7492b0	'ant clean check' was failing because I hadn't updated the --find-some-non-tmw and --find-all-non-tmw baselines. Code cleanup.	2003-06-22 16:11:58 +00:00
dchandler	66287f3cc9	Small TMW->Wylie performance improvements. TMW->Wylie is much faster than TMW->Unicode etc.; this is because many fewer replacements are made (i.e., more text is replaced each time a replacement is performed). I must find a way to still preserve formatting but do many fewer replacements in TMW->{Unicode,TM} and TM->TMW.	2003-06-22 04:32:59 +00:00
dchandler	6540b260bd	Fixes a (small, I think) TMW->Unicode performance glitch. I was inserting 5 characters at a time and then skipping ahead just one position. I don't think this affected correctness. I believe there's still a terrible (exponential?) slowdown as the input file gets bigger, however. Perhaps not -- but we run through the first 1000 TMW glyphs in 6 seconds, the 20th thousand takes at least 60 seconds. Is TMW->Wylie faster than TMW->Unicode? If so, why? Thought: don't use a DuffPane within TibetanConverter -- it can only add overhead, right? My hprof profile said that the conversion was taking just a couple of percent of the work; the rest was going to display-related stuff that you should only see if you were displaying the document. I'm not!	2003-06-22 04:08:33 +00:00
dchandler	dfe64a1927	Added --find-some-non-tm and --find-all-non-tm modes to the converter to help ensure worry-free TM->TMW conversions.	2003-06-22 00:14:18 +00:00
dchandler	80101666c7	Included a fix from WylieWord's tibwn.ini. Removed some needless trailing tildes.	2003-06-21 02:35:21 +00:00
dchandler	9a41f512d9	It used to be the case that you could select 'Close', and then when asked "do you want to save?" you could press yes and then press cancel and Jskad would still exit. That's no longer the case. Added File->Exit to Jskad.	2003-06-21 02:07:51 +00:00
dchandler	45b87b0fb4	In Jskad, you can now clear the preferences and return to default values.	2003-06-21 01:26:17 +00:00
eg3p	fbb6245fdb	Added cut() and copy() methods to override JTextPane's methods of same name.	2003-06-20 15:27:20 +00:00
dchandler	5067683121	Edward corrected me; he had intended to have M map to 7.91, not 7.90.	2003-06-17 01:46:19 +00:00
dchandler	ced830a7d3	Renamed TMW_RTF_TO_THDL_WYLIE TibetanConverter.	2003-06-15 19:19:23 +00:00
dchandler	34a7b5da9b	This converter now performs TMW->Unicode conversions.	2003-06-15 18:38:42 +00:00
dchandler	da70434e52	Jskad now allows for TMW->Unicode conversion.	2003-06-15 16:27:36 +00:00
dchandler	af5b95b08d	A TMW->Unicode table is here. Note these issues, however: Is the EWTS '_' to be represented as U+0020, or is it a wider space? Does TMW9.42, Dza, map to U+0F5F,U+0F39? Does TMW6.60, r+y, map to U+0F62,U+0FBB or to U+0F6A,U+0FBB? (Likewise with r+w, TMW6.61, TMW6.62, etc.) Is U+0F7E a bindu? What Unicode does TMW7.96 map to, for example? What does TMW7.91 map to? Should TMW8.97 and TMW8.98 map to swastiskas elsewhere in Unicode? If so, which codepoints? Likewise with TMW9.60, a Chinese character. Does TMW7.68 map to U+0F39? Does TMW7.74, the ITHI secret sign, have a Unicode mapping? f68,fa0,f80,f72 comes close, but fa0 would be too large, wouldn't it? What Unicode does TMW9.61 map to? Is it for sequences like f40,f7c,f60,f72? Or is it for f60,f72,f7c?	2003-06-15 03:25:45 +00:00
dchandler	b387c512e9	Fixed two bugs.	2003-06-15 03:08:57 +00:00
dchandler	189fef9aec	Made Jskad smart enough to handle a few more EWTS characters; some it can only convert to Wylie, others are live key sequences. This will make converting the shechen documents go more smoothly.	2003-06-09 13:35:43 +00:00
dchandler	09a55110b7	Handles more TibetanMachine oddballs.	2003-06-09 02:01:13 +00:00
dchandler	b9219640e5	Handles more TibetanMachine oddballs.	2003-06-09 01:53:01 +00:00
dchandler	e97e1c8464	Handles more TibetanMachine oddballs.	2003-06-09 01:20:32 +00:00
dchandler	651a599188	Fixed usage info.	2003-06-08 23:23:12 +00:00
dchandler	70b31558fa	Tried to fix a crashing bug that happened when you converted TM->TMW and then tried to convert that TMW to Wylie. I swear it's Java's problem (see the ugly stack trace in the code and decide for yourself), and I tried replacing rather than inserting-and-then-removing, but it didn't work. I've left these things as options.	2003-06-08 23:12:52 +00:00
dchandler	212414edef	TMW_RTF_TO_THDL_WYLIE now converts TM->TMW.	2003-06-08 22:43:27 +00:00
dchandler	32831b698f	If bad (oddball) TM glyphs appear, then converting to TMW causes, by default, all oddballs to appear once in the resulting document. This'll help me find the correct glyphs for the oddballs, and it'll prevent the average user from converting a document with oddballs.	2003-06-08 22:37:38 +00:00
dchandler	d45f5ab8c8	Improved performance (I suppose).	2003-06-03 23:49:34 +00:00
dchandler	7d768c9e06	Fixed a crashing bug that happened upon converting wylie to tibetan.	2003-06-03 23:45:15 +00:00
dchandler	0f724989b5	The Wylie 'M' used to map to TMW7.91, when it should map to TMW7.90. I've fixed that. I've also added a couple of Unicode mappings to give a flavor for how multi-codepoint mappings will be represented. TM->TMW conversion takes about 1 second per thousand glyphs on my PIII-550.	2003-06-01 23:05:32 +00:00
dchandler	54ca37c824	The Wylie 'M' used to map to TMW7.91, when it should map to TMW7.90. I've fixed that. I've also added a couple of Unicode mappings to give a flavor for how multi-codepoint mappings will be represented.	2003-06-01 19:14:08 +00:00
dchandler	e2caf99085	Some code cleanup. tibwn.ini must now have, in the Unicode column, either nothing, or 0FXX(,0FXX)*. E.g., 0F04,0F05 is valid. Debugging code ensures this is the case.	2003-06-01 18:09:49 +00:00
dchandler	1f6bb07d53	Fixes bogus Unicode mappings mentioned in http://sourceforge.net/tracker/index.php?func=detail&aid=746871&group_id=61934&atid=502515.	2003-06-01 04:02:04 +00:00
dchandler	7a8264d87c	Fixed typo.	2003-06-01 03:30:49 +00:00
dchandler	0235263ddf	TM->TMW and TMW->TM conversion in RTF is now supported. I've noticed that formatting is mostly OK but sometimes gets bungled slightly. I tried everything I could think of, and now I'm passing the buck to Java's RTF support. TMW_RTF_TO_THDL_WYLIE (now misnamed) support TMW->TM conversion (but not TM->TMW). There is an automated test case for a TMW->TM conversion. I have full confidence in this conversion. Even the smallest glitch in the core functionality (not formatting) would surprise me. Note that the JUnit test TMW_RTF_TO_THDL_WYLIETest sometimes fails due to one- or two-line diffs between the actual and expected outputs. This is because Java's RTF support is not deterministic, I'm guessing, and is not a real failure. I'm too lazy to make a more elaborate sed/diff mechanism that works on all platforms, and that would complicate the build anyway.	2003-05-31 23:21:29 +00:00
dchandler	bfacd6c998	Accurate TM->TMW and TMW->TM mappings are now available. I've verified this extensively and have full confidence that these mappings agree with Tony Duff's Tibetan! 5.1 documentation (except as described below). To get them, I had to disregard Tony Duff's tables for a few glyphs: the characters with ordinal 32 and 45 (space and hyphen in Roman ASCII, space and tsheg in Tibetan). For these glyphs, we must have mappings from TibetanMachineSkt4.32 to something, etc., and those mappings were not present. I've normalized the mapping for these glyphs, as it is arbitrary because the same two glyphs just appear fifteen times each.	2003-05-31 20:13:15 +00:00
dchandler	a4bc23a9ab	Made performance improvements, doc improvements, and code cleanup to DuffCode.	2003-05-31 17:02:06 +00:00
dchandler	08d2ea3e2d	Jeff C. H. Wu found a bug whereby typing 'cuig' just after starting Jskad fails (by producing 'cug') although typing 'kcuig' succeeds. This is now fixed, and test cases now exist to ensure that the problem doesn't reappear.	2003-05-31 12:58:36 +00:00
dchandler	bc9a8f4754	Jeff C. H. Wu found a bug whereby typing 'cuig' just after starting Jskad fails (by producing 'cug') although typing 'kcuig' succeeds. This is now fixed.	2003-05-31 12:49:44 +00:00
dchandler	6f0390c5d6	By default (controllable via options.txt), Jskad now fixes the Tahoma curly brace problem upon opening any RTF document. The TMW_RTF_TO_THDL_WYLIE test baselines changed because I fixed (a while ago) some inconsistencies between the EWTS standard and Jskad. Conversion of TibetanMachineWeb8.40, @#, to Wylie now works correctly. Unfortunately, though, typing @# doesn't produce 8.40, it still produces 8.38 and 8.39, two glyphs.	2003-05-28 00:40:59 +00:00
dchandler	a144b125ca	I've made Jskad adhere to the THDL Extended Wylie spec. Some punctuation has changed {@, #, %, and $}. Fixed some errors in tibwn.ini so that all the TM<->TMW mappings are correct.	2003-05-26 13:11:51 +00:00
dchandler	ec7fec695f	Added some automated JUnit tests for TMW_RTF_TO_THDL_WYLIE.	2003-05-18 17:17:52 +00:00
dchandler	e2a9720d9b	I've added a command-line converter, org.thdl.tib.input.TMW_RTF_TO_THDL_WYLIE. It converts RTF files consisting of TMW characters to the corresponding THDL Extended Wylie. It supports --find-some-non-tmw mode, which allows you to ensure that no unusual characters will spoil the conversion. The converter has built-in intelligence that allows it to handle Tahoma '{', '}', and '\\' characters properly. The converter works on mixed Roman/TMW also, but --find-some-non-tmw and --find-all-non-tmw modes are not as useful. Invoke org.thdl.tib.input.TMW_RTF_TO_THDL_WYLIE, which resides in Jskad's jar, with no command-line options to see usage information.	2003-05-18 14:14:47 +00:00
dchandler	17ea8fdf2a	Copying from Word XP used to crash Jskad sometimes. Now you get a dialog box telling you something about RTF support in Java.	2003-05-15 01:41:56 +00:00
dchandler	78dc46a979	Jskad keyboards are now configured via keyboards.ini, a file that has comments that explain its function. It's quite simple. This is in response to Jeff C. H. Wu's request.	2003-05-14 03:25:36 +00:00
dchandler	dcb36ec338	Clearer status message; cleanup.	2003-05-14 02:37:28 +00:00
dchandler	8958366a07	Bad RTF now causes an error message to appear in the transcription instead of causing a fatal exception. The error allows you to look up the DuffCode that caused the trouble.	2003-05-14 01:37:49 +00:00
dchandler	8275afeb41	Bad RTF files cause a polite error message to appear instead of an exception to be thrown. Jskad windows now always have "Jskad" in their window titles.	2003-05-14 01:34:39 +00:00
eg3p	3e847ed009	DELETE was not working properly in Roman entry mode. Now it works ok.	2003-04-17 19:48:22 +00:00
amontano	0bacdcc229	fixed the paste problem for the translation tool	2003-04-17 11:12:59 +00:00
dchandler	59175ccfd6	Added a few tests for the ACIP keyboard, which I've improved a bit. Noted some failures. "Fixed" the code to do what I want it to do for the (no sanskrit stacking, tibetan stacking) case [which is exercised by this keyboard only].	2003-04-14 23:55:00 +00:00
dchandler	efa8fc1f25	DuffPane now has the start of a unit test suite. Invoke it via 'ant clean check'. Right now there are tests to ensure that typing certain sequences of keys in the Extended Wylie keyboard gives the expected Extended Wylie back when "Tools/Convert Tibetan to Wylie" is invoked. The syntactically illegal d.wa now converts to Tibetan and then back to d.wa (not dwa, as it did); likewise with the illegal g.wa. wa doesn't take any prefixes, but I prefer clean end-to-end behavior. (jeskd doesn't go end-to-end, though.) Note that you cannot successfully run the DuffPane tests on a Linux box unless your DISPLAY variable is set correctly. Thus, my nightly builds will fail with an Error (as opposed to a Failure).	2003-04-14 05:22:27 +00:00
dchandler	6636d03a41	ant private-javadocs runs without warnings; cleaned up some as-yet-unused code.	2003-04-13 01:46:20 +00:00
dchandler	644c0d3801	Updated the HTML help file; removed some useless code.	2003-04-13 01:17:10 +00:00
dchandler	daacf6ee3b	I've got too many sandboxes, so I'm committing these changes, half-done, from one sandbox so as to consolidate my sandboxes.	2003-04-12 20:56:20 +00:00
dchandler	6e05b60cff	I'll need these when I turn a sequence of UnicodeGraphemeClusters into LegalTshegBars.	2003-04-12 20:19:02 +00:00
dchandler	66e34aadfd	Code cleanup -- removed cruft.	2003-04-12 16:28:56 +00:00
dchandler	cbccfc5277	Fixed bug 718207. 'byungs now converts from Tibetan to Wylie correctly.	2003-04-10 02:14:15 +00:00
amontano	bc8b5f724b	nothing	2003-04-08 13:28:38 +00:00
eg3p	995817eb98	no message	2003-04-08 12:14:03 +00:00
dchandler	7dd67bbf6a	Now turns Tibetan into pa'am, not pa'm. Works with or without vowels in the part preceding the 'am or 'ang, overcoming the inconsistency that I'd put here for a short time.	2003-04-08 04:56:40 +00:00
dchandler	eb71fb6075	"sgom pa'am " is correct, not "sgom pa'm ".	2003-04-07 23:49:07 +00:00
eg3p	df4f8b8a45	processRomanChar now sets aside formatting like TAB, ENTER, etc.	2003-04-07 19:41:48 +00:00
eg3p	275cf9d79d	Improved handling of backspace based on my understanding of various known Java bugs. Those who mess around with backspace take note of the following: The Java bug database has several related bugs concerning the treatment of backspace. Here I adopt solution based on fix of bug 4402080: Evaluation The text components now key off of KEY_TYPED with a keyChar == 8 to do the deletion. The motivation for this can be found in bug 4256901. xxxxx@xxxxx 2001-01-05	2003-04-07 16:41:49 +00:00
amontano	e7684dedcd	nothing	2003-04-05 00:03:44 +00:00
amontano	341bea3c16	Added a line to the paste method so that if text is selected, the pasted text substitute the selected text.	2003-04-03 05:17:40 +00:00
amontano	5423bc19d4	Updated the clipboard calls to DuffPane. Ed: there are some mistakes that didn't happen before. There are certain combinations that use a header letter that when pasted from the DuffPane to the DuffPane fail. Try writing "rgyas", copying it and pasting it beside it.	2003-04-03 05:16:14 +00:00
eg3p	7a495bc720	Made the following changes: (1) renamed DuffPane's copySelection, pasteSelection, etc. to copy, paste and so forth, which override JTextComponent's methods by those names: Andres, please change the translation tool accordingly to use these new methods if that it necessary; (2) in order to allow for easier integration of Jskad with other tools such as QuillDriver, I changed DuffPane to rely on a Keymap instead of a KeyListener for its default key intercepts; this addresses the comments to bug 617156. Note that I have been working on Mac OS X and have not extensively tested my changes on a PC yet.	2003-04-02 20:37:14 +00:00
amontano	2250e03766	Updated copyright and version info.	2003-04-01 13:38:16 +00:00
amontano	a7a573020f	Renamed LinkedList to SimplifiedLinkList and moved it from org.thdl.tib.scanner to org.thdl.util. This linked list was implemented because the VM running on handhelds does not include java.util.LinkedList.	2003-04-01 13:08:38 +00:00
dchandler	d836b850e8	"sgom pa'm ", not "sgom pa'am", is now used. "pe'm " was being produced already, so the code was inconsistent. If it turns out that "pe'am " is preferred, I'll fix it later. Consistency is very appealing.	2003-03-31 01:38:27 +00:00
dchandler	33b3080068	Fixed a bunch of bugs; supports le'u'i'o, sgom pa'am, etc. Better tests. As part of that, I had to break TibetanMachineWeb into TibetanMachineWeb+THDLWylieConstants, because I don't want the class-wide initialization code from TibetanMachineWeb causing errors in LegalTshegBarTest.	2003-03-31 00:33:50 +00:00
dchandler	1987f7d80a	b-r-g, b-l-g-s, etc., when converted from Tibetan to Wylie, give correct, unambiguous Wylie.	2003-03-30 21:49:55 +00:00
amontano	8565855dd1	Now the handheld version supports both portrait and landscape.	2003-03-30 17:09:09 +00:00
dchandler	f9670233ba	Removed documentation FIXMEs from this code; did away for good with some really iffy code that I think was behind the "Tibetan->Wylie conversion fails when keyboard isn't Extended Wylie" bug.	2003-03-30 16:13:00 +00:00
dchandler	58f7371e66	I hope that Revamped the "Tools>Convert Tibetan To Wylie" feature that converts TibetanMachineWeb glyphs to THDL Wylie. Three-glyph and four-glyph sequences with implicit "a" vowels are now handled correctly, except for disambiguation w.r.t. things like b-la-g vs. bla-g and d-wa vs. dwa. pa'am, pa'ang etc. now work too. Illegal Tibetan sequences now become very ugly, but "correct" Wylie. Correct in the sense that converting it back to glyphs should get you the glyphs you started with. I also made a change to TibetanMachineWeb.java that I hope will clear up problems with this feature when keyboards other than "Extended Wylie" are selected. Took nga out of the farRightSet [postsuffixes]; only da and sa belong there, right? I tried to get the system in a state such that I could run automated tests of this stuff, but I ran into difficulties. I have some manual test cases; ask if you're interested.	2003-03-30 02:31:16 +00:00
dchandler	2b81020b0e	More and better tests; fixed some bugs in LegalTshegBar.	2003-03-28 03:49:49 +00:00
amontano	35a9869aac	1. Fixed parsing error 2. Added support extreme uses of 'a' like le'u'i'o 3. Now parses correctly syllables that have the particles "ang" and "am" added to them. Second works only in "roman script" mode. The converter from tibetan script to roman script does not convert correctly this combinations. ("pa'ang" is converted wrongly into "pa'ng" and "pa'am" is converted wrongly into "pa'ma").	2003-03-23 20:27:54 +00:00
dchandler	08d2a5d702	Added a test for org.thdl.tib.text.tshegbar.UnicodeCodepointToThdlWylie.	2003-03-22 04:55:17 +00:00
dchandler	f2dcb0cbc3	I said I removed this earlier; I lied. Now it's gone.	2003-03-22 03:58:13 +00:00
dchandler	16cbfb6033	Moved ad-hoc test.java test cases to UnicodeGraphemeClusterTest.java, a JUnit test which can be run via 'ant check'. Removed test.java and its build process.	2003-03-22 03:55:39 +00:00
dchandler	395eca7bb1	Moved ad-hoc test.java test cases to LegalTshegBarTest.java, a JUnit test which can be run via 'ant check'.	2003-03-22 03:46:32 +00:00
dchandler	879b477902	Made some ad-hoc tests in test.java into JUnit tests, run by 'ant check'. NORM_NFD was replaced with NORM_NFKD in three cases in testMostlyNFKD.	2003-03-22 03:24:56 +00:00
dchandler	1e326bb06d	Removing these QuillDriver leftovers. They're still in the CSV Attic, if anyone needs them.	2003-03-22 02:38:24 +00:00
eg3p	fed25e27ee	No longer necessary now that Savant & QuillDriver have been moved out of THDL Tools.	2003-03-14 00:33:24 +00:00
eg3p	c280d0fc96	Savant and QuillDriver are being removed from THDL Tools and moved to a new site: Tools for Field Linguistics.	2003-03-13 20:00:51 +00:00
eg3p	6cc0c5e99b	Savant and QuillDriver are being removed from THDL Tools and moved to a new site: Tools for Field Linguistics.	2003-03-13 19:57:12 +00:00
eg3p	a98849d3eb	QD as XML editor. More details later.	2003-03-12 12:48:18 +00:00
eg3p	4070c5ccee	Latest QD	2003-03-12 12:46:44 +00:00
dchandler	9e0dc68d12	Feature Request 697358 is done. The working directory for Jskad is now a preference. In addition, Jskad now raises an error dialog when you try to "Save As" to a bad place or open a file that doesn't exist or isn't readable.	2003-03-11 01:03:19 +00:00
dchandler	aa144dd599	Javadoc 1.4.1_01 no longer has a single warning about this package.	2003-02-03 01:36:56 +00:00
dchandler	c379db6ff5	Javadoc 1.4.1_01 no longer has a single warning about this file as we use @ to represent the at sign @.	2003-02-03 01:36:08 +00:00
dchandler	e6a10d052f	Added a "Help" menu item that pulls up jskad_doc.html, which is now put into Jskad's JAR file. Doing so required that I cut out a lot of fancy HTML code. The correct fix is to use XML to store the meat and then use XSL to generate two forms of HTML: one dumb enough for Java, one for use on the THDL tools website.	2003-02-01 06:42:07 +00:00
dchandler	cf279bb620	Added a JScrollPane that views a noneditable HTML file found inside a JAR file.	2003-02-01 06:37:32 +00:00
dchandler	bde0cc8381	Slapped on copyright boilerplate.	2003-02-01 05:52:03 +00:00
dchandler	a1f6b9e117	Each class's author is now listed as Than.	2003-02-01 05:38:48 +00:00
dchandler	d453e801ef	Windows directory separators (backslashes) have been replaced with java.io.File.separatorChar. This means tibbibl puts its temporary files under Jskad/bin in my Linux sandbox.	2003-02-01 05:30:22 +00:00
dchandler	72ee4fc7d2	Added the initial version of Tibbibl, which Nathaniel Garson of UVa e-mailed to me. Tibbibl is an editor for XML-based bibliographies of Tibetan texts. All I did was change the package from org.thdl.xml to org.thdl.tib.bibl and add boilerplate; no changes to Than's code were made. Tibbibl features a diacritic input tool which Jskad might want to swipe.	2003-02-01 05:08:02 +00:00
dchandler	190a3d9b60	achen must appear before a vowel.	2003-01-05 05:58:32 +00:00
dchandler	fcb75c55eb	Small performance improvement involving String.intern(). Plus a little bit of code cleanup.	2003-01-05 05:57:44 +00:00
dchandler	e5a63df1c1	Added a class skeleton that may not stay for long. I'm committing in order to sync with my laptop, really. This stuff will disappear and reappear in better form later, after a holiday of coding and eggless, alcohol-free nog.	2002-12-20 04:46:13 +00:00
dchandler	fdfedb4419	Added some tests for org.thdl.tib.text.tshegbar. These tests are preliminary, and for this package only. I'm committing in order to sync with my laptop, really. This stuff will disappear and reappear in better form later, after a holiday of coding and eggless, alcohol-free nog.	2002-12-20 04:34:56 +00:00
dchandler	7ea185fa01	Renamed UnicodeCharToExtendedWylie to UnicodeCodepointToThdlWylie.java. Added a new class, UnicodeGraphemeCluster, that can tell you the components of a grapheme cluster from top to bottom. It does not yet have good error checking; it is not yet finished. Next is to parse clean Unicode into GraphemeClusters. After that comes scanning dirty Unicode into best-guess GraphemeClusters, and scanning dirty Unicode to get nice error messages.	2002-12-17 13:51:18 +00:00
dchandler	8e8a23c6a6	Extended Wylie is referred to as THDL Extended Wylie or THDL Wylie because a Japanese scholar has an "Extended Wylie" also. NFKD and NFD have a new brother, NFTHDL. I wish there weren't a need, but as my yet-to-be-put-into-CVS break-unicode-into-grapheme-clusters code demonstrates, the-need-is-there. forgive-me for the hyphens, it's late.	2002-12-15 06:57:32 +00:00
dchandler	a42347b224	Now uses terminology from the Unicode standard. No more talk of characters, for example. Normalization forms NFKD and NFD are supported for the Tibetan Unicode range. I don't like either, actually. I've tested NFKD, but I've not yet committed the tests.	2002-12-15 03:35:24 +00:00
eg3p	3199ff7926	There are two classes here. One renders XML transcripts in JTextPane, and the other uses XPath to navigate the transcripts. Neither is part of the build yet. I'll document them more fully later when I've got to a point where they are worth sharing.	2002-12-12 15:17:42 +00:00
eg3p	86c2374706	New QD files that don't do anything yet.	2002-12-10 20:53:55 +00:00
dchandler	26993a5093	So that Unicode escape sequences appear correctly in javadocs.	2002-12-09 02:35:39 +00:00
dchandler	2d6c8be804	So that Unicode escape sequences appear correctly in javadocs.	2002-12-09 02:29:09 +00:00
dchandler	22c6ec5406	Javadoc now works without warnings.	2002-12-09 01:48:34 +00:00
dchandler	f4a16f8e9d	This commit is for my benefit only; these classes are not ready for prime time, and the build system is not yet aware of them. I'm adding some classes for representing legal tsheg-bars (syllables, for the most part) in Unicode. These classes were designed bottom-up (OK, OK -- they weren't designed designed, but I had to write down everything I knew about Tibetan syntax somewhere). The classes are aware of extended wylie. I doubt the Javadocs work yet, and I'm still testing (and am not committing my testing code with these as it is not yet ready). Next on my list--fix these up to reflect my new awareness of suffix particles (like le'u'i'o) add classes to support syntactically incorrect Unicode sequences. Then add a UnicodeReader, and we've got the back end of a Tibetan Unicode shaping system (like half of MS's Uniscribe or Apple's Worldscript or FreeType Layout or Omega's OTPs). A top-down design would not have included LegalTshegBar. But now that my itch has been scratched, potential uses are lingering about. For example, it would be nice to scan some input and break it into LegalTshegBars, punctuation/marks/signs, and illegal stacks. Then we could alert the client of the illegality, its precise form, and its precise location. The real system for turning a Unicode stream into an internal representation suitable for conversion to EWTS/ACIP/XHTML/what-have-you need not be aware of Tibetan syntax. But to make the very best conversion from Unicode to, e.g., EWTS, it is necessary to konw that gaskad is better represented as gskad, but that jaskad is not the same as jskad.	2002-12-09 01:02:23 +00:00
dchandler	53aa2e2309	Added jskad_doc.html (a revision of which is up at http://iris.lib.virginia.edu/tibet/tools/jskad_doc.html) to the repository. The build puts this into Jskad's JARs, but Jskad itself does not allow for viewing it. In Java, that's a ten-minute job, but I haven't done it.	2002-12-07 17:53:24 +00:00
eg3p	9eedfcd909	This is Tashi's TibetanSyllable class for sorting Wylie Tibetan. It does not have many methods for determining the root letter, suffix, and so on, but these should be easy to add. David, please use this class to the extent that it and your new work overlap.	2002-12-05 01:48:41 +00:00
eg3p	d14aa87fda	Removed egotistical self-reference from About Jskad text.	2002-12-04 16:02:16 +00:00
eg3p	1aad72f81b	Just testing cvs commit from my Mac.	2002-12-04 15:16:40 +00:00
amontano	569b2bb608	changed them to public	2002-11-29 08:08:54 +00:00
michel_jacobson	f869f054b7	new applet for QT4J	2002-11-28 16:18:19 +00:00
michel_jacobson	3d215caf53	modifications to handle url in place of file and to be used with SmartQT4JApplet	2002-11-28 16:16:27 +00:00
amontano	178ffcb800	added documentation	2002-11-28 06:54:46 +00:00
amontano	c81241e309	put in comments the association of menus with shortcuts. With the shortcut sometimes it seems to copy stuff twice. Without the shortcut seems to work anyway.	2002-11-27 23:32:57 +00:00
amontano	4acb2aa77e	fixed grammar mistake	2002-11-27 23:31:22 +00:00
amontano	c12088ce5d	fixed the importing of dictionaries using '-' as a separator, without confusing such character with reverse vowel in the tibetanized sanskrit.	2002-11-27 23:30:44 +00:00
amontano	c13adf9d14	now having copied a selection, if you paste it over selected text, the selected text is substituted with the text being pasted.	2002-11-27 23:29:31 +00:00
amontano	93eeae2118	Fixed bug that recently made it crash. Enabled the property thdl.rely.on.system.tmw.fonts before the production of TibetanMachineWeb HTML. This avoids the call to readInFontFiles() within the TibetanMachineWeb class (which raises an exception when it cannot find for whatever reason the fonts). The servlet doesn't need to load the fonts anyway!	2002-11-23 21:13:47 +00:00
amontano	5432168694	Fixed bug that recently appeared that made it crash. Enabled the property thdl.rely.on.system.tmw.fonts before the production of TibetanMachineWeb HTML. This avoids	2002-11-23 21:03:33 +00:00
amontano	b73760009c	added warning against using tibetanmachineweb, while the html script is not working.	2002-11-23 01:57:00 +00:00
amontano	dbf900b08b	minor change	2002-11-22 22:51:11 +00:00
michel_jacobson	a7bc9e97c0	no more need. It have been replaced by SmartJMFApplet.java	2002-11-19 21:26:01 +00:00
amontano	5d205ca9d9	minor changes to about window.	2002-11-19 18:47:43 +00:00
amontano	06fa7f020e	added timestamp to about window.	2002-11-19 18:46:47 +00:00
amontano	1fb425c6cd	corrected possible error with the '-' being used as both marker separating definition and definiendum and valid wylie character (transliterated sanskrit).	2002-11-19 18:46:14 +00:00
michel_jacobson	cc0097f2ae	no message	2002-11-19 17:51:45 +00:00
dchandler	07fe242596	Very minor cleanup to fix Javadocs and make the source code more readable; comments added.	2002-11-18 21:33:44 +00:00
dchandler	d200b03d66	Updated the build system so that you must do a cvs checkout of the 'Fonts' module inside the 'Jskad' module. I.e., you must now have the tree like so: Jskad/ source/ dist/ Fonts/ TibetanMachineWeb/ . . . This is because the THDL tools now optionally (and by default) load the TibetanMachineWeb fonts automatically. Updated the build system so that the 'web-start-releases' and 'self-contained-dist' targets JAR up optional JARs to create double-clickable, self-contained joy. Even the TMW fonts are in the JARs now. Changed the strings describing two Jskad keyboards so that "keyboard" is no longer in the description. It's in the label next to the combo box. Jskad now saves preferences on exit or when the user selects a menu item (that is there for debugging mainly) to ~/my_thdl_preferences.txt on *nix or C:\my_thdl_preferences.txt on Win32. I don't know the correct Mac location. There's a new paradigm for telling org.thdl.util.ThdlOptions that a user preference has been changed. If, for example, a combo box is manipulated so that the ACIP keyboard is selected, then you must call a certain method in ThdlOptions.	2002-11-18 16:12:25 +00:00
amontano	77b8c5e424	Added timestamp to about window (of all versions of translation tool except servlet).	2002-11-17 09:09:10 +00:00
dchandler	5ffb813019	Jskad's "About" dialog box now lists the time of compilation. Ant creates source/org/thdl/util/ThdlVersion.java when you execute the jskad-compile target.	2002-11-16 19:18:44 +00:00
michel_jacobson	872232c108	no message	2002-11-15 20:38:25 +00:00
michel_jacobson	3e71ff8351	changed to work with lacito applet - but not done	2002-11-14 21:13:08 +00:00
eg3p	c9349f6846	These files are not used.	2002-11-12 16:47:02 +00:00
eg3p	7c47e89811	This file is no longer used.	2002-11-12 16:44:20 +00:00
dchandler	ecf61bc892	A DuffPane is now a TibetanPane. A TibetanPane is much more lightweight but does line breaks correctly. I.e., I refactored DuffPane into two classes. I did this trying to track down a subtle bug in line breaking: 'gye ' breaks after 'gy' sometimes, with the dreng bo on the next line, but only when you resize the window certain ways, and only in Savant (and maybe QD and the translation tool, I don't know) but not in Jskad. I was not successful in finding the bug, but it still exists when I use TibetanPanes instead of DuffPanes in org.thdl.savant.tib.*.	2002-11-08 04:11:42 +00:00
dchandler	04da61688d	A DuffPane is now a TibetanPane. A TibetanPane is much more lightweight but does line breaks correctly. I.e., I refactored DuffPane into two classes. I did this trying to track down a subtle bug in line breaking: 'gye ' breaks after 'gy' sometimes, with the dreng bo on the next line, but only when you resize the window certain ways, and only in Savant (and maybe QD and the translation tool, I don't know) but not in Jskad. I was not successful in finding the bug, but it still exists when I use TibetanPanes instead of DuffPanes in org.thdl.savant.tib.*.	2002-11-08 04:05:06 +00:00
dchandler	86e384352b	Jskad's "Do you want to save your changes before you quit?" dialog is now optional.	2002-11-08 03:58:35 +00:00
amontano	947ac5537a	Updated on comments and made it a bit more consistent.	2002-11-03 17:42:11 +00:00
dchandler	d462f4e41c	Fixes all known bugs with the ACIP keyboard except for one: ACIP's 'WA' represents Wylie's 'wa', but ACIP's 'ZHVA' represents Wylie's 'zhwa'. The key for wasur is the same as the key for the twentieth consonant in extended Wylie, but not in ACIP.	2002-11-03 17:34:33 +00:00
dchandler	22141248e7	Terribly minor cleanup.	2002-11-03 17:05:44 +00:00
dchandler	7adfddfb43	Fixed my fix to the "Jskad freezes on impossible input" bug. Typing 'lKU' in Extended Wylie is now equivalent to 'lU'. I'm not sure if this is a change or not.	2002-11-03 17:05:05 +00:00
amontano	37b29c8d33	Added comments to all class headers. Comments to individual methods will be added as needed.	2002-11-03 08:56:11 +00:00
eg3p	b4e4decc2e	Updates, including support for internationalized strings.	2002-11-02 22:11:02 +00:00
eg3p	fab76cb82e	no message	2002-11-02 22:10:12 +00:00
eg3p	392b2b180a	These files have been updated for use with Savant. That is, org.thdl.savant.SoundPanel has been eliminated in favour of these classes, which are shared between QD and Savant. The main change is that SmartMoviePanels can now communicate with the outside world, for example to send messages to a Savant text window telling it to update highlights.	2002-11-02 20:20:30 +00:00
eg3p	5bfaccded7	This class provides static methods for dealing with THDL's internationalization issues.	2002-11-02 20:11:42 +00:00
eg3p	59d65bedc3	Change scrolling policy w/in Savant. Now highlighted text stays in the middle of the window instead of at the bottom.	2002-11-02 20:10:05 +00:00
dchandler	de6ae79959	Fixes bug 624133, "Input freezes after impossible character". Try 'shsM' in ACIP or 'ShSm' in Extended Wylie to see the new behavior. We use a trie to store valid input sequences. In the future, we could use the same trie as a replacement for the more inefficient HashSets we use to store characters, vowels, and punctuation. For example, we'd use 'validInputSequences.put("K", new Pair("consonant", "k"))' when reading in the ACIP keyboard's description of the first consonant of the Tibetan alphabet in 'TibetanKeyboard.java'. Note that the current trie implementation is only useful for 7- or 8-bit transcription systems, and works best for tries with low average depth, which describes a transcription system's trie very well. If you used arbitrary Unicode in your keyboard, you'd need a different trie implementation. Improved the optional keyboard input mode status messages.	2002-11-02 18:44:24 +00:00
dchandler	a6cc4a7ff3	Removed/commented out/tagged some unused local variables. Added a JUnit test for the new Trie that fails at present since the Trie is case-insensitive. Running JUnit tests is not something our build system knows about at present, but Eclipse 2.0 makes it very easy. Fixed a few compiler errors due to imports I'd forgotten.	2002-11-02 16:01:40 +00:00
dchandler	b8391e923d	Borrowed a trie implementation from Apache's Xalan 2.4.0.	2002-11-02 13:39:29 +00:00
dchandler	29042638e2	In the ACIP keyboard, 'KEE' and 'KOO', which are equivalent to Wylie's 'kai' and 'kau', now work. The optional status messages have been improved.	2002-11-02 05:21:12 +00:00
dchandler	aa580e0bea	Undoing my erroneous commit of buggy code.	2002-11-02 03:46:44 +00:00
dchandler	abcf8f19b3	Factored TibetanDocument into two classes, one that is a DefaultStyledDocument, and another consisting entirely of static utility methods for processing Tibetan text. Moved TibetanDocument.DuffData into its own class. I think this makes things a bit more transparent, and gets us a little closer to making clean use of Swing.	2002-11-02 03:38:59 +00:00
dchandler	5249c48807	Factored TibetanDocument into two classes, one that is a DefaultStyledDocument, and another consisting entirely of static utility methods for processing Tibetan text. Moved TibetanDocument.DuffData into its own class. I think this makes things a bit more transparent, and gets us a little closer to making clean use of Swing.	2002-11-02 03:33:09 +00:00
eg3p	d070e470ef	Updated these files to use DuffPane instead of JTextPane and so take advantage of DLC's new line wrapping code.	2002-10-31 19:06:47 +00:00
dchandler	97c530e974	GHA and KR'i now work.	2002-10-28 05:31:19 +00:00
dchandler	1ecbfe6a7c	Fixed some Javadoc comments in preparation for putting up new Javadocs on http://thdltools.sf.net/.	2002-10-28 04:49:24 +00:00
dchandler	fd1b4dd468	Now breaks the line after the last whitespace, not the first. I cleaned things up a bit, and I've made logging optional since I don't yet trust the code fully. A Wylie underscore at the end of a line is worth looking into further, at the very least.	2002-10-28 04:12:49 +00:00
dchandler	8433369d60	Now with slightly better error handling.	2002-10-28 03:17:28 +00:00
dchandler	0ad135f8f1	This may well be a fix to the "Improper line wrapping" bug. The fix is basically that we use our own special ViewFactory, with a new subclass of LabelView (the view RTFEditorKit uses for the nitty gritty) that is aware of Tibetan. There are a couple of nasty hacks still here, and Swing's documentation for doing what I did was quite poor. I searched the web for hours, read the Javadocs and the tutorials, and consulted a Swing reference book, but I still don't have tremendous confidence in this solution. If it fundamentally doesn't work, though, we have to define our own first-class Document, Element hierarchy, ViewFactory, Views, and EditorKit. So let's hope it does work fundamentally. I can't say for sure if this even works, as I have yet to run this code on a machine where Jskad works properly. I had major trouble installing the TMW fonts on Linux, and have yet to resolve it, even after verifying via xlsfonts that the fonts were installed and then changing TibetanMachineWeb.java to look for them. Because I haven't tested this yet, a lot of nasty code is tagged 'DLC' and commented out.	2002-10-28 03:08:04 +00:00
dchandler	f26dd53da3	Changed the build so that Savant and QuillDriver's builds include Smart*Player.java, which are accessed via reflection. Cleaned up the code a bit so that it would compile in so doing. Changed the 'options.txt' preferences file to reflect the new method of selecting media players.	2002-10-27 19:12:13 +00:00
amontano	e4aa52a6eb	re-arranged the display. Now the buttons are closer to the text input.	2002-10-27 18:48:48 +00:00
amontano	8391f19a8d	copy and paste features are fixed.	2002-10-27 18:48:03 +00:00
amontano	7336d27a33	fixing the copy-paste issue for the translation tool.	2002-10-26 18:15:34 +00:00
dchandler	b6b8cd73ff	Moved JskadKeyboard-related code into separate files; made many things public.	2002-10-26 17:40:51 +00:00
dchandler	3ee1fbd3fa	Removed backup copies of .java files.	2002-10-26 15:57:06 +00:00
amontano	d35048a067	fixing copy and paste. works, except if pasted from a TextArea through the windows pop-up menu.	2002-10-26 15:49:55 +00:00
eg3p	34b660b8f9	Moved all media related stuff to new package. This makes more sense, since all this stuff is accessed by both Savant and QuillDriver.	2002-10-25 20:19:56 +00:00
eg3p	7f3f0eb8e1	Various changes related to Quicktime and JMF support, as well as keyboard modularization. Not quite done yet, though, so may not compile.	2002-10-25 20:18:22 +00:00
eg3p	27dfa66b02	Ongoing work with Andres to change paste so that isRomanEnabled = false implies auto conversion of Wylie to Tibetan. Doesn't work yet.	2002-10-25 19:47:14 +00:00
eg3p	91b8fd3cd9	Edited JskadKeyboard code slightly so that it is easier to use these keyboards outside of Jskad (for example from QuillDriver).	2002-10-25 19:41:43 +00:00
eg3p	8fbb971628	Removed proposed JSpinner reflection code which I moved to SimpleSpinner instead.	2002-10-25 19:39:32 +00:00
eg3p	107b4424b4	This is a class that uses reflection to manifest as JSpinner if JRE 1.4 is installed, otherwise as a numeric JTextField.	2002-10-25 19:36:40 +00:00
amontano	a2e8acca39	almost working but not quite	2002-10-25 17:59:27 +00:00
eg3p	d45a58e1ba	Changed paste so that if (isRomanEnabled = false), it will assume the text is Wylie and convert it to Tibetan.	2002-10-25 17:34:30 +00:00
dchandler	f6bcc49119	Fixed a bug I introduced when I made Tibetan keyboards more modular.	2002-10-23 04:03:17 +00:00
dchandler	8da821d503	Uses new methods for cutting and copying from a DuffPane.	2002-10-23 02:50:48 +00:00
eg3p	2e8608d13b	Miscellaneous minor changes.	2002-10-22 20:47:39 +00:00
dchandler	4eda412cb5	The enter and tab keys were causing edits regardless of setEditable(false); this is now fixed. Minor clean-up resulting from my aborted refactoring of the keyboard event handling code.	2002-10-22 03:53:33 +00:00
dchandler	4c0026ab4f	Removed the old installKeyboard routines.	2002-10-20 08:25:10 +00:00
dchandler	dc53ded878	Adding a new Tibetan keyboard now requires merely copying and pasting 3 lines. Quick reference .rtf files (on the Info menu) are optional. Added a first try at an ACIP keyboard. At the very least, ACIP's "GHA" is busted.	2002-10-20 08:02:16 +00:00
dchandler	2a923f83f8	Added a first attempt at an ACIP keyboard following their document http://www.asianclassics.org/download/tibetancode/ticode.pdf	2002-10-20 07:59:25 +00:00
dchandler	0097be4266	Fixed bug 617156, "DuffPane ignores setEditable(false)". I fixed this the easy way, by checking the value of isEditable() before cutting, pasting, or adding typed text. I may have missed a spot, but checking at a lower level is a bit less efficient. Fixing this the hard way, the keymaps-and-overridden-default-action way, seems like it will make the code uglier, not cleaner. And it won't get us closer to fixing the killer bug, 614475, "Improper Line Wrapping".	2002-10-20 05:54:29 +00:00
dchandler	44524d3c89	Bulletproofed so that this can run in the presence of a security manager.	2002-10-19 02:27:14 +00:00

... 4 5 6 7 8 ...

586 commits