Commit graph

238 commits

Author SHA1 Message Date
dchandler
7ea185fa01 Renamed UnicodeCharToExtendedWylie to
UnicodeCodepointToThdlWylie.java.

Added a new class, UnicodeGraphemeCluster, that can tell you
the components of a grapheme cluster from top to bottom.  It does not
yet have good error checking; it is not yet finished.

Next is to parse clean Unicode into GraphemeClusters.  After that comes
scanning dirty Unicode into best-guess GraphemeClusters, and scanning
dirty Unicode to get nice error messages.
2002-12-17 13:51:18 +00:00
dchandler
8e8a23c6a6 Extended Wylie is referred to as THDL Extended Wylie or THDL Wylie
because a Japanese scholar has an "Extended Wylie" also.

NFKD and NFD have a new brother, NFTHDL.  I wish there weren't a need,
but as my yet-to-be-put-into-CVS break-unicode-into-grapheme-clusters code
demonstrates, the-need-is-there.  forgive-me for the hyphens, it's late.
2002-12-15 06:57:32 +00:00
dchandler
a42347b224 Now uses terminology from the Unicode standard. No more talk of
characters, for example.

Normalization forms NFKD and NFD are supported for the Tibetan Unicode
range.  I don't like either, actually.  I've tested NFKD, but I've not yet
committed the tests.
2002-12-15 03:35:24 +00:00
eg3p
3199ff7926 There are two classes here. One renders XML transcripts in JTextPane, and the other uses XPath to navigate the transcripts. Neither is part of the build yet. I'll document them more fully later when I've got to a point where they are worth sharing. 2002-12-12 15:17:42 +00:00
eg3p
86c2374706 New QD files that don't do anything yet. 2002-12-10 20:53:55 +00:00
dchandler
26993a5093 So that Unicode escape sequences appear correctly in javadocs. 2002-12-09 02:35:39 +00:00
dchandler
2d6c8be804 So that Unicode escape sequences appear correctly in javadocs. 2002-12-09 02:29:09 +00:00
dchandler
35425f3753 So that javadoc build works again, this property is now just a string, not a
location.

Andres, this may mean that you need to do something different to build
javadocs with links to local JDOM/Sun javadocs.
2002-12-09 02:18:03 +00:00
dchandler
983a9e2045 ThdlVersion.java is now generated before making Javadocs. 2002-12-09 01:58:17 +00:00
dchandler
22c6ec5406 Javadoc now works without warnings. 2002-12-09 01:48:34 +00:00
dchandler
f4a16f8e9d This commit is for my benefit only; these classes are not ready for prime time,
and the build system is not yet aware of them.

I'm adding some classes for representing legal tsheg-bars (syllables, for the
most part) in Unicode.  These classes were designed bottom-up (OK, OK --
they weren't designed designed, but I had to write down everything I knew
about Tibetan syntax somewhere).  The classes are aware of extended
wylie.  I doubt the Javadocs work yet, and I'm still testing (and am not
committing my testing code with these as it is not yet ready).

Next on my list--fix these up to reflect my new awareness of suffix particles
(like le'u'i'o) add classes to support syntactically incorrect Unicode
sequences.  Then add a UnicodeReader, and we've got the back end of
a Tibetan Unicode shaping system (like half of MS's Uniscribe or Apple's
Worldscript or FreeType Layout or Omega's OTPs).

A top-down design would not have included LegalTshegBar.  But now that
my itch has been scratched, potential uses are lingering about.  For example,
it would be nice to scan some input and break it into LegalTshegBars,
punctuation/marks/signs, and illegal stacks.  Then we could alert the client
of the illegality, its precise form, and its precise location.

The real system for turning a Unicode stream into an internal representation
suitable for conversion to EWTS/ACIP/XHTML/what-have-you need not be
aware of Tibetan syntax.  But to make the very best conversion from
Unicode to, e.g., EWTS, it is necessary to konw that gaskad is better
represented as gskad, but that jaskad is not the same as jskad.
2002-12-09 01:02:23 +00:00
dchandler
03688b6137 Dza, fa, and va go together, not dza, fa, and va. 2002-12-08 21:08:58 +00:00
dchandler
53aa2e2309 Added jskad_doc.html (a revision of which is up at
http://iris.lib.virginia.edu/tibet/tools/jskad_doc.html) to the repository.  The
build puts this into Jskad's JARs, but Jskad itself does not allow for viewing
it.  In Java, that's a ten-minute job, but I haven't done it.
2002-12-07 17:53:24 +00:00
eg3p
9eedfcd909 This is Tashi's TibetanSyllable class for sorting Wylie Tibetan.
It does not have many methods for determining the root letter, suffix,
and so on, but these should be easy to add. David, please use this
class to the extent that it and your new work overlap.
2002-12-05 01:48:41 +00:00
eg3p
d14aa87fda Removed egotistical self-reference from About Jskad text. 2002-12-04 16:02:16 +00:00
eg3p
1aad72f81b Just testing cvs commit from my Mac. 2002-12-04 15:16:40 +00:00
eg3p
15404dc3b1 String resources for Spanish version of software. 2002-12-02 20:39:43 +00:00
dchandler
a3a1923d42 I haven't tested this, but it should make it easier to experiment with the links
from our Javadocs to Sun's Javadocs.
2002-12-01 18:53:09 +00:00
amontano
569b2bb608 changed them to public 2002-11-29 08:08:54 +00:00
michel_jacobson
f869f054b7 new applet for QT4J 2002-11-28 16:18:19 +00:00
michel_jacobson
3d215caf53 modifications to handle url in place of file and to be used with SmartQT4JApplet 2002-11-28 16:16:27 +00:00
amontano
178ffcb800 added documentation 2002-11-28 06:54:46 +00:00
dchandler
efa69fe225 At Andres' request, I added ConsoleScannerFilter to the build of both the
Standalone version and the Handheld version.
2002-11-28 02:51:29 +00:00
amontano
c81241e309 put in comments the association of menus with shortcuts. With the shortcut sometimes it seems to copy stuff twice. Without the shortcut seems to work anyway. 2002-11-27 23:32:57 +00:00
amontano
4acb2aa77e fixed grammar mistake 2002-11-27 23:31:22 +00:00
amontano
c12088ce5d fixed the importing of dictionaries using '-' as a separator, without confusing such character with reverse vowel in the tibetanized sanskrit. 2002-11-27 23:30:44 +00:00
amontano
c13adf9d14 now having copied a selection, if you paste it over selected text, the selected text is substituted with the text being pasted. 2002-11-27 23:29:31 +00:00
amontano
93eeae2118 Fixed bug that recently made it crash.
Enabled the property thdl.rely.on.system.tmw.fonts before the production of TibetanMachineWeb HTML. This avoids the call to readInFontFiles() within the TibetanMachineWeb class (which raises an exception when it cannot find for whatever reason the fonts). The servlet doesn't need to load the fonts anyway!
2002-11-23 21:13:47 +00:00
amontano
5432168694 Fixed bug that recently appeared that made it crash.
Enabled the property thdl.rely.on.system.tmw.fonts before
the production of TibetanMachineWeb HTML. This avoids
2002-11-23 21:03:33 +00:00
amontano
b73760009c added warning against using tibetanmachineweb, while the html script is not working. 2002-11-23 01:57:00 +00:00
amontano
dbf900b08b minor change 2002-11-22 22:51:11 +00:00
michel_jacobson
a7bc9e97c0 no more need. It have been replaced by SmartJMFApplet.java 2002-11-19 21:26:01 +00:00
amontano
5d205ca9d9 minor changes to about window. 2002-11-19 18:47:43 +00:00
amontano
06fa7f020e added timestamp to about window. 2002-11-19 18:46:47 +00:00
amontano
1fb425c6cd corrected possible error with the '-' being used as both marker separating definition and definiendum and valid wylie character (transliterated sanskrit). 2002-11-19 18:46:14 +00:00
michel_jacobson
cc0097f2ae no message 2002-11-19 17:51:45 +00:00
thangarson
384ad282b5 This is the SGML version of the TIBBIBL DTD used to catalog Tibetan
texts. All the catalogs of the Literary Collection are at this time using this
DTD and a proprietary program, Dynaweb. There is also an XML version
of it, xtibbibl.dtd, which has some slight modifications.
2002-11-19 16:51:23 +00:00
dchandler
cfb111eeb1 Added a nightly-build target that creates a zip file
dist/nightlyBuild.zip.  As the suggestive name implies, coupling this
with a shell script like '(cvs -f -q update -dP && ant nightly-build
&& scp dist/nightlyBuild.zip && ssh 'cd wherever/ && unzip foo') ||
email_error_message' would yield a nightly build process for us.
2002-11-18 22:16:07 +00:00
dchandler
07fe242596 Very minor cleanup to fix Javadocs and make the source code more
readable; comments added.
2002-11-18 21:33:44 +00:00
dchandler
0e70c7fb97 No longer including JMF or Savant in the JWS or self-contained JARs. 2002-11-18 17:16:34 +00:00
dchandler
718e2f1283 Ignores products of the revised build. 2002-11-18 16:16:17 +00:00
dchandler
d200b03d66 Updated the build system so that you must do a cvs checkout of the
'Fonts' module inside the 'Jskad' module.  I.e., you must now have the
tree like so:

Jskad/
   source/
   dist/
   Fonts/
       TibetanMachineWeb/
   .
   .
   .

This is because the THDL tools now optionally (and by default) load
the TibetanMachineWeb fonts automatically.

Updated the build system so that the 'web-start-releases' and
'self-contained-dist' targets JAR up optional JARs to create
double-clickable, self-contained joy.  Even the TMW fonts are in the
JARs now.

Changed the strings describing two Jskad keyboards so that "keyboard"
is no longer in the description.  It's in the label next to the combo
box.

Jskad now saves preferences on exit or when the user selects a menu
item (that is there for debugging mainly) to ~/my_thdl_preferences.txt
on *nix or C:\my_thdl_preferences.txt on Win32.  I don't know the
correct Mac location.

There's a new paradigm for telling org.thdl.util.ThdlOptions that a
user preference has been changed.  If, for example, a combo box is
manipulated so that the ACIP keyboard is selected, then you must call
a certain method in ThdlOptions.
2002-11-18 16:12:25 +00:00
amontano
77b8c5e424 Added timestamp to about window (of all versions of translation tool except servlet). 2002-11-17 09:09:10 +00:00
dchandler
5ffb813019 Jskad's "About" dialog box now lists the time of compilation. Ant
creates source/org/thdl/util/ThdlVersion.java when you execute the
jskad-compile target.
2002-11-16 19:18:44 +00:00
dchandler
9f232fc5e2 The JARs built no longer have "-JWS" and "-vanilla" suffixes.
Instead, there are dist/lib-jws/ and dist/lib-vanilla/ directories.
2002-11-16 18:04:47 +00:00
dchandler
5d212a0671 Now you can truly compile without either JMF or QT4J in your
extensions/ directory tree.  If and only if they are present,
integration with them is compiled in.

Removed a FIXME that I'd fixed a long time ago.
2002-11-16 16:35:25 +00:00
michel_jacobson
872232c108 no message 2002-11-15 20:38:25 +00:00
michel_jacobson
3e71ff8351 changed to work with lacito applet - but not done 2002-11-14 21:13:08 +00:00
eg3p
c9349f6846 These files are not used. 2002-11-12 16:47:02 +00:00
eg3p
7c47e89811 This file is no longer used. 2002-11-12 16:44:20 +00:00