Commit graph

78 commits

Author SHA1 Message Date
a1tsal
f205f7253d Incremental progress. Passes all conversion tests except non-TScript,
rejection, and the peculiar case kaR+Wabi.
2004-12-26 22:45:59 +00:00
a1tsal
d7f5b22a2d Progress: now passing most of TestConversion; exceptions are non-Tibetan
embeds, a few complex Skt stacks, and also it fails to reject some things
that should be unparseable.
2004-12-22 12:43:29 +00:00
a1tsal
de529e7187 Ye olde incremental progress.
Everything mostly works.
But everything is riddled with bugs.
2004-12-19 08:06:41 +00:00
a1tsal
129e7ed917 Incremental progress on Unicode. 2004-12-18 11:09:47 +00:00
a1tsal
cb6ffac9c8 Incremental progress on Unicode. 2004-12-18 05:44:54 +00:00
a1tsal
39752ad984 Whoops, +' is a foreign subjoined consonant. 2004-12-18 05:42:51 +00:00
a1tsal
4d4b1f9d81 GPL and note explaining GPL vs THDL OL added, and to .INP too.
CVS is whining about the GPL containing "escape characters" (presumably
control-L), but when I try to add it as binary, it complains that it's actually text.
I hope this works.
2004-12-17 02:04:15 +00:00
a1tsal
f0a91f9a79 TMW -> Unicode works modulo Word Unicode bugs. 2004-12-16 13:06:38 +00:00
a1tsal
b3eead0df6 Checkpoint: Typing EWTS to get Tibetan script mostly works.
Infinitely many bugs in Word Unicode handling remain to work around.
2004-12-16 08:02:06 +00:00
a1tsal
11dcda0532 Checkpoint: TMW-specific code removed from TScript parser. 2004-12-15 02:55:56 +00:00
a1tsal
a1dc82bd62 Check in incremental progress on Unicode.
Checkpointing here because I'm about to start overhauling the TMW
tsheg bar parser to accept Unicode Tibetan as well.
2004-12-15 02:23:53 +00:00
a1tsal
1c2fcb0910 Add reference to TMUni. 2004-12-13 01:30:05 +00:00
a1tsal
a4a07e5a6f Add the alpha Tibetan Machine Uni font and the "Unicode definition.txt" files.
Change tibwn.ini to overwrite -- previously it was set not to, which seems
completely wrong.
The Uni font is now set to overwrite; I hope this is the right thing.  For some
reason the TMW fonts had been set no-overwrite.  I'm hoping that is just
because I assumed they wouldn't change rather than because it doesn't
work to overwrite fonts or something.
2004-12-13 01:19:06 +00:00
a1tsal
23ddf2aac7 This file has the Wylie <-> Unicode correspondence, which actually can't
be recovered from tibwn.ini, oddly enough.
2004-12-12 22:31:11 +00:00
a1tsal
c93f7467a7 Unicode->Wylie now basically working. Typing EWTS to get Tibetan is
mostly not working but starting to show signs of life.
2004-12-12 12:03:39 +00:00
a1tsal
d211b73642 Incremental progress on Unicode. Wylie->Unicode is basically working
(but I haven't run extensive tests, and probably will break in some cases).
Unicode->Wylie is not working.  I'm checkpointing here because I'm
about to attack the Tibetan->Wylie code and I'm not sure my approach
is correct -- may need to back out.  (Current theory is to recycle the TMW->
Wylie code with small amounts of internal conditionality on TMW vs.
Unicode, but I'm not positive this can be made to work.)
2004-12-12 07:36:23 +00:00
a1tsal
4311f44042 Beginning of version 2.1 (with Unicode support).
It would cleaner to do a fork of "WylieWord development copy.dot",
which is what this, but the wincvs doc doesn't cover the fork command
at all, and I don't want to risk confusion.
2004-12-11 10:22:27 +00:00
a1tsal
92bf0142cd Minor changes. 2004-11-03 02:36:17 +00:00
a1tsal
e40cf55a30 Document new "F11 conversion uses matrices" option. 2004-11-03 02:35:50 +00:00
a1tsal
f6e5c6b045 Version 2.0p1:
Per request from Cathy Cantwell, add option to do F11 interlineal
  conversion without introducing tables, producing text only.  An
  unfortunate side-effect of this is that you get a one-time-only error
  message complaining about the options file (whose format has changed
  to store the value of this option).

  Changed the interlineal line breaking code to put only shad-like
  things at the end of lines, and not zla tse type things.  (But: it no
  longer breaks lines on zla tse type things at all; ideally it should
  break on those but put them at the beginning of lines.  On the other
  hand, you'd never expect to see a zla tse type thing other than after
  a shad type thing (or at the begnning of a text).)
2004-11-03 02:35:01 +00:00
a1tsal
79749d8909 Changed a bunch of Integer declarations to Long so that F11 conversion
wouldn't blow out on chunks of text longer than 32k characters (a
problem reported by Cathy Cantwell).  What a pathetic programming
language VBA is.
2004-11-02 07:15:14 +00:00
a1tsal
6f8ba71ad0 Fix problems reported by Julie Regan <juregan@hotmail.com> with
printing.  The page format and second-page-header were somehow
inherited from Matthew Kapstein's reader. This included line
numbering, non-standard margins, and the header.  Changed back to Word
defaults.
2004-11-01 09:31:14 +00:00
a1tsal
14206b789d Add note about test case for \ syntax. 2004-11-01 07:59:29 +00:00
a1tsal
0d78ad1444 Total rewrite of ConvertWylie() to fix the problem, reported by Cathy
Cantwell (SF RFE #1034292), that it stripped footnotes (and other
non-text items).

Per request of Cathy Cantwell (SF bug 1034688), make
HandleUserApostrophe respect Options.AutoFormatAsYouTypeReplaceQuotes.
I.e., typing a ' will no longer get curly single quotes if you have
turned off the "smart quotes" autocorrect option (which lots of
Tibetanists may have done, so a-chung comes out decently).

Fix PartOfTshegBar to not accept [, which it did, due to an
inexplicable Obiwan error.  Clarify code to make Obiwan error less
likely.

Fix PartOfTshegBar to not accept H.  This is bogus; H is a Sanskrit
letter, but tibwn.ini claims it is punctuation, and our code depends
on that.
2004-11-01 07:57:02 +00:00
a1tsal
47dbbdc16a Add test cases for bug reported by "Cathy Cantwell"
<catherine.cantwell@oriental-institute.oxford.ac.uk>
whereby TMW [m][kh]['][i] converted to EWTS makha'i.
2004-10-31 03:59:11 +00:00
a1tsal
4596ec268b Fix bug reported by "Cathy Cantwell"
<catherine.cantwell@oriental-institute.oxford.ac.uk>
whereby TMW [m][kh]['][i] converted to EWTS makha'i.
2004-10-31 03:58:37 +00:00
a1tsal
779db45e92 Version 2.0: no code changes from 2.0b3; just incremented the About version number. 2004-07-07 20:04:32 +00:00
a1tsal
a7cb875b1b Documented fix for Spanish keyboard DeadKey problem. 2004-07-07 19:55:26 +00:00
a1tsal
d3ed56655b Documented fix for Spanish keyboard DeadKey problem. 2004-05-02 13:12:32 +00:00
a1tsal
2a2c9112b3 Updated for latest theory of R, Y, W. 2004-04-21 19:30:10 +00:00
a1tsal
82ab632660 Fixed tiny typos. 2004-04-21 19:26:24 +00:00
a1tsal
c80fa8879d Extensive, ugly code changes to make R+ work.
Sean Something <knowone@zensearch.com> pointed out that a special ha
  glyph (1,102) is supposed to be used solely in the case of hU.
  Added code to do so.
2004-04-21 19:17:02 +00:00
a1tsal
0b6a4941f2 Remove from the rules cases that were previously exceptions but are now
handled by the nasalization rule.
2004-04-18 19:21:35 +00:00
a1tsal
2d1d85577d Update for the lastest hairifcation of the nasalization rule. 2004-04-18 19:20:49 +00:00
a1tsal
fd6d03d040 Change implementation of nasalization rule for lastest elaboration
thereof (n, not m, when root letter is b or ph but not pronounced as
  such).
2004-04-18 19:19:52 +00:00
a1tsal
a697cf2010 Added support for recently-added nasalization rule in THDL phonetics. 2004-04-12 08:28:35 +00:00
a1tsal
68ee1d083e Fixed phonetics in interlineal example. 2004-04-12 08:27:35 +00:00
a1tsal
e25e00e003 Added test cases for ' as root letter.
Added test case "lnga".
2004-04-12 08:26:45 +00:00
a1tsal
97c1b4db57 Added support for recently-added nasalization rule in THDL phonetics.
Fixed bug: Sean Something <knowone@zensearch.com> and Christopher
  Walker <cewalker@uchicago.edu> correctly reported that lnga did not
  not stack when typed.  (It gave "lang".  But it worked correctly in
  F9 conversion.)  This was a side-effect of the "sara" fixes in
  2.0b2.  Fixed by invoking the consonant stack buffering code when we
  get "ln".

  Changed both parsers to allow ' (a-chung) as root, in order to handle
  "'od" which Robert Walker <robertjwalker@onetel.net.uk> correctly
  reported failed in Wylie->Tibetan conversion.  (It also failed in the
  opposite direction.)

  Fix typo in comment at beginning of Wylie parser.
2004-04-12 08:23:34 +00:00
a1tsal
fbda68dcec Added test cases for nasalization rule. 2004-04-12 08:08:34 +00:00
a1tsal
e91d0dbf05 Minor additions, fixes, and clarifications. 2004-02-27 10:29:57 +00:00
a1tsal
57e14ce081 Added test cases for vowel.vowel. 2004-02-27 10:29:21 +00:00
a1tsal
b679f02228 Updated spelling of dorje (needs acute accent according to
revised phonetics scheme).

Document option to disable ] and \

Explained how to suppress ].

revised documentation of EWTS conformance.

change ^ to ~.

it said ctrl+alt+> was underdot; corrected to overdot
2004-02-27 10:25:59 +00:00
a1tsal
6e1c32279a Explained how to suppress ].
Change ^ to ~.

It said ctrl+alt+> was underdot; corrected to overdot.

Tiny clarification of F10.
2004-02-27 10:23:02 +00:00
a1tsal
9ec91bb19b dorje gets an acute accent 2004-02-27 10:21:32 +00:00
a1tsal
de4c935a2e Made installer not overwrite the fonts if they already exist;
this will prevent problems (as reported by DG) that occur when the
fonts are in use at installation time.
2004-02-27 10:19:06 +00:00
a1tsal
11fc084f45 Version 2.0b2 (release candidate).
Fix the "sara" problem some more.  Fairly extensive code
modification and clean-up to get all cases correct.

Added support for vowel.vowel syntax (which was added to the EWTS
syntax after the first beta release).

Added new, more rigorous automated Wylie<->TMW testing procedure
(working on a by-character basis, simulating user typing rather than
just F9, F10 conversion).

Fixed bug in HandleUserUCS that made typing Sh fail after "a".

Fixed ugliness induced by recently-added code for doing interlineal
from Startup, whereby the top row had whitespace inherited from
Normal, which might be doublespaced or something; fix forces 3
pixels above and below, same as the other rows.

Made the phonetics code throw away ., ~, `, and ^.

Made the phonetics code turn M into "ng".
2004-02-27 10:08:46 +00:00
a1tsal
616009fb00 Added various test cases.
Removed "mangs" as a rejection test case -- see SF bug 793596.
2004-02-24 15:07:56 +00:00
a1tsal
6e44a14ff8 Add TMW tsheg bar parser. Call it in FixTMW2WylieConversion. This
fixes various problems in TMW->Wylie Sanskrit conversion, viz.
  800153 (Typing sara gets sra), 811282 (F10 of ragyada is rgyad),
  and 811284 (F10 of gayarma gets ga.yarma).

  Removed the last of the (dangerous, bogus) globally-declared
  temporary variables.

  Fixed bug in R+ (the code I had written did the wrong thing in a
  complicated way; actually all that was needed was a single call to
  InsertLatinOrTibetanLetter, which already did the right thing in a
  simple way).
2004-02-24 14:59:30 +00:00
a1tsal
7660a4208a Added test cases for ~M and ~M`.
Added test cases for R+, +W, +Y, +R (and updated existing test cases
  that used the old forms).
2004-02-22 09:27:12 +00:00