Commit graph

74 commits

Author SHA1 Message Date
a1tsal
cb6ffac9c8 Incremental progress on Unicode. 2004-12-18 05:44:54 +00:00
a1tsal
39752ad984 Whoops, +' is a foreign subjoined consonant. 2004-12-18 05:42:51 +00:00
a1tsal
4d4b1f9d81 GPL and note explaining GPL vs THDL OL added, and to .INP too.
CVS is whining about the GPL containing "escape characters" (presumably
control-L), but when I try to add it as binary, it complains that it's actually text.
I hope this works.
2004-12-17 02:04:15 +00:00
a1tsal
f0a91f9a79 TMW -> Unicode works modulo Word Unicode bugs. 2004-12-16 13:06:38 +00:00
a1tsal
b3eead0df6 Checkpoint: Typing EWTS to get Tibetan script mostly works.
Infinitely many bugs in Word Unicode handling remain to work around.
2004-12-16 08:02:06 +00:00
a1tsal
11dcda0532 Checkpoint: TMW-specific code removed from TScript parser. 2004-12-15 02:55:56 +00:00
a1tsal
a1dc82bd62 Check in incremental progress on Unicode.
Checkpointing here because I'm about to start overhauling the TMW
tsheg bar parser to accept Unicode Tibetan as well.
2004-12-15 02:23:53 +00:00
a1tsal
1c2fcb0910 Add reference to TMUni. 2004-12-13 01:30:05 +00:00
a1tsal
a4a07e5a6f Add the alpha Tibetan Machine Uni font and the "Unicode definition.txt" files.
Change tibwn.ini to overwrite -- previously it was set not to, which seems
completely wrong.
The Uni font is now set to overwrite; I hope this is the right thing.  For some
reason the TMW fonts had been set no-overwrite.  I'm hoping that is just
because I assumed they wouldn't change rather than because it doesn't
work to overwrite fonts or something.
2004-12-13 01:19:06 +00:00
a1tsal
23ddf2aac7 This file has the Wylie <-> Unicode correspondence, which actually can't
be recovered from tibwn.ini, oddly enough.
2004-12-12 22:31:11 +00:00
a1tsal
c93f7467a7 Unicode->Wylie now basically working. Typing EWTS to get Tibetan is
mostly not working but starting to show signs of life.
2004-12-12 12:03:39 +00:00
a1tsal
d211b73642 Incremental progress on Unicode. Wylie->Unicode is basically working
(but I haven't run extensive tests, and probably will break in some cases).
Unicode->Wylie is not working.  I'm checkpointing here because I'm
about to attack the Tibetan->Wylie code and I'm not sure my approach
is correct -- may need to back out.  (Current theory is to recycle the TMW->
Wylie code with small amounts of internal conditionality on TMW vs.
Unicode, but I'm not positive this can be made to work.)
2004-12-12 07:36:23 +00:00
a1tsal
4311f44042 Beginning of version 2.1 (with Unicode support).
It would cleaner to do a fork of "WylieWord development copy.dot",
which is what this, but the wincvs doc doesn't cover the fork command
at all, and I don't want to risk confusion.
2004-12-11 10:22:27 +00:00
a1tsal
92bf0142cd Minor changes. 2004-11-03 02:36:17 +00:00
a1tsal
e40cf55a30 Document new "F11 conversion uses matrices" option. 2004-11-03 02:35:50 +00:00
a1tsal
f6e5c6b045 Version 2.0p1:
Per request from Cathy Cantwell, add option to do F11 interlineal
  conversion without introducing tables, producing text only.  An
  unfortunate side-effect of this is that you get a one-time-only error
  message complaining about the options file (whose format has changed
  to store the value of this option).

  Changed the interlineal line breaking code to put only shad-like
  things at the end of lines, and not zla tse type things.  (But: it no
  longer breaks lines on zla tse type things at all; ideally it should
  break on those but put them at the beginning of lines.  On the other
  hand, you'd never expect to see a zla tse type thing other than after
  a shad type thing (or at the begnning of a text).)
2004-11-03 02:35:01 +00:00
a1tsal
79749d8909 Changed a bunch of Integer declarations to Long so that F11 conversion
wouldn't blow out on chunks of text longer than 32k characters (a
problem reported by Cathy Cantwell).  What a pathetic programming
language VBA is.
2004-11-02 07:15:14 +00:00
a1tsal
6f8ba71ad0 Fix problems reported by Julie Regan <juregan@hotmail.com> with
printing.  The page format and second-page-header were somehow
inherited from Matthew Kapstein's reader. This included line
numbering, non-standard margins, and the header.  Changed back to Word
defaults.
2004-11-01 09:31:14 +00:00
a1tsal
14206b789d Add note about test case for \ syntax. 2004-11-01 07:59:29 +00:00
a1tsal
0d78ad1444 Total rewrite of ConvertWylie() to fix the problem, reported by Cathy
Cantwell (SF RFE #1034292), that it stripped footnotes (and other
non-text items).

Per request of Cathy Cantwell (SF bug 1034688), make
HandleUserApostrophe respect Options.AutoFormatAsYouTypeReplaceQuotes.
I.e., typing a ' will no longer get curly single quotes if you have
turned off the "smart quotes" autocorrect option (which lots of
Tibetanists may have done, so a-chung comes out decently).

Fix PartOfTshegBar to not accept [, which it did, due to an
inexplicable Obiwan error.  Clarify code to make Obiwan error less
likely.

Fix PartOfTshegBar to not accept H.  This is bogus; H is a Sanskrit
letter, but tibwn.ini claims it is punctuation, and our code depends
on that.
2004-11-01 07:57:02 +00:00
a1tsal
47dbbdc16a Add test cases for bug reported by "Cathy Cantwell"
<catherine.cantwell@oriental-institute.oxford.ac.uk>
whereby TMW [m][kh]['][i] converted to EWTS makha'i.
2004-10-31 03:59:11 +00:00
a1tsal
4596ec268b Fix bug reported by "Cathy Cantwell"
<catherine.cantwell@oriental-institute.oxford.ac.uk>
whereby TMW [m][kh]['][i] converted to EWTS makha'i.
2004-10-31 03:58:37 +00:00
a1tsal
779db45e92 Version 2.0: no code changes from 2.0b3; just incremented the About version number. 2004-07-07 20:04:32 +00:00
a1tsal
a7cb875b1b Documented fix for Spanish keyboard DeadKey problem. 2004-07-07 19:55:26 +00:00
a1tsal
d3ed56655b Documented fix for Spanish keyboard DeadKey problem. 2004-05-02 13:12:32 +00:00
a1tsal
2a2c9112b3 Updated for latest theory of R, Y, W. 2004-04-21 19:30:10 +00:00
a1tsal
82ab632660 Fixed tiny typos. 2004-04-21 19:26:24 +00:00
a1tsal
c80fa8879d Extensive, ugly code changes to make R+ work.
Sean Something <knowone@zensearch.com> pointed out that a special ha
  glyph (1,102) is supposed to be used solely in the case of hU.
  Added code to do so.
2004-04-21 19:17:02 +00:00
a1tsal
0b6a4941f2 Remove from the rules cases that were previously exceptions but are now
handled by the nasalization rule.
2004-04-18 19:21:35 +00:00
a1tsal
2d1d85577d Update for the lastest hairifcation of the nasalization rule. 2004-04-18 19:20:49 +00:00
a1tsal
fd6d03d040 Change implementation of nasalization rule for lastest elaboration
thereof (n, not m, when root letter is b or ph but not pronounced as
  such).
2004-04-18 19:19:52 +00:00
a1tsal
a697cf2010 Added support for recently-added nasalization rule in THDL phonetics. 2004-04-12 08:28:35 +00:00
a1tsal
68ee1d083e Fixed phonetics in interlineal example. 2004-04-12 08:27:35 +00:00
a1tsal
e25e00e003 Added test cases for ' as root letter.
Added test case "lnga".
2004-04-12 08:26:45 +00:00
a1tsal
97c1b4db57 Added support for recently-added nasalization rule in THDL phonetics.
Fixed bug: Sean Something <knowone@zensearch.com> and Christopher
  Walker <cewalker@uchicago.edu> correctly reported that lnga did not
  not stack when typed.  (It gave "lang".  But it worked correctly in
  F9 conversion.)  This was a side-effect of the "sara" fixes in
  2.0b2.  Fixed by invoking the consonant stack buffering code when we
  get "ln".

  Changed both parsers to allow ' (a-chung) as root, in order to handle
  "'od" which Robert Walker <robertjwalker@onetel.net.uk> correctly
  reported failed in Wylie->Tibetan conversion.  (It also failed in the
  opposite direction.)

  Fix typo in comment at beginning of Wylie parser.
2004-04-12 08:23:34 +00:00
a1tsal
fbda68dcec Added test cases for nasalization rule. 2004-04-12 08:08:34 +00:00
a1tsal
e91d0dbf05 Minor additions, fixes, and clarifications. 2004-02-27 10:29:57 +00:00
a1tsal
57e14ce081 Added test cases for vowel.vowel. 2004-02-27 10:29:21 +00:00
a1tsal
b679f02228 Updated spelling of dorje (needs acute accent according to
revised phonetics scheme).

Document option to disable ] and \

Explained how to suppress ].

revised documentation of EWTS conformance.

change ^ to ~.

it said ctrl+alt+> was underdot; corrected to overdot
2004-02-27 10:25:59 +00:00
a1tsal
6e1c32279a Explained how to suppress ].
Change ^ to ~.

It said ctrl+alt+> was underdot; corrected to overdot.

Tiny clarification of F10.
2004-02-27 10:23:02 +00:00
a1tsal
9ec91bb19b dorje gets an acute accent 2004-02-27 10:21:32 +00:00
a1tsal
de4c935a2e Made installer not overwrite the fonts if they already exist;
this will prevent problems (as reported by DG) that occur when the
fonts are in use at installation time.
2004-02-27 10:19:06 +00:00
a1tsal
11fc084f45 Version 2.0b2 (release candidate).
Fix the "sara" problem some more.  Fairly extensive code
modification and clean-up to get all cases correct.

Added support for vowel.vowel syntax (which was added to the EWTS
syntax after the first beta release).

Added new, more rigorous automated Wylie<->TMW testing procedure
(working on a by-character basis, simulating user typing rather than
just F9, F10 conversion).

Fixed bug in HandleUserUCS that made typing Sh fail after "a".

Fixed ugliness induced by recently-added code for doing interlineal
from Startup, whereby the top row had whitespace inherited from
Normal, which might be doublespaced or something; fix forces 3
pixels above and below, same as the other rows.

Made the phonetics code throw away ., ~, `, and ^.

Made the phonetics code turn M into "ng".
2004-02-27 10:08:46 +00:00
a1tsal
616009fb00 Added various test cases.
Removed "mangs" as a rejection test case -- see SF bug 793596.
2004-02-24 15:07:56 +00:00
a1tsal
6e44a14ff8 Add TMW tsheg bar parser. Call it in FixTMW2WylieConversion. This
fixes various problems in TMW->Wylie Sanskrit conversion, viz.
  800153 (Typing sara gets sra), 811282 (F10 of ragyada is rgyad),
  and 811284 (F10 of gayarma gets ga.yarma).

  Removed the last of the (dangerous, bogus) globally-declared
  temporary variables.

  Fixed bug in R+ (the code I had written did the wrong thing in a
  complicated way; actually all that was needed was a single call to
  InsertLatinOrTibetanLetter, which already did the right thing in a
  simple way).
2004-02-24 14:59:30 +00:00
a1tsal
7660a4208a Added test cases for ~M and ~M`.
Added test cases for R+, +W, +Y, +R (and updated existing test cases
  that used the old forms).
2004-02-22 09:27:12 +00:00
a1tsal
c40a8a036c Fixed somewhat-bogus test case of second-suffix d removal 2004-02-22 09:25:37 +00:00
a1tsal
b705c780a7 Fixed SF bug 800193, which was due to Word's failure to inherit
styles from templates loaded from the Startup folder.  (They *are*
  inherited from templates loaded normally.)  Worked around the bug by
  having the code check whether the interlineal styles are defined,
  and if not, defining them programmatically.

  Added option to disable the EWTS interpretation of \ and ] when in
  non-Tibetan text.

  Added control-T as a way to get into Tibetan mode (for people who
  turn off ] and \).  This appears in the Tibetan menu as well.

  Implemented R+, +W, +Y, +R.

  Un-implemented tsa 'phru: it breaks the parser, and fixing it
  seems more trouble than it is worth.

  Remove a whole slew of styles that somehow virally infected the
  WylieWord template from David Germano's personal template.
2004-02-22 09:23:26 +00:00
a1tsal
77b3d4396b Change ^M to ~M and ^X to ~X to conform to spec change.
Introduce ~M` (for 0F82).
Make ^ be tsa phrul (per spec change).

Changed processing of <?Input:Numbers?> for recent change by DLC
whereby it contains lots of stuff other than numbers (ugh).
2004-02-20 15:16:16 +00:00
a1tsal
2e9ea92a3a Fixed phonetics code to strip post-suffix d (bug 800167 in
SourceForge).

  Implemented (somewhat kludgily) option for phonetics scheme to
  replace e with é iff it is the last letter of the last tsheg bar.
  This is required by the new THDL phonetics spec.

  New algorithm, per new THDL phonetics spec, for ba->wa processing.
  The heuristic is that it applies only to the last tsheg bar in
  multi-tsheg-bar words.  (Previously, ba always generated "?ba/wa?",
  which is maybe more correct but less attractive.)  This heuristic
  fails on, e.g., "tsheg bar".  Oh well.

  Rationalized format of phonetics file: > is used as separator in exceptions
  as well as rules.  (Previously, : was used in exceptions only.)
2004-02-20 09:37:23 +00:00