Initial check-in of 2.0b1 non-binary files.
This commit is contained in:
parent
fad893dbf6
commit
f25b0d4f40
8 changed files with 1191 additions and 0 deletions
115
Aro phonetics scheme.txt
Normal file
115
Aro phonetics scheme.txt
Normal file
|
@ -0,0 +1,115 @@
|
||||||
|
; Aro Gar phonetic rules for WylieWord.
|
||||||
|
|
||||||
|
; These rules attempt to capture the phonetic transcription scheme
|
||||||
|
; used by Ngak'chang Rinpoche. The results have not, however, yet
|
||||||
|
; been reviewed by Rinpoche, and probably differ from his intention
|
||||||
|
; in some ways.
|
||||||
|
|
||||||
|
; File format: there are two sections, for rules and exceptions.
|
||||||
|
;
|
||||||
|
; There is one rule per line. Typically a rule has two parts,
|
||||||
|
; separated by a space. The first part is a sequence of letters
|
||||||
|
; that, if it appears in a Wylie transliteration, is to be replaced
|
||||||
|
; by the second part in the phonetic transcription. The second
|
||||||
|
; part can be missing, in which case the first part is simply
|
||||||
|
; deleted from transcriptions when found.
|
||||||
|
;
|
||||||
|
; A semicolon precedes a comment. Blank lines are OK.
|
||||||
|
;
|
||||||
|
; The rules are applied in the order they appear in this file.
|
||||||
|
; Each rule is applied as many times as possible, but we never
|
||||||
|
; go back to a previous rule. (This simple rewrite-rule grammar is
|
||||||
|
; not sufficient to implement all phonetic schemes, at least not
|
||||||
|
; compactly. For example, it would be difficult to capture the
|
||||||
|
; effects of preinitial consonants on tone (as in the scheme used
|
||||||
|
; in Joe Wilson's book, for instance).) Also note that not even the
|
||||||
|
; whole of the present scheme is implemented using these rules. In
|
||||||
|
; particular, the deletion of prefix and superscript consonants,
|
||||||
|
; and of wa-zur, are done in program code, not using the rules here.
|
||||||
|
|
||||||
|
; Miscellaneous prefix transformations
|
||||||
|
g. ; delete this (representing g prefix, used before root y only)
|
||||||
|
dby y ; must come before db->w, for dbyang
|
||||||
|
dbr r ; must come before db->w, for dbral
|
||||||
|
db w ; must come before by->j
|
||||||
|
|
||||||
|
; c and ch are both transcribed ch. To get this we need a kludge
|
||||||
|
; (involving x), because the rule c -> ch would apply recursively.
|
||||||
|
ch c
|
||||||
|
c x
|
||||||
|
x ch
|
||||||
|
|
||||||
|
; Bad behavior from Y
|
||||||
|
py ch
|
||||||
|
phy ch
|
||||||
|
by j
|
||||||
|
my ny
|
||||||
|
|
||||||
|
; Retroflexes
|
||||||
|
kr tr
|
||||||
|
khr thr
|
||||||
|
gr dr
|
||||||
|
pr tr
|
||||||
|
phr thr
|
||||||
|
br dr
|
||||||
|
|
||||||
|
; Other bad behavior from R
|
||||||
|
mr m
|
||||||
|
sr s
|
||||||
|
|
||||||
|
; Uniquely random case
|
||||||
|
zl d
|
||||||
|
|
||||||
|
; Vowel transformations
|
||||||
|
;
|
||||||
|
; Note: this must be done before suffix-stripping.
|
||||||
|
; Before actually doing the transformations, we "hide" the n in ng, so that ng doesn't
|
||||||
|
; induce umlaut. This is gross; if we had a real grammar engine it wouldn't
|
||||||
|
; be necessary.
|
||||||
|
ng x
|
||||||
|
; Transformations of e: short before g, m, n; long otherwise
|
||||||
|
eg èg
|
||||||
|
em èm
|
||||||
|
en èn
|
||||||
|
e é
|
||||||
|
; Umlaut of a, o, u followed by d, n, l, s
|
||||||
|
ad èd
|
||||||
|
an èn
|
||||||
|
al èl
|
||||||
|
as è
|
||||||
|
od öd
|
||||||
|
on ön
|
||||||
|
ol öl
|
||||||
|
os ö
|
||||||
|
ud üd
|
||||||
|
un ün
|
||||||
|
ul ül
|
||||||
|
us ü
|
||||||
|
; restore ng
|
||||||
|
x ng
|
||||||
|
|
||||||
|
<?Exceptions?>
|
||||||
|
|
||||||
|
; There is one exception per line. Each exception consists of
|
||||||
|
; the transliteration (which may be several syllables separated
|
||||||
|
; by spaces), followed by a space, a colon, a space, and the
|
||||||
|
; pronunciation (which may also contain spaces). A semicolon
|
||||||
|
; precedes a comment. Blank lines are OK.
|
||||||
|
|
||||||
|
ba : wa ; mind you, ba (pronounced ba) means cow. But that's much rarer than wa.
|
||||||
|
bo : wo
|
||||||
|
ba'i : wa'i
|
||||||
|
bo'i : wo'i
|
||||||
|
bar : ?bar/war? ; bar = "middle"; could be either, so supply both and let user sort it out
|
||||||
|
bor : ?bor/wor? ; bor = "cast away"; could be either, so supply both and let user sort it out
|
||||||
|
rdo rje : dorjé
|
||||||
|
mkha' 'gro : khandro
|
||||||
|
sku mnye : kumnyé
|
||||||
|
sprul sku : tulku
|
||||||
|
mtsho rgyal : tsogyèl
|
||||||
|
rta mgrin: tamdrin
|
||||||
|
dga' ldan : ganden
|
||||||
|
dge 'dun : gendün
|
||||||
|
a mdo : amdo
|
||||||
|
srid pa : sipa
|
||||||
|
pad ma : pèma
|
76
Aro phonetics test cases.txt
Normal file
76
Aro phonetics test cases.txt
Normal file
|
@ -0,0 +1,76 @@
|
||||||
|
;
|
||||||
|
; These examples come from the draft (8/21/03) THDL Phonetics document
|
||||||
|
;
|
||||||
|
lha sa > lhasa
|
||||||
|
sa skya pa > sakyapa
|
||||||
|
; blo bzang > lobzang -- THDL exception not included in Aro exception dictionary
|
||||||
|
rnying ma pa > nyingmapa
|
||||||
|
rdo rje > dorjé
|
||||||
|
dge lugs pa > gélugpa
|
||||||
|
gzhis ka rtse > zhikatsé
|
||||||
|
mar me > marmé
|
||||||
|
bcu > chu
|
||||||
|
lce > ché
|
||||||
|
rin chen bzang po > rinchènzangpo
|
||||||
|
nag chu > nagchu
|
||||||
|
bka' rgyud pa > kagyüdpa
|
||||||
|
bsod nams> södnam
|
||||||
|
thub bstan > thubtèn
|
||||||
|
'jam dpal dbyangs > jampèlyang
|
||||||
|
dge legs > gélèg
|
||||||
|
kha btags > khatag
|
||||||
|
bka' shag > kashag
|
||||||
|
; sbra nag zhol > banakzhöl -- THDL exception not included in Aro exception dictionary
|
||||||
|
thabs > thab
|
||||||
|
lha sa ba > lhasawa
|
||||||
|
jo bo > jowo
|
||||||
|
dpa' bo > pawo
|
||||||
|
spyan ras gzig > chènrèzig
|
||||||
|
sbyin bdag > jindag
|
||||||
|
smyong > nyong
|
||||||
|
sgrol ma > drölma
|
||||||
|
rten 'brel > tèndrél
|
||||||
|
'bras spungs > drèpung
|
||||||
|
'phrin las > thrinlè
|
||||||
|
dbang > wang
|
||||||
|
dbral > rèl
|
||||||
|
dbyar kha > yarkha
|
||||||
|
zla ba > dawa
|
||||||
|
klad pa > lèdpa
|
||||||
|
glog > log
|
||||||
|
le'u > lé'u
|
||||||
|
pa'ang > pa'ang
|
||||||
|
ba'i > wa'i
|
||||||
|
rta mgrin > tamdrin
|
||||||
|
|
||||||
|
; Other tests, to exercise particular rules in the grammar that aren't covered in the rules above
|
||||||
|
g.yon > yön
|
||||||
|
phyag > chag
|
||||||
|
bkra shis > trashi
|
||||||
|
khros ma > thröma
|
||||||
|
sprul > trül
|
||||||
|
mri tam ga > mitamga
|
||||||
|
srid pa > sipa
|
||||||
|
pad ma > pèma
|
||||||
|
pan chen > pènchèn
|
||||||
|
ral pa can > rèlpachèn
|
||||||
|
thun > thün
|
||||||
|
dus gsum > düsum
|
||||||
|
sbed > béd
|
||||||
|
ces > ché
|
||||||
|
pa'i > pa'i
|
||||||
|
che'i > ché'i
|
||||||
|
gri'i > dri'i
|
||||||
|
po'i > po'i
|
||||||
|
le'u'i > lé'u'i
|
||||||
|
rdzogs > dzog
|
||||||
|
thug pa > thugpa
|
||||||
|
'debs > déb
|
||||||
|
sib sib > sibsib
|
||||||
|
lobs pa > lobpa
|
||||||
|
grub > drub
|
||||||
|
kla col > lachöl
|
||||||
|
|
||||||
|
; Aro-specific exceptions
|
||||||
|
mtsho rgyal > tsogyèl
|
||||||
|
|
45
Build procedure.txt
Normal file
45
Build procedure.txt
Normal file
|
@ -0,0 +1,45 @@
|
||||||
|
WylieWord Build Procedure
|
||||||
|
|
||||||
|
This explains how to make a new version of WylieWord.
|
||||||
|
|
||||||
|
The program source file is called "WylieWord developement copy.dot".
|
||||||
|
This is what you edit with the VBA editor.
|
||||||
|
|
||||||
|
When you have finished modifying it:
|
||||||
|
|
||||||
|
Do a search for, and remove, all debugging MsgBox calls.
|
||||||
|
|
||||||
|
I use "!!!" in comments to note known bugs that should be dealt with.
|
||||||
|
Search for "!!!" and make sure anything left in that state is
|
||||||
|
acceptable in the release.
|
||||||
|
|
||||||
|
Re-test everything that has changed since the last release, and
|
||||||
|
anything that might be affected by those changes.
|
||||||
|
|
||||||
|
Increment the version number in AboutWylieWord.
|
||||||
|
|
||||||
|
Do PrepareDistribution in immediate window.
|
||||||
|
This runs automated tests. Verify that the results are OK.
|
||||||
|
It also unbinds all the WylieWord keys.
|
||||||
|
This is critical to having WylieWord work correctly on
|
||||||
|
non-English keyboards.
|
||||||
|
After making the distribution, you will want to restore
|
||||||
|
the keybindings in the develoment copy by calling
|
||||||
|
WylieWordBindKeys.
|
||||||
|
|
||||||
|
Save "WylieWord developement copy.dot"
|
||||||
|
|
||||||
|
Copy it to the build folder.
|
||||||
|
Rename it Setup.doc.
|
||||||
|
|
||||||
|
The WylieWord installer is created using the program "SEAU",
|
||||||
|
available for $49 on the web. The .INP file is its configuration.
|
||||||
|
|
||||||
|
In SEAU:
|
||||||
|
|
||||||
|
Update the Files folder to be the WylieWord build folder.
|
||||||
|
Update the Archive output file location to be in that folder also.
|
||||||
|
Save the SEAU configuration (.INP file) if you have changed it.
|
||||||
|
Run the SEAU builder.
|
||||||
|
|
||||||
|
Test the resulting "WylieWord Setup.exe" file.
|
740
Conversion test cases.txt
Normal file
740
Conversion test cases.txt
Normal file
|
@ -0,0 +1,740 @@
|
||||||
|
// Test cases for Wylie -> [other representation] -> Wylie round-trip conversion.
|
||||||
|
//
|
||||||
|
// Each line in the file is a comment, blank, or one test case.
|
||||||
|
//
|
||||||
|
// A comment line starts with //.
|
||||||
|
//
|
||||||
|
// Each test case is understood as EWTS; it should convert back into itself.
|
||||||
|
|
||||||
|
// The following test cases are from DLC's DuffPaneTest.
|
||||||
|
tsha
|
||||||
|
tsa
|
||||||
|
dza
|
||||||
|
sha
|
||||||
|
nga
|
||||||
|
nag
|
||||||
|
nga /
|
||||||
|
bkra shis bde legs/
|
||||||
|
sgom pa'am
|
||||||
|
sgom pe'am
|
||||||
|
le'u'i'o
|
||||||
|
la'u'i'o/la'am/pa'ang/pe'ang
|
||||||
|
bras
|
||||||
|
dwa
|
||||||
|
gwa
|
||||||
|
gyug
|
||||||
|
g.yag
|
||||||
|
gyag
|
||||||
|
g.yas
|
||||||
|
gyas
|
||||||
|
'gas
|
||||||
|
gangs
|
||||||
|
gnags
|
||||||
|
'byung
|
||||||
|
'byungs
|
||||||
|
blags
|
||||||
|
mnags
|
||||||
|
gdams
|
||||||
|
'gams
|
||||||
|
|
||||||
|
// Random things known to be hard
|
||||||
|
|
||||||
|
// Difficult to parse deterministically
|
||||||
|
brgyud
|
||||||
|
|
||||||
|
// Featuritis
|
||||||
|
brtswand
|
||||||
|
|
||||||
|
// two-letter suffix (not to be treated as n suffix + g second-suffix)
|
||||||
|
dbang
|
||||||
|
// plus second sufix
|
||||||
|
dbangs
|
||||||
|
|
||||||
|
// A common word that is not grammatical, according to some
|
||||||
|
// sources, and that is also hard because br- looks like
|
||||||
|
// b should be the root
|
||||||
|
brlabs
|
||||||
|
|
||||||
|
// "'is" is the form of kyis used after bare vowel or a chung
|
||||||
|
re'u'is
|
||||||
|
he'is
|
||||||
|
|
||||||
|
// Bare vowels
|
||||||
|
a
|
||||||
|
i
|
||||||
|
'o
|
||||||
|
'a
|
||||||
|
// Bare polyvowels
|
||||||
|
a'o
|
||||||
|
i'u'i
|
||||||
|
'o'i
|
||||||
|
|
||||||
|
// things that at one time or another tickled bugs in WW code
|
||||||
|
gyurd
|
||||||
|
ga*ra
|
||||||
|
ha_sa
|
||||||
|
gangs
|
||||||
|
mngas
|
||||||
|
'gram
|
||||||
|
|
||||||
|
// The exceptions to the three-letter root rule
|
||||||
|
rags lags nags bags bangs gangs rangs langs nangs sangs babs rabs rams nams
|
||||||
|
|
||||||
|
// This is my old root-letter-capitalization and pronunciation test suite.
|
||||||
|
// It's should also be good for tickling conversion bugs.
|
||||||
|
grwa
|
||||||
|
bsgrubs
|
||||||
|
bzlog
|
||||||
|
brlabs
|
||||||
|
sprul
|
||||||
|
shabs
|
||||||
|
shes
|
||||||
|
ldan
|
||||||
|
mgon
|
||||||
|
rgyud
|
||||||
|
ten
|
||||||
|
rigs
|
||||||
|
chos
|
||||||
|
dbang
|
||||||
|
dbying
|
||||||
|
dbyang
|
||||||
|
spyang
|
||||||
|
phyug
|
||||||
|
byang
|
||||||
|
sbyin
|
||||||
|
myur
|
||||||
|
po'i
|
||||||
|
mri
|
||||||
|
srung
|
||||||
|
mkha'
|
||||||
|
'gro
|
||||||
|
rta'i
|
||||||
|
khwa
|
||||||
|
rdzogs
|
||||||
|
rdo
|
||||||
|
rje
|
||||||
|
zla
|
||||||
|
bar
|
||||||
|
ngan
|
||||||
|
rnga
|
||||||
|
g.yo
|
||||||
|
gyo
|
||||||
|
dpa'
|
||||||
|
'dag
|
||||||
|
klu
|
||||||
|
khyung
|
||||||
|
|
||||||
|
// These are from DLC's PackageTest
|
||||||
|
brtan
|
||||||
|
blta
|
||||||
|
blag
|
||||||
|
brag
|
||||||
|
bsabs
|
||||||
|
zungs
|
||||||
|
brtib
|
||||||
|
spyoms
|
||||||
|
sku'i
|
||||||
|
bskyangs
|
||||||
|
ag
|
||||||
|
bda'
|
||||||
|
dbang
|
||||||
|
dga'
|
||||||
|
dgra
|
||||||
|
dmar
|
||||||
|
gda'
|
||||||
|
|
||||||
|
// Sanskrit
|
||||||
|
|
||||||
|
// Tests from DLC
|
||||||
|
ai
|
||||||
|
heM hiM h-iM heM haiM hoM hauM hUM
|
||||||
|
pad+me
|
||||||
|
D+h+D+ha
|
||||||
|
gaD+h+D+ha
|
||||||
|
autapa
|
||||||
|
parItshatsawa
|
||||||
|
ke:
|
||||||
|
ka:
|
||||||
|
n+yadA
|
||||||
|
ak+Sha
|
||||||
|
Asa
|
||||||
|
AT+Ta
|
||||||
|
b+da
|
||||||
|
d+ba
|
||||||
|
d+ga
|
||||||
|
d+g+ra
|
||||||
|
d+gara
|
||||||
|
|
||||||
|
|
||||||
|
// Other Sanskrit
|
||||||
|
rak+Shasa
|
||||||
|
DAkinI
|
||||||
|
haM
|
||||||
|
pad+ma
|
||||||
|
samuts+tsaya
|
||||||
|
gang+gA
|
||||||
|
sid+d+hi
|
||||||
|
shrI
|
||||||
|
sat+t+wa
|
||||||
|
sarma
|
||||||
|
|
||||||
|
// bug ticklers
|
||||||
|
ragyad
|
||||||
|
'garma
|
||||||
|
'garama
|
||||||
|
gayaradasa
|
||||||
|
|
||||||
|
// Taken from THDL Sechen text collection
|
||||||
|
dashadigAMsarbatimirabi ni saranan+nAmashrIguh+yagar+b+hatatwaMsh+ts+yayastanataras+yaTa'ikAbiharatisma
|
||||||
|
shrI tan+t+radzamAyadzalasamudAyArtas+yadwArAt+nish+tsitasub+haShitaMguh+yapatimukhAgamanAmabiharatisma
|
||||||
|
guh+ya pa ti sa shad+h+ya laM kA ra nA ma shrA iguh+ya gar+b+ha ta twa pa in ish+tsa ya tan+t+ra s+ya pr-i t+t-i bi ha rti sma
|
||||||
|
|
||||||
|
// Bare vowels:
|
||||||
|
A I U -i -I
|
||||||
|
// leading weird vowels
|
||||||
|
aiga
|
||||||
|
-Iba
|
||||||
|
|
||||||
|
// M,H on bare vowel
|
||||||
|
aiM
|
||||||
|
eM
|
||||||
|
U^M
|
||||||
|
au^M
|
||||||
|
aH
|
||||||
|
aiH
|
||||||
|
|
||||||
|
// M and ^M in the middle of a word
|
||||||
|
gaMb+ha
|
||||||
|
hU^MhU^MhU^M
|
||||||
|
aMu^MraH
|
||||||
|
|
||||||
|
raMH
|
||||||
|
|
||||||
|
// Semivowel as subscript.
|
||||||
|
hr-I kl-iba
|
||||||
|
|
||||||
|
// Difficult because ny+d is not legal
|
||||||
|
ny+dza
|
||||||
|
|
||||||
|
raH
|
||||||
|
hai
|
||||||
|
hau
|
||||||
|
|
||||||
|
// One instance of every stack in tibwn.ini, with sequential vowels attached
|
||||||
|
k+Sha
|
||||||
|
k+ke
|
||||||
|
k+khi
|
||||||
|
k+ngo
|
||||||
|
k+tsu
|
||||||
|
k+tA
|
||||||
|
k+t+yI
|
||||||
|
k+t+rU
|
||||||
|
k+t+r+yai
|
||||||
|
k+t+wau
|
||||||
|
k+th-i
|
||||||
|
k+th+y-I
|
||||||
|
k+Na
|
||||||
|
k+ne
|
||||||
|
k+n+yi
|
||||||
|
k+pho
|
||||||
|
k+mu
|
||||||
|
k+m+yA
|
||||||
|
k+r+yI
|
||||||
|
k+w+yU
|
||||||
|
k+shai
|
||||||
|
k+sau
|
||||||
|
k+s+n-i
|
||||||
|
k+s+m-I
|
||||||
|
k+s+ya
|
||||||
|
k+s+we
|
||||||
|
kh+khi
|
||||||
|
kh+no
|
||||||
|
kh+lu
|
||||||
|
g+gA
|
||||||
|
g+g+hI
|
||||||
|
g+nyU
|
||||||
|
g+dai
|
||||||
|
g+d+hau
|
||||||
|
g+d+h+y-i
|
||||||
|
g+d+h+w-I
|
||||||
|
g+na
|
||||||
|
g+n+ye
|
||||||
|
g+pi
|
||||||
|
g+b+ho
|
||||||
|
g+b+h+yu
|
||||||
|
g+mA
|
||||||
|
g+m+yI
|
||||||
|
g+r+yU
|
||||||
|
g+hai
|
||||||
|
g+h+g+hau
|
||||||
|
g+h+ny-i
|
||||||
|
g+h+n-I
|
||||||
|
g+h+n+ya
|
||||||
|
g+h+me
|
||||||
|
g+h+li
|
||||||
|
g+h+yo
|
||||||
|
g+h+ru
|
||||||
|
g+h+wA
|
||||||
|
ng+kI
|
||||||
|
ng+k+tU
|
||||||
|
ng+k+t+yai
|
||||||
|
ng+k+yau
|
||||||
|
ng+kh-i
|
||||||
|
ng+kh+y-I
|
||||||
|
ng+ga
|
||||||
|
ng+g+re
|
||||||
|
ng+g+yi
|
||||||
|
ng+g+ho
|
||||||
|
ng+g+h+yu
|
||||||
|
ng+g+h+rA
|
||||||
|
ng+ngI
|
||||||
|
ng+tU
|
||||||
|
ng+nai
|
||||||
|
ng+mau
|
||||||
|
ng+y-i
|
||||||
|
ng+l-I
|
||||||
|
ng+sha
|
||||||
|
ng+he
|
||||||
|
ng+k+Shi
|
||||||
|
ng+k+Sh+wo
|
||||||
|
ng+k+Sh+yu
|
||||||
|
ts+tsA
|
||||||
|
ts+tshI
|
||||||
|
ts+tsh+wU
|
||||||
|
ts+tsh+rai
|
||||||
|
ts+nyau
|
||||||
|
ts+n+y-i
|
||||||
|
ts+m-I
|
||||||
|
ts+ya
|
||||||
|
ts+re
|
||||||
|
ts+li
|
||||||
|
ts+h+yo
|
||||||
|
tsh+thu
|
||||||
|
tsh+tshA
|
||||||
|
tsh+yI
|
||||||
|
tsh+rU
|
||||||
|
tsh+lai
|
||||||
|
dz+dzau
|
||||||
|
dz+dz+ny-i
|
||||||
|
dz+dz+w-I
|
||||||
|
dz+dz+ha
|
||||||
|
dz+h+dz+he
|
||||||
|
dz+nyi
|
||||||
|
dz+ny+yo
|
||||||
|
dz+nu
|
||||||
|
dz+n+wA
|
||||||
|
dz+mI
|
||||||
|
dz+yU
|
||||||
|
dz+rai
|
||||||
|
dz+wau
|
||||||
|
dz+h-i
|
||||||
|
dz+h+y-I
|
||||||
|
dz+h+ra
|
||||||
|
dz+h+le
|
||||||
|
dz+h+wi
|
||||||
|
ny+tso
|
||||||
|
ny+ts+mu
|
||||||
|
ny+ts+yA
|
||||||
|
ny+tshI
|
||||||
|
ny+dzU
|
||||||
|
ny+dz+yai
|
||||||
|
ny+dz+hau
|
||||||
|
ny+ny-i
|
||||||
|
ny+p-I
|
||||||
|
ny+pha
|
||||||
|
ny+ye
|
||||||
|
ny+ri
|
||||||
|
ny+lo
|
||||||
|
ny+shu
|
||||||
|
T+kA
|
||||||
|
T+TI
|
||||||
|
T+T+hU
|
||||||
|
T+nai
|
||||||
|
T+pau
|
||||||
|
T+m-i
|
||||||
|
T+y-I
|
||||||
|
T+wa
|
||||||
|
T+se
|
||||||
|
Th+yi
|
||||||
|
Th+ro
|
||||||
|
D+gu
|
||||||
|
D+g+yA
|
||||||
|
D+g+hI
|
||||||
|
D+g+h+rU
|
||||||
|
D+Dai
|
||||||
|
D+D+hau
|
||||||
|
D+D+h+y-i
|
||||||
|
D+n-I
|
||||||
|
D+ma
|
||||||
|
D+ye
|
||||||
|
D+ri
|
||||||
|
D+wo
|
||||||
|
D+hu
|
||||||
|
D+h+D+hA
|
||||||
|
D+h+mI
|
||||||
|
D+h+yU
|
||||||
|
D+h+rai
|
||||||
|
D+h+wau
|
||||||
|
N+T-i
|
||||||
|
N+Th-I
|
||||||
|
N+Da
|
||||||
|
N+D+ye
|
||||||
|
N+D+ri
|
||||||
|
N+D+r+yo
|
||||||
|
N+D+hu
|
||||||
|
N+NA
|
||||||
|
N+d+rI
|
||||||
|
N+mU
|
||||||
|
N+yai
|
||||||
|
N+wau
|
||||||
|
t+k-i
|
||||||
|
t+k+r-I
|
||||||
|
t+k+wa
|
||||||
|
t+k+se
|
||||||
|
t+gi
|
||||||
|
t+nyo
|
||||||
|
t+Thu
|
||||||
|
t+tA
|
||||||
|
t+t+yI
|
||||||
|
t+t+rU
|
||||||
|
t+t+wai
|
||||||
|
t+thau
|
||||||
|
t+th+y-i
|
||||||
|
t+n-I
|
||||||
|
t+n+ya
|
||||||
|
t+pe
|
||||||
|
t+p+ri
|
||||||
|
t+pho
|
||||||
|
t+mu
|
||||||
|
t+m+yA
|
||||||
|
t+yI
|
||||||
|
// t+rnU -- !!! I think that this is a bogus tibwn.ini entry, but I'm not sure
|
||||||
|
t+sai
|
||||||
|
t+s+thau
|
||||||
|
t+s+n-i
|
||||||
|
t+s+n+y-I
|
||||||
|
t+s+ma
|
||||||
|
t+s+m+ye
|
||||||
|
t+s+yi
|
||||||
|
t+s+ro
|
||||||
|
t+s+wu
|
||||||
|
t+r+yA
|
||||||
|
t+w+yI
|
||||||
|
t+k+ShU
|
||||||
|
th+yai
|
||||||
|
th+wau
|
||||||
|
d+g-i
|
||||||
|
d+g+y-I
|
||||||
|
d+g+ra
|
||||||
|
d+g+he
|
||||||
|
d+g+h+ri
|
||||||
|
d+dzo
|
||||||
|
d+du
|
||||||
|
d+d+yA
|
||||||
|
d+d+rI
|
||||||
|
d+d+wU
|
||||||
|
d+d+hai
|
||||||
|
d+d+h+nau
|
||||||
|
d+d+h+y-i
|
||||||
|
d+d+h+r-I
|
||||||
|
d+d+h+wa
|
||||||
|
d+ne
|
||||||
|
d+bi
|
||||||
|
d+b+ro
|
||||||
|
d+b+hu
|
||||||
|
d+b+h+yA
|
||||||
|
d+b+h+rI
|
||||||
|
d+mU
|
||||||
|
d+yai
|
||||||
|
d+r+yau
|
||||||
|
d+w+y-i
|
||||||
|
d+h-I
|
||||||
|
d+h+na
|
||||||
|
d+h+n+ye
|
||||||
|
d+h+mi
|
||||||
|
d+h+yo
|
||||||
|
d+h+ru
|
||||||
|
d+h+r+yA
|
||||||
|
d+h+wI
|
||||||
|
n+kU
|
||||||
|
n+k+tai
|
||||||
|
n+g+hau
|
||||||
|
n+ng-i
|
||||||
|
n+dz-I
|
||||||
|
n+dz+ya
|
||||||
|
n+De
|
||||||
|
n+ti
|
||||||
|
n+t+yo
|
||||||
|
n+t+ru
|
||||||
|
n+t+r+yA
|
||||||
|
n+t+wI
|
||||||
|
n+t+sU
|
||||||
|
n+thai
|
||||||
|
n+dau
|
||||||
|
n+d+d-i
|
||||||
|
n+d+d+r-I
|
||||||
|
n+d+ya
|
||||||
|
n+d+re
|
||||||
|
n+d+hi
|
||||||
|
n+d+h+ro
|
||||||
|
n+d+h+yu
|
||||||
|
n+nA
|
||||||
|
n+n+yI
|
||||||
|
n+pU
|
||||||
|
n+p+rai
|
||||||
|
n+phau
|
||||||
|
n+m-i
|
||||||
|
n+b+h+y-I
|
||||||
|
n+tsa
|
||||||
|
n+ye
|
||||||
|
n+ri
|
||||||
|
n+wo
|
||||||
|
n+w+yu
|
||||||
|
n+sA
|
||||||
|
n+s+yI
|
||||||
|
n+hU
|
||||||
|
n+h+rai
|
||||||
|
p+tau
|
||||||
|
p+t+y-i
|
||||||
|
p+t+r+y-I
|
||||||
|
p+da
|
||||||
|
p+ne
|
||||||
|
p+n+yi
|
||||||
|
p+po
|
||||||
|
p+mu
|
||||||
|
p+lA
|
||||||
|
p+wI
|
||||||
|
p+sU
|
||||||
|
p+s+n+yai
|
||||||
|
p+s+wau
|
||||||
|
p+s+y-i
|
||||||
|
b+g+h-I
|
||||||
|
b+dza
|
||||||
|
b+de
|
||||||
|
b+d+dzi
|
||||||
|
b+d+ho
|
||||||
|
b+d+h+wu
|
||||||
|
b+tA
|
||||||
|
b+nI
|
||||||
|
b+bU
|
||||||
|
b+b+hai
|
||||||
|
b+b+h+yau
|
||||||
|
b+m-i
|
||||||
|
b+h-I
|
||||||
|
b+h+Na
|
||||||
|
b+h+ne
|
||||||
|
b+h+mi
|
||||||
|
b+h+yo
|
||||||
|
b+h+ru
|
||||||
|
b+h+wA
|
||||||
|
m+nyI
|
||||||
|
m+NU
|
||||||
|
m+nai
|
||||||
|
m+n+yau
|
||||||
|
m+p-i
|
||||||
|
m+p+r-I
|
||||||
|
m+pha
|
||||||
|
m+be
|
||||||
|
m+b+hi
|
||||||
|
m+b+h+yo
|
||||||
|
m+mu
|
||||||
|
m+lA
|
||||||
|
m+wI
|
||||||
|
m+sU
|
||||||
|
m+hai
|
||||||
|
y+yau
|
||||||
|
y+r-i
|
||||||
|
y+w-I
|
||||||
|
y+sa
|
||||||
|
r+khe
|
||||||
|
r+g+hi
|
||||||
|
r+g+h+yo
|
||||||
|
r+ts+yu
|
||||||
|
r+tshA
|
||||||
|
r+dz+nyI
|
||||||
|
r+dz+yU
|
||||||
|
r+Tai
|
||||||
|
r+Thau
|
||||||
|
r+D-i
|
||||||
|
r+N-I
|
||||||
|
r+t+wa
|
||||||
|
r+t+te
|
||||||
|
r+t+si
|
||||||
|
r+t+s+no
|
||||||
|
r+t+s+n+yu
|
||||||
|
r+thA
|
||||||
|
r+th+yI
|
||||||
|
r+d+d+hU
|
||||||
|
r+d+d+h+yai
|
||||||
|
r+d+yau
|
||||||
|
r+d+h-i
|
||||||
|
r+d+h+m-I
|
||||||
|
r+d+h+ya
|
||||||
|
r+d+h+re
|
||||||
|
r+pi
|
||||||
|
r+b+po
|
||||||
|
r+b+bu
|
||||||
|
r+b+hA
|
||||||
|
r+m+mI
|
||||||
|
r+yU
|
||||||
|
r+wai
|
||||||
|
r+shau
|
||||||
|
r+sh+y-i
|
||||||
|
r+Sh-I
|
||||||
|
r+Sh+Na
|
||||||
|
r+Sh+N+ye
|
||||||
|
r+Sh+mi
|
||||||
|
r+Sh+yo
|
||||||
|
r+su
|
||||||
|
r+hA
|
||||||
|
r+k+ShI
|
||||||
|
l+g+wU
|
||||||
|
l+b+yai
|
||||||
|
l+mau
|
||||||
|
l+y-i
|
||||||
|
l+w-I
|
||||||
|
l+la
|
||||||
|
l+h+we
|
||||||
|
w+yi
|
||||||
|
w+ro
|
||||||
|
w+nu
|
||||||
|
w+wA
|
||||||
|
sh+tsI
|
||||||
|
sh+ts+yU
|
||||||
|
sh+tshai
|
||||||
|
sh+Nau
|
||||||
|
sh+n-i
|
||||||
|
sh+p-I
|
||||||
|
sh+b+ya
|
||||||
|
sh+me
|
||||||
|
sh+yi
|
||||||
|
sh+r+yo
|
||||||
|
sh+lu
|
||||||
|
sh+w+gA
|
||||||
|
sh+w+yI
|
||||||
|
sh+shU
|
||||||
|
Sh+kai
|
||||||
|
Sh+k+rau
|
||||||
|
Sh+T-i
|
||||||
|
Sh+T+y-I
|
||||||
|
Sh+T+ra
|
||||||
|
Sh+T+r+ye
|
||||||
|
Sh+T+wi
|
||||||
|
Sh+Tho
|
||||||
|
Sh+Th+yu
|
||||||
|
Sh+NA
|
||||||
|
Sh+N+yI
|
||||||
|
Sh+DU
|
||||||
|
Sh+thai
|
||||||
|
Sh+pau
|
||||||
|
Sh+p+r-i
|
||||||
|
Sh+m-I
|
||||||
|
Sh+ya
|
||||||
|
Sh+we
|
||||||
|
Sh+Shi
|
||||||
|
s+k+so
|
||||||
|
s+khu
|
||||||
|
s+ts+yA
|
||||||
|
s+TI
|
||||||
|
s+ThU
|
||||||
|
s+t+yai
|
||||||
|
s+t+rau
|
||||||
|
s+t+w-i
|
||||||
|
s+th-I
|
||||||
|
s+th+ya
|
||||||
|
s+n+ye
|
||||||
|
s+n+wi
|
||||||
|
s+pho
|
||||||
|
s+ph+yu
|
||||||
|
s+yA
|
||||||
|
s+r+wI
|
||||||
|
s+sU
|
||||||
|
s+s+wai
|
||||||
|
s+hau
|
||||||
|
s+w+y-i
|
||||||
|
h+ny-I
|
||||||
|
h+Na
|
||||||
|
h+te
|
||||||
|
h+ni
|
||||||
|
h+n+yo
|
||||||
|
h+pu
|
||||||
|
h+phA
|
||||||
|
h+mI
|
||||||
|
h+yU
|
||||||
|
h+lai
|
||||||
|
h+sau
|
||||||
|
h+s+w-i
|
||||||
|
h+w+y-I
|
||||||
|
k+Sh+Na
|
||||||
|
k+Sh+me
|
||||||
|
k+Sh+m+yi
|
||||||
|
k+Sh+yo
|
||||||
|
k+Sh+ru
|
||||||
|
k+Sh+lA
|
||||||
|
k+Sh+wI
|
||||||
|
a+yU
|
||||||
|
a+rai
|
||||||
|
a+r+yau
|
||||||
|
|
||||||
|
// Non-Tibetan, Non-Sanskrit words that should be convertible
|
||||||
|
fava
|
||||||
|
vafa
|
||||||
|
vI
|
||||||
|
fai
|
||||||
|
vo
|
||||||
|
fI/ fai/ fo/ fuM
|
||||||
|
|
||||||
|
// punctuation
|
||||||
|
012345678901234
|
||||||
|
!@#$%&*()_={}:;<>?
|
||||||
|
a_tsal852ja$)@#%(!Ta)0daM)%!@sa
|
||||||
|
// test special handing for @#
|
||||||
|
@# #@ @a#
|
||||||
|
|
||||||
|
// English insert
|
||||||
|
rta'i [(horse's)] mgrin
|
||||||
|
|
||||||
|
<?Reject?>
|
||||||
|
|
||||||
|
// The dbu can [m][ng][s] should be read "mngas".
|
||||||
|
mangs
|
||||||
|
|
||||||
|
// ??? legal???
|
||||||
|
pa'am'ang
|
||||||
|
// is this actually illegal?
|
||||||
|
pe'as
|
||||||
|
|
||||||
|
'angs
|
||||||
|
'ag
|
||||||
|
|
||||||
|
bskyUMbs
|
||||||
|
bskyUMbsHgro
|
||||||
|
b+g+ka
|
||||||
|
'ad
|
||||||
|
xan
|
||||||
|
tax
|
||||||
|
qan
|
||||||
|
taq
|
||||||
|
shae
|
||||||
|
aeb
|
||||||
|
.
|
||||||
|
+
|
||||||
|
.y
|
||||||
|
// Arguable if these are illegal
|
||||||
|
g.ra
|
||||||
|
r.ya
|
||||||
|
+a
|
||||||
|
a+ba
|
||||||
|
e+ya
|
||||||
|
ba+
|
||||||
|
b+
|
||||||
|
M
|
||||||
|
^M
|
||||||
|
Ma
|
||||||
|
Mam
|
||||||
|
^Ma
|
||||||
|
^Mam
|
||||||
|
r+ra
|
||||||
|
|
||||||
|
tat
|
||||||
|
sbla
|
4
Fonts note.txt
Normal file
4
Fonts note.txt
Normal file
|
@ -0,0 +1,4 @@
|
||||||
|
The TMW font files need to be included in a WylieWord distribution.
|
||||||
|
(They are referenced in the SEAU .INP file.)
|
||||||
|
They are not included in this repository because they are not really
|
||||||
|
part of the WylieWord project as such.
|
133
THDL phonetics scheme.txt
Normal file
133
THDL phonetics scheme.txt
Normal file
|
@ -0,0 +1,133 @@
|
||||||
|
; THDL Phonetics rules for WylieWord.
|
||||||
|
|
||||||
|
; File format: there are two sections, for rules and exceptions.
|
||||||
|
;
|
||||||
|
; There is one rule per line. Typically a rule has two parts,
|
||||||
|
; separated by a space. The first part is a sequence of letters
|
||||||
|
; that, if it appears in a Wylie transliteration, is to be replaced
|
||||||
|
; by the second part in the phonetic transcription. The second
|
||||||
|
; part can be missing, in which case the first part is simply
|
||||||
|
; deleted from transcriptions when found.
|
||||||
|
;
|
||||||
|
; A semicolon precedes a comment. Blank lines are OK.
|
||||||
|
;
|
||||||
|
; The rules are applied in the order they appear in this file.
|
||||||
|
; Each rule is applied as many times as possible, but we never
|
||||||
|
; go back to a previous rule. (This simple rewrite-rule grammar is
|
||||||
|
; not sufficient to implement all phonetic schemes, at least not
|
||||||
|
; compactly. For example, it would be difficult to capture the
|
||||||
|
; effects of preinitial consonants on tone (as in the scheme used
|
||||||
|
; in Joe Wilson's book, for instance).) Also note that not even the
|
||||||
|
; whole of the present scheme is implemented using these rules. In
|
||||||
|
; particular, the deletion of prefix and superscript consonants,
|
||||||
|
; and of wa-zur, are done in program code, not using the rules here.
|
||||||
|
|
||||||
|
; Miscellaneous prefix transformations
|
||||||
|
g. ; delete this (representing g prefix, used before root y only)
|
||||||
|
dby y ; must come before db->w, for dbyang
|
||||||
|
dbr r ; must come before db->w, for dbral
|
||||||
|
db w ; must come before by->j
|
||||||
|
|
||||||
|
; c and ch are both transcribed ch. To get this we need a kludge
|
||||||
|
; (involving x), because the rule c -> ch would apply recursively.
|
||||||
|
ch c
|
||||||
|
c x
|
||||||
|
x ch
|
||||||
|
|
||||||
|
; Bad behavior from Y
|
||||||
|
py ch
|
||||||
|
phy ch
|
||||||
|
by j
|
||||||
|
my ny
|
||||||
|
|
||||||
|
; Retroflexes
|
||||||
|
kr tr
|
||||||
|
khr thr
|
||||||
|
gr dr
|
||||||
|
pr tr
|
||||||
|
phr thr
|
||||||
|
br dr
|
||||||
|
|
||||||
|
; Other bad behavior from R
|
||||||
|
mr m
|
||||||
|
sr s
|
||||||
|
|
||||||
|
; Uniquely random case
|
||||||
|
zl d
|
||||||
|
|
||||||
|
; Umlaut of a, o, u followed by d, n, l, s
|
||||||
|
; Note: this must be done before suffix-stripping.
|
||||||
|
; Before actually doing the umlaut, we "hide" the n in ng, so that ng doesn't
|
||||||
|
; induce umlaut. This is gross; if we had a real grammar engine it wouldn't
|
||||||
|
; be necessary.
|
||||||
|
ng x
|
||||||
|
ad e
|
||||||
|
an en
|
||||||
|
al el
|
||||||
|
as e
|
||||||
|
od ö
|
||||||
|
on ön
|
||||||
|
ol öl
|
||||||
|
os ö
|
||||||
|
ud ü
|
||||||
|
un ün
|
||||||
|
ul ül
|
||||||
|
us ü
|
||||||
|
; restore ng
|
||||||
|
x ng
|
||||||
|
|
||||||
|
; Stripping of suffix d, s, and ' from i and e
|
||||||
|
; Note: this has already been done by the umlaut rules for some cases,
|
||||||
|
; which don't need to be repeated here.
|
||||||
|
id i
|
||||||
|
ed e
|
||||||
|
is i
|
||||||
|
es e
|
||||||
|
a' a
|
||||||
|
e' e
|
||||||
|
i' i
|
||||||
|
o' o
|
||||||
|
u' u
|
||||||
|
|
||||||
|
; Remove doubled vowels (e.g. pa'ang -> pang, not paang)
|
||||||
|
aa a
|
||||||
|
ee e
|
||||||
|
ii i
|
||||||
|
oo o
|
||||||
|
uu u
|
||||||
|
|
||||||
|
; Devoicing of suffix g, b
|
||||||
|
ag ak
|
||||||
|
eg ek
|
||||||
|
ig ik
|
||||||
|
og ok
|
||||||
|
ug uk
|
||||||
|
ab ap
|
||||||
|
eb ep
|
||||||
|
ib ip
|
||||||
|
ob op
|
||||||
|
ub up
|
||||||
|
|
||||||
|
<?Exceptions?>
|
||||||
|
|
||||||
|
; There is one exception per line. Each exception consists of
|
||||||
|
; the transliteration (which may be several syllables separated
|
||||||
|
; by spaces), followed by a space, a colon, a space, and the
|
||||||
|
; pronunciation (which may also contain spaces). A semicolon
|
||||||
|
; precedes a comment. Blank lines are OK.
|
||||||
|
|
||||||
|
ba : wa ; mind you, ba (pronounced ba) means cow. But that's much rarer than wa.
|
||||||
|
bo : wo
|
||||||
|
ba'i : wai
|
||||||
|
bo'i : woi
|
||||||
|
bar : ?bar/war? ; bar = "middle"; could be either, so supply both and let user sort it out
|
||||||
|
bor : ?bor/wor? ; bor = "cast away"; could be either, so supply both and let user sort it out
|
||||||
|
rdo rje : dorje
|
||||||
|
mkha' 'gro : khandro
|
||||||
|
sprul sku : tulku
|
||||||
|
rta mgrin: tamdrin
|
||||||
|
dga' ldan : ganden
|
||||||
|
dge 'dun : gendün
|
||||||
|
a mdo : amdo
|
||||||
|
blo bzang : lobzang
|
||||||
|
sbra nag zhol : banakzhöl
|
72
THDL phonetics test cases.txt
Normal file
72
THDL phonetics test cases.txt
Normal file
|
@ -0,0 +1,72 @@
|
||||||
|
;
|
||||||
|
; These examples come from the draft (8/21/03) THDL Phonetics document
|
||||||
|
;
|
||||||
|
lha sa > lhasa
|
||||||
|
sa skya pa > sakyapa
|
||||||
|
blo bzang > lobzang
|
||||||
|
rnying ma pa > nyingmapa
|
||||||
|
rdo rje > dorje
|
||||||
|
dge lugs pa > gelukpa
|
||||||
|
gzhis ka rtse > zhikatse
|
||||||
|
mar me > marme
|
||||||
|
bcu > chu
|
||||||
|
lce > che
|
||||||
|
rin chen bzang po > rinchenzangpo
|
||||||
|
nag chu > nakchu
|
||||||
|
bka' rgyud pa > kagyüpa
|
||||||
|
bsod nams> sönam
|
||||||
|
thub bstan > thupten
|
||||||
|
'jam dpal dbyangs > jampelyang
|
||||||
|
dge legs > gelek
|
||||||
|
kha btags > khatak
|
||||||
|
bka' shag > kashak
|
||||||
|
sbra nag zhol > banakzhöl
|
||||||
|
thabs > thap
|
||||||
|
lha sa ba > lhasawa
|
||||||
|
jo bo > jowo
|
||||||
|
dpa' bo > pawo
|
||||||
|
spyan ras gzig > chenrezik
|
||||||
|
sbyin bdag > jindak
|
||||||
|
smyong > nyong
|
||||||
|
sgrol ma > drölma
|
||||||
|
rten 'brel > tendrel
|
||||||
|
'bras spungs > drepung
|
||||||
|
'phrin las > thrinle
|
||||||
|
dbang > wang
|
||||||
|
dbral > rel
|
||||||
|
dbyar kha > yarkha
|
||||||
|
zla ba > dawa
|
||||||
|
klad pa > lepa
|
||||||
|
glog > lok
|
||||||
|
le'u > leu
|
||||||
|
pa'ang > pang
|
||||||
|
ba'i > wai
|
||||||
|
rta mgrin > tamdrin
|
||||||
|
|
||||||
|
; Other tests, to exercise particular rules in the grammar that aren't covered in the rules above
|
||||||
|
g.yon > yön
|
||||||
|
phyag > chak
|
||||||
|
bkra shis > trashi
|
||||||
|
khros ma > thröma
|
||||||
|
sprul > trül
|
||||||
|
mri tam ga > mitamga
|
||||||
|
srid pa > sipa
|
||||||
|
pad ma > pema
|
||||||
|
pan chen > penchen
|
||||||
|
ral pa can > relpachen
|
||||||
|
thun > thün
|
||||||
|
dus gsum > düsum
|
||||||
|
sbed > be
|
||||||
|
ces > che
|
||||||
|
pa'i > pai
|
||||||
|
che'i > chei
|
||||||
|
gri'i > dri
|
||||||
|
po'i > poi
|
||||||
|
le'u'i > leui
|
||||||
|
rdzogs > dzok
|
||||||
|
thug pa > thukpa
|
||||||
|
'debs > dep
|
||||||
|
sib sib > sipsip
|
||||||
|
lobs pa > loppa
|
||||||
|
grub > drup
|
||||||
|
kla col > lachöl
|
6
tibwn.ini note.txt
Normal file
6
tibwn.ini note.txt
Normal file
|
@ -0,0 +1,6 @@
|
||||||
|
WylieWord uses the same tibwn.ini file as Jskad.
|
||||||
|
|
||||||
|
The Jskad version is canonical, so tibwn.ini does not appear here.
|
||||||
|
|
||||||
|
You need to get it from the Jskad SF repository and include it in
|
||||||
|
WylieWord builds.
|
Loading…
Reference in a new issue