WylieWord/THDL phonetics test cases.txt

138 lines
2.6 KiB
Plaintext
Raw Normal View History

;
; These examples mostly come from the THDL Phonetics document (Jan 2004 draft)
;
dag pa > dakpa
ring po > ringpo
rin chen > rinchen
lab > lap
dum bu > dumbu
dmar po > marpo
ril bu > rilbu
sa skya pa > sakyapa
blo bzang > lozang
rnying ma pa > nyingmapa
rdo rje > dorj<72>
dge lugs pa > gelukpa
gzhis ka rtse > zhikats<74>
mar me > marm<72>
dge bshes > gesh<73>
bcu > chu
gcig pa > chikpa
nag chu > nakchu
'phag pa > pakpa
gser thang > sertang
khang tshan > khangtsen
lce > ch<63>
rin chen bzang po > rinchenzangpo
bka' rgyud > kagy<67>
bsod nams> s<>nam
yul > y<>l
dus tshod > d<>ts<74>
bon po > b<>npo
sde dge > deg<65>
brgyad > gy<67>
dge rgan > gegen
ral pa can > relpachen
tshe ring > tsering
byes > j<>
bstan 'dzin > tendzin
'jam dpal dbyangs > jampelyang
dge legs > gelek
kha btags > khatak
sngags pa > ngakpa
byang chub > jangchup
thub bstan > tupten
tabs > tap
bka' shag > kashak
sbra nag zhol > banakzh<7A>l
thabs > tap
lha sa ba > lhasawa
jo bo > jowo
dpa' bo > pawo
gsal bar > selwar
; nga'i deb > ng<6E> dep -- can't do this one, it depends on word segmentation
bar ba > barwa
spyan ras gzig > chenrezik
phyag > chak
sbyin bdag > jindak
smyong > nyong
dmyal ba > nyelwa
sgrol ma > dr<64>lma
'bras spungs > drepung
'phrin las > trinl<6E>
srung ma > sungma
rdzun smra ba > dz<64>nmawa
klad pa > lepa
glog > lok
zla ba > dawa
lha sa > lhasa
lho phyogs > lhochok
lhun grub > lh<6C>ndrup
dbang > wang
dbyar kha > yarkha
dbral > rel
le'u > leu
khyi'u > khyiu
pa'ang > pang
gri'i > dri
'gro ba'i > drow<6F>
rgyal bu'i > gyelb<6C>
rin po che'i > rinpoch<63>
bdag po'i > dakp<6B>
le'u'i > le<6C>
rta mgrin > tamdrin
g.yon > y<>n
phyag > chak
bkra shis > trashi
khros ma > tr<74>ma
sprul > tr<74>l
mri tam ga > mitamga
srid pa > sipa
pad ma > pema
pan chen > penchen
thun > t<>n
dus gsum > d<>sum
sbed > b<>
ces > ch<63>
btsan dbang > tsenwang
tshong khang > tsongkhang
rdzong > dzong
stabs > tap
thug pa > tukpa
debs > dep
sib sib > sipsip
lobs pa > loppa
grub > drup
kla col > lach<63>l
spyan snga ba > chenngawa
sems dpa'i > semp<6D>
bon po'i > b<>np<6E>
rdzogs > dzok
; Tests of nasalization rule (taken from specification document as of 15 Apr 04)
bka' 'gyur > kangyur
dge 'dun > gend<6E>n
ngos 'dzin > ng<6E>ndzin
rig 'dzin > rindzin
mkha' 'gro > khandro
dkyil 'khor > kyinkhor
chos 'phel > ch<63>mpel
dpal 'bar > pembar
sku 'bum > kumbum
rgyu 'bras > gyundr<64>
dpal 'byor > penjor
; These are exceptions: by the rule they should be kyandro, tendrel, landr<64>
skyabs 'gro > kyamdro
rten 'brel > tendrel
lam 'bras > lamdr<64>
; Other random tests
phreng > treng
snrub > nup
; Test of second-suffix d removal. Made-up word because I don't know real ones.
srand > sen
; Test that we don't spazz out on single-letter words.
a > a
ai > ai