WylieWord/THDL phonetics test cases.txt

137 lines
2.6 KiB
Text

;
; These examples mostly come from the THDL Phonetics document (Jan 2004 draft)
;
dag pa > dakpa
ring po > ringpo
rin chen > rinchen
lab > lap
dum bu > dumbu
dmar po > marpo
ril bu > rilbu
sa skya pa > sakyapa
blo bzang > lozang
rnying ma pa > nyingmapa
rdo rje > dorjé
dge lugs pa > gelukpa
gzhis ka rtse > zhikatsé
mar me > marmé
dge bshes > geshé
bcu > chu
gcig pa > chikpa
nag chu > nakchu
'phag pa > pakpa
gser thang > sertang
khang tshan > khangtsen
lce > ché
rin chen bzang po > rinchenzangpo
bka' rgyud > kagyü
bsod nams> sönam
yul > yül
dus tshod > dütsö
bon po > bönpo
sde dge > degé
brgyad > gyé
dge rgan > gegen
ral pa can > relpachen
tshe ring > tsering
byes > jé
bstan 'dzin > tendzin
'jam dpal dbyangs > jampelyang
dge legs > gelek
kha btags > khatak
sngags pa > ngakpa
byang chub > jangchup
thub bstan > tupten
tabs > tap
bka' shag > kashak
sbra nag zhol > banakzhöl
thabs > tap
lha sa ba > lhasawa
jo bo > jowo
dpa' bo > pawo
gsal bar > selwar
; nga'i deb > ngé dep -- can't do this one, it depends on word segmentation
bar ba > barwa
spyan ras gzig > chenrezik
phyag > chak
sbyin bdag > jindak
smyong > nyong
dmyal ba > nyelwa
sgrol ma > drölma
'bras spungs > drepung
'phrin las > trinlé
srung ma > sungma
rdzun smra ba > dzünmawa
klad pa > lepa
glog > lok
zla ba > dawa
lha sa > lhasa
lho phyogs > lhochok
lhun grub > lhündrup
dbang > wang
dbyar kha > yarkha
dbral > rel
le'u > leu
khyi'u > khyiu
pa'ang > pang
gri'i > dri
'gro ba'i > drowé
rgyal bu'i > gyelbü
rin po che'i > rinpoché
bdag po'i > dakpö
le'u'i > leü
rta mgrin > tamdrin
g.yon > yön
phyag > chak
bkra shis > trashi
khros ma > tröma
sprul > trül
mri tam ga > mitamga
srid pa > sipa
pad ma > pema
pan chen > penchen
thun > tün
dus gsum > düsum
sbed > bé
ces > ché
btsan dbang > tsenwang
tshong khang > tsongkhang
rdzong > dzong
stabs > tap
thug pa > tukpa
debs > dep
sib sib > sipsip
lobs pa > loppa
grub > drup
kla col > lachöl
spyan snga ba > chenngawa
sems dpa'i > sempé
bon po'i > bönpö
rdzogs > dzok
; Tests of nasalization rule (taken from specification document as of 15 Apr 04)
bka' 'gyur > kangyur
dge 'dun > gendün
ngos 'dzin > ngöndzin
rig 'dzin > rindzin
mkha' 'gro > khandro
dkyil 'khor > kyinkhor
chos 'phel > chömpel
dpal 'bar > pembar
sku 'bum > kumbum
rgyu 'bras > gyundré
dpal 'byor > penjor
; These are exceptions: by the rule they should be kyandro, tendrel, landré
skyabs 'gro > kyamdro
rten 'brel > tendrel
lam 'bras > lamdré
; Other random tests
phreng > treng
snrub > nup
; Test of second-suffix d removal. Made-up word because I don't know real ones.
srand > sen
; Test that we don't spazz out on single-letter words.
a > a
ai > ai