Provides the classes to take Tibetan language passages and divide the passages up into their component phrases and words, and display corresponding dictionary definitions.

This tool helps Tibetan to English translators partially automate the translation process. In the Tibetan language, the boundaries of individual words are not marked in any manner such as the way in which spaces separate and mark words in English. Instead, there is a punctuation mark called a "tsheg" which separates each syllable. Thus while syllabic boundaries are utterly explicit, word boundaries are often unclear. One of the main difficulties beginning students thus have with translating Tibetan texts is figuring out where each word ends and the next word starts, and determining what series of syllables to look up in the dictionary either as constituting a single word or a larger compound phrase. This entails a very time consuming process of looking up multiple combinations of syllables to determine which are found within a given dictionary.

It partially automates that process by breaking up a sentence/paragraph entered in Extended Wylie or Tibetan script into the biggest component parts it can find in multiple dictionary databases. Then for each component part found, it displays its stored definitions and relevant information. This will thus often yield only the definition of a long phrase, rather than its component words, but one can also search for the syllables of that phrase one by one separately.

The tool can run on-line through a:

The tool can also run off-line in:

The classes designed to be run from the command-line are:

Notes on Input:

Author: Andrés Montano Pellegrini

Related Documentation

@see org.thdl.tib.text @see org.thdl.tib.input