diff --git a/source/org/thdl/tib/text/ttt/LotsOfTshegBarsTest.java b/source/org/thdl/tib/text/ttt/LotsOfTshegBarsTest.java index 2cfc090..ab74bc1 100644 --- a/source/org/thdl/tib/text/ttt/LotsOfTshegBarsTest.java +++ b/source/org/thdl/tib/text/ttt/LotsOfTshegBarsTest.java @@ -43,11 +43,17 @@ import junit.framework.TestCase; * S0291M.ACT, S0319M.ACT, S0986I2.INC, S0986I3.INC, SL6850I2.ACT, * TD4105E.ACT. * - *

I had the converter output all tsheg bars. I then ran 'sort + *

I had the converter output all tsheg bars using the + * org.thdl.tib.text.ttt.OutputAllTshegBars option. I then ran 'sort * -d' on that, piped it to 'uniq -c', and ran awk -- '{y = * sprintf("\"%s", $2); x = sprintf(" th(%10s\");\t// %d", y, $1); * print x;}' on that. And you get the following. * + *

There were approximately 35,253,990 tsheg bars counted. Note + * that I use the term tsheg bar to refer even to + * polysyllablic, non-native patches that literally occur between + * tsheg-like things. + * * @author David Chandler */ public class LotsOfTshegBarsTest extends TestCase {