Commit Graph

23 Commits

Author SHA1 Message Date
James Bowes abdecf3fe3 Fix division for byte count node size 2012-10-25 11:51:37 -03:00
Michael Hrivnak e6bef92ab7 fixed bug where 4 bytes of data would be written with only 3 expected. 2012-10-23 19:51:14 -04:00
Michael Hrivnak 1abcc9b425 implementing the byte length field for number of nodes. this makes it possible to correctly represent 128 or more nodes. 2012-10-23 17:26:39 -04:00
James Bowes 6c00822523 Remove extra debug spew from thing.rb 2012-08-12 09:35:26 -03:00
James Bowes 16345dbad2 Decoding working for C 2012-08-11 14:16:29 -03:00
James Bowes 11fd9f1f4a Add huffman decoding for C 2012-08-09 17:51:05 -03:00
James Bowes 4b82b83e02 make thing.rb executable 2012-08-08 09:45:31 -03:00
Vincent Batts 9ca686aa6f adding a 'p' option, to see the parent tree format 2012-08-06 17:40:57 -04:00
Vincent Batts 34514563b0 derp 2012-08-06 17:25:05 -04:00
Vincent Batts e994597d42 adding a #to_h method for the Node object 2012-08-06 17:21:44 -04:00
Vincent Batts 0d71eb9e15 seperating output for verbosity 2012-08-06 17:04:30 -04:00
Vincent Batts d5e899f804 get_child feels like java 2012-08-06 16:47:54 -04:00
Vincent Batts 3e9789880d don't let the lookup return nil 2012-08-06 14:37:28 -04:00
Vincent Batts 28b6092ea3 adding logging to track where this nil is comming from 2012-08-06 14:32:17 -04:00
James Bowes 6036500b74 Add c and d subcommands 2012-08-01 06:48:24 -03:00
James Bowes 227e8de979 Fix bug in duplicate detection.
Each node is written to disk as a list of (path, node pointer) pairs.
The duplicate detection code was considering the node's children and the
node's name. If we only look for  the children, we can find much more
duplicates.

Previous duplicate detection went from 424 nodes to 127. New duplicate
detection reduces to 48 nodes.

With this better duplicate detection, the prefix compression doesn't
appear to be useful anymore. comment it out.

Trims an extra 40 bytes off my sample data.
2012-07-28 12:46:03 -03:00
James Bowes 7742eeb024 POC 2012-07-27 16:41:44 -03:00
James Bowes abfdbebe28 checkpoint 2012-07-27 14:47:20 -03:00
James Bowes ddf7d89408 temp 2012-07-26 17:04:52 -03:00
James Bowes a5b7fd02ac poc with de-duped full nodes 2012-07-26 16:38:10 -03:00
James Bowes 606b0ea5e6 class based 2012-07-26 14:21:16 -03:00
James Bowes 4e0f638cd2 print out original stored value 2012-07-26 13:56:45 -03:00
James Bowes afb59bf7fa init 2012-07-26 13:18:58 -03:00