Commit graph

19 commits

Author SHA1 Message Date
b5fd3c6008 stylistic tweaks 2012-08-06 14:51:56 -04:00
168d256fea adding a README 2012-08-06 14:49:56 -04:00
3e9789880d don't let the lookup return nil 2012-08-06 14:37:28 -04:00
28b6092ea3 adding logging to track where this nil is comming from 2012-08-06 14:32:17 -04:00
James Bowes
f4777de387 Merge branch 'vbatts/master'
Conflicts:
	thing.rb
2012-08-01 06:53:38 -03:00
James Bowes
6036500b74 Add c and d subcommands 2012-08-01 06:48:24 -03:00
50aeed0b53 show the size of the *.bin written 2012-07-31 13:47:01 -04:00
Vincent Batts
99eccf44c5 more Makefile tweaks 2012-07-30 12:13:33 -04:00
Vincent Batts
6b3bd894d6 Makefile cleanup 2012-07-30 12:09:02 -04:00
James Bowes
227e8de979 Fix bug in duplicate detection.
Each node is written to disk as a list of (path, node pointer) pairs.
The duplicate detection code was considering the node's children and the
node's name. If we only look for  the children, we can find much more
duplicates.

Previous duplicate detection went from 424 nodes to 127. New duplicate
detection reduces to 48 nodes.

With this better duplicate detection, the prefix compression doesn't
appear to be useful anymore. comment it out.

Trims an extra 40 bytes off my sample data.
2012-07-28 12:46:03 -03:00
James Bowes
a8a7fd57f6 Add start of C based decoder 2012-07-28 10:51:53 -03:00
James Bowes
427caabb1b add huffman implementation 2012-07-27 16:42:02 -03:00
James Bowes
7742eeb024 POC 2012-07-27 16:41:44 -03:00
James Bowes
abfdbebe28 checkpoint 2012-07-27 14:47:20 -03:00
James Bowes
ddf7d89408 temp 2012-07-26 17:04:52 -03:00
James Bowes
a5b7fd02ac poc with de-duped full nodes 2012-07-26 16:38:10 -03:00
James Bowes
606b0ea5e6 class based 2012-07-26 14:21:16 -03:00
James Bowes
4e0f638cd2 print out original stored value 2012-07-26 13:56:45 -03:00
James Bowes
afb59bf7fa init 2012-07-26 13:18:58 -03:00