diff --git a/awqutils/README.md b/examples/awqutils/README.md similarity index 97% rename from awqutils/README.md rename to examples/awqutils/README.md index da7a85e92..481cbcde0 100644 --- a/awqutils/README.md +++ b/examples/awqutils/README.md @@ -60,7 +60,7 @@ Several quantization methods are supported. They differ in the resulting model d |Llama 7B | ms/tok @ 4th | xxx | xx | xx | xx | |Llama 7B | ms/tok @ 8th | xxx | xx | xx | xx | |Llama 7B | bits/weight | 16.0 | 4.5 | 5.0 | 2.6 | -|AWQ-LLama 7B| perplexity | 5.9175 | 6.0252 | xxxxxx | xxxxx | +|AWQ-LLama 7B| perplexity | 5.9175 | 6.0252 | 5.9987 | xxxxx | |AWQ-LLama 7B| file size | 12.9G | 3.5G | 3.9G | 2.7G | |AWQ-LLama 7B| ms/tok @ 4th | xxx| xxx | xxx | xxx | |AWQ-LLama 7B| ms/tok @ 8th | xxx| xx | xx | xx | diff --git a/awqutils/apply_awq.py b/examples/awqutils/apply_awq.py similarity index 100% rename from awqutils/apply_awq.py rename to examples/awqutils/apply_awq.py diff --git a/awqutils/requirements.txt b/examples/awqutils/requirements.txt similarity index 100% rename from awqutils/requirements.txt rename to examples/awqutils/requirements.txt