Merge pull request #1 from shiqinwen/shiqinwen-correct-readme-quantize-param

Correct the parameters of type given.
This commit is contained in:
Wen Shi 2023-04-28 23:58:52 +08:00 committed by GitHub
commit ef7bfbad54
No known key found for this signature in database
GPG key ID: 4AEE18F83AFDEB23

View file

@ -271,7 +271,7 @@ python3 -m pip install -r requirements.txt
python3 convert.py models/7B/
# quantize the model to 4-bits (using q4_0 method)
./quantize ./models/7B/ggml-model-f16.bin ./models/7B/ggml-model-q4_0.bin q4_0
./quantize ./models/7B/ggml-model-f16.bin ./models/7B/ggml-model-q4_0.bin 2
# run the inference
./main -m ./models/7B/ggml-model-q4_0.bin -n 128