Merge pull request #1 from shiqinwen/shiqinwen-correct-readme-quantize-param
Correct the parameters of type given.
This commit is contained in:
commit
ef7bfbad54
1 changed files with 1 additions and 1 deletions
|
@ -271,7 +271,7 @@ python3 -m pip install -r requirements.txt
|
|||
python3 convert.py models/7B/
|
||||
|
||||
# quantize the model to 4-bits (using q4_0 method)
|
||||
./quantize ./models/7B/ggml-model-f16.bin ./models/7B/ggml-model-q4_0.bin q4_0
|
||||
./quantize ./models/7B/ggml-model-f16.bin ./models/7B/ggml-model-q4_0.bin 2
|
||||
|
||||
# run the inference
|
||||
./main -m ./models/7B/ggml-model-q4_0.bin -n 128
|
||||
|
|
Loading…
Add table
Add a link
Reference in a new issue