From dfa380dff4a85daccab82d294456463210c10924 Mon Sep 17 00:00:00 2001 From: BarfingLemurs <128182951+BarfingLemurs@users.noreply.github.com> Date: Fri, 13 Oct 2023 19:45:27 -0400 Subject: [PATCH] Update README.md --- README.md | 10 ++++++---- 1 file changed, 6 insertions(+), 4 deletions(-) diff --git a/README.md b/README.md index 0a9d0bad9..1d9288516 100644 --- a/README.md +++ b/README.md @@ -10,7 +10,7 @@ Inference of [LLaMA](https://arxiv.org/abs/2302.13971) model in pure C/C++ ### Hot topics -- ‼️ BPE tokenizer update: existing Falcon 180B `.gguf` models will need to be reconverted +- ‼️ BPE tokenizer update: existing Falcon and Starcoder `.gguf` models will need to be reconverted - ‼️ Breaking change: `rope_freq_base` and `rope_freq_scale` must be set to zero to use the model default values: [#3401](https://github.com/ggerganov/llama.cpp/pull/3401) - Parallel decoding + continuous batching support added: [#3228](https://github.com/ggerganov/llama.cpp/pull/3228) \ **Devs should become familiar with the new API** @@ -91,7 +91,7 @@ as the main playground for developing new features for the [ggml](https://github - [X] [OpenBuddy 🐶 (Multilingual)](https://github.com/OpenBuddy/OpenBuddy) - [X] [Pygmalion/Metharme](#using-pygmalion-7b--metharme-7b) - [X] [WizardLM](https://github.com/nlpxucan/WizardLM) -- [X] [[Baichuan 1&2](https://huggingface.co/models?search=baichuan-inc/Baichuan) + [derivations](https://huggingface.co/hiyouga/baichuan-7b-sft) +- [X] [Baichuan 1&2](https://huggingface.co/models?search=baichuan-inc/Baichuan) + [derivations](https://huggingface.co/hiyouga/baichuan-7b-sft) - [X] [Aquila 1&2](https://huggingface.co/models?search=BAAI/Aquila) - [X] [Starcoder models](https://github.com/ggerganov/llama.cpp/pull/3187) - [X] [Mistral AI v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) @@ -208,7 +208,8 @@ https://user-images.githubusercontent.com/1991296/224442907-7693d4be-acaa-4e01-8 ## Usage -**Quickstart:** You will find prebuilt Windows binaries on the release page. +#### **Quickstart:** +You will find prebuilt Windows binaries on the release page. Simply download and extract the zip package of choice (e.g. `llama-b1380-bin-win-avx2-x64.zip`) @@ -218,7 +219,8 @@ From the unzipped folder, open a terminal/cmd window here and place a pre-conver .\main -m llama-2-7b.Q4_0.gguf -n 128 ``` -**Build:** Here are the end-to-end binary build and model conversion steps for the LLaMA-7B model. +#### **Build:** +Here are the end-to-end binary build and model conversion steps for the LLaMA-7B model. ### Get the Code