Pascal Patry 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								9379d3cc17 
								
							 
						 
						
							
							
								
								readme : rename result_format to response_format ( #9300 )  
							
							
							
						 
						
							2024-09-04 09:45:40 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									daghanerdonmez 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								9beb2dda03 
								
							 
						 
						
							
							
								
								readme : fix typo [no ci] ( #8389 )  
							
							... 
							
							
							
							Bakus-Naur --> Backus-Naur 
							
						 
						
							2024-07-09 09:16:00 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Olivier Chafik 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								139cc621e9 
								
							 
						 
						
							
							
								
								json: restore default additionalProperties to false, fix some pattern escapes (#8180 )  
							
							... 
							
							
							
							* json: expand ESCAPED_IN_REGEXPS_BUT_NOT_IN_LITERALS charset
* json: revert default of additionalProperties to false
* Update README.md 
							
						 
						
							2024-06-28 09:26:45 +01:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Olivier Chafik 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								cb0b06a8a6 
								
							 
						 
						
							
							
								
								json: update grammars/README w/ examples & note about additionalProperties (#8132 )  
							
							... 
							
							
							
							* json: update grammars/README
* mention broken prefixItems
* add mention to llama-gbnf-validator
* json: explicit type: object for nested items object in cli example 
							
						 
						
							2024-06-27 22:08:42 +01:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Olivier Chafik 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								1c641e6aac 
								
							 
						 
						
							
							
								
								build: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809 )  
							
							... 
							
							
							
							* `main`/`server`: rename to `llama` / `llama-server` for consistency w/ homebrew
* server: update refs -> llama-server
gitignore llama-server
* server: simplify nix package
* main: update refs -> llama
fix examples/main ref
* main/server: fix targets
* update more names
* Update build.yml
* rm accidentally checked in bins
* update straggling refs
* Update .gitignore
* Update server-llm.sh
* main: target name -> llama-cli
* Prefix all example bins w/ llama-
* fix main refs
* rename {main->llama}-cmake-pkg binary
* prefix more cmake targets w/ llama-
* add/fix gbnf-validator subfolder to cmake
* sort cmake example subdirs
* rm bin files
* fix llama-lookup-* Makefile rules
* gitignore /llama-*
* rename Dockerfiles
* rename llama|main -> llama-cli; consistent RPM bin prefixes
* fix some missing -cli suffixes
* rename dockerfile w/ llama-cli
* rename(make): llama-baby-llama
* update dockerfile refs
* more llama-cli(.exe)
* fix test-eval-callback
* rename: llama-cli-cmake-pkg(.exe)
* address gbnf-validator unused fread warning (switched to C++ / ifstream)
* add two missing llama- prefixes
* Updating docs for eval-callback binary to use new `llama-` prefix.
* Updating a few lingering doc references for rename of main to llama-cli
* Updating `run-with-preset.py` to use new binary names.
Updating docs around `perplexity` binary rename.
* Updating documentation references for lookup-merge and export-lora
* Updating two small `main` references missed earlier in the finetune docs.
* Update apps.nix
* update grammar/README.md w/ new llama-* names
* update llama-rpc-server bin name + doc
* Revert "update llama-rpc-server bin name + doc"
This reverts commit e474ef1df4 
							
						 
						
							2024-06-13 00:41:52 +01:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Olivier Chafik 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								b61eb9644d 
								
							 
						 
						
							
							
								
								json: refine constraint for whitespace to avoid runaways yet allow pretty print ( #7866 )  
							
							
							
						 
						
							2024-06-11 02:22:57 +01:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Olivier Chafik 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								396b18dfec 
								
							 
						 
						
							
							
								
								json: document schema conversion in GBNF readme, align manual grammar examples & converters (#7841 )  
							
							... 
							
							
							
							* json: fix char pattern in grammar converters
* json: prevent number precision & whitespace runaways in example grammars
* json: add doc to grammar readme 
							
						 
						
							2024-06-11 01:00:30 +01:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Olivier Chafik 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								55b2d0849d 
								
							 
						 
						
							
							
								
								grammars: x{min,max} repetition operator ( #6640 )  
							
							... 
							
							
							
							* grammars: x{min,max} repetition operator + tweak +/*/? to avoid duplication of original over alternates
* grammars: handle `x{n}` and fix `x{n,n}`
* grammars: document new repetition operators
* grammars: uniform use of int for min & max
* grammars: refactor parser test
* grammar: parsing tests w/ natural pretty print of updated expectations
* grammars: much prettier print of expectations (+ TEST_GRAMMAR_PARSER_PRINT_ALL=1 to force all)
* grammars: improve test pretty print again
* grammars: pretty print rules and chars
* grammars: fix copy rule skipping
* grammars: disallow `a{,}` (not allowed in regexps)
* Update common/grammar-parser.cpp
Co-authored-by: Clint Herron <hanclinto@gmail.com>
* grammars: fix copy rule skipping (again) & display of expectations
* grammars: more test cases
* grammars: update reps parsing to bring ? / * / + closer to before
* json: use new GBNF repetitions{m,n} syntax
* grammars: update performance gotchas w/ repetition advice
* Update examples/json_schema_to_grammar.py
Co-authored-by: Clint Herron <hanclinto@gmail.com>
* Update examples/server/public/json-schema-to-grammar.mjs
Co-authored-by: Clint Herron <hanclinto@gmail.com>
* grammars: comment on rule repetitions
* grammars: ensure unambiguous number alternatives
* grammar: nit typo switched error msgs
* grammar: nit numbering in comment
* json: update numeric rule to be unambiguous
* Apply suggestions from code review
Co-authored-by: Clint Herron <hanclinto@gmail.com>
* Update examples/server/public/json-schema-to-grammar.mjs
Co-authored-by: Clint Herron <hanclinto@gmail.com>
* json: fix integral-part
* grammar: add repetition tests
---------
Co-authored-by: Clint Herron <hanclinto@gmail.com> 
							
						 
						
							2024-06-06 10:07:06 +01:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									omahs 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								04976db7a8 
								
							 
						 
						
							
							
								
								docs: fix typos ( #7124 )  
							
							... 
							
							
							
							* fix typo
* fix typos
* fix typo
* fix typos
* fix typo
* fix typos 
							
						 
						
							2024-05-07 18:20:33 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Olivier Chafik 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								ab9a3240a9 
								
							 
						 
						
							
							
								
								JSON schema conversion:  ⚡ ️ faster repetitions, min/maxLength for strings, cap number length ( #6555 )  
							
							... 
							
							
							
							* json: rename python schema converter to make import easier
* server: skip null json_schema / grammar fields
* json: deps management for primitive rules (+ allow null values)
* json: optimize repetitions for minItems/maxItems and regexps: `a{,3}` goes from `"a"? "a"? "a"?` (explosive combos) to `(a (a (a)?)?)?`
* grammars: add troubleshooting section to readme
* json: cap length of numbers to 15 digits before/after decimal point
(avoids infinite gen, e.g. "one third" -> `0.333333333333...`)
* json: unify all repetition code (w/ or w/o sep)
* json: support string minLength/maxLength
* server+json: update server/README w/ result_format
* nits
* json: fix type error w/ python 3.8
* json: fix server/README (json_schema in /completion vs. result_format in /v1/chat/completions)
* json: simplify DOT `{"type": "string", "pattern": "^.$"}`
* json: remove recursion in opt_repetitions (avoids Python stack overflow)
* json: rm dead code
* json: rm useless assert & ggml.h import 
							
						 
						
							2024-04-12 19:43:38 +01:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									ExtReMLapin 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								3de31677d3 
								
							 
						 
						
							
							
								
								grammars : blacklists character control set ( #5888 )  
							
							... 
							
							
							
							* Prevent control characters from being served in json string
* Prevent control characters from being served in json string (array) 
							
						 
						
							2024-03-05 18:33:08 +02:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Georgi Gerganov 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								82cb31eb93 
								
							 
						 
						
							
							
								
								Revert "grammars : don't allow to output unescaped new line in string ( #5885 )"  
							
							... 
							
							
							
							This reverts commit b1a4e994fd 
							
						 
						
							2024-03-05 15:56:24 +02:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									ExtReMLapin 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								b1a4e994fd 
								
							 
						 
						
							
							
								
								grammars : don't allow to output unescaped new line in string ( #5885 )  
							
							... 
							
							
							
							* Don't allow grammar json array to output unescaped new line in string
* Don't allow new line in json object string 
							
						 
						
							2024-03-05 15:44:29 +02:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Richard Kiss 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								532dd74e38 
								
							 
						 
						
							
							
								
								Fix some documentation typos/grammar mistakes ( #4032 )  
							
							... 
							
							
							
							* typos
* Update examples/parallel/README.md
Co-authored-by: Kerfuffle <44031344+KerfuffleV2@users.noreply.github.com>
---------
Co-authored-by: Kerfuffle <44031344+KerfuffleV2@users.noreply.github.com> 
							
						 
						
							2023-11-11 23:04:58 -07:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Georgi Gerganov 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								921772104b 
								
							 
						 
						
							
							
								
								speculative : add grammar support ( #2991 )  
							
							... 
							
							
							
							* speculative : add grammar support
* grammars : add json_arr.gbnf
* grammar : add comments to new grammar file
* grammar : remove one nested level
* common : warm-up with 2 tokens - seems to work better
* speculative : print draft token pieces
* speculative : reuse grammar parser + better logs and comments
* speculative : avoid grammar_mem
* make : fix speculative build 
							
						 
						
							2023-09-05 08:46:17 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Ben Siraphob 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								d8d6977f48 
								
							 
						 
						
							
							
								
								examples : add C grammar ( #2357 )  
							
							
							
						 
						
							2023-09-01 16:32:14 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Evan Jones 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								f5fe98d11b 
								
							 
						 
						
							
							
								
								docs : add grammar docs ( #2701 )  
							
							... 
							
							
							
							* docs : add grammar docs
* tweaks to grammar guide
* rework GBNF example to be a commented grammar 
							
						 
						
							2023-08-22 21:01:57 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Evan Jones 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								8183159cf3 
								
							 
						 
						
							
							
								
								examples : generate JSON according to schema ( #1887 )  
							
							... 
							
							
							
							* examples : add JSON schema grammars
* complete JSON grammar
* ensure primitive types can be used as root of schema
* support integer type and adjust usage text 
							
						 
						
							2023-08-02 22:05:44 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Evan Jones 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								84e09a7d8b 
								
							 
						 
						
							
							
								
								llama : add grammar-based sampling ( #1773 )  
							
							... 
							
							
							
							* llama, main : constrain sampling to grammar
* allow loading grammar from file
* fix whitespace errors
* handle & print parser errors
* add comments to grammar syntax and allow newlines where unambiguous
* add missing include
* support alternates in root rule
* fix bugs with empty token and EOS
* adjust JSON grammar
* remove swp file
* rewrite ternary expressions
Co-authored-by: Henri Vasserman <henv@hot.ee>
* use struct for grammar elements and add Unicode support
* add unicode escapes
* add inverse char ranges
* only sample full tokens (no peeking or truncation)
* llama : minor style changes
blindly applied in online editor - hopefully I didn't break something
* update help text
* add warning message if EOS is disabled
---------
Co-authored-by: Henri Vasserman <henv@hot.ee>
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> 
							
						 
						
							2023-07-23 23:58:10 -04:00