cosmopolitan

mirror of https://github.com/jart/cosmopolitan.git synced 2025-07-06 19:28:29 +00:00

Author	SHA1	Message	Date
Justine Tunney	4a8a81eb9f	Fix llama.com interactive mode regressions	2023-05-13 00:09:38 -07:00
Justine Tunney	45186c74ac	Introduce -q (quiet flag) and improve ctrl-c ux	2023-05-12 09:46:07 -07:00
Justine Tunney	80c174d494	Clean up llama.com anti/stop/reverse-prompt code Example use case for JSON completion: $ m=opt $ make -j16 m=$m o/$m/third_party/ggml/llama.com $ o/$m/third_party/ggml/llama.com -m llama.bin -p '{"key": "life", "val": ' -r '}' 42} This provides better control. More sophisticated facilities for controlling text generation will be provided soon enough.	2023-05-12 08:20:58 -07:00
Justine Tunney	ca990ef091	Make `llama.com -h` print to stdout	2023-05-10 04:55:59 -07:00
Justine Tunney	5f57fc1f59	Upgrade llama.cpp to e6a46b0ed1884c77267dc70693183e3b7164e0e0	2023-05-10 04:20:48 -07:00
Justine Tunney	3dac9f8999	Use Companion AI in llama.com by default	2023-04-30 23:08:15 -07:00
Justine Tunney	b31ba86ace	Introduce prompt caching so prompts load instantly This change also introduces an ephemeral status line in non-verbose mode to display a load percentage status when slow operations are happening.	2023-04-28 16:15:26 -07:00
Justine Tunney	1c2da3a55a	Make shell usability improvements to llama.cpp - Introduce -v and --verbose flags - Don't print stats / diagnostics unless -v is passed - Reduce --top_p default from 0.95 to 0.70 - Change --reverse-prompt to no longer imply --interactive - Permit --reverse-prompt specifying custom EOS if non-interactive	2023-04-28 02:54:11 -07:00
Justine Tunney	e8b43903b2	Import llama.cpp https://github.com/ggerganov/llama.cpp 0b2da20538d01926b77ea237dd1c930c4d20b686 See third_party/ggml/README.cosmo for changes	2023-04-27 14:37:14 -07:00

9 commits