Commit graph

  • c0e1cb53c7
    🙈 Add output chat to .gitignore Nato Boram 2023-03-18 14:45:10 -04:00
  • c21c89edca
    Update README.md LostRuins 2023-03-19 00:50:03 +08:00
  • 42f307ef6a
    Update README.md LostRuins 2023-03-19 00:21:59 +08:00
  • 2b188521a1
    Merge branch 'ggerganov:master' into concedo LostRuins 2023-03-19 00:20:09 +08:00
  • 5a6f3b01bd update readme Concedo 2023-03-19 00:19:34 +08:00
  • 0dc3ab930c Updated binaries Concedo 2023-03-19 00:09:00 +08:00
  • e3d85aa08b Merge branch 'master' into concedo Concedo 2023-03-19 00:07:32 +08:00
  • 2c8f870f53 Created a python bindings for llama.cpp and emulated a simple Kobold HTTP API Endpoint Concedo 2023-03-19 00:07:11 +08:00
  • edc17cfa9f
    Remove direct access to std streams from llama_main Thiago Padilha 2023-03-18 12:20:20 -03:00
  • 1088d2dd04
    Move model loading back to main.cpp Thiago Padilha 2023-03-18 12:12:00 -03:00
  • e3648474d6
    Add main.cpp back, and invoke llama_main from it Thiago Padilha 2023-03-18 11:58:11 -03:00
  • 82e70dbfe0
    Move struct definitions in llama.cpp to llama.h Thiago Padilha 2023-03-18 11:52:55 -03:00
  • 51d003e885
    Move main.cpp to llama.cpp Thiago Padilha 2023-03-18 11:49:09 -03:00
  • b64ca1c07c
    Merge pull request #40 from rupeshs/windows-console-ansi-color-fix Kevin Kwok 2023-03-18 07:37:29 -07:00
  • d3f202d57b
    Remove unused code since n_vocab is model.hparams.n_vocab (#262) master-d3f202d Alex Nguyen 2023-03-18 20:51:49 +07:00
  • fd73543510 make publishable Emanuel Seemann 2023-03-18 14:34:32 +01:00
  • 8bb0dd55f4
    Merge pull request #1 from MariusCiocanel/MariusCiocanel-curl-instead-of-wget-1 Marius Ciocanel 2023-03-18 13:12:18 +00:00
  • bb60fdaf32
    Update command for downloading the weights to use curl Marius Ciocanel 2023-03-18 13:10:25 +00:00
  • 51fa40be1b Remove unused code since n_vocab is model.hparams.n_vocab Tien Dung 2023-03-18 19:31:24 +07:00
  • 60f519c74a add self to license Emanuel Seemann 2023-03-18 13:29:43 +01:00
  • 092393781f add all llamacpypy Emanuel Seemann 2023-03-18 13:26:50 +01:00
  • eb3d30e53d add modules Emanuel Seemann 2023-03-18 13:24:14 +01:00
  • e03e359730
    fixed warning with std::ignore about unused function result (#151) Justin Suess 2023-03-18 07:44:09 -04:00
  • a81d0c2a17
    Fix n^2 loop in tokenization (#254) Gary Linscott 2023-03-18 04:17:19 -07:00
  • a83e2e7a24 Windows console ANSI color issue fixed Rupesh Sreeraman 2023-03-18 16:41:20 +05:30
  • a44ccef6ac
    Merge branch 'master' into optimize-convert tpoisonooo 2023-03-18 18:41:28 +08:00
  • 4a524c51ba commenting out aarch antimatter15 2023-03-18 01:15:51 -07:00
  • ddc4e24cb8 maybe macos-arm64 is case sensitive antimatter15 2023-03-18 01:11:02 -07:00
  • 564b861bac archiving artifacts antimatter15 2023-03-18 01:06:44 -07:00
  • 3f7d187b6b more copying stuff antimatter15 2023-03-18 00:55:00 -07:00
  • 1c62e35984 create release antimatter15 2023-03-18 00:37:34 -07:00
  • 7e126618c4 ci releases for mac and linux antimatter15 2023-03-18 00:34:53 -07:00
  • 501a8e19d9 adding to credit section antimatter15 2023-03-18 00:28:05 -07:00
  • 1cb9215e5d removing random prompt generation antimatter15 2023-03-18 00:19:52 -07:00
  • e95e64bd49 Implement non-greedy tokenizer that tries to maximize token lengths (#242) thement 2023-03-17 21:05:58 +01:00
  • b2de7f18df
    CI Improvements (#230) anzz1 2023-03-18 09:27:12 +02:00
  • 7b24407613
    Merge pull request #31 from anzz1/ci_test Kevin Kwok 2023-03-18 00:06:16 -07:00
  • 96e0519ae1 extending context window antimatter15 2023-03-17 23:46:31 -07:00
  • 97d327e1bf
    Update chat.cpp Kevin Kwok 2023-03-17 23:43:09 -07:00
  • 7cd84a7027
    Update README.md Kevin Kwok 2023-03-17 22:57:27 -07:00
  • 4cf24a4df4
    Fix n^2 loop in tokenization Gary Linscott 2023-03-17 22:34:11 -07:00
  • 0b5448a3a4
    Implement system polyfill for win32 / posix.1 Justine Tunney 2023-03-17 21:22:40 -07:00
  • a19b5a4adc Merge remote-tracking branch 'origin/master' into concedo Concedo 2023-03-18 10:52:54 +08:00
  • b10383fbff merge oKatanaaa 2023-03-18 02:02:02 +00:00
  • a292747893
    Nix flake (#40) Niklas Korz 2023-03-17 23:03:48 +01:00
  • c9f670a177
    Implement non-greedy tokenizer that tries to maximize token lengths (#242) thement 2023-03-17 21:05:58 +01:00
  • 4f54609110
    Default to 4 threads (#243) Georgi Gerganov 2023-03-17 21:46:46 +02:00
  • e81b9c81c1
    Update Contributing section Georgi Gerganov 2023-03-17 20:30:04 +02:00
  • d0f855a8df
    Update README.md Kevin Kwok 2023-03-17 11:13:47 -07:00
  • 367946c668
    Don't tell users to use a bad number of threads (#243) Stephan Walter 2023-03-17 17:47:35 +00:00
  • 7566d1ab9b Insert single space in front of the prompt Jakub Horak 2023-03-17 18:08:03 +01:00
  • f60e972eb5
    better CI naming convention anzz1 2023-03-17 19:44:02 +02:00
  • 7e1041a730 Implement non-greedy tokenizer that tries to maximize token lengths Jakub Horak 2023-03-17 17:35:41 +01:00
  • 6adc143e31 Don't tell users to use a bad number of threads Stephan Walter 2023-03-17 18:27:22 +01:00
  • e7bdee6cc9
    CI fine tuning anzz1 2023-03-17 19:17:59 +02:00
  • 1e82fa82c6
    Merge pull request #26 from mcmonkey4eva/master Kevin Kwok 2023-03-17 09:44:40 -07:00
  • 71d097898d
    Merge pull request #12 from anzz1/ci_test Kevin Kwok 2023-03-17 09:44:26 -07:00
  • 197df5f096
    Merge pull request #18 from jxy/limit_tokens Kevin Kwok 2023-03-17 09:44:12 -07:00
  • 6b0df5ccf3
    add ptread link to fix cmake build under linux (#114) mmyjona 2023-03-18 00:38:24 +08:00
  • c6a0f83b12
    Docker image shouldn't be published on PRs Sebastián A 2023-03-17 13:36:28 -03:00
  • c0f6681694
    Merge branch 'master' into fix-cmake-pthread Sebastián A 2023-03-17 13:33:42 -03:00
  • af02c94798 add easy Windows install instructions to the readme Alex "mcmonkey" Goodwin 2023-03-17 04:49:41 -07:00
  • 584add5c18 fix publish permission Bernat Vadell 2023-03-17 10:54:21 +01:00
  • 2af23d3043
    🚀 Dockerize llamacpp (#132) Bernat Vadell 2023-03-17 10:47:06 +01:00
  • 79a48d9876 feat: include all-in-one command tool & update readme.md Bernat Vadell 2023-03-17 10:31:53 +01:00
  • fb324e035a improvement(tools): add example tpoisonooo 2023-03-17 16:56:45 +08:00
  • 3c7cb413fb improvement(tools): optimize with argparse tpoisonooo 2023-03-17 16:53:53 +08:00
  • 7f2a53363c
    CI Improvements anzz1 2023-03-17 10:08:49 +02:00
  • 50fa1a006e Merge branch 'master' into feat/dockerize Bernat Vadell 2023-03-17 09:02:11 +01:00
  • db4945c22a fix(ggml): compile error on centos 7.9 and gcc7 tpoisonooo 2023-03-17 15:06:09 +08:00
  • 3877722a08 fix compile error on centos 7.5 chenliangyu 2023-03-17 15:00:12 +08:00
  • 5be098f51e Compute remaining tokens along the way and exit if over Xiao-Yong Jin 2023-03-17 00:20:24 -05:00
  • 5b8023d935
    Implement prototype for instant mmap() loading Justine Tunney 2023-03-16 22:03:09 -07:00
  • 904d2a8d6a
    Q4_1 quantization (#193) Matvey Soloviev 2023-03-17 05:48:39 +01:00
  • f8db3d6cd9 Improved quantize script SuajCarrot 2023-03-16 21:15:41 -06:00
  • 3d4854455c ban eos token Concedo 2023-03-17 11:01:02 +08:00
  • 27990d54ed minor change (+1 squashed commits) oKatanaaa 2023-03-16 22:22:42 +00:00
  • 197020deee Use F16 for memory_k and memory_v Ty Everett 2023-03-14 23:10:12 -07:00
  • 07d2da0cb0 forgot the newline that needs to be sent to the model Rickey Bowers Jr 2023-03-16 20:54:06 -06:00
  • 8470458b64 fix coloring of last n_batch of prompt, and refactor line input Rickey Bowers Jr 2023-03-16 20:23:32 -06:00
  • e8fc1d5491
    Merge branch 'fix-cmake-pthread' of https://github.com/mmyjona/llama.cpp into fix-cmake-pthread mmyjona 2023-03-17 10:15:35 +08:00
  • 88f776b972
    separate make and cmake workflow mmyjona 2023-03-17 10:15:06 +08:00
  • 376a4a2ab0
    Merge branch 'master' into ci_test anzz1 2023-03-17 04:11:13 +02:00
  • 8a3d6f2ea4
    CI: Enable Linux/MacOS builds anzz1 2023-03-17 04:00:40 +02:00
  • f2eda96137
    Fix Makefile anzz1 2023-03-17 03:58:34 +02:00
  • 235a4115df
    Update build.yml Kevin Kwok 2023-03-16 18:39:41 -07:00
  • 72f9fbe4d1
    Fix #2 Kevin Kwok 2023-03-16 18:36:09 -07:00
  • 2af2331add
    Merge pull request #3 from bigattichouse/master Kevin Kwok 2023-03-16 18:30:17 -07:00
  • bf244623ef
    Merge pull request #5 from anzz1/master Kevin Kwok 2023-03-16 18:26:03 -07:00
  • b4db6b7235 makefile: Fix CPU feature detection on Haiku Alexander von Gluck IV 2023-03-16 20:20:01 -05:00
  • fc924e9aba minor change oKatanaaa 2023-03-16 22:38:06 +00:00
  • 7252a2b658 refactor: make weights load faster oKatanaaa 2023-03-16 22:22:42 +00:00
  • c2577fdb04
    Merge branch 'master' into master qunash 2023-03-17 00:31:52 +03:00
  • 0660aeed69 process the scanf() output so Ubuntu 22 compiler doesn't error due to default warn_unused_result instead of Makefile -Wunused-result Michael E. Johnson 2023-03-16 16:18:12 -05:00
  • 7b8858415e
    Scale buf_size linearly with n_ctx hx507 2023-03-17 05:11:49 +08:00
  • c8917ca358
    Fix Windows CI and autorelease anzz1 2023-03-16 23:08:18 +02:00
  • 6d973a699f
    Fix CMake anzz1 2023-03-16 23:07:23 +02:00
  • 9f90347596
    Update main.cpp Justin Suess 2023-03-16 12:45:24 -04:00
  • e750735216 first commit antimatter15 2023-03-16 05:56:54 -07:00
  • 721311070e
    Update README.md Georgi Gerganov 2023-03-16 15:00:09 +02:00