Commit graph

  • 7a50208155
    Merge 5293e17154 into aaa5505307 JohnnyB 2025-02-09 01:59:23 +08:00
  • af63886030 return reasoning_content before content ochafik 2025-02-08 17:58:46 +00:00
  • c0f972bb45 Use --reasoning-format, remove forced thinking for now ochafik 2025-02-08 17:58:33 +00:00
  • 6bf10c8b64 server : (webui) increase edit textarea size woof-dog 2025-02-08 18:52:38 +01:00
  • e78170eefb trigger ci again ochafik 2025-02-08 17:11:34 +00:00
  • b2b8c6140a revert changes ochafik 2025-02-08 17:09:17 +00:00
  • 98046495bc
    Merge d875c8e919 into aaa5505307 Jianlin Shi 2025-02-08 17:09:02 +00:00
  • 84fe6c4e93 format code Xuan Son Nguyen 2025-02-08 18:02:32 +01:00
  • 69fa94af58 add headers Xuan Son Nguyen 2025-02-08 17:56:30 +01:00
  • 8e092c4a15 add webworker Xuan Son Nguyen 2025-02-08 17:54:54 +01:00
  • aaa5505307
    server : minor log updates (#11760) Georgi Gerganov 2025-02-08 18:08:43 +02:00
  • 5f15932ab4 check curl version *before* the choco deps install (temp) ochafik 2025-02-08 15:26:20 +00:00
  • d37d309335 Install alternative curl to maybe avoid flaky auto certification revocation ochafik 2025-02-08 15:17:48 +00:00
  • bdcf8b6a56
    cont : fix mmap flag print (#11699) Georgi Gerganov 2025-02-08 16:49:38 +02:00
  • cc2c712cf9 Merge remote-tracking branch 'origin/master' into r1-toolcall ochafik 2025-02-08 14:35:10 +00:00
  • 4d3465c5ae
    ggml: Fix data race in ggml threadpool (#11736) b4671 Karol Kontny 2025-02-08 15:30:53 +01:00
  • 84919d2fbf better state management Xuan Son Nguyen 2025-02-08 15:25:39 +01:00
  • d86e23101e
    server : minor log updates gg/server-logs Georgi Gerganov 2025-02-08 16:23:37 +02:00
  • e1f03c4009 handle python exception Xuan Son Nguyen 2025-02-08 15:20:24 +01:00
  • 22e826336a fix multiple lines output and color scheme Xuan Son Nguyen 2025-02-08 15:16:26 +01:00
  • 19a95daf78 adapt layout on mobile view Xuan Son Nguyen 2025-02-08 14:58:30 +01:00
  • 6f1fcbcc0f bring back sticky copy button Xuan Son Nguyen 2025-02-08 14:53:51 +01:00
  • fbf2853f54 fix overflow for long output lines Xuan Son Nguyen 2025-02-08 14:41:20 +01:00
  • 586d8c3b86
    Merge a9db9b0048 into d80be897ac Aleksei Nikiforov 2025-02-08 13:20:11 +00:00
  • ff7fc14db0
    Merge 2ae9bb7764 into d80be897ac Isaac McFadyen 2025-02-08 13:20:05 +00:00
  • 64a2a00246
    Merge a444c15209 into d80be897ac Brian 2025-02-08 13:20:01 +00:00
  • 69c7a14e13
    Merge 178ad4e8c9 into d80be897ac Olivier Chafik 2025-02-08 13:19:59 +00:00
  • 79b81d9fa5
    Merge bddb1efc1f into d80be897ac Daniel Bevenius 2025-02-08 13:19:56 +00:00
  • 8acc7258af
    Merge 421e1f0cbd into d80be897ac Olivier Chafik 2025-02-08 13:19:56 +00:00
  • c75d758318
    Merge 27f59dbaaa into d80be897ac Daniel Bevenius 2025-02-08 13:19:49 +00:00
  • 5802ac4e85
    Merge 951f1d9053 into d80be897ac Andrei 2025-02-08 20:18:54 +08:00
  • be22b41fe3 build Xuan Son Nguyen 2025-02-08 13:12:38 +01:00
  • 115f75c5b1 fix auto scroll Xuan Son Nguyen 2025-02-08 13:03:33 +01:00
  • 483a3bc2ad add python code interpreter Xuan Son Nguyen 2025-02-08 13:01:02 +01:00
  • 422e53e607 redo Settings modal UI Xuan Son Nguyen 2025-02-07 22:53:34 +01:00
  • 2b5737f675 There's a better way of clearing lines Eric Curtin 2025-02-08 10:23:34 +00:00
  • 0d058857d7
    Merge 715682d21a into d80be897ac code 2025-02-08 18:06:15 +08:00
  • fc5cbf6e20
    Merge bf3ea1fd6e into d80be897ac magicse 2025-02-08 17:49:21 +08:00
  • d80be897ac
    CUDA: fix min. version for movmatrix (#11751) Johannes Gäßler 2025-02-08 10:46:07 +01:00
  • 3ab410f55f
    readme : update front-end framework (#11753) Nikolaos Pothitos 2025-02-08 11:43:04 +02:00
  • 0cf867160c
    server : (webui) fix numeric settings being saved as string (#11739) Xuan-Son Nguyen 2025-02-08 10:42:34 +01:00
  • 4b5a710e69
    readme : update front-end framework Nikolaos Pothitos 2025-02-08 10:52:48 +02:00
  • e9121333b9 CUDA: fix min. version for movmatrix Johannes Gäßler 2025-02-08 09:03:09 +01:00
  • 20b7af024b
    Merge 0dd48a6952 into d2fe216fb2 a3sh 2025-02-08 00:32:50 -05:00
  • 2f1e248444
    Merge 739648f3e6 into d2fe216fb2 Heiner 2025-02-07 22:12:35 -06:00
  • f3cdb201ef
    In streaming output mode, the content in delta is missing from the second to last data #11746 WangxuP 2025-02-08 11:28:32 +08:00
  • 2cf0d9ef3b ggml-cpu-aarch64: Fix compilation issues Dongyan Qian 2025-02-08 10:05:32 +08:00
  • 2745606c18 vulkan: Make Vulkan optional at runtime (#11493). Danny Milosavljevic 2025-01-29 18:47:51 +01:00
  • 4ff7009c3c blas build with openblas/blis/etc: use libraries and linker flags from from pkg-config Alexey Korepanov 2025-02-07 20:43:13 +00:00
  • 0a8995a375 adding test case for velvet chat template f.buciuni 2025-02-07 19:56:23 +01:00
  • 39795570db updating velvet chat template f.buciuni 2025-02-07 19:54:55 +01:00
  • 66e6d10b61 fixing position of LLM_CHAT_TEMPLATE_VELVET in enum f.buciuni 2025-02-07 19:53:16 +01:00
  • a354774283 add some more comments Xuan Son Nguyen 2025-02-07 18:30:17 +01:00
  • a54c54aef2 server : (webui) fix numeric settings being saved as string Xuan Son Nguyen 2025-02-07 18:25:51 +01:00
  • e04880fa54 More updates for review comments Charles Xu 2025-02-07 17:42:54 +01:00
  • 9edd10737b updates for review comments Charles Xu 2025-02-07 16:09:58 +01:00
  • 4c3a33b81b ggml: Fix data race in ggml threadpool Karol Kontny 2025-02-07 14:11:31 +01:00
  • d2fe216fb2
    Make logging more verbose (#11714) b4667 Eric Curtin 2025-02-07 14:42:46 +00:00
  • 15404ca5a7
    Merge b790a7ff29 into ed926d8833 Don Mahurin 2025-02-07 09:32:17 -05:00
  • ed926d8833
    llama : fix defrag logic (#11707) b4666 Georgi Gerganov 2025-02-07 16:05:34 +02:00
  • 2d219b389e
    vocab : ignore invalid UTF-8 input in the BPE tokenizer (#11729) Christian Fillion 2025-02-07 08:55:47 -05:00
  • 333820d749
    llama : fix progress dots (#11730) magicse 2025-02-07 15:48:47 +02:00
  • 72a78b802b
    Update llama.cpp magicse 2025-02-07 13:18:28 +02:00
  • 1edd618084
    Update server.cpp magicse 2025-02-07 13:13:01 +02:00
  • 1caeddea91
    Update llama.cpp magicse 2025-02-07 12:59:34 +02:00
  • cff1c3bc99
    ignore invalid UTF-8 input in the BPE tokenizer Christian Fillion 2025-02-07 03:43:14 -05:00
  • d6aab89361
    Merge bd9c319515 into c026ba3c23 Milot Mirdita 2025-02-07 11:30:40 +01:00
  • 8ba7b28c68
    Merge 74342d48c2 into c026ba3c23 Eugeniusz 2025-02-07 10:27:18 +00:00
  • c026ba3c23
    vulkan: print shared memory size (#11719) b4663 Jeff Bolz 2025-02-07 04:26:03 -06:00
  • 7ee953a64a
    llama : add llama_sampler_init for safe usage of llama_sampler_free (#11727) b4662 Christian Fillion 2025-02-07 04:33:27 -05:00
  • ec3bc8270b
    SYCL: remove XMX info from print devices (#11712) b4661 Akarshan Biswas 2025-02-07 14:57:53 +05:30
  • b7552cfcbc
    common : add default embeddings presets (#11677) b4660 Daniel Bevenius 2025-02-07 09:15:22 +01:00
  • 9d86a0442d removing whitespaces in src/lla-a-chat.cpp fbuciuni90 2025-02-07 08:12:02 +00:00
  • 861d3b99de
    cont : clamp fragmentation to 0.0 Georgi Gerganov 2025-02-07 09:50:32 +02:00
  • 225bbbfa39
    ggml : optimize and build warning fix for LoongArch (#11709) b4659 Jinyang He 2025-02-07 15:38:31 +08:00
  • f9ede5230c
    add llama_sampler_init for safe usage of llama_sampler_free Christian Fillion 2025-02-07 02:16:30 -05:00
  • 85b0a8f13a lasx_ext16_32: Initialize the vector Dongyan Qian 2025-02-07 14:42:38 +08:00
  • 78507168e9 Add Janus Attention Pool with Latent Query support in CLIP model ravenouse 2025-02-07 06:04:41 +00:00
  • 9245d7a95a
    Merge 044d4998ae into 855cd0734a uvos 2025-02-07 12:52:38 +08:00
  • 701623a8d0 docs: add OpenCL Li He 2025-02-04 22:42:33 -08:00
  • 4306c3615f
    Merge 63978cb6dc into 855cd0734a Zhenwei Jin 2025-02-06 22:17:22 +00:00
  • 855cd0734a
    llama : fix old glm4 models (#11670) b4658 tv1wnd 2025-02-06 22:48:51 +01:00
  • 01feb09107 ggml : draft commit, replace reallocation of vector for set_tensor by reserve inital vector and use it lexasub 2025-02-07 00:25:50 +04:00
  • 0dab5bdb0c vulkan: print shared memory size Jeff Bolz 2025-02-06 14:22:26 -06:00
  • fa55281759 separate vision ctx and llm ctx Xuan Son Nguyen 2025-02-06 20:32:09 +01:00
  • 8a59053f63
    sync : ggml b4657 Georgi Gerganov 2025-02-06 21:23:03 +02:00
  • 1d20e53c40
    rpc: fix known RCE in rpc-server (ggml/1103) Patrick Peng 2025-02-06 09:29:13 -05:00
  • ff77b15845 Merge branch 'master' into xsn/vision_2 Xuan Son Nguyen 2025-02-06 18:08:37 +01:00
  • 52b0bb3731
    Update src/llama-chat.cpp Francesco Buciuni 2025-02-06 17:44:45 +01:00
  • 3df9d221ed
    Update include/llama.h Francesco Buciuni 2025-02-06 17:39:47 +01:00
  • 99be555369
    Update convert_hf_to_gguf.py Francesco Buciuni 2025-02-06 17:38:58 +01:00
  • 07e1d0a14c
    Update convert_hf_to_gguf.py Francesco Buciuni 2025-02-06 17:38:30 +01:00
  • 2fb3c32a16
    server : (webui) migrate project to ReactJS with typescript (#11688) Xuan-Son Nguyen 2025-02-06 17:32:29 +01:00
  • 1dc99ef775 fix code block cannot be selected while generating Xuan Son Nguyen 2025-02-06 17:08:04 +01:00
  • 67b38f5849 Supporting Velvet model fbuciuni90 2025-02-06 16:02:00 +00:00
  • 12af7ede88 Make logging more verbose Eric Curtin 2025-02-06 15:36:05 +00:00
  • 629093c041
    Merge 90a0349349 into 9ab42dc722 LostRuins Concedo 2025-02-06 22:52:28 +08:00
  • fde80bf512
    SYCL: remove XMX info from print devices for now Akarshan Biswas 2025-02-06 20:08:00 +05:30
  • 1c5a87b5e9 vulkan: account for lookup tables when checking shared memory size Jeff Bolz 2025-01-29 16:10:26 -06:00
  • fa7d604067
    Merge 9c972aa43c into 9ab42dc722 katsu560 2025-02-06 13:36:21 +01:00