Commit graph

  • d349f61380 Change model name Billel Mokeddem 2024-12-16 07:52:25 +00:00
  • 5cb6209de5 Fixes for building SYCL backend for AMD GPUs lhl 2024-12-16 16:44:57 +09:00
  • 5d46c48137 fix musa build on aarch64 Huaishun Hu 2024-12-16 14:42:53 +08:00
  • 2607b7de0f
    SYCL: Integrate debug logs with GGML_LOG and other fixes Akarshan Biswas 2024-12-16 11:27:38 +05:30
  • c656d929ff multi row k quant shaders! Eve 2024-12-15 23:34:50 -05:00
  • 663716ce42 only s_warptile_mmq needs to be run with 32 threads or more Eve 2024-12-15 15:47:24 -05:00
  • 4ddd199f6f
    llava : Allow locally downloaded models for QwenVL (#10833) Bartowski 2024-12-15 15:43:25 -05:00
  • a0974156f3
    llama : add Deepseek MoE v1 & GigaChat models (#10827) b4333 Valentin Mamedov 2024-12-16 00:02:46 +07:00
  • 87cf323cef
    scripts : change build path to "build-bench" for compare-commits.sh (#10836) Georgi Gerganov 2024-12-15 18:44:47 +02:00
  • 19ce4b64b7
    SYCL: Add pragma directive to suppress warning spam Akarshan Biswas 2024-12-15 21:02:43 +05:30
  • 7778b89d30 add test for no dangling pointers Johannes Gäßler 2024-12-15 15:18:01 +01:00
  • 5ed4403558
    SYCL: Add back static to ggml_backend_buffer_is_sycl_split function Akarshan Biswas 2024-12-15 19:22:52 +05:30
  • 0662a86809
    SYCL: remove extra space Akarshan Biswas 2024-12-15 19:18:52 +05:30
  • f8603b0cc0
    SYCL: fix assertions and add proper comments Akarshan Biswas 2024-12-15 19:11:43 +05:30
  • 0c6eafdac1 change placement of gigachat chat template Valentin Mamedov 2024-12-15 20:32:42 +07:00
  • 6cdb3d86f9 Merge remote-tracking branch 'upstream/master' into gigachat-model Valentin Mamedov 2024-12-15 20:24:29 +07:00
  • 78ef42665b move deepseek above deepseek2 Valentin Mamedov 2024-12-15 20:22:41 +07:00
  • da40c42062
    SYCL: common.cpp try to migrate away from tensor->backend Akarshan Biswas 2024-12-15 18:41:58 +05:30
  • 6ee759966c use std::string instead of static char Judd 2024-12-15 20:01:04 +08:00
  • 5478bbcd17
    server: (UI) add syntax highlighting and latex math rendering (#10808) b4331 Vinesh Janarthanan 2024-12-15 05:55:54 -06:00
  • b5ae1ddff9
    gguf-py : bump to v0.13.0 gguf-v0.13.0 Georgi Gerganov 2024-12-15 13:16:42 +02:00
  • 39f8347504 use id for color; simple_hash removed. Judd 2024-12-15 19:00:36 +08:00
  • 3e92f4ecbe
    cont [no ci] gg/unicode-refactor Georgi Gerganov 2024-12-15 12:36:03 +02:00
  • 8c2233ac06
    rm trailing space Xuan Son Nguyen 2024-12-15 11:35:15 +01:00
  • 7a20c287c7
    unicode : improve naming style Georgi Gerganov 2024-12-15 12:24:04 +02:00
  • 7e9208e408
    scripts : change build path to "build-bench" for compare-commits.sh gg/compare-change-path Georgi Gerganov 2024-12-15 11:47:30 +02:00
  • 8e69669007
    Fix compilation on Pop!_OS 22.04 LTS CUDA Mika Pi 2024-12-15 00:43:30 -08:00
  • 5806435526 Merge remote-tracking branch 'upstream/master' into gigachat-model Valentin Mamedov 2024-12-15 14:32:26 +07:00
  • 6e13df8d57 remove comments Valentin Mamedov 2024-12-15 14:05:17 +07:00
  • 43c679507f fix order of deepseek and deepseek2 in constants; mark shared exp as deepseek arch need Valentin Mamedov 2024-12-15 13:53:42 +07:00
  • b32159c8a7 fix order of deepseek and deepseek2, move gigachat temlate to the end of func Valentin Mamedov 2024-12-15 13:42:33 +07:00
  • 66e59b0155 lint llama.cpp Valentin Mamedov 2024-12-15 13:37:12 +07:00
  • 35bff171af
    Migrate to tensor->buffer for checking backend buffer type: 1 Akarshan Biswas 2024-12-15 11:45:43 +05:30
  • 7e3feff073 tool-call: stabilize server tests ochafik 2024-12-15 00:16:12 +00:00
  • 89d604f2c8
    server: Fix has_next_line in JSON response (#10818) gguf-v0.12.0 b4329 Michelle Tan 2024-12-14 22:29:45 +00:00
  • 7bfd83ce05 fix memory leak failure Johannes Gäßler 2024-12-14 22:43:58 +01:00
  • 107b3538d0
    Define model_path Bartowski 2024-12-14 16:19:31 -05:00
  • b01af274c7
    Allow locally downloaded models for QwenVL Bartowski 2024-12-14 15:54:22 -05:00
  • e52aba537a
    nix: allow to override rocm gpu targets (#10794) Evgeny Kurnevsky 2024-12-14 18:17:36 +00:00
  • fecf662ec1 try Windows fix Johannes Gäßler 2024-12-14 18:07:46 +01:00
  • baa8b5d2d2 try macOS fix Johannes Gäßler 2024-12-14 16:43:39 +01:00
  • f220234fe1 Clean up: Fix lint. MichelleTPY 2024-12-14 15:42:55 +00:00
  • 558e690614 Clean up: Fix lint. MichelleTPY 2024-12-14 15:41:15 +00:00
  • 7bfcd0a8dd Merge remote-tracking branch 'origin/master' into tool-call ochafik 2024-12-14 15:08:00 +00:00
  • 1e2115ffb9 tool-calls: shorter name: grammar_triggers ochafik 2024-12-14 15:05:18 +00:00
  • 055053c859 Merge remote-tracking branch 'origin/master' into tool-call ochafik 2024-12-14 15:04:45 +00:00
  • 299d681c52 tests: add tests for GGUF Johannes Gäßler 2024-12-10 15:50:27 +01:00
  • 0579e3bf65 Refactor: Add llamma_ prefix in unicode.h unicode.cpp MichelleTPY 2024-12-14 14:18:25 +00:00
  • 858dad8d91 latex codeblock as code Xuan Son Nguyen 2024-12-14 14:46:55 +01:00
  • 7985295afb fix format Valentin Mamedov 2024-12-14 20:42:59 +07:00
  • 64c16c4ae0 Merge branch 'master' into vulkan Zhiyuan Li 2024-12-14 21:28:50 +08:00
  • 89714175e7 remove comment Valentin Mamedov 2024-12-14 20:22:19 +07:00
  • f3d0a23fe5 delete comments Valentin Mamedov 2024-12-14 20:20:45 +07:00
  • ca168fc7a4 add readme Valentin Mamedov 2024-12-14 20:00:01 +07:00
  • 2d30fd4457 improve template code Valentin Mamedov 2024-12-14 19:59:06 +07:00
  • 504121ec4b fix warnings; remove ggml_backend_sched_splits_fdump_dot. Judd 2024-12-14 20:55:16 +08:00
  • ba1cb19cdd
    llama : add Qwen2VL support + multimodal RoPE (#10361) b4327 HimariO 2024-12-14 20:43:46 +08:00
  • 9f89d7d8e4 Merge remote-tracking branch 'fork/master' Valentin Mamedov 2024-12-14 15:32:30 +03:00
  • da8cf83f86 Add deepseek v1 arch & gigachat template Valentin Mamedov 2024-12-14 15:23:40 +03:00
  • 7db99a044e Address code review comment: type check for has_new_line in unit test MichelleTPY 2024-12-14 12:05:39 +00:00
  • cd4015643f add comment Xuan Son Nguyen 2024-12-14 12:58:09 +01:00
  • bb4e17fc70 fix latex rendering Xuan Son Nguyen 2024-12-14 12:55:21 +01:00
  • 10aa898c83 ability to add a demo conversation for dev Xuan Son Nguyen 2024-12-14 12:35:59 +01:00
  • 046c0d77a9 llama : use zero value of n_swa to distinguish Phi-4 from other PHI3 models Stanisław Szymczyk 2024-12-14 12:00:19 +01:00
  • c7fdbd3735 convert-hf : use zero value of sliding_window to distinguish Phi-4 from other PHI3 models Stanisław Szymczyk 2024-12-14 11:59:59 +01:00
  • 520e8a0377 convert-hf : do not use model name to distinguish Phi-4 from Phi-3 Stanisław Szymczyk 2024-12-14 11:28:14 +01:00
  • 12d8cd683d add ggml_backend_sched_dump_dot Judd 2024-12-14 18:09:39 +08:00
  • f96909e2fd
    remote old rope_section compare operator HimariO 2024-12-14 11:58:38 +08:00
  • 6110a9b36e
    Merge branch 'master' into vulkan_llvmpipe Eve 2024-12-14 01:33:59 +00:00
  • 4de800cc82
    Merge branch 'ggerganov:master' into server-update-JSON-response Michelle Tan 2024-12-14 00:43:39 +00:00
  • 29e6298d2e Remove has_new_line unit test changes. MichelleTPY 2024-12-14 00:36:33 +00:00
  • a2e03b826f fix: Use gpt2 tokenizer for roberta and add eos/bos tokens Gabe Goodhart 2024-12-13 16:41:40 -07:00
  • 56eea0781c
    Removes spurious \r in output that causes logging in journalctl to treat lines as binary and therefore hidden by default (#10771) b4326 cduk 2024-12-13 23:21:49 +01:00
  • 3c8a053459
    Merge branch 'ggerganov:master' into server-update-JSON-response Michelle Tan 2024-12-13 21:44:11 +00:00
  • a76c56fa1a
    Introducing experimental OpenCL backend with support for Qualcomm Adreno GPUs (#10693) b4325 lhez 2024-12-13 12:23:52 -08:00
  • 2e0f15fd91
    Merge 8545425976 into c27ac678dd Wang Qin 2024-12-13 20:47:10 +01:00
  • 220cf7f780 Add unit test to check has_new_line JSON response MichelleTPY 2024-12-13 19:22:15 +00:00
  • 9697d07b21 opencl: update log message for unsupported GPUs Max Krasnyansky 2024-12-13 10:31:54 -08:00
  • dbaa360a55 opencl: check for various requirements, allow deprecated API Li He 2024-12-12 23:06:17 -08:00
  • b41b6e679f opencl: fix MSVC builds (string length error) Max Krasnyansky 2024-12-12 22:03:27 -08:00
  • b25a4caaf4 opencl: fail gracefully if opencl devices are not available Max Krasnyansky 2024-12-12 14:51:08 -08:00
  • c971a1885d opencl: fix compiler warnings with GCC and Clang Max Krasnyansky 2024-12-12 12:31:55 -08:00
  • 3bc085b359 opencl: use pools for tensor_extra Li He 2024-12-11 23:19:52 -08:00
  • 74a9bafcb9 opencl: remove limits on tensor_extra Li He 2024-12-11 21:46:03 -08:00
  • 70063c6c0c opencl: replace some more OPENCL2 leftovers Max Krasnyansky 2024-12-11 21:38:24 -08:00
  • c64ef0fb5c opencl: remove copyright marker since main license already covers Li He 2024-12-11 15:15:46 -08:00
  • e447dbcc01 opencl: rename backend - funcs, structs, etc opencl2 -> opencl Li He 2024-12-11 14:48:26 -08:00
  • 22411ab58f opencl: make OpenCL required, remove redundant lib and inc directories Li He 2024-12-11 14:07:36 -08:00
  • 97a12703dd opencl: rename kernel files ggml-opencl2 -> ggml-opencl Li He 2024-12-10 23:24:34 -08:00
  • 34f2fc15ea opencl: rename backend opencl2 -> opencl Li He 2024-12-10 22:17:24 -08:00
  • e9a97381f2 opencl: use GGML_LOG_xxx instead of fprintf(stderr, ...) Li He 2024-12-10 20:42:15 -08:00
  • 9a9d92b0b9 opencl: use cl_ulong for sizes and strides Max Krasnyansky 2024-12-07 18:02:15 -08:00
  • c21fc8c5f9 opencl: use cl_ulong for all offsets Max Krasnyansky 2024-12-07 17:44:42 -08:00
  • 31f305ea01 opencl: use ulong for offsets and strides in ADD kernel Max Krasnyansky 2024-12-07 17:35:26 -08:00
  • 0451edd936 opencl: cleanup ggml-opencl2 header file Max Krasnyansky 2024-12-07 16:49:01 -08:00
  • 66d4330377 opencl: Clean up small-alloc in CMake files Li He 2024-11-28 23:05:51 -08:00
  • 969a00a4b9 opencl: CI workflow fixes Max Krasnyansky 2024-11-28 16:37:03 -08:00
  • 4bca601be6 opencl: fix embed tool invocation with python3 Max Krasnyansky 2024-11-28 16:24:44 -08:00
  • 9b6540b6f9 opencl-ci: use RUNNER_TEMP instead of github.workspace Max Krasnyansky 2024-11-28 15:57:36 -08:00
  • d24b360255 opencl: fixed merge conflict (MUSA added twice in cmake) Max Krasnyansky 2024-11-28 15:37:54 -08:00