common: llama_load_model_from_url split support (#6192)

* llama: llama_split_prefix fix strncpy does not include string termination
common: llama_load_model_from_url:
 - fix header name case sensitive
 - support downloading additional split in parallel
 - hide password in url

* common: EOL EOF

* common: remove redundant LLAMA_CURL_MAX_PATH_LENGTH definition

* common: change max url max length

* common: minor comment

* server: support HF URL options

* llama: llama_model_loader fix log

* common: use a constant for max url length

* common: clean up curl if file cannot be loaded in gguf

* server: tests: add split tests, and HF options params

* common: move llama_download_hide_password_in_url inside llama_download_file as a lambda

* server: tests: enable back Release test on PR

* spacing

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

* spacing

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

* spacing

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

This commit is contained in:

Pierrick Hymbert

2024-03-23 18:07:00 +01:00

• committed by

GitHub

parent 1997577d5e

commit f482bb2e49

No known key found for this signature in database

GPG key ID: B5690EEEBB952194

10 changed files with 200 additions and 63 deletions

									
										1

.github/workflows/server.yml
									
										vendored
									
										View file
										
				@ -35,7 +35,6 @@ jobs:

				        include:

				          - build_type: Release

				            sanitizer: ""

				            disabled_on_pr: true

				      fail-fast: false # While -DLLAMA_SANITIZE_THREAD=ON is broken

				    container:

Rows
Columns

common: llama_load_model_from_url split support (#6192)

1 .github/workflows/server.yml vendored Unescape Escape View file

1

.github/workflows/server.yml vendored

View file