* llama : add inference support and model types for T5 and FLAN-T5 model families * llama : add new API functions to support encoder-decoder models: llama_encode(), llama_model_has_encoder(), llama_model_decoder_start_token() * common, llama-cli, llama-batched : add support for encoder-decoder models * convert-hf : handle shared token embeddings tensors in T5Model * convert-hf : add support for SentencePiece BPE tokenizer in T5Model (for Pile-T5 models) * convert-hf : add MT5ForConditionalGeneration and UMT5ForConditionalGeneration to architectures supported by T5Model * convert : add t5 tokenizer tests, use "slow" HF tokenizer for t5 --------- Co-authored-by: Stanisław Szymczyk <sszymczy@gmail.com> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
		
			
				
	
	
		
			46 lines
		
	
	
	
		
			1.9 KiB
		
	
	
	
		
			Text
		
	
	
	
	
	
			
		
		
	
	
			46 lines
		
	
	
	
		
			1.9 KiB
		
	
	
	
		
			Text
		
	
	
	
	
	
|  1052 207 19 207 19109 4223
 | |
|  37 100014 71 6245
 | |
| 
 | |
|  207
 | |
|  243
 | |
|  300
 | |
|  184
 | |
|  185
 | |
|  185 185
 | |
|  185 185 185
 | |
|  184 185
 | |
|  17464 1843
 | |
|  37727 1843
 | |
|  17464 5427
 | |
|  37727 5427
 | |
|  37727 5427 0
 | |
|  17464 11 1843 0
 | |
|  37727 11 1843 0
 | |
|  437 317 12356 99 234 13 14743
 | |
|  86 15 19 23 207 22 83 3970 27519 26016 3944 14025
 | |
|  1603 6476 620 91754
 | |
|  71374 209 71374 114 71374 228 155 240 220 71374 224 155 240 211 71374 231 71374 115 71374 240 155 240 210 71374 240 71374 95 71374 114 71374 214 71374 210 71374 236 71374 214 155 240 210 71374 218
 | |
|  10044 95300 334 8754 8 33701 114 350 222 10044 221 104 46713 334 34732 996 24250 262 80923 8 207 37103 214 334 5956 89213 344 643 895 1377 10728 8
 | |
|  17464
 | |
|  37727
 | |
|  207 37727
 | |
|  243 37727
 | |
|  300 37727
 | |
|  300 37727 185 300 37727
 | |
|  334
 | |
|  185 403
 | |
|  6 2906
 | |
|  17464 11 320 6 436 0 1724 418 340 33701 210 3025 19017 612 9407 2681 16 18 16 19 16 20 16 1398 68940 239
 | |
|  15278 3033
 | |
|  18
 | |
|  18 18
 | |
|  18 18 18
 | |
|  18 18 18 18
 | |
|  18 18 18 18 18
 | |
|  18 18 18 18 18 18
 | |
|  18 18 18 18 18 18 18
 | |
|  18 18 18 18 18 18 18 18
 | |
|  18 18 18 18 18 18 18 18 18
 | |
|  34 32555 242 64 23708 32555 216 83
 | |
|  1763 2550
 | |
|  185 207 185 185 207 185 185 185 207 11969 486 22504 185 243 185 300 185 251 185 663 185 10044 95300 334 8754 8 33701 114 350 222 10044 221 104 46713 334 34732 996 24250 262 80923 8 207 37103 214 12356 99 234 10044 99 234 207 18 207 18 18 207 18 18 18 207 18 18 18 18 207 18 18 18 18 18 207 18 18 18 18 18 18 207 18 18 18 18 18 18 18 207 18 18 18 18 18 18 18 18 207 18 13 18 207 18 526 18 207 18 1204 18 207 71374 209 71374 114 71374 228 155 240 220 71374 224 155 240 211 71374 231 71374 115 71374 240 155 240 210 71374 240 71374 95 71374 114 71374 214 71899 210 3025 19017 612 9407 2681 16 18 16 19 16 20 16 1398 68940 239 78827 55170 76659 620 91754 31116 36804 4885 4885 10897 4390 4390 41047 15278 3033 14986 5675 304 6 313 803 655 33326 362 6 82 745 11 655 1374 340 2049 30 655 44 441 2049 304 6 647 1099 359 11 655 35 340 837 742 10842 30 1003 6 10699 245 6 75 43
 |