* llama : add inference support and model types for T5 and FLAN-T5 model families * llama : add new API functions to support encoder-decoder models: llama_encode(), llama_model_has_encoder(), llama_model_decoder_start_token() * common, llama-cli, llama-batched : add support for encoder-decoder models * convert-hf : handle shared token embeddings tensors in T5Model * convert-hf : add support for SentencePiece BPE tokenizer in T5Model (for Pile-T5 models) * convert-hf : add MT5ForConditionalGeneration and UMT5ForConditionalGeneration to architectures supported by T5Model * convert : add t5 tokenizer tests, use "slow" HF tokenizer for t5 --------- Co-authored-by: Stanisław Szymczyk <sszymczy@gmail.com> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
		
			
				
	
	
		
			46 lines
		
	
	
	
		
			1.7 KiB
		
	
	
	
		
			Text
		
	
	
	
	
	
			
		
		
	
	
			46 lines
		
	
	
	
		
			1.7 KiB
		
	
	
	
		
			Text
		
	
	
	
	
	
|  1142 220 19 220 27154 4038
 | |
|  37 51853 261
 | |
| 
 | |
|  220
 | |
|  256
 | |
|  262
 | |
|  197
 | |
|  198
 | |
|  271
 | |
|  1432
 | |
|  1602
 | |
|  9906 1917
 | |
|  22691 1917
 | |
|  9906 4435
 | |
|  22691 4435
 | |
|  22691 4435 0
 | |
|  9906 11 1917 0
 | |
|  22691 11 1917 0
 | |
|  420 374 11410 99 247 13 11055
 | |
|  86 23904 220 22 83 2005 42908 11729 3013 17156
 | |
|  79862 102118 13373 64571 34694 3114 112203 80112
 | |
|  21549 222 98629 241 45358 233 21549 237 45358 224 21549 244 21549 115 21549 253 45358 223 21549 253 21549 95 98629 227 21549 223 21549 249 21549 227 45358 223 21549 231
 | |
|  9468 248 222 320 8416 8 27623 114 102470 9468 234 104 31643 320 36773 100166 98634 8 26602 227 320 3323 43465 430 706 1202 1866 4037 8
 | |
|  9906
 | |
|  22691
 | |
|  220 22691
 | |
|  256 22691
 | |
|  262 22691
 | |
|  262 22691 198 262 22691
 | |
|  320
 | |
|  198 284
 | |
|  6 11639
 | |
|  9906 11 379 65948 0 2650 527 499 27623 223 949 37046 101067 19000 23182 102301 9263 18136 16 36827 21909
 | |
|  17523 3001
 | |
|  18
 | |
|  1644
 | |
|  8765
 | |
|  8765 18
 | |
|  8765 1644
 | |
|  8765 8765
 | |
|  8765 8765 18
 | |
|  8765 8765 1644
 | |
|  8765 8765 8765
 | |
|  34 91163 101798
 | |
|  2624 2402
 | |
|  198 4815 15073 66597 8004 1602 2355 79772 11187 9468 248 222 320 8416 8 27623 114 102470 9468 234 104 31643 320 36773 100166 98634 8 26602 227 11410 99 247 9468 99 247 220 18 220 1644 220 8765 220 8765 18 220 8765 1644 220 8765 8765 220 8765 8765 18 220 8765 8765 1644 220 18 13 18 220 18 497 18 220 18 1131 18 220 21549 222 98629 241 45358 233 21549 237 45358 224 21549 244 21549 115 21549 253 45358 223 21549 253 21549 95 98629 227 76460 223 949 37046 101067 19000 23182 102301 9263 18136 16 36827 21909 56560 54337 19175 102118 13373 64571 34694 3114 112203 80112 3436 106451 14196 14196 74694 3089 3089 29249 17523 3001 27708 7801 358 3077 1027 364 83 820 568 596 1070 11 364 793 499 2771 30 364 44 539 2771 358 3358 1304 433 11 364 35 499 1093 1063 15600 30 1226 6 43712 264 64966 43
 |