* llama : add inference support and model types for T5 and FLAN-T5 model families * llama : add new API functions to support encoder-decoder models: llama_encode(), llama_model_has_encoder(), llama_model_decoder_start_token() * common, llama-cli, llama-batched : add support for encoder-decoder models * convert-hf : handle shared token embeddings tensors in T5Model * convert-hf : add support for SentencePiece BPE tokenizer in T5Model (for Pile-T5 models) * convert-hf : add MT5ForConditionalGeneration and UMT5ForConditionalGeneration to architectures supported by T5Model * convert : add t5 tokenizer tests, use "slow" HF tokenizer for t5 --------- Co-authored-by: Stanisław Szymczyk <sszymczy@gmail.com> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
		
			
				
	
	
		
			46 lines
		
	
	
	
		
			1.7 KiB
		
	
	
	
		
			Text
		
	
	
	
	
	
			
		
		
	
	
			46 lines
		
	
	
	
		
			1.7 KiB
		
	
	
	
		
			Text
		
	
	
	
	
	
|  1122 220 19 220 26062 3951
 | |
|  37 50753 261
 | |
| 
 | |
|  220
 | |
|  256
 | |
|  262
 | |
|  197
 | |
|  198
 | |
|  271
 | |
|  1406
 | |
|  1572
 | |
|  9707 1879
 | |
|  21927 1879
 | |
|  9707 4337
 | |
|  21927 4337
 | |
|  21927 4337 0
 | |
|  9707 11 1879 0
 | |
|  21927 11 1879 0
 | |
|  419 374 11162 99 247 13 10821
 | |
|  86 15 19 23 220 22 83 1963 41808 11472 2940 16739
 | |
|  78762 14144 1456 13073 63471 33594 3038 133178 79012
 | |
|  146394 97529 241 44258 233 146568 44258 224 147603 20879 115 146280 44258 223 146280 147272 97529 227 147805 148301 147270 44258 223 146848
 | |
|  145836 320 8252 8 26525 114 378 235 149921 30543 320 35673 99066 97534 8 25521 227 320 3243 42365 429 702 1181 1828 3950 8
 | |
|  9707
 | |
|  21927
 | |
|  220 21927
 | |
|  256 21927
 | |
|  262 21927
 | |
|  262 21927 198 262 21927
 | |
|  320
 | |
|  198 284
 | |
|  6 11385
 | |
|  9707 11 379 64848 0 2585 525 498 26525 223 937 104100 18493 22377 99257 16 18 16 19 16 20 16 35727 21216
 | |
|  17085 2928
 | |
|  18
 | |
|  18 18
 | |
|  18 18 18
 | |
|  18 18 18 18
 | |
|  18 18 18 18 18
 | |
|  18 18 18 18 18 18
 | |
|  18 18 18 18 18 18 18
 | |
|  18 18 18 18 18 18 18 18
 | |
|  18 18 18 18 18 18 18 18 18
 | |
|  34 90063 128324
 | |
|  2560 2347
 | |
|  198 4710 14731 65497 7847 1572 2303 78672 10947 145836 320 8252 8 26525 114 378 235 149921 30543 320 35673 99066 97534 8 25521 227 11162 99 247 149955 220 18 220 18 18 220 18 18 18 220 18 18 18 18 220 18 18 18 18 18 220 18 18 18 18 18 18 220 18 18 18 18 18 18 18 220 18 18 18 18 18 18 18 18 220 18 13 18 220 18 496 18 220 18 1112 18 220 146394 97529 241 44258 233 146568 44258 224 147603 20879 115 146280 44258 223 146280 147272 97529 227 144534 937 104100 18493 22377 99257 16 18 16 19 16 20 16 35727 21216 55460 53237 18658 14144 1456 13073 63471 33594 3038 133178 79012 3355 4605 4605 13874 13874 73594 3014 3014 28149 17085 2928 26610 7646 358 3003 1012 364 83 813 566 594 1052 11 364 787 498 2704 30 364 44 537 2704 358 3278 1281 432 11 364 35 498 1075 1045 15243 30 1205 6 42612 264 63866 43
 |