Georgi Gerganov 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								2b3389677a 
								
							 
						 
						
							
							
								
								ggml : refactor rope norm/neox ( #7634 )  
							
							... 
							
							
							
							* ggml : unify rope norm/neox (CPU)
* ggml : fix compile warning
* ggml : remove GLM rope mode
ggml-ci
* metal : better rope implementation
ggml-ci
* cuda : better rope implementation
ggml-ci
* naming : n_orig_ctx -> n_ctx_orig
ggml-ci
* dev : add reminders to update backends
ggml-ci
* vulkan : fix ggml_rope_ext() usage
* cuda : fix array size + indents
ggml-ci 
							
						 
						
							2024-06-05 11:29:20 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									woachk 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								9e405b6e2e 
								
							 
						 
						
							
							
								
								kompute : implement op_getrows_f32 ( #6403 )  
							
							... 
							
							
							
							op_getrows_f32 is required since https://github.com/ggerganov/llama.cpp/pull/6122 
for the Vulkan w/ Kompute backend to be functional.
As such, implement this op to make this backend functional again. 
							
						 
						
							2024-06-03 08:32:16 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Jared Van Bortel 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								fbf1ddec69 
								
							 
						 
						
							
							
								
								Nomic Vulkan backend ( #4456 )  
							
							... 
							
							
							
							Signed-off-by: Jared Van Bortel <jared@nomic.ai>
Co-authored-by: niansa <anton-sa@web.de>
Co-authored-by: Adam Treat <treat.adam@gmail.com>
Co-authored-by: Aaron Miller <apage43@ninjawhale.com>
Co-authored-by: ToKiNoBug <tokinobug@163.com>
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
Co-authored-by: slaren <slarengh@gmail.com> 
							
						 
						
							2024-01-29 15:50:50 -05:00