cturan/llama.cpp

Autor	SHA1 Zpráva	Datum
bandoti	095231dfd3 cmake : fix transient definitions in find pkg (#3411)	před 2 roky
Kevin Ji	ea55295a74 docker : ignore Git files (#3314)	před 2 roky
vvhg1	c97f01c362 infill : add new example + extend server API (#3296)	před 2 roky
slaren	f5ef5cfb18 ggml-cuda : perform cublas mat mul of quantized types as f16 (#3412)	před 2 roky
slaren	40e07a60f9 llama.cpp : add documentation about rope_freq_base and scale values (#3401)	před 2 roky
Georgi Gerganov	bc34dd4f5b train : fix KQ_pos allocation (#3392)	před 2 roky
Cebtenzzre	2777a84be4 llama : quantize up to 31% faster on Linux and Windows with mmap (#3206)	před 2 roky
BarfingLemurs	0a4a4a0982 readme : update hot topics + model links (#3399)	před 2 roky
Andrew Duffy	569550df20 readme : add link to grammars app (#3388)	před 2 roky
Jhen-Jie Hong	c71bf2c45c swift : fix build on xcode 15 (#3387)	před 2 roky
Cebtenzzre	bc39553c90 build : enable more non-default compiler warnings (#3200)	před 2 roky
Hua Jiang	0ccfc62a96 ggml_tensor: update the structure comments. (#3283)	před 2 roky
Qu Zongfu	7f1a0fe709 ggml : release the requested thread pool resource (#3292)	před 2 roky
slaren	16bc66d947 llama.cpp : split llama_context_params into model and context params (#3301)	před 2 roky
Eve	0512d66670 ci : multithreaded builds (#3311)	před 2 roky
xaedes	0e76a8992c train : finetune LORA (#2632)	před 2 roky
Cebtenzzre	2db94d98ed gguf : basic type checking in gguf_get_* (#3346)	před 2 roky
Cebtenzzre	ecf90b1a51 gguf : make token scores and types optional (#3347)	před 2 roky
Georgi Gerganov	2619109ad5 ci : disable freeBSD builds due to lack of VMs (#3381)	před 2 roky
Georgi Gerganov	ec893798b7 llama : custom attention mask + parallel decoding + no context swaps (#3228)	před 2 roky
Kevin Ji	45855b3f1c docs : mark code as Bash (#3375)	před 2 roky
Pierre Alexandre SCHEMBRI	4aea3b846e readme : add Mistral AI release 0.1 (#3362)	před 2 roky
slaren	da0400344b ggml-cuda : perform cublas fp16 matrix multiplication as fp16 (#3370)	před 2 roky
Zhang Peiyuan	e519621010 convert : remove bug in convert.py permute function (#3364)	před 2 roky
Richard Roberson	ac43576124 make-ggml.py : compatibility with more models and GGUF (#3290)	před 2 roky
Cebtenzzre	20c7e1e804 gguf : fix a few general keys (#3341)	před 2 roky
Rickard Hallerbäck	dc6897404e metal : reusing llama.cpp logging (#3152)	před 2 roky
Jag Chadha	527e57cfd8 build : add ACCELERATE_NEW_LAPACK to fix warning on macOS Sonoma (#3342)	před 2 roky
BarfingLemurs	ffe88a36a9 readme : add some recent perplexity and bpw measurements to READMES, link for k-quants (#3340)	před 2 roky
DAN™	99115f3fa6 cmake : fix build-info.h on MSVC (#3309)	před 2 roky

Novější Starší

Historie revizí Hledat

Historie revizí