cturan/llama.cpp

Аутор	SHA1 Порука	Датум
compilade	fa79495bb4 llama : fix pre-tokenization of non-special added tokens (#8228)	пре 1 година
bandoti	17eb6aa8a9 vulkan : cmake integration (#8119)	пре 1 година
Georgi Gerganov	c917b67f06 metal : template-ify some of the kernels (#8447)	пре 1 година
Georgi Gerganov	4e24cffd8c server : handle content array in chat API (#8449)	пре 1 година
Georgi Gerganov	6af51c0d96 main : print error on empty input (#8456)	пре 1 година
Daniel Bevenius	f53226245f llama : suppress unary minus operator warning (#8448)	пре 1 година
Douglas Hanley	c3ebcfa148 server : ensure batches are either all embed or all completion (#8420)	пре 1 година
Armen Kaleshian	8a4441ea1a docker : fix filename for convert-hf-to-gguf.py in tools.sh (#8441)	пре 1 година
Jiří Podivín	5aefbce27a convert : remove fsep token from GPTRefactForCausalLM (#8237)	пре 1 година
Georgi Gerganov	71c1121d11 examples : sprintf -> snprintf (#8434)	пре 1 година
Georgi Gerganov	370b1f7e7a ggml : minor naming changes (#8433)	пре 1 година
Chen Xi	b549a1bbef [SYCL] fix the mul_mat_id ut issues (#8427)	пре 1 година
Nicholai Tukanov	368645698a ggml : add NVPL BLAS support (#8329) (#8425)	пре 1 година
Daniel Bevenius	b078c619aa cuda : suppress 'noreturn' warn in no_device_code (#8414)	пре 1 година
Johannes Gäßler	808aba3916 CUDA: optimize and refactor MMQ (#8416)	пре 1 година
Georgi Gerganov	a977c11544 gitignore : deprecated binaries	пре 1 година
compilade	9a55ffe6fb tokenize : add --no-parse-special option (#8423)	пре 1 година
Georgi Gerganov	7a221b672e llama : use F32 precision in Qwen2 attention and no FA (#8412)	пре 1 година
Clint Herron	278d0e1846 Initialize default slot sampling parameters from the global context. (#8418)	пре 1 година
Clint Herron	dd07a123b7 Name Migration: Build the deprecation-warning 'main' binary every time (#8404)	пре 1 година
AidanBeltonS	f4444d992c [SYCL] Use multi_ptr to clean up deprecated warnings (#8256)	пре 1 година
Georgi Gerganov	6b2a849d1f ggml : move sgemm sources to llamafile subfolder (#8394)	пре 1 година
Dibakar Gope	0f1a39f343 ggml : add AArch64 optimized GEMV and GEMM Q4 kernels (#5780)	пре 1 година
M. Yusuf Sarıgöz	83321c6958 gguf-py rel pipeline (#8410)	пре 1 година
Borislav Stanimirov	cc61948b1f llama : C++20 compatibility for u8 strings (#8408)	пре 1 година
Borislav Stanimirov	7a80710d93 msvc : silence codecvt c++17 deprecation warnings (#8395)	пре 1 година
fairydreaming	a8be1e6f59 llama : add assert about missing llama_encode() call (#8400)	пре 1 година
RunningLeon	e4dd31ff89 py : fix converter for internlm2 (#8321)	пре 1 година
laik	8f0fad42b9 py : fix extra space in convert_hf_to_gguf.py (#8407)	пре 1 година
Clint Herron	a59f8fdc85 Server: Enable setting default sampling parameters via command-line (#8402)	пре 1 година

Новије Старије

Историја ревизија Пронађи

Историја ревизија