cturan/llama.cpp

Auteur	SHA1 Message	Date
Georgi Gerganov	afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110)	il y a 1 an
Georgi Gerganov	47182dd03f llama : update llama_model API names (#11063)	il y a 1 an
Georgi Gerganov	f66f582927 llama : refactor `src/llama.cpp` (#10902)	il y a 1 an
Diego Devesa	ae8de6d50a ggml : build backends as libraries (#10256)	il y a 1 an
Diego Devesa	dca1d4b58a ggml : fix BLAS with unsupported types (#9775)	il y a 1 an
Georgi Gerganov	df270ef745 llama : refactor sampling v2 (#9294)	il y a 1 an
Georgi Gerganov	370b1f7e7a ggml : minor naming changes (#8433)	il y a 1 an
Olivier Chafik	8843a98c2b Improve usability of --model-url & related flags (#6930)	il y a 1 an
Michael Klimenko	52bb63c708 refactor : switch to emplace_back to avoid extra object (#5291)	il y a 1 an
Kawrakow	f4d7e54974 SOTA 3-bit quants (#5196)	il y a 1 an
Georgi Gerganov	bcc0eb4591 llama : per-layer KV cache + quantum K cache (#4309)	il y a 2 ans
cebtenzzre	b12fa0d1c1 build : link against build info instead of compiling against it (#3879)	il y a 2 ans
slaren	16bc66d947 llama.cpp : split llama_context_params into model and context params (#3301)	il y a 2 ans
Cebtenzzre	8781013ef6 make : restore build-info.h dependency for several targets (#3205)	il y a 2 ans
Cebtenzzre	e6616cf0db examples : add compiler version and target to build info (#2998)	il y a 2 ans
Cebtenzzre	3aefaab9e5 check C++ code with -Wmissing-declarations (#3184)	il y a 2 ans
Cebtenzzre	00d62adb79 fix some warnings from gcc and clang-tidy (#3038)	il y a 2 ans
Georgi Gerganov	6381d4e110 gguf : new file format with flexible meta data (beta) (#2398)	il y a 2 ans
Stephan Walter	1b107b8550 ggml : generalize `quantize_fns` for simpler FP16 handling (#1237)	il y a 2 ans
Didzis Gosko	527b6fba1d llama : make model stateless and context stateful (llama_state) (#1797)	il y a 2 ans
Borislav Stanimirov	9cbf50c041 build : fix and ignore MSVC warnings (#1889)	il y a 2 ans
Kawrakow	99009e72f8 ggml : add SOTA 2,3,4,5,6 bit k-quantizations (#1684)	il y a 2 ans
Stephan Walter	dc271c52ed Remove unused n_parts parameter (#1509)	il y a 2 ans
DannyDaemonic	f4cef87edf Add git-based build information for better issue tracking (#1232)	il y a 2 ans
Kawrakow	38de86a711 llama : multi-threaded quantization (#1075)	il y a 2 ans
Georgi Gerganov	eb17a026fd quantize-stats : fix bug in --type argument	il y a 2 ans
Pavol Rusnak	c56b715269 Expose type name from ggml (#970)	il y a 2 ans
Georgi Gerganov	9190e8eac8 llama : merge llama_internal.h into llama.h	il y a 2 ans
comex	f963b63afa Rewrite loading code to try to satisfy everyone:	il y a 2 ans
unbounded	62cfc54f77 Add quantize-stats command for testing quantization (#728)	il y a 2 ans

Récemment Précédemment

Historique des commits Trouver

Historique des commits