cturan/llama.cpp

Autor	SHA1 Zpráva	Datum
Kawrakow	e37e69dcc3 10X faster BPE tokenizer (#2876)	před 2 roky
xaedes	44c117f41e train : mem usage and other improvements (#2439)	před 2 roky
Johannes Gäßler	6b73ef1201 YAML result logging + preset script (#2657)	před 2 roky
grahameth	be475f60af llama.cpp : fix wrong vsnprintf call in MS compiler (#2856)	před 2 roky
Georgi Gerganov	c10704d01e llama : fix MPI threads (close #2827)	před 2 roky
Kawrakow	463173a6c0 llama : speedup tokenization (#2831)	před 2 roky
Georgi Gerganov	eaa13a48ff falcon : fix CUDA inference by making K and Q contiguous (#2830)	před 2 roky
Kawrakow	a6d1189fdd k_quants tuning for Falcon-7b (#2816)	před 2 roky
Georgi Gerganov	d0cee0d36d gguf : add 64-bit support (GGUF v2) (#2821)	před 2 roky
Georgi Gerganov	edd4c14817 llama : more tokenizer fixes (#2810)	před 2 roky
Przemysław Pawełczyk	1591e2e590 ggml : detect SSSE3 (#2825)	před 2 roky
Tim Miller	c7d92e6dfe llama : use Unicode Escape Sequence to replace encoded characters (#2814)	před 2 roky
Cebtenzzre	741ca7dd1c llama : move #includes out of _GNU_SOURCE conditional (#2817)	před 2 roky
Cebtenzzre	50526f37eb llama : use std::abs in llama_sample_tail_free (#2800)	před 2 roky
Georgi Gerganov	04f4b1eb10 k-quants : remove unnecessary tensor shape restrictions (#2811)	před 2 roky
Kawrakow	7592375403 Better perplexity for 2- and 3-bit quantization for LLaMA-v2-70B (#2807)	před 2 roky
klosax	2ba83c8685 Fix spm whitespaces (#2806)	před 2 roky
Matt Pulver	c82742ac9c llama : add llama_beam_search() (#2267)	před 2 roky
slaren	154725c543 llama-bench : add model sizes (#2771)	před 2 roky
Henri Vasserman	6bbc598a63 ROCm Port (#1087)	před 2 roky
Georgi Gerganov	3f460a2b72 cuda : add RoPE kernel for mode == 2 (NeoX) (#2760)	před 2 roky
slaren	0d3094f0c7 gguf : add rope_freq_base parameter for CodeLlama (#2769)	před 2 roky
Shouzheng Liu	38b16dfca6 metal : bug-fix when enable ggml-alloc (#2757)	před 2 roky
slaren	fea95c682d fix convert.py for codellama, add llama 34B to the list of recognized models (#2768)	před 2 roky
Georgi Gerganov	c3e53b421a llama : escape all U+2581 in a string (#2750)	před 2 roky
Evan Jones	6e91a1b070 llama : fix grammar sometimes generating null char (#2756)	před 2 roky
Georgi Gerganov	cf658adc83 llm : add Falcon support (#2717)	před 2 roky
Kerfuffle	777f42ba18 Improve handling of special tokens in GGML to GGUF converter (#2725)	před 2 roky
goerch	46ef5b5fcf llama : fix whitespace escaping in tokenizer (#2724)	před 2 roky
Georgi Gerganov	deb7dfca4b gguf : add ftype meta info to the model (#2710)	před 2 roky

Novější Starší

Historie revizí Hledat

Historie revizí