cturan/llama.cpp

Autor	SHA1 Mensagem	Data
Georgi Gerganov	bc5ba007b2 server : check that the prompt fits in the slot's context (#10030)	há 1 ano atrás
Molly Sophia	11d47057a5 Rwkv chat template fix (#10001)	há 1 ano atrás
Molly Sophia	4ff7fe1fb3 llama : add chat template for RWKV-World + fix EOT (#9968)	há 1 ano atrás
compilade	1927378bcc convert : refactor rope_freqs generation (#9396)	há 1 ano atrás
nopperl	f99d3f8367 py : add model class for Chameleon conversion (#9683)	há 1 ano atrás
Georgi Gerganov	f4d2b8846a llama : add reranking support (#9510)	há 1 ano atrás
nopperl	9a913110cf llama : add support for Chameleon (#8543)	há 1 ano atrás
Gabe Goodhart	3d6bf6919f llama : add IBM Granite MoE architecture (#9438)	há 1 ano atrás
Gabe Goodhart	0d2ec43833 llama : support IBM Granite architecture (#9412)	há 1 ano atrás
compilade	d54c21df7e convert : identify missing model files (#9397)	há 1 ano atrás
Shane A	0aadac10c7 llama : support OLMoE (#9462)	há 1 ano atrás
CarryFun	95ca85168b llama : support MiniCPM3 (#9322)	há 1 ano atrás
Csaba Kecskemeti	3c7989fd29 py : add "LLaMAForCausalLM" conversion support (#9485)	há 1 ano atrás
daminho	c837981bba py : add Phi-1.5/Phi-2 tokenizer (#9361)	há 1 ano atrás
Molly Sophia	39f852f440 py : add special tokens in hf_converter for RWKV v6 (#9428)	há 1 ano atrás
Molly Sophia	0b4ac75772 RWKV v6: Add time_mix_decay_w1/w2 in quant exclusion list (#9387)	há 1 ano atrás
compilade	9bc6db28d0 ggml-quants : ternary packing for TriLMs and BitNet b1.58 (#8151)	há 1 ano atrás
Molly Sophia	8f1d81a0b6 llama : support RWKV v6 models (#8980)	há 1 ano atrás
Carsten Kragelund Jørgensen	75e1dbbaab llama : fix llama3.1 rope_freqs not respecting custom head_dim (#9141)	há 1 ano atrás
Xuan Son Nguyen	3ba780e2a8 lora : fix llama conversion script with ROPE_FREQS (#9117)	há 1 ano atrás
Younes Belkada	b40eb84895 llama : support for `falcon-mamba` architecture (#9074)	há 1 ano atrás
Minsoo Cheong	c679e0cb5c llama : add EXAONE model support (#9025)	há 1 ano atrás
Yoshi Suhara	2a24c8caa6 Add Nemotron/Minitron GGUF Conversion & Inference Support (#8922)	há 1 ano atrás
Esko Toivonen	6bda7ce6c3 llama : add pre-tokenizer regexes for BLOOM and gpt3-finnish (#8850)	há 1 ano atrás
fairydreaming	7c3f55c100 Add support for encoder-only T5 models (#8900)	há 1 ano atrás
compilade	3a14e00366 gguf-py : simplify support for quant types (#8838)	há 1 ano atrás
Douglas Hanley	cdd1889de6 convert : add support for XLMRoberta embedding models (#8658)	há 1 ano atrás
Sigbjørn Skjæret	b72c20b85c Fix conversion of unnormalized BF16->BF16 weights (#7843)	há 1 ano atrás
Jeffrey Morgan	b5e95468b1 llama : add support for llama 3.1 rope scaling factors (#8676)	há 1 ano atrás
Fan Shupei	8a4bad50a8 llama: use sliding window for phi3 (#8627)	há 1 ano atrás

Recente Antigo

Histórico de Commits Pesquisar

Histórico de Commits