Joan Fontanals
|
b83cc3f5b3
llama : add Jina Embeddings architecture (#6826)
|
1 жил өмнө |
Georgi Gerganov
|
9cb317f77e
ggml : full ALiBi support (#7192)
|
1 жил өмнө |
slaren
|
e849648888
llama-bench : add pp+tg test type (#7199)
|
1 жил өмнө |
Georgi Gerganov
|
18e437665c
metal : fix flash attention kernel requirements (#7169)
|
1 жил өмнө |
Georgi Gerganov
|
8c660242d7
convert : print "ignore_merges" field
|
1 жил өмнө |
slaren
|
25c6e82e7a
llama : use n_vocab to differentiate between mistral 7B and llama3 8B (#7200)
|
1 жил өмнө |
Justine Tunney
|
4e3880978f
Fix memory bug in grammar parser (#7194)
|
1 жил өмнө |
HanishKVC
|
f89fe2732c
Main+: optionally allow special tokens from user in interactive mode (#7097)
|
1 жил өмнө |
Andrei
|
d11afd6652
llava : fix moondream support (#7163)
|
1 жил өмнө |
Ouadie EL FAROUKI
|
8c570c9496
Minor arithmetic improvement to mmvq wrapper kernel (#7172)
|
1 жил өмнө |
slaren
|
eaf4bd8b39
eval-callback : fix conversion to float (#7184)
|
1 жил өмнө |
0cc4m
|
befddd0f15
Vulkan Bugfixes and Improvements (#7084)
|
1 жил өмнө |
Georgi Gerganov
|
d46dbc76f8
readme : add scheduled server workflow status badge
|
1 жил өмнө |
l3utterfly
|
0961d86604
readme : add app (#6371)
|
1 жил өмнө |
jaime-m-p
|
43248e5594
llama3 custom regex split (#6965)
|
1 жил өмнө |
Johannes Gäßler
|
a743d76a01
CUDA: generalize FP16 fattn vec kernel (#7061)
|
1 жил өмнө |
Galunid
|
f31ec120bc
Add warning if token is invalid (#7173)
|
1 жил өмнө |
Daniel Bevenius
|
fd9f92b154
llama : update llama_timings.n_p_eval setting (#7160)
|
1 жил өмнө |
Sigbjørn Skjæret
|
22842164bc
gguf-py : add special token modification capability (#7166)
|
1 жил өмнө |
Albert Jin
|
4734524882
opencl : alignment size converted from bits to bytes (#7090)
|
1 жил өмнө |
Ahmet Zeer
|
07cd41d096
TypoFix (#7162)
|
1 жил өмнө |
Jared Van Bortel
|
4426e2987b
cmake : fix typo (#7151)
|
1 жил өмнө |
compilade
|
f98eb31c51
convert-hf : save memory with lazy evaluation (#7075)
|
1 жил өмнө |
agray3
|
bc4bba364f
Introduction of CUDA Graphs to LLama.cpp (#6766)
|
1 жил өмнө |
Johannes Gäßler
|
c12452c7ae
JSON: [key] -> .at(key), assert() -> GGML_ASSERT (#7143)
|
1 жил өмнө |
Georgi Gerganov
|
9da243b36a
Revert "llava : add support for moondream vision language model (#6899)"
|
1 жил өмнө |
JohnnyB
|
bd1871fa2b
server : add themes + favicon (#6848)
|
1 жил өмнө |
Gilad S
|
26458af1d6
metal : use `vm_allocate` instead of `posix_memalign` on macOS (#7078)
|
1 жил өмнө |
Dawid Potocki
|
83330d8cd6
main : add --conversation / -cnv flag (#7108)
|
1 жил өмнө |
Eve
|
465263d0cf
sgemm : AVX Q4_0 and Q8_0 (#6891)
|
1 жил өмнө |