Georgi Gerganov
|
62bfef5194
metal : disable FA kernel for HS=256 (#7556)
|
1 vuosi sitten |
Georgi Gerganov
|
eaf6e03174
llama : add comments about experimental flags (#7544)
|
1 vuosi sitten |
Brian
|
d6ef0e77dd
github: add self sorted issue ticket forms (#7543)
|
1 vuosi sitten |
Georgi Gerganov
|
dff451cfa1
flake.lock: Update (#7540)
|
1 vuosi sitten |
Brian
|
d298382ad9
main: replace --no-special with --special (#7534)
|
1 vuosi sitten |
Galunid
|
32a28217f4
Fix aya-23 conversion scripts (#7539)
|
1 vuosi sitten |
Bartowski
|
c429b33beb
llama : add Smaug 70B support (#7402)
|
1 vuosi sitten |
Aarni Koskela
|
9146d36fe7
Readme: add akx/ggify to tools (#1484)
|
1 vuosi sitten |
HanishKVC
|
b9adcbbf92
SimpleChat Completion Mode flexibility and cleanup, Settings gMe, Optional sliding window (#7480)
|
1 vuosi sitten |
Georgi Gerganov
|
9588f196b1
train : change default FA argument (#7528)
|
1 vuosi sitten |
Brian
|
3cbd23ed88
labeler: added Apple Metal detector (+Kompute) (#7529)
|
1 vuosi sitten |
Justine Tunney
|
00c6390793
main : don't print special tokens with --grammar (#6923)
|
1 vuosi sitten |
Masaya, Kato
|
faa0e6979a
ggml: aarch64: SVE kernels for q8_0_q8_0, q4_0_q8_0 vector dot (#7433)
|
1 vuosi sitten |
Elton Kola
|
9791f40258
android : module (#7502)
|
1 vuosi sitten |
Xuan Son Nguyen
|
902184dd3a
fix missing slash in `fs_get_cache_directory()` (#7503)
|
1 vuosi sitten |
Mikko Juola
|
57684331fc
Make tokenize CLI tool have nicer command line arguments. (#6188)
|
1 vuosi sitten |
compilade
|
b83bab15a5
gguf-py : fix and simplify quantized shape round-trip (#7483)
|
1 vuosi sitten |
Georgi Gerganov
|
d041d2ceaa
flake.lock: Update (#7232)
|
1 vuosi sitten |
Brian
|
27891f6db0
docker.yml: disable light-intel and server-intel test (#7515)
|
1 vuosi sitten |
fairydreaming
|
fbca2f27fc
Add support for ArcticForCausalLM (#7020)
|
1 vuosi sitten |
Neo Zhang
|
0df0aa8e43
add build shared lib in win release package (#7438)
|
1 vuosi sitten |
Georgi Gerganov
|
74f33adf5f
readme : remove trailing space (#7469)
|
1 vuosi sitten |
Georgi Gerganov
|
1debe72737
ggml : silence UB sanitizer error during iq2_xxs quantization (#0)
|
1 vuosi sitten |
Tristan Druyen
|
007489e895
Fix phi3 chat template confusion with zephyr (#7449)
|
1 vuosi sitten |
Raj Hammeer Singh Hada
|
8b94e799df
readme : add Bunny in supported models [no ci] (#7469)
|
1 vuosi sitten |
Daniel Bevenius
|
3015851c5a
llama : add getters for n_threads/n_threads_batch (#7464)
|
1 vuosi sitten |
Georgi Gerganov
|
55ac3b7aea
ci : use Pythia models instead of OpenLlama (#7470)
|
1 vuosi sitten |
Victor Nogueira
|
dacfcebd60
readme : add GPT-NeoX + Pythia to the list of supported models (#7491)
|
1 vuosi sitten |
fairydreaming
|
9b82476ee9
Add missing inference support for GPTNeoXForCausalLM (Pythia and GPT-NeoX base models) (#7461)
|
1 vuosi sitten |
Georgi Gerganov
|
a61a94e543
llama : rename n_ctx -> cache.size, less confusing (#0)
|
1 vuosi sitten |