Neo Zhang
|
e2b065071c
[SYCL]fix ggml_sycl_mul_mat_id() to match the change of api (#7436)
|
1 year ago |
Georgi Gerganov
|
0548a4187f
ggml : generalize GGML_OP_CONCAT (#7563)
|
1 year ago |
mgroeber9110
|
9335b969e8
server: do not remove whitespace at the start of a completion chunk (#7524)
|
1 year ago |
Nathan Epstein
|
c41767154e
Markdownish code block fix (#7571)
|
1 year ago |
Ikko Eltociear Ashimine
|
74b239b3d5
llava : update clip.h (#7580)
|
1 year ago |
Djip007
|
852aafb163
update HIP_UMA #7399 (#7414)
|
1 year ago |
kunnis
|
0136966daf
adding in x64 targets to cmake presets (#7574)
|
1 year ago |
Johannes Gäßler
|
10b1e45876
make: add --device-debug to NVCC debug flags (#7542)
|
1 year ago |
agray3
|
197c00681b
Allow multiple copy function pointers for CUDA graph kernel param updates (#7565)
|
1 year ago |
AidanBeltonS
|
95f84d5ce8
Fix q_xxs using mul_mat_q (#7459)
|
1 year ago |
AidanBeltonS
|
5487593bc7
Add freq factors (#7495)
|
1 year ago |
Georgi Gerganov
|
1d8fca72ae
metal : add GGML_OP_REPEAT kernels (#7557)
|
1 year ago |
Georgi Gerganov
|
62bfef5194
metal : disable FA kernel for HS=256 (#7556)
|
1 year ago |
Georgi Gerganov
|
eaf6e03174
llama : add comments about experimental flags (#7544)
|
1 year ago |
Brian
|
d6ef0e77dd
github: add self sorted issue ticket forms (#7543)
|
1 year ago |
Georgi Gerganov
|
dff451cfa1
flake.lock: Update (#7540)
|
1 year ago |
Brian
|
d298382ad9
main: replace --no-special with --special (#7534)
|
1 year ago |
Galunid
|
32a28217f4
Fix aya-23 conversion scripts (#7539)
|
1 year ago |
Bartowski
|
c429b33beb
llama : add Smaug 70B support (#7402)
|
1 year ago |
Aarni Koskela
|
9146d36fe7
Readme: add akx/ggify to tools (#1484)
|
1 year ago |
HanishKVC
|
b9adcbbf92
SimpleChat Completion Mode flexibility and cleanup, Settings gMe, Optional sliding window (#7480)
|
1 year ago |
Georgi Gerganov
|
9588f196b1
train : change default FA argument (#7528)
|
1 year ago |
Brian
|
3cbd23ed88
labeler: added Apple Metal detector (+Kompute) (#7529)
|
1 year ago |
Justine Tunney
|
00c6390793
main : don't print special tokens with --grammar (#6923)
|
1 year ago |
Masaya, Kato
|
faa0e6979a
ggml: aarch64: SVE kernels for q8_0_q8_0, q4_0_q8_0 vector dot (#7433)
|
1 year ago |
Elton Kola
|
9791f40258
android : module (#7502)
|
1 year ago |
Xuan Son Nguyen
|
902184dd3a
fix missing slash in `fs_get_cache_directory()` (#7503)
|
1 year ago |
Mikko Juola
|
57684331fc
Make tokenize CLI tool have nicer command line arguments. (#6188)
|
1 year ago |
compilade
|
b83bab15a5
gguf-py : fix and simplify quantized shape round-trip (#7483)
|
1 year ago |
Georgi Gerganov
|
d041d2ceaa
flake.lock: Update (#7232)
|
1 year ago |