R0CKSTAR
|
c35e586ea5
musa: enable building fat binaries, enable unified memory, and disable Flash Attention on QY1 (MTT S80) (#9526)
|
1 سال پیش |
Georgi Gerganov
|
19514d632e
cmake : do not hide GGML options + rename option (#9465)
|
1 سال پیش |
Georgi Gerganov
|
6262d13e0b
common : reimplement logging (#9418)
|
1 سال پیش |
Xuan Son Nguyen
|
feff4aa846
server : add loading html page while model is loading (#9468)
|
1 سال پیش |
Ahmad Tameem
|
2b00fa7997
riscv : modify Makefile and add a RISCV_VECT to print log info (#9442)
|
1 سال پیش |
slaren
|
fb3f249815
make : do not run llama-gen-docs when building (#9399)
|
1 سال پیش |
Xuan Son Nguyen
|
bfe76d4a17
common : move arg parser code to `arg.cpp` (#9388)
|
1 سال پیش |
Xuan Son Nguyen
|
1b9ae5189c
common : refactor arg parser (#9308)
|
1 سال پیش |
Georgi Gerganov
|
df270ef745
llama : refactor sampling v2 (#9294)
|
1 سال پیش |
0cc4m
|
5fd89a70ea
Vulkan Optimizations and Fixes (#8959)
|
1 سال پیش |
Georgi Gerganov
|
272e3bd95e
make : fix llava obj file race (#8946)
|
1 سال پیش |
tc-mb
|
3071c0a5f2
llava : support MiniCPM-V-2.5 (#7599)
|
1 سال پیش |
Pablo Duboue
|
ebd541a570
make : clean llamafile objects (#8923)
|
1 سال پیش |
slaren
|
15fa07a5c5
make : use C compiler to build metal embed object (#8899)
|
1 سال پیش |
Clint Herron
|
ed9d2854c9
Build: Fix potential race condition (#8781)
|
1 سال پیش |
R0CKSTAR
|
e54c35e4fb
feat: Support Moore Threads GPU (#8383)
|
1 سال پیش |
slaren
|
2b1f616b20
ggml : reduce hash table reset cost (#8698)
|
1 سال پیش |
Xuan Son Nguyen
|
be6d7c0791
examples : remove `finetune` and `train-text-from-scratch` (#8669)
|
1 سال پیش |
Xuan Son Nguyen
|
de280085e7
examples : Fix `llama-export-lora` example (#8607)
|
1 سال پیش |
Georgi Gerganov
|
938943cdbf
llama : move vocab, grammar and sampling into separate files (#8508)
|
1 سال پیش |
Johannes Gäßler
|
5e116e8dd5
make/cmake: add missing force MMQ/cuBLAS for HIP (#8515)
|
1 سال پیش |
bandoti
|
17eb6aa8a9
vulkan : cmake integration (#8119)
|
1 سال پیش |
Nicholai Tukanov
|
368645698a
ggml : add NVPL BLAS support (#8329) (#8425)
|
1 سال پیش |
Clint Herron
|
dd07a123b7
Name Migration: Build the deprecation-warning 'main' binary every time (#8404)
|
1 سال پیش |
Georgi Gerganov
|
6b2a849d1f
ggml : move sgemm sources to llamafile subfolder (#8394)
|
1 سال پیش |
Dibakar Gope
|
0f1a39f343
ggml : add AArch64 optimized GEMV and GEMM Q4 kernels (#5780)
|
1 سال پیش |
Clint Herron
|
e500d6135a
Deprecation warning to assist with migration to new binary names (#8283)
|
1 سال پیش |
Johannes Gäßler
|
a03e8dd99d
make/cmake: LLAMA_NO_CCACHE -> GGML_NO_CCACHE (#8392)
|
1 سال پیش |
Brian
|
f7cab35ef9
gguf-hash: model wide and per tensor hashing using xxhash and sha1 (#8048)
|
1 سال پیش |
Clint Herron
|
3e2618bc7b
Adding step to `clean` target to remove legacy binary names to reduce upgrade / migration confusion arising from #7809. (#8257)
|
1 سال پیش |