Diego Devesa
|
6adc3c3ebc
llama : add thread safety test (#14035)
|
há 7 meses atrás |
bandoti
|
0dbcabde8c
cmake: clean up external project logic for vulkan-shaders-gen (#14179)
|
há 7 meses atrás |
Đinh Trọng Huy
|
ad590be98c
model : add NeoBERT (#14164)
|
há 7 meses atrás |
uvos
|
7d6d91babf
HIP: disable rocwmma on gfx12 by default until rocm 7.0 (#14202)
|
há 7 meses atrás |
Georgi Gerganov
|
d3e64b9f49
llama : rework embeddings logic (#14208)
|
há 7 meses atrás |
Charles Xu
|
3ba0d843c6
ggml: Add Android support for GGML_CPU_ALL_VARIANTS (#14206)
|
há 7 meses atrás |
Bartowski
|
0bf49eb668
convert : remove arcee change in convert_hf_to_gguf_update.py (#14207)
|
há 7 meses atrás |
Đinh Trọng Huy
|
4ad243677b
gguf-py : allow key override when adding value to GGUFWriter (#14194)
|
há 7 meses atrás |
Jeff Bolz
|
c89c2d1ab9
vulkan: mutex around vkQueueSubmit (#14127)
|
há 7 meses atrás |
xctan
|
3555b3004b
ggml-cpu : rework weak alias on apple targets (#14146)
|
há 7 meses atrás |
Bartowski
|
d7da8dc83a
model : Add support for Arcee AI's upcoming AFM model (#14185)
|
há 7 meses atrás |
Eric Curtin
|
cd355eda7d
server : When listening on a unix domain socket don't print http:// and port (#14180)
|
há 7 meses atrás |
Ed Addario
|
30e5b01de2
quantize : change int to unsigned int for KV overrides (#14197)
|
há 7 meses atrás |
uvos
|
e54b394082
CUDA/HIP: fix ssm_scan on devices where warp size is not 32 (#14196)
|
há 7 meses atrás |
uvos
|
2c2caa4443
HIP: Replace usage of depricated preprocessor macro __AMDGCN_WAVEFRONT_SIZE__ (#14183)
|
há 7 meses atrás |
Georgi Gerganov
|
5fce5f948d
kv-cache : fix use-after-move of defrag info (#14189)
|
há 7 meses atrás |
Mikko Juola
|
9ae4143bc6
model : add dots.llm1 architecture support (#14044) (#14118)
|
há 7 meses atrás |
Georgi Gerganov
|
c311ac664d
cparams : rename LLAMA_MAX_PARALLEL_SEQUENCES to LLAMA_MAX_SEQ (#14188)
|
há 7 meses atrás |
Georgi Gerganov
|
b9912ac570
batch : auto-gen positions + verify multi-sequence input (#14177)
|
há 7 meses atrás |
Pepijn de Vos
|
00ba772610
docs : remove WIP since PR has been merged (#13912)
|
há 7 meses atrás |
Piotr
|
3cb203c89f
llama-chat : Do not throw when tool parsing fails (#14012)
|
há 7 meses atrás |
Aman Gupta
|
2e42be42bd
compare-llama-bench: add option to plot (#14169)
|
há 7 meses atrás |
Georgi Gerganov
|
fb85a288d7
vocab : fix build (#14175)
|
há 7 meses atrás |
Svetlozar Georgiev
|
40643edb86
sycl: fix docker image (#14144)
|
há 7 meses atrás |
Guy Goldenberg
|
3cfbbdb44e
Merge commit from fork
|
há 7 meses atrás |
Georgi Gerganov
|
80709b70a2
batch : add LLAMA_BATCH_DEBUG environment variable (#14172)
|
há 7 meses atrás |
ddpasa
|
26ff3685bf
docs : Update multimodal.md (#14122)
|
há 7 meses atrás |
Georgi Gerganov
|
60c666347b
batch : rework llama_batch_allocr (#14153)
|
há 7 meses atrás |
Georgi Gerganov
|
b7cc7745e3
readme : remove survey link (#14168)
|
há 7 meses atrás |
Christian Kastner
|
cc8d081879
cmake: Add ability to pass in LLAMA_BUILD_NUMBER/COMMIT (#14167)
|
há 7 meses atrás |