Georgi Gerganov
|
afa8a9ec9b
llama : add `llama_vocab`, functions -> methods, naming (#11110)
|
1 gadu atpakaļ |
Vinesh Janarthanan
|
c05e8c9934
gguf-py: fixed local detection of gguf package (#11180)
|
1 gadu atpakaļ |
Daniel Bevenius
|
2739a71e4b
convert : sort print supported models [no ci] (#11179)
|
1 gadu atpakaļ |
Daniel Bevenius
|
ba8a1f9c5b
examples : add README.md to tts example [no ci] (#11155)
|
1 gadu atpakaļ |
Daniel Bevenius
|
ff3fcabc72
convert : add --print-supported-models option (#11172)
|
1 gadu atpakaļ |
0cc4m
|
c3f9d25706
Vulkan: Fix float16 use on devices without float16 support + fix subgroup_size_control validation error (#11161)
|
1 gadu atpakaļ |
Molly Sophia
|
ee7136c6d1
llama: add support for QRWKV6 model architecture (#11001)
|
1 gadu atpakaļ |
Akarshan Biswas
|
c6860cc734
SYCL: Refactor ggml_sycl_compute_forward (#11121)
|
1 gadu atpakaļ |
Tei Home
|
1204f97270
doc: add cuda guide for fedora (#11135)
|
1 gadu atpakaļ |
Daniel Bevenius
|
8eceb888d7
server : add tooltips to settings and themes btn (#11154)
|
1 gadu atpakaļ |
Pierrick Hymbert
|
f8feb4b01a
model: Add support for PhiMoE arch (#11003)
|
1 gadu atpakaļ |
Georgi Gerganov
|
be0e950c91
media : remove old img [no ci]
|
1 gadu atpakaļ |
Xuan Son Nguyen
|
d9feae1c06
llama-chat : add phi 4 template (#11148)
|
1 gadu atpakaļ |
hydai
|
8d59d91171
fix: add missing msg in static_assert (#11143)
|
1 gadu atpakaļ |
Vinesh Janarthanan
|
8a1d9c25fa
gguf-py : move scripts directory (#11116)
|
1 gadu atpakaļ |
Eric Curtin
|
1bf839b1e8
Enhance user input handling for llama-run (#11138)
|
1 gadu atpakaļ |
Xuan Son Nguyen
|
f7cd13301c
ci : use actions from ggml-org (#11140)
|
1 gadu atpakaļ |
Xuan Son Nguyen
|
4d2b3d8804
lora : improve compat with `mergekit-extract-lora` (#11131)
|
1 gadu atpakaļ |
Georgi Gerganov
|
c07d437bbd
llama : avoid hardcoded QK_K (#11061)
|
1 gadu atpakaļ |
Georgi Gerganov
|
99a3755a3c
sync : ggml
|
1 gadu atpakaļ |
Radoslav Gerganov
|
c792dcf488
ggml : allow loading backend with env variable (ggml/1059)
|
1 gadu atpakaļ |
Xuan Son Nguyen
|
80ccf5d725
ci : pin dependency to specific version (#11137)
|
1 gadu atpakaļ |
Georgi Gerganov
|
a3c1232c3f
arg : option to exclude arguments from specific examples (#11136)
|
1 gadu atpakaļ |
amritahs-ibm
|
8cef75c743
llamafile : ppc64le MMA INT8 implementation (#10912)
|
1 gadu atpakaļ |
Georgi Gerganov
|
0d52a69e4b
ci : fix cmake option (#11125)
|
1 gadu atpakaļ |
Mathieu Baudier
|
02f0430141
Disable GL_KHR_cooperative_matrix Vulkan extension if not available. (#11117)
|
1 gadu atpakaļ |
ag2s20150909
|
bec2183f2c
fix: Vulkan shader gen binary path when Cross-compiling (#11096)
|
1 gadu atpakaļ |
Johannes Gäßler
|
53ff6b9b9f
GGUF: C++ refactor, backend support, misc fixes (#11030)
|
1 gadu atpakaļ |
Diego Devesa
|
017cc5f446
ggml-backend : only offload from host buffers (fix) (#11124)
|
1 gadu atpakaļ |
Diego Devesa
|
a3d50bc022
ggml-backend : only offload from host buffers (#11120)
|
1 gadu atpakaļ |