| .. |
|
baby-llama
|
afefa319f1
ggml : change ggml_scale to take a float instead of tensor (#4573)
|
2 jaren geleden |
|
batched
|
b0034d93ce
examples : add passkey test (#3856)
|
2 jaren geleden |
|
batched-bench
|
e7e4df031b
llama : ggml-backend integration (#4766)
|
2 jaren geleden |
|
batched.swift
|
5c9f90cba1
swift : fix prompt tokenization logic (#4321)
|
2 jaren geleden |
|
beam-search
|
5be6c803fa
llama : remove token functions with `context` args in favor of `model` (#3720)
|
2 jaren geleden |
|
benchmark
|
147b17ac94
2-bit quantizations (#4897)
|
2 jaren geleden |
|
convert-llama2c-to-ggml
|
cafcd4f895
ggml : remove n_dims from ggml_tensor (#4469)
|
2 jaren geleden |
|
embedding
|
b12fa0d1c1
build : link against build info instead of compiling against it (#3879)
|
2 jaren geleden |
|
export-lora
|
930f907d3e
export-lora : use LLAMA_FILE_MAGIC_GGLA (#4894)
|
2 jaren geleden |
|
finetune
|
152d9d05e0
finetune : print sample-start/include-sample-start (#5072)
|
2 jaren geleden |
|
gguf
|
32259b2dad
gguf : simplify example dependencies
|
2 jaren geleden |
|
imatrix
|
15bceec2d7
imatrix : keep intermediate imatrix results (#5077)
|
2 jaren geleden |
|
infill
|
35a2ee9143
Remove unused data and add fixes (#5154)
|
2 jaren geleden |
|
jeopardy
|
a8777ad84e
parallel : add option to load external prompt file (#3416)
|
2 jaren geleden |
|
llama-bench
|
e7e4df031b
llama : ggml-backend integration (#4766)
|
2 jaren geleden |
|
llama.android
|
256d1bb0dd
android : use release cmake build type by default (#5123)
|
2 jaren geleden |
|
llama.swiftui
|
e790eef21c
llama.swiftui : update models layout (#4826)
|
2 jaren geleden |
|
llava
|
6db2b41a76
llava : support for Yi-VL and fix for mobileVLM (#5093)
|
2 jaren geleden |
|
lookahead
|
9494d7c477
english : use `typos` to fix comments and logs (#4354)
|
2 jaren geleden |
|
lookup
|
7082d24cec
lookup : add prompt lookup decoding example (#4484)
|
2 jaren geleden |
|
main
|
722d33f34e
main : add parameter --no-display-prompt (#4541)
|
2 jaren geleden |
|
main-cmake-pkg
|
82d6eab224
main-cmake-pkg : fix build issue (#4665)
|
2 jaren geleden |
|
parallel
|
6b0a7420d0
llama : KV cache view API + better KV cache management (#4170)
|
2 jaren geleden |
|
passkey
|
b0034d93ce
examples : add passkey test (#3856)
|
2 jaren geleden |
|
perplexity
|
44879ee885
Additional KL-divergence statistics (#5081)
|
2 jaren geleden |
|
quantize
|
66d575c45c
llama : add Q3_K_XS (#5060)
|
2 jaren geleden |
|
quantize-stats
|
bcc0eb4591
llama : per-layer KV cache + quantum K cache (#4309)
|
2 jaren geleden |
|
save-load-state
|
df845cc982
llama : minimize size used for state save/load (#4820)
|
2 jaren geleden |
|
server
|
39baaf55a1
docker : add server-first container images (#5157)
|
2 jaren geleden |
|
simple
|
23b5e12eb5
simple : update error message for KV cache check (#4324)
|
2 jaren geleden |
|
speculative
|
e0324285a5
speculative : threading options (#4959)
|
2 jaren geleden |
|
tokenize
|
28a2e6e7d4
tokenize example: Respect normal add BOS token behavior (#4126)
|
2 jaren geleden |
|
train-text-from-scratch
|
381ee19572
finetune : fix ggml_allocr lifetimes (tmp workaround) (#5033)
|
2 jaren geleden |
|
CMakeLists.txt
|
4be5ef556d
metal : remove old API (#4919)
|
2 jaren geleden |
|
Miku.sh
|
019fe257bb
MIKU MAYHEM: Upgrading the Default Model for Maximum Fun 🎉 (#2287)
|
2 jaren geleden |
|
alpaca.sh
|
a17a2683d8
alpaca.sh : update model file name (#2074)
|
2 jaren geleden |
|
base-translate.sh
|
96e80dabc6
examples : improve base-translate.sh script (#4783)
|
2 jaren geleden |
|
chat-13B.bat
|
d9ad104440
Create chat-13B.bat (#592)
|
2 jaren geleden |
|
chat-13B.sh
|
6daa09d879
examples : read chat prompts from a template file (#1196)
|
2 jaren geleden |
|
chat-persistent.sh
|
ac2219fef3
llama : fix session saving/loading (#3400)
|
2 jaren geleden |
|
chat-vicuna.sh
|
c36e81da62
examples : add chat-vicuna.sh (#1854)
|
2 jaren geleden |
|
chat.sh
|
8341a25957
main : log file (#2748)
|
2 jaren geleden |
|
gpt4all.sh
|
107980d970
examples : add -n to alpaca and gpt4all scripts (#706)
|
2 jaren geleden |
|
json-schema-to-grammar.py
|
7c2227a197
chmod : make scripts executable (#2675)
|
2 jaren geleden |
|
llama.vim
|
125d03a503
llama.vim : added api key support (#5090)
|
2 jaren geleden |
|
llama2-13b.sh
|
73643f5fb1
gitignore : changes for Poetry users + chat examples (#2284)
|
2 jaren geleden |
|
llama2.sh
|
73643f5fb1
gitignore : changes for Poetry users + chat examples (#2284)
|
2 jaren geleden |
|
llm.vim
|
ad9ddcff6e
llm.vim : stop generation at multiple linebreaks, bind to <F2> (#2879)
|
2 jaren geleden |
|
make-ggml.py
|
ac43576124
make-ggml.py : compatibility with more models and GGUF (#3290)
|
2 jaren geleden |
|
pydantic-models-to-grammar-examples.py
|
d292f4f204
examples : make pydantic scripts pass mypy and support py3.8 (#5099)
|
2 jaren geleden |
|
pydantic_models_to_grammar.py
|
d292f4f204
examples : make pydantic scripts pass mypy and support py3.8 (#5099)
|
2 jaren geleden |
|
reason-act.sh
|
7c2227a197
chmod : make scripts executable (#2675)
|
2 jaren geleden |
|
server-llama2-13B.sh
|
7c2227a197
chmod : make scripts executable (#2675)
|
2 jaren geleden |