Georgi Gerganov
|
483bab2e3d
Avoid the transposed X branch in the Z = X * Y matrix multiplication (#439)
|
2 years ago |
Jed Fox
|
404e1da38e
Fix quantize script not finding models in parent directory (#428)
|
2 years ago |
Georgi Gerganov
|
4cc053b6d5
Remove oboslete command from Docker script
|
2 years ago |
Georgi Gerganov
|
0ba5a3a9a5
Obsolete
|
2 years ago |
rabidcopy
|
2e17dfd80a
Replace EOS with newline to prevent context/memory being flushed by EOS in interactive mode (#333)
|
2 years ago |
Timmy Knight
|
20a1a4e09c
Fix GPTQ converter (#423)
|
2 years ago |
nusu-github
|
ad072fc5ad
Generate library with CMake (#430)
|
2 years ago |
anzz1
|
ea10d3ded2
Command line args bounds checking (#424)
|
2 years ago |
Ben Siraphob
|
a18c19259a
Fix Nix build
|
2 years ago |
Stephan Walter
|
a50e39c6fe
Revert "Delete SHA256SUMS for now" (#429)
|
2 years ago |
Kerfuffle
|
a140219e81
Fix Makefile echo escape codes (by removing them). (#418)
|
2 years ago |
Gary Mulder
|
8a3e5ef801
Move model section from issue template to README.md (#421)
|
2 years ago |
anzz1
|
8eea5ae0e5
Delete SHA256SUMS for now (#416)
|
2 years ago |
Georgi Gerganov
|
93208cfb92
Adjust repetition penalty ..
|
2 years ago |
Georgi Gerganov
|
03ace14cfd
Add link to recent podcast about whisper.cpp and llama.cpp
|
2 years ago |
anzz1
|
e4412b45e3
CI: CMake: Separate build and test steps (#376)
|
2 years ago |
tjohnman
|
f7dc43bc0d
Fix instruct mode broken by PR #354 (#409)
|
2 years ago |
Gary Mulder
|
ee8a788786
Update issue template so people will use it (#404)
|
2 years ago |
Stephan Walter
|
69c92298a9
Deduplicate q4 quantization functions (#383)
|
2 years ago |
Valentyn Bezshapkin
|
97940520e8
fix: add POSIX functionality for Linux compilation (#51)
|
2 years ago |
tjohnman
|
305ba6f0e6
Don't force immediate interactive without `-i` (#354)
|
2 years ago |
Erik Scholz
|
4122dffff9
cmake: make llama an actual library (#392)
|
2 years ago |
Erik Scholz
|
56e659a0b2
fix perplexity after c-api refactor (#390)
|
2 years ago |
Gary Linscott
|
40ea807a97
Add details on perplexity to README.md (#395)
|
2 years ago |
Yusuf Kağan Hanoğlu
|
d5850c53ca
Add missing header for memcpy (#386)
|
2 years ago |
Georgi Gerganov
|
ae44e23ee3
When seed <= 0 - use the clock to generate one
|
2 years ago |
Georgi Gerganov
|
928480ef5b
Init llama_context_params properly from CLI (#370)
|
2 years ago |
Georgi Gerganov
|
56817b1f88
Remove temporary notice and update hot topics
|
2 years ago |
Georgi Gerganov
|
f5a77a629b
Introduce C-style API (#370)
|
2 years ago |
Gary Mulder
|
da0e9fe90c
Add SHA256SUMS file and instructions to README how to obtain and verify the downloads
|
2 years ago |