Henri Vasserman
|
6bbc598a63
ROCm Port (#1087)
|
2 anni fa |
Georgi Gerganov
|
44d5462b5c
readme : fix link
|
2 anni fa |
Georgi Gerganov
|
c7868b0753
minor : fix trailing whitespace
|
2 anni fa |
Georgi Gerganov
|
79da24b58c
readme : update hot topics
|
2 anni fa |
Evan Jones
|
f5fe98d11b
docs : add grammar docs (#2701)
|
2 anni fa |
Georgi Gerganov
|
6381d4e110
gguf : new file format with flexible meta data (beta) (#2398)
|
2 anni fa |
Adrian
|
2d8b76a110
Add link to clojure bindings to Readme. (#2659)
|
2 anni fa |
Georgi Gerganov
|
7af633aec3
readme : incoming BREAKING CHANGE
|
2 anni fa |
mdrokz
|
eaf98c2649
readme : add link to Rust bindings (#2656)
|
2 anni fa |
Johannes Gäßler
|
0992a7b8b1
README: fix LLAMA_CUDA_MMV_Y documentation (#2647)
|
2 anni fa |
Henri Vasserman
|
6ddeefad9b
[Zig] Fixing Zig build and improvements (#2554)
|
2 anni fa |
Johannes Gäßler
|
25d43e0eb5
CUDA: tuned mul_mat_q kernels (#2546)
|
2 anni fa |
ldwang
|
220d931864
readme : add Aquila-7B model series to supported models (#2487)
|
2 anni fa |
Yiming Cui
|
a312193e18
readme : Add Chinese LLaMA-2 / Alpaca-2 to supported models (#2475)
|
2 anni fa |
Johannes Gäßler
|
0728c5a8b9
CUDA: mmq CLI option, fixed mmq build issues (#2453)
|
2 anni fa |
Johannes Gäßler
|
11f3ca06b8
CUDA: Quantized matrix matrix multiplication (#2160)
|
2 anni fa |
niansa/tuxifan
|
edcc7ae7d2
Obtaining LLaMA 2 instructions (#2308)
|
2 anni fa |
Johannes Gäßler
|
70d26ac388
Fix __dp4a documentation (#2348)
|
2 anni fa |
Jose Maldonado
|
91171b8072
make : fix CLBLAST compile support in FreeBSD (#2331)
|
2 anni fa |
wzy
|
78a3d13424
flake : remove intel mkl from flake.nix due to missing files (#2277)
|
2 anni fa |
wzy
|
45a1b07e9b
flake : update flake.nix (#2270)
|
2 anni fa |
Jiří Podivín
|
27ab66e437
py : turn verify-checksum-models.py into executable (#2245)
|
2 anni fa |
Chad Brewbaker
|
917831c63a
readme : fix zig build instructions (#2171)
|
2 anni fa |
Evan Miller
|
5656d10599
mpi : add support for distributed inference via MPI (#2099)
|
2 anni fa |
JackJollimore
|
18780e0a5e
readme : update Termux instructions (#2147)
|
2 anni fa |
rankaiyx
|
2492a53fd0
readme : add more docs indexes (#2127)
|
2 anni fa |
dylan
|
84525e7962
docker : add support for CUDA in docker (#1461)
|
2 anni fa |
Judd
|
36680f6e40
convert : update for baichuan (#2081)
|
2 anni fa |
Johannes Gäßler
|
924dd22fd3
Quantized dot products for CUDA mul mat vec (#2067)
|
2 anni fa |
Georgi Gerganov
|
b472f3fca5
readme : add link web chat PR
|
2 anni fa |