|
|
@@ -10,6 +10,7 @@ Inference of [LLaMA](https://arxiv.org/abs/2302.13971) model in pure C/C++
|
|
|
|
|
|
### Hot topics
|
|
|
|
|
|
+- New SOTA quantized models, including pure 2-bits: https://huggingface.co/ikawrakow
|
|
|
- Collecting Apple Silicon performance stats:
|
|
|
- M-series: https://github.com/ggerganov/llama.cpp/discussions/4167
|
|
|
- A-series: https://github.com/ggerganov/llama.cpp/discussions/4508
|