2 лет назад · a9a8c5de3d
--- a/README.md
+++ b/README.md
@@ -10,6 +10,7 @@ Inference of [LLaMA](https://arxiv.org/abs/2302.13971) model in pure C/C++
 
				 
			
 
				 ### Hot topics
			
 
				 
			
 
				+- New SOTA quantized models, including pure 2-bits: https://huggingface.co/ikawrakow
			
 
				 - Collecting Apple Silicon performance stats:
			
 
				   - M-series: https://github.com/ggerganov/llama.cpp/discussions/4167
			
 
				   - A-series: https://github.com/ggerganov/llama.cpp/discussions/4508