Makarna Test Suite

This directory contains comprehensive tests for the Makarna inference engine.

Test Structure

quant_test.go: Quantization/dequantization correctness tests for all K-quant types
- Q2_K (2-bit, 4 levels)
- Q3_K (3-bit, 8 levels)
- Q4_K (4-bit, 16 levels)
- Q6_K (6-bit, 64 levels)
- Q8_K (8-bit, 256 levels)
unit_quant_test.go: Unit tests for low-level quantization utilities
- FP16 ↔ FP32 conversion
- Individual dequantization functions
backend_test.go: Backend operation tests
- Linear (matrix multiplication)
- Embedding lookup
- RMSNorm
- Softmax
data/: Golden test data (binary files)
- Contains quantized blocks + original float32 data for validation

cd tests
go test -v

cd tests
go test -v -run TestQ3K_GoldenIter

Each quantization type is tested against 4 different data patterns:

Type	Bits	Levels	Typical MSE (Normal)	Use Case
Q8_K	8	256	< 0.001	High precision
Q6_K	6	64	< 0.002	Embeddings
Q4_K	4	16	< 0.05	General weights
Q3_K	3	8	< 0.15	Medium quality mix
Q2_K	2	4	< 0.5	Aggressive compression

Note: Large-range patterns (±50+) will have proportionally higher MSE due to quantization binning.

These tests run automatically on:

All tests must pass before merge.