Quantization Benchmark

Measure how int8 and binary quantization affect embedding quality compared to float.

Run

vai benchmark quantization

int8: Very high correlation with float (~0.99+). Minimal quality loss.
binary: Lower correlation (~0.90-0.95). More quality loss but 32× smaller storage.

Scenario	Recommended
Standard production	`float` (default)
Large corpus, storage-sensitive	`int8` (4× smaller, minimal loss)
Coarse first-pass filter	`binary` (32× smaller, rerank with float)