Embedding Benchmark

Measure embedding latency, throughput, and dimensions across Voyage AI models.

Run

vai benchmark embed

What It Measures

Latency per embedding call (p50, p95, p99)
Throughput (tokens per second)
Output dimensions per model
Comparison across multiple models

Customize

# Test specific models
vai benchmark embed --models voyage-4-large,voyage-4-lite

# More iterations for reliable results
vai benchmark embed --iterations 10

# JSON output
vai benchmark embed --json

Sample Output

The benchmark uses built-in sample texts (10 documents about common tech topics) so you don't need any setup. Results show a formatted table comparing each model's performance.

Tips

Run benchmarks multiple times — API latency varies with server load
Network latency dominates for single-text embeds; batch throughput is a better measure of model speed
Use --json to track performance over time

Run​

What It Measures​

Customize​

Sample Output​

Tips​

Run

What It Measures

Customize

Sample Output

Tips