Mercury 2 General Availability

Mercury 2

General Availability

Hi dobry juzer,

Our new Mercury 2 model is now available to all API users! With M2, you can build production applications that operate with unparalleled low latencies.

As a reminder, here is some of the key information about M2.

Speed: 1,009 tokens/sec on standard NVIDIA GPUs
Price: $0.25/1M input tokens · $0.75/1M output tokens · $0.025/1M cached input tokens
Quality: competitive with leading speed-optimized models
Features: tunable reasoning · 128K context · native tool use · schema-aligned JSON output

For now, you can access the model via our API platform. Access through partners like OpenRouter will follow soon.

Best,

The Inception Team

Get Started with Mercury 2

If you wish to unsubscribe from future emails, click here.