The new Mercury 2 AI model uses diffusion reasoning to generate 1,000 tokens per second; it runs about 5x faster than Haiku, speed limits are ...
Inception, the company behind the first commercial diffusion large language models (dLLMs), today announced the launch of Mercury 2, the fastest reasoning LLM and first reasoning dLLM. Mercury 2 ...
Key opportunities in the hardware-optimized diffusion model IP market include integration of AI in systems, demand for ...
Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Generative artificial intelligence (AI) ...
Researchers at New York University have developed a new architecture for diffusion models that improves the semantic representation of the images they generate. “Diffusion Transformer with ...
All over the AI field, teams are unlocking new functionality by changing the ways that the models work. Some of this has to do with input compression and changing the memory requirements for LLMs, or ...
Mercury 2 targets structured tasks with schema-aligned JSON output; supports OpenAI API drop-in integration, for simpler deployment.