Home
News
Tech Grid
Interviews
Anecdotes
Think Stack
Press Releases
Articles
  • Generative AI

Together AI Launches AI Native Conf with Major Research Releases


Together AI Launches AI Native Conf with Major Research Releases
  • by: Source Logo
  • |
  • March 6, 2026

Together AI, the AI Native Cloud powering fast-growing AI companies, has launched AI Native Conf, its inaugural conference focused on builders developing next-generation AI applications. The event highlights the company's strong momentum, serving thousands of customers, supporting over one million developers, and achieving 10x year-over-year growth in annual contract revenue—including 27 deals over $1 million and one exceeding $1 billion.

Quick Intel

  • Together AI hosts its first AI Native Conf, uniting builders of next-gen AI applications amid rapid industry adoption and company growth.
  • The platform powers leading AI-native firms like Cursor, Decagon, and Cartesia with production-scale inference, pre-training, and model shaping.
  • New releases include FlashAttention 4 (up to 4x faster long-context performance), a distributed Reinforcement Learning API, ThunderAgent (3.6x throughput for agentic workloads), and ATLAS-2 (1.5x immediate inference gains via real-time adaptation).
  • Conference features frontier leaders from Gamma and Cartesia alongside Together AI researchers deploying large-scale AI systems.
  • Together AI bridges frontier research—led by creators of FlashAttention and ThunderKittens—with real-world production deployment.
  • Event underscores the rise of AI-native companies scaling to $100M ARR faster than prior generations, driven by generative AI adoption in 70% of companies.

Together AI, the AI Native Cloud powering some of the world's fastest-growing AI companies, today launched AI Native Conf, its first-ever conference dedicated to builders creating the next generation of AI applications. The event comes amid rapid business momentum for Together AI, which now serves thousands of customers, supports over one million developers, and has achieved 10x year-over-year growth in annual contract revenue (ACR), including 27 customer deals exceeding $1 million and one exceeding $1 billion.

Together AI has emerged as a core infrastructure provider for leading AI-native companies including Cursor, Decagon, and Cartesia, delivering production-scale inference, pre-training and model shaping. With an industry-leading systems research lab led by the creators of FlashAttention and ThunderKittens, Together AI sits at the intersection of frontier research and real-world deployment.

"AI is moving faster than any technological shift we've seen before, and the companies being built today look fundamentally different," said Vipul Ved Prakash, co-founder and CEO of Together AI. "This event is about bringing together the AI-native builders at the frontlines and sharing what it actually takes to run AI in production at scale. Our advantage is simple: the same researchers who publish foundational work are the ones shipping it into production systems our customers rely on."

Announcing New Research Breakthroughs and Products From The AI-Native Cloud

At AI Native Conf, Together AI unveiled new research-to-production advancements across kernels, reinforcement learning, and inference optimization, underscoring the company's deep research bench and rapid development cadence. These advancements will help companies improve training and inference performance to enable businesses of all sizes to capitalize on the benefits of generative and agentic AI.

Key announcements include FlashAttention 4, the latest evolution of the widely adopted kernel now powering most major language models in production. FlashAttention 4 delivers up to 4x performance improvements at long sequence lengths, narrowing the gap between theoretical and real-world performance for long-context workloads like coding agents and document reasoning.

A new Reinforcement Learning API decouples inference and training, enabling globally distributed reinforcement learning pipelines that were previously only feasible for organizations with massive, co-located GPU clusters.

ThunderAgent, an open-source, program-aware system for serving and training agentic workloads, delivers up to 3.6x throughput improvements and significantly reduced memory overhead.

ATLAS-2 uses real-time user data to adapt and optimize, immediately delivering 1.5X faster inference results.

A Gathering for the AI-Native Generation

AI Native Conf was created in response to the rapid emergence of AI-native startups. Generative AI is already being used by 70% of companies, according to McKinsey, making it the fastest-adopted major technology platform in history. As a result, a new class of AI-native companies is scaling at unprecedented speed, with many reaching $100 million in ARR faster than any previous generation of startups.

The conference features leaders building at the frontier of AI, including Grant Lee, co-founder and CEO of Gamma, and Arjun Desai, co-founder of Cartesia, alongside researchers and engineers deploying AI systems at massive scale.

For a complete recap of research and product announcements from AI Native Conf, visit today's post on the Together AI blog.

Together AI's launch of AI Native Conf and simultaneous unveiling of performance breakthroughs reinforce its position as a leading infrastructure provider at the forefront of AI-native innovation, enabling scalable, efficient deployment of advanced models and agentic systems.

 

About Together AI 

Together AI is the AI Native Cloud, combining state-of-the-art open-source models, high-performance infrastructure, and frontier research in AI efficiency and scalability. Founded in 2022, Together AI powers over a million of developers and some of the world's most demanding AI workloads, delivering production-scale inference, training, and reinforcement learning for the next generation of AI-native companies.

  • Generative AIAgentic AIAI Infrastructure
News Disclaimer
  • Share