Announcing Tensordyne Napier

Announcing Tensordyne Napier

Announcing Tensordyne Napier

Announcing Tensordyne Napier

Announcing Tensordyne Napier

Announcing Tensordyne Napier

Announcing Tensordyne Napier

Announcing Tensordyne Napier

AI Inference.
At the speed you want. And the margin you need.

Tensordyne Napier is a new class of AI inference systems designed for blistering speed and unmatched profitability.

Explore the system

Tensordyne Napier is the fastest, most energy-efficient AI inference system ever built. Driven by logarithmic math and the lowest latency scale-up interconnect, it makes AI inference faster and cheaper than ever before.

Tensordyne Rack - TDN Rack - Perspective view with its doors open, showcasing the internal hardware architecture.

the fastest AI inference 
on earth. at the lowest 
possible cost.

Explore the system
TDN Math
Perspective symbol for TDN Math

Napier
Math

TDN AIP
Perspective symbol for TDN AIP

Napier
Chip

TDN ACT
Perspective symbol for TDN ACT

Napier
Compute Tray

TDN LINK
Perspective symbol for TDN Link

Napier
Scale-up Interconnect

Tensordyne Napier system 
development progress

See the development progress of Tensordyne napier, the world's most efficient system to make AI both fast and cheap.

2026
Entering high-volume manufacturing

Entering high-volume manufacturing

Moving from development to commercial reality, Napier silicon is officially transitioning into High-Volume Manufacturing (HVM) at TSMC's world-class facilities to meet the massive, immediate demand for datacenter AI infrastructure.

2026
Next-gen 3nm silicon successfully taped out

Next-gen 3nm silicon successfully taped out

We finalized the complex netlist and completed the tapeout of our highly advanced 3nm Napier chip in partnership with Broadcom and TSMC, purpose-built to handle multi-modal generative AI inference at a fraction of standard energy costs.

2025
Right-Sizing Hardware for MoE and Agentic AI

Right-Sizing Hardware for MoE and Agentic AI

As breakthroughs such as DeepSeek-R1 reshaped the industry, our agile, full-stack co-design approach enabled us to capitalize on these changes in real time. Throughout 2025, we optimized silicon, systems, and software to support the unique demands of Mixture of Experts (MoE) models and agentic workloads while maximizing performance, efficiency, and scalability.

2024
Engineering ultra-efficient systems with Juniper Networks

Engineering ultra-efficient systems with Juniper Networks

To ensure our architecture translates to elite datacenter performance, we entered a strategic technology partnership with Juniper Networks. By integrating their industry-leading scale-up networking components directly with our platform, we unlock the massive fabric scalability required for modern LLM clusters.

2024
Strategic pivot to datacenter scale

Strategic pivot to datacenter scale

Recognizing that our log-math innovation yields massive efficiency advantages specifically for transformer models, we made the deliberate, level-headed decision to halt legacy vision tracks and focus 100% of our engineering on Napier — our next-generation datacenter AI inference chip.

2023
Validating Value in Multi-Modal and Generative AI

Validating Value in Multi-Modal and Generative AI

While developing the architecture for our second-generation chip, we expanded support for transformer-based models and identified a significant opportunity in emerging generative AI workloads like GPT, Stable Diffusion, and many others. Our simulations demonstrated unmatched processing efficiency, opening a compelling new market opportunity.

2022
Proven silicon delivered to customers

Proven silicon delivered to customers

We successfully shipped functional Scorpio samples to our first wave of validation customers. Our Scorpio development system with surround-view object detection ran non-stop 24/7/365 with no downtime, proving our hardware's real-world reliability and performance.

2021
From architecture to functional 7nm silicon

From architecture to functional 7nm silicon

We successfully taped out our first-generation Scorpio chip with Broadcom and received fully functional 7nm silicon from TSMC. This milestone successfully validated our proprietary log-math architecture in physical hardware.

2019
Patented breakthrough in AI compute

Patented breakthrough in AI compute

We filed our first five foundational patents for log-math-based architectures and kicked off design for Scorpio, a first-generation chip proving that logarithmic mathematics could radically accelerate AI workloads at unprecedented speed and efficiency.

Incredible performance.
In any direction.

Realtime 4K Video Generation

Unlocks conversational AI video by generating 4K content at 30FPS, enabled by any-to-any cell-based scale-up interconnect for seamless linear scaling.

Abstract illustration of a person sitting in front of a laptop

Multi-Trillion Parameter Mixture of Experts

Optimized for DeepSeek-V4 and larger MoE models, leveraging EP72 parallelism and TDN Link to serve giant models at 2x the speed of leading solutions.

Abstract illustration of ai models

High-Speed Agentic Coding

Drives 1,000+ tokens per user, providing the ultra-low latency necessary for instantaneous code generation and complex, multi-step agentic reasoning.

Abstract illustration of a person writing code on a laptop

For Hyperscalers

Scalable Inference Factories

Deploy massive-scale AI factories with massive cost and power advantages over specialized hybrid setups. Our system achieves 608 PFLOPS of dense compute in a rack, utilizing disaggregation to eliminate bottlenecks in multi-trillion parameter model serving. Achieve public cloud performance at a fraction of the footprint.

For Neo Clouds

Premium Speed, High Margins

Deepen your position in high-margin segments by offering per-user speeds exceeding 1,000 tokens per second. Our ultra-low latency scale-up interconnect ensures consistent performance for agentic AI and real-time coding. Seamlessly integrate into your existing K8s-managed stack via PyTorch, Triton, and vLLM support for a frictionless multi-vendor experience.

For Enterprise On-Prem

Cloud Performance, Local TCO

Bring frontier-class AI inside your own walls with a system that is far more energy-efficient than GPU architectures. At 30 kW per pod, our air-cooled design eliminates the need for complex liquid cooling infrastructure. Run the world’s largest models in uncompromised 16-bit precision to minimize hallucinations and maximize data authority.

Reach out for more information.

Portrait of Gilles from Tensordyne
application layer companies
foundation model builders
Gilles Backhus
Product
Book a call
Portrait of Nick from Tensordyne
Hyperscalers
Neo clouds
Nick Vitro
Sales
Book a call

Two Ecosystems.
One Pulse.

We are built into the fabric of North America and Europe. This dual presence gives us a unique seat at the table of two of the world's most sophisticated economies, fueling our innovation with the collective power of both.

About Us
Dark illustration of a globe
UNITED STATES
SUNNYVALE, CA 94089
1195 BORDEAUX DR
GERMANY
MUNICH, 80333
BRIENNER STRASSE 59

00:00

/

00:00