.

Research and latest news.

February 26, 2026

Code Review Bench: Towards Billion Dollar Benchmarks

January 30, 2026

ARES: Open-Source Infrastructure for Online RL on Coding Agents

January 15, 2026

Beyond Static Mechanistic Interpretability: Agentic Long-Horizon Tasks as the Next Frontier

December 7, 2025

Martian Interpretability Challenge, Part 2: The Core Problems In Interpretability

October 30, 2025

Beyond Beyond Monoliths: An Exploration of Martian’s Position Paper - Part 1

October 3, 2025

Up and to the left! How Martian Uses Routing to Push the Pareto Frontier

September 30, 2025

Hiring Announcement: Fazl Barez

August 18, 2025

Approximating Human Preferences Using a Multi-Judge Learned System

August 6, 2025

Research Highlight: Guardian Loop

May 13, 2025

Beyond Monolithic AI: The Case for an Expert Orchestration Architecture

December 6, 2024

AI Safety Grant Update: Purging Corrupted Capabilities across Language Models

September 16, 2024

Martian Partners with Accenture, Launches Airlock Compliance for Enterprises

June 25, 2024

Claude Sonnet 3.5 Release: Token Prices and Jevons Paradox

June 13, 2024

Cracking the Code: Automated Prompt Optimization. Insights from Industry Leaders

May 31, 2024

Scaling AI Interpretability

May 24, 2024

AI Safety vs Capitalism

May 17, 2024

The Sustainability Challenge of AI: Tackling the Energy Footprint of LLMs

May 10, 2024

Model Mapping: The Key to AI Alignment and Beyond

May 2, 2024

Expanding Horizons: Embracing the Multi-Model Future with Martian

March 20, 2024

Introducing RouterBench

July 20, 2023

Introducing Martian - Better AI Tools Through Better Understanding