Anthropic
A research company building reliable, interpretable, steerable AI systems.
Stage
Series A
Raising
$124M
Founded
San Francisco, 2021
Contact
founders@anthropic.com
Anthropic
02 · Problem
The problem

AI systems are getting more capable faster than they are getting more honest.

Large language models can now write code, summarize legal contracts, and answer medical questions. They will be deployed in places where being wrong with confidence has real consequences.

The same models still hallucinate citations, follow harmful instructions when asked nicely, and produce outputs no one inside the lab can fully explain.

Capability research has a five-year head start on safety research. Without a deliberate push, the gap widens with every new training run.

Anthropic
03 · Why Now
Why now

Foundation models stopped being a research curiosity. Safety has to catch up before the next training cycle.

01

Capability scaling laws are predictable

Scaling exponents from 2020 to 2021 held within 5% across three independent labs. We can forecast the next jump. We cannot yet forecast its failure modes.

Research curiosity Industrial primitive
02

Deployment is outpacing alignment

GPT-3 sits behind millions of API calls per day. The number of researchers working on its interpretability fits in a single conference room.

Theoretical concern Shipping requirement
03

Constitutional methods now beat RLHF on key axes

Self-critique with a written set of principles reduces harmful outputs by 40% over standard human-feedback training, and explains its choices.

Opaque preferences Written principles
Anthropic
04 · Solution
Our approach

We build frontier models and the safety techniques to deploy them, in the same building, by the same team.

Most labs treat safety as a downstream filter. We treat it as a design constraint on the model itself, written into training, interpretability tooling, and the assistant we ship to customers.

Constitutional AI

Train assistants against a written set of principles instead of opaque human preferences. The model can explain why it refused.

Mechanistic interpretability

Reverse-engineer the circuits inside a transformer the way a chip designer reads a die shot. Understand the model, not just its outputs.

Responsible scaling

Capability thresholds trigger pre-committed safety evaluations. We do not ship the next model until the previous tier's evals pass.

Anthropic
05 · Product
Product

Claude, a safety-trained assistant, plus an API research labs and enterprises build on.

Claude Assistant

A conversational interface tuned for long-form reasoning, refusal transparency, and citation discipline.

  • · 100K-token context
  • · Constitutional refusals
  • · Cited sources, where known

Claude API

One endpoint. Same safety stack. Used today by legal, research, and customer-support teams in production.

  • · Streaming responses
  • · System prompts
  • · Tier pricing by token volume

Safety toolchain

Open-sourced interpretability libraries plus model evaluations the broader research community uses on their own systems.

  • · Circuit tracing
  • · Sycophancy evals
  • · Red-team playbooks
Anthropic
06 · Demo
Inside the product
A representative conversation in Claude.
claude.ai / chat / a-research-question
JD
You
Walk me through how Constitutional AI differs from standard RLHF, in two paragraphs, for a reader who has trained models but not read the paper.
C
Claude
Standard RLHF asks human labelers to rank model outputs, then trains a reward model on those rankings, then fine-tunes the policy against that reward. The signal is dense but opaque, the model learns "this kind of output gets high reward," not "this kind of output is honest."

Constitutional AI replaces the human ranker with the model itself, evaluating its own draft against a written list of principles. The model produces a critique, then revises. The signal is sparser but the rule it learned is now legible, you can read the constitution and predict the refusal.
Anthropic
07 · Market
Market

Foundation-model inference is becoming an operating cost line for every software company.

$1.3T+
Total addressable spend on labor categories foundation models can credibly assist by 2030.
McKinsey Global Institute, generative AI productivity report, 2023
$200B
Annual enterprise spend on software categories where natural-language interfaces will replace point-and-click UI.
$95B
Annual outsourced knowledge-work spend (BPO, contact centers, research support) directly addressable by a safety-trained assistant.
3.4x
Foundation-model API revenue growth, 2020 to 2022, across the three largest providers.
Anthropic
08 · Business Model
How we make money

Usage-based API pricing for developers, seat-based enterprise plans for teams.

Free tier
Research access, rate-limited. On-ramp for academic users and individual builders.
$0per month
Enterprise
Seat licenses, dedicated capacity, SOC2, custom safety policies, named SRE support.
$60per seat month

Unit economics, directional

Gross margin, API tier62%
Gross margin, Enterprise tier78%
Payback period, enterprise8 months
Net revenue retention, charter customers141%
Margin improves as we move inference onto our own training infrastructure. Today we still rent the majority of our compute.
Anthropic
09 · Traction
Traction

Quiet but compounding. The customers we already have keep buying more.

11x
Year-over-year growth in monthly API token volume, second half of 2022.
Anthropic infrastructure logs
141%
Net revenue retention across our 22 charter enterprise customers, trailing six months.
Customer billing system
38
Research papers published or accepted at top-tier venues by the team since founding.
Anthropic publications page
"Claude is the only assistant we have been comfortable putting in front of a medical-records workflow. The refusals are predictable, and we can read why." VP Engineering, Series C health-tech customer
Anthropic
10 · Competition
Competition

Three labs ship frontier models. One is shipping a research program for using them safely.

OpenAI

Capability-first, safety as filter

Significant talent, fast model cadence. Safety work happens, but it is positioned downstream of training. Their organizational incentive is to maximize capability lead time.

Google DeepMind

Research depth, deployment caution

Strong interpretability publications. Product velocity is constrained by Alphabet's brand exposure, which means many of their best safety ideas reach users on someone else's product first.

Open-source labs

Distribution, no safety stack

Weight releases accelerate research and remove guardrails simultaneously. Open-source models are part of the ecosystem. They are not building the assistant an enterprise can deploy without a separate safety team.

Anthropic

Frontier model, plus a safety research program, plus the assistant that proves it

We are the only lab where the same researchers train the model, publish the interpretability paper, and ship the customer-facing product. That feedback loop is the moat. It also dictates what we will and will not build.

Anthropic
11 · Moat
Defensibility

A safety research program compounds. A capability lead does not.

Any large lab can match a single capability benchmark within twelve months. Replicating a research program takes years, the people, the publications, the trust with customers who have already deployed it.

Our moat is the ladder, each rung opens the next one, and each one is something a competitor would have to rebuild from scratch.

It is also why we attract the researchers we attract. Safety-motivated engineers will not work on a product where their work is positioned downstream.

06
Autonomous research assistants
Models that conduct their own alignment experiments
2024+
05
Mechanistic interpretability at scale
Circuit-level explanations for production models
Ours
04
Constitutional training pipelines
Written-principle training in production
Ours
03
Safety-trained assistant in market
Claude API, with refusal transparency
Ours
02
RLHF deployment
Standard preference-trained models
Industry
01
Foundation-model training
Large transformer pre-training
Industry
Anthropic
12 · GTM
Go to market

Start with the customers who get fired for being wrong. Expand from there.

Wedge · 2022 to 2023

Regulated knowledge work

Legal, healthcare, financial services. Buyers who already pay a premium for predictable, auditable behavior and have someone to escalate to when it fails.

22 customers
Beachhead · 2024

Mid-market platforms

Software companies embedding an assistant inside an existing workflow. They ship through us, our safety reputation becomes part of their product story.

+200 expected
Expansion · 2025+

General developer API

By the time Claude is the safety-default, the long tail of API developers comes for the same reason every enterprise did, the model that refuses correctly.

10k+ developers
Channel insight
We do not need a sales-led motion to win. The enterprises buying us today are the ones our beachhead customers want to sell to next.
Anthropic
13 · Team
The team

A founding team built inside the labs whose work made foundation models work in the first place.

DA
Dario Amodei
Co-founder, CEO
Former VP of Research at OpenAI, where he led the team that produced GPT-2 and GPT-3 and authored the scaling laws paper. Earlier, principal scientist at Baidu's deep learning lab. Physics PhD from Princeton.
If anyone has the receipts on what frontier models will do next, it is the person who wrote the math for it.
DA
Daniela Amodei
Co-founder, President
Former VP of Safety and Policy at OpenAI. Built and ran the team that wrote the deployment guidelines for GPT-3. Earlier, Stripe and Skype on global operations. Political science from UCSC.
The person who has actually written the rulebook a frontier lab ships against.
Founding researchers
Tom Brown · GPT-3 lead author Sam McCandlish · scaling laws co-author Jared Kaplan · Johns Hopkins, scaling laws Chris Olah · circuits at Distill, OpenAI Jack Clark · OpenAI policy, AI Index co-founder
Anthropic
14 · Ask
The ask

Capital to train the next generation of safety-first models and the people to evaluate them.

$124M
Series A. Capital that takes us through the next two training generations and the safety program built around them.
Runway
22 months
Lead
In discussion
Closing
Q2 2023

Frontier model training

The next two generations of pre-training, plus the compute to evaluate them at every checkpoint.

62%

Safety research team

Twenty more interpretability and alignment researchers. The bottleneck on the program is people, not compute.

24%

Enterprise GTM

A small, deliberate field team. We do not need many sellers. We need the right ten.

14%
Next step · email founders@anthropic.com · responses within 48 hours