Claude Haiku 4.5 - Fast, Cheap, Properly Smart. From $0.25/M Tokens.

No tutorial. No setup wizard. Just paste these into your editor, replace the API key, run.

Streaming chat

~200ms TTFB

import Anthropic from "@anthropic-ai/sdk";

const client = new Anthropic();

const stream = client.messages.stream({
  model: "claude-haiku-4-5-20251001",
  max_tokens: 1024,
  messages: [{
    role: "user",
    content: "Help me debug this..."
  }]
});

for await (const chunk of stream) {
  process.stdout.write(chunk);
}

typescript · streaming Get key →

Batch classifier

50% off

import anthropic

batch = anthropic.Anthropic().beta.messages.batches.create(
  requests=[
    {"custom_id": f"req_{i}",
     "params": {
       "model": "claude-haiku-4-5-20251001",
       "max_tokens": 128,
       "messages": [{
         "role": "user",
         "content": f"Classify: {text}"
       }]
     }}
    for i, text in enumerate(items)
  ]
)
# 50% cheaper, async, ideal for backfill.

python · batch api Get key →

Tool-using agent

low cost / call

import anthropic

response = anthropic.Anthropic().messages.create(
  model="claude-haiku-4-5-20251001",
  max_tokens=1024,
  tools=[{
    "name": "lookup_order",
    "description": "Get an order by id.",
    "input_schema": {"type": "object",
       "properties": {"id": {"type": "string"}}}
  }],
  messages=[{"role": "user",
             "content": "Where's order #4291?"}]
)
# Haiku picks the tool, fills args, ships fast.

python · tool use Get key →

	Haiku 4.5	Sonnet 4.6	Opus 4.7
Best for	High-volume / low-latency	Most everyday tasks	Hardest reasoning
Speed	Ultra fast	Fast	Thoughtful
Cost (per M input)	$0.25	$3.00	$15.00
Cost (per M output)	$1.25	$15.00	$75.00
Reasoning depth	★★★	★★★★	★★★★★
Context window	200K	200K	200K
Tool use / agents	✓ Full	✓ Full	✓ Best
Use it when…	Cost & latency win	Quality + speed balance	Quality is non-negotiable

Model	Input ($ / M tokens)	Output ($ / M tokens)	Context
Haiku 4.5	$0.25	$1.25	200K
Sonnet 4.6	$3.00	$15.00	200K
Opus 4.7	$15.00	$75.00	200K

Fast, cheap,
and properly smart.

Built for scale, priced for volume.

Six things made for Haiku.

Customer support bots

Content moderation

Retrieval-augmented Q&A

Tool-using agents

Summarization at scale

Data extraction

Three things, copy-pasteable.

Streaming chat

Batch classifier

Tool-using agent

Spin up an API key.
Ship in the next hour.

When you shouldn't use Haiku.

Simple. Per token. Pay-as-you-go.

Quick answers about Haiku 4.5.

Build the next thing.
Ship it on Haiku.

Built for scale, priced for volume.

Six things made for Haiku.

Customer support bots

Content moderation

Retrieval-augmented Q&A

Tool-using agents

Summarization at scale

Data extraction

Three things, copy-pasteable.

Streaming chat

Batch classifier

Tool-using agent

Spin up an API key.Ship in the next hour.

When you shouldn't use Haiku.

Simple. Per token. Pay-as-you-go.

Quick answers about Haiku 4.5.

Build the next thing.Ship it on Haiku.

Spin up an API key.
Ship in the next hour.

Build the next thing.
Ship it on Haiku.