Why bulk.run exists

AI is good at reasoning.
Computers are good at structure.

What's missing is a reliable way to apply one to the other.

Today, AI mostly runs:

But real work doesn't happen in prompts.
It happens in datasets.

Spreadsheets. Tables. Lists. Logs.
At scale, everything becomes rows.

The gap

When people try to apply AI to real datasets today, they end up with:

These systems are impressive to demo and painful to maintain.

They optimize for interaction, not execution.

What's missing is a simple execution layer:

No workflows.
No hidden state.
No ceremony.

Just jobs that run and finish.

bulk.run is the execution layer for LLMs on tabular data.

It treats datasets as the unit of work.

You give it:

It gives you:

CSV is the first interface. Not the point.

A true execution layer has three requirements:

Structured outputs. Validation. Automatic retries. When something fails, it fails loudly—no silent errors.

Know exactly how each cell was produced: which model, which prompt, which sources, when it ran.

Works with anything. CSV today. Sheets, Airtable, Notion tomorrow. API and SDK so other products can call it.

Build those three, and you stop being "a tool." You become infrastructure.

We're not building an app.
We're building infrastructure.

That means:

Some parts will be hosted.
Some parts will be open.

What matters is that the primitive exists.

bulk.run exists because AI needs to work on the real world—not just talk about it.