Almanac API

Every outcome verified, structured, and ready to train on

Verified ground truth from prediction markets for AI calibration. Structured records linking questions to manipulation-resistant probabilities and confirmed outcomes.

Coming soon. The Almanac API is currently in development. Sign up for early access to be notified when it launches.

Get Early Access → Read the Research

GET /v2/almanac/BWR-GOVT-SHUTDOWN-FED-2024

{
  "ticker": "BWR-GOVT-SHUTDOWN-FED-2024",
  "question": "Will there be a federal government shutdown before Dec 31, 2024?",
  "category": "GOVERNMENT_OPERATIONS",

  "probability_at_close": 0.78,
  "price_label": "6h VWAP, reportable window",

  "outcome": "YES",
  "resolved_at": "2024-12-21T00:00:00Z",
  "resolution_sources": ["omb.gov", "opm.gov"],

  "robustness": {
    "peak_cost_to_move_5c": 112000,
    "reportability": "reportable"
  }
}

The Problem

AI systems need ground truth.
The training data doesn't exist.

Prediction markets generate millions of probabilistic forecasts, but turning them into calibration data requires verification, filtering, and structure that nobody provides.

Outcomes aren't verified

Platforms self-report resolutions with no independent checks. Disputes go unresolved, edge cases get hand-waved, and there's no audit trail. Training on unverified outcomes means training on noise.

Probabilities aren't filtered

Most market prices used for training come from illiquid, manipulable markets. A probability from a market that can be moved for $500 teaches an AI system the wrong lessons about real-world likelihood.

Data isn't structured

There's no standard format linking a question to the probability it produced and the outcome that actually happened. Every research team builds their own fragile pipeline from scratch.

What You Get

Every outcome verified, structured,
and ready to train on

Calibration-grade data from prediction markets that actually resisted manipulation.

◉

Reportable-Only Filtering

Only markets that achieved cost-to-move-5¢ ≥ $100K during their active window. Real consensus, not noise from thin markets that anyone could move.

✓

Verified Outcomes

The Resolution Engine checks outcomes against authoritative sources — government databases, official records, wire services. No human discretion, no platform self-reporting.

▷

Structured Triples

(question, probability, outcome) format ready for ML pipelines. Each record links the natural-language question to the manipulation-resistant price and the verified result.

△

Historical VWAP

The volume-weighted average price from the market's reportable window — not a spot price that could reflect a single large trade or manipulation attempt.

⚙

Provenance Metadata

Source markets, platforms, resolution sources, timestamps, and robustness metrics. Full lineage from prediction to outcome for reproducible research.

How It Works

From market to training data

Four steps between a live prediction market and a verified record in the Almanac.

Market reaches threshold

A market achieves reportable status — cost-to-move-5¢ ≥ $100K. Only liquid, manipulation-resistant markets qualify.

Event window closes

The underlying event resolves. The market's final reportable-window VWAP is captured as the probability of record.

Resolution Engine verifies

Automated verification against authoritative sources. What actually happened, confirmed independently of the platform.

Record enters the Almanac

The structured triple — question, probability, verified outcome — is published to the Almanac API with full provenance.

AI systems need ground truth.The training data doesn't exist.