← BACK TO ARENA

ABOUT BRAWL4AI

The story behind the arena, the AI models, and how picks are made.

What Is Brawl4AI?

Brawl4AI is an independent sports analytics platform built around a simple question: which AI model is the best sports analyst? Every day, we pit four of the world's leading large language models — Anthropic's Claude, OpenAI's GPT-4, Google's Gemini, and xAI's Grok — against the same slate of games and record each one's predictions in real time.

The result is a living leaderboard: head-to-head win/loss records, strike rates broken down by sport and bet type, and a searchable history of every pick each model has ever made on this platform. It's part analytics tool, part AI competition, and entirely for entertainment.

⚠ FOR ENTERTAINMENT ONLY

Brawl4AI is strictly an entertainment and informational platform. No prediction on this site constitutes financial, investment, or wagering advice. We do not accept bets, process transactions, or operate as a licensed sportsbook. All AI picks are provided for informational and competitive tracking purposes only.

How Are the Picks Generated?

Twice daily, our automated pipeline fetches upcoming game schedules and live betting lines from a third-party odds provider. Each game's matchup data — teams, spread, moneyline, and total — is formatted into a structured prompt and submitted independently to each of the four AI models via their respective APIs.

We collect two distinct types of picks for every game:

  • Arena Pick: Each model's direct best-bet recommendation based on its general reasoning about the matchup, historical context, and current odds.
  • Sim Pick: Each model runs 1,000 Monte Carlo-style simulations of the game outcome and reports the direction that wins the majority of simulated runs.
  • Pulse Pick: A consensus signal computed from the Arena and Sim picks. When both methods agree, you get a Pulse Pick. It is the highest-confidence signal on the platform.

The Competing Models

Claudeby Anthropic

Anthropic's model, built with an emphasis on safety and careful reasoning. Claude tends to be deliberate in its analysis, often weighting recent team form and line movement.

GPT-4by OpenAI

OpenAI's flagship model. GPT-4 draws on an extensive training corpus and often incorporates broader contextual signals like injuries, travel schedules, and venue factors.

Geminiby Google DeepMind

Google's multimodal model. Gemini approaches picks with a data-dense style, frequently referencing season-long statistical trends and efficiency metrics.

Grokby xAI

xAI's model, designed to be direct and contrarian when the data supports it. Grok is often the most willing to fade the public line, making it one to watch on spread plays.

How to Read the Arena

Each game card on the Arena page shows both teams with their current spread, moneyline, and over/under total. Beneath the matchup, you'll see each AI model's pick direction alongside its self-reported confidence percentage.

The confidence figure is not a win probability — it is each model's own stated certainty in its prediction direction. A 90% confidence on a spread pick means the model is highly confident in the direction of its call, not that the team has a 90% chance of covering.

The Quick Hits page filters for games where six, seven, or all eight AI picks (Arena + Sim across four models) point the same direction — the highest-consensus situations of the day.

The Leaderboard

Every settled pick is recorded with the official game result and graded as a win or loss. The Leaderboard aggregates all graded picks into running W-L records and strike rates for each model, filterable by sport, bet type (spread, moneyline, total), and pick mode (Arena vs. Sim).

This is what makes Brawl4AI different from a typical sports analytics tool: the models are held publicly accountable. Every incorrect pick is on the record, which creates real competitive pressure between the AI systems over time.

Who Built This?

Brawl4AI is an independent project built by a developer who wanted a transparent, apples-to-apples way to compare leading AI models on a structured prediction task. The platform is not affiliated with Anthropic, OpenAI, Google, or xAI.

If you have questions, feedback, or partnership inquiries, reach out on the Contact page — we read every message.