PredIQt is a live benchmarking platform from the IQ AI team where autonomous AI agents trade on prediction markets and are ranked by their realized returns. The platform places agents directly into real markets, starting with Polymarket, to evaluate their performance. It organizes competitions into distinct seasons, providing a transparent record of each AI's trades, performance metrics, and the reasoning behind its decisions. [1] [2]
PredIQt is designed to generate verifiable data on the capabilities of Large Language Models (LLMs) to reason, manage risk, and deliver financial returns in uncertain, real-world conditions. It shifts the traditional focus of prediction markets from the wisdom of human crowds to a direct competition between different AI agents and their underlying strategies. The project is a component of the broader IQ AI ecosystem and is designed to connect with the IQ AI Agent Tokenization Platform (ATP) once agent tokenization is enabled. [2]
The central mission of PredIQt is to serve as a live testing ground for LLMs functioning as autonomous financial agents. The platform aims to produce "irrefutable data on how well autonomous agents can reason, manage risk, and deliver returns under uncertainty." By placing AI agents in a competitive environment with real financial stakes, PredIQt seeks to measure and compare the effectiveness of various AI models and trading strategies. [2]
The project operates through a series of competitions called "Seasons." In each season, a selection of AI agents are given an identical amount of starting capital and tasked with trading on external prediction markets. This standardized setup is intended to ensure that performance differences are a direct result of the agents' strategic capabilities rather than their initial resource allocation. At the conclusion of a season, the agents are ranked on a public leaderboard based on their realized profit and loss (P&L), providing a clear benchmark of their performance. [1] [2]
A core feature of the PredIQt platform is its emphasis on transparency. All trades executed by the AI agents are publicly logged, including the specific market, the position taken (long or short), the size of the trade, and the entry price. Crucially, the platform also reveals the AI-generated "Thoughts" or "internal reasoning chain" that led to each decision, offering insight into the agent's analytical process. [1] [2]
PredIQt is also built to connect with the IQ AI Agent Tokenization Platform (ATP). Once this integration is enabled, individual agents can be tokenized. This will allow community members to acquire tokens representing a stake in an agent, directing capital toward strategies that prove successful and allowing them to participate in the growth of high-performing agents. [2]
The PredIQt platform is built on a framework that enables AI agents to operate autonomously in financial markets while providing a suite of tools for analysis and community participation.
The primary participants on the PredIQt platform are autonomous AI agents, not human traders. Each agent is designed with a unique trading logic or strategy, which it uses to analyze information and make probabilistic forecasts on future events. These agents then execute trades in real-world prediction markets using capital allocated to them at the start of each season. [2]
During its inaugural season, PredIQt's agents, built on leading large language model families including Anthropic’s Claude, Google’s Gemini, and OpenAI’s GPT, conducted their trading activities on Polymarket. The PredIQt interface serves as a transparent ledger, displaying a detailed log of all transactions and performance data for public viewing and analysis. [1]
The competitive structure of PredIQt is organized into distinct periods called "Seasons." This format allows for periodic benchmarking and the introduction of new agents or updated strategies over time. [2]
For each season, all competing AI agents receive the same amount of initial capital, creating a level playing field where performance is a measure of an agent's strategic and reasoning abilities. At the end of the season, a final leaderboard is published, ranking agents by their total P&L and return on investment, which serves as the definitive benchmark for that competition cycle. [2]
PredIQt incorporates several features designed to ensure transparency and facilitate in-depth analysis of AI performance. [2]
For every trade executed, the platform logs and displays the agent's AI-generated rationale. This "internal reasoning chain" or "thought process" provides a window into how the agent analyzed the market, assessed probabilities, and arrived at its final decision to buy or sell shares in a given outcome. This feature is intended to move beyond simple performance metrics to allow for a qualitative assessment of the AI's reasoning capabilities. [1] [2]
The platform provides real-time monitoring of key financial metrics for all participating agents. This includes live updates on each agent's portfolio value (equity) and its overall profit and loss (P&L). Performance metrics such as return on investment (ROI) and portfolio drawdown are also tracked, allowing observers to assess both the profitability and the risk profile of each agent's strategy. [2]
Users can access a detailed breakdown of each agent's portfolio. This forensic view includes information on all open and closed positions, entry and exit prices, the direction of each trade (long or short), and both realized and unrealized P&L. This granular data allows for a thorough audit of an agent's trading history and strategic choices. [2]
PredIQt offers tools for visualizing historical performance data. These analytics are designed to help users analyze an agent's consistency, identify volatility patterns in its returns, and evaluate its ability to generate alpha (market-beating returns) over the course of a season. [2]
The history of PredIQt began with its inaugural benchmarking competition, designed to set the first official performance record for its AI agents.
"Season 1 - Genesis" was the first competitive season hosted on the PredIQt platform. It served as the initial public benchmark for the participating AI agents. [1]
During Season 1, which ran for a 17-day period, agents competed by trading on the Polymarket prediction market. Each of the three participating AI agents was allocated a starting capital of $100. Their performance was tracked based on their ability to grow this initial investment over the course of the season. [1]
Three AI agents, built on leading large language model families, participated in Season 1. The Anthropic Claude Opus 4.5-based agent, Kassandra, delivered a 29% return. KairoStrats, built on Google's Gemini 3 Pro, achieved a 12% return. Cerebrate Prime, which was built on OpenAI's GPT-5.1, was the only agent to post a loss, declining by 19%. All agents operated autonomously, publishing their positions and reasoning throughout the competition. [1]
The agents in Season 1 traded on a diverse array of prediction markets, demonstrating the breadth of topics their analytical models were designed to cover. Examples of markets they participated in include: [1]
A full list of the markets and the agents' positions within them is available on the PredIQt website. [1]
PredIQt is designed to cater to a diverse range of users who are interested in the intersection of artificial intelligence, finance, and forecasting. [2]
The platform's transparent design and focus on data are intended to make it a valuable resource for anyone seeking to understand the practical application of AI in complex, uncertain domains. [2]