Agent Building
Advanced
Always open

Multimodal Prediction Market Agent

This challenge focuses on building a highly sophisticated agent for analyzing real-time prediction markets. Leveraging Google's Agent Development Kit (ADK) with its strong multimodal capabilities, developers will create an agent that aggregates diverse data sources—market data, news feeds, social media sentiment, and expert opinions—to forecast event outcomes and identify potential arbitrage opportunities. This goes beyond simple data analysis to synthesize complex information for strategic decision-making. The system will use Claude Opus 4.5 for nuanced textual analysis and advanced strategy generation, interpreting qualitative market signals. Grok 4 Heavy will be integrated for high-performance, real-time complex pattern recognition and scenario simulation, enabling rapid assessment of market dynamics. Firebender AI will serve as a specialized service, offering advanced market data aggregation and pattern recognition as a tool that the Google ADK agent orchestrates. The agent will interact with a simulated Prediction Market API to fetch real-time contract data and execute simulated trades. This challenge emphasizes multimodal data fusion, multi-LLM orchestration, and the integration of specialized agentic services to create a powerful, real-time market intelligence system capable of providing actionable insights to 'pro gamblers' and financial strategists.

Challenge brief

What you are building

The core problem, expected build, and operating context for this challenge.

This challenge focuses on building a highly sophisticated agent for analyzing real-time prediction markets. Leveraging Google's Agent Development Kit (ADK) with its strong multimodal capabilities, developers will create an agent that aggregates diverse data sources—market data, news feeds, social media sentiment, and expert opinions—to forecast event outcomes and identify potential arbitrage opportunities. This goes beyond simple data analysis to synthesize complex information for strategic decision-making. The system will use Claude Opus 4.5 for nuanced textual analysis and advanced strategy generation, interpreting qualitative market signals. Grok 4 Heavy will be integrated for high-performance, real-time complex pattern recognition and scenario simulation, enabling rapid assessment of market dynamics. Firebender AI will serve as a specialized service, offering advanced market data aggregation and pattern recognition as a tool that the Google ADK agent orchestrates. The agent will interact with a simulated Prediction Market API to fetch real-time contract data and execute simulated trades. This challenge emphasizes multimodal data fusion, multi-LLM orchestration, and the integration of specialized agentic services to create a powerful, real-time market intelligence system capable of providing actionable insights to 'pro gamblers' and financial strategists.

Datasets

Shared data for this challenge

Review public datasets and any private uploads tied to your build.

Loading datasets...
Evaluation rubric

How submissions are scored

These dimensions define what the evaluator checks, how much each dimension matters, and which criteria separate a passable run from a strong one.

Max Score: 4
Dimensions
4 scoring checks
Binary
4 pass or fail dimensions
Ordinal
0 scaled dimensions
Dimension 1predictionaccuracy

PredictionAccuracy

Agent must correctly predict the market outcome for at least 80% of test cases.

binary
Weight: 1
Binary check

This dimension contributes its full weight only when the submission satisfies the requirement. Partial credit is not awarded.

Dimension 2arbitrageidentificationrate

ArbitrageIdentificationRate

Agent must correctly identify 90% of intentionally planted arbitrage opportunities.

binary
Weight: 1
Binary check

This dimension contributes its full weight only when the submission satisfies the requirement. Partial credit is not awarded.

Dimension 3average_prediction_confidence

Average Prediction Confidence

Average confidence score for all market predictions made by the agent. • target: 0.8 • range: 0-1

binary
Weight: 1
Binary check

This dimension contributes its full weight only when the submission satisfies the requirement. Partial credit is not awarded.

Dimension 4arbitrage_profitability_average_margin

Arbitrage Profitability (Average Margin)

Average profit margin of detected arbitrage opportunities. • target: 0.05 • range: 0-0.1

binary
Weight: 1
Binary check

This dimension contributes its full weight only when the submission satisfies the requirement. Partial credit is not awarded.

Learning goals

What you should walk away with

Master Google ADK for building multimodal AI agents, focusing on ingesting and processing diverse data formats (text, images, video snippets from news).

Implement multi-LLM orchestration, using Claude Opus 4.5 for sophisticated sentiment analysis from news and social media, and Grok 4 Heavy for high-throughput pattern recognition in numerical market data.

Integrate Firebender AI as a specialized agentic service tool, enabling the Google ADK agent to delegate complex market data aggregation and statistical pattern recognition tasks.

Design and build real-time data pipelines to ingest live prediction market contract data and news feeds from a simulated Prediction Market API.

Develop advanced analytical algorithms within the agent to identify subtle market inefficiencies and potential arbitrage opportunities across different prediction markets.

Implement robust evaluation metrics to assess the accuracy of market predictions and the profitability of identified arbitrage strategies.

Explore ethical considerations and potential biases in using AI for financial prediction markets, ensuring transparency in decision-making.

Start from your terminal
$npx -y @versalist/cli start multimodal-prediction-market-agent

[ok] Wrote CHALLENGE.md

[ok] Wrote .versalist.json

[ok] Wrote eval/examples.json

Requires VERSALIST_API_KEY. Works with any MCP-aware editor.

Docs
Manage API keys
Challenge at a glance
Host and timing
Vera

AI Research & Mentorship

Starts Available now
Evergreen challenge
Your progress

Participation status

You haven't started this challenge yet

Timeline and host

Operating window

Key dates and the organization behind this challenge.

Start date
Available now
Run mode
Evergreen challenge
Explore

Find another challenge

Jump to a random challenge when you want a fresh benchmark or a different problem space.

Useful when you want to pressure-test your workflow on a new dataset, new constraints, or a new evaluation rubric.

Tool Space Recipe

Draft
Evaluation
Rubric: 4 dimensions
·PredictionAccuracy(1%)
·ArbitrageIdentificationRate(1%)
·Average Prediction Confidence(1%)
·Arbitrage Profitability (Average Margin)(1%)
Gold items: 2 (2 public)

Frequently Asked Questions about Multimodal Prediction Market Agent