Agent Building
Advanced
Always open

Autonomous Robotaxi Decision Agent

Design and implement an advanced agentic system capable of making real-time, safety-critical decisions for autonomous robotaxis. This challenge focuses on building a robust, stateful decision-making agent that can handle complex traffic scenarios, unexpected events, and regulatory compliance by integrating real-time sensor data and regulatory knowledge bases. The system will leverage extended thinking capabilities and graph-based workflows to ensure safe and efficient operation within a simulated environment. The core of this challenge involves orchestrating a multi-agent system where a central 'Driver Agent' (powered by Gemini 2.5 Pro in Deep Think mode) uses LangGraph to manage dynamic state and decision paths. This agent will communicate with 'Sensor Agents' and 'Regulatory Agents' via a simplified A2A protocol. Tool integration will simulate vehicle controls (acceleration, braking, steering) and external data feeds (traffic updates, weather forecasts, geo-fencing policies).

Status
Always open
Difficulty
Advanced
Points
500
Start the challenge to track prompts, tools, evaluation progress, and leaderboard position in one workspace.
Challenge at a glance
Host and timing
Vera

AI Research & Mentorship

Starts Available now
Evergreen challenge
Challenge brief

What you are building

The core problem, expected build, and operating context for this challenge.

Design and implement an advanced agentic system capable of making real-time, safety-critical decisions for autonomous robotaxis. This challenge focuses on building a robust, stateful decision-making agent that can handle complex traffic scenarios, unexpected events, and regulatory compliance by integrating real-time sensor data and regulatory knowledge bases. The system will leverage extended thinking capabilities and graph-based workflows to ensure safe and efficient operation within a simulated environment. The core of this challenge involves orchestrating a multi-agent system where a central 'Driver Agent' (powered by Gemini 2.5 Pro in Deep Think mode) uses LangGraph to manage dynamic state and decision paths. This agent will communicate with 'Sensor Agents' and 'Regulatory Agents' via a simplified A2A protocol. Tool integration will simulate vehicle controls (acceleration, braking, steering) and external data feeds (traffic updates, weather forecasts, geo-fencing policies).

Datasets

Shared data for this challenge

Review public datasets and any private uploads tied to your build.

Loading datasets...
Learning goals

What you should walk away with

Master LangGraph for building stateful agent workflows, including persistence and conditional branching for dynamic decision paths.

Implement A2A protocol concepts for secure and efficient agent-to-agent communication between specialized 'Sensor', 'Regulatory', and 'Driver' agents.

Leverage Gemini 2.5 Pro's Deep Think mode to enable advanced reasoning, complex scenario analysis, and dynamic problem-solving within the 'Driver Agent'.

Build MCP-enabled tool integration layers to interface with simulated vehicle control APIs (e.g., accelerate, brake, turn) and external data sources (e.g., traffic, weather).

Design and apply extended thinking techniques with adaptive reasoning budgets to address safety-critical situations, ensuring the agent can 'think longer' for high-stakes decisions.

Develop RAG mechanisms to provide the 'Regulatory Agent' with up-to-date city regulations and geo-fencing policies, retrieved in real-time.

Orchestrate a multi-agent system where roles are clearly defined, and collaboration patterns are optimized for autonomous navigation and safety compliance.

Your progress

Participation status

You haven't started this challenge yet

Timeline and host

Operating window

Key dates and the organization behind this challenge.

Start date
Available now
Run mode
Evergreen challenge
Explore

Find another challenge

Jump to a random challenge when you want a fresh benchmark or a different problem space.

Useful when you want to pressure-test your workflow on a new dataset, new constraints, or a new evaluation rubric.

Tool Space Recipe

Draft
Evaluation

Frequently Asked Questions about Autonomous Robotaxi Decision Agent