Agent Building
Advanced
Always open

Real-time Voice Assistant for Market Intelligence

Develop a sophisticated real-time voice assistant capable of performing competitive market intelligence and product analysis. This challenge focuses on building a responsive, multi-modal agent system that leverages the AI SDK for seamless streaming interactions and sophisticated tool orchestration. The core reasoning will be powered by Google's Gemini 3 Flash, providing rapid and accurate insights based on voice input. The system will integrate Microsoft AutoGen to spin up specialized, scriptable agents that handle deep-dive research tasks, collaborating with the primary voice agent. Security and data integrity are paramount, so LatticeFlow AI will be utilized to implement robust model safety policies and secure data pipelines. For comprehensive monitoring and evaluation, LangFuse will be integrated to trace agent interactions and performance metrics, ensuring the system operates efficiently and reliably. Retell AI will provide the real-time voice-to-text and text-to-speech capabilities, enabling natural language interactions for the end-user. This challenge emphasizes cutting-edge multi-agent orchestration, real-time voice processing, and advanced AI engineering practices to deliver a high-performance, secure, and observable AI application.

Challenge brief

What you are building

The core problem, expected build, and operating context for this challenge.

Develop a sophisticated real-time voice assistant capable of performing competitive market intelligence and product analysis. This challenge focuses on building a responsive, multi-modal agent system that leverages the AI SDK for seamless streaming interactions and sophisticated tool orchestration. The core reasoning will be powered by Google's Gemini 3 Flash, providing rapid and accurate insights based on voice input. The system will integrate Microsoft AutoGen to spin up specialized, scriptable agents that handle deep-dive research tasks, collaborating with the primary voice agent. Security and data integrity are paramount, so LatticeFlow AI will be utilized to implement robust model safety policies and secure data pipelines. For comprehensive monitoring and evaluation, LangFuse will be integrated to trace agent interactions and performance metrics, ensuring the system operates efficiently and reliably. Retell AI will provide the real-time voice-to-text and text-to-speech capabilities, enabling natural language interactions for the end-user. This challenge emphasizes cutting-edge multi-agent orchestration, real-time voice processing, and advanced AI engineering practices to deliver a high-performance, secure, and observable AI application.

Datasets

Shared data for this challenge

Review public datasets and any private uploads tied to your build.

Loading datasets...
Evaluation rubric

How submissions are scored

These dimensions define what the evaluator checks, how much each dimension matters, and which criteria separate a passable run from a strong one.

Max Score: 7
Dimensions
7 scoring checks
Binary
7 pass or fail dimensions
Ordinal
0 scaled dimensions
Dimension 1real_time_voice_latency

Real-time Voice Latency

Ensure average response time from audio input to audio output is under 3 seconds.

binary
Weight: 1
Binary check

This dimension contributes its full weight only when the submission satisfies the requirement. Partial credit is not awarded.

Dimension 2market_analysis_accuracy

Market Analysis Accuracy

Verify key findings from market analysis queries are factually correct and relevant.

binary
Weight: 1
Binary check

This dimension contributes its full weight only when the submission satisfies the requirement. Partial credit is not awarded.

Dimension 3autogen_task_completion

AutoGen Task Completion

Confirm AutoGen agents successfully complete their assigned sub-tasks.

binary
Weight: 1
Binary check

This dimension contributes its full weight only when the submission satisfies the requirement. Partial credit is not awarded.

Dimension 4security_policy_adherence

Security Policy Adherence

Check if LatticeFlow AI correctly identifies and mitigates policy violations.

binary
Weight: 1
Binary check

This dimension contributes its full weight only when the submission satisfies the requirement. Partial credit is not awarded.

Dimension 5semantic_similarity_score_analysis

Semantic Similarity Score (Analysis)

Cosine similarity between generated analysis text and expert-reviewed benchmark text. • target: 0.85 • range: 0.7-1

binary
Weight: 1
Binary check

This dimension contributes its full weight only when the submission satisfies the requirement. Partial credit is not awarded.

Dimension 6tool_invocation_rate

Tool Invocation Rate

Percentage of relevant tools invoked per complex query. • target: 0.95 • range: 0.8-1

binary
Weight: 1
Binary check

This dimension contributes its full weight only when the submission satisfies the requirement. Partial credit is not awarded.

Dimension 7langfuse_trace_completeness

LangFuse Trace Completeness

Percentage of agent steps and tool calls captured in LangFuse traces. • target: 1 • range: 0.95-1

binary
Weight: 1
Binary check

This dimension contributes its full weight only when the submission satisfies the requirement. Partial credit is not awarded.

Learning goals

What you should walk away with

Master Vercel's AI SDK for building streaming AI applications in TypeScript, including advanced tool calling and state management.

Implement real-time voice input and output using Retell AI, connecting it seamlessly with the AI SDK and underlying language models.

Orchestrate dynamic agent teams with Microsoft AutoGen, defining specialized roles for information gathering, synthesis, and summarization.

Design secure agent-to-agent communication protocols and data handling practices, integrating LatticeFlow AI for compliance and risk management.

Build extended reasoning pipelines leveraging Gemini 3 Flash for rapid and context-aware analysis of complex market data.

Integrate LangFuse into the AI SDK and AutoGen workflows for end-to-end tracing, monitoring, and performance evaluation of agent interactions.

Develop custom tools for the AI SDK agents to access external market data APIs, web scrapers, and internal knowledge bases.

Start from your terminal
$npx -y @versalist/cli start real-time-voice-assistant-for-market-intelligence

[ok] Wrote CHALLENGE.md

[ok] Wrote .versalist.json

[ok] Wrote eval/examples.json

Requires VERSALIST_API_KEY. Works with any MCP-aware editor.

Docs
Manage API keys
Challenge at a glance
Host and timing
Vera

AI Research & Mentorship

Starts Available now
Evergreen challenge
Your progress

Participation status

You haven't started this challenge yet

Timeline and host

Operating window

Key dates and the organization behind this challenge.

Start date
Available now
Run mode
Evergreen challenge
Explore

Find another challenge

Jump to a random challenge when you want a fresh benchmark or a different problem space.

Useful when you want to pressure-test your workflow on a new dataset, new constraints, or a new evaluation rubric.

Tool Space Recipe

Draft
Evaluation
Rubric: 7 dimensions
·Real-time Voice Latency(1%)
·Market Analysis Accuracy(1%)
·AutoGen Task Completion(1%)
·Security Policy Adherence(1%)
·Semantic Similarity Score (Analysis)(1%)
·Tool Invocation Rate(1%)
·LangFuse Trace Completeness(1%)
Gold items: 3 (3 public)

Frequently Asked Questions about Real-time Voice Assistant for Market Intelligence