Workflow Automation
Advanced
Always open

Ethical Ad Personalization Agent

This challenge focuses on building a cutting-edge, ethically-aware ad personalization and delivery system for conversational AI platforms. Leveraging Vercel's AI SDK, developers will design an agent that dynamically generates and filters advertisements based on user context, preferences, and real-time conversation flow, while strictly adhering to a defined set of ethical guidelines. The system must integrate Google's Gemini 3 Pro for multimodal ad content generation and personalization, and use Fiddler AI for continuous monitoring and evaluation of ad compliance against ethical policies. Real-time inference capabilities will be supported by RunPod for specialized ad rendering models, and LiveKit will enable voice-interface interactions for a seamless user experience. This challenge emphasizes responsive, context-aware ad delivery combined with robust ethical governance in generative AI applications. Developers will master the intricacies of creating reactive AI interfaces with streaming capabilities, orchestrating multiple generative models, and implementing an automated observability pipeline for ethical AI compliance, moving beyond simple content generation to intelligent and responsible content curation.

Challenge brief

What you are building

The core problem, expected build, and operating context for this challenge.

This challenge focuses on building a cutting-edge, ethically-aware ad personalization and delivery system for conversational AI platforms. Leveraging Vercel's AI SDK, developers will design an agent that dynamically generates and filters advertisements based on user context, preferences, and real-time conversation flow, while strictly adhering to a defined set of ethical guidelines. The system must integrate Google's Gemini 3 Pro for multimodal ad content generation and personalization, and use Fiddler AI for continuous monitoring and evaluation of ad compliance against ethical policies. Real-time inference capabilities will be supported by RunPod for specialized ad rendering models, and LiveKit will enable voice-interface interactions for a seamless user experience. This challenge emphasizes responsive, context-aware ad delivery combined with robust ethical governance in generative AI applications. Developers will master the intricacies of creating reactive AI interfaces with streaming capabilities, orchestrating multiple generative models, and implementing an automated observability pipeline for ethical AI compliance, moving beyond simple content generation to intelligent and responsible content curation.

Datasets

Shared data for this challenge

Review public datasets and any private uploads tied to your build.

Loading datasets...
Evaluation rubric

How submissions are scored

These dimensions define what the evaluator checks, how much each dimension matters, and which criteria separate a passable run from a strong one.

Max Score: 4
Dimensions
4 scoring checks
Binary
4 pass or fail dimensions
Ordinal
0 scaled dimensions
Dimension 1adrelevancethreshold

AdRelevanceThreshold

Generated ads must have a relevance_score above 0.7 for at least 80% of test cases.

binary
Weight: 1
Binary check

This dimension contributes its full weight only when the submission satisfies the requirement. Partial credit is not awarded.

Dimension 2ethicalcomplianceaccuracy

EthicalComplianceAccuracy

Fiddler AI integration must correctly identify 90% of intentionally non-compliant ads and have a false positive rate below 5%.

binary
Weight: 1
Binary check

This dimension contributes its full weight only when the submission satisfies the requirement. Partial credit is not awarded.

Dimension 3average_ad_generation_latency_ms

Average Ad Generation Latency (ms)

Average time taken to generate a personalized ad, including multimodal content. • target: 300 • range: 0-1000

binary
Weight: 1
Binary check

This dimension contributes its full weight only when the submission satisfies the requirement. Partial credit is not awarded.

Dimension 4compliance_flag_false_positive_rate

Compliance Flag False Positive Rate (%)

Percentage of ethically compliant ads incorrectly flagged by Fiddler AI. • target: 2 • range: 0-10

binary
Weight: 1
Binary check

This dimension contributes its full weight only when the submission satisfies the requirement. Partial credit is not awarded.

Learning goals

What you should walk away with

Master Vercel's AI SDK for developing scalable, streaming conversational AI agents with tool use and multi-provider support.

Implement multimodal ad generation and contextual personalization using Gemini 3 Pro's advanced capabilities, including image and video ad content.

Design and integrate a real-time ethical compliance monitoring system using Fiddler AI to evaluate ad content against predefined policy rules and flag violations.

Orchestrate real-time inference for specialized ad rendering models deployed on RunPod to ensure low-latency dynamic content delivery.

Build seamless voice-enabled ad interaction experiences within conversational interfaces using LiveKit for high-quality audio processing and streaming.

Develop strategies for continuous learning and adaptation of the ad personalization model based on user feedback and engagement metrics.

Integrate robust error handling and fallback mechanisms to maintain system stability and user experience during ad generation and delivery.

Start from your terminal
$npx -y @versalist/cli start ethical-ad-personalization-agent

[ok] Wrote CHALLENGE.md

[ok] Wrote .versalist.json

[ok] Wrote eval/examples.json

Requires VERSALIST_API_KEY. Works with any MCP-aware editor.

Docs
Manage API keys
Challenge at a glance
Host and timing
Vera

AI Research & Mentorship

Starts Available now
Evergreen challenge
Your progress

Participation status

You haven't started this challenge yet

Timeline and host

Operating window

Key dates and the organization behind this challenge.

Start date
Available now
Run mode
Evergreen challenge
Explore

Find another challenge

Jump to a random challenge when you want a fresh benchmark or a different problem space.

Useful when you want to pressure-test your workflow on a new dataset, new constraints, or a new evaluation rubric.

Tool Space Recipe

Draft
Evaluation
Rubric: 4 dimensions
·AdRelevanceThreshold(1%)
·EthicalComplianceAccuracy(1%)
·Average Ad Generation Latency (ms)(1%)
·Compliance Flag False Positive Rate (%)(1%)
Gold items: 2 (2 public)

Frequently Asked Questions about Ethical Ad Personalization Agent