testing

Testing and Complex Scene Analysis

Inspect the original prompt language first, then copy or adapt it once you know how it fits your workflow.

Linked challenge: Multimodal 3D Object Verification

Format

Text-first

Lines

Sections

Linked challenge

Multimodal 3D Object Verification

Prompt source

Original prompt text with formatting preserved for inspection.

1 lines

1 sections

No variables

0 checklist items

Run the agent with both 'Object_Compliance_Check' and 'Complex_Scene_Analysis' evaluation tasks. Create varied 'scene_description' inputs with deliberate errors or ambiguities. Analyze Gemini 2.5 Pro's reasoning traces and the agent's output. Refine the agent's prompts and plugin usage to improve its accuracy in detecting subtle errors and its ability to provide clear, actionable recommendations for complex scenarios.

Adaptation plan

Keep the source stable, then change the prompt in a predictable order so the next run is easier to evaluate.

Keep stable

Preserve the rubric, target behavior, and pass-fail criteria as the baseline for evaluation.

Tune next

Adjust fixtures, mocks, and thresholds to the system under test instead of weakening the assertions.

Verify after

Make sure the prompt catches regressions instead of just mirroring the happy-path examples.

Prompt diagnostics

Variables

Lists

Code blocks

Purpose

testing

This prompt is mostly narrative and instruction-driven, so adapt examples and output constraints before you rewrite the structure.

Linked challenge

Multimodal 3D Object Verification

Leveraging the advancements in multimodal AI and 3D vision models like Meta's SAM 3D, this challenge tasks you with building a multimodal agent system using Semantic Kernel. Your agent will act as a '3D Model Quality Assurance' specialist. It will receive a natural language request along with a simulated '3D scene description' (derived from SAM 3D output) and verify if objects within the scene meet specified criteria. The Gemini 2.5 Pro model will be at the core, orchestrating visual analysis tools (simulated APIs for SAM 3D) and performing extended reasoning to identify discrepancies or compliance issues.

Open challenge

Related prompts

Browse library

Semantic Kernel Setup & Simulated SAM 3D Plugin

implementation

Gemini 2.5 Pro Agent & Planner Design

planning

Extended Reasoning Implementation for Compliance Checks

implementation