testing

Gradio Interface and Full System Evaluation

Inspect the original prompt language first, then copy or adapt it once you know how it fits your workflow.

Linked challenge: Agent for Complex Policy & Contract Analysis

Format

Text-first

Lines

Sections

Linked challenge

Agent for Complex Policy & Contract Analysis

Prompt source

Original prompt text with formatting preserved for inspection.

1 lines

1 sections

No variables

0 checklist items

Create a simple web interface using **Gradio** where a user can upload a policy document (or paste text) and submit analysis goals. Your Claude agent should then process this request, perform the analysis using the RAG system, and display the structured JSON output. Test your full system with the provided sample input for the 'Policy Document Analysis' task. Include the Gradio app code and the final runnable Python script for the agent and interface.

Adaptation plan

Keep the source stable, then change the prompt in a predictable order so the next run is easier to evaluate.

Keep stable

Preserve the rubric, target behavior, and pass-fail criteria as the baseline for evaluation.

Tune next

Adjust fixtures, mocks, and thresholds to the system under test instead of weakening the assertions.

Verify after

Make sure the prompt catches regressions instead of just mirroring the happy-path examples.

Prompt diagnostics

Variables

Lists

Code blocks

Purpose

testing

This prompt is mostly narrative and instruction-driven, so adapt examples and output constraints before you rewrite the structure.

Linked challenge

Agent for Complex Policy & Contract Analysis

Develop a Claude Agent using the Claude Agents SDK capable of dissecting and analyzing complex legal or business policy documents, drawing inspiration from the startup Ivo's approach to breaking down legal reviews. The agent will focus on reducing 'hallucinations' by performing granular task decomposition and leveraging contextual retrieval. It will use Claude Opus 4.1 for sophisticated reasoning, Weaviate for efficient semantic search over a corpus of policy documents (RAG approach), and Prefect for orchestrating document ingestion workflows. A simple Gradio interface will allow users to submit documents for analysis.

Open challenge

Related prompts

Browse library

RAG System Setup with Weaviate and Prefect

implementation

Claude Agent with Retrieval Tool Implementation

implementation

Task Decomposition for Analysis and Hallucination Reduction

implementation