Financial Proxy Analyst AI
This challenge focuses on developing an advanced AI system capable of ingesting and analyzing complex corporate proxy statements. The goal is to extract key financial and governance data, synthesize insights, and generate well-justified voting recommendations or executive summaries. This system must demonstrate sophisticated document understanding and output generation, paired with a robust evaluation framework. Developers will focus on precisely extracting structured information from unstructured legal text, using a powerful LLM to reason and synthesize, and employing an MLOps platform to ensure the quality and reliability of the AI's financial analysis outputs.
What you are building
The core problem, expected build, and operating context for this challenge.
This challenge focuses on developing an advanced AI system capable of ingesting and analyzing complex corporate proxy statements. The goal is to extract key financial and governance data, synthesize insights, and generate well-justified voting recommendations or executive summaries. This system must demonstrate sophisticated document understanding and output generation, paired with a robust evaluation framework. Developers will focus on precisely extracting structured information from unstructured legal text, using a powerful LLM to reason and synthesize, and employing an MLOps platform to ensure the quality and reliability of the AI's financial analysis outputs.
Shared data for this challenge
Review public datasets and any private uploads tied to your build.
What you should walk away with
Master parsing and pre-processing of complex PDF documents (e.g., corporate proxy statements) using LlamaIndex's document loaders and advanced parsing tools to prepare content for LLM ingestion.
Implement advanced prompt engineering techniques with Claude Opus 4.5, leveraging its long-context window for structured data extraction, multi-hop reasoning, and sophisticated synthesis from financial and governance text.
Design a system to generate clear, concise executive summaries and justified voting recommendations based on extracted financial data, governance proposals, and predefined policy criteria.
Integrate with enterprise data systems using Paragon to securely retrieve proxy statements from a document repository and publish generated analyses or recommendations to a corporate dashboard.
Build and utilize an MLflow evaluation pipeline to objectively assess the accuracy, completeness, and justification quality of the AI-generated outputs against a curated set of ground truth or expert-annotated examples.
Explore techniques for validating generated outputs against financial benchmarks, regulatory guidelines, and company-specific voting policies to ensure compliance and strategic alignment.
[ok] Wrote CHALLENGE.md
[ok] Wrote .versalist.json
[ok] Wrote eval/examples.json
Requires VERSALIST_API_KEY. Works with any MCP-aware editor.
DocsAI Research & Mentorship
Participation status
You haven't started this challenge yet
Operating window
Key dates and the organization behind this challenge.
Find another challenge
Jump to a random challenge when you want a fresh benchmark or a different problem space.