Train, Evaluate, and Interpret with DeepEval

testingChallengeDecember 5, 2025

Prompt Content

Train your RL agent(s) within the simulated environment. Once trained, use DeepEval to rigorously evaluate the agent's performance. Focus on metrics like net revenue, EV charging satisfaction rate, and market compliance. Utilize DeepEval's interpretability features to understand why the agent makes certain bidding decisions, especially in complex scenarios. Run the `SimulateBiddingAgent` evaluation task for a 7-day period to assess the agent's long-term effectiveness.

Run with your own API keysBYOK

Use your Anthropic, OpenAI, or Vertex keys to execute this prompt directly in Vera. keys are stored locally in your browser.

Usage Tips

Copy the prompt and paste it into your preferred AI tool (Claude, ChatGPT, Gemini)

Customize placeholder values with your specific requirements and context

For best results, provide clear examples and test different variations