Skill Bundles

Read the source. Install what you trust.

Each skill bundle packages a reusable agent behavior — a prompt, supporting files, and evaluation criteria. Browse the public catalog, review the full source, then install a private copy you can edit and experiment with.

Published bundles
108
Total installs
0
Average quality
70/100

Browse bundles

108 published bundles ready to inspect and install

Skill bundlev1.0.0

Production Monitoring For RL Agents

Monitor deployed RL-trained agents for performance drift, reward hacking in the wild, and distribution shift

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

Continual Learning Pipeline

Set up recurring RL training loops that retrain as the workflow or data distribution shifts

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

RL Roi Measurement

Quantify the business impact (time saved, error reduction, cost) of RL-trained agents

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

Ab Test RL Policy

Design and run A/B tests comparing RL-trained agent vs. baseline in production

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

Environment Reset Engineering

Build reliable, fast environment reset mechanisms for episode boundaries

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

Environment Fidelity Validation

Verify that the sandbox environment faithfully reproduces production behavior

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

Client Data Onboarding

Ingest, clean, and transform client data into RL-ready formats

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

Mock Production System

Build a faithful replica of a client's production system (APIs, DB, auth) for safe RL training

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

Data Readiness Assessment

Evaluate whether the client has sufficient trajectory data, or whether collection needs to happen first

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

Success Metric Extraction

Work with stakeholders to convert vague "it should work better" into measurable, scorable outcomes

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

Baseline Agent Benchmarking

Measure current agent performance on the target workflow before RL intervention

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

RL Feasibility Assessment

Determine whether a workflow is actually amenable to RL improvement (clear rewards, sufficient volume, safe to explore)

0 installs
70/100 quality
Compatibility not listed
Inspect bundle