Skill Bundles

Read the source. Install what you trust.

Each skill bundle packages a reusable agent behavior: a prompt, supporting files, and evaluation criteria. Browse the public catalog, review the full source, then install a private copy you can edit and experiment with.

Browse bundles

109 published bundles ready to inspect and install

Skill bundlev1.0.0

Safe Exploration Constraints

Define and enforce hard constraints on what agents can do during training rollouts

Compatibility not listed

Skill bundlev1.0.0

Reward Hacking Red Teaming

Systematically find ways an agent could game the reward function

Compatibility not listed

Skill bundlev1.0.0

Rollback And Versioning

Maintain and switch between agent versions when new RL training degrades performance

0 installs

70/100 quality

Compatibility not listed

Inspect bundle

Skill bundlev1.0.0

Production Monitoring For RL Agents

Monitor deployed RL-trained agents for performance drift, reward hacking in the wild, and distribution shift

Compatibility not listed

Skill bundlev1.0.0

Continual Learning Pipeline

Set up recurring RL training loops that retrain as the workflow or data distribution shifts

Compatibility not listed

Skill bundlev1.0.0

RL Roi Measurement

Quantify the business impact (time saved, error reduction, cost) of RL-trained agents

Compatibility not listed

Skill bundlev1.0.0

Environment Reset Engineering

Build reliable, fast environment reset mechanisms for episode boundaries

Compatibility not listed

Skill bundlev1.0.0

Environment Fidelity Validation

Verify that the sandbox environment faithfully reproduces production behavior

Compatibility not listed

Skill bundlev1.0.0

Client Data Onboarding

Ingest, clean, and transform client data into RL-ready formats

Compatibility not listed

Skill bundlev1.0.0

Mock Production System

Build a faithful replica of a client's production system (APIs, DB, auth) for safe RL training

Compatibility not listed

Skill bundlev1.0.0

Data Readiness Assessment

Evaluate whether the client has sufficient trajectory data, or whether collection needs to happen first

Compatibility not listed

Skill bundlev1.0.0

Success Metric Extraction

Work with stakeholders to convert vague "it should work better" into measurable, scorable outcomes

Compatibility not listed

Page 4 of 10