Skill Bundles

Read the source. Install what you trust.

Each skill bundle packages a reusable agent behavior — a prompt, supporting files, and evaluation criteria. Browse the public catalog, review the full source, then install a private copy you can edit and experiment with.

Published bundles
109
Total installs
2
Average quality
70/100

Browse bundles

109 published bundles ready to inspect and install

Skill bundlev1.0.0

Reward Hacking Red Teaming

Systematically find ways an agent could game the reward function

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

Rollback And Versioning

Maintain and switch between agent versions when new RL training degrades performance

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

Production Monitoring For RL Agents

Monitor deployed RL-trained agents for performance drift, reward hacking in the wild, and distribution shift

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

Continual Learning Pipeline

Set up recurring RL training loops that retrain as the workflow or data distribution shifts

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

RL Roi Measurement

Quantify the business impact (time saved, error reduction, cost) of RL-trained agents

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

Environment Reset Engineering

Build reliable, fast environment reset mechanisms for episode boundaries

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

Environment Fidelity Validation

Verify that the sandbox environment faithfully reproduces production behavior

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

Client Data Onboarding

Ingest, clean, and transform client data into RL-ready formats

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

Mock Production System

Build a faithful replica of a client's production system (APIs, DB, auth) for safe RL training

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

Data Readiness Assessment

Evaluate whether the client has sufficient trajectory data, or whether collection needs to happen first

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

Success Metric Extraction

Work with stakeholders to convert vague "it should work better" into measurable, scorable outcomes

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

Baseline Agent Benchmarking

Measure current agent performance on the target workflow before RL intervention

0 installs
70/100 quality
Compatibility not listed
Inspect bundle

Page 4 of 10