Skill Bundles

Read the source. Install what you trust.

Each skill bundle packages a reusable agent behavior — a prompt, supporting files, and evaluation criteria. Browse the public catalog, review the full source, then install a private copy you can edit and experiment with.

Published bundles
108
Total installs
0
Average quality
70/100

Browse bundles

108 published bundles ready to inspect and install

Skill bundlev1.0.0

Browser Env Construction

Build instrumented browser environments with action logging and state capture

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

Repo Level Coding Env

Build environments where agents navigate and modify entire repositories, not just single files

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

Test Generation As Reward

Use test pass rates as automatic reward signals for code generation

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

Code Review Reward Design

Score code changes on correctness, style, security, and performance

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

Code Completion RL Env

Build environments for training code completion models (à la Cursor's online RL)

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

Experience Replay Management

Maintain and curate experience replay buffers for continual RL training

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

Distribution Shift Detection

Detect when the production task distribution has drifted from the training distribution

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

Catastrophic Forgetting Mitigation

Prevent RL training from destroying previously learned capabilities

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

Online RL From Production

Set up learning loops where production experience feeds back into training

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

Capability Regression Testing

Run broad capability evals before and after RL training to catch degradation

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

Overfitting Detection For RL

Detect when RL training narrows capability (great on trained tasks, worse on everything else)

0 installs
70/100 quality
Compatibility not listed
Inspect bundle
Skill bundlev1.0.0

Domain Transfer Measurement

Quantify how much RL training on coding transfers to (say) data analysis or writing

0 installs
70/100 quality
Compatibility not listed
Inspect bundle