Skill Bundles

Read the source. Install what you trust.

Each skill bundle packages a reusable agent behavior — a prompt, supporting files, and evaluation criteria. Browse the public catalog, review the full source, then install a private copy you can edit and experiment with.

Published bundles

109

Total installs

2

Average quality

70/100

Browse bundles

109 published bundles ready to inspect and install

Skill bundlev1.0.0

RL Feasibility Assessment

Determine whether a workflow is actually amenable to RL improvement (clear rewards, sufficient volume, safe to explore)

Compatibility not listed

Skill bundlev1.0.0

Workflow Audit

Map an enterprise workflow end-to-end: inputs, decisions, tools, outputs, success criteria

Compatibility not listed

Skill bundlev1.0.0

Model Versioning For RL

Track and switch between reference model, current policy, and reward model versions during training

Compatibility not listed

Skill bundlev1.0.0

VLLM For RL

Configure vLLM or similar engines for RL workloads (batched generation, multiple completions)

Compatibility not listed

Skill bundlev1.0.0

High Throughput Rollout Serving

Serve models at high throughput for RL rollout collection (not just user-facing latency)

Compatibility not listed

Skill bundlev1.0.0

Compute Budgeting For RL

Estimate and optimize GPU hours needed for RL training runs

Compatibility not listed

Skill bundlev1.0.0

Checkpoint Selection

Choose the best model checkpoint based on eval performance, not just training metrics

Compatibility not listed

Skill bundlev1.0.0

Training Stability Debugging

Diagnose and fix common RL training failures: reward collapse, mode collapse, KL explosion

Compatibility not listed

Skill bundlev1.0.0

Kl Divergence Management

Control how far the policy drifts from the reference model during training

Compatibility not listed

Skill bundlev1.0.0

Reward Model Training

Train reward models from human preference data, handle label noise and distribution shift

Compatibility not listed

Skill bundlev1.0.0

RL Hyperparameter Tuning

Tune learning rates, KL penalties, reward scaling, batch sizes for RL stability

Compatibility not listed

Skill bundlev1.0.0

Distributed RL Training

Shard training across multiple GPUs/nodes with proper gradient synchronization

Compatibility not listed

Page 5 of 10