Skip to content

Skill Overview

The swm SKILL.md is a structured document that teaches AI agents how to provision and manage GPU pods end-to-end. It follows the open Agent Skills standard supported by Cursor, Claude Code, Codex, Copilot, Windsurf, and others.

The fastest way to get started:

Terminal window
mkdir -p .agents/skills/swm-gpu-workflow
curl -sL https://raw.githubusercontent.com/swm-gpu/swm/main/.agents/skills/swm-gpu-workflow/SKILL.md \
-o .agents/skills/swm-gpu-workflow/SKILL.md

See Supported Platforms for platform-specific install instructions.

The skill guides agents through a repeatable workflow:

PhaseWhat happens
0. State checkCheck for existing pods with swm pod list
1. ClarifyAsk user about VRAM needs, provider preference, lifecycle policy
2. Pick GPUSearch live pricing with swm gpus, select cheapest match
3. ProvisionCreate pod with swm pod create -y, wait for SSH
4. InstallUse swm setup install for built-in frameworks, or swm run for custom tools
5. VerifyHealth-check HTTP endpoints, validate GPU memory, check disk space
6. Hand offReport URL, lifecycle policy, and debugging commands to user
  • GPU class based on VRAM requirements
  • Cheapest in-stock provider across all 10 clouds
  • Tensor parallelism configuration (auto-detected from GPU count)
  • Whether to use built-in swm setup or custom swm run chains
  • Lifecycle guard policy and idle timeout
  • VRAM floor needed
  • Provider preference (or auto-select cheapest)
  • Lifecycle policy (auto-down, auto-stop, remind, manual)
  • Idle timeout duration
  • Whether to persist workspace to S3

The skill covers all 7 built-in frameworks:

FrameworkCommandUse case
vLLMswm setup install vllmLLM inference server (OpenAI-compatible API)
Open WebUIswm setup install open-webuiChat interface for LLMs
Ollamaswm setup install ollamaLocal model runner
ComfyUIswm setup install comfyuiNode-based image generation
SwarmUIswm setup install swarmuiImage generation UI
Axolotlswm setup install axolotlLLM fine-tuning
H2O LLM Studioswm setup install llm-studioNo-code fine-tuning

For unsupported tools, the skill guides agents through swm run command chains.

Beyond the core workflow, the skill covers:

  • Lifecycle Guardswm guard enable to auto-terminate idle pods
  • Cost Trackingswm costs live for real-time spend, swm costs budget set for alerts
  • Model Managementswm models search / pull / set for HuggingFace models
  • Workspace Syncswm sync watch for continuous auto-push on file changes
  • Installing to container disk instead of /workspace (data loss on stop)
  • Using pip install -e . for projects with [tool.uv.sources]
  • Forgetting -y on swm pod create (hangs in agent context)
  • Mixing venvs across frameworks with conflicting torch versions
  • Handing off a pod before all health checks pass
  • Skipping the state check (creating duplicate pods)