Skip to main content

Getting Started with Babysitter Plugin

Get your first orchestrated workflow running in under 20 minutes.

This guide walks you through installing the Babysitter plugin and running your first successful orchestration. By the end, you will understand how to use the Babysitter skill to manage complex, multi-step workflows.


Table of Contents

  1. Prerequisites
  2. Quick Installation (5 minutes)
  3. First Orchestration (10 minutes)
  4. Understanding the Output
  5. Troubleshooting
  6. Next Steps

Prerequisites

Before installing, verify your system has the required dependencies.

Required Software

SoftwareMinimum VersionCheck Command
Node.jsv18+node --version
npmAny recentnpm --version
GitAny recentgit --version
jqAny versionjq --version

Verify Prerequisites

Run the verification script to check all dependencies at once:

# From your project directory containing the plugin

Expected output when all checks pass:

Babysitter Plugin Installation Verification
============================================

=== Environment Checks ===
✓ Environment: Linux/macOS/Windows detected

=== Required Dependencies ===
✓ Node.js: Version 20.x.x (>=18 required)
✓ npm: Version 10.x.x
✓ git: Version 2.x.x
✓ jq: Version 1.x

=== Plugin Checks ===
✓ Plugin Structure: All required directories and files present
✓ SDK CLI: Version x.x.x

=== Summary ===
All checks passed! The babysitter plugin is ready to use.

Install Missing Dependencies

Node.js and npm:

  • Download from nodejs.org (v18 LTS or later recommended)

jq (required for hooks):

# macOS
brew install jq

# Ubuntu/Debian
sudo apt-get install jq

# Windows (using Chocolatey)
choco install jq

Quick Installation (5 minutes)

Choose one of three installation methods:

In Claude Code, run:

/plugin marketplace add a5c-ai/babysitter-claude
/plugin install babysitter@a5c.ai

After installation:

  1. Restart Claude Code if prompted
  2. Verify with /skills - look for "babysitter" in the list

Option B: Manual Installation from Source (Global)

# Clone the source repo
git clone https://github.com/a5c-ai/babysitter.git ~/src/babysitter
cd ~/src/babysitter

# Generate the harness-specific plugin bundles from plugins/babysitter-unified
npm run generate:plugins

# Install the generated Claude Code bundle
mkdir -p ~/.claude/plugins/babysitter
cp -r artifacts/generated-plugins/claude-code/* ~/.claude/plugins/babysitter/

Option C: Manual Installation from Source (Project-Local)

For project-specific plugin usage:

# Generate the plugin bundle from the repo root first
npm run generate:plugins

# Create plugin directory in your project
mkdir -p .claude/plugins

# Copy the generated Claude Code bundle into the project-local plugins dir
mkdir -p .claude/plugins/babysitter
cp -r artifacts/generated-plugins/claude-code/* .claude/plugins/babysitter/

Verify Installation

In Claude Code, verify the skill is loaded:

/skills

You should see babysitter:babysit and babysitter:call in the list.


First Orchestration (10 minutes)

The Babysitter plugin is used through skill invocation - you ask Claude to use the babysitter skill, and Claude handles all the orchestration internally.

Method 1: Direct Skill Invocation

Use the slash command to invoke the skill:

/babysitter:call implement a Hello World function with TDD targeting 80% quality

Other modes available:

ModeCommandBehavior
Interactive/babysitter:callPauses at breakpoints for approval (default)
YOLO/babysitter:yoloFull auto, no breakpoints — ship while you sleep
Forever/babysitter:foreverNever-ending loop for monitoring/periodic tasks
Plan/babysitter:planCreates process without executing — review first

Method 2: Natural Language Request

Simply ask Claude to use the babysitter skill:

Use the babysitter skill to implement a Hello World function.
Requirements:
- Create a function that returns "Hello, World!"
- Function accepts an optional name parameter
- Returns "Hello, {name}!" when name is provided
- Include unit tests
- Use TDD methodology

What Happens Next

When you invoke the skill, Claude will:

  1. Interview Phase - Claude may ask clarifying questions about your requirements, scope, and preferred methodology

  2. Process Creation - Claude creates a custom process tailored to your request

  3. Run Setup - Claude initializes the orchestration run (you'll see a message about the run being activated)

  4. Iteration Loop - Claude executes the workflow step by step:

    • Executes tasks (code generation, testing, refactoring)
    • Posts results to the run
    • Handles quality gates automatically
  5. Breakpoints - If human approval is needed, Claude will ask you directly (in interactive mode) before continuing

  6. Completion - When done, Claude reports the final results

Example Session

Here's what a typical session looks like:

You: /babysitter:call create a calculator module with add and subtract functions using TDD

Claude: I'll orchestrate this using the babysitter skill. Let me first understand your requirements...

Claude: What programming language would you prefer?
- TypeScript (Recommended)
- JavaScript
- Python

You: TypeScript

Claude: 🔄 Babysitter run activated in this session!
Creating process for TDD calculator implementation...

[Claude creates the process and begins orchestration]

Claude: Phase 1: Writing failing tests for add function...
[Shows test code]

Claude: Phase 2: Implementing add function to pass tests...
[Shows implementation]

Claude: Phase 3: Quality gate - Tests passing: ✓
Current quality score: 75%

[Continues iterating until quality threshold is met]

Claude: ✅ Run completed successfully!
- All tests passing
- Quality score: 85%
- Files created: src/calculator.ts, tests/calculator.test.ts

Choosing a Methodology

You can specify which methodology to use:

/babysitter:call use GSD methodology to quickly prototype a REST API

/babysitter:call use TDD Quality Convergence to implement user authentication

/babysitter:call use Spec-Kit for implementing the requirements in spec.md

See the Process Selection Guide for help choosing the right methodology.


Understanding the Output

Run Directory Structure

Babysitter creates a run directory with all artifacts:

~/.a5c/runs/<runId>/ # default; use <repo>/.a5c/runs/<runId>/ when BABYSITTER_RUNS_SCOPE=repo
├── run.json # Run metadata
├── inputs.json # Your inputs
├── journal.jsonl # Event log (append-only)
├── state.json # Current state cache
├── code/ # Process implementation
│ └── main.js # Process entry point
├── tasks/ # Task execution artifacts
│ └── <effectId>/
│ ├── input.json
│ ├── result.json
│ ├── stdout.log
│ └── stderr.log
└── artifacts/ # Human-readable outputs
└── *.md

Iteration Status Values

StatusMeaningWhat Claude Does
executedTasks were executedContinues iterating
waitingBreakpoint or sleep gateAsks for your approval
completedRun finished successfullyReports final results
failedRun encountered an errorReports error and recovery options

Quality Gates

Babysitter uses quality gates to ensure work meets standards:

  • Quality Score: A 0-100 score based on test coverage, code quality, and requirements coverage
  • Quality Threshold: The minimum score required (default: 80%)
  • Convergence: Babysitter iterates until the threshold is met or max iterations reached

Troubleshooting

Common First-Time Issues

Issue: Skill not found

Symptom: /babysitter:call doesn't work or skill not in /skills list

Solution: Verify plugin installation:

  1. Check plugin directory exists: ls ~/.claude/plugins/babysitter/
  2. If you installed from source, re-run npm run generate:plugins before copying the bundle so the Claude Code files are generated from plugins/babysitter-unified
  3. Restart Claude Code
  4. Run /skills again

Issue: "jq: command not found" during execution

Solution: Install jq (required for hooks):

# macOS
brew install jq

# Ubuntu/Debian
sudo apt-get install jq

# Windows
choco install jq

Issue: Run seems stuck

Cause: Usually waiting for your approval at a breakpoint.

Solution: Check if Claude is asking you a question. In interactive mode, breakpoints require your response to continue.

Issue: Quality threshold never reached

Cause: The implementation may need significant changes to meet the threshold.

Solutions:

  • Lower the quality threshold: "targeting 70% quality"
  • Increase max iterations: "with up to 10 iterations"
  • Simplify requirements

Issue: Permission denied errors

Solution: Make hook scripts executable:

chmod +x plugins/babysitter-unified/hooks/*.sh

Health Check

Run the health check script to diagnose issues:

Health status meanings:

StatusExit CodeMeaning
HEALTHY0All checks passed
DEGRADED2Warnings present but functional
UNHEALTHY1Critical failures detected

Next Steps

Now that you understand how to use Babysitter, explore these resources:

Choose Your Methodology

See the Process Selection Guide for a guide on choosing the right methodology:

MethodologyBest For
GSDQuick prototypes, getting things done fast
TDD Quality ConvergenceQuality-focused development with tests
Spec-KitProjects with formal specifications
Scrum/AgileSprint-based team workflows
BDDBehavior-driven development

Example Invocations

# Quick prototype
/babysitter:call use GSD to create a CLI tool for converting CSV to JSON

# Quality-focused
/babysitter:call implement user authentication with TDD, 85% quality threshold

# Spec-driven
/babysitter:call use Spec-Kit to implement the API defined in openapi.yaml

# Feature development
/babysitter:call add dark mode support to the React app using iterative development

Advanced Topics

Utility Scripts


Need help? Just ask Claude: "Use the babysitter skill to help me with..." and describe what you want to build.