Testing

For the full experience including quality scoring and one-click install features for each skill — upgrade to Pro.

auto-build

Implement a whole DAG plan autonomously — one approved pass, one clean rollback point per task, never building on a broken base.

critique

Evaluate design from a UX perspective, assessing visual hierarchy, information architecture, emotional resonance, cognitive load, and overall quality with quantitative scoring,…

mumo

Multi-model deliberation via mumo's MCP server. Best for contested architecture/product decisions, design reviews, pressure-testing a pre-launch spec, resolving tradeoffs with…

mutate

Run mutation testing to verify test effectiveness by analyzing killed, survived, and uncovered mutants.

mk:party

Multi-agent collaboration session. Brings 2-4 agent perspectives into one discussion for architecture decisions and trade-off analysis.

test-harness-auditor

Audit a repo's test, lint, type-check, static analysis, build, and debug infrastructure for AI coding agents.

unikit-memory

Add or update rules in .unikit/memory/: core rules (code style, design principles, testing, performance) and stack rules (framework-specific patterns — Zenject, DOTween,…

work-quick

Quick workflow for trivial changes (single-file fix, rename, typo). Skip the full Explore-Plan-TDD-Audit cycle.

10x-tdd

Drive an approved plan from context/changes//plan.md phase by phase, test-first, through red→green→refactor — only for TDD'able phases not yet implemented; everything…

A/B Test Design

Statistical experiment design and analysis capabilities for product experimentation

acceptance-criteria-verification

Use after implementing features - verifies each acceptance criterion with structured testing and posts verification reports to the GitHub issue — from engineering/testing

act-local-testing

Use when testing GitHub Actions workflows locally with act. Covers act CLI usage, Docker configuration, debugging workflows, and troubleshooting common issues when running…

agent-browser

Browser automation for web testing, form filling, screenshots, and data extraction. Use when navigating websites, interacting with web pages, filling forms, taking screen — from…

agent-browser

Browser automation for web testing, form filling, screenshots, and data extraction. Use when navigating websites, interacting with web pages, filling forms, taking screen — from…

ameba-custom-rules

Use when creating custom Ameba rules for Crystal code analysis including rule development, AST traversal, issue reporting, and rule testing.

bdd-collaboration

Use when facilitating BDD collaboration between developers, testers, and business stakeholders. Use when running discovery workshops and example mapping sessions.

bdd-patterns

Use when applying Behavior-Driven Development patterns including Given-When-Then structure, feature files, and acceptance criteria.

bdd-scenarios

Use when writing effective BDD scenarios including acceptance criteria, edge cases, and scenario organization. Use when defining behavior specifications.

bio-comparative-genomics-positive-selection

Detect positive selection using dN/dS (omega) tests with PAML codeml and HyPhy. Identify sites and branches under adaptive evolution through codon models and branch-site tests.

bio-population-genetics-association-testing

Genome-wide association studies (GWAS) with PLINK. Perform case-control and quantitative trait association testing using logistic/linear regression with covariates, gener — from…

bio-workflows-microbiome-pipeline

End-to-end 16S amplicon workflow from FASTQ reads to differential abundance. Orchestrates DADA2 ASV inference, taxonomy assignment, diversity analysis, and compositional — from…

biocompatibility-test-selector

Biocompatibility test selection and protocol recommendation skill based on device categorization

browser

Opens browser at URL for E2E testing via playwright-cli. Provides CLI commands for navigation, screenshots, and interaction.

jutsu-bun:bun-testing

Use when writing tests with Bun's built-in test runner. Covers test organization, assertions, mocking, and snapshot testing using Bun's fast test infrastructure.

cash-flow-forecaster

Daily, weekly, and monthly cash forecasting skill with scenario analysis and liquidity stress testing

catalyst-analyzer

Catalyst performance analysis skill for activity testing, deactivation modeling, and optimization

corder-test-generation

Generate unit tests, integration tests, and test fixtures for code. Supports Jest, Mocha, pytest. Use when writing tests or improving test coverage.

cui-javascript-unit-testing

Jest unit testing standards covering configuration, test structure, testing patterns, and coverage requirements

Cypress E2E Testing

Expert Cypress testing framework integration for browser-based end-to-end testing

symfony:e2e-panther-playwright

Write end-to-end tests with Symfony Panther 2.4 for browser automation or Playwright for complex scenarios

e2e-review

Reviews spec implementation with E2E visual browser validation via Playwright. Triggers on keywords: e2e review, visual review, spec review, browser validation

e2e-template

Template generator for E2E test definitions. Creates structured test files for use with test-e2e skill. Triggers on keywords: e2e template, test template, create test definition

e2e-test-automation

Execute end-to-end tests for Nikita using Telegram MCP, Gmail MCP, Supabase MCP, Chrome DevTools MCP, and gcloud CLI.

e2e-write

Generate e2e tests from spec or code analysis with accessibility-first locators and POM conventions.

effect-testing

Use when testing Effect code including Effect.gen in tests, test layers, mocking services, and testing error scenarios. Use for writing tests for Effect applications.

forge-qa

QA 验收与测试报告。纯验收模式：测试+报告，不修代码。两种调用模式： Mode A（完整 QA）：test-spec 生成 → 10 维度 Playwright 断言引擎 → 智能分析。 Mode B（单 bug 修复回归）：配合 forge-bugfix 的 P6 调用，读取 docs/bugfix/reviews/BF-XX.md，针对 Bug…

fuzz

Lightweight web fuzzing via ffuf — directory discovery, parameter testing, subdomain enumeration.

fx-hedging-strategy-modeler

Foreign exchange exposure analysis and hedging strategy skill with hedge effectiveness testing

gdunit4-test-runner

Run gdUnit4 tests for Godot projects. Use after implementing features, fixing bugs, or modifying GDScript files. USE PROACTIVELY to verify code changes. — from engineering/testing

hz-xr-simulator-setup

Sets up the Meta XR Simulator for testing Meta Quest and Horizon OS apps without a physical device. Use when configuring device-free testing for Unity or Unreal projects.

iso10993-evaluator

Biological evaluation planning skill implementing ISO 10993-1 for biocompatibility testing strategy

layer-testing

Generate comprehensive tests for architectural layers with coverage-first analysis. Use when testing specific layers (core, domain, application, infrastructure, boundary).

local-service-testing

Use when code changes touch database, cache, queue, or other service-dependent components - enforces testing against real local services instead of mocks

loom-model-evaluation

Evaluates ML models for performance, fairness, and reliability. Use for metric selection, cross-validation strategies, overfitting/underfitting diagnosis, hyperparameter tuning,…

loom-testing

Test implementation across unit, integration, e2e, security, infrastructure, data pipeline, and ML domains.

mutation-testing

Configures mewt or muton mutation testing campaigns — scopes targets, tunes timeouts, and optimizes long-running runs.

notion-fixtures

Notion API レスポンス（BlockObjectResponse / PageObjectResponse / DataSourceObjectResponse）のテスト用最小モックファクトリを提供する。`.claude/rules/testing.md` を補完し、テスト追記時の定型コード送信を削減する

mk:nyquist

Test-to-requirement coverage mapping. Reads plan acceptance criteria and test files, produces a coverage gap report showing which requirements have no tests.

pair-session

AI pair programming with Claude (builder) and a second model (advisor). The human observes and can intervene.

performance-test-designer

Performance test design skill for test planning, data collection, and acceptance criteria verification

plugin-development

Complete guide to building Claude Code plugins — manifest schema, command/skill/agent/hook authoring, MCP server development, marketplace publishing, and testing

process-simulation-modeler

Discrete event simulation skill for process modeling, scenario testing, and optimization

prompt-eng

Prompt engineering specialist for system prompt optimization. Designs effective prompts, A/B testing, prompt injection detection, AI response quality.

psychometric-assessment

Develop, validate, and adapt measurement instruments including factor analysis, reliability testing, and cross-cultural validation

pytest-ml-tester

ML-specific testing skill using pytest with fixtures for data, models, and predictions.

qa-only

Report-only QA testing. Systematically tests a web application and produces a structured report with health score, screenshots, and repro steps — but never fixes anything — from…

rtk-tdd

Enforces TDD (Red-Green-Refactor) for Rust development. Auto-triggers on implementation, testing, refactoring, and bug fixing tasks.

Selenium WebDriver

Selenium WebDriver expertise for cross-browser automation and legacy system testing

sox-control-tester

SOX Section 404 control testing skill with workpaper generation and deficiency classification

strategy-stress-testing-skill

Strategy robustness testing, scenario-based evaluation, vulnerability identification, and adaptation planning

Categories

Use cases

Popular tags

Learn

Site