Claude Code Skills·Claude Skills·The open SKILL.md registry for Claude
ClaudSkillsEngineering › Testing

Testing

2448 Claude Code skills in the Testing sub-category of Engineering.

2,448 skills · updated 2026-06-12 · showing 1–60 of 2,448 by quality score

For the full experience including quality scoring and one-click install features for each skill — upgrade to Pro.

Evaluate design from a UX perspective, assessing visual hierarchy, information architecture, emotional resonance, cognitive load, and overall quality with quantitative scoring,…
Multi-model deliberation via mumo's MCP server. Best for contested architecture/product decisions, design reviews, pressure-testing a pre-launch spec, resolving tradeoffs with…
Audit a repo's test, lint, type-check, static analysis, build, and debug infrastructure for AI coding agents.
Quick workflow for trivial changes (single-file fix, rename, typo). Skip the full Explore-Plan-TDD-Audit cycle.
Drive an approved plan from context/changes//plan.md phase by phase, test-first, through red→green→refactor — only for TDD'able phases not yet implemented; everything…
Statistical experiment design and analysis capabilities for product experimentation
Use after implementing features - verifies each acceptance criterion with structured testing and posts verification reports to the GitHub issue — from engineering/testing
Use when testing GitHub Actions workflows locally with act. Covers act CLI usage, Docker configuration, debugging workflows, and troubleshooting common issues when running…
Browser automation for web testing, form filling, screenshots, and data extraction. Use when navigating websites, interacting with web pages, filling forms, taking screen — from…
Browser automation for web testing, form filling, screenshots, and data extraction. Use when navigating websites, interacting with web pages, filling forms, taking screen — from…
Use when creating custom Ameba rules for Crystal code analysis including rule development, AST traversal, issue reporting, and rule testing.
Use when facilitating BDD collaboration between developers, testers, and business stakeholders. Use when running discovery workshops and example mapping sessions.
Use when applying Behavior-Driven Development patterns including Given-When-Then structure, feature files, and acceptance criteria.
Use when writing effective BDD scenarios including acceptance criteria, edge cases, and scenario organization. Use when defining behavior specifications.
Detect positive selection using dN/dS (omega) tests with PAML codeml and HyPhy. Identify sites and branches under adaptive evolution through codon models and branch-site tests.
Genome-wide association studies (GWAS) with PLINK. Perform case-control and quantitative trait association testing using logistic/linear regression with covariates, gener — from…
End-to-end 16S amplicon workflow from FASTQ reads to differential abundance. Orchestrates DADA2 ASV inference, taxonomy assignment, diversity analysis, and compositional — from…
Biocompatibility test selection and protocol recommendation skill based on device categorization
Use when writing tests with Bun's built-in test runner. Covers test organization, assertions, mocking, and snapshot testing using Bun's fast test infrastructure.
Daily, weekly, and monthly cash forecasting skill with scenario analysis and liquidity stress testing
Catalyst performance analysis skill for activity testing, deactivation modeling, and optimization
Generate unit tests, integration tests, and test fixtures for code. Supports Jest, Mocha, pytest. Use when writing tests or improving test coverage.
Jest unit testing standards covering configuration, test structure, testing patterns, and coverage requirements
Expert Cypress testing framework integration for browser-based end-to-end testing
Execute end-to-end tests for Nikita using Telegram MCP, Gmail MCP, Supabase MCP, Chrome DevTools MCP, and gcloud CLI.
Use when testing Effect code including Effect.gen in tests, test layers, mocking services, and testing error scenarios. Use for writing tests for Effect applications.
QA 验收与测试报告。纯验收模式:测试+报告,不修代码。 两种调用模式: Mode A(完整 QA):test-spec 生成 → 10 维度 Playwright 断言引擎 → 智能分析。 Mode B(单 bug 修复回归):配合 forge-bugfix 的 P6 调用,读取 docs/bugfix/reviews/BF-XX.md, 针对 Bug…
Lightweight web fuzzing via ffuf — directory discovery, parameter testing, subdomain enumeration.
Foreign exchange exposure analysis and hedging strategy skill with hedge effectiveness testing
Run gdUnit4 tests for Godot projects. Use after implementing features, fixing bugs, or modifying GDScript files. USE PROACTIVELY to verify code changes. — from engineering/testing
Sets up the Meta XR Simulator for testing Meta Quest and Horizon OS apps without a physical device. Use when configuring device-free testing for Unity or Unreal projects.
Biological evaluation planning skill implementing ISO 10993-1 for biocompatibility testing strategy
Generate comprehensive tests for architectural layers with coverage-first analysis. Use when testing specific layers (core, domain, application, infrastructure, boundary).
Use when code changes touch database, cache, queue, or other service-dependent components - enforces testing against real local services instead of mocks
Evaluates ML models for performance, fairness, and reliability. Use for metric selection, cross-validation strategies, overfitting/underfitting diagnosis, hyperparameter tuning,…
Test implementation across unit, integration, e2e, security, infrastructure, data pipeline, and ML domains.
Configures mewt or muton mutation testing campaigns — scopes targets, tunes timeouts, and optimizes long-running runs.
Performance test design skill for test planning, data collection, and acceptance criteria verification
Discrete event simulation skill for process modeling, scenario testing, and optimization
Prompt engineering specialist for system prompt optimization. Designs effective prompts, A/B testing, prompt injection detection, AI response quality.
Develop, validate, and adapt measurement instruments including factor analysis, reliability testing, and cross-cultural validation
ML-specific testing skill using pytest with fixtures for data, models, and predictions.
Report-only QA testing. Systematically tests a web application and produces a structured report with health score, screenshots, and repro steps — but never fixes anything.
Enforces TDD (Red-Green-Refactor) for Rust development. Auto-triggers on implementation, testing, refactoring, and bug fixing tasks.
Selenium WebDriver expertise for cross-browser automation and legacy system testing
SOX Section 404 control testing skill with workpaper generation and deficiency classification
Strategy robustness testing, scenario-based evaluation, vulnerability identification, and adaptation planning
TDD workflow for RTK filter development. Red-Green-Refactor with Rust idioms. Real fixtures, token savings assertions, snapshot tests with insta.
Use when you want to audit test suites for potential issues (declares candidate signals: flaky, orphan, trivial assertions).
Skill for correlating test results with analytical predictions and model validation
Use when a source file needs a corresponding test pair file created (generates skeleton test structure).
Cria testes com Pest PHP incluindo Feature, Unit, HTTP e Datasets. Use quando precisar escrever testes, criar test suites, ou implementar TDD em projetos Laravel com Pest.
End-to-end testing patterns with Playwright — page objects, AI agent testing, visual regression, accessibility testing with axe-core, and CI integration.
Integration and contract testing patterns — API endpoint tests, component integration, database testing, Pact contract verification, property-based testing, and Zod schema…
LLM and AI testing patterns — mock responses, evaluation with DeepEval/RAGAS, structured output validation, and agentic test patterns (generator, healer, planner).
Performance and load testing patterns — k6 load tests, Locust stress tests, pytest execution optimization (xdist parallel, plugins), test type classification, and performance…
Unit testing patterns for isolated business logic tests — AAA pattern, parametrized tests (test.each, @pytest.mark.parametrize), fixture scoping (function/module/session), mocking…
Comprehensive skill for VA BDD (Vanessa Automation) testing of 1C:Enterprise configurations. Covers the complete workflow: configuration analysis, VA documentation lookup, test…
Stateful, phased test-rollout orchestrator for existing products. Writes context/foundation/test-plan.md, then drives each rollout phase through /10x-new → /10x-research →…
Design rigorous A/B/n experiments — hypothesis, power analysis, MDE, randomisation unit, guardrails, decision criteria — and route to stats-reviewer for peer-review.
All Engineering skills →
More in EngineeringDevops (2,410) · Architecture (1,778) · Backend (1,375) · Frontend (1,035) · Languages (880) · Cloud Platforms (802) · Code Quality (774) · Databases (568) · Performance (517) · Mobile (379) · Observability (272) · Data Engineering (230) · Docs Engineering (197) · Workflow Orchestration (170) · ML AI Eng (144) · API Tooling (15)