Claude Engineering Skills (Page 196 of 336)

For the full experience including quality scoring and one-click install features for each skill — upgrade to Pro.

llm-evaluation

Master comprehensive evaluation strategies for LLM applications, from automated metrics to human evaluation and A/B testing.

llm-evaluation

Implement comprehensive evaluation strategies for LLM applications using automated metrics, human feedback, and benchmarking.

llm-evaluation

Implement comprehensive evaluation strategies for LLM applications using automated metrics, human feedback, and benchmarking.

llm-evaluation-harness

Build automated LLM evaluation pipelines with benchmarks, regression tests, RAGAS, and human eval workflows.

llm_evaluation

Implement comprehensive evaluation strategies for LLM applications using automated metrics, human feedback, and benchmarking.

llm-game-development

Comprehensive guide to using LLMs throughout the game development lifecycle - from design to implementation to testingUse when "ai game development, llm game dev, claude game, gpt…

llm-gateway-routing

LLM gateway and routing configuration using OpenRouter and LiteLLM. Invoke when: - Setting up multi-model access (OpenRouter, LiteLLM) - Configuring model fallbacks and…

llm-guardrails-designer

Design input and output guardrails for an LLM app — decide what to check (injection patterns, PII, secrets, policy, schema, leakage, toxicity), place them as input vs.

llm-latency-optimizer

Optimize LLM response latency with caching and architecture. TRIGGERS - Use when user needs help with llm-latency-optimizer related tasks.

llm-load-testing

LLM inference load testing for throughput and concurrency limits. Token/s benchmarks, concurrent request sweeps, latency-vs-throughput curves, and breaking-point identification.

llm-response-caching-layer

Implement semantic and exact-match caching for LLM responses to reduce cost 40-60% and latency. Activate on: LLM caching, semantic cache, reduce API costs, cache AI responses.

llm-serving-patterns

LLM inference infrastructure, serving frameworks (vLLM, TGI, TensorRT-LLM), quantization techniques, batching strategies, and streaming response patterns.

llm-streaming-response-handler

Build production LLM streaming UIs with Server-Sent Events, real-time token display, cancellation, error recovery. Handles OpenAI/Anthropic/Claude streaming APIs.

llmem-setup

Install and configure LLMem for an agent harness. Handles CLI install, plugin deployment, skill registration, and provider setup.

Explain llms-txt-php-cli status/info

Use the CLI info command to summarize what llms-txt-php-cli detects/configures for the current repository.

Initialise llms.txt

Guide the user to generate an initial llms.txt for a repository using llms-txt-php-cli, choosing sensible defaults and verifying output.

Validate llms.txt

Validate an existing llms.txt with llms-txt-php-cli and guide the user through fixing validation errors.

lmcache-mp

LMCache multiprocess (MP) mode — standalone LMCache server in its own pod/process that vLLM connects to over ZMQ.

ln-020-codegraph

Builds and queries code knowledge graph for dependency analysis, references, implementations, and architecture overview.

ln-021-codegraph

Builds and queries code knowledge graph for dependency analysis, references, implementations, and architecture overview.

ln-112-project-core-creator

Creates core project docs (requirements, architecture, tech stack, patterns catalog). Use for any project regardless of type.

ln-115-devops-docs-creator

Creates infrastructure.md and runbook.md (Docker-conditional). Use for DevOps documentation in any project.

ln-120-reference-docs-creator

Creates reference docs (ADRs, guides, manuals) for nontrivial tech stack choices. Use when project needs justified architecture decision records.

ln-140-test-docs-creator

Creates test documentation (testing-strategy.md, tests/README.md) with Risk-Based Testing philosophy. Use when setting up test strategy for a project.

ln-404-test-executor

Executes test tasks (label 'tests') through Todo to To Review with risk-based limits. Use for test task execution. Not for implementation tasks.

ln-501-code-quality-checker

Worker that checks DRY/KISS/YAGNI/architecture compliance with quantitative Code Quality Score. Validates architectural decisions via MCP Ref: (1) Optimality - is chosen approach…

ln-510-test-planner

Orchestrates test planning pipeline (research → manual → auto tests). Coordinates ln-511, ln-512, ln-513. Invoked by ln-500-story-quality-gate.

ln-511-code-quality-checker

Checks DRY/KISS/YAGNI/architecture compliance with quantitative Code Quality Score. Use when implementation tasks are Done and need quality scoring.

ln-512-manual-tester

Performs manual testing of Story AC via executable bash scripts saved to tests/manual/. Creates reusable test suites per Story. Worker for ln-510.

ln-512-tech-debt-cleaner

Auto-fixes low-risk tech debt (unused imports, dead code, commented-out code) with >=90% confidence. Use when audit findings need safe automated cleanup.

ln-513-auto-test-planner

Plans automated tests (E2E/Integration/Unit) using Risk-Based Testing after manual testing. Calculates priorities, delegates to ln-301-task-creator. Worker for ln-510.

ln-514-test-log-analyzer

Analyzes application logs: classifies errors, checks log quality, maps stack traces to source. Use when logs need review after test runs or during development.

ln-520-test-planner

Orchestrates test planning pipeline: research, manual testing, automated test planning. Use when Story needs comprehensive test coverage planning.

ln-522-manual-tester

Performs manual testing of Story AC via executable bash scripts in tests/manual/. Use when Story implementation needs hands-on AC verification.

ln-523-auto-test-planner

Plans automated tests (E2E/Integration/Unit) using Risk-Based Testing after manual testing. Use when Story needs a test task with prioritized scenarios. — from engineering/testing

ln-610-code-comments-auditor

Audit code comments and docstrings quality across 6 categories (WHY-not-WHAT, Density, Forbidden Content, Docstrings, Actuality, Legacy).

ln-623-architecture-auditor

Architecture audit worker (L3). Checks DRY (7 types), KISS/YAGNI, layer breaks, error handling, DI patterns. Returns findings with severity, location, effort, recommendations.

ln-630-test-auditor

Use when auditing the test surface through the evaluation platform with mandatory research, coordinated test audit workers, and structured summaries.

ln-631-test-business-logic-auditor

Detects tests validating framework/library behavior instead of project code. Use when auditing test business logic focus.

ln-632-test-e2e-priority-auditor

Validates E2E coverage for critical paths (money, security, data integrity). Risk-based prioritization. Use when auditing E2E test coverage.

ln-633-test-value-auditor

Scores each test by Impact x Probability, returns KEEP/REVIEW/REMOVE decisions. Use when auditing test value and pruning low-value tests.

ln-634-test-coverage-auditor

Identifies missing tests for critical paths (money, security, data integrity, core flows). Use when auditing test coverage gaps.

ln-635-test-isolation-auditor

Checks test isolation (API/DB/FS/Time/Network), determinism, flaky tests, order-dependency, anti-patterns. Use when auditing test isolation.

ln-636-manual-test-auditor

Checks manual test scripts for harness adoption, golden files, fail-fast, config sourcing, idempotency. Use when auditing manual test quality.

ln-637-test-structure-auditor

Checks test file organization, directory layout, test-to-source mapping, domain grouping, co-location. Use when auditing test structure.

ln-638-test-oracle-effectiveness-auditor

Audits assertion strength and test oracles that prove real defects. Use when finding weak tests that execute code but prove little.

ln-642-layer-boundary-auditor

Checks layer boundary violations, transaction boundaries, session ownership, cross-layer consistency. Use when auditing architecture layers.

ln-642-layer-ownership-boundary-auditor

Checks layer, resource ownership, and orchestration boundaries. Use when auditing architecture boundary enforcement.

ln-644-dependency-topology-auditor

Builds dependency topology, detects cycles, validates import rules, and calculates coupling metrics. Use when auditing architecture topology.

ln-645-architecture-modernization-auditor

Finds architecture-level modernization opportunities: obsolete custom mechanisms, overbuilt extension points, and simplifiable architecture.

ln-647-configuration-boundary-auditor

Audits architecture config boundaries: typed settings, scattered env reads, config leakage, and layer ownership. Use for config architecture.

ln-651-query-efficiency-auditor

Checks redundant fetches, N+1 loops, over-fetching, missing bulk operations, wrong caching scope. Use when auditing query efficiency.

ln-720-structure-migrator

Scaffolds new or restructures existing projects to Clean Architecture. Use when setting up project structure.

ln-721-frontend-restructure

Scaffolds new React projects or restructures monoliths to component-based architecture. Use when setting up frontend structure.

ln-724-artifact-cleaner

Removes platform-specific artifacts from Replit, StackBlitz, CodeSandbox, Glitch. Use when preparing exported projects for production.

ln-730-devops-setup

Sets up Docker, CI/CD, and environment configuration with auto-detection. Use when adding DevOps infrastructure to a project.

ln-743-test-infrastructure

Sets up test infrastructure with Vitest, xUnit, and pytest. Use when adding testing frameworks and sample tests to a project.

ln-771-logging-configurator

Configures structured JSON logging with Serilog (.NET) or structlog (Python). Use when adding logging to backend projects.

ln-772-error-handler-setup

Configures global exception handling middleware. Use when adding centralized error handling to .NET or Python backends.

ln-774-healthcheck-setup

Configures health check endpoints for Kubernetes readiness/liveness/startup probes. Use when deploying to Kubernetes.

Categories

Use cases

Popular tags

Learn

Site

Claude Engineering Skills (Page 196 of 336)

Keep exploring Engineering

Categories

Use cases

Popular tags

Learn

Site