Home › By role › Claude Code skills for data engineers

Claude Code skills for data engineers

ETL/ELT pipelines, warehouse modeling, dbt transforms, streaming ingestion, and SQL across dialects. Skills for moving and shaping data at scale.

Related searches: claude code skills for data engineers, AI data engineering skills, claude ETL pipeline skills, claude code data warehouse skills.

data-pipeline-engineer

Expert data engineer for ETL/ELT pipelines, streaming, data warehousing. Activate on: data pipeline, ETL, ELT, data warehouse, Spark, Kafka, Airflow, dbt, data modeling, star schema, streaming data, b

general

data-pipelines

Use this skill when building data pipelines, ETL/ELT workflows, or data transformation layers. Triggers on Airflow DAG design, dbt model creation, Spark job optimization, streaming vs batch architectu

engineering

senior-data-engineer

World-class data engineering skill for building scalable data pipelines, ETL/ELT systems, and data infrastructure. Expertise in Python, SQL, Spark, Airflow, dbt, Kafka, a — from ricardonevesbraga/flow

engineering

senior-data-engineer

World-class data engineering skill for building scalable data pipelines, ETL/ELT systems, and data infrastructure. Expertise in Python, SQL, Spark, Airflow, dbt, Kafka, a — from bg-szy/TOP-SKILLS

engineering

stream

ETL/ELT pipeline design, data flow visualization, batch/streaming selection, and Kafka/Airflow/dbt design. Use when building data pipelines or managing data quality.

general

agency-data-engineer

Expert data engineer specializing in building reliable data pipelines, lakehouse architectures, and scalable data infrastructure. Masters ETL/ELT, Apache Spark, dbt, streaming systems, and cloud data

engineering

data-eng-sme

데이터 엔지니어링·클라우드 글의 기술 깊이를 만든다. GCP(BigQuery/Dataflow/Pub-Sub/Composer)와 OSS 데이터 스택(Spark/Airflow/Kafka/Flink/dbt/Iceberg)의 아키텍처·실행 가능한 코드·트레이드오프·실제 스키마를 02_sme_notes.md에 작성한다. AWS/Snowflake/Databricks는

general

data-engineer

Build scalable data pipelines, modern data warehouses, and real-time streaming architectures. Implements Apache Spark, dbt, Airflow, and cloud-native data platforms.

engineering

data-pipeline-review

Review or design a data pipeline architecture. Assesses ingestion pattern, transformation design, orchestration, idempotency, freshness SLAs, data contracts at boundaries, dbt test coverage, lineage,

engineering

data-warehouse-integration

Syncing Rails Postgres to a data warehouse (Snowflake, BigQuery, Redshift) — Fivetran / Airbyte / Hightouch / Stitch / Census / CDC via Debezium, when ELT beats ETL, dbt for transformation, reverse-ET

engineering

typescript-data-engineering

Use when building data pipelines, ETL jobs, event processors, message-broker producers/consumers, application caching layers, database migrations, BigQuery queries, or event-sourcing handlers in TypeS

engineering

ingesting-into-data-lake

Import data into the AWS data lake from S3 files, local uploads, JDBC databases (Oracle, SQL Server, PostgreSQL, MySQL, RDS, Aurora), Amazon Redshift, Snowflake, BigQuery, DynamoDB, or existing Glue c

engineering

sales-maestroqa

MaestroQA platform help — conversation data QA platform with customizable scorecards, AI-powered coaching workflows, conversation analytics (AskAI), reverse-ETL to CRM/Slack/data warehouses, CSAT inge

sales

databricks-pipelines

Develop Lakeflow Spark Declarative Pipelines (formerly Delta Live Tables) on Databricks. Use when building batch or streaming data pipelines with Python or SQL. Invoke BEFORE starting implementation.

engineering

dev-data-pipeline-builder

Conception de pipelines de données robustes et scalables. Se déclenche avec "data pipeline", "pipeline de données", "batch processing", "stream processing", "Apache Spark", "Airflow", "dbt", "data eng

general

docker-data-environments

Docker for data engineering environments — multi-stage Dockerfiles for dbt/Spark/Airflow images, BuildKit layer caching (--mount=type=cache), private registries (ghcr.io/Harbor), docker buildx multi-p

engineering

e2e-medallion-architecture

Implement end-to-end Medallion Architecture (Bronze/Silver/Gold) lakehouse patterns in Microsoft Fabric using PySpark, Delta Lake, and Fabric Pipelines. Use when the user wants to: (1) design a Bronze

engineering

fabric-data-engineering

Review Microsoft Fabric data engineering artifacts — Lakehouse and OneLake design, medallion (bronze/silver/gold) architecture, Spark notebooks and Spark job definitions, Data pipelines and Dataflows

engineering

lakehouse-architect

Delta Lake, Apache Iceberg, Hudi for ACID transactions on object storage. Activate on: lakehouse, Delta Lake, Iceberg, Hudi, table format, ACID on S3, time travel, data lake, open table format. NOT fo

general

spark-declarative-pipelines

Creates, configures, and updates Databricks Lakeflow Spark Declarative Pipelines (SDP/LDP) using serverless compute. Handles streaming tables, materialized views, CDC, SCD Type 2, and Auto Loader inge

engineering

streaming-into-data-lake

Stream rows continuously into Apache Iceberg tables on S3 Tables (or standard Iceberg on a general purpose bucket) using Amazon Data Firehose with IcebergDestinationConfiguration. Covers the Firehose

general

airflow

Python DAG workflow orchestration using Apache Airflow for data pipelines, ETL processes, and scheduled task automation

engineering

clari-core-workflow-a

Build a Clari forecast export pipeline to your data warehouse. Use when exporting forecast calls, quota data, and CRM totals from Clari to Snowflake, BigQuery, or a local database. Trigger with phrase

general

clickhouse-webhooks-events

Ingest data into ClickHouse from webhooks, Kafka, and streaming sources with batching, dedup, and exactly-once patterns. Use when building data ingestion pipelines, consuming webhook payloads, or inte

general

dagster-orchestration

ALWAYS USE when working with Dagster assets, resources, IO managers, schedules, sensors, or dbt integration. CRITICAL for: @asset decorators, @dbt_assets, DbtCliResource, ConfigurableResource, IO mana

engineering

data-analytics

Create data pipeline and analytics architecture diagrams using PlantUML syntax with database/analytics stencil icons. Best for ETL pipelines, data lakes, real-time streaming, data warehousing, and BI

engineering

data-pipeline-spec

Design an ETL/ELT data pipeline specification. Use when asked to design a data pipeline, spec an ETL or ELT process, document a data ingestion workflow, or plan a data integration. Produces a complete

general

data-warehouse-experimentation

Running experiments out of the data warehouse instead of via dedicated experiment platforms. SQL-based assignment, exposure logging discipline, metric definitions in dbt models, statistical analysis i

science

data-warehouse-optimizer

Snowflake, BigQuery, clustering, partitioning, and materialized views for warehouse performance. Activate on: Snowflake, BigQuery, Redshift, query optimization, clustering, partitioning, materialized

general

databricks-core-workflow-a

Execute Databricks primary workflow: Delta Lake ETL pipelines. Use when building data ingestion pipelines, implementing medallion architecture, or creating Delta Lake transformations. Trigger with phr

engineering

Claude Code skills for data engineers

data-pipeline-engineer

data-pipelines

senior-data-engineer

senior-data-engineer

stream

agency-data-engineer

data-eng-sme

data-engineer

data-pipeline-review

data-warehouse-integration

typescript-data-engineering

ingesting-into-data-lake

sales-maestroqa

databricks-pipelines

dev-data-pipeline-builder

docker-data-environments

e2e-medallion-architecture

fabric-data-engineering

lakehouse-architect

spark-declarative-pipelines

streaming-into-data-lake

airflow

clari-core-workflow-a

clickhouse-webhooks-events

dagster-orchestration

data-analytics

data-pipeline-spec

data-warehouse-experimentation

data-warehouse-optimizer

databricks-core-workflow-a

Categories

Use cases

Popular tags

Learn

Site