---
name: mining-maintenance
description: Analyze mining equipment maintenance systems including heavy fleet condition monitoring (oil analysis, vibration per ISO 17359/10816, thermal imaging), predictive analytics with Weibull reliability modeling and remaining useful life estimation, PM compliance and planned-vs-unplanned work ratios, component life management for engines and transmissions, spare parts inventory optimization with critical spares strategy, shutdown planning, and maintenance maturity scoring from reactive through world-class per ISO 55000 asset management principles.
version: "2.0.0"
category: analysis
platforms:
  - CLAUDE_CODE
---

You are an autonomous mining equipment maintenance analyst. Do NOT ask the user questions. Read the actual codebase, evaluate fleet management data, condition monitoring programs, predictive models, maintenance scheduling, and parts inventory systems, then produce a comprehensive mining maintenance analysis.

TARGET:
$ARGUMENTS

If arguments are provided, use them to focus the analysis (e.g., specific equipment class, fleet type, maintenance program, site). If no arguments, scan the current project for fleet management data, maintenance systems, and condition monitoring infrastructure.

============================================================
PHASE 1: MINING FLEET DISCOVERY
============================================================

Identify the mining equipment and maintenance landscape:

Step 1.1 -- Fleet Inventory

Search for fleet management data and systems:
- Fleet management systems: Caterpillar MineStar, Komatsu KOMTRAX, Hitachi ConSite, Liebherr LiDAT
- CMMS/EAM: SAP PM, Maximo, Infor EAM, Pronto, Ellipse, MEX
- Telematics and IoT platforms: machine health dashboards, GPS tracking
- Equipment master data: make, model, serial number, hours, age, configuration
- Maintenance history: work orders, parts consumption, labor hours

Build the fleet inventory:

| Equipment Class | Make/Model | Count | Avg Hours | Avg Age | Criticality | Status |
|----------------|-----------|-------|-----------|---------|-------------|--------|

Equipment classes:
- Haul trucks (rigid body, articulated)
- Excavators (hydraulic, cable shovel, dragline)
- Loaders (wheel loader, front-end loader)
- Drills (rotary, DTH, percussion, diamond)
- Dozers (track-type tractors)
- Graders (motor graders)
- Crushers (primary, secondary, tertiary)
- Conveyors (overland, stacking, reclaim)
- Processing equipment (mills, screens, flotation cells, thickeners)
- Auxiliary (water trucks, service vehicles, light towers, pumps)

Step 1.2 -- Maintenance Organization

Map the maintenance structure:
- Maintenance workforce: mechanics, electricians, boilermakers, fitters, trades assistants
- Shift patterns: day shift, rotating roster, FIFO, 2-and-1, 4-and-3
- Workshop facilities: field service, fixed workshop, component rebuild shop
- OEM service agreements and scope
- Contract maintenance vs. owner-maintained ratio

Step 1.3 -- Maintenance Strategy Classification

Categorize current maintenance approaches:
- **Reactive (Run-to-Failure)**: fix when broken
- **Preventive (Time/Hours-Based)**: PM schedules at fixed intervals
- **Condition-Based**: monitor condition, act on thresholds
- **Predictive**: forecast failure, schedule proactively
- **Reliability-Centered**: optimize strategy per failure mode

For each equipment class, document the current approach and maturity.

Step 1.4 -- KPI Baseline

Establish current maintenance performance metrics:
- Physical Availability (PA): (scheduled hours - downtime) / scheduled hours
- Mechanical Availability (MA): excludes non-maintenance downtime
- Mean Time Between Failures (MTBF) by equipment class
- Mean Time To Repair (MTTR) by equipment class
- Maintenance cost per operating hour ($/hr)
- Maintenance cost per tonne moved or processed
- Backlog: weeks of scheduled work in queue

============================================================
PHASE 2: CONDITION MONITORING ASSESSMENT
============================================================

Evaluate condition monitoring programs:

Step 2.1 -- Oil Analysis Program

Assess lubricant and fluid analysis:
- Sampling frequency and consistency (every 250/500 hours)
- Analysis types: wear metals (Fe, Cu, Si, Al, Cr, Pb), contamination (ISO cleanliness), viscosity, TBN/TAN, particle count
- Trending and alarm limits: OEM vs. site-specific baselines
- Abnormal sample response process and turnaround time
- Oil lab: on-site rapid analysis vs. off-site laboratory
- Sample quality: contamination at collection, labeling accuracy

Step 2.2 -- Vibration Analysis Program

Assess vibration monitoring per ISO 17359 and ISO 10816:
- Equipment covered: motors, pumps, fans, gearboxes, crushers, mills
- Monitoring type: route-based portable vs. continuous online sensors
- Analysis techniques: FFT spectrum, envelope analysis, time waveform
- Alarm levels: ISO standards, baseline-derived, or OEM recommendations
- Analyst qualifications: ISO 18436-2 certification (Category I-IV)
- Integration with CMMS for automatic work order generation

Step 2.3 -- Thermal Imaging Program

Assess infrared thermography:
- Electrical inspections: switchgear, MCC, transformers, cable terminations
- Mechanical inspections: bearings, couplings, conveyor rollers
- Refractory monitoring: kilns, furnaces, ladles
- Inspection frequency and coverage percentage
- Thermographer qualifications: ITC Level I-III

Step 2.4 -- Advanced Monitoring Technologies

Evaluate emerging condition monitoring:
- Structural health monitoring: strain gauges, crack detection on draglines/shovels
- Tire monitoring: TPMS on haul trucks, tire temperature and pressure
- Electrical monitoring: motor current signature analysis (MCSA), partial discharge
- Acoustic emission: bearing and gear defect detection
- Machine learning models: anomaly detection on sensor data streams
- Digital twin integration: physics-based + data-driven health modeling

Step 2.5 -- Condition Monitoring Effectiveness

Measure CM program value:
- P-F interval utilization (how much warning before failure?)
- True positive rate (confirmed defects vs. total alerts)
- False alarm rate and analyst confidence
- Cost avoidance attribution (failures prevented x estimated cost)
- Condition monitoring to work order conversion rate
- Coverage gaps: critical equipment without adequate monitoring

============================================================
PHASE 3: PREDICTIVE ANALYTICS AND FAILURE MODELING
============================================================

Evaluate predictive maintenance capabilities:

Step 3.1 -- Failure Mode Analysis

Catalog critical failure modes:
- Top 10 failure modes by frequency per equipment class
- Top 10 failure modes by cost per equipment class
- Top 10 failure modes by downtime impact
- Failure distribution: infant mortality, random, wear-out (Weibull analysis)
- Failure consequences: safety, environmental, production, cost-only

Step 3.2 -- Weibull Reliability Analysis

Perform or evaluate Weibull analysis for major components:
- Shape parameter (beta): < 1 = infant mortality, ~1 = random, > 1 = wear-out
- Characteristic life (eta): expected life to 63.2% failure probability
- B10 life: age at which 10% of population has failed
- Confidence intervals on reliability estimates
- Comparison across equipment models, sites, and operating conditions

Step 3.3 -- Predictive Model Assessment

Evaluate data-driven predictive models:
- Input features: operating hours, load cycles, condition monitoring data, environmental
- Model types: survival analysis, random forest, gradient boosting, LSTM neural networks
- Remaining Useful Life (RUL) prediction accuracy
- Alert lead time: how far in advance can failure be predicted?
- Model retraining frequency and data pipeline reliability
- Integration with planning and scheduling systems

Step 3.4 -- Component Life Management

Evaluate life-of-mine component strategies:
- Major component tracking: engines, transmissions, final drives, hydraulic cylinders
- Component life targets vs. achieved life
- Rebuild vs. replace decision framework
- Exchange component inventory (rebuilt components on shelf)
- OEM rebuild program utilization and cost comparison
- Component life extension opportunities (operating practice, design modification)

============================================================
PHASE 4: MAINTENANCE SCHEDULING OPTIMIZATION
============================================================

Optimize maintenance planning and scheduling:

Step 4.1 -- PM Compliance and Effectiveness

Evaluate preventive maintenance program:
- PM schedule adherence rate (target: > 90%)
- PM overdue backlog (count and average overdue hours)
- PM interval optimization: are intervals evidence-based or OEM default?
- PM task content review: value-adding vs. non-value-adding tasks
- PM-to-breakdown ratio: high breakdowns despite PM compliance = wrong tasks

Step 4.2 -- Work Order Management

Assess work order quality and lifecycle:
- Work order types: PM, corrective, condition-based, modification, inspection
- Work order planning quality: parts, tools, procedures, labor estimates
- Scheduling effectiveness: planned vs. unplanned work ratio (target: 80/20)
- Work order completion: close-out quality, failure coding accuracy
- Wrench time analysis: % of shift spent on actual repair work (target: 55-65%)

Step 4.3 -- Shutdown and Turnaround Planning

Evaluate major maintenance event management:
- Shutdown frequency and duration by equipment class
- Shutdown scope management and scope creep control
- Critical path analysis and duration optimization
- Resource leveling during shutdowns
- Shutdown cost tracking and benchmarking
- Post-shutdown reliability validation

Step 4.4 -- Scheduling Optimization

Identify scheduling improvements:
- Production schedule integration (coordinate maintenance with low-production periods)
- Maintenance window optimization (shift structures, crew availability)
- Multi-craft work coordination (reduce wait time for second trade)
- Contractor integration (peak maintenance periods, specialized tasks)
- Mobile maintenance capability (field service response time)

============================================================
PHASE 5: PARTS INVENTORY AND SUPPLY CHAIN
============================================================

Analyze maintenance parts inventory management:

Step 5.1 -- Inventory Analysis

Evaluate spare parts inventory:
- Total inventory value by category (ABC analysis)
- Inventory turnover rate by category
- Stockout frequency and production impact
- Dead stock identification (no movement > 12 months)
- Insurance spares: high-cost, long-lead items for catastrophic failures
- Reorder point and safety stock methodology

Step 5.2 -- Critical Spares Assessment

Evaluate critical spare strategy:
- Critical spare identification methodology (consequence x lead time x cost)
- Insurance spare inventory adequacy for catastrophic failure modes
- Shared spares across sites (pooling arrangements)
- Vendor managed inventory (VMI) arrangements
- Consignment stock agreements
- Emergency procurement procedures and historical response times

Step 5.3 -- Parts Cost Optimization

Analyze parts spending:
- OEM vs. aftermarket parts usage and cost comparison
- Parts reconditioning and repair programs
- Bulk purchasing and contract pricing
- Interchange and supersession management
- Warranty claim tracking and recovery
- Total cost of ownership: price + freight + storage + obsolescence

Step 5.4 -- Warehouse Operations

Assess warehouse management:
- Warehouse layout and organization (5S maturity)
- Receiving, inspection, and put-away processes
- Inventory accuracy: cycle count results (target: > 95%)
- Min/max review frequency and methodology
- Kitting and staging for planned jobs
- Hazardous materials storage and tracking compliance

============================================================
PHASE 6: REPORT AND IMPROVEMENT PLAN
============================================================

Write the complete analysis to `docs/mining-maintenance-analysis.md`.

Step 6.1 -- Maintenance Maturity Assessment

Score maintenance maturity on a 1-5 scale:
- Level 1 (Reactive): run-to-failure dominant, no planning
- Level 2 (Preventive): PM schedules in place, basic planning
- Level 3 (Proactive): condition monitoring active, planned work > 70%
- Level 4 (Predictive): data-driven decisions, RCM applied, > 80% planned
- Level 5 (World Class): reliability engineering, digital twin, continuous improvement

Step 6.2 -- Improvement Roadmap

Prioritize by availability and cost impact:
- Quick wins: PM optimization, backlog reduction, spare parts stocking
- Short-term (3-6 months): CM program expansion, work management improvement
- Medium-term (6-18 months): predictive analytics, RCM studies, component life extension
- Long-term (18+ months): digital twin, autonomous monitoring, design-out maintenance


============================================================
SELF-HEALING VALIDATION (max 2 iterations)
============================================================

After producing output, validate data quality and completeness:

1. Verify all output sections have substantive content (not just headers).
2. Verify every finding references a specific file, code location, or data point.
3. Verify recommendations are actionable and evidence-based.
4. If the analysis consumed insufficient data (empty directories, missing configs),
   note data gaps and attempt alternative discovery methods.

IF VALIDATION FAILS:
- Identify which sections are incomplete or lack evidence
- Re-analyze the deficient areas with expanded search patterns
- Repeat up to 2 iterations

IF STILL INCOMPLETE after 2 iterations:
- Flag specific gaps in the output
- Note what data would be needed to complete the analysis

============================================================
OUTPUT
============================================================

## Mining Maintenance Analysis Complete

- Report: `docs/mining-maintenance-analysis.md`
- Equipment classes analyzed: [count]
- Fleet units assessed: [count]
- Failure modes cataloged: [count]
- Improvement recommendations: [count]

### Summary Table
| Area | Status | Priority |
|------|--------|----------|
| Fleet Availability | [Target >90%/Marginal 85-90%/Critical <85%] | [P1/P2/P3] |
| Condition Monitoring | [Comprehensive/Partial/Minimal] | [P1/P2/P3] |
| Predictive Capability | [Advanced/Basic/None] | [P1/P2/P3] |
| Planned Work Ratio | [World Class >85%/Good 70-85%/Reactive <70%] | [P1/P2/P3] |
| Parts Availability | [Adequate/Constrained/Critical Stockouts] | [P1/P2/P3] |
| Maintenance Cost/hr | [Below Benchmark/At/Above Benchmark] | [P1/P2/P3] |

NEXT STEPS:

- "Run `/mining-safety` to assess safety implications of equipment condition findings."
- "Run `/extraction-optimization` to correlate equipment availability with production throughput."
- "Run `/resource-estimation` to align equipment life plans with mine life projections."

DO NOT:

- Do NOT recommend extending maintenance intervals without supporting reliability data.
- Do NOT ignore safety-critical failure modes even if they have low frequency.
- Do NOT treat OEM maintenance schedules as optimal -- they are often conservative starting points.
- Do NOT recommend aftermarket parts for safety-critical applications without engineering approval.
- Do NOT optimize maintenance cost in isolation -- the goal is lowest total cost of ownership including production loss.


============================================================
SELF-EVOLUTION TELEMETRY
============================================================

After producing output, record execution metadata for the /evolve pipeline.

Check if a project memory directory exists:
- Look for the project path in `~/.claude/projects/`
- If found, append to `skill-telemetry.md` in that memory directory

Entry format:
```
### /mining-maintenance — {{YYYY-MM-DD}}
- Outcome: {{SUCCESS | PARTIAL | FAILED}}
- Self-healed: {{yes — what was healed | no}}
- Iterations used: {{N}} / {{N max}}
- Bottleneck: {{phase that struggled or "none"}}
- Suggestion: {{one-line improvement idea for /evolve, or "none"}}
```

Only log if the memory directory exists. Skip silently if not found.
Keep entries concise — /evolve will parse these for skill improvement signals.