Computational mineral prospectivity mapping using satellite data. Predict where to dig before anyone else.
GeaSpirit is an advanced remote sensing and exploration intelligence platform based on multi-source fusion and zone-based validation. It identifies zones with high probability of containing mineral deposits by fusing multiple data sources — satellite imagery, geophysics, geochemistry, hydrology, and geological context.
Unlike traditional exploration, GeaSpirit does not require physical access to the target zone. It operates entirely from publicly available remote sensing and geoscientific data.
"There is [MINERAL] at [DEPTH] at [COORDINATES] with [X%] certainty."
GeaSpirit scores any point on the planet across 4 dimensions:
| MINERAL | 4.0 / 10 | Identify deposit type (porphyry Cu, orogenic Au, sedimentary Cu, etc.) |
| DEPTH | 4.1 / 10 | Estimate target depth (surface proxy — deposit-scale geophysics blocked) |
| COORDINATES | 7.0 / 10 | Precise location at 30m per pixel resolution |
| CERTAINTY | 7.7 / 10 | Calibrated probability (isotonic calibration, Brier score validated) |
Canonical Score: 22.8 / 40 (57%) — Methodology frozen v4
The gap to 10/10 is a DATA problem (gravity, AEM, drill holes), not ML. Architecture is ready; the bottleneck is depth-oriented data access.
| Zone | Country | Deposit Type | Baseline AUC | Fusion AUC | Improvement |
|---|---|---|---|---|---|
| Chuquicamata | Chile | Porphyry Copper | 0.789 | 0.882 | +0.093 |
| Kalgoorlie | Australia | Orogenic Gold | 0.865 | 0.879 | +0.013 |
| Zambia Copperbelt | Zambia | Sedimentary Copper | 0.737 | 0.760 | +0.024 |
Multi-source fusion improves performance at all 3 validated zones, across different deposit types and continents. This confirms the approach generalizes.
| Spectral analysis | Multi-band satellite imagery interpretation | PRODUCTION |
| Thermal anomaly detection | Long-term thermal proxy from 20-year archives | PRODUCTION |
| Hydrological features | Drainage density, watershed analysis | PRODUCTION |
| Neighborhood context | Spatial autocorrelation of deposit occurrence | PRODUCTION |
| Aeromagnetics | National TMI magnetic anomaly data | PRODUCTION |
| Geological context | Lithology classification via Macrostrat | SELECTIVE |
| Probability calibration | Isotonic calibration for honest certainty | PRODUCTION |
| GEE integration | Google Earth Engine operationalized for data pipelines | OPERATIONAL |
The following details are available only in the restricted technical whitepaper:
Server-side authentication required.
| Deposit Type | Zone | Labels | AUC |
|---|---|---|---|
| Porphyry Cu | Chuquicamata | 43 | 0.8622 |
| Orogenic Au | Kalgoorlie | 103 Au-only | 0.8063 |
| Sediment-hosted Cu | Zambia | 28 | 0.7626 |
| Version | Labels | Stack | AUC | Improvement |
|---|---|---|---|---|
| v1 (MRDS only) | 16 | 14 bands, 50% valid | 0.5752 | baseline |
| v2 (+OZMIN) | 205 | 12 bands, 50% valid | 0.7219 | +0.1467 (labels) |
| v3 (full stack) | 205 | 12 bands, 100% valid | 0.7690 | +0.0471 (stack) |
| Direction | AUC | Lesson |
|---|---|---|
| Chuquicamata → Zambia | 0.4543 | Porphyry Cu ≠ sediment-hosted Cu |
| Zambia → Chuquicamata | 0.5437 | Same commodity ≠ same geological signal |
| Experiment ⓘ EXPERIMENT — Data source combinations tested
Phase 3 baseline: 19 satellite bands, 152 mixed deposits (noisy)
A: S2 only: 5 Sentinel-2 mineral indices
B: Full satellite: 19 bands (S2 + SAR + DEM + thermal)
C: Geology only: 5 features from Macrostrat geological maps
D: S2 + geology: Spectral + geological maps (10 bands)
E: Full fusion: All satellite + geology = 24 features |
Bands ⓘ BANDS — Features per pixel:
S2 (5): Iron Oxide, Clay, Ferrous Iron, Laterite, NDVI
SAR (5): VV, VH, VV/VH, GLCM variance, GLCM contrast
DEM (6): Elevation, Slope, Aspect, TPI, Ruggedness
Thermal (3): Median temp, P90 temp, Anomaly z-score
Geology (5): Lithology, Rock age, Group, Contact distance, Availability |
AUC ⓘ AUC-ROC — Area Under the Receiver Operating Characteristic curve.
0.50 = random guessing, 0.70 = acceptable, 0.80 = good, 0.90+ = excellent
An AUC of 0.86 means the model correctly ranks deposits higher than barren ground 86% of the time.
Industry: academic 0.70-0.85, GeaSpirit 0.86, commercial ($50M+) 0.85-0.93 |
Precision ⓘ PRECISION — "When we say dig, how often are we right?"
0.948 = 95 out of 100 flagged pixels are real deposits. Only 5% false alarms. |
Recall ⓘ RECALL — "Of all real deposits, how many do we find?"
0.760 = finds 33 of 43 known deposits. 10 missed (likely no surface expression). |
F1 ⓘ F1 SCORE — Harmonic mean of Precision and Recall. Balances both metrics.
0.837 = well balanced between finding deposits and avoiding false alarms. |
|---|---|---|---|---|---|
| Phase 3 baseline | 19 | 0.6844 | 0.606 | 0.284 | 0.345 |
| A: S2 only | 5 | 0.7325 | 0.803 | 0.880 | 0.822 |
| B: Full satellite | 19 | 0.8530 | 0.933 | 0.764 | 0.833 |
| C: Geology only | 5 | 0.7356 | 0.751 | 0.738 | 0.685 |
| D: S2 + geology | 10 | 0.8094 | 0.850 | 0.897 | 0.862 |
| E: Full fusion | 24 | 0.8622 | 0.948 | 0.760 | 0.837 |
| Metric | Before | After |
|---|---|---|
| Brier Score | 0.1955 | 0.1711 |
| ECE | 0.1446 | 0.0000 |
| # | Feature | Importance | Source |
|---|---|---|---|
| 1 | Terrain ruggedness | 18.6% | DEM |
| 2 | Elevation | 12.1% | DEM |
| 3 | SAR VH backscatter | 10.9% | Sentinel-1 |
| 4 | Ferrous iron index | 7.4% | Sentinel-2 |
| 5 | Thermal z-score | 7.0% | Landsat |
| 6 | Thermal P90 | 5.9% | Landsat |
| 7 | Clay/hydroxyl | 4.5% | Sentinel-2 |
| 8 | SAR VV | 3.9% | Sentinel-1 |
| 9 | SAR texture | 3.8% | Sentinel-1 |
| 10 | Iron oxide | 3.3% | Sentinel-2 |
| Curated deposits (Cu/Au/Ag) | 43 (from 152 raw MRDS) |
| Positive pixels | 33,428 |
| Geology-aware negatives | 14,483 (random + hard + matched) |
| Image resolution | 1856 x 1857 px · 24 bands · 30m |
| Sensors | Sentinel-2 + SAR + DEM + thermal + Macrostrat geology |
| Area covered | ~50 x 50 km |
| Validation | Spatial block CV (10km blocks, 5 folds) |
| Sentinel-2 (5) | Iron Oxide · Clay/Hydroxyl · Ferrous Iron · Laterite · NDVI |
| Sentinel-1 SAR (5) | VV · VH · VV/VH ratio · GLCM variance · GLCM contrast |
| DEM (6) | Elevation · Slope · sin(Aspect) · cos(Aspect) · TPI · Ruggedness |
| Landsat thermal (3) | Median LST · P90 LST · Thermal z-score anomaly |
| Macrostrat geology (5) | Lithology code · Group · Geological age · Distance to contact · Availability |
| COPPER (Cu) | Probability: HIGH (primary target) · Confidence: 86% AUC Surface indicators: iron oxide anomalies, clay alteration halos, thermal anomalies, terrain ruggedness consistent with porphyry Cu Known deposits matched: 33 of 43 (76%) · Estimated style: Porphyry copper ± molybdenum Detection basis: surface alteration proxy (no direct depth measurement possible from satellite) |
| GOLD (Au) | Probability: MODERATE-HIGH (associated with Cu) Surface indicators: ferrous iron, silicification zones Note: Au in this district is typically associated with porphyry Cu systems Detection basis: surface alteration proxy (no direct depth measurement) |
| SILVER (Ag) | Probability: MODERATE (byproduct of Cu mining) Surface indicators: clay/hydroxyl anomalies Note: Ag typically co-occurs with Cu in this district |
| IRON (Fe) | Detection: STRONG surface signal · Iron oxides are #4 feature (7.4%) Caution: not all iron = valuable deposit. Iron oxides also form from normal weathering of any iron-bearing rock. |
| LITHIUM (Li) | NOT ASSESSED — Li deposits (salars, pegmatites) have different signatures. Salar de Atacama is 200km south. Would require separate model. |
| MOLYBDENUM (Mo) | INDIRECT — Mo is associated with porphyry Cu here. No direct spectral signature from satellites. |
| HIGH mineral probability (>0.7) | ~496 km² (32.9% of valid area) |
| MODERATE probability (0.5-0.7) | ~34 km² (2.2%) |
| LOW probability (<0.5) | ~979 km² (64.9%) |
| Unexplored targets (>0.6, >5km from known) | 50 zones identified |
| Study area | 50 x 50 km centered on (-22.3, -68.9) |
| Pixel resolution | 30 meters |
| Total pixels | ~1.69 million (valid) |
| Features per pixel | 24 (satellite + geology) |
| Model | XGBoost gradient boosting |
| Validation | 5-fold spatial block CV (10km blocks) |
| Calibration | Isotonic regression (ECE = 0.000) |
| AOI | Top Score | HIGH km² | Style | Verdict |
|---|---|---|---|---|
| Banos de Mula, Murcia | 0.762 | 10.0 | Hydrothermal | PRIORITY 1 |
| Volcan de Barqueros, Murcia | 0.713 | 5.6 | Volcanic | PRIORITY 2 |
| Salave, Asturias | — | 0.0 | Orogenic Au (known) | PRIORITY 3 |
| Score | Meaning |
|---|---|
| 0.00-0.30 | Weak / background — no strong signal |
| 0.30-0.50 | Low anomaly — minor or noise |
| 0.50-0.60 | Moderate — worth review |
| 0.60-0.70 | Strong target — multi-proxy signal |
| 0.70-0.80 | Very strong — coherent anomaly |
| >0.80 | Exceptional — rare, immediate follow-up |
| Data Source | Status | Impact |
|---|---|---|
| EMIT Hyperspectral | ACTIVE (Chuquicamata confirmed, Peru 50 granules found) | Porphyry Cu alteration — type-specific |
| GA Aeromagnetics | AVAILABLE (operator checklist ready) | Structural Au at Kalgoorlie |
| GA Radiometrics | PENDING (manual download) | K/Th/U for alteration mapping |
| Geomorphometry (6 bands) | COMPLETE (5 zones) | Curvature, TRI, multi-scale TPI |
| Direct GNN Inference | WORKING | CGCNN forward pass on CIF structures |
| Feature | Cohen's d | p-value | Signal |
|---|---|---|---|
| amplitude | -0.680 | 2.2e-15 | VERY STRONG |
| std_annual | -0.617 | 1.0e-12 | VERY STRONG |
| thermal_range_ratio | -0.565 | 1.3e-07 | VERY STRONG |
| mean_annual | -0.508 | 4.9e-08 | STRONG |
| summer_mean | -0.448 | 1.5e-06 | MODERATE |
| summer_winter_diff | -0.423 | 1.6e-05 | MODERATE |
| Model | AUC | PR-AUC | Delta AUC |
|---|---|---|---|
| A: Baseline (satellite only) | 0.7971 | 0.5952 | — |
| D: Baseline + robust thermal v2 | 0.8078 | 0.6188 | +0.0107 |
| 4: Baseline + std_annual | 0.8253 | 0.6357 | +0.0133 |
| 5: Baseline + ratio + std | 0.8229 | 0.6416 | +0.0109 |
| E: Thermal only | 0.7565 | 0.5598 | -0.0406 |
| Feature | Kalgoorlie d | Chuquicamata d | Consistent? |
|---|---|---|---|
| amplitude | -0.680 | -0.898 | YES — both lower at deposits |
| thermal_range_ratio | -0.565 | -0.785 | YES — both lower at deposits |
| mean_annual | -0.508 | -1.121 | YES — both lower at deposits |
| summer_winter_diff | -0.423 | -0.898 | YES — both lower at deposits |
| std_annual | -0.617 | -0.174 | Same direction but weak at Chuquicamata |
| Criterion | Score | Evidence |
|---|---|---|
| Cross-zone signal consistency | 3/3 | 4 features same direction + significant at both sites |
| Model improvement at both sites | 1/3 | Kalgoorlie +0.013 AUC; Chuquicamata no AUC gain (baseline already 0.91) |
| EMIT Feature | Cohen's d | p-value | Direction |
|---|---|---|---|
| reflectance_pca_2 | -0.670 | 1.9e-06 | VERY PROMISING |
| hydroxyl_proxy | +0.645 | 1.5e-06 | MORE hydroxyl at deposits |
| mineral_id_count | +0.528 | 7.7e-05 | MORE spectral features |
| clay_proxy | +0.516 | 8.9e-05 | MORE clay at deposits |
| alteration_mineral_presence | +0.383 | 0.014 | MORE alteration |
| Test | Value | Interpretation |
|---|---|---|
| Mann-Whitney p | 0.138 | Not significant |
| Cohen's d | -0.250 | Small effect (below threshold) |
| Bootstrap 95% CI | [-0.009, +0.004] | Includes zero |
| Baseline + residual AUC | 0.724 | -0.016 vs baseline |
| Model | AUC | Delta | Assessment |
|---|---|---|---|
| A: Baseline satellite | 0.9110 | — | Reference |
| E: Baseline + PCA embeddings | 0.9374 | +0.0264 | BEST — largest single-family gain |
| H: Baseline + thermal + grad + emb | 0.9287 | +0.0177 | Good fusion |
| B: Baseline + thermal | 0.9166 | +0.0056 | Modest |
| D: Baseline + spatial gradients | 0.9049 | -0.0061 | Negative — gradients hurt |
| Zone | Type | Selected Families | Rejected | AUC |
|---|---|---|---|---|
| Kalgoorlie | Orogenic Au | satellite + thermal + PCA embeddings | EMIT, gradients | 0.937 |
| Chuquicamata | Porphyry Cu | satellite + thermal + EMIT | PCA embeddings | 0.862 |
| Peru | Porphyry Cu | satellite + thermal | PCA embeddings | 0.758 |
| Arizona | Porphyry Cu | satellite + thermal | PCA embeddings | 0.718 |
| Zambia | Sediment Cu | satellite | — | 0.763 |
| Experiment | Zone | AUC Delta | Verdict |
|---|---|---|---|
| Aeromagnetics + Radiometrics | Kalgoorlie | +0.002 | NEUTRAL — below threshold, K/Th ratio promising |
| Foundation Embeddings v1 | Kalgoorlie | +0.004 | NEUTRAL — below threshold in block CV |
| EMIT Peru Replication | Peru | — | BLOCKED — granule truncated, download timed out |
| Full Stack (sat+therm+PCA+mag) | Kalgoorlie | +0.005 | 0.870 — best combination |
| Dimension | Score | Capability | Gap |
|---|---|---|---|
| MINERAL | 3.3/10 | Neighborhood context: Au vs Ni AUC 0.627 | Geology maps, EMIT type, labels |
| DEPTH | 4.1/10 | Magnetic Euler proxy exists but noisy | AEM, gravity shape, drill holes |
| COORDINATES | 7.0/10 | 30m resolution, 1km2 zones | Peak finding, GPS validation |
| CERTAINTY | 9.3/10 | AUC 0.882, Brier 0.091 (calibrated) | More labels, ensemble |
| Idea | Physical Basis | Status |
|---|---|---|
| ML residual maps | Strong | TESTED — NEGATIVE |
| Spatial gradient / edge operators | Strong | VIABLE — next experiment |
| Multiscale texture (SAR/DEM/S2) | Strong | VIABLE |
| NDVI ↔ thermal cross-correlation | Moderate | SPECULATIVE |
| Night-time thermal difference | Strong | Partially redundant |
| SAR polarimetric decomposition | Moderate | Limited (dual-pol only) |
| InSAR coherence | Weak | SPECULATIVE |
| Soil moisture anomaly | Moderate | INVIABLE (resolution) |
| Passive microwave downscaling | Weak | INVIABLE |
| Self-potential from satellite | N/A | INVIABLE |
| Family | Verdict | Detail |
|---|---|---|
| Thermal 20yr | USEFUL | Universal modest. d=-0.627, +0.013 AUC. Replicated at 2 zones. |
| EMIT alteration | SELECTIVE | Porphyry Cu only (hydroxyl d=+0.645). Negative at orogenic Au. |
| PCA embeddings | SELECTIVE | +0.026 AUC at Kalgoorlie. Negative at all porphyry zones. |
| Neighborhood context | PROMISING | Mineral AUC 0.507→0.627. Key for mineral discrimination. |
| Magnetics (TMI) | SMALL + | +0.009 AUC with correct GA national data. Prior result was invalid (wrong tiles). |
| Peru EMIT | DEFERRED | 50 granules found. Download truncated. Not scientifically failed. |
| Spatial gradients | REJECTED | -0.006 AUC. No benefit at any tested zone. |
| ML residuals | REJECTED | No independent subsurface signal at Kalgoorlie. |
| Component | Status | Details |
|---|---|---|
| Geology | VALIDATED SELECTIVE | 3-zone evidence (Zambia, Peru, Kalgoorlie) |
| Depth Activation Layer | BUILT | 1 active, 3 ready, 2 regional, 2 future |
| Operator Unlock Checklist | v3 — 11 ITEMS | 4 HIGH priority, all 3 dropzones EMPTY |
| Gating | v6 — 10 RULES | Baseline-aware, type+zone+baseline gating |
| Frontier Track | v4 | spectral_unmixing + NDVI_trend selected for Phase 21 |
| Registry | v16 | 27+ families catalogued |
| Canonical Score | 22.8/40 (57%) | FROZEN v4 — bottleneck is depth DATA, not architecture |
| Component | Status | Details |
|---|---|---|
| Feature Family | 9 FEATURES BUILT | topo_diversity, landform_variety, slope, aspect, TPI, TRI, curvature, SAR_VV, SAR_VH via GEE (CSP/ERGo landform dataset) |
| Top Feature | topo_diversity | Top feature at 3/4 zones (Peru, Kalgoorlie, Chuquicamata) from CSP/ERGo landform dataset |
| Peru Standalone | AUC 0.902 | Strong standalone terrain discrimination |
| Kalgoorlie Standalone | AUC 0.859 | Strong standalone terrain discrimination |
| Chuquicamata Standalone | AUC 0.846 | Strong standalone terrain discrimination |
| Zambia Standalone | AUC 0.682 | Weak — flat Copperbelt terrain provides limited signal |
| Combined with S2 | REDUNDANT | Kalgoorlie +0.001 (NEUTRAL), Zambia -0.068 (NEGATIVE), Chuquicamata -0.021 (NEGATIVE), Peru -0.004 (NEUTRAL) |
| Terrain Reclassification | SURFACE_STRUCTURE | Not true depth — measures surface morphology, not subsurface geology |
| Canonical Score | 22.8/40 (57%) | UNCHANGED — real depth needs GA gravity, GSWA AEM, USGS Earth MRI (manual portals) |
| Pipeline | Status | Details |
|---|---|---|
| Raw S2 Reflectance | PIPELINE_READY | 10 bands (B2-B12), 4 zones, cloud-masked via SCL |
| Multi-Year NDVI | PIPELINE_READY | Landsat 8, 2013-2024, 12 annual composites, 4 zones |
| Full Raster Export | PENDING | Requires async ee.batch.Export.image.toDrive() |
| Spectral Unmixing | VALIDATION_PENDING | Needs exported rasters for real endmember decomposition |
| NDVI Trend | VALIDATION_PENDING | Zambia +0.0043/yr greening — promising, needs full validation |
| Frontier Candidate | Status | Root Cause |
|---|---|---|
| Spectral Unmixing | BLOCKED_BY_DATA | Existing stacks contain derived indices (iron oxide ratio, clay ratio, NDVI) — not raw Sentinel-2 L2A reflectance bands (B2–B12). Sub-pixel mineral unmixing requires raw reflectance as input. Cannot validate without rebuilding the data pipeline from raw S2 imagery. |
| NDVI Trend (20yr) | BLOCKED_BY_DATA | Stacks contain a single-date NDVI snapshot, not a 20-year Landsat time series. Vegetation stress anomaly detection requires multi-year composite statistics (mean, trend slope, variance). Cannot validate without building a Landsat temporal composite pipeline. |
| Data Source | Status | Details |
|---|---|---|
| GEE Python API | FULLY_ACCESSIBLE | ee.Initialize() succeeds. SRTM data query works. Ready for raw S2 + Landsat pipelines. |
| ECOSTRESS / earthaccess | PARTIALLY_ACCESSIBLE | Library installed, NASA Earthdata auth OK. Search returns 0 granules for test AOI — endpoint or collection issue. |
| Gravity (GA Bouguer) | BLOCKED | GA endpoints return HTML portal, not data. Manual download required. |
| AEM (detailed) | BLOCKED | No public API. Requires state geological survey access. |
| Earth MRI | BLOCKED | USGS Earth MRI portal — manual download only. |
| Pipeline | Source | Purpose |
|---|---|---|
| Raw S2 Reflectance | GEE → Sentinel-2 L2A | B2–B12 raw bands for spectral unmixing (NNLS endmember decomposition) |
| Multi-year NDVI Composite | GEE → Landsat 5/7/8/9 archive | 20-year median, trend slope, variance for vegetation stress anomaly detection |
| GEE → GeaSpirit Integration | ee Python API | Automated download, reproject to zone AOI, align to existing stacks |
| Real Frontier Validation | New raw stacks | Re-run unmixing + NDVI trend experiments with proper data, measure real AUC delta |
| Component | Status | Details |
|---|---|---|
| S2 Reflectance Pipeline | PIPELINE_READY | 10 bands, 4 zones, sample-tested with real data |
| NDVI Trend Pipeline | PIPELINE_READY | 12 years, 4 zones, real trends measured |
| GEE | OPERATIONALIZED | 2 datasets, 4 AOIs, export pathway documented |
| Raster Export | PENDING | Async GEE batch export needed |
| Frontier Validation | PIPELINE_READY | Needs exported rasters for real AUC measurement |
| Blocked Items | 8/11 | Depth items all blocked |
| Gating | v9 | Updated with pipeline-ready gates |
| Registry | v19 | Updated with GEE pipeline results |
| Canonical Score | 22.8/40 (57%) | UNCHANGED — no real AUC improvement yet |
Data from ESA Copernicus, NASA, USGS, and Macrostrat open programs.