← Back to Computer Vision
cs.CV

Combining hyperspectral and standard satellite images in one model

Nassim Ait Ali Braham, Aaron Banze, Conrad M. Albrecht, Julien Mairal, Jocelyn Chanussot, Xiao Xiang Zhu

May 20, 2026

Hyperspectral satellites capture far more spectral detail than standard multispectral ones, but no foundation model has combined them effectively. SpectralEarth-FM uses a hierarchical transformer with spectral tokenization and cross-sensor fusion to handle all five data types simultaneously—hyperspectral, optical, thermal, and radar. The team curated a 40TB dataset linking hyperspectral images from three satellites (EnMAP, EMIT, DESIS) with co-located Sentinel and Landsat data, then pretrained on 25M patches using a contrastive objective. It tops benchmarks for both hyperspectral tasks and standard Earth observation metrics.
Published as SpectralEarth-FM: Bringing Hyperspectral Imagery into Multimodal Earth Observation Pretraining arXiv:2605.21075
Read the original paper →