Bi-modal contrastive learning for crop classification using Sentinel-2 and Planetscope

Ankit Patnala; Scarlet Stadtler; Scarlet Stadtler; Martin G. Schultz; Martin G. Schultz; Juergen Gall; Juergen Gall

doi:10.3389/frsen.2024.1480101

Frontiers in Remote Sensing (Dec 2024)

Bi-modal contrastive learning for crop classification using Sentinel-2 and Planetscope

Ankit Patnala,
Scarlet Stadtler,
Scarlet Stadtler,
Martin G. Schultz,
Martin G. Schultz,
Juergen Gall,
Juergen Gall

Affiliations

Ankit Patnala: Juelich Supercomputing Centre, Forschungszentrum Juelich, Juelich, Germany
Scarlet Stadtler: Juelich Supercomputing Centre, Forschungszentrum Juelich, Juelich, Germany
Scarlet Stadtler: Gradient Zero, Vienna, Austria
Martin G. Schultz: Juelich Supercomputing Centre, Forschungszentrum Juelich, Juelich, Germany
Martin G. Schultz: Department of Mathematics and Computer Science, University of Cologne, Cologne, Germany
Juergen Gall: Department of Information Systems and Artificial Intelligence, University of Bonn, Bonn, Germany
Juergen Gall: Lamarr Institute for Machine Learning and Artificial Intelligence, Dortmund, Germany

DOI: https://doi.org/10.3389/frsen.2024.1480101
Journal volume & issue: Vol. 5

Abstract

Read online

Remote sensing has enabled large-scale crop classification for understanding agricultural ecosystems and estimating production yields. In recent years, machine learning has become increasingly relevant for automated crop classification. However, the existing algorithms require a huge amount of annotated data. Self-supervised learning, which enables training on unlabeled data, has great potential to overcome the problem of annotation. Contrastive learning, a self-supervised approach based on instance discrimination, has shown promising results in the field of natural as well as remote sensing images. Crop data often consists of field parcels or sets of pixels from small spatial regions. Additionally, one needs to account for temporal patterns to correctly label crops. Hence, the standard approaches for landcover classification cannot be applied. In this work, we propose two contrastive self-supervised learning approaches to obtain a pre-trained model for crop classification without the need for labeled data. First, we adopt the uni-modal contrastive method (SCARF) and, second, we use a bi-modal approach based on Sentinel-2 and Planetscope data instead of standard transformations developed for natural images to accommodate the spectral characteristics of crop pixels. Evaluation in three regions of Germany and France shows that crop classification with the pre-trained multi-modal model is superior to the pre-trained uni-modal method as well as the supervised baseline models in the majority of test cases.

Published in Frontiers in Remote Sensing

ISSN: 2673-6187 (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Science: Physics: Geophysics. Cosmic physics; Science: Physics: Meteorology. Climatology
Website: https://www.frontiersin.org/journals/remote-sensing

About the journal

Abstract

Keywords