A Spatial-Temporal Attention-Based Method and a New Dataset for Remote Sensing Image Change Detection

Abstract

Remote sensing image change detection (CD) is done to identify desired significant changes between bitemporal images. Given two co-registered images taken at different times, the illumination variations and misregistration errors overwhelm the real object changes. Exploring the relationships among different spatial–temporal pixels may improve the performances of CD methods. In our work, we propose a novel Siamese-based spatial–temporal attention neural network. In contrast to previous methods that separately encode the bitemporal images without referring to any useful spatial–temporal dependency, we design a CD self-attention mechanism to model the spatial–temporal relationships. We integrate a new CD self-attention module in the procedure of feature extraction. Our self-attention module calculates the attention weights between any two pixels at different times and positions and uses them to generate more discriminative features. Considering that the object may have different scales, we partition the image into multi-scale subregions and introduce the self-attention in each subregion. In this way, we could capture spatial–temporal dependencies at various scales, thereby generating better representations to accommodate objects of various sizes. We also introduce a CD dataset LEVIR-CD, which is two orders of magnitude larger than other public datasets of this field. LEVIR-CD consists of a large set of bitemporal Google Earth images, with 637 image pairs (1024 × 1024) and over 31 k independently labeled change instances. Our proposed attention module improves the F1-score of our baseline model from 83.9 to 87.3 with acceptable computational overhead. Experimental results on a public remote sensing image CD dataset show our method outperforms several other state-of-the-art methods.

Keywords

Computer scienceDiscriminative modelPixelChange detectionArtificial intelligencePattern recognition (psychology)Image (mathematics)Feature (linguistics)Computer vision

Affiliated Institutions

Beihang University CN

Related Publications

Caltech-256 Object Category Dataset

G. S. Griffin , Alex Holub , Pietro Perona

We introduce a challenging set of 256 object categories containing a total of 30607 images. The original Caltech-101 [1] was collected by choosing a set of object categories, do...

2007 The Caltech Institute Archives (Calif... 2388 citations

CBAM: Convolutional Block Attention Module

Sanghyun Woo , Jongchan Park , Joon‐Young Lee +1 more

2018 Lecture notes in computer science 20102 citations

Dual Attention Network for Scene Segmentation

Jun Fu , Jing Liu , Haijie Tian +4 more

In this paper, we address the scene segmentation task by capturing rich contextual dependencies based on the self-attention mechanism. Unlike previous works that capture context...

2019 6497 citations

The Unreasonable Effectiveness of Deep Features as a Perceptual Metric

Richard Zhang , Phillip Isola , Alexei A. Efros +2 more

While it is nearly effortless for humans to quickly assess the perceptual similarity between two images, the underlying processes are thought to be quite complex. Despite this, ...

2018 10763 citations

SUN attribute database: Discovering, annotating, and recognizing scene attributes

Geneviève Patterson , James Hays

In this paper we present the first large-scale scene attribute database. First, we perform crowd-sourced human studies to find a taxonomy of 102 discriminative attributes. Next,...

2012 857 citations

Publication Info

Year: 2020
Type: article
Volume: 12
Issue: 10
Pages: 1662-1662
Citations: 1487
Access: Closed

External Links

Download PDF (Free) View on DOI.org Semantic Scholar

Social Impact

Altmetric

A Spatial-Temporal Attention-Based Method and a New Dataset for Remote Sensing Image Change Detection

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

1487

OpenAlex

281

Influential

1450

CrossRef

Cite This

APA Style

                            
                                    Hao Chen, 
                                
                                    Zhenwei Shi
                                
                            (2020). 
                            A Spatial-Temporal Attention-Based Method and a New Dataset for Remote Sensing Image Change Detection. 
                            Remote Sensing
                            , 12
                            (10)
                            , 1662-1662.
                            https://doi.org/10.3390/rs12101662

Identifiers

DOI: 10.3390/rs12101662

Data Quality

Data completeness: 86%