In response to the problem of inadequate utilization of local information in PolSAR image classification using Vision Transformer in existing studies, this paper proposes a Vision Transformer method considering local ...In response to the problem of inadequate utilization of local information in PolSAR image classification using Vision Transformer in existing studies, this paper proposes a Vision Transformer method considering local information, LIViT. The method replaces image patch sequence with polarimetric feature sequence in the feature embedding, and uses convolution for mapping to preserve image spatial detail information. On the other hand, the addition of the wavelet transform branch enables the network to pay more attention to the shape and edge information of the feature target and improves the extraction of local edge information. The results in Wuhan, China and Flevoland, Netherlands show that considering local information when using Vision Transformer for PolSAR image classification effectively improves the image classification accuracy and shows better advantages in PolSAR image classification.展开更多
Speckle effects on classification results can be sup- pressed to some extent by introducing the contextual information. An unsupervised classification algorithm is proposed for polarimetric synthetic aperture radar (...Speckle effects on classification results can be sup- pressed to some extent by introducing the contextual information. An unsupervised classification algorithm is proposed for polarimetric synthetic aperture radar (POLSAR) images based on the mean shift (MS) segmentation and Markov random field (MRF). First, polarimetdc features are exacted by target decomposition for MS segmentation. An initial classification is executed by using the target decomposition and the agglomerative hierarchical clus- tering algorithm. Thereafter, a classification step based on MRF is performed by using the mean coherence matrices obtained for each segment. Under the MRF framework, the smoothness term is defined according to the distance between neighboring areas. By using POLSAR images acquired by the German Aerospace Centre and National Aeronautics and Space Administration/Jet Propulsion Laboratory, the experimental results confirm that the proposed method has higher accuracy and better regional connectivity than other classification methods.展开更多
文摘In response to the problem of inadequate utilization of local information in PolSAR image classification using Vision Transformer in existing studies, this paper proposes a Vision Transformer method considering local information, LIViT. The method replaces image patch sequence with polarimetric feature sequence in the feature embedding, and uses convolution for mapping to preserve image spatial detail information. On the other hand, the addition of the wavelet transform branch enables the network to pay more attention to the shape and edge information of the feature target and improves the extraction of local edge information. The results in Wuhan, China and Flevoland, Netherlands show that considering local information when using Vision Transformer for PolSAR image classification effectively improves the image classification accuracy and shows better advantages in PolSAR image classification.
基金supported by the National Natural Science Foundation of China(6100118741001256+1 种基金40971219)the National High Technology Research and Development Program of China(863 Program)(2013 AA122301)
文摘Speckle effects on classification results can be sup- pressed to some extent by introducing the contextual information. An unsupervised classification algorithm is proposed for polarimetric synthetic aperture radar (POLSAR) images based on the mean shift (MS) segmentation and Markov random field (MRF). First, polarimetdc features are exacted by target decomposition for MS segmentation. An initial classification is executed by using the target decomposition and the agglomerative hierarchical clus- tering algorithm. Thereafter, a classification step based on MRF is performed by using the mean coherence matrices obtained for each segment. Under the MRF framework, the smoothness term is defined according to the distance between neighboring areas. By using POLSAR images acquired by the German Aerospace Centre and National Aeronautics and Space Administration/Jet Propulsion Laboratory, the experimental results confirm that the proposed method has higher accuracy and better regional connectivity than other classification methods.