特定課題報告書印刷(Print out of Special Research Projects)

表題番号：2022C-489 日付：2023/04/03

研究課題Multiscanning Strategy-Based Dynamic One-Hot Positional Encoding with Spectral-Spatial Soft Transformer for Hyperspectral Image Classification

	研究者所属（当時）	資格	氏名
（代表者）	理工学術院情報生産システム研究センター	助手	周　惟廉
（連携研究者）	Graduate School of Information, Production, and Systems (IPS), Waseda University	Professor	Sei-ichiro Kamata

研究成果概要

I published two papers in international conferences - the 26th International Conference on Pattern Recognition (ICPR) and the International Conference on Image Processing (ICIP) in 2022.

The first paper [1], presented a novel approach to designing a unified spectral-spatial Transformer for hyperspectral image classification. Specifically, I proposed a cascaded integration of the spectral vision Transformer with the spatial pyramid vision Transformer, along with a cross-scale fusion module. Moreover, I introduced a local-global encoder in the spatial domain, which validates the effectiveness of incorporating local features into the Transformer model. Overall, my paper contributed to the advancement and practicality of using a pure vision Transformer-based model for hyperspectral image classification.

The second paper [2] proposed a new approach for addressing hyperspectral image classification by leveraging the 3D configuration of a vision Transformer, which enabled simultaneous correlation of spectral and spatial features. To this end, I introduced a novel 3D coordinate positional embedding method that distinguished the relative distances among all hyper-cubes resulting from the 3D partition operation. I also designed a local-global feature combination approach that seamlessly integrates with the 3D configuration of the vision Transformer. Furthermore, we presented our research at two conferences and received positive feedback.