To enhance land use/cover change (LUCC) simulation accuracy, we introduced ViViT-ANN-CA, blending video vision transformer’s spatio-temporal features extraction ability, artificial neural network‘s (ANN) non-linearity computing ability, and CA’s spatial computing. Compared to 3DCNN-ANN-CA, ViViT-ANN-CA showed higher accuracy in simulating water bodies and vegetation, with overall improvements in Hailing District and Wuxi City. ViViT demonstrates comparable spatio-temporal feature extraction ability to three-dimensional convolutional neural network (3DCNN), promising for future ynamic LUCC simulations.
Funding
This work was supported by the National Natural Science Foundation of China [42171088]; State Key Laboratory of Earth Surface Processes and Resource Ecology [2022-ZD-04].