Journal of System Simulation ›› 2026, Vol. 38 ›› Issue (1): 112-124.doi: 10.16182/j.issn1004731x.joss.25-0824

• Papers • Previous Articles     Next Articles

Spatio-temporal Swin Transformer-based Flow-solid Coupling Interaction Sequence Image Prediction Network

Zou Changjun, Ge Zhiyu, Zhong Chenxi   

  1. School of Information and Software Engineering, East China Jiaotong University, Nanchang 330013, China
  • Received:2025-09-01 Revised:2025-10-15 Online:2026-01-18 Published:2026-01-28

Abstract:

To address limitations in modeling long-term dependencies and multi-scale features in fluid-structure interaction scenarios, a spatiotemporal deep learning model (SwinLSTM) integrating ConvLSTM and Swin Transformer is proposed. The model employs a gated spatiotemporal attention mechanism that dynamically embeds Swin Transformer's window-based multi-head self-attention into ConvLSTM's output gate, enabling adaptive temporal-spatial feature coupling, and designs a multi-level ConvLSTM framework to hierarchically capture complex spatiotemporal correlations. Experiments on a self-built fluid-interaction dataset show that our method achieves the highest PSNR and leading SSIM scores, with superior performance in preserving vortex details and boundary consistency. This work provides an efficient solution for fluid dynamics prediction in interactive scenarios.

Key words: deep learning, computational fluid dynamics, image prediction, ConvLSTM, Swin Transformer

CLC Number: