StNet: Local and global spatial-temporal modeling for action recognitionDongliang HeZhichao Zhouet al.2019AAAI 2019
Purely Attention Based Local Feature Integration for Video ClassificationXiang LongGerard De Meloet al.2022IEEE TPAMI