Dynamic Segment Aggregation Network for Video-Level Representation Learning
Overview We release the code of the DSANet (Dynamic Segment Aggregation Network). We introduce the DSA module to capture relationship among snippets for video-level representation learning. Equipped with DSA modules, the top-1 accuracy of I3D ResNet-50 is improved to 78.2% on Kinetics-400. The core code to implement the Dynamic Segment Aggregation Module is codes/models/modules_maker/DSA.py. [July 7, 2021] We release the core code of DSANet. [July 3, 2021] DSANet has been accepted by ACMMM 2021. Prerequisites All dependencies can be installed […]
Read more