Repository logo
Institutional Digital Repository
Shreenivas Deshpande Library, IIT (BHU), Varanasi

AMS-CNN: Attentive multi-stream CNN for video-based crowd counting

dc.contributor.authorTripathy S.K.; Srivastava R.
dc.date.accessioned2025-05-23T11:26:32Z
dc.description.abstractIn recent years video-based crowd counting and density estimation (CCDE) have become essential for crowd analysis. Current approaches rarely exploit spatial–temporal features for CCDE, and they also usually do not consider measures to minimize the frame's background influence for obtaining crowd density maps, which has resulted in lower performance in terms of Mean Absolute Error (MAE) and Root Mean Squared Error (RMSE). Again, attention to individual feature set's response toward crowd counting is also neglected. To this end, we are motivated to design an end-to-end trainable attentive multi-stream convolutional neural network (AMS-CNN) for crowd counting. At first, a multi-stream CNN (MS-CNN) is designed to obtain crowd density maps. The MS-CNN comprises three streams to fuse deep spatial, temporal, and spatial foreground features from different cues of the crowd video dataset, like frames, the volume of frames, and foregrounds of frames. To improve the accuracy, we designed three stream-wise attention modules to generate attentive crowd density maps, and their relative average is obtained using a relative averaged attentive density-map (RAAD) layer. The relative averaged density map is concatenated with the MS-CNN output, followed by two-stage CNN blocks to get the final density map. The experiments are demonstrated on three publicly available crowd density video datasets: Mall, UCSD, and Venice. We obtained promising and better results in terms of MAE and RMSE as compared with state-of-the-art approaches. © 2021, The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature.
dc.identifier.doihttps://doi.org/10.1007/s13735-021-00220-7
dc.identifier.urihttp://172.23.0.11:4000/handle/123456789/10416
dc.relation.ispartofseriesInternational Journal of Multimedia Information Retrieval
dc.titleAMS-CNN: Attentive multi-stream CNN for video-based crowd counting

Files

Collections