Repository logo
Institutional Digital Repository
Shreenivas Deshpande Library, IIT (BHU), Varanasi

Multiple-Hand 2D Pose Estimation From a Monocular RGB Image

dc.contributor.authorMishra P.; Sarawadekar K.
dc.date.accessioned2025-05-23T11:13:11Z
dc.description.abstractDeep learning models and algorithms facilitate relatively easier ways of hand pose estimation from monocular RGB images compared to traditional approaches. Despite this, a majority of available algorithms use multiple-stage models to perform hand pose estimation. Moreover, the single-stage methods are mainly limited to a single hand and it is difficult for them to scale to multiple hands. To this end, we propose an approach that takes the features of the saliency map extracted for hand region of interest (ROI) localization. An integrated network uses these features for pose estimation. This arrangement of layers forms an end-to-end pipeline that allows simultaneous pose estimation for multiple hands. The model is designed to run on multiple cores of CPU/GPU to independently perform inference for each detected hands'pose making possible faster inference and hence suitable for real-time applications. In addition, a new approach using grid-based design to estimate hand-keypoints position with high precision is also proposed. Both the proposed designs are validated on multiple datasets to prove their feasibility and effectiveness. The probability of the correct keypoint (PCK) value at threshold value of 0.2 is above 95% on the test sets from Interhand dataset and Rendered HandPose Dataset (RHD). © 2013 IEEE.
dc.identifier.doihttps://doi.org/10.1109/ACCESS.2024.3376426
dc.identifier.urihttp://172.23.0.11:4000/handle/123456789/5528
dc.relation.ispartofseriesIEEE Access
dc.titleMultiple-Hand 2D Pose Estimation From a Monocular RGB Image

Files

Collections