Temporal Pose Analysis for False Positive Reduction in Proactive Video Surveillance

Marko M. Živanović¹ and Milica M. Živanović²

¹ Faculty of Information Technology, Belgrade Metropolitan University, Tadeuša Košćuška 63, Belgrade, 11000, Serbia

² Faculty of Organizational Sciences, University of Belgrade, Jove Ilića 154, Belgrade, 11000, Serbia

marko.zivanovic@metropolitan.ac.rs

ABSTRACT: Proactive video surveillance demands efficient and accurate models for real-time analysis, especially in the context of protecting vulnerable groups such as the el-derly and children in public spaces. This paper presents a modular system for fall detection, a critical component of such surveillance that enables rapid emergency response. The system utilizes the YOLO11x-pose model to perform human pose estimation, processing video from diverse sources (local files, webcams, RTSP streams) to identify 17 skeletal keypoints. Its core innovation lies in detecting a fall as a dynamic transition from a standing or sitting posture to a lying state, which significantly reduces false alarms compared to static pose analysis. The methodology employs a PoseTracker for multi-person tracking and a PoseAna-lyzer that classifies posture based on biomechanical parameters (e.g., torso angle, knee angle, bounding box aspect ratio). The system generates dual outputs: a vis-ually annotated video for human review and structured JSON data for real-time integration with external alarm systems (e.g., VMS). This configurable and ro-bust solution provides a practical foundation for automated safety monitoring.

KEYWORDS: Pose Estimation, Fall Detection, YOLO11x-pose, Real-Time Tracking, Proactive Surveillance, Public Safety.

ACKNOWLEDGMENT: The authors express their gratitude to Metropolitan University for the stimulating environment for scientific research and for the financial support provided. Particular gratitude is owed to the measure of exempting the authors from the registration fee, which directly enabled the publication and presentation of the results of this research.

REFERENCES:

Vaishya, R., & Vaish, A. (2020). Falls in older adults are serious. Indian journal of orthopaedics, 54(1), 69-74.
Islam, M. M., Tayan, O., Islam, M. R., Islam, M. S., Nooruddin, S., Kabir, M. N., & Islam, M. R. (2020). Deep learning based systems developed for fall detection: A review. IEEE Access, 8, 166117-166137.
Roggio, F., Trovato, B., Sortino, M., & Musumeci, G. (2024). A comprehensive analysis of the machine learning pose estimation models used in human movement and posture analyses: A narrative review. Heliyon, 10(21).
Kaur, N., Rani, S., & Kaur, S. (2024). Real-time video surveillance based human fall detection system using hybrid haar cascade classifier. Multimedia Tools and Applications, 83(28), 71599-71617.
Lumetzberger, J., Ballester, I., & Kampel, M. (2025). Fall detection. Privacy-Aware Monitoring for Assisted Living, 131.
e Silva, R. B., Rowland, M. T., Marques, R. P., Franco, I. C., Li, J., Holzer, T., … & White, G. (2025). Results of the IAEA Coordinated Research Project Enhancing Computer Security for Radiation Detection Systems. Nuclear Engineering and Technology, 103998.
Gawande, P. D. (2025). From Reactive to Proactive: Real-Time Human-AI Collaboration in Intelligent Alerting Systems. Journal of CompuYu, F., Wang, D., Shangguan, L., Zhang, M., Tang, X., Liu, C., & Chen, X. (2021). A survey of large-scale deep learning serving system optimization: Challenges and oppor-tunities. arXiv preprint arXiv:2111.14247.Computer Science and Technology Studies, 7(6), 1074-1083.
Do, T. T. T., Huynh, Q. T., Kim, K., & Nguyen, V. Q. (2025). A Survey on Video Big Data Analytics: Architecture, Technologies, and Open Research Challenges. Applied Sciences, 15(14), 8089.
Duan, L. (2021). Architectures and gpu-based parallelization for online bayesian computa-tional statistics and dynamic modeling (Doctoral dissertation, University of Saskatchewan).]
Yu, F., Wang, D., Shangguan, L., Zhang, M., Tang, X., Liu, C., & Chen, X. (2021). A survey of large-scale deep learning serving system optimization: Challenges and opportunities. arXiv preprint arXiv:2111.14247.
Dubey, S., & Dixit, M. (2023). A comprehensive survey on human pose estimation approaches. Multimedia Systems, 29(1), 167-195.
Samkari, E., Arif, M., Alghamdi, M., & Al Ghamdi, M. A. (2023). Human pose estimation using deep learning: A systematic literature review. Machine Learning and Knowledge Extraction, 5(4), 1612-1659.
von Diezmann, L., Shechtman, Y., & Moerner, W. E. (2017). Three-dimensional localization of single molecules for super-resolution imaging and single-particle tracking. Chemical reviews, 117(11), 7244-7275.
Brumann, C., Kukuk, M., & Reinsberger, C. (2021). Evaluation of open-source and pre-trained deep convolutional neural networks suitable for player detection and motion analysis in squash. Sensors, 21(13), 4550.
Xie, S., Quan, T., Luo, J., Ren, X., & Miao, Y. (2025). A Unified Framework for Enhanced 3D Spatial Localization of Weeds via Keypoint Detection and Depth Estimation. Agriculture, 15(17), 1854.
Viswakumar, A., Rajagopalan, V., Ray, T., Gottipati, P., & Parimi, C. (2022). Development of a robust, simple, and affordable human gait analysis system using bottom-up pose estimation with a smartphone camera. Frontiers in physiology, 12, 784865.
Cao, Z., Hidalgo, G., Simon, T., Wei, S. E., & Sheikh, Y. (2019). Openpose: Realtime multi-person 2d pose estimation using part affinity fields. IEEE trans-actions on pattern analysis and machine intelligence, 43(1), 172-186.
Xu, R., Razavi, S., & Zheng, R. (2023). Edge video analytics: A survey on appli-cations, systems and enabling techniques. IEEE Communications Surveys & Tutorials, 25(4), 2951-2982.
Nilsson, F. (2023). Intelligent network video: Understanding modern video surveillance systems. crc Press.
Haering, N., Venetianer, P. L., & Lipton, A. (2008). The evolution of video surveillance: an overview. Machine Vision and Applications, 19(5), 279-290.
Ding, J., Niu, S., Nie, Z., & Zhu, W. (2024). Research on human posture estima-tion algorithm based on YOLO-Pose. Sensors, 24(10), 3036.
Paszke, A., Gross, S., Massa, F., Lerer, A., Bradbury, J., Chanan, G., … & Chinta-la, S. (2019). Pytorch: An imperative style, high-performance deep learning library. Advances in neural information processing systems, 32.
Kljucaric, L., & George, A. D. (2019, September). Deep-learning inferencing with high-performance hardware accelerators. In 2019 IEEE High Performance Extreme Computing Conference (HPEC) (pp. 1-7). IEEE.

IZVOR: Proceedings of the 16th International Conference on Business Information Security BISEC’2025

Menu

Temporal Pose Analysis for False Positive Reduction in Proactive Video Surveillance

Temporal Pose Analysis for False Positive Reduction in Proactive Video Surveillance

Marko M. Živanović¹ and Milica M. Živanović²

¹ Faculty of Information Technology, Belgrade Metropolitan University, Tadeuša Košćuška 63, Belgrade, 11000, Serbia

² Faculty of Organizational Sciences, University of Belgrade, Jove Ilića 154, Belgrade, 11000, Serbia

marko.zivanovic@metropolitan.ac.rs

milicazivanovic2411@gmail.com

DOI:10.46793/BISEC25.257Z

Temporal Pose Analysis for False Positive Reduction in Proactive Video Surveillance

Temporal Pose Analysis for False Positive Reduction in Proactive Video Surveillance

Marko M. Živanović1 and Milica M. Živanović2

1 Faculty of Information Technology, Belgrade Metropolitan University, Tadeuša Košćuška 63, Belgrade, 11000, Serbia

2 Faculty of Organizational Sciences, University of Belgrade, Jove Ilića 154, Belgrade, 11000, Serbia

marko.zivanovic@metropolitan.ac.rs

milicazivanovic2411@gmail.com

DOI:10.46793/BISEC25.257Z

Marko M. Živanović¹ and Milica M. Živanović²

¹ Faculty of Information Technology, Belgrade Metropolitan University, Tadeuša Košćuška 63, Belgrade, 11000, Serbia

² Faculty of Organizational Sciences, University of Belgrade, Jove Ilića 154, Belgrade, 11000, Serbia