Full Paper View


Media Control Using Hand Gestures

Mrigank Singh, Sheenu Rizvi
Research Paper | Journal Paper
Volume 2 , Issue 1 , PP 1-11
DOI: https://doi.org/10.54060/JIEEE/002.01.005


The corporate world today basically relies on presentations of ideas and statistics. In the board room, the presenters are highly conscious of depicting confidence in their presentation. This would entail accessibility and mobility to the presenter or media viewer. As the extent of Artificial Intelligence is increasing in all directions, I am utilizing its extreme capabilities to create a software that would help in accessibility and save time and money. This paper describes a software written in Python 3.8 and makes use of Python libraries like OpenCV and PyAutoGUI to receive input from the computer’s Webcam and recognize gestures to control the PowerPoint Presentation and Portable Document Format (PDF) files or the Media Control. The user interface is built with the Python library PyQt5. This paper aims is to help people control their Presentations and Portable Document Format (PDF) files, and many other media through their hand gestures, without using a mouse or any other pointing device. The software would not require any other external hardware; hence it would not burn a hole in people’s pockets.

Key-Words / Index Term

Artificial Intelligence, Deep Learning, Computer Vision, OpenCV, Python


  1. Aamir, Muhammad; Irfan, Muhammad; Ali, Tariq; Ali, Ghulam; Shaf, Ahmad; S, Alqahtani Saeed; Al-Beshri, Ali; Alasbali, Tariq; Mahnashi, Mater H. (2020). An Adoptive Threshold-Based Multi-Level Deep Convolutional Neural Network for Glaucoma Eye Disease Detection and Classification. Diagnostics, 10(8), 602–. doi:10.3390/diagnostics10080602 
  2. Nayan N. Shende; S.P. SyedIbrahim . (2019) Layout detection using computer vision. doi:10.1504/IJCCIA.2019.103752
  3. Bertling, Joy (2013). Exercising the Ecological Imagination: Representing the Future of Place. Art Education, 66(1), 33–39. doi:10.1080/00043125.2013.11519206 
  4. Tsutomu Kawasaki (1970). Theory of chromatography of macromolecules with rigid structures on hydroxyapatite columns. II. Dynamic part. , 9(3), 291–306.doi:10.1002/bip.1970.360090306 
  5. Jones, Nathaniel M.; Paschos, Georgios S.; Shrader, Brooke; Modiano, Eytan (2014). [ACM Press the 15th ACM international symposium - Philadelphia, Pennsylvania, USA (2014.08.11-2014.08.14)] Proceedings of the 15th ACM international symposium on Mobile ad hoc networking and computing - MobiHoc '14 - An overlay architecture for throughput optimal multipath routing. , (), 73–82. doi:10.1145/2632951.2632957 
  6. Allan M. Anderson (2008). A framework for NPD management: doing the right things, doing them right, and measuring the results. , 19(11), 0–561. doi:10.1016/j.tifs.2008.01.015 
  7. McInnis, Schillaci; Agrawal, Ankur (2018). [IEEE 2018 International Conference on Advances in Computing and Communication Engineering (ICACCE) - Paris, France (2018.6.22-2018.6.23)] 2018 International Conference on Advances in Computing and Communication Engineering (ICACCE) - Web-based Visualization and Navigation of the Content of SNOMED CT.,348–353. doi:10.1109/ICACCE.2018.8441707 
  8. Lambani, Matodzi Nancy; Nengome, Zachariah (2017). Group Work Impact on Academic Communication: Female English Student Teachers’ Views. International Journal of Educational Sciences, 18(1-3), 101–109. doi:10.1080/09751122.2017.1317168 
  9. Papageorgiou, C.P.; Oren, M.; Poggio, T. (1998). [Narosa Publishing House IEEE 6th International Conference on Computer Vision - Bombay, India (4-7 Jan. 1998)] Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271) - A general framework for object detection. , (), 555–562. doi:10.1109/ICCV.1998.710772 
  10. Guobo Xie and Wen Lu, "Image Edge Detection Based On Opencv," International Journal of Electronics and Electrical Engineering, Vol. 1, No. 2, pp. 104-106, June 2013. doi: 10.12720/ijeee.1.2.104-106
  11. Rupali, M., & Amit, P. (2017). A Review Paper on General Concepts of “Artificial Intelligence and Machine Learning.” IARJSET, 4(4), 79–82. https://doi.org/10.17148/iarjset/nciarcse.2017.22
  12. Wiley, Victor & Lucas, Thomas. (2018). Computer Vision and Image Processing: A Paper Review. International Journal of Artificial Intelligence Research. 2. 22. 10.29099/ijair.v2i1.42.
  13. Adusumalli, Harish & Kalyani, D. & Sri, R.Krishna & Pratapteja, M. & Rao, P. (2021). Face Mask Detection Using OpenCV. 1304-1309. 10.1109/ICICV50876.2021.9388375.