Full Paper View


A Comprehensive Survey on Machine Learning

Astha Singh, Pawan Singh, Anil Kr Tiwari
Review Paper | Journal Paper
Volume 1 , Issue 1 , PP 1-17


The objective of this briefing is to present an overview of the topic, machine learning techniques currently in use or in consideration at statistical agencies worldwide. It is important to know the main reason why real-world scenario should start exploring the use of machine learning techniques, terminology, approach and about few popular libraries in python, what regression is, by completely throwing light on simple as well as multiple linear and non-linear regression models and their applications, classification techniques, various clustering techniques. The material presented in this paper is the result of a study based on different models and the study of various datasets (analysis and choice of the correct model are important). While Machine Learning involves concepts of automation, it requires human guidance. Machine Learning involves a high level of generalization to get a system that performs well on yet-unseen data instances. Topics like regression, classification and clustering, the report cover the insight of various techniques and their applications.

Key-Words / Index Term

Machine Learning, Regression, Classification, Clustering


[1] S.A. Macskassy and F. Provost. Classification in networked data: A toolkit and a univariate case study, The Journal of Machine Learning Research, 2007, vol.8, pp. 935–983
[2] Understanding deep learning requires rethinking generalization (2017), C. Zhang et al.
[3] Subhashree Subudhu, Ram Narayan Patro, Pradyut Kumar Biswal. “Superpixel Clustering Based Segmentation Algorithm for Hyperspectral Image Classification”, 2019 International Conference on Information Technology (ICIT), 2019
[4] https://towardsdatascience.com/introduction-to-machine-learning-algorithms-linear-regression-14c4e325882a
[5] Programming in Python-3, A complete introduction to the Python language by Mark Summerfield
[6] Python for Data Analysis by Wes McKinney
[7] Artificial Intelligence by M. Trivedi and J. Kumar
[8] Machine Learning by IBM
[9] T. Joachims. Transductive inference for text classification using support vector machines. In Proceedings of the International Conference on Machine Learning (ICML’99), 1999, pp. 200–209.
[10] Meng XF. Big data management: concept, technology and challenge. [J]. Computer research and development, 2013,50 (1): 146-169.
[11] Arel I, Rose DC, Karnowski T P.Deep machine learning-A new frontier in artificial intelligence esearch[J].Computational intelligence Mag-azine,IEEE,2010,5(4):13-18
[12] Low Y, Gonzalez J, Kyrola A, Bickson D, Guestrin C, Hellerstein JM. Graphlab: A new framework for parallel machine learning. ar Xiv preprint ar Xiv:1006.4990. 2010.
[13] Balraj Singh, Parveen Sihag, Siraj Muhammed Pandhiani, Sourav Debnath, Saurabh Gautam. "Estimation of permeability of soil using easy measured soil parameters: assessing the artificial intelligence-based models”, ISH Journal of Hydraulic Engineering, 2019
[14] Abhishek Sharma, Prateek Agrawal, Vishu Madaan, Shubham Goyal. "Prediction on diabetes patient's hospital readmission rates”, Proceedings of the Third International Conference on Advanced Informatics for Computing Research - ICAICR '19, 2019.