To ensure more effectiveness in the learning process in
educational institutions, categorization of students is a very
interesting method to enhance student’s learning capabilities by
identifying the factors that affect their performance and use their
categories to design targeted inventions for improving their
quality. Many research works have been conducted on student
performances, to improve their grades and to stop them from
dropping out from school by using a data driven approach [1] [2].
In this paper, we have proposed a new model to categorize
students into 3 categories to determine their learning capabilities
and to help them to improve their studying techniques. We have
chosen the state of the art of machine learning approach to
classify student’s nature of study by selecting prominent features
of their activity in their academic field. We have chosen a data
driven approach where key factors that determines the base of
student and classify them into high, medium and low ranks. This
process generates a system where we can clearly identify the
crucial factors for which they are categorized. Manual
construction of student labels is a difficult approach. Therefore,
we have come up with a student categorization model on the basis
of selected features which are determined by the preprocessing of
Dataset and implementation of Random Forest Importance; Chi2
algorithm; and Artificial Neural Network algorithm. For the
research we have used Python’s Machine Learning libraries:
Scikit-Learn [3]. For Deep Learning paradigm we have used
Tensor-Flow, Keras. For data processing Pandas library and
Matplotlib and Pyplot has been used for graph visualization
purpose.