Comparison of C4.5 and Naive Bayes Algorithm Methods in Prediction of Student Graduation on Time (Case Study: Information Systems Study Program)
In tertiary institutions, students become one of the important parameters in the evaluation of study program organizers. Prediction of student graduation is a special concern to know, early identification for students is needed as an important action. Information processing to predict student graduation is by implementing data mining. The implementation of data mining can be applied if a university, especially a study program, does not yet have an early classification in achieving student graduation on time. The ITTP Information System study program is one of the study programs that does not have an early identification of student graduation on time. Determination of graduation for SI ITTP Study Program students includes GPA, TOEFL scores, and total credits. The purpose of this research is to find out which attributes have the most influence in predicting graduation of ITTP IS Study Program students. The method used in this prediction is by using the classification of the C4.5 Algorithm and Naïve Bayes. The classification is used to determine which attributes have an effect on predicting student graduation on time and to compare the two classification methods. The results obtained are the training set size 70% which has the best accuracy when compared to other training set sizes. Comparing the accuracy between the two methods, it is known that the C4.5 algorithm has good accuracy when training set size is 70% and Naïve Bayes has higher accuracy when training set size is 75%. Decision tree C4.5 interprets that the most influential attribute is the GPA as the root of the decision tree to predict student graduation on time. The research is expected to be used as a reference for the ITTP IS Study Program in formulating student graduation policies on time and as a reference for further researchers in predicting in the same field.