Implementation of Support Vector Machine Algorithm for Classification of Study Period and Graduation Predicate of Students

  • Sumiyatun Universitas Teknologi Digital Indonesia
  • Yagus Cahyadi Universitas Teknologi Digital Indonesia
  • Edi Faizal Universitas Teknologi Digital Indonesia

Keywords: Educational Data, Graduation Predicates Classification, Machine Learning, Study Duration, SVM Algorithm

Abstract

Introduction: Accurately predicting the duration of study and graduation predicates in higher education is essential for improving academic outcomes and decision-making. This study aims to classify students’ study period and graduation predicates in the Information Systems program at UTDI using the Support Vector Machine (SVM) algorithm. Methods: A dataset of 500 student records containing academic and demographic variables—including GPA, age, semesters, and graduation predicates—was processed through data cleaning, normalization, and feature selection. Study duration was categorized into three classes: short (≤4 years), medium (4–6 years), and long (>6 years). An SVM with a linear kernel was applied, and the model was evaluated using accuracy, precision, recall, and F1-score. Results: The SVM model achieved perfect classification for study duration, with 100% accuracy, precision, recall, and F1-score across all categories. For graduation predicate classification, the model attained 95.18% accuracy. While it performed well overall, it faced some difficulty distinguishing between "Cum Laude" and "Very Satisfactory" due to overlapping GPA ranges. The analysis identified GPA as the most influential feature in both classification tasks, while age and the number of semesters played supporting roles. Conclusions: The SVM model demonstrates strong capability in classifying study duration and graduation predicates, offering valuable insights for academic management. Although performance was high, especially for study period prediction, further refinement is suggested to enhance classification in overlapping categories. Future work may benefit from larger, more balanced datasets and exploration of advanced models to increase prediction reliability.

Downloads

Download data is not yet available.

References

R. A. Raj, “Classification and Prediction of Incipient Faults in Transformer Oil by Supervised Machine Learning using Decision Tree,” 2023 3rd International Conference on Artificial Intelligence and Signal Processing, AISP 2023. 2023, doi: 10.1109/AISP57993.2023.10134566.

A. Singh and V. Gutte, “Classification of Breast Tumor Using Ensemble Learning,” in Mobile Computing and Sustainable Informatics, 2022, pp. 491–507.

L. S. Van Velzen, “Classification of suicidal thoughts and behaviour in children: results from penalised logistic regression analyses in the Adolescent Brain Cognitive Development study,” Br. J. Psychiatry, vol. 220, no. 4, pp. 210–218, 2022, doi: 10.1192/bjp.2022.7.

R. Hazra, “Machine Learning for Breast Cancer Classification with ANN and Decision Tree,” 11th Annual IEEE Information Technology, Electronics and Mobile Communication Conference, IEMCON 2020. pp. 522–527, 2020, doi: 10.1109/IEMCON51383.2020.9284936.

Y. Wang, “Rice diseases detection and classification using attention based neural network and bayesian optimization,” Expert Syst. Appl., vol. 178, 2021, doi: 10.1016/j.eswa.2021.114770.

B. H. Reddy, “Classification of Fire and Smoke Images using Decision Tree Algorithm in Comparison with Logistic Regression to Measure Accuracy, Precision, Recall, F-score,” 14th International Conference on Mathematics, Actuarial Science, Computer Science and Statistics, MACS 2022. 2022, doi: 10.1109/MACS56771.2022.10022449.

S. Markkandan, “SVM-based compliance discrepancies detection using remote sensing for organic farms,” Arab. J. Geosci., vol. 14, no. 14, 2021, doi: 10.1007/s12517-021-07700-4.

Q. Tang, “Classification of Complex Power Quality Disturbances Using Optimized S-Transform and Kernel SVM,” IEEE Trans. Ind. Electron., vol. 67, no. 11, pp. 9715–9723, 2020, doi: 10.1109/TIE.2019.2952823.

P. Purnawansyah, A. P. Wibawa, and ..., “Comparative Study of Herbal Leaves Classification using Hybrid of GLCM-SVM and GLCM-CNN,” Ilk. J. …, 2023, doi: 10.33096/ilkom.v15i2.1759.382-389.

W. Wu, “An Intelligent Diagnosis Method of Brain MRI Tumor Segmentation Using Deep Convolutional Neural Network and SVM Algorithm,” Comput. Math. Methods Med., vol. 2020, 2020, doi: 10.1155/2020/6789306.

P. Sharma, “Performance analysis of deep learning CNN models for disease detection in plants using image segmentation,” Inf. Process. Agric., vol. 7, no. 4, pp. 566–574, 2020, doi: 10.1016/j.inpa.2019.11.001.

A. A. Ewees, “Performance analysis of Chaotic Multi-Verse Harris Hawks Optimization: A case study on solving engineering problems,” Eng. Appl. Artif. Intell., vol. 88, 2020, doi: 10.1016/j.engappai.2019.103370.

A. Tuppad and S. D. Patil, “Data Pre-processing Issues in Medical Data Classification,” 2023 Int. Conf. …, 2023, doi: 10.1109/NMITCON58196.2023.10275855.

K. N. Myint and Y. Y. Hlaing, “Predictive Analytics System for Stock Data: methodology, data pre-processing and case studies,” 2023 IEEE Conf. Comput. …, 2023 doi: 10.1109/ICCA51723.2023.10182047.

S. Buriya, “Malware Detection Using 1d Convolution with Batch Normalization and L2 Regularization for Android,” 2023 International Conference on System, Computation, Automation and Networking, ICSCAN 2023. 2023, doi: 10.1109/ICSCAN58655.2023.10395472.

L. Peng, “Dual-Structure Elements Morphological Filtering and Local Z-Score Normalization for Infrared Small Target Detection against Heavy Clouds,” Remote Sens., vol. 16, no. 13, 2024, doi: 10.3390/rs16132343.

M. Sholeh, “Comparison of Z-score, min-max, and no normalization methods using support vector machine algorithm to predict student’s timely graduation,” AIP Conference Proceedings, vol. 3077, no. 1. 2024, doi: 10.1063/5.0202505.

N. Mamat, “Enhancement of water quality index prediction using support vector machine with sensitivity analysis,” Front. Environ. Sci., vol. 10, 2023, doi: 10.3389/fenvs.2022.1061835.

X. Ning, “Classification of Sulfadimidine and Sulfapyridine in Duck Meat by Surface Enhanced Raman Spectroscopy Combined with Principal Component Analysis and Support Vector Machine,” Anal. Lett., vol. 53, no. 10, pp. 1513–1524, 2020, doi: 10.1080/00032719.2019.1710524.

M. Stusek, “Accuracy Assessment and Cross-Validation of LPWAN Propagation Models in Urban Scenarios,” IEEE Access, vol. 8, pp. 154625–154636, 2020, doi: 10.1109/ACCESS.2020.3016042.

S. Clark and R. Wicentowski, “SwatCS: Combining simple classifiers with estimated accuracy,” 2013.

P. A. Flach and M. Kull, “Precision-Recall-Gain Curves: PR Analysis Done Right.”.

R. Kiruthika, “Predictive Modeling of Poultry Growth Optimization in IoT-Connected Farms Using SVM for Sustainable Farming Practices,” Proceedings of the 2024 10th International Conference on Communication and Signal Processing, ICCSP 2024. pp. 677–682, 2024, doi: 10.1109/ICCSP60870.2024.10543463.

W. Zhuo and A. Ahmad, “HCRF: an improved random forest algorithm based on hierarchical clustering,” Indones. J. Electr. Eng. Comput. Sci., vol. 38, no. 1, p. 578, Apr. 2025, doi: 10.11591/ijeecs.v38.i1.pp578-586.

J. T. Hasić, “Breast Cancer Classification Using Support Vector Machines (SVM),” Lecture Notes in Networks and Systems, vol. 644. pp. 195–205, 2023, doi: 10.1007/978-3-031-43056-5_16.

Published
2025-03-31
How to Cite
Sumiyatun, Cahyadi, Y., & Faizal, E. (2025). Implementation of Support Vector Machine Algorithm for Classification of Study Period and Graduation Predicate of Students. Indonesian Journal of Data and Science, 6(1), 55-63. https://doi.org/10.56705/ijodas.v6i1.214