Optimization of Support Vector Machine Using SMOTE and Grid Search for Kidney Health Data Classification
DOI:
https://doi.org/10.35746/jtim.v8i2.993Keywords:
Support Vector Machine, SMOTE, Grid Search, imbalanced data, machine learningAbstract
Kidney disease is a highly prevalent health problem that can seriously impact the quality of life of those affected. To improve diagnostic accuracy, machine learning methods are widely used to classify patient data. Class imbalance (imbalanced data) is one of the problems that often occurs in the classification process and can affect the performance of machine learning models, especially in detecting minority classes. This study aims to improve the performance of the Support Vector Machine (SVM) algorithm by applying the SMOTE (Synthetic Minority Over-sampling Tech-nique) and Grid Search methods in the data classification process. SMOTE is used to balance the class distribution by adding synthetic data to the minority class, while Grid Search is used to ob-tain optimal model parameters. The results show that the SVM model without handling data im-balance produces relatively low performance with an accuracy value of 51%, precision 17%, re-call 33%, and F1-score 23%. After applying the SMOTE method, the model performance increases significantly to 81% accuracy, 81% precision, 80% recall, and 81% F1-score. Furthermore, the ap-plication of Grid Search to the SVM + SMOTE model provides the best results with an accuracy of 84%, precision 82%, recall 81%, and F1-score 81% with an AUC value of 0,92. The findings of this study indicate that the combination of SMOTE and Grid Search is effective in improving the per-formance of the SVM algorithm in data classification. The novelty of this study demonstrates that data imbalance management and hyperparameter optimization play a crucial role in producing more accurate and optimal classification models.
Downloads
References
S. Pal, “A deep analysis of chronic kidney disease for early detection using machine learning classifiers,” Int. J. Med. Eng. Inform., vol. 17, no. 3, pp. 279–291, 2025, https://doi.org/10.1504/IJMEI.2025.145849.
D. A. Angggoro and S. S. Mukti, “Performance Comparison of Grid Search and Random Search Methods for Hy-perparameter Tuning in Extreme Gradient Boosting Algorithm to Predict Chronic Kidney Failure,” International Journal of Intelligent Engineering and Systems, vol. 14, no. 6, pp. 198–207, Dec. 2021, https://doi.org/10.22266/ijies2021.1231.19.
D. A. Ajalkar, J. Y. Deshmukh, M. V. Shelke, S. V. Wankhade, and S. K. Patil, “Detection of chronic kidney disease based on ensemble approach with optimal feature selection using machine learning,” IAES International Journal of Artificial Intelligence (IJ-AI), vol. 14, no. 5, p. 4017, Oct. 2025, https://doi.org/10.11591/ijai.v14.i5.pp4017-4031.
M. Chitra, A. K. Parveen, M. Elavarasi, J. Sangeetha, and R. Vaittilingame, “Chronic kidney disease prediction model using machine learning approach,” International Journal of Informatics and Communication Technology (IJ-ICT), vol. 12, no. 2, p. 162, Aug. 2023, https://doi.org/10.11591/ijict.v12i2.pp162-170.
A. A. Sutariya and D. B. Rathod, “Early Phase, Multi Diseases Detection, Using AI & Intelligent Hybrid Su-pervised Machine Learning Classifier Model,” International Journal of Electronics and Communication Engineering, vol. 11, no. 11, pp. 45–53, Nov. 2024, https://doi.org/10.14445/23488549/IJECE-V11I11P105.
G. U. Nneji et al., “FFS-IML: fusion-based statistical feature selection for machine learning-driven interpretability of chronic kidney disease,” International Journal of Machine Learning and Cybernetics, vol. 16, no. 9, pp. 6215–6248, Sep. 2025, https://doi.org/10.1007/s13042-025-02621-0.
B. Dhanwanth, B. Vivek, M. Abirami, S. M. Waseem, and C. Manikantaa, “Forecasting Chronic Kidney Disease Using Ensemble Machine Learning Technique,” International Journal on Recent and Innovation Trends in Compu-ting and Communication, vol. 11, no. 5s, pp. 336–344, Jun. 2023, https://doi.org/10.17762/ijritcc.v11i5s.7035.
A. P. Anqui, “Classifying chronic kidney disease using selected machine learning techniques,” International Jour-nal of Advanced And Applied Sciences, vol. 12, no. 2, pp. 72–79, Feb. 2025, https://doi.org/10.21833/ijaas.2025.02.008.
D. A. Otchere, T. O. Arbi Ganat, R. Gholami, and S. Ridha, “Application of supervised machine learning para-digms in the prediction of petroleum reservoir properties: Comparative analysis of ANN and SVM models,” J. Pet. Sci. Eng., vol. 200, p. 108182, May 2021, https://doi.org/10.1016/j.petrol.2020.108182.
Q. Shi and H. Zhang, “Fault Diagnosis of an Autonomous Vehicle With an Improved SVM Algorithm Subject to Unbalanced Datasets,” IEEE Transactions on Industrial Electronics, vol. 68, no. 7, pp. 6248–6256, Jul. 2021, https://doi.org/10.1109/TIE.2020.2994868.
C. C. Colmenares-Mejía et al., “Multivariable prediction model of complications derived from diabetes mellitus using machine learning on scarce highly unbalanced data,” Int. J. Diabetes Dev. Ctries., vol. 44, no. 3, pp. 528–538, Sep. 2024, https://doi.org/10.1007/s13410-023-01264-7.
C.-A. Tsai and Y.-J. Chang, “Efficient Selection of Gaussian Kernel SVM Parameters for Imbalanced Data,” Genes (Basel)., vol. 14, no. 3, p. 583, Feb. 2023, https://doi.org/10.3390/genes14030583.
J.-B. Wang, C.-A. Zou, and G.-H. Fu, “AWSMOTE: An SVM-Based Adaptive Weighted SMOTE for Class-Imbalance Learning,” Sci. Program., vol. 2021, pp. 1–18, May 2021, https://doi.org/10.1155/2021/9947621.
W. Cai, M. Cai, Q. Li, and Q. Liu, “Three-way imbalanced learning based on fuzzy twin SVM,” Appl. Soft Comput., vol. 150, p. 111066, Jan. 2024, https://doi.org/10.1016/j.asoc.2023.111066.
F. Kausar and B. Ramamurthy, “Machine Learning Based Optimal Feature Selection for Pediatric Ultrasound Kidney Images Using Binary Coati Optimization,” International Journal of Intelligent Engineering and Systems, vol. 17, no. 6, pp. 1300–1313, Dec. 2024, https://doi.org/10.22266/ijies2024.1231.94.
B. Yamini, T. Saraswathi, P. Radhakrishnan, M. Nalini, M. Shanmuganathan and Siva Subramanian. R, “Ma-chine learning algorithms for predicting of chronic kidney disease and its significance in healthcare,” Interna-tional Journal of Advanced Technology and Engineering Exploration, vol. 11, no. 112, Mar. 2024, https://doi.org/10.19101/IJATEE.2023.10101788.
A. Syarif, O. D. Riana, D. A. Shofiana, and A. Junaidi, “A Comprehensive Comparative Study of Machine Learn-ing Methods for Chronic Kidney Disease Classification: Decision Tree, Support Vector Machine, and Naive Bayes,” International Journal of Advanced Computer Science and Applications, vol. 14, no. 10, 2023, https://doi.org/10.14569/IJACSA.2023.0141063.
B. K. Swain, S. Mohapatra, M. Mishra, and R. Sharma, “A unified approach for Parkinson’s disease recognition: imbalance mitigation and grid search optimized boosting with LightGBM,” Med. Biol. Eng. Comput., vol. 62, no. 11, pp. 3471–3491, Nov. 2024, https://doi.org/10.1007/s11517-024-03139-3.
S. EL Ferouali, Z. Elamrani Abou Elassad, S. Qassimi, and A. Abdali, “From Baseline to Best Practice: An Ad-vanced Feature Selection, Feature Resampling and Grid Search Techniques to Improve Injury Severity Predic-tion,” Applied Artificial Intelligence, vol. 39, no. 1, Dec. 2025, https://doi.org/10.1080/08839514.2025.2452675.
D. Santhadevi and B. Janet, “SDB-RGSO: Swarm-Based Data Balancing and Randomized Grid Search Optimiza-tion for IoT NetFlow Malware Detection with Ensemble Machine Learning Model,” 2023, pp. 615–631. https://doi.org/10.1007/978-981-99-6550-2_46.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Muhammad Maulana, Zulkipli Zulkipli, Tanwir Tanwir, Dading Oktaviadi Resmiranta, Naufal Hanif, Raisul Azhar

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.




