Solution of class imbalance of k-nearest neighbor for data of new student admission selection

Siti Mutrofin(1*), Ainul Mu'alif(2), Raden Venantius Hari Ginardi(3), Chastine Fatichah(4),

(1) Universitas Pesantren Tinggi Darul Ulum, Jombang
(2) Universitas Pesantren Tinggi Darul Ulum, Jombang
(3) Institut Teknologi Sepuluh Nopember, Surabaya
(4) Institut Teknologi Sepuluh Nopember, Surabaya
(*) Corresponding Author


The objective of this research is to correct the inconsistencies associated with the response differences by each examiner with respect to the assessment of each hafiz candidate. To carry out this research, 259 students were selected within a week using 4testers. However, the examiners are also tasked with another essential mandate which must be immediately fulfilled asides testing candidates for hafiz. In order to overcome this problem, the Educational Data Mining (EDM) system is applied during classification. The problems associated with the use of this technique however, is the limited number of attributes and the imbalance data class. This study was proposed to apply the kNN (k-Nearest Neighbor) technique. The results obtained indicates that kNN can provide recommendations to testers who are students and it is suitable for the solving the problem associated with class imbalance as indicated by the application of Shuffled and Stratified sampling techniques which has values of accuracy, precision, recall and AUC > 0.8%.


class balance; class imbalance; EDM; kNN; tahfiz

Full Text:


Article Metrics

Abstract view : 1042 times
PDF - 493 times


A. B. Hakim and V. Ramdhani, "Perancangan dan Pengembangan Prototipe Aplikasi Mobile Untuk Lembaga Penghafal Quran Berbasis Android Menggunakan Metode Rapid Application Development," I-STATEMENT, vol. 3, no. 2, pp. 74-88, 2017.

R. Wulandari, "Rancang Bangun Sistem Informasi Monitoring dan Evaluasi Hafalan Al-Qur'an Program Beasiswa Santri Berprestasi (PBSB) Berbasis Web Pada Universitas Islam Negeri (UIN) Maulana Malik Ibrahim Malang dengan Metode Extreme Programming (XP)," UIN Maulana Malik Ibrahim, Malang, 2018.

D. Iskandar, S. D. Budiwati and R. Budiawan, "Aplikasi Penilaian dan Presensi Siswa untuk Kegiatan Pembelajaran Akademik (Studi Kasus : SD Ar-Rafi’)," in e-Proceeding of Applied Science, 2017.

A. T. R. Saragih, A. S. Sembiring and M. Sayuthi, "Penerapan Metode Clustering K-Means untuk Proses Seleksi Calon Peserta Lomba MTQ," Jurnal Pelita Informatika, vol. 17, no. 2, pp. 117-122, 2018.

D. Thammasiri, D. Delen, P. Meesad and N. Kasap, "A critical assessment of imbalanced class distribution problem: The case of predicting freshmen student attrition," Expert Systems with Applications, vol. 41, no. 2, pp. 321-330, 2014.

I. Brown and C. Mues, "An experimental comparison of classification algorithms for imbalanced credit scoring data sets," Expert Systems with Applications, vol. 39, no. 3, p. 3446–3453, 2012.

S. Mutrofin, A. Izzah, A. Kurniawardhani and M. Masrur, "Optimasi teknik klasifikasi modified k nearest neighbor menggunakan algoritma genetika," Jurnal Gamma, vol. 10, no. 1, 2015.

D. T. Larose and C. D. Larose, Discovering knowledge in data: an introduction to data mining, 2nd ed., Hoboken: John Wiley & Sons, 2014.

R. S. Wahono, N. S. Herman and S. Ahmad, "A comparison framework of classification models for software defect prediction," Advanced Science Letters, vol. 20, no. 10-12, pp. 1945-1950, 2014.


Copyright (c) 2019 International Journal of Artificial Intelligence Research


International Journal Of Artificial Intelligence Research

Organized by: Departemen Teknik Informatika STMIK Dharma Wacana
Published by: STMIK Dharma Wacana
Jl. Kenanga No.03 Mulyojati 16C Metro Barat Kota Metro Lampung
phone. +62725-7850671
Fax. +62725-7850671
Email: | 

View IJAIR Statcounter

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.