Application of Synthetic Minority Over-Sampling Technique (SMOTE) to Outlier Data for Probabilistic Neural Network (PNN)
Keywords: probabilistic neural network, artificial neural network, Gaussian KDE Function, synthetic minority over-sampling technique
Abstract
One common model of Artificial Neural Network (ANN) used in classification tasks is the Probabilistic Neural Network (PNN). PNN is an algorithm that utilizes probability functions, eliminating the necessity for a large dataset during its development process. In this research, the best model parameters were initially determined using the sigma parameter and Kernel Density Estimation (KDE) function on a randomly sampled dataset employing the Stratified Random Sampling (SRS) method. The optimal sigma parameter obtained from this process is 0.075, with a Gaussian KDE function. The data used in this study is related to direct marketing campaigns (phone calls) from Portuguese banking institutions collected by S ́ergio. Subsequently, PNN is applied to this dataset to determine its Accuracy and F1-Score values. The results indicate an accuracy rate of 87.117% and an F1-Score of 92.755%. Following this, Synthetic Minority Over-Sampling Technique (SMOTE) is applied to the dataset to balance the data. PNN is then implemented on the oversampled data, and in this phase, an evaluation of the Accuracy and F1-Score values is conducted, resulting in respective figures of 93.437% and 93.511%.
Downloads
References
Adyati, R. D., Nasution, Y. N., & Wahyuningsih, S. (2019). KLASIFIKASI PROBABILISTIC NEURAL NETWORK (PNN) PADA DATA DIAGNOSA PENYAKIT DEMAM BERDARAH DENGUE (DBD) TAHUN 2018. Prosiding Seminar Nasional Matematika Dan Statistika, 1, pp. 15-21. Retrieved from https://jurnal.fmipa.unmul.ac.id/index.php/SNMSA/article/view/521
Arifiyanti, A. A., & Wahyuni, E. d. (2020). SMOTE: METODE PENYEIMBANG KELAS PADA KLASIFIKASI DATA MINING. Jurnal Teknologi Informasi dan Komunikasi, 34-39.
Azizah, P. D. (2021). Penerapan Probabilistic Neural Network pada Klasifikasi Berat Bayi. Journal Riset Statistika, 1(2), 152-159. doi:doi:10.29313/jrs.v1i2.524.
Chawla, N. V., Bowyer, K., Hall, L. O., & Kegelmeyer, W. P. (2002). SMOTE: Synthetic Minority Over-sampling Technique. Journal of Artificial Intelligence Research, 16(1), 321-357.
Er, O. (2011). An approach based on probabilistic neural network for diagnosis of Mesothelioma’s disease. Computers & Electrical Engineering, 75-81.
Erlin, E., Desnelita, Y., Nasution, N., Suryati, L., & Zoromi, F. (2022). Dampak SMOTE terhadap Kinerja Random Forest Classifier berdasarkan Data Tidak seimbang. Jurnal Manajemen, Teknik Informatika Dan Rekayasa Komputer, 21 (3), 677-690. Retrieved from https://doi.org/https://doi.org/10.30812/matrik.v21i3.1726
Mohebali, B. T.-B. (2020). Probabilistic neural networks: a brief overview of theory, implementation, and application. In P. Samui, D. T. Bui, S. Chakraborty, & R. C. (Eds.), Handbook of Probabilistic Models (pp. 347-367). Retrieved from https://doi.org/10.1016/B978-0-12-816514-0.00014-X.
Moro, S., Cortez, P., & Rita, P. (2014). A data-driven approach to predict the success of bank telemarketing. Decision Support Systems, 62, 22-31. doi:https://doi.org/10.1016/j.dss.2014.03.001
Nurbaiti, Setyaningsih, F. A., & Midyanti, D. (2017). IDENTIFIKASI BIBIT PADA TANAMAN LAHAN GAMBUT BERDASARKAN BENTUK DAUN MENGGUNAKAN METODE PROBABILISTIK NEURAL NETWORK (PNN)BERBASIS WEBSITE (UMUR BIBIT 2 BULAN-1 TAHUN). CODING: Jurnal Komputer dan Aplikasi, 5(1), 14-22. doi:DOI: http://dx.doi.org/10.26418/coding.v5i1.19170
Ramadhani, P., Ispriyanti, D., & Safitri, D. (2018). KAPABILITAS PROSES DENGAN ESTIMASI FUNGSI DENSITAS KERNEL PADA PRODUKSI DENIM DI PT APAC INTI CORPORA. JURNAL GAUSSIAN, 7(1), 326-336.
Sarita, F. T., Setiawan, A., & Parhusip, H. (2019). Analisis Indeks Pembangunan Manusia (IPM) Kabupaten/Kota di Provinsi Maluku Utara Menggunakan Indeks Geary C Berdasarkan Resampling Estimasi Densitas Kernel. JuTISI : Jurnal Teknik Informatika dan Sistem Informasi, 5(1), 62-72. doi:https://doi.org/10.28932/jutisi.v5i1.1582
Shen, L. L. (2016). Relay Backpropagation For Effective Learning Of Deep Convolutional Neural Networks. (B. M. In Leibe, Ed.) Cham: Springer International.
Specht, D. F. (1990). Probabilistic Neural Networks. Neural Networks, 3(1), 109-118. doi:https://doi.org/10.1016/0893-6080(90)90049-Q
Susanti, S. (2019). Klasifikasi Kemampuan Perawatan Diri Anak dengan Disabilitas Menggunakan SMOTE Berbasis Neural Network. JURNAL INFORMATIKA, 6(2), 175-184.
Sutoyo, E., & Fadlurrahman, M. (2020). Penerapan SMOTE untuk Mengatasi Imbalance Class dalam Klasifikasi Television Advertisement Performance Rating Menggunakan Artificial Neural Network. JEPIN (Jurnal Edukasi dan Penelitian Informatika), 6(3), 379-385. doi:http://dx.doi.org/10.26418/jp.v6i3.42896
Utomo, A., Gumilang , M., & Ahmad, A. (2022). Agricultural Commodity Sales Recommendation System For Farmers Based on Geographic Information Systems and Price Forecasting Using Probabilistic Neural Network Algorithm. The 4th International Conference on Food and Agriculture (pp. 1-9). Jember: IOP Conference Series Earth and Environmental Science.
Wang, S.-C. (2003). Artificial Neural Network In: Interdisciplinary Computing in Java Programming. Boston: Springer, Boston, MA. doi: https://doi.org/10.1007/978-1-4615-0377-4_5.
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Copyright (c) 2024 Jurnal Matematika Sains dan Teknologi
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.