Modeling Air Quality Index in Indonesia Using Smoothing Splines and Truncated Splines Regression
Keywords: air quality index (AQI), nonparametric regression, smoothing splines, truncated splines, generalized cross validation (GCV)
Abstract
The Air Quality Index (AQI) is a composite indicator that reflects regional air quality conditions and is influenced by multiple determinants with complex and nonlinear relationships. In such circumstances, parametric regression may be restrictive because it requires a predetermined functional form. This study applies spline based nonparametric regression using smoothing splines and truncated splines to model AQI in Indonesia and to compare the performance of both approaches. AQI is treated as the response variable, while population density, land cover area within and outside forest areas, and the number of motor vehicles are considered as predictor variables. For smoothing splines, the optimal smoothing parameter is selected using Generalized Cross Validation, whereas truncated splines are estimated using Ordinary Least Squares under various knot configurations and selected based on the minimum Generalized Cross Validation value. Model performance is evaluated using Generalized Cross Validation, Mean Squared Error, and Adjusted R squared. The study aims to identify the most appropriate model and to determine key factors influencing AQI variation in Indonesia, thereby providing empirical support for environmental policy making. The results show that the smoothing spline model provides better performance than the truncated spline model, with a lower Mean Squared Error (MSE) of 0.0716 and a higher Adjusted R² of 0.794. These results indicate that smoothing splines are more effective in capturing the nonlinear relationships influencing AQI variation in Indonesia.
Downloads
References
Ariesta, D., Gusriani, N., & Parmikanti, K. (2021). Estimasi parameter model regresi nonparametrik B-splin E pada angka kematian maternal. Jurnal Matematika UNAND, 10(3), 342–354. https://doi.org/10.25077/jmu.10.3.342-354.2021
Centofanti, F., Lepore, A., Menafoglio, A., Palumbo, B., & Vantini, S. (2023). Adaptive smoothing spline estimator for the function-on-function linear regression model. Computational Statistics, 38(1), 191–216. https://doi.org/10.1007/s00180-022-01223-6
Chen, L., Smetanina, E., & Wu, W. B. (2021). Estimation of nonstationary nonparametric regression model with multiplicative structure. The Econometrics Journal, 25(1), 176–214. https://doi.org/10.1093/ectj/utab018
Dani, A. T. R., & Adrianingsih, N. Y. (2021). Pemodelan regresi nonparametrik dengan estimator spline truncated vs deret fourier. Jambura Journal of Mathematics, 3(1), 26–36. https://doi.org/10.34312/jjom.v3i1.7713
Dani, A. T. R., Adrianingsih, N. Y., Ainurrochmah, A., & Sriningsih, R. (2021). Flexibility of nonparametric regression spline truncated on data without a specific pattern. Jurnal Litbang Edusaintech, 2(1), 37–43. https://doi.org/10.51402/jle.v2i1.30
Fatmawati, F., Budiantara, I. N., & Lestari, B. (2019). Comparison of smoothing and truncated spline estimators in estimating blood pressure models. International Journal of Innovation, Creativity and Change, 5(3), 1177–1199.
Handayani, T., Sifriyani, S., & Dani, A. T. R. (2024). Stunting prevalence modeling using nonparametric regression of quadratic splines. Jurnal Varian, 7(2), 149–160. https://doi.org/10.30812/varian.v7i2.2916
Mariati, N. P. A. M., Budiantara, I. N., & Ratnasari, V. (2021). Smoothing spline estimator in nonparametric regression (Application: Poverty in Papua Province). Proceedings of the 7th International Conference on Research, Implementation, and Education of Mathematics and Sciences (ICRIEMS 2020), 309–314. https://doi.org/10.2991/assehr.k.210305.044
Nisa, K., Herawati, N., & Setiawan, E. (2018). Analisis regresi nonnparametrik dengan teknik smoothing. Seminar Dan Rapat Tahunan (Semirata), 1–20.
Nurcahayani, H., Budiantara, I. N., & Zain, I. (2019). Nonparametric truncated spline regression on modelling mean years schooling of regencies in Java. AIP Conference Proceedings 2194, 020073. https://doi.org/10.1063/1.5139805
Pant, A., Joshi, R. C., Sharma, S., & Pant, K. (2023). Predictive modeling for forecasting air quality index (AQI) using time series analysis. Avicenna Journal of Environmental Health Engineering, 10(1), 38–43. https://doi.org/10.34172/ajehe.2023.5376
Sahani, M., Singh, H., Patel, P. K., & Singh, S. (2024). Evaluation of predictive models for air quality index prediction in an Indian urban area. Journal of Indian Association for Environmental Management, 44(3), 31–40.
Sriliana, I., Budiantara, I. N., & Ratnasari, V. (2022). A truncated spline and local linear mixed estimator in nonparametric regression for longitudinal data and its application. Symmetry, 14(12), 2687. https://doi.org/10.3390/sym14122687
Suparti, S., Prahutama, A., & Santoso, R. (2018). Mix local polynomial and spline truncated: the development of nonparametric regression model. Journal of Physics: Conference Series, 1025, 012102. https://doi.org/10.1088/1742-6596/1025/1/012102
Wongkar, D. C., Ruliana, R., & S., M. F. (2023). Analisis regresi nonparametrik spline truncated untuk menganalisis faktor-faktor yang mempengaruhi tingkat pengangguran terbuka di Provinsi Sulawesi Selatan. VARIANSI: Journal of Statistics and Its Application on Teaching and Research, 5(2), 55–63.
Xu, D., & Wang, Y. (2021). Low‐rank approximation for smoothing spline via eigensystem truncation. Stat, 10(1). https://doi.org/10.1002/sta4.355
Copyright (c) 2025 Jurnal Matematika Sains dan Teknologi

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.