search for




 

Ridge regression splines with knot selection via 0 penalty
Journal of the Korean Data & Information Science Society 2024;35:889-904
Published online November 30, 2024;  https://doi.org/10.7465/jkdi.2024.35.6.889
© 2024 Korean Data and Information Science Society.

Eun-Ji Lee1 · Jae-Hwan Jhong2

1Department of Statistics, Chungbuk National University
2Department of Information Statistics, Chungbuk National University
Correspondence to: † This research was supported by Chungbuk National University Korea National University Development Project (2023).
1 PhD student, Department of Statistics, Chung Buk National University, Cheongju 28644, Korea
2 Corresponding author: Department of Information Statistics, Chung Buk National University, Cheongju 28644, Korea. E-mail: jjh25@chungbuk.ac.kr
Received July 29, 2024; Revised September 23, 2024; Accepted September 24, 2024.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Abstract
The Ridge regression model that considers the 2-norm penalty is a model that can alleviate multicollinearity problems by slightly reducing all coefficients in high-dimensional data. When the 0-norm penalty is considered in a linear regression model, variable selection is explicitly performed to build a model using the fewest variables, and a sparse model is created to facilitate interpretation and prevent overfitting. From a nonparametric regression perspective, the knot selection problem can be solved using a norm penalty with a regression model using a combined truncated power spline. We propose estimator based 2-norm and 0-norm penalties. We estimate the coefficients of the regression model with a coordinate descent algorithm. We evaluate the performance of 2-norm and 0-norm penalty-based estimation through three simulations and we present numerical studies through real data analysis.
Keywords : Best subset selection, coordinate descent algorithm, knot selection, ridge penalty, truncated power splines