Permanent link to Research Commons versionhttps://hdl.handle.net/10289/14679
This paper proposes Maximal Associated Regression (MAR), a novel algorithm that performs forward stage-wise regression by applying nonlinear transformations to fit predictor covariates. For each predictor, MAR selects between a linear or additive fit as determined by the dataset. The proposed algorithm is an adaptation of Least Angle Regression (LARS) and retains its efficiency in building sparse models. Constrained penalized splines are used to generate smooth nonlinear transformations for the additive fits. A monotonically constrained extension of MAR (MARm) is also introduced in this paper to fit isotonic regression problems. The proposed algorithms are validated on both synthetic and real datasets. The performances of MAR and MARm are compared against LARS, Generalized Linear Models (GLM), and Generalized Additive Models (GAM) under the Gaussian assumption with a unity link function. Results indicate that MAR-type algorithms achieve a superior subset selection accuracy, generating sparser models that generalize well to new data. MAR is also able to generate models for sample deficient datasets. Thus, MAR is proposed as a valuable tool for subset selection and data exploration, especially when a priori knowledge of the dataset is unavailable.
Institute of Electrical and Electronics Engineers (IEEE)
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/