A case study employing non-parametric regression to develop a novel triglycerides model based on HDL levels using non-normally distributed data
DOI:
https://doi.org/10.3329/bjms.v25i2.88743Keywords:
Non-parametric regression; Kendall-Theil Sen Siegel; Highdensity lipoprotein (HDL); TriglycerideAbstract
Background This research models high-density lipoprotein (HDL) and triglyceride consumption using non-parametric regression, bootstrap resampling, and data splitting into training and testing sets to improve comprehension of their complex connection and inform evidence-based health promotion efforts. Objective The goal of this study is to develop a non-parametric regression model that connects HDL levels with triglyceride levels. This will help make predictions more accurate for HDL levels in the patients that were studied by using diagnostic tools. Materials and Methods When the assumption of linear regression is not satisfied, the model that is constructed may produce biased estimates. To get around this problem, a non-parametric regression model is used in this study, along with an improved bootstrap method to get the best model estimates. In particular, the study uses the Kendall- Theil Sen Siegel slope, a robust estimator, to find the slope of the regression line. This reduces the effect of outliers or values that are too high or too low. This method makes it easy to explore the relationship between variables without having to make rigid assumptions about the parameters. This method was used to split the dataset into training and testing groups. The training dataset will be used to build the model, and the testing dataset will be used to make sure the model works. Result Based on the statistical analysis conducted using R, it was found that the non-parametric regression method performed superiorly in making predictions, particularly in instances where the data didn’t adhere to the assumption of normality. Notably, both the training and testing datasets yielded high R-squared values of 79.2% and 73.6% respectively under the non-parametric regression model. In contrast, when employing simple parametric regression, the R-squared values for the training and testing datasets were 45.89% and 47.29% respectively. A significantly high level of performance was attained using the proposed methodology, as demonstrated by these results. Conclusion The outcome of the research underscores the exceptional performance exhibited by the hybrid model methodology utilized.
BJMS, Vol. 25 No. 02 April’26 Page: 566-575
Downloads
25
18
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Wan Muhamad Amir W Ahmad, Farah Muna Mohamad Ghazali, Mohamad Nasarudin Adnan, Nor Azlida Aleng, Firdaus Mohamad Hamzah

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish in the Bangladesh Journal of Medical Science agree to the following terms that:
- Authors retain copyright and grant Bangladesh Journal of Medical Science the right of first publication of the work.

Articles in Bangladesh Journal of Medical Science are licensed under a Creative Commons Attribution 4.0 International License CC BY-4.0.This license permits use, distribution and reproduction in any medium, provided the original work is properly cited.- Authors are able to enter into separate, additional contractual arrangements for the distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted to post their work online (e.g., in institutional repositories or on their website) as it can lead to productive exchanges, as well as greater citation of published work.