Identifying Risk Factors of Early Marriage among Women in Bangladesh Using Machine Learning Algorithms
DOI:
https://doi.org/10.3329/ijss.v25i2.85769Keywords:
Early marriage, Feature selection, Machine learning algorithms, Socio-economic and household decision making factors, BangladeshAbstract
Early marriage, defined as marriage before age 18, is a human rights violation with serious consequences for women’s health and well-being, and remains a major public health issue, particularly in South Asia and Bangladesh. The objective of this study is to identify the key socio-demographic and household decision making risk factors associated with early marriage among women in Bangladesh by applying various machine learning algorithms, and to evaluate the predictive performance of these models for effective policy formulation using data from the nationally representative BDHS 2022. Chi-square tests assessed associations between respondent characteristics and early marriage, while three advanced feature selection methods Boruta, LASSO, and Information Gain were employed for selection of relevant features. Eight machine learning algorithms, including Logistic Regression, Decision Tree, Random Forest, Gradient Boosting, XGBoost, AdaBoost, LogitBoost, and Neural Network, were evaluated using 5-fold cross-validation. Model performance was assessed by sensitivity, specificity, precision, accuracy, FDR, and AUC. The analysis revealed that the prevalence of early marriage was 67.02%. Combining significance tests and feature selection, Division, Wealth Index, Reading Newspaper, Religion, Residence, Household Purchases, and Age consistently emerged as the most influential predictors. Among all models, Decision Tree provided the best balanced performance on the testing set (sensitivity: 0.248, specificity: 0.896, precision: 0.540, accuracy: 0.682, AUC: 0.690), indicating its suitability for generalizable early marriage prediction. Feature importance analysis highlighted Wealth Index and Division as primary drivers. This study guides policymakers to target interventions by pinpointing high-risk regions and socioeconomic groups driving early marriage. Strengthening girls’ education, economic support, and community awareness can effectively reduce its prevalence in Bangladesh.
International Journal of Statistical Sciences, Vol. 25(2), November, 2025, pp 81-99
42
13
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Department of Statistics, University of Rajshahi, Rajshahi

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.