TY - JOUR
T1 - Validation of social science theories using machine learning models
T2 - a methodological perspective
AU - David, Lemuel Kenneth
AU - Wang, Jianling
AU - Angel, Vanessa
N1 - Publisher Copyright:
© The Author(s), under exclusive licence to Springer Nature B.V. 2025.
PY - 2025/6
Y1 - 2025/6
N2 - There is a critical need to validate Social Trust Theory, Political Participation Theory, and Happiness and Well-being Theory using modern methodologies. This study employs machine learning models—Random Forest (RF) and Support Vector Machine (SVM)—applied to longitudinal data from 1972 to 2023 across six diverse countries. The findings reveal that Social Trust (24.5%) is the most significant predictor of societal cohesion, followed by Happiness Score (19%) and Income (16%), underscoring their central roles in shaping social outcomes. The results demonstrate the models' ability to capture complex, non-linear interactions among variables, surpassing traditional econometric approaches. Specifically, RF identified critical socio-demographic predictors of political participation, while SVM highlighted the interplay between cultural values and economic stability in determining well-being. These insights advance computational social science by enhancing the accuracy of theory validation and offering actionable recommendations for policymakers, such as targeting income inequality and fostering institutional trust. This research bridges computational and traditional methods, presenting a scalable framework for analyzing evolving social phenomena.
AB - There is a critical need to validate Social Trust Theory, Political Participation Theory, and Happiness and Well-being Theory using modern methodologies. This study employs machine learning models—Random Forest (RF) and Support Vector Machine (SVM)—applied to longitudinal data from 1972 to 2023 across six diverse countries. The findings reveal that Social Trust (24.5%) is the most significant predictor of societal cohesion, followed by Happiness Score (19%) and Income (16%), underscoring their central roles in shaping social outcomes. The results demonstrate the models' ability to capture complex, non-linear interactions among variables, surpassing traditional econometric approaches. Specifically, RF identified critical socio-demographic predictors of political participation, while SVM highlighted the interplay between cultural values and economic stability in determining well-being. These insights advance computational social science by enhancing the accuracy of theory validation and offering actionable recommendations for policymakers, such as targeting income inequality and fostering institutional trust. This research bridges computational and traditional methods, presenting a scalable framework for analyzing evolving social phenomena.
KW - Computational social science
KW - Happiness
KW - Machine learning
KW - Policy implications
KW - Political participation
KW - Random forest
KW - Social trust
KW - Support vector machine
KW - Theory validation
UR - https://www.scopus.com/pages/publications/86000354012
U2 - 10.1007/s11135-025-02075-0
DO - 10.1007/s11135-025-02075-0
M3 - 文章
AN - SCOPUS:86000354012
SN - 0033-5177
VL - 59
SP - 2799
EP - 2823
JO - Quality and Quantity
JF - Quality and Quantity
IS - 3
ER -