Mapping the Statistical Significance of Factors Contributing to the World Happiness Report
Karan Bhowmick1, Charuchith Ranji2

1Karan Bhowmick, Department of Information Technology, Vellore Institute of Technology, Vellore (Tamil Nadu), India. 
2Charuchith Ranjit*, Department of Computer Science with Specialization in Bioinformatics, Vellore Institute of Technology, Vellore Tamil Nadu, India.
Manuscript received on July 09, 2021. Revised Manuscript received on July 15, 2021. Manuscript published on August 30, 2021.| PP: 28-37 | Volume-10 Issue-6, August 2021 | Retrieval Number: 100.1/ijeat.F29630810621 | DOI: 10.35940/ijeat.F2963.0810621
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC BY-NC-ND license (

Abstract: This paper aims to delineate findings of the statistical significance of the factors contributing to the happiness score. The happiness score also termed as ladder score, is a metric used by the United Nations Sustainable Development Solutions Network to metricize the happiness of the citizens in a country. To tackle this issue, we use regression and data visualization. We perform a survey on the factors affecting ladder score and how these factors can be used for predictive analytics. We use Linear Regression, Polynomial Regression, Lasso Regression with cross-validation, and Ridge Regression with cross-validation. Next, we use evaluation metrics like MSE, RMSE, Adjusted r-squared, and r-squared value for the evaluation of the factors on the predictive model. Then, we plot the countries mentioned in the report on a geographical scale based on their happiness index scores. Furthermore, we plot the statistical significance of these factors on a continental scale, to reveal insightful patterns over a larger geographical domain. We aim to bring to light the trends of the aforementioned factors and produce the significance of these results on a world map. The results will help elucidate the global patterns formed by these metrics. An additional application is an extrapolation of the results procured. To augment the metrics of the Word Happiness Report in a statistically comprehensive way. Furthermore, through this evaluation, the world happiness report can be revised to accommodate more inclusive factors and mitigate the redundancy of the factors.
Keywords: RMSE, MSE, VIF, Ladder Score, GDP, Cross Validation, Overfitting, Underfitting.
Scope of the Article: Big data quality validation