The dataset contains information on happiness scores across different countries worldwide and their correlation with economic, health, and social factors.
β
Python π
β
Pandas (for data manipulation)
β
Seaborn & Matplotlib (for visualization)
β
NumPy (for numerical operations)
β
Jupyter Notebook
βοΈ Handled missing values (used mean/median based on skewness).
βοΈ Fixed string encoding issues (\xa0
) and trimmed unwanted spaces.
βοΈ Converted non-numeric columns to numerical for correlation analysis.
βοΈ Descriptive statistics using .describe()
, .info()
.
βοΈ Checked correlation between happiness score & other factors.
βοΈ Identified data distribution & skewness using histograms.
βοΈ Found that GDP per capita (0.80) and Social Support (0.75) are the strongest predictors of happiness.
βοΈ Discovered that higher corruption (-0.40 correlation) lowers happiness levels.
βοΈ Freedom to make life choices (0.54 correlation) significantly impacts happiness.
π Correlation Matrix Heatmap β Showed strong relationships between variables.
π Scatter Plot (Happiness Score vs GDP per Capita) β Showed a positive correlation.
π Box Plot β Helped identify outliers in happiness scores.
π‘ Next Steps:
-
Exploring interactive dashboards using Tableau/Plotly Dash π.
-
Applying Machine Learning for Happiness Prediction π€.
git clone https://github.com/yourusername/world_happiness_report.git
cd world_happiness_report