Skip to content

yanhui-ma-dev/Data-Analytics-Portfolio

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

96 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸš€ Data Analytics & Machine Learning Portfolio (R)

R Tableau Accuracy License

πŸ“Œ Project Overview

This repository showcases an end-to-end data analytics and machine learning workflow. I bridge the gap between complex data orchestration and actionable business intelligence, delivering evidence-based clarity for strategic decision-making.


πŸ“‚ Key Business Modules

1. Retail Analysis: Customer Segmentation & Churn Prediction

  • 🎯 Objective: Translate raw transaction data into retention strategies.
  • πŸ’‘ Achievement: Processed 500k+ records into an RFM framework and developed a Churn Prediction Model with 90.5% accuracy.
  • πŸ“Š Visualization: Executive-level Tableau dashboards serving as a "Source of Truth" for customer health metrics.

2. Big Data Pipeline: Scalable Data Orchestration

  • 🎯 Objective: Process high-velocity spatiotemporal data for urban mobility insights.
  • πŸ›  Tech: Orchestrated a high-performance pipeline for 21.8M+ GPS records using data.table and dplyr in R.
  • πŸ“ˆ Impact: Derived driver activity patterns and operational mobility metrics from unstructured data.

3. Finance: Risk Intelligence & Model Benchmarking

  • 🎯 Objective: Optimize credit risk assessment through algorithmic benchmarking.
  • πŸ”¬ Methodology: Compared SVM (AUC: 0.8524), Random Forest, and Naive Bayes; utilized PCA and SOM Neural Networks for dimensionality reduction.

πŸ›  Tech Stack & Methodologies

  • Languages: R (Advanced), SQL.
  • Tools: Tableau.
  • BA Frameworks: BPMN 2.0, Gap Analysis, SOA Architecture.

Maintained by Yanhui Ma – Business Analyst & Operations Specialist

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages