INCOME Prediction Model
Project GoalPredict income for new customers to increase the overall efficiency of loan application to loan approval.
ChallengesFreeform job titles, extreme values from existing income data, legality, and maximizing business opportunities under controllable risk
Methods & toolsAggregated similar job titles with BERT model (converted job titles to vectors).
Segmented customers with similar income with the KMeans model for predictions to avoid impact from extreme values.
Used LightGBM classification to select customers who may fit the lowest income requirement.
Applied rules to ensure overall risk is under control in order to follow legal criteria.
— Tool used: Python/ PostgreSQL/ Jupytor lab/ VS code/Azure DevOps
ImpactModel was implemented into a fully automated loan approval process that is now patent protected.
{Patent Link}
On average 416 manpower days were reclaimed yearly after project launch.
CUSTOMEr BEHAVIORS PREDICTION MODEL 1
Project GoalPredict when customers will go abroad in the near future to offer timely services or promotions
ChallengesCommunicating with a business team that had no technical background; True Y is unavailable
Methods & toolsUsed multiple data sources to simulate customers' travel behaviors, such as offline transactions in foreign countries.
Built classification model with LightGBM. Y= travel or not within the next 90 days.
Designed A/B test to form marketing campaigns.
Analyzed and explained important factors with Explainable AI.
— Tool used: Python/ PostgreSQL/ Jupytor lab/ VS code/Azure DevOps
ImpactDecreased marketing costs by 50% while achieving the same marketing objectives.
CAR Price Prediction
Project Goal 1. Predict a reasonable price for our used car or when buying a used car.
2. Knowing what car features influence the price more and keep in mind when buying a new car.
ChallengesNon-US dataset; Dataset contains unnamed features.
Methods & toolsEDA
Model use: Linear Regression/Random forest Regression/ Xgboost Regression
Explainable AI: SHAP; Understanding the ML black box.
– Source of data: https://www.kaggle.com/lepchenkov/usedcarscatalog
– Tool used: R /R studio
ImpactDecreased marketing costs by 50% while achieving the same marketing objectives.
Underwriting MOdel- Avoid the BAD LOAN
Projet GoalProvide suggestions for future loan application approvals to avoid the bad loans
ChallengesLimited data points and columns
Methods and toolsEDA
Model Use: logistic regression/ CART/ Lightgbm
Explainable AI: SHAP
Python