Skip to content

louissellier/aml_final

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Columbia University

A Machine Learning Approach to Corporate Credit Ratings

Final Project for Applying Machine Learning – Spring 2025
Columbia University, New York City, USA


🧠 Project Overview

Credit ratings influence billions in capital flows — but do raters behave differently in times of crisis?
This project investigates whether human credit rating agencies add value or diverge from machine learning-based predictions during periods of financial instability. Using XGBoost and real-world corporate ratings data, we explore:

  • 📉 Prediction accuracy during crisis vs. normal periods
  • 🤖 Model-human disagreements and their underlying drivers
  • 📊 Feature analysis of override patterns
  • 🔍 Behavioral insights into rating decisions across time

📁 Repository Contents

  • ml_final_project.ipynb – the full analysis notebook
  • corporate_ratings.csv – cleaned dataset used in modeling

🧪 Methods Used

  • XGBoost classifier with hyperparameter tuning
  • Cross-validation with custom metrics
  • Confusion matrix + risk-adjusted scoring
  • Precision-Recall and ROC analysis
  • Time series resampling and crisis segmentation
  • Visualization with matplotlib & seaborn

✍️ Authors

  • Michelle Ren
  • Louis Sellier

📘 Acknowledgements

  • Columbia, Professor Björkegren, and TAs for guidance
  • Original dataset and prior academic literature on credit risk modeling
  • XGBoost and scikit-learn libraries

About

Applying Machine Learning - Prof. Björkegren

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published