-
Wise
- London - United Kingdom
- in/tommaso-ferracci
Stars
A library for feature selection for gradient boosting models using regression on feature Shapley values
This repository contains the Hugging Face Agents Course.
Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
An extension of XGBoost to probabilistic modelling
Dissecting Sample Hardness: A Fine-Grained Analysis of Hardness Characterization Methods for Data-Centric AI (ICLR 2024)
Data-IQ: Characterizing subgroups with heterogeneous outcomes in tabular data (NeurIPS 2022)
A Data-Centric library providing a unified interface for state-of-the-art methods for hardness characterisation of data points.
A simple, extensible library for developing AutoML systems
OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)
Contains the implementation of the EDAIN and EDAIN-KL methods proposed in our paper. The research was also part of the thesis I wrote as part of my MSc in Statistics (Data Science) at Imperial Coll…
Data Shapley: Equitable Valuation of Data for Machine Learning
Simulating argon atoms following the ideal gas law