Skip to content
View tommaso-ferracci's full-sized avatar

Block or report tommaso-ferracci

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A library for feature selection for gradient boosting models using regression on feature Shapley values

Python 38 6 Updated Jul 22, 2025

This repository contains the Hugging Face Agents Course.

MDX 23,343 1,644 Updated Oct 13, 2025

Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Python 11,058 871 Updated Oct 17, 2025

An extension of XGBoost to probabilistic modelling

Python 663 71 Updated Nov 1, 2025

Synthetic data generation for tabular data

Python 3,270 397 Updated Nov 3, 2025

Dissecting Sample Hardness: A Fine-Grained Analysis of Hardness Characterization Methods for Data-Centric AI (ICLR 2024)

Python 4 1 Updated Feb 29, 2024

Data-IQ: Characterizing subgroups with heterogeneous outcomes in tabular data (NeurIPS 2022)

Jupyter Notebook 16 6 Updated Mar 20, 2023

A Data-Centric library providing a unified interface for state-of-the-art methods for hardness characterisation of data points.

Jupyter Notebook 25 2 Updated Mar 6, 2025
Python 109 27 Updated Jun 20, 2023

A simple, extensible library for developing AutoML systems

Python 175 41 Updated Jul 28, 2023

OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)

Python 99 8 Updated Feb 4, 2025

Contains the implementation of the EDAIN and EDAIN-KL methods proposed in our paper. The research was also part of the thesis I wrote as part of my MSc in Statistics (Data Science) at Imperial Coll…

Jupyter Notebook 15 Updated Feb 19, 2024

Data Shapley: Equitable Valuation of Data for Machine Learning

Python 280 71 Updated May 1, 2024

Simulating argon atoms following the ideal gas law

C++ 4 Updated Apr 9, 2023
CSS 4 Updated Sep 13, 2025