-
Hugging Face
- Paris
- https://huggingface.co/lhoestq
- @lhoestq
-
universal_pathlib Public
Forked from fsspec/universal_pathlibpathlib api extended to use fsspec backends
Python MIT License UpdatedNov 9, 2025 -
filesystem_spec Public
Forked from fsspec/filesystem_specA specification that python filesystems should adhere to.
Python BSD 3-Clause "New" or "Revised" License UpdatedOct 30, 2025 -
trl Public
Forked from huggingface/trlTrain transformer language models with reinforcement learning.
-
LiveCodeBench Public
Forked from LiveCodeBench/LiveCodeBenchOfficial repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"
Python MIT License UpdatedJul 16, 2025 -
-
iceberg-python Public
Forked from apache/iceberg-pythonApache PyIceberg
Python Apache License 2.0 UpdatedMay 15, 2025 -
webdataset Public
Forked from webdataset/webdatasetA high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
Python BSD 3-Clause "New" or "Revised" License UpdatedApr 8, 2025 -
smollm Public
Forked from huggingface/smollmEverything about the SmolLM2 and SmolVLM family of models
Python Apache License 2.0 UpdatedMar 25, 2025 -
smallpond Public
Forked from definite-app/smallpondA lightweight data processing framework built on DuckDB and 3FS.
-
pandas-audio-methods Public
Audio methods for pandas dataframes using soundfile
-
pandas-image-methods Public
Image methods for pandas dataframes using Pillow
-
pandas Public
Forked from pandas-dev/pandasFlexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Python BSD 3-Clause "New" or "Revised" License UpdatedDec 31, 2024 -
etils Public
Forked from google/etilsCollection of eclectic utils for python.
Python Apache License 2.0 UpdatedDec 16, 2024 -
presidio Public
Forked from microsoft/presidioContext aware, pluggable and customizable data protection and de-identification SDK for text and images
-
lerobot Public
Forked from huggingface/lerobot🤗 LeRobot: State-of-the-art Machine Learning for Real-World Robotics in Pytorch
-
Parallel computing with task scheduling
Python BSD 3-Clause "New" or "Revised" License UpdatedMar 19, 2024 -
lm-evaluation-harness Public
Forked from EleutherAI/lm-evaluation-harnessA framework for few-shot evaluation of language models.
Python MIT License UpdatedJan 18, 2024 -
img2dataset Public
Forked from rom1504/img2datasetEasily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Python MIT License UpdatedJan 8, 2024 -
pytorch-image-models Public
Forked from huggingface/pytorch-image-modelsPyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
Python Apache License 2.0 UpdatedMar 24, 2023 -
libsndfile-binaries Public
Forked from bastibe/libsndfile-binariesPre-compiled shared libraries for libsndfile
Shell GNU Lesser General Public License v2.1 UpdatedFeb 7, 2023 -
datasets Public
Forked from huggingface/datasets🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
-
-
beam Public
Forked from apache/beamApache Beam is a unified programming model for Batch and Streaming data processing.
Java Apache License 2.0 UpdatedOct 24, 2022 -
DeDLOC Public
Forked from yandex-research/DeDLOCOfficial code for "Distributed Deep Learning in Open Collaborations"
Jupyter Notebook Apache License 2.0 UpdatedDec 3, 2021 -
notebooks Public
Forked from patrickvonplaten/notebooksSome notebooks for NLP
-
datasets_readme_generator Public
Forked from madlag/datasets_readme_generatorAutomatically create READMEs for datasets
-
pytorch-lightning Public
Forked from Lightning-AI/pytorch-lightningThe lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.
-
datasets-tagging Public
Forked from huggingface/datasets-taggingA Streamlit app to add structured tags to the datasets
-
transformers Public
Forked from huggingface/transformers🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.