andimarafioti

Andrés Marafioti andimarafioti

Multimodal Research Lead at Hugging Face.

312 followers · 31 following

Hugging Face
Bern, Switzerland

Achievements

x2 x2 x3

Achievements

x2 x2 x3

Highlights

Organizations

Stars

drbh / uvnote

📓 computational document system build on uv and markdown

Python 8 Updated Oct 2, 2025

huggingface / smol2operator

Python 100 14 Updated Sep 23, 2025

huggingface / datasets

🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

Python 20,725 2,979 Updated Oct 10, 2025

huggingface / large-scale-image-deduplication

Python 159 13 Updated Jul 18, 2025

FreedomIntelligence / ALLaVA

Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model

Python 273 9 Updated Jun 25, 2024

microsoft / Semi-supervised-learning

A Unified Semi-Supervised Learning Codebase (NeurIPS'22)

Python 1,533 200 Updated Sep 24, 2025

open-edge-platform / anomalib

An anomaly detection library comprising state-of-the-art algorithms and features such as experiment management, hyper-parameter optimization, and edge inference.

Python 5,035 824 Updated Oct 10, 2025

facebookresearch / sscd-copy-detection

Open source implementation of "A Self-Supervised Descriptor for Image Copy Detection" (SSCD).

Python 355 28 Updated Aug 2, 2022

huggingface / trl

Train transformer language models with reinforcement learning.

Python 15,826 2,230 Updated Oct 11, 2025

EvolvingLMMs-Lab / lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,155 393 Updated Oct 9, 2025

tang-bd / fuse-dit

[CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis

Python 122 4 Updated May 16, 2025

ngxson / smolvlm-realtime-webcam

Real-time webcam demo with SmolVLM and llama.cpp server

HTML 4,779 762 Updated May 12, 2025

apple / ml-fastvlm

This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

Python 6,750 463 Updated May 5, 2025

Blaizzy / mlx-audio

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

Python 2,735 218 Updated Sep 25, 2025