Skip to content
View andimarafioti's full-sized avatar
  • Hugging Face
  • Bern, Switzerland

Highlights

  • Pro

Organizations

@huggingface @tifgan

Block or report andimarafioti

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

📓 computational document system build on uv and markdown

Python 8 Updated Oct 2, 2025
Python 100 14 Updated Sep 23, 2025

🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

Python 20,725 2,979 Updated Oct 10, 2025

Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model

Python 273 9 Updated Jun 25, 2024

A Unified Semi-Supervised Learning Codebase (NeurIPS'22)

Python 1,533 200 Updated Sep 24, 2025

An anomaly detection library comprising state-of-the-art algorithms and features such as experiment management, hyper-parameter optimization, and edge inference.

Python 5,035 824 Updated Oct 10, 2025

Open source implementation of "A Self-Supervised Descriptor for Image Copy Detection" (SSCD).

Python 355 28 Updated Aug 2, 2022

Train transformer language models with reinforcement learning.

Python 15,826 2,230 Updated Oct 11, 2025

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,155 393 Updated Oct 9, 2025

[CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis

Python 122 4 Updated May 16, 2025

Real-time webcam demo with SmolVLM and llama.cpp server

HTML 4,779 762 Updated May 12, 2025

This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

Python 6,750 463 Updated May 5, 2025

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

Python 2,735 218 Updated Sep 25, 2025

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,111 394 Updated Sep 10, 2025

Lightweight coding agent that runs in your terminal

Rust 47,095 5,675 Updated Oct 12, 2025

Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"(ICCV2025)

Python 1,669 126 Updated Jul 25, 2025

SmolVLM2 Demo

Swift 174 20 Updated Mar 20, 2025

The python library for real-time communication

JavaScript 4,339 401 Updated Sep 19, 2025

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,172 518 Updated Oct 11, 2025

Fully open reproduction of DeepSeek-R1

Python 25,531 2,398 Updated Sep 8, 2025

State-of-the-Art Text Embeddings

Python 17,672 2,694 Updated Oct 9, 2025

Get your documents ready for gen AI

Python 41,296 2,935 Updated Oct 10, 2025

🦄 Serving Platform for Spatial AI and Robotics.

Rust 21 4 Updated Jun 19, 2025

🤗 smolagents: a barebones library for agents that think in code.

Python 23,346 2,048 Updated Oct 12, 2025

Recipes to scale inference-time compute of open models

Python 1,109 123 Updated May 22, 2025

A course on aligning smol models.

Jupyter Notebook 6,443 2,288 Updated Oct 1, 2025

The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.

Python 2,244 205 Updated Oct 6, 2025

High-quality and streaming Speech-to-Speech interactive agent in a single file. 只用一个文件实现的流式全双工语音交互原型智能体!

Python 464 47 Updated Jun 16, 2025

Everything about the SmolLM and SmolVLM family of models

Python 3,304 225 Updated Sep 16, 2025
Next