andimarafioti

Andrés Marafioti andimarafioti

Multimodal Research Lead at Hugging Face.

314 followers · 31 following

Hugging Face
Bern, Switzerland

Achievements

x2 x2 x3

Achievements

x2 x2 x3

Highlights

Organizations

open-r1-multimodal Public
Forked from EvolvingLMMs-Lab/open-r1-multimodal

A fork to add multimodal model training to open-r1

Python Apache License 2.0 Updated Feb 3, 2025
open-r1 Public
Forked from huggingface/open-r1

Fully open reproduction of DeepSeek-R1

Python Apache License 2.0 Updated Jan 28, 2025
VLMEvalKit Public
Forked from open-compass/VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks

Python Apache License 2.0 Updated Jan 24, 2025
transformers Public
Forked from huggingface/transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 2 4 Apache License 2.0 Updated Jan 20, 2025
mlx-vlm Public
Forked from Blaizzy/mlx-vlm

MLX-VLM is a package for running Vision LLMs locally on your Mac using MLX.

Python 1 MIT License Updated Nov 28, 2024
scripts Public

Some scripts for easy sharing.

Python Updated Nov 27, 2024
smol-vision Public
Forked from merveenoyan/smol-vision

Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜

Jupyter Notebook 2 1 Apache License 2.0 Updated Nov 22, 2024
smol-tools Public

Python Apache License 2.0 Updated Nov 5, 2024
moonshine Public
Forked from moonshine-ai/moonshine

Fast and accurate automatic speech recognition (ASR) for edge devices

Python 2 MIT License Updated Oct 23, 2024
MeloTTS Public
Forked from myshell-ai/MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Python 5 6 MIT License Updated Oct 14, 2024
lightning-whisper-mlx Public
Forked from mustafaaljadery/lightning-whisper-mlx

An extremely fast implementation of whisper optimized for Apple Silicon using MLX.

Python 1 Updated Oct 1, 2024
speech-to-speech-inference-toolkit Public
Forked from huggingface/huggingface-inference-toolkit

Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.

Python 5 1 Apache License 2.0 Updated Sep 26, 2024
florence2-finetuning Public

Quick exploration into fine tuning florence 2

Jupyter Notebook 332 30 MIT License Updated Sep 19, 2024
sms-tools Public
Forked from MTG/sms-tools

Sound analysis/synthesis tools for music applications

Python 1 1 GNU Affero General Public License v3.0 Updated Aug 29, 2024
VideoLLaMA2 Public
Forked from DAMO-NLP-SG/VideoLLaMA2

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Python Apache License 2.0 Updated Aug 6, 2024
llm-swarm Public
Forked from huggingface/llm-swarm

Manage scalable open LLM inference endpoints in Slurm clusters

Python MIT License Updated Jul 11, 2024
UPD Public
Forked from AtsuMiyai/UPD

[arXiv2024] Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models

Python Apache License 2.0 Updated Jun 10, 2024
MetaCLIP Public
Forked from facebookresearch/MetaCLIP

Everything about MetaCLIP: curation/training code, metadata, distribution and pre-trained models.

Python Other Updated Mar 6, 2024
andimarafioti Public

Updated Aug 3, 2023
tifresi Public

STFT transforms suitable for use with PGHI (phase gradient heap integration)

Python 15 1 MIT License Updated Apr 5, 2023
audioContextEncoder Public

A context encoder for audio inpainting

machine-learning paper context-encoder

Jupyter Notebook 25 2 Updated Mar 24, 2023
sagan-models Public

Python MIT License Updated Jun 29, 2021
phaseRetrievalEvaluation Public

Time-Frequency Phase Retrieval for Audio --- The Effect of Transform Parameters

Jupyter Notebook 9 2 MIT License Updated Jun 15, 2021
inflated_convnets_pytorch Public
Forked from hassony2/inflated_convnets_pytorch

Inflate DenseNet and ResNet as per I3D with ImageNet weight transfer

Python 1 MIT License Updated Apr 28, 2021
GACELA Public

Generative adversarial context encoder for audio inpainting

audio music gan music-generation inpainting audio-inpainting

Jupyter Notebook 26 4 MIT License Updated Apr 20, 2021
hpc-docs Public
Forked from hpc-unibe-ch/hpc-unibe-ch.github.io

Guides, tutorials and documentation about the central HPC resources

Shell 1 Updated Apr 1, 2021
audioLIME Public
Forked from CPJKU/audioLIME

audioLIME: Listenable Explanations Using Source Separation

Python 1 Updated Dec 22, 2020
Self-Attention-GAN Public
Forked from heykeetae/Self-Attention-GAN

Pytorch implementation of Self-Attention Generative Adversarial Networks (SAGAN)

Python Updated Dec 11, 2020
ConwaysGameOfLife Public

Python Updated Jun 29, 2020
gantools Public
Forked from nperraud/gantools

A set of tools to deal with GANs

Python Updated Oct 22, 2019

Andrés Marafioti andimarafioti

Achievements

Achievements

Highlights

Organizations

open-r1-multimodal Public

Uh oh!

open-r1 Public

Uh oh!

VLMEvalKit Public

Uh oh!

transformers Public

Uh oh!

mlx-vlm Public

Uh oh!

scripts Public

Uh oh!

smol-vision Public

Uh oh!

smol-tools Public

Uh oh!

moonshine Public

Uh oh!

MeloTTS Public

Uh oh!

lightning-whisper-mlx Public

Uh oh!

speech-to-speech-inference-toolkit Public

Uh oh!

florence2-finetuning Public

Uh oh!

sms-tools Public

Uh oh!

VideoLLaMA2 Public

Uh oh!

llm-swarm Public

Uh oh!

UPD Public

Uh oh!

MetaCLIP Public

Uh oh!

andimarafioti Public

Uh oh!

tifresi Public

Uh oh!

audioContextEncoder Public

Uh oh!

sagan-models Public

Uh oh!

phaseRetrievalEvaluation Public

Uh oh!

inflated_convnets_pytorch Public

Uh oh!

GACELA Public

Uh oh!

hpc-docs Public

Uh oh!

audioLIME Public

Uh oh!

Self-Attention-GAN Public

Uh oh!

ConwaysGameOfLife Public

Uh oh!

gantools Public

Uh oh!