-
Hugging Face
- Bern, Switzerland
Highlights
- Pro
-
open-r1-multimodal Public
Forked from EvolvingLMMs-Lab/open-r1-multimodalA fork to add multimodal model training to open-r1
Python Apache License 2.0 UpdatedFeb 3, 2025 -
open-r1 Public
Forked from huggingface/open-r1Fully open reproduction of DeepSeek-R1
Python Apache License 2.0 UpdatedJan 28, 2025 -
VLMEvalKit Public
Forked from open-compass/VLMEvalKitOpen-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks
Python Apache License 2.0 UpdatedJan 24, 2025 -
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
-
mlx-vlm Public
Forked from Blaizzy/mlx-vlmMLX-VLM is a package for running Vision LLMs locally on your Mac using MLX.
-
-
smol-vision Public
Forked from merveenoyan/smol-visionRecipes for shrinking, optimizing, customizing cutting edge vision models. 💜
-
-
moonshine Public
Forked from moonshine-ai/moonshineFast and accurate automatic speech recognition (ASR) for edge devices
-
MeloTTS Public
Forked from myshell-ai/MeloTTSHigh-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
-
lightning-whisper-mlx Public
Forked from mustafaaljadery/lightning-whisper-mlxAn extremely fast implementation of whisper optimized for Apple Silicon using MLX.
-
speech-to-speech-inference-toolkit Public
Forked from huggingface/huggingface-inference-toolkitHugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.
-
florence2-finetuning Public
Quick exploration into fine tuning florence 2
-
sms-tools Public
Forked from MTG/sms-toolsSound analysis/synthesis tools for music applications
-
VideoLLaMA2 Public
Forked from DAMO-NLP-SG/VideoLLaMA2VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
Python Apache License 2.0 UpdatedAug 6, 2024 -
llm-swarm Public
Forked from huggingface/llm-swarmManage scalable open LLM inference endpoints in Slurm clusters
Python MIT License UpdatedJul 11, 2024 -
UPD Public
Forked from AtsuMiyai/UPD[arXiv2024] Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models
Python Apache License 2.0 UpdatedJun 10, 2024 -
MetaCLIP Public
Forked from facebookresearch/MetaCLIPEverything about MetaCLIP: curation/training code, metadata, distribution and pre-trained models.
Python Other UpdatedMar 6, 2024 -
-
tifresi Public
STFT transforms suitable for use with PGHI (phase gradient heap integration)
-
audioContextEncoder Public
A context encoder for audio inpainting
-
-
phaseRetrievalEvaluation Public
Time-Frequency Phase Retrieval for Audio --- The Effect of Transform Parameters
-
inflated_convnets_pytorch Public
Forked from hassony2/inflated_convnets_pytorchInflate DenseNet and ResNet as per I3D with ImageNet weight transfer
-
GACELA Public
Generative adversarial context encoder for audio inpainting
-
hpc-docs Public
Forked from hpc-unibe-ch/hpc-unibe-ch.github.ioGuides, tutorials and documentation about the central HPC resources
-
audioLIME Public
Forked from CPJKU/audioLIMEaudioLIME: Listenable Explanations Using Source Separation
-
Self-Attention-GAN Public
Forked from heykeetae/Self-Attention-GANPytorch implementation of Self-Attention Generative Adversarial Networks (SAGAN)
Python UpdatedDec 11, 2020 -
-
gantools Public
Forked from nperraud/gantoolsA set of tools to deal with GANs
Python UpdatedOct 22, 2019