-
lucid
-
FBGEMM Public
Forked from pytorch/FBGEMMFB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
C++ Other UpdatedSep 24, 2025 -
-
-
-
-
-
-
Muon Public
Forked from KellerJordan/MuonMuon optimizer: +>30% sample efficiency with <3% wallclock overhead
-
integrated_dit Public
minibatch ode with block causal transformers :00:
-
-
-
-
LightBall Public
Forked from HomebrewML/HeavyBallOur balls were too heavy all along
-
diffusion-speedrun Public
Forked from fal-ai/diffusion-speedrunFocused on fast experimentation and simplicity
Python UpdatedDec 24, 2024 -
-
stochastic_round_cuda Public
Forked from ethansmith2000/stochastic_round_cudaCuda MIT License UpdatedNov 22, 2024 -
-
-
-
temporal_min-snr Public
MinSNR is super nice, here's an extension of minsnr for the temporal dimension when using it against rolling diffusion or other AR diffusion objectives :3, this was proposed by the DF paper https:/…
-
Hyperspherical-Discrete-Bayesian-Flow-Like-Vibe-But-also-kinda-tangental-discrete-diffusion Public
Forked from Algomancer/Hyperspherical-Discrete-Bayesian-Flow-Like-Vibe-But-also-kinda-tangental-discrete-diff...It's a hyperspherical thingy for discrete diffusionishness.
Python UpdatedOct 29, 2024 -
oracle-head-gpt Public
probe for predicting future hiddenstates on gpt-2 vibes....
-
-
PufferLib Public
Forked from PufferAI/PufferLibSimplifying reinforcement learning for complex game environments
Python MIT License UpdatedOct 23, 2024 -
-
-
rope-nd-jax Public
Forked from limefax/rope-ndN-dimensional Rotary Position Embeddings for Jax too
-
-
deepspeed-vae Public
perceptual loss, gan, wasserstein-loss, kl/lsq/fsq v(q)ae
1 UpdatedSep 24, 2024 -