knjcode

Kenji Doi knjcode

54 followers · 7 following

Japan
https://blog.knjcode.com

Achievements

x2 x3

Achievements

x2 x3

Stars

SonyResearch / micro_diffusion

Official repository for our work on micro-budget training of large-scale diffusion models.

Python 1,528 53 Updated Jan 12, 2025

lllyasviel / IC-Light

More relighting!

Python 8,300 521 Updated Feb 20, 2025

line / Human-Interaction-Generation

Official implementation of ICCV 2023 Oral Paper "Role-Aware Interaction Generation from Textual Description"

Python 33 Updated Oct 20, 2023

line / Skeleton-Temporal-Action-Localization

Official implementation of AAAI 2023 Oral Paper "Frame-Level Label Refinement for Skeleton-Based Weakly-Supervised Action Recognition"

Python 13 1 Updated Oct 20, 2023

GaParmar / clean-fid

PyTorch - FID calculation with proper image resizing and quantization steps [CVPR 2022]

Python 1,124 78 Updated Aug 2, 2025

ofsoundof / LSDIR

73 1 Updated Dec 17, 2024

facebookresearch / segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 52,643 6,156 Updated Sep 18, 2024

tanelp / tiny-diffusion

A minimal PyTorch implementation of probabilistic diffusion models for 2D datasets.

Jupyter Notebook 960 76 Updated May 7, 2024

reazon-research / ReazonSpeech

Massive open Japanese speech corpus

Python 337 27 Updated Oct 16, 2025

LuChengTHU / dpm-solver

Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)

Python 1,789 134 Updated Feb 6, 2024

DSL-Lab / GRBM

Gaussian-Bernoulli Restricted Boltzmann Machines

Python 105 11 Updated Dec 1, 2022

shuntama / srdd

Python 30 5 Updated Jul 14, 2022

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 91,190 11,443 Updated Sep 8, 2025

rinnakk / japanese-stable-diffusion

Japanese Stable Diffusion is a Japanese specific latent text-to-image diffusion model capable of generating photo-realistic images given any text input.

Jupyter Notebook 282 13 Updated Mar 19, 2023