Skip to content
View tkhe's full-sized avatar

Block or report tkhe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

AllenAI's post-training codebase

Python 3,521 486 Updated Jan 13, 2026

PyTorch building blocks for the OLMo ecosystem

Python 692 124 Updated Jan 13, 2026

[NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion

Python 100 4 Updated Oct 29, 2025

Detect Anything via Next Point Prediction (Based on Qwen2.5-VL-3B)

Jupyter Notebook 1,062 71 Updated Jan 10, 2026

Depth Anything 3

Python 3,964 351 Updated Dec 12, 2025

torchcomms: a modern PyTorch communications API

C++ 319 61 Updated Jan 13, 2026

Official code repo for our work "Native Visual Understanding: Resolving Resolution Dilemmas in Vision-Language Models"

Python 53 3 Updated Jun 17, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,741 1,511 Updated Jan 4, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 22,368 4,032 Updated Jan 13, 2026

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,867 588 Updated May 3, 2024

Train a 1B LLM with 1T tokens from scratch by personal

Jupyter Notebook 784 78 Updated Apr 27, 2025

个人构建MoE大模型:从预训练到DPO的完整实践

Python 2,231 164 Updated Dec 30, 2025

训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。

Python 77 12 Updated Sep 6, 2024

Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.

Jupyter Notebook 583 66 Updated Jul 11, 2024

中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。

Python 1,659 189 Updated Apr 20, 2024

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 9,269 689 Updated Nov 20, 2025

Nano vLLM

Python 10,726 1,376 Updated Nov 3, 2025

A single-file educational implementation for understanding vLLM's core concepts and running LLM inference.

Python 33 4 Updated Jun 22, 2025

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 3,041 311 Updated Dec 22, 2025

将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调

Python 500 50 Updated Sep 8, 2025
Python 43 3 Updated Jan 4, 2026

[ICCV2025] PyTorch implementation of "Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal Generative Models"

Python 114 5 Updated Jul 25, 2025

High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.

Python 1,381 94 Updated Jan 12, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 67,409 12,560 Updated Jan 13, 2026

[CVPR 2025] Official implementation for "Empowering LLMs to Understand and Generate Complex Vector Graphics" https://arxiv.org/abs/2412.11102

Python 602 13 Updated May 22, 2025

从无名小卒到大模型(LLM)大英雄~ 欢迎关注后续!!!

Jupyter Notebook 1,937 131 Updated Nov 22, 2025

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 5,985 640 Updated Dec 27, 2025

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

22,075 2,098 Updated May 19, 2025

Solve Visual Understanding with Reinforced VLMs

Python 5,803 377 Updated Oct 21, 2025

Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis (ICCV, 2025)

Python 52 2 Updated Sep 21, 2025
Next