yikangshen

Yikang Shen yikangshen

171 followers · 21 following

Achievements

x3 x3

Achievements

x3 x3

Stars

MoonshotAI / Kimi-Linear

1,217 54 Updated Nov 17, 2025

sustcsonglin / fla-tilelang

Python 22 2 Updated Mar 7, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,932 286 Updated May 15, 2025

virattt / ai-hedge-fund

An AI Hedge Fund Team

Python 42,479 7,537 Updated Nov 13, 2025

MiniMax-AI / MiniMax-01

The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention

Python 3,250 313 Updated Jul 7, 2025

deepseek-ai / DeepSeek-R1

91,533 11,778 Updated Jun 27, 2025

open-lm-engine / lm-engine

LM engine is a library for pretraining/finetuning LLMs

Python 77 22 Updated Nov 28, 2025

ibm-granite / granite-3.0-language-models

266 28 Updated Jun 25, 2025

microsoft / Samba

[ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling

Python 930 46 Updated Nov 16, 2025

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 4,769 615 Updated Nov 25, 2025

microsoft / infinibatch

Efficient, check-pointed data loading for deep learning with massive data sets.

Python 210 17 Updated Jun 12, 2023

ibm-granite / granite-code-models

Granite Code Models: A Family of Open Foundation Models for Code Intelligence

1,241 88 Updated Jun 25, 2025

wellecks / lm-evaluation-harness

Forked from EleutherAI/lm-evaluation-harness

A framework for few-shot evaluation of autoregressive language models.

Python 24 9 Updated Dec 21, 2023

myshell-ai / JetMoE

Reaching LLaMA2 Performance with 0.1M Dollars

Python 987 78 Updated Jul 23, 2024

BlinkDL / LinearAttentionArena

Here we will test various linear attention designs.

Python 62 14 Updated Apr 25, 2024

shawntan / scattermoe

Triton-based implementation of Sparse Mixture of Experts.

Python 253 22 Updated Oct 3, 2025

fla-org / flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models

Python 3,936 315 Updated Nov 27, 2025

OpenBMB / MiniCPM

MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

Jupyter Notebook 8,438 522 Updated Oct 8, 2025

ACL2023-Retrieval-LM / ACL2023-Retrieval-LM.github.io

https://acl2023-retrieval-lm.github.io/

JavaScript 158 15 Updated Oct 18, 2023

ZheyuAqaZhang / transformers

Forked from huggingface/transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 2 Updated Mar 28, 2023

togethercomputer / OpenChatKit

Python 9,014 1,017 Updated Apr 9, 2024

auspicious3000 / contentvec

speech self-supervised representations

Python 514 39 Updated Apr 27, 2023

microsoft / Tutel

Tutel MoE: Optimized Mixture-of-Experts Library, Support GptOss/DeepSeek/Kimi-K2/Qwen3 using FP8/NVFP4/MXFP4

C 945 106 Updated Nov 10, 2025

JunweiLiang / awesome_lists

Awesome Lists for Tenure-Track Assistant Professors and PhD students. (助理教授/博士生生存指南)

Python 1,612 92 Updated Feb 1, 2024

srush / GPU-Puzzles

Solve puzzles. Learn CUDA.

Jupyter Notebook 11,765 905 Updated Sep 1, 2024

LiyuanLucasLiu / Transformer-Clinic

Understanding the Difficulty of Training Transformers

Python 332 19 Updated May 31, 2022

XueFuzhao / awesome-mixture-of-experts

A collection of AWESOME things about mixture-of-experts

1,231 81 Updated Dec 8, 2024

EleutherAI / gpt-neo

An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

Python 8,285 963 Updated Feb 25, 2022

davidmrau / mixture-of-experts

PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538

Python 1,206 110 Updated Apr 19, 2024

lucidrains / mixture-of-experts

A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models

Python 834 66 Updated Sep 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yikang Shen yikangshen

Achievements

Achievements

Block or report yikangshen

Stars

MoonshotAI / Kimi-Linear

sustcsonglin / fla-tilelang

deepseek-ai / open-infra-index

virattt / ai-hedge-fund

MiniMax-AI / MiniMax-01

deepseek-ai / DeepSeek-R1

open-lm-engine / lm-engine

ibm-granite / granite-3.0-language-models

microsoft / Samba

pytorch / torchtitan

microsoft / infinibatch

ibm-granite / granite-code-models

wellecks / lm-evaluation-harness

myshell-ai / JetMoE

BlinkDL / LinearAttentionArena

shawntan / scattermoe

fla-org / flash-linear-attention

OpenBMB / MiniCPM

ACL2023-Retrieval-LM / ACL2023-Retrieval-LM.github.io

ZheyuAqaZhang / transformers

togethercomputer / OpenChatKit

auspicious3000 / contentvec

microsoft / Tutel

JunweiLiang / awesome_lists

srush / GPU-Puzzles

LiyuanLucasLiu / Transformer-Clinic

XueFuzhao / awesome-mixture-of-experts

EleutherAI / gpt-neo

davidmrau / mixture-of-experts

lucidrains / mixture-of-experts