dawnmsg

Follow

Shaoguang Mao dawnmsg

Follow

Cute coding engineer

97 followers · 1 following

Microsoft
Beijing

Achievements

Achievements

Stars

MoonshotAI / MoBA

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 2,032 129 Updated Apr 3, 2025

openai / mle-bench

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering

Python 1,270 199 Updated Dec 19, 2025

tsinghua-fib-lab / AgentSocietyChallenge

Yelp Simulator for WWW'25 AgentSociety Challenge

Python 89 39 Updated Apr 27, 2025

automoto / big-five-data

Big five trait scores for 307,313 people from many different countries.

PLpgSQL 74 18 Updated Jan 29, 2019

fangru-lin / redial_dialect_robustness_fairness

Python 8 3 Updated Jan 14, 2025

hiyouga / LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 65,387 7,946 Updated Jan 9, 2026

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 154,876 31,689 Updated Jan 9, 2026

microsoft / BitNet

Official inference framework for 1-bit LLMs

Python 25,636 2,060 Updated Jun 3, 2025

SWE-agent / SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

Python 18,217 1,951 Updated Dec 29, 2025

SWE-bench / SWE-bench

SWE-bench: Can Language Models Resolve Real-world Github Issues?

Python 4,084 730 Updated Jan 4, 2026

tencent-ailab / persona-hub

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Python 1,437 118 Updated Feb 19, 2025

microsoft / Alympics

Python 74 11 Updated May 23, 2024

MikeWangWZHL / Solo-Performance-Prompting

Repo for paper "Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration"

Python 347 31 Updated May 8, 2024

ShengranHu / Thought-Cloning

[NeurIPS '23 Spotlight] Thought Cloning: Learning to Think while Acting by Imitating Human Thinking

Python 267 24 Updated Jun 28, 2024

microsoft / SmartWordSuggestions

Repo for "Smart Word Suggestions" (SWS) task and benchmark

Python 20 3 Updated Dec 4, 2023

chenfei-wu / TaskMatrix

Python 34,300 3,261 Updated Jan 6, 2024

ivy-llc / ivy

Convert Machine Learning Code Between Frameworks

Python 14,224 5,563 Updated Oct 17, 2025

ymcui / Chinese-BERT-wwm

Pre-Training with Whole Word Masking for Chinese BERT（中文BERT-wwm系列模型）

Python 10,166 1,394 Updated Jul 15, 2025

Morizeyao / GPT2-Chinese

Chinese version of GPT2 training code, using BERT tokenizer.

Python 7,606 1,699 Updated Apr 25, 2024

shibing624 / pycorrector

pycorrector is a toolkit for text error correction. 文本纠错，实现了Kenlm，T5，MacBERT，ChatGLM3，Qwen2.5等模型应用在纠错场景，开箱即用。

Python 6,331 1,162 Updated Jan 6, 2026

umairmunir-95 / MediaPlayer

A C# based MediaPlayer desktop application.

C# 3 Updated Aug 20, 2018

farizrahman4u / seq2seq

Sequence to Sequence Learning with Keras

Python 3,178 837 Updated Aug 20, 2022

jacoxu / encoder_decoder

Four styles of encoder decoder model by Python, Theano, Keras and Seq2Seq

Python 279 125 Updated Jun 20, 2017

tiny-dnn / tiny-dnn

header only, dependency-free deep learning framework in C++14

C++ 6,008 1,397 Updated Apr 17, 2022