Skip to content
View dawnmsg's full-sized avatar
  • Microsoft
  • Beijing

Block or report dawnmsg

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 2,032 129 Updated Apr 3, 2025

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering

Python 1,270 199 Updated Dec 19, 2025

Yelp Simulator for WWW'25 AgentSociety Challenge

Python 89 39 Updated Apr 27, 2025

Big five trait scores for 307,313 people from many different countries.

PLpgSQL 74 18 Updated Jan 29, 2019

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 65,387 7,946 Updated Jan 9, 2026

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 154,876 31,689 Updated Jan 9, 2026

Official inference framework for 1-bit LLMs

Python 25,636 2,060 Updated Jun 3, 2025

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

Python 18,217 1,951 Updated Dec 29, 2025

SWE-bench: Can Language Models Resolve Real-world Github Issues?

Python 4,084 730 Updated Jan 4, 2026

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Python 1,437 118 Updated Feb 19, 2025
Python 74 11 Updated May 23, 2024

Repo for paper "Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration"

Python 347 31 Updated May 8, 2024

[NeurIPS '23 Spotlight] Thought Cloning: Learning to Think while Acting by Imitating Human Thinking

Python 267 24 Updated Jun 28, 2024

Repo for "Smart Word Suggestions" (SWS) task and benchmark

Python 20 3 Updated Dec 4, 2023

Convert Machine Learning Code Between Frameworks

Python 14,224 5,563 Updated Oct 17, 2025

Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)

Python 10,166 1,394 Updated Jul 15, 2025

Chinese version of GPT2 training code, using BERT tokenizer.

Python 7,606 1,699 Updated Apr 25, 2024

pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。

Python 6,331 1,162 Updated Jan 6, 2026

A C# based MediaPlayer desktop application.

C# 3 Updated Aug 20, 2018

Sequence to Sequence Learning with Keras

Python 3,178 837 Updated Aug 20, 2022

Four styles of encoder decoder model by Python, Theano, Keras and Seq2Seq

Python 279 125 Updated Jun 20, 2017

header only, dependency-free deep learning framework in C++14

C++ 6,008 1,397 Updated Apr 17, 2022