Skip to content
View Thekey756's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Fudan University
  • Shanghai
  • 15:26 (UTC +08:00)

Highlights

  • Pro

Block or report Thekey756

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A collection of recent papers on building autonomous agent. Two topics included: RL-based / LLM-based agents.

732 58 Updated Dec 24, 2024

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

21,506 2,046 Updated May 19, 2025

[ECCV 2024] Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models

Jupyter Notebook 77 5 Updated Oct 29, 2024

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 76,151 11,205 Updated Oct 22, 2025
Python 2,208 159 Updated Nov 8, 2024

Multi-layer Recurrent Neural Networks (LSTM, GRU, RNN) for character-level language models in Torch

Lua 11,876 2,625 Updated Oct 24, 2023

Official Code for Stable Cascade

Jupyter Notebook 6,584 526 Updated Jul 25, 2024

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 7,946 706 Updated May 31, 2024

An Open-source Toolkit for LLM Development

Python 2,789 178 Updated Jan 13, 2025

MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

Jupyter Notebook 8,402 520 Updated Oct 8, 2025

C++高性能分布式服务器框架,webserver,websocket server,自定义tcp_server(包含日志模块,配置模块,线程模块,协程模块,协程调度模块,io协程调度模块,hook模块,socket模块,bytearray序列化,http模块,TcpServer模块,Websocket模块,Https模块等, Smtp邮件模块, MySQL, SQLite3, ORM,Red…

C++ 4,555 1,020 Updated Dec 8, 2023

🌱 This is a tutorial of MySQL. In this tutorial, you can leran how to use MySQL and optimize SQL.

514 182 Updated Dec 4, 2023

A Gentle Introduction to SQL Using SQLite

HTML 202 82 Updated Aug 18, 2022

The Python micro framework for building web applications.

Python 70,643 16,596 Updated Oct 14, 2025

Games: Create interesting games in pure python.

Python 5,266 2,304 Updated Jul 25, 2024

基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等

Python 2,772 315 Updated Dec 12, 2023

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Python 3,143 272 Updated Jan 10, 2025

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Python 1,860 113 Updated Jan 21, 2024

基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择ChatGPT/Claude/DeepSeek/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。

Python 39,486 9,463 Updated Oct 22, 2025

Accelerate LLM with low-bit (FP4 / INT4 / FP8 / INT8) optimizations using ipex-llm

Jupyter Notebook 169 42 Updated Apr 29, 2025

Universal LLM Deployment Engine with ML Compilation

Python 21,517 1,845 Updated Oct 24, 2025

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 3,679 440 Updated Aug 5, 2025

Example implementation of a discrete 1st order Hidden Markov model with test

Python 35 21 Updated Sep 20, 2016

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 30,188 4,037 Updated Jul 17, 2024

This is a possible solution to MCM/ICM problem D

Jupyter Notebook 1 Updated Feb 22, 2023

Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.

Python 22,246 2,300 Updated Apr 29, 2025

Operating Systems: Three Easy Pieces(OSTEP) homework and project solutions

C 833 182 Updated Feb 3, 2024

在UDP的基础上实现GBN协议和SR协议,双向传输数据,并引入拥塞控制

Python 6 1 Updated Jun 22, 2023

this is a realization of TLS1.3 by ikun

Python 6 Updated Nov 5, 2022
Next