Skip to content
View cslydia's full-sized avatar

Block or report cslydia

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Empirical Study of GPT-4o Image Generation Capabilities

28 Updated Apr 16, 2025

[ICCV 2025] VisualCloze: A universal image generation framework that can support a wide range of in-domain tasks and generalize to unseen ones. (🔥 🔥 🔥 Merged into offical pipelines of diffusers.)

Python 262 13 Updated Sep 15, 2025

[ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning

Python 1,325 77 Updated Sep 12, 2025

Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models

723 30 Updated Nov 7, 2025

[ICCV 2025 Highlight] The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"

Python 175 16 Updated Mar 17, 2025
Python 3 Updated Aug 10, 2025

[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 1,491 82 Updated Nov 10, 2025

[CVPR 2025] OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?

Python 101 5 Updated Jul 24, 2025

📃 A better UX for chat, writing content, and coding with LLMs.

TypeScript 5,141 818 Updated Aug 15, 2025

Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"

Python 858 70 Updated Aug 27, 2024
Python 14 Updated Oct 28, 2023

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 16,518 1,294 Updated Oct 6, 2025

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 48,588 4,019 Updated Nov 11, 2025

The official Meta Llama 3 GitHub site

Python 29,088 3,479 Updated Jan 26, 2025

Code and data for the paper Revealing the structure of language model capabilities

7 1 Updated Jun 14, 2023

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Python 13,739 1,604 Updated Jan 13, 2025

轩辕:度小满中文金融对话大模型

Python 1,268 117 Updated Jan 7, 2025

Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Python 11,082 874 Updated Nov 10, 2025

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 19,696 1,644 Updated Sep 30, 2025

Measuring Massive Multitask Language Understanding | ICLR 2021

Python 1,515 113 Updated May 28, 2023

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,943 1,877 Updated Jul 15, 2025

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Python 5,684 508 Updated Jul 18, 2024

maximal update parametrization (µP)

Jupyter Notebook 1,622 104 Updated Jul 17, 2024

Rotary Transformer

Python 1,042 59 Updated Mar 21, 2022

CodeXGLUE

C# 1,768 388 Updated Apr 23, 2024

Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]

Python 1,781 82 Updated Jul 27, 2025

Light local website for displaying performances from different chat models.

Python 87 7 Updated Nov 13, 2023

Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.

Shell 1,006 102 Updated Jul 29, 2024

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 1,428 228 Updated Mar 20, 2024

Simple implementation of using lora form the peft library to fine-tune the chatglm-6b

Python 84 18 Updated Apr 3, 2023
Next