Highlights
- Pro
Stars
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
The simplest, fastest repository for training/finetuning small-sized VLMs.
2026 AI/ML internship & new graduate job list updated daily
Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources
Official implementation of ATI: Any Trajectory Instruction for Controllable Video Generation. https://arxiv.org/pdf/2505.22944
SimWorld: An Open-ended Realistic Simulator for Autonomous Agents in Physical and Social Worlds
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"
An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]
Calculate perplexity on a text with pre-trained language models. Support MLM (eg. DeBERTa), recurrent LM (eg. GPT3), and encoder-decoder LM (eg. Flan-T5).
A wrapper script to build whole-program LLVM bitcode files
A collection of out-of-tree LLVM passes for teaching and learning
Curated list of project-based tutorials
Enforce structured output from LLMs 100% of the time
These are the best resources for System Design on the Internet
Learn Blockchain, Solidity, and Full Stack Web3 Development with Javascript
Linux Runtime Security and Forensics using eBPF