Skip to content
View vejaxu's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report vejaxu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

PyTorch implementation of some attentions for Deep Learning Researchers.

Python 547 74 Updated Mar 4, 2022

2026 AI/ML internship & new graduate job list updated daily

3,630 146 Updated Oct 13, 2025

A beamer template for LAMDA lab at NJU

TeX 17 10 Updated Oct 17, 2020

My learning notes/codes for ML SYS.

Python 3,858 233 Updated Oct 6, 2025

Writing AI Conference Papers: A Handbook for Beginners

2,902 100 Updated Jul 16, 2025

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 2,600 136 Updated Oct 9, 2025

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust 10,127 977 Updated Oct 8, 2025

📜 Paper list on decoding methods for LLMs and LVLMs

60 1 Updated Jun 30, 2025

An automated data pipeline scaling RL to pretraining levels

Python 43 6 Updated Oct 11, 2025
Python 114 5 Updated May 14, 2025

Methods and Implements of Deep Clustering

3,018 425 Updated Aug 25, 2024

Official repo for paper: "GRACE: Generative Representation Learning via Contrastive Policy Optimization"

Python 10 1 Updated Oct 4, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 23,720 2,643 Updated Aug 12, 2024

An open source implementation of CLIP.

Python 12,743 1,171 Updated Sep 21, 2025

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

1,338 58 Updated Mar 14, 2024

[COLM2025] "Weak-for-Strong: Training Weak Meta-Agent to Harness Strong Executors"

Python 35 1 Updated Oct 6, 2025

An Open-Source Large-Scale Reinforcement Learning Project for Search Agents

Python 454 27 Updated Oct 8, 2025

Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models

665 25 Updated Sep 13, 2025

A version of verl to support diverse tool use

Python 591 43 Updated Oct 12, 2025

LIMI: Less is More for Agency

Python 138 7 Updated Oct 8, 2025

[COLM 2025] Code for Paper: Learning Adaptive Parallel Reasoning with Language Models

Python 132 10 Updated Aug 15, 2025

All-in-One Sandbox for AI Agents that combines Browser, Shell, File, MCP and VSCode Server in a single Docker container.

Python 591 56 Updated Oct 10, 2025

Fast and memory-efficient exact kmeans

Python 103 6 Updated Sep 30, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 24,990 1,741 Updated Oct 13, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 14,165 1,075 Updated Oct 13, 2025

Data Synthesis for Deep Research Based on Semi-Structured Data

Python 169 13 Updated Oct 9, 2025

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 5,649 306 Updated Sep 30, 2025

Easy Data Preparation with latest LLMs-based Operators and Pipelines.

Python 1,377 91 Updated Oct 13, 2025

MiroThinker is open-source agentic models trained for deep research and complex tool use scenarios.

Python 439 38 Updated Oct 2, 2025
Next