Skip to content
View Ajyy's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Ajyy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

2026 AI/ML internship & new graduate job list updated daily

4,112 167 Updated Nov 27, 2025

Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate

Python 720 41 Updated Nov 19, 2024

A unified tokenizer that is capable of both extracting semantic information and enabling high-fidelity audio reconstruction.

Python 121 8 Updated Sep 19, 2025

GLM-4-Voice | 端到端中英语音对话模型

Python 3,088 265 Updated Dec 5, 2024

An easy-to-use, fast, and easily integrable tool for evaluating audio LLM

Python 166 9 Updated Nov 27, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 12,104 1,374 Updated Nov 14, 2025

Official repository for LTX-Video

Python 8,839 820 Updated Oct 25, 2025

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 2,988 176 Updated Oct 9, 2025

Nano vLLM

Python 9,305 1,145 Updated Nov 3, 2025

FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.

Python 214 20 Updated Nov 11, 2025

A collection of full time roles in SWE, Quant, and PM for new grads.

15,739 1,232 Updated Nov 27, 2025

RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios

Python 73 7 Updated Jul 4, 2025

✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models

631 58 Updated Jun 16, 2025

A bibliography and survey of the papers surrounding o1

TeX 1,213 51 Updated Nov 16, 2024

A python module to repair invalid JSON from LLMs

Python 4,074 158 Updated Nov 25, 2025

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 2,953 219 Updated Nov 24, 2025

Collection of works for evaluating (and analyzing) large audio-language models (LALMs)

40 Updated Aug 11, 2025
Python 4,561 366 Updated Jun 12, 2025

[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers

Python 2,906 197 Updated Nov 18, 2025

Unified automatic quality assessment for speech, music, and sound.

Python 636 45 Updated Jun 5, 2025

The MATH Dataset (NeurIPS 2021)

Python 1,256 110 Updated Sep 6, 2025

[TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

699 34 Updated Oct 20, 2025

From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓

3,444 200 Updated May 7, 2025

A Conversational Speech Generation Model

Python 14,316 1,450 Updated May 27, 2025

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 9,121 826 Updated Nov 20, 2025

[NeurIPS 2025] Benchmark data and code for MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix

Python 179 4 Updated Jun 6, 2025

🧑‍🚀 全世界最好的LLM资料总结(语音视频生成、Agent、辅助编程、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.

6,822 648 Updated Nov 27, 2025

Fully open reproduction of DeepSeek-R1

Python 25,688 2,402 Updated Nov 24, 2025

A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation"

Python 406 59 Updated Apr 13, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,472 819 Updated Nov 9, 2025
Next