Ajyy

🎯

Focusing

Junyi Ao Ajyy

🎯

Focusing

CUHK-Shenzhen PhD Student

74 followers · 58 following

CUHK-Shenzhen
Shenzhen
06:47 (UTC +08:00)
https://ajyy.github.io

Achievements

Starred repositories

speedyapply / 2026-AI-College-Jobs

2026 AI/ML internship & new graduate job list updated daily

4,112 167 Updated Nov 27, 2025

hubertsiuzdak / snac

Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate

Python 720 41 Updated Nov 19, 2024

XiaomiMiMo / MiMo-Audio-Tokenizer

A unified tokenizer that is capable of both extracting semantic information and enabling high-fidelity audio reconstruction.

Python 121 8 Updated Sep 19, 2025

zai-org / GLM-4-Voice

GLM-4-Voice | 端到端中英语音对话模型

Python 3,088 265 Updated Dec 5, 2024

OpenBMB / UltraEval-Audio

An easy-to-use, fast, and easily integrable tool for evaluating audio LLM

Python 166 9 Updated Nov 27, 2025

Wan-Video / Wan2.2

Wan: Open and Advanced Large-Scale Video Generative Models

Python 12,104 1,374 Updated Nov 14, 2025

Lightricks / LTX-Video

Official repository for LTX-Video

Python 8,839 820 Updated Oct 25, 2025

QwenLM / Qwen3-Omni

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 2,988 176 Updated Oct 9, 2025

GeeeekExplorer / nano-vllm

Nano vLLM

Python 9,305 1,145 Updated Nov 3, 2025

xingchensong / FlashCosyVoice

FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.

Python 214 20 Updated Nov 11, 2025

SimplifyJobs / New-Grad-Positions

A collection of full time roles in SWE, Quant, and PM for new grads.

15,739 1,232 Updated Nov 27, 2025

byteresearchcla / RealSI

RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios

Python 73 7 Updated Jul 4, 2025

reasoning-survey / Awesome-Reasoning-Foundation-Models

✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models

631 58 Updated Jun 16, 2025

srush / awesome-o1

A bibliography and survey of the papers surrounding o1

TeX 1,213 51 Updated Nov 16, 2024

mangiucugna / json_repair

A python module to repair invalid JSON from LLMs

Python 4,074 158 Updated Nov 25, 2025

argilla-io / distilabel

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 2,953 219 Updated Nov 24, 2025

ckyang1124 / LALM-Evaluation-Survey

Collection of works for evaluating (and analyzing) large audio-language models (LALMs)

40 Updated Aug 11, 2025

stepfun-ai / Step-Audio

Python 4,561 366 Updated Jun 12, 2025

Paper2Poster / Paper2Poster

[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers

Python 2,906 197 Updated Nov 18, 2025

facebookresearch / audiobox-aesthetics

Unified automatic quality assessment for speech, music, and sound.

Python 636 45 Updated Jun 5, 2025

hendrycks / math

The MATH Dataset (NeurIPS 2021)

Python 1,256 110 Updated Sep 6, 2025

Eclipsess / Awesome-Efficient-Reasoning-LLMs

[TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

699 34 Updated Oct 20, 2025

atfortes / Awesome-LLM-Reasoning

From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓

3,444 200 Updated May 7, 2025

SesameAILabs / csm

A Conversational Speech Generation Model

Python 14,316 1,450 Updated May 27, 2025

kyutai-labs / moshi

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 9,121 826 Updated Nov 20, 2025

ddlBoJack / MMAR

[NeurIPS 2025] Benchmark data and code for MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix

Python 179 4 Updated Jun 6, 2025

WangRongsheng / awesome-LLM-resources

🧑‍🚀 全世界最好的LLM资料总结（语音视频生成、Agent、辅助编程、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型） | Summary of the world's best LLM resources.

6,822 648 Updated Nov 27, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,688 2,402 Updated Nov 24, 2025

shmsw25 / FActScore

A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation"

Python 406 59 Updated Apr 13, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Junyi Ao Ajyy

Starred repositories

speech-synthesis

speech-to-text

text-to-speech

speech-recognition

speech