Lists (5)
Sort Name ascending (A-Z)
Stars
A curated list of awesome research papers, projects, code, dataset, workshops etc. related to virtual try-on.
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
No fortress, purely open ground. OpenManus is Coming.
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
A high-throughput and memory-efficient inference and serving engine for LLMs