Tags: xyfgemini/vllm
  
            
          Tags
  [CI Failure] Fix tests with missing TinyLlama-1.1B-Chat-v1.0-FP8-e2e (v… …llm-project#26816) Signed-off-by: mgoin <[email protected]>
Update CUDA architecture list in build pipeline for 12.9.1 wheels (vl… …lm-project#26592) Signed-off-by: Will Eaton <[email protected]> Signed-off-by: simon-mo <[email protected]>
[ci] fix wheel names for arm wheels (vllm-project#24898) Signed-off-by: simon-mo <[email protected]>
[Build/CI] Revert back to Ubuntu 20.04, install python 3.12 with uv (v… …llm-project#26103) Signed-off-by: Tyler Michael Smith <[email protected]> Co-authored-by: Simon Mo <[email protected]> Signed-off-by: simon-mo <[email protected]>
[Deepseek v3.2] Support indexer prefill chunking (vllm-project#25999) Signed-off-by: Chen Zhang <[email protected]> Signed-off-by: simon-mo <[email protected]>
[BugFix] Fix default kv-cache-dtype default for DeepseekV3.2 (vllm-pr… …oject#25988) Signed-off-by: Lucas Wilkinson <[email protected]> Signed-off-by: simon-mo <[email protected]>
[P/D] NIXL Updates (vllm-project#25844) Signed-off-by: Sage Moore <[email protected]> Signed-off-by: simon-mo <[email protected]> Signed-off-by: rentianyue-jk <[email protected]> Signed-off-by: Russell Bryant <[email protected]> Signed-off-by: Isotr0py <[email protected]> Signed-off-by: Chenheli Hua <[email protected]> Signed-off-by: mgoin <[email protected]> Signed-off-by: Tyler Michael Smith <[email protected]> Signed-off-by: NickLucche <[email protected]> Signed-off-by: Roger Wang <[email protected]> Signed-off-by: Robert Shaw <[email protected]> Co-authored-by: Sage Moore <[email protected]> Co-authored-by: Russell Bryant <[email protected]> Co-authored-by: rentianyue-jk <[email protected]> Co-authored-by: Isotr0py <[email protected]> Co-authored-by: Chenheli Hua <[email protected]> Co-authored-by: Wentao Ye <[email protected]> Co-authored-by: Michael Goin <[email protected]> Co-authored-by: Tyler Michael Smith <[email protected]> Co-authored-by: Nicolò Lucchesi <[email protected]> Co-authored-by: Roger Wang <[email protected]> Co-authored-by: Robert Shaw <[email protected]> Signed-off-by: simon-mo <[email protected]>
[VLM] Update Qwen3-VL max_num_video_tokens calculation for configurab… …le video profiling (vllm-project#25557) Signed-off-by: Isotr0py <[email protected]> Signed-off-by: Roger Wang <[email protected]> Co-authored-by: Roger Wang <[email protected]> Signed-off-by: simon-mo <[email protected]>
[Doc]: improve CPU(x86) build-wheel-from-source section (vllm-project… …#25617) Signed-off-by: Kosseila (CloudThrill) <[email protected]>
[Doc]: improve CPU(x86) build-wheel-from-source section (vllm-project… …#25617) Signed-off-by: Kosseila (CloudThrill) <[email protected]>
PreviousNext