-
Notifications
You must be signed in to change notification settings - Fork 132
Pull requests: vllm-project/vllm-ascend
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Enhancement] Add padding for ACL Graph and refactor graph batch size
module:core
#803
opened May 9, 2025 by
yiz-liu
Loading…
add custom ascendc kernel vocabparallelembedding
module:tests
#796
opened May 8, 2025 by
ttanzhiqiang
Loading…
[P/D][DP] Upgrade pd proxy to support both prefill and decode instances in disaggregated-prefill.
module:core
#794
opened May 8, 2025 by
whx-sjtu
Loading…
[Benchmark] Add qwen2.5 and qwen2.5-vl to benchmarks
documentation
Improvements or additions to documentation
#792
opened May 8, 2025 by
Potabk
Loading…
feat: support torchair graph mode in v1 engine
module:core
#789
opened May 8, 2025 by
NeverRaR
Loading…
[CI/UT] Fix the default value of block_size in VllmRunner to None
module:tests
#783
opened May 7, 2025 by
rjg-lyh
Loading…
add workflow to build and release wheel
ci/build
module:tools
#775
opened May 7, 2025 by
celestialli
Loading…
[CI] Add accuracy test for Qwen2.5-VL-3B-Instruct
module:tests
#766
opened May 6, 2025 by
hfadzxy
Loading…
[V1][Structured Output] Enable Speculative Decoding with Structured Outputs
#751
opened May 5, 2025 by
shen-shanshan
•
Draft
[Attention][Kernel]moe support for llama4 and mllama4
module:ops
#740
opened Apr 30, 2025 by
cxcxflying
Loading…
Add qwen2.5 vl multimodal feature for vllm-ascend v1
module:tests
#736
opened Apr 30, 2025 by
RookieChenTaoYu
Loading…
V1 parallel fix: bug fix to enable DP in V1
module:core
module:ops
#710
opened Apr 28, 2025 by
HanlinDu
Loading…
[Feature] Impl the connector based on the llmdatadist for v1
module:core
#684
opened Apr 27, 2025 by
jianzs
Loading…
1 of 5 tasks
update chunk prefill torch
module:ops
module:tests
#679
opened Apr 27, 2025 by
ttanzhiqiang
Loading…
[WIP] Add support for custom DeepSeek modelling in ACL Graph mode
module:core
module:ops
#677
opened Apr 27, 2025 by
yiz-liu
Loading…
[Feature] Enable disaggregated prefill functionality for v0
module:core
module:tests
#658
opened Apr 25, 2025 by
jianzs
Loading…
Adjust KV cache shape for compatibility with updated APIs for graph mode
ci/build
#657
opened Apr 25, 2025 by
linfeng-yuan
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.