-
Notifications
You must be signed in to change notification settings - Fork 204
Pull requests: alibaba/rtp-llm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat(omni): add Qwen2.5-Omni multi-stage pipeline support
#1074
opened Jun 7, 2026 by
stmatengss
Loading…
1 of 3 tasks
feat(new_weight_loader): introduce new weight loader framework with FP8 quantization support for Qwen2
#1072
opened Jun 5, 2026 by
Oneydauh
Loading…
feat(xpu): C++ device generalization for Intel XPU support (2/4)
#1071
opened Jun 5, 2026 by
aslanxie
Loading…
perf: ensure torch inference mode throughout engine
#1070
opened Jun 5, 2026 by
zhangjianning-zjn
Collaborator
Loading…
refactor(dash_sc/proxy): switch forward addr config to SERVICE_ROUTE
#1069
opened Jun 5, 2026 by
sunmiaozju
Collaborator
Loading…
feat(p2p): decode_entrance P2P support with lease-based race fix
#1067
opened Jun 4, 2026 by
ZhihanYan
Collaborator
Loading…
7 tasks
feat(deepepv2): 集成 DeepEPv2 ElasticBuffer 路由器
#1065
opened Jun 3, 2026 by
zhijiehou
Collaborator
Loading…
fix(rocm/trt-allreduce): restore fast-path zero-copy IPC + keep exit …
#1050
opened May 28, 2026 by
hxy0118
Collaborator
Loading…
fix(stream): avoid self-deadlock in waitLoadCacheDone via reportError…
#1045
opened May 27, 2026 by
ZhihanYan
Collaborator
Loading…
docs: fix typos ('comming soon', 'not suport')
#1044
opened May 27, 2026 by
daqiege
Loading…
1 task done
fix(openai): streaming spec compliance — SSE chunk split, content preservation, min_new_tokens enforcement
#1040
opened May 26, 2026 by
aslanxie
Loading…
fix(stream): wake nextOutput() waiter when stream is flipped to Error
#1036
opened May 24, 2026 by
ZhihanYan
Collaborator
Loading…
fix(rocm): preserve column-major layout in GDN qkvz+ba fusion to fix swizzle core dump
#1030
opened May 22, 2026 by
chengshu-lcc
Collaborator
Loading…
fix(dash_sc): thinking parameter propagation fixes (max_new_think_tokens, FINISH_REASON_LENGTH, 0-budget)
#1028
opened May 21, 2026 by
jianglan89
Collaborator
Loading…
feat: migrate flashinfer renorm kernel
#1027
opened May 21, 2026 by
Vinkle-hzt
Collaborator
Loading…
feat(ci_gate): delegate fork PR review reruns to workflow_run helper and add pre gate
#1025
opened May 21, 2026 by
guoj14
Collaborator
Loading…
feat: add ROCm aiter custom and quick allreduce support
#1010
opened May 18, 2026 by
chengshu-lcc
Collaborator
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2026-05-07.