Skip to content

Pull requests: ModelTC/LightLLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

add dynamic fa3
#1334 opened Jun 8, 2026 by shihaobai Collaborator Loading…
feat: update disk cache params and benchmark_multiturn.py
#1333 opened Jun 8, 2026 by blueswhen Collaborator Loading…
feat: Qwen3.5 / Qwen3.5-MoE MTP speculative decoding
#1330 opened Jun 4, 2026 by sufubao Collaborator Loading…
4 tasks done
add Flashinfer sampling backend
#1328 opened Jun 2, 2026 by blueswhen Collaborator Loading…
Fa4 support
#1327 opened Jun 2, 2026 by blueswhen Collaborator Loading…
add in-process URL pool caching
#1325 opened Jun 1, 2026 by Owleye4 Contributor Loading…
update cpu cache load use async way.
#1318 opened May 25, 2026 by hiworldwzj Collaborator Loading…
support mtp for gemma4
#1316 opened May 22, 2026 by WANDY666 Contributor Loading…
Propagate FINISHED_ERROR from detokenization init failure
#1299 opened May 9, 2026 by sufubao Collaborator Loading…
6 tasks
feat(RL): add RL support for verl
#1298 opened May 8, 2026 by shihaobai Collaborator Loading…
import flashqla to speedup gdn prefill
#1295 opened May 8, 2026 by WANDY666 Contributor Loading…
import flashqla and support cudagraph for gdn
#1292 opened May 6, 2026 by WANDY666 Contributor Loading…
ViT/multimodal token-budget admission + max_pixels clamp
#1290 opened May 6, 2026 by sufubao Collaborator Loading…
3 of 5 tasks
Logging colorization + access middleware cleanup + windowed cache stats
#1289 opened May 6, 2026 by sufubao Collaborator Loading…
6 tasks done
fix(api): forward extra_body.chat_template_kwargs on /v1/messages
#1276 opened Apr 18, 2026 by sufubao Collaborator Loading…
2 of 3 tasks
[Feature] Add support for Neo++
#1274 opened Apr 17, 2026 by XHPlus Contributor Loading…
【draft】Mtp optimization
#1266 opened Apr 9, 2026 by hiworldwzj Collaborator Loading…
fix error response format to match OpenAI API standard
#1259 opened Apr 7, 2026 by sufubao Collaborator Loading…
4 tasks
feat: page size > 1 support
#1224 opened Mar 10, 2026 by blueswhen Collaborator Loading…
feat: add multi node disk cache
#1218 opened Mar 4, 2026 by blueswhen Collaborator Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.