-
Notifications
You must be signed in to change notification settings - Fork 334
Pull requests: ModelTC/LightLLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix(moe): avoid cross-warp stale read in ep_scatter prefix sum
#1362
opened Jun 16, 2026 by
Anai-Guo
Loading…
feat: add fused moe shared-expert and add-rmsnorm optimization
#1353
opened Jun 15, 2026 by
blueswhen
Collaborator
Loading…
test(sampling_params): repair broken test collection and add verify() coverage
#1350
opened Jun 13, 2026 by
SuperMarioYL
Contributor
Loading…
feat(qwen3_5_mtp): Qwen3.5 / Qwen3.5-MoE MTP speculative decoding
#1338
opened Jun 9, 2026 by
sufubao
Collaborator
Loading…
feat: add multi-platform support with ascend and maca
#1335
opened Jun 8, 2026 by
zhangts20
Loading…
feat: update disk cache params and benchmark_multiturn.py
#1333
opened Jun 8, 2026 by
blueswhen
Collaborator
Loading…
fix: replace pickle deserialization with RestrictedUnpickler in PD WebSocket endpoints (CVE-2026-26220)
#1306
opened May 11, 2026 by
nexadodigital
Loading…
import flashqla and support cudagraph for gdn
#1292
opened May 6, 2026 by
WANDY666
Contributor
Loading…
Logging colorization + access middleware cleanup + windowed cache stats
#1289
opened May 6, 2026 by
sufubao
Collaborator
Loading…
6 tasks done
fix: implement RestrictedUnpickler to mitigate CVE-2026-26220
#1219
opened Mar 5, 2026 by
RinZ27
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.