-
Notifications
You must be signed in to change notification settings - Fork 204
Pull requests: SemiAnalysisAI/InferenceX
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[AMD] Add DSv4-FP4-MI355X atom-disagg MTP
#1855
opened Jun 19, 2026 by
seungrokj
Collaborator
Loading…
2 tasks
[codex] Add all-evals matrix expansion mode
#1854
opened Jun 19, 2026 by
Oseltamivir
Collaborator
Loading…
6 of 8 tasks
[codex] Cover every multinode parallelism in evals
#1850
opened Jun 19, 2026 by
Oseltamivir
Collaborator
•
Draft
[AMD] Optimize MiniMax M3 sparse index scoring on MI300X
sweep-enabled
#1840
opened Jun 18, 2026 by
Oseltamivir
Collaborator
Loading…
[Klaud Cold] MI325X MiniMax-M3 EAGLE3 nightly image and FP8 KV cache
full-sweep-fail-fast
#1838
opened Jun 18, 2026 by
cquil11
Collaborator
Loading…
[Klaud Cold] MI325X MiniMax-M3 nightly image and FP8 KV cache
full-sweep-fail-fast
#1836
opened Jun 18, 2026 by
cquil11
Collaborator
Loading…
[codex] Update MiniMax M3 B300 FlashInfer image
full-sweep-fail-fast
#1834
opened Jun 18, 2026 by
cquil11
Collaborator
Loading…
[codex] Update MiniMax M3 B300 EAGLE3 FlashInfer image
full-sweep-fail-fast
#1835
opened Jun 18, 2026 by
cquil11
Collaborator
Loading…
[codex] Update MiniMax M3 B200 FlashInfer image
full-sweep-fail-fast
#1833
opened Jun 18, 2026 by
cquil11
Collaborator
Loading…
[codex] Update MiniMax M3 B200 EAGLE3 FlashInfer image
full-sweep-fail-fast
#1832
opened Jun 18, 2026 by
cquil11
Collaborator
Loading…
fix(ci): bound multinode pre-run Slurm cleanup drain loop (unblocks NVIDIA sweeps)
#1820
opened Jun 18, 2026 by
arygupt
Collaborator
Loading…
[AMD] add dsv4 sglang disagg
AMD
full-sweep-enabled
#1818
opened Jun 18, 2026 by
billishyahao
Collaborator
Loading…
Add Qwen3.5-FP8 GB200 SGLang disaggregated benchmark
full-sweep-enabled
#1810
opened Jun 16, 2026 by
RohitNagraj
Collaborator
Loading…
[AMD] [MI300X] minimaxm3-fp8-mi300x-vllm: enable AITER kernels for MXFP8 on MI300X
full-sweep-enabled
#1808
opened Jun 16, 2026 by
JohnQinAMD
Collaborator
Loading…
Fix for https://github.com/sgl-project/sglang/issues/22072
#1806
opened Jun 16, 2026 by
davzhuAMD
Loading…
[NV]Add GLM-5 NVFP4 GB200 disagg non-mtp TRT-LLM benchmarks via Dynamo
full-sweep-enabled
#1803
opened Jun 16, 2026 by
xinli-sw
Collaborator
Loading…
[NV]Add GLM-5 NVFP4 GB200 disagg-mtp TRT-LLM benchmarks via Dynamo
full-sweep-enabled
#1800
opened Jun 16, 2026 by
xinli-sw
Collaborator
Loading…
[NV]Add GLM-5 NVFP4 GB300 disagg-mtp TRT-LLM benchmarks via Dynamo
full-sweep-enabled
#1799
opened Jun 16, 2026 by
xinli-sw
Collaborator
Loading…
[NV]Update Kimi K2.5 NVFP4 GB200 disaggregated TRT-LLM benchmarks via Dynamo
full-sweep-enabled
#1797
opened Jun 16, 2026 by
xinli-sw
Collaborator
Loading…
[NV]Add Kimi K2.5 NVFP4 GB300 disaggregated TRT-LLM benchmarks via Dynamo
full-sweep-enabled
#1796
opened Jun 16, 2026 by
xinli-sw
Collaborator
Loading…
chore(runners): add TensorWave MI300X docker runners (mi300x-tw)
#1793
opened Jun 16, 2026 by
cquil11
Collaborator
Loading…
[NV]dsr1-fp4-b200-sglang: add DPA PDL lane
full-sweep-enabled
#1792
opened Jun 15, 2026 by
hshrivastava-droid
Collaborator
Loading…
[DO NOT MERGE] Run-only: gb200 dsr1 measured power+temp (canonical NVIDIA)
sweep-enabled
#1791
opened Jun 15, 2026 by
arygupt
Collaborator
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.