vLLMBatchingMemoryGap

Public

Fork of vLLM for developing the paper "Mind the Memory Gap: Unveiling GPU Bottlenecks in Large-Batch LLM Inference"

Apache License 2.0

Updated Feb 2, 2026

Created Jun 11, 2025

6 stars

1 forks

1 watchers

0 open issues

Languages

Codebase composition by bytes

Top Contributors

445 commits

229 commits

134 commits

117 commits

109 commits

97 commits

74 commits

72 commits

65 commits

62 commits

Recent Stargazers

7 months ago

6 months ago

4 months ago

21 days ago

6 days ago

6 days ago

Issues & Activity

Open vs closed issues and recent activity peak.

Open Issues 0Closed Issues 1

Open Issues1

Activity Peak

Most recent day with highest activityFeb 2, 2026 • 1 events

GitHub Analytics

vLLMBatchingMemoryGap