Imarkov/conditional compilation ranges #127

ProExpertProg · 2025-11-06T17:15:55Z

Purpose

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: ilmarkov <markovilya197@gmail.com>

…itional_compilation_ranges Signed-off-by: ilmarkov <markovilya197@gmail.com>

Signed-off-by: ilmarkov <markovilya197@gmail.com>

…itional_compilation_ranges Signed-off-by: ilmarkov <markovilya197@gmail.com>

Signed-off-by: ilmarkov <markovilya197@gmail.com>

…itional_compilation_ranges

ProExpertProg

A few initial thoughts. Could we also use a dataclass instead of a tuple for a compiled range? We can add utility methods (like is_single_size), names to the elements, and docs to make the code clearer.

Also, currently if there is more than one range above the cudagraphs capture size, I think we don't ever trigger it in the GPU model runner as the compilation only happens as the compiled model is invoked with the shape (with _dummy_run) - we should make sure to dummy run for each compile range.

I also think we should try to give hints to Inductor about the range, can be done as a follow-up.

ProExpertProg · 2025-11-07T20:53:38Z

vllm/compilation/backends.py

                elapsed = now - compilation_start_time
-                compilation_config.compilation_time += elapsed
-                if runtime_shape is None:
+                if compile_range is None:


Lost the compilation time update

Suggested change

if compile_range is None:

compilation_config.compilation_time += elapsed

if compile_range is None:

ProExpertProg · 2025-11-07T21:00:50Z

vllm/config/compilation.py

    """Sizes to compile for inductor. In addition
    to integers, it also supports "cudagraph_capture_sizes" to
    specify the sizes for cudagraph capture."""
+    compile_ranges_split_points: list[int] | None = None


This comment implies ranges are done inclusive-exclusive but in the code you use inclusive-inclusive. Can we standardize on inclusive-exclusive?

ProExpertProg · 2025-11-07T21:06:19Z

tests/compile/test_compile_ranges.py

+        state = {
+            "ranges": self.ranges,
+        }
+        return InductorPass.hash_dict(state)


Add the current range to cache key and check the number of times the manager gets called (to make sure the bug you found doesn't manifest)

ilmarkov added 20 commits November 3, 2025 10:09

Split PR. Second part. Compile ranges

c3af2af

Signed-off-by: ilmarkov <markovilya197@gmail.com>

Remove general shape graph

0cbb065

Signed-off-by: ilmarkov <markovilya197@gmail.com>

Add test to test pipeline

d5392f5

Signed-off-by: ilmarkov <markovilya197@gmail.com>

Fix pre-commit

027c9eb

Signed-off-by: ilmarkov <markovilya197@gmail.com>

Upd

b2992d3

Signed-off-by: ilmarkov <markovilya197@gmail.com>

Upd config

3499384

Signed-off-by: ilmarkov <markovilya197@gmail.com>

Fix

5336ee6

Signed-off-by: ilmarkov <markovilya197@gmail.com>

Priotitize compile_sizes

4958474

Signed-off-by: ilmarkov <markovilya197@gmail.com>

Fix inductor config

04306ed

Signed-off-by: ilmarkov <markovilya197@gmail.com>

Laith's fix

9dc4eea

Signed-off-by: ilmarkov <markovilya197@gmail.com>

Upd

2c63f0b

Signed-off-by: ilmarkov <markovilya197@gmail.com>

Merge branch 'imarkov/fused_allreduce_torch_native' into imarkov/cond…

8b8d01d

…itional_compilation_ranges Signed-off-by: ilmarkov <markovilya197@gmail.com>

Add caching

fcebc21

Signed-off-by: ilmarkov <markovilya197@gmail.com>

Address comments

65151bc

Signed-off-by: ilmarkov <markovilya197@gmail.com>

Update benchmark

df22202

Signed-off-by: ilmarkov <markovilya197@gmail.com>

Fix

a21de2b

Signed-off-by: ilmarkov <markovilya197@gmail.com>

Merge branch 'imarkov/fused_allreduce_torch_native' into imarkov/cond…

ada24e6

…itional_compilation_ranges Signed-off-by: ilmarkov <markovilya197@gmail.com>

Update fakify for compile sizes

6766e4f

Signed-off-by: ilmarkov <markovilya197@gmail.com>

Linter fix

af87d7a

Signed-off-by: ilmarkov <markovilya197@gmail.com>

Merge branch 'imarkov/fused_allreduce_torch_native' into imarkov/cond…

459f71c

…itional_compilation_ranges

ProExpertProg commented Nov 7, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Imarkov/conditional compilation ranges #127

Imarkov/conditional compilation ranges #127

Uh oh!

ProExpertProg commented Nov 6, 2025 •

edited by github-actions bot

Loading

Uh oh!

ProExpertProg left a comment

Uh oh!

ProExpertProg Nov 7, 2025

Uh oh!

ProExpertProg Nov 7, 2025

Uh oh!

ProExpertProg Nov 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	if compile_range is None:
	compilation_config.compilation_time += elapsed
	if compile_range is None:

Imarkov/conditional compilation ranges #127

Are you sure you want to change the base?

Imarkov/conditional compilation ranges #127

Uh oh!

Conversation

ProExpertProg commented Nov 6, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

ProExpertProg left a comment

Choose a reason for hiding this comment

Uh oh!

ProExpertProg Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

ProExpertProg Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

ProExpertProg Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ProExpertProg commented Nov 6, 2025 •

edited by github-actions bot

Loading