spark-vllm-docker/tests/test_recipes.sh at 2923fe6ea5977d4838a9dbc6e5690b2feebefcf0

Files

Raphael Amorim b7c3cdcfcb Enhancement: add -- pass-through for arbitrary vLLM arguments

Implements Unix-style pass-through allowing any vLLM argument to be
passed after `--` separator. Arguments are appended verbatim to the
generated vLLM command.

Examples:
  ./run-recipe.py model --solo -- --load-format safetensors
  ./run-recipe.py model --solo -- --served-model-name my-api
  ./run-recipe.py model --solo -- -cc.cudagraph_mode=PIECEWISE

Features:
- Uses parse_known_args() to capture arguments after --
- Warns when extra args duplicate CLI overrides (--port, --tp, etc.)
- Works in both solo and cluster modes

Adds 10 integration tests covering:
- --load-format, --served-model-name, equals syntax
- Multiple arguments, empty --, cluster mode
- Duplicate detection warnings for port/tp/gpu-mem

Closes #30

2026-02-08 02:36:49 -05:00

36 KiB

Executable File

Raw Blame History

View Raw

36 KiB Executable File Raw Blame History

36 KiB

Executable File

Raw Blame History