spark-vllm-docker

Author	SHA1	Message	Date
Raphael Amorim	6943a51ced	Adding tests and refactoring repeated methods	2026-02-09 17:21:32 -05:00
Raphael Amorim	b7c3cdcfcb	Enhancement: add -- pass-through for arbitrary vLLM arguments Implements Unix-style pass-through allowing any vLLM argument to be passed after `--` separator. Arguments are appended verbatim to the generated vLLM command. Examples: ./run-recipe.py model --solo -- --load-format safetensors ./run-recipe.py model --solo -- --served-model-name my-api ./run-recipe.py model --solo -- -cc.cudagraph_mode=PIECEWISE Features: - Uses parse_known_args() to capture arguments after -- - Warns when extra args duplicate CLI overrides (--port, --tp, etc.) - Works in both solo and cluster modes Adds 10 integration tests covering: - --load-format, --served-model-name, equals syntax - Multiple arguments, empty --, cluster mode - Duplicate detection warnings for port/tp/gpu-mem Closes #30	2026-02-08 02:36:49 -05:00
Eugene Rakhmatulin	f139c4b55d	Updated tests	2026-02-04 12:06:30 -08:00
Raphael Amorim	28ba6090fc	Adding suggestions from Eugr and unit tests	2026-02-03 17:32:59 -05:00