Files
spark-vllm-docker/recipes/qwen35-35b-a3b-fp8.yaml
Erik Vullings 163f23d85b Update qwen35-35b-a3b-fp8.yaml
--max_num_batched_tokens is a default variable now, which can be overriden via the CLI
2026-03-03 12:46:12 +01:00

1.3 KiB