Files
spark-vllm-docker/recipes/minimax-m2.7-awq.yaml
L.B.R. caa28c8e12 Add recipe for MiniMax-M2.7-AWQ
Add a vLLM serving recipe for the MiniMax M2.7 model using
the cyankiwi/MiniMax-M2.7-AWQ-4bit quantization. Uses the
same minimax_m2 tool-call and reasoning parsers as the
existing M2 recipe, with Ray distributed backend on 2 GPUs.
2026-04-18 22:44:26 +01:00

1.1 KiB