spark-vllm-docker/recipes/minimax-m2.7-awq.yaml at ba9dde963f2824a5d41ed7059084e1622d54b27d

Files

L.B.R. caa28c8e12 Add recipe for MiniMax-M2.7-AWQ

Add a vLLM serving recipe for the MiniMax M2.7 model using
the cyankiwi/MiniMax-M2.7-AWQ-4bit quantization. Uses the
same minimax_m2 tool-call and reasoning parsers as the
existing M2 recipe, with Ray distributed backend on 2 GPUs.

2026-04-18 22:44:26 +01:00

1.1 KiB

Raw Blame History

View Raw

1.1 KiB Raw Blame History

1.1 KiB

Raw Blame History