Re-enable flashinfer_cutlass
This commit is contained in:
@@ -55,6 +55,7 @@ command: |
|
||||
--max-num-batched-tokens {max_num_batched_tokens} \
|
||||
--trust-remote-code \
|
||||
--chat-template unsloth.jinja \
|
||||
--load-format instanttensor \
|
||||
-tp {tensor_parallel} \
|
||||
--distributed-executor-backend ray
|
||||
|
||||
|
||||
Reference in New Issue
Block a user