Logo
Explore Help
Register Sign In
software-engineering/spark-vllm-docker
2
0
Fork 0
You've already forked spark-vllm-docker
Code Issues Actions Packages Projects Releases Wiki Activity
Files
7c198b1cebd37fbad10bd7ae9cc0b1e54ed0cbe1
spark-vllm-docker/mods
History
eugr 7c198b1ceb Merge pull request #90 from sonusflow/pr/qwen35-397b-tp4
Add Qwen3.5-397B INT4-AutoRound TP=4 recipe (37 tok/s)
2026-03-12 15:04:23 -07:00
..
fix-glm-4.7-flash-AWQ
Now using an opened PR for glm-4.7-flash crash fix in the mod
2026-02-17 12:45:17 -08:00
fix-qwen3-coder-next
Another fix for the Qwen mod as the slow PR was reversed in main
2026-02-13 13:46:00 -08:00
fix-qwen3-next-autoround
Mod for Intel/Qwen3-Coder-Next-INT4-Autoround model
2026-02-24 18:24:42 -08:00
fix-qwen3.5-autoround
Intel/Qwen3.5-122B-A10B-int4-AutoRound support via mods/fix-qwen3.5-autoround
2026-02-27 10:55:42 -08:00
fix-qwen3.5-chat-template
Unsloth chat template for qwen3.5
2026-03-06 23:35:18 -08:00
fix-qwen35-tp4-marlin
Add Qwen3.5-397B INT4-AutoRound TP=4 recipe and Marlin fix
2026-03-09 21:30:28 +00:00
fix-Salyut1-GLM-4.7-NVFP4
initial mod implementation
2025-12-23 13:38:10 -08:00
nemotron-nano
Added ability to launch NGC container in the cluster
2026-02-02 16:57:04 -08:00
nemotron-super
super nemotron mod & recipe for nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4
2026-03-11 20:53:44 +01:00
use-ngc-vllm
Added ability to launch NGC container in the cluster
2026-02-02 16:57:04 -08:00
Powered by Gitea Version: 1.25.4 Page: 52ms Template: 5ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API