Logo
Explore Help
Register Sign In
software-engineering/spark-vllm-docker
2
0
Fork 0
You've already forked spark-vllm-docker
Code Issues Actions Packages Projects Releases Wiki Activity
Files
ad2cd3373f7d0ef115fc526453494c07756b613a
spark-vllm-docker/mods
History
Eugene Rakhmatulin 03b055d7f0 Major cluster orchestration refactoring to support running without Ray
2026-03-13 11:55:18 -07:00
..
fix-glm-4.7-flash-AWQ
Now using an opened PR for glm-4.7-flash crash fix in the mod
2026-02-17 12:45:17 -08:00
fix-qwen3-coder-next
Another fix for the Qwen mod as the slow PR was reversed in main
2026-02-13 13:46:00 -08:00
fix-qwen3-next-autoround
Mod for Intel/Qwen3-Coder-Next-INT4-Autoround model
2026-02-24 18:24:42 -08:00
fix-qwen3.5-autoround
Intel/Qwen3.5-122B-A10B-int4-AutoRound support via mods/fix-qwen3.5-autoround
2026-02-27 10:55:42 -08:00
fix-qwen3.5-chat-template
Unsloth chat template for qwen3.5
2026-03-06 23:35:18 -08:00
fix-qwen35-tp4-marlin
Add Qwen3.5-397B INT4-AutoRound TP=4 recipe and Marlin fix
2026-03-09 21:30:28 +00:00
fix-Salyut1-GLM-4.7-NVFP4
initial mod implementation
2025-12-23 13:38:10 -08:00
gpu-mem-util-gb
Experimental mod to support gpu-memory-utilization-gb
2026-03-12 13:37:44 -07:00
nemotron-nano
Added ability to launch NGC container in the cluster
2026-02-02 16:57:04 -08:00
nemotron-super
super nemotron mod & recipe for nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4
2026-03-11 20:53:44 +01:00
use-ngc-vllm
Major cluster orchestration refactoring to support running without Ray
2026-03-13 11:55:18 -07:00
Powered by Gitea Version: 1.25.4 Page: 54ms Template: 5ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API