Logo
Explore Help
Register Sign In
software-engineering/spark-vllm-docker
2
0
Fork 0
You've already forked spark-vllm-docker
Code Issues Actions Packages Projects Releases Wiki Activity
Files
2c13e1ce25a3a3295475b987afb44a59194b1f3e
spark-vllm-docker/mods
History
Eugene Rakhmatulin 1ad85442ac Added a helper mod for Qwen3.5-397B recipe
2026-04-12 19:14:23 -07:00
..
drop-caches
Added a helper mod for Qwen3.5-397B recipe
2026-04-12 19:14:23 -07:00
fix-gemma4-tool-parser
Mod to fix Gemma4 tool parser
2026-04-04 16:48:07 -07:00
fix-glm-4.7-flash-AWQ
Now using an opened PR for glm-4.7-flash crash fix in the mod
2026-02-17 12:45:17 -08:00
fix-qwen3-coder-next
Another fix for the Qwen mod as the slow PR was reversed in main
2026-02-13 13:46:00 -08:00
fix-qwen3-next-autoround
Mod for Intel/Qwen3-Coder-Next-INT4-Autoround model
2026-02-24 18:24:42 -08:00
fix-qwen3.5-autoround
Made autoround mod optional to support latest version of vLLM. Fixes #144.
2026-03-27 09:00:50 -07:00
fix-qwen3.5-chat-template
Unsloth chat template for qwen3.5
2026-03-06 23:35:18 -08:00
fix-qwen35-tp4-marlin
Add Qwen3.5-397B INT4-AutoRound TP=4 recipe and Marlin fix
2026-03-09 21:30:28 +00:00
fix-Salyut1-GLM-4.7-NVFP4
initial mod implementation
2025-12-23 13:38:10 -08:00
gpu-mem-util-gb
Experimental mod to support gpu-memory-utilization-gb
2026-03-12 13:37:44 -07:00
nemotron-nano
Added ability to launch NGC container in the cluster
2026-02-02 16:57:04 -08:00
nemotron-super
super nemotron mod & recipe for nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4
2026-03-11 20:53:44 +01:00
use-ngc-vllm
Major cluster orchestration refactoring to support running without Ray
2026-03-13 11:55:18 -07:00
Powered by Gitea Version: 1.25.4 Page: 106ms Template: 9ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API