Commit Graph

  • 23cca2a11a Merge branch '3-node' of gitlab.home.eugr.net:ai/spark-vllm into 3-node Eugene Rakhmatulin 2026-03-25 23:17:25 -07:00
  • c2fe579ccc Enhance .env file handling and validation in scripts Eugene Rakhmatulin 2026-03-25 23:16:56 -07:00
  • 8b7c02aa25 add .env support to build-and-copy.sh Eugene Rakhmatulin 2026-03-25 22:47:02 -07:00
  • 73fec1bdf8 bugfix Eugene Rakhmatulin 2026-03-25 15:40:09 -07:00
  • 2f5ff0211e Cleanup in build script Eugene Rakhmatulin 2026-03-25 15:39:23 -07:00
  • 63ee72e729 Merge branch '3-node' of gitlab.home.eugr.net:ai/spark-vllm into 3-node Eugene Rakhmatulin 2026-03-25 15:36:31 -07:00
  • 4a0feea6c3 Added --cleanup option to build script Eugene Rakhmatulin 2026-03-25 15:35:32 -07:00
  • 429042b7dc Revert "Added --cleanup option" Eugene Rakhmatulin 2026-03-25 15:35:15 -07:00
  • ef95336937 Merge branch '3-node' of gitlab.home.eugr.net:ai/spark-vllm into 3-node Eugene Rakhmatulin 2026-03-25 15:25:19 -07:00
  • b8930b05a1 Added --cleanup option Eugene Rakhmatulin 2026-03-25 15:24:59 -07:00
  • 49d505ad14 Merge branch '3-node' of gitlab.home.eugr.net:ai/spark-vllm into 3-node Eugene Rakhmatulin 2026-03-25 15:16:47 -07:00
  • 1755dfd114 Added LOCAL_IP support Eugene Rakhmatulin 2026-03-25 15:16:06 -07:00
  • 3d4dc4c82e Merge branch '3-node' of gitlab.home.eugr.net:ai/spark-vllm into 3-node Eugene Rakhmatulin 2026-03-25 14:42:37 -07:00
  • 07fac71dac Fixed bug with CONTAINER_NAME variable Eugene Rakhmatulin 2026-03-25 14:42:01 -07:00
  • 1702f47df6 Merge branch '3-node' of gitlab.home.eugr.net:ai/spark-vllm into 3-node Eugene Rakhmatulin 2026-03-25 14:18:32 -07:00
  • ad2cd3373f .env configuration support for launch-cluster.sh Eugene Rakhmatulin 2026-03-25 14:18:00 -07:00
  • 1fd8c7afc3 Merge branch 'main' into 3-node Eugene Rakhmatulin 2026-03-25 12:45:40 -07:00
  • 3dcd2a90c1 Updated Nemotron-3-Super recipe Eugene Rakhmatulin 2026-03-25 12:44:44 -07:00
  • efacbd69f2 Updated Nemotron3-Super recipe Eugene Rakhmatulin 2026-03-25 12:43:12 -07:00
  • c4b078b868 Merge branch 'main' into 3-node Eugene Rakhmatulin 2026-03-24 22:21:25 -07:00
  • 3be2fb24a8 Merge pull request #122 Eugene Rakhmatulin 2026-03-24 22:18:52 -07:00
  • 7fa69187df metadata changes Eugene Rakhmatulin 2026-03-24 22:18:07 -07:00
  • 8298c3d7f8 Merge remote-tracking branch 'upstream/main' Drew Botwinick 2026-03-24 15:41:09 -05:00
  • f8c2653fd3 Quick fix for NCCL dependency Eugene Rakhmatulin 2026-03-23 23:20:59 -07:00
  • 990a7b3837 Use mesh-optimized NCCL Eugene Rakhmatulin 2026-03-23 15:43:18 -07:00
  • 9e089acf2b Updated Nemotron recipes to use VLLM CUTLASS Eugene Rakhmatulin 2026-03-22 23:03:24 -07:00
  • 2d749742e4 Changed base image back to base CUDA development one Eugene Rakhmatulin 2026-03-21 18:11:20 -07:00
  • 7a54657abf Revert "cuda 13.2 torch" Eugene Rakhmatulin 2026-03-21 15:36:17 -07:00
  • 926dd57a87 cuda 13.2 torch Eugene Rakhmatulin 2026-03-21 15:15:01 -07:00
  • 6e8d85c914 cleanup Eugene Rakhmatulin 2026-03-21 15:12:12 -07:00
  • d6e76f8e2f add build metadata generation and include in Dockerfiles Drew Botwinick 2026-03-21 16:10:04 -05:00
  • 8385506c5e Fixes Eugene Rakhmatulin 2026-03-20 23:51:21 -07:00
  • 8caebe3155 Reverting back to CUDA image + pytorch from wheels Eugene Rakhmatulin 2026-03-20 17:03:18 -07:00
  • 919a881cb1 Merge branch 'main' of gitlab.home.eugr.net:ai/spark-vllm Eugene Rakhmatulin 2026-03-18 22:03:25 -07:00
  • 8ddc259619 Fixed #111 Eugene Rakhmatulin 2026-03-18 22:03:04 -07:00
  • 22f3fa6c21 Merge pull request #103 from apairmont/network_arg eugr 2026-03-18 21:48:48 -07:00
  • 15d295887c Updated README to reflect --master-port parameter Eugene Rakhmatulin 2026-03-18 21:23:28 -07:00
  • 7e4150feed Added master-port argument Eugene Rakhmatulin 2026-03-18 16:57:55 -07:00
  • 7b752c31c5 Merge pull request #110 from voloszad/patch-1 eugr 2026-03-18 14:54:11 -07:00
  • bdd2b10f54 Remove script copy and permission commands from Dockerfile Andrej V. 2026-03-18 21:57:56 +01:00
  • 2755b62d12 Fixes #108 Eugene Rakhmatulin 2026-03-18 13:26:39 -07:00
  • f327b92abe Fixes #106 and #108 Eugene Rakhmatulin 2026-03-18 13:06:44 -07:00
  • 57b458570e Added experimental Qwen3.5-397B support for dual Spark configuration Eugene Rakhmatulin 2026-03-17 19:05:36 -07:00
  • 57ed099465 Updated README file to reflect new launch-cluster options. Eugene Rakhmatulin 2026-03-17 16:16:04 -07:00
  • fb0687cd1b Updated README to describe no-ray mode Eugene Rakhmatulin 2026-03-17 15:27:22 -07:00
  • ccea2ba861 Bugfixes Eugene Rakhmatulin 2026-03-17 13:54:42 -07:00
  • 957605498c Added extra passthrough variables to run-recipe Eugene Rakhmatulin 2026-03-17 13:41:40 -07:00
  • b1eeefc0eb Changed Nemotron-3-Nano-NVFP4 to Marlin backend Eugene Rakhmatulin 2026-03-17 13:10:48 -07:00
  • b879b7748f add network arg to common build flags Alan Pairmont 2026-03-16 12:09:59 -04:00
  • fa645f3e4b bugfixes Eugene Rakhmatulin 2026-03-13 13:39:30 -07:00
  • dedbd0a01d bugfixes Eugene Rakhmatulin 2026-03-13 12:41:48 -07:00
  • caa83d9e5b Bugfixes Eugene Rakhmatulin 2026-03-13 12:32:43 -07:00
  • 4bcbbaa25a Bugfixes Eugene Rakhmatulin 2026-03-13 12:23:41 -07:00
  • d08266a123 Bugfixes Eugene Rakhmatulin 2026-03-13 12:18:22 -07:00
  • 03b055d7f0 Major cluster orchestration refactoring to support running without Ray Eugene Rakhmatulin 2026-03-13 11:55:18 -07:00
  • d609fecef3 Merge branch 'main' of github.com:eugr/spark-vllm-docker Eugene Rakhmatulin 2026-03-12 15:04:41 -07:00
  • 7c198b1ceb Merge pull request #90 from sonusflow/pr/qwen35-397b-tp4 eugr 2026-03-12 15:04:23 -07:00
  • 8ae51192e5 Experimental mod to support gpu-memory-utilization-gb Eugene Rakhmatulin 2026-03-12 13:37:44 -07:00
  • 8fec9bed06 Updated Nemotron to support dual sparks Eugene Rakhmatulin 2026-03-12 13:30:15 -07:00
  • 6a323cc6f5 Merge pull request #93 Eugene Rakhmatulin 2026-03-12 13:00:13 -07:00
  • 6f9a2f981c Adjusted model parameters Eugene Rakhmatulin 2026-03-12 12:59:05 -07:00
  • 122edc8229 super nemotron mod & recipe for nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4 remi 2026-03-11 20:53:44 +01:00
  • 7ceea85647 Fixed qwen3-coder-next-int4-autoround to exclude Ray Eugene Rakhmatulin 2026-03-11 11:20:56 -07:00
  • 45066e2b16 Updated README Eugene Rakhmatulin 2026-03-11 09:57:34 -07:00
  • f2cf11b047 Added a recipe for qwen3-coder-next-int4-autoround Eugene Rakhmatulin 2026-03-11 09:23:23 -07:00
  • 3baca14eb1 Move recipe to 4x-spark-cluster/ and add UMA memory optimizations sonusflow 2026-03-11 07:29:45 +00:00
  • 66b5c85907 Merge branch 'main' of github.com:eugr/spark-vllm-docker Eugene Rakhmatulin 2026-03-10 10:29:10 -07:00
  • 0019bdf5ed Merge pull request #85 from saladinomario/feat/recipe-env-passthrough eugr 2026-03-10 10:28:29 -07:00
  • 006734910c Add Qwen3.5-397B INT4-AutoRound TP=4 recipe and Marlin fix sonusflow 2026-03-09 21:30:28 +00:00
  • e225c709fb Revert "fix: add temporary patch for CUDA graphs estimation" as it has been merged to main Eugene Rakhmatulin 2026-03-09 09:46:50 -07:00
  • 63b2a8dbed fix: add temporary patch for CUDA graphs estimation Eugene Rakhmatulin 2026-03-08 22:43:41 -07:00
  • 9724619dbd Merge pull request #87 from SeraphimSerapis/fix_wheels_download eugr 2026-03-07 09:34:31 -08:00
  • d42c4199fa Unsloth chat template for qwen3.5 staging-current-1772875976 Eugene Rakhmatulin 2026-03-06 23:35:18 -08:00
  • b9fc32ec34 fix: skip empty lines in wheel download read loop Tim Messerschmidt 2026-03-07 05:06:12 +01:00
  • 9dc09bd04b Renamed recipe for qwen3.5-35b-a3b-fp8 to match others Eugene Rakhmatulin 2026-03-06 13:56:06 -08:00
  • e88426646b Merge pull request #76 from mmonad/fix-exec-arg-quoting eugr 2026-03-06 13:45:53 -08:00
  • f95beba566 Add -e/--env passthrough to run-recipe.py mariosaladino 2026-03-06 21:50:29 +01:00
  • eb8abcca7f Prevent 169.254.x.x fallback when setting fix IP address (#84) Olivier Paroz 2026-03-06 20:47:47 +01:00
  • d148d95a19 Merge pull request #80 from oliverjohnwilson/recipe-add_minimax-m2.5_qwen3.5-397b-a17B-fp8 eugr 2026-03-06 11:46:37 -08:00
  • 5346372f14 More robust wheels check before download Eugene Rakhmatulin 2026-03-05 17:06:57 -08:00
  • 5f8f988d91 Merge branch 'main' of github.com:eugr/spark-vllm-docker Eugene Rakhmatulin 2026-03-05 16:29:00 -08:00
  • 3fabd3fb1c Merge pull request #72 from erikvullings/main eugr 2026-03-05 16:27:50 -08:00
  • 2d03bc138d saving flashinfer and vllm commits in wheels directories Eugene Rakhmatulin 2026-03-05 14:41:25 -08:00
  • a749fcce87 Added a recipe for qwen3.5-122B-FP8 staging-current-1772696532 staging-current-1772696417 Eugene Rakhmatulin 2026-03-04 16:49:39 -08:00
  • 505a060a7d vLLM prebuilt wheels support Eugene Rakhmatulin 2026-03-04 16:01:50 -08:00
  • ca34ebcffc Merge branch 'main' into vllm-wheels Eugene Rakhmatulin 2026-03-04 15:59:16 -08:00
  • 4303f8b6d0 added minimax-m2.5 and qwen3.5-397b-a17B-fp8 recipes to a recipes/4x-spark-cluster/ subdirectory oliverjohnwilson 2026-03-04 16:01:37 -06:00
  • 2152ef127d Now can use prebuilt vLLM wheels Eugene Rakhmatulin 2026-03-04 13:33:32 -08:00
  • 19f06a0d16 Fixed a bug with checking whether we need to download remote wheels staging-current-1772668553 staging-current-1772668424 Eugene Rakhmatulin 2026-03-04 13:00:40 -08:00
  • bbd7db2813 revert bumping up base image staging-current-1772642791 staging-current-1772642670 Eugene Rakhmatulin 2026-03-04 07:29:53 -08:00
  • 50b3ca60f3 Fix shell quoting for exec command arguments L.B.R. 2026-03-04 15:22:42 +00:00
  • fff1a24982 Rolling back base image Eugene Rakhmatulin 2026-03-04 07:19:43 -08:00
  • ae19b66fdd Bumped base image version Eugene Rakhmatulin 2026-03-03 23:27:47 -08:00
  • 163f23d85b Update qwen35-35b-a3b-fp8.yaml Erik Vullings 2026-03-03 12:46:12 +01:00
  • 7d8465fd9c Added recipe for qwen3.5-122b-int4-autoround, updated README staging-current-1772609005 staging-current-1772608894 staging-current-1772608818 Eugene Rakhmatulin 2026-03-02 12:18:16 -08:00
  • 8f11e7e5ed Intel/Qwen3.5-122B-A10B-int4-AutoRound support via mods/fix-qwen3.5-autoround Eugene Rakhmatulin 2026-02-27 10:55:42 -08:00
  • e8f94d6b8b Add Qwen35-35B-A3B recipe in FP8 format Erik Vullings 2026-02-27 17:46:06 +01:00
  • df88997449 piping exec command to docker logs when running in the daemon mode. Eugene Rakhmatulin 2026-02-26 18:19:38 -08:00
  • 15888c407a Merge pull request #62 Eugene Rakhmatulin 2026-02-26 15:24:42 -08:00
  • c1c3b9d66a support for daemon mode with exec command Eugene Rakhmatulin 2026-02-26 15:23:08 -08:00