Eugene Rakhmatulin
f886505436
Added --non-privileged flag to launch-cluster.sh
2026-02-15 00:12:06 -08:00
Eugene Rakhmatulin
701147b1eb
Qwen3-Coder-Next fixes and updated recipe
2026-02-12 15:56:32 -08:00
Eugene Rakhmatulin
3b1e49dcb0
Supporting other CUDA archs via --gpu-arch flag
2026-02-11 13:10:41 -08:00
Eugene Rakhmatulin
6d3f5dfd5c
map flashinfer/torch/triton cache directories by default
2026-02-10 16:36:02 -08:00
Eugene Rakhmatulin
ace16f3a8f
Applied new fastsafetensors fix to mxfp4 build; disabled wheel builds by default
2026-02-09 23:47:06 -08:00
Eugene Rakhmatulin
2923fe6ea5
Removed temp fastsafetensors patch
2026-02-09 10:21:14 -08:00
Eugene Rakhmatulin
ec987259a0
Recipes and Launch Script support
2026-02-04 12:01:53 -08:00
Eugene Rakhmatulin
ef6a5eca29
Merge branch 'main' into pr-19
2026-02-04 11:36:59 -08:00
Eugene Rakhmatulin
1e5aa060b8
Updated README to include networking guide
2026-02-03 14:14:05 -08:00
Eugene Rakhmatulin
f8eb294c58
Updated README.md and added Networking Guide.
2026-02-03 12:54:38 -08:00
Eugene Rakhmatulin
4b9ab0de7c
Added ability to launch NGC container in the cluster
2026-02-02 16:57:04 -08:00
Eugene Rakhmatulin
4634ee92a2
Added a mod for Nemotron Nano
2026-02-02 11:58:07 -08:00
Raphael Amorim
751bc5a47a
Adding sample profile and profile loader
2026-02-02 10:25:53 -05:00
Eugene Rakhmatulin
ace61c2d55
added new mod for glm4.7-flash-awq, solo model support.
2026-01-29 18:18:00 -08:00
Eugene Rakhmatulin
7a81e90cd2
added -e parameter
2026-01-29 13:06:22 -08:00
Eugene Rakhmatulin
a3afb6f313
Merge branch 'main' into mxfp4
2026-01-28 13:25:26 -08:00
Eugene Rakhmatulin
74c02c37c2
warning message about wheel builds
2026-01-28 13:25:02 -08:00
Eugene Rakhmatulin
6b11902cc8
Updated README
2026-01-26 23:18:27 -08:00
Eugene Rakhmatulin
18a25c8382
Updated README
2026-01-08 14:38:12 -08:00
Eugene Rakhmatulin
4ee090f632
Updated README re: hf-download option
2025-12-24 08:37:33 -08:00
Eugene Rakhmatulin
04e6d27b84
Updated README re: mods functionality
2025-12-23 18:09:59 -08:00
Eugene Rakhmatulin
786a50c5c7
Updated README
2025-12-21 22:41:48 -08:00
Eugene Rakhmatulin
1139a37324
Added transformers v5 support
2025-12-21 22:41:03 -08:00
Eugene Rakhmatulin
c37053adf6
Updated README
2025-12-21 14:57:35 -08:00
Eugene Rakhmatulin
82802f0cad
Added Quickstart section to README
2025-12-21 14:53:05 -08:00
Eugene Rakhmatulin
bbd3469549
Support vLLM release wheels
2025-12-21 11:15:52 -08:00
Eugene Rakhmatulin
2aa545a810
Added PSA about build cache
2025-12-21 00:49:59 -08:00
Eugene Rakhmatulin
63a1a6a97c
Update README to reflect reduced build time and container size for vLLM
2025-12-20 23:16:12 -08:00
Eugene Rakhmatulin
dfe426e912
Add support for pre-release FlashInfer packages in Docker builds
2025-12-20 23:13:26 -08:00
Eugene Rakhmatulin
76988e0c75
Added --use-wheels to use precompiled vLLM wheels instead of compiling from the source
2025-12-20 20:25:07 -08:00
Eugene Rakhmatulin
0cac77c286
Fixed contributor username
2025-12-19 10:41:03 -08:00
Eugene Rakhmatulin
3eb57a6d49
Updated README - autodiscovery in copy ops
2025-12-19 10:39:28 -08:00
Eugene Rakhmatulin
244ad758d2
Updated README
2025-12-19 09:56:24 -08:00
Eugene Rakhmatulin
23858a3c7f
Merge branch 'main' into pr-2
2025-12-19 08:51:52 -08:00
Eugene Rakhmatulin
de055928b8
Update CHANGELOG: Document --nccl-debug option for NCCL debug level control
2025-12-18 23:29:03 -08:00
Eugene Rakhmatulin
294d155532
Add NCCL debug level option to launch-cluster.sh
2025-12-18 23:28:12 -08:00
Eugene Rakhmatulin
8c53179cc2
changed extra docker args variable to VLLM_SPARK_EXTRA_DOCKER_ARGS for consistency
2025-12-18 22:27:27 -08:00
Eugene Rakhmatulin
cf9da89545
Updated README
2025-12-18 22:03:46 -08:00
Eugene Rakhmatulin
e6efd668cd
Added Table of Contents to README
2025-12-18 15:43:09 -08:00
Eugene Rakhmatulin
8be691e806
Fixed issue with argument passing
2025-12-18 15:31:53 -08:00
Eugene Rakhmatulin
369283f655
Updated README.md with launch-cluster details.
2025-12-18 15:25:22 -08:00
Eric Lewis
11355677f6
Add parallel copy option to build-and-copy.sh
...
Introduced the --copy-parallel flag to enable concurrent copying of Docker images to multiple hosts. Updated the README with usage instructions and details about the new option. Refactored the script to support both serial and parallel copy modes for improved efficiency.
2025-12-18 01:24:48 -05:00
Eric Lewis
e67abd5e6e
Add multi-host copy support to build-and-copy.sh
...
Updated build-and-copy.sh to support copying Docker images to multiple hosts using the new -c/--copy-to flag, which accepts space- or comma-separated host lists. The old --copy-to-host flag is retained as an alias for backward compatibility, and -h is now used for help. The README was updated to document these changes and provide new usage examples.
2025-12-18 00:32:45 -05:00
Eugene Rakhmatulin
79f6a204d1
Update README.md
2025-12-15 09:51:49 -08:00
eugr
02f842e1fd
Updated README
2025-12-14 00:39:15 -08:00
eugr
295e1f2266
Removed MiniMax M2 temporary patch from Dockerfile; updated README.md
2025-12-11 13:24:57 -08:00
eugr
5fba205db4
Implemented a temporary patch for recently broken MiniMax-M2 (in builds after 12/10) for some quants.
2025-12-11 11:13:05 -08:00
eugr
9d351cd6d5
Updated README
2025-12-05 11:32:02 -08:00
eugr
270446be27
Add build-and-copy script for automated image building and deployment
2025-12-05 11:28:43 -08:00
eugr
6a66a4b66f
Added patch to allow fastsafetensors in cluster config
2025-11-26 21:25:04 -08:00