Commit Graph

57 Commits

Author SHA1 Message Date
Eugene Rakhmatulin
ec987259a0 Recipes and Launch Script support 2026-02-04 12:01:53 -08:00
Eugene Rakhmatulin
ef6a5eca29 Merge branch 'main' into pr-19 2026-02-04 11:36:59 -08:00
Eugene Rakhmatulin
1e5aa060b8 Updated README to include networking guide 2026-02-03 14:14:05 -08:00
Eugene Rakhmatulin
f8eb294c58 Updated README.md and added Networking Guide. 2026-02-03 12:54:38 -08:00
Eugene Rakhmatulin
4b9ab0de7c Added ability to launch NGC container in the cluster 2026-02-02 16:57:04 -08:00
Eugene Rakhmatulin
4634ee92a2 Added a mod for Nemotron Nano 2026-02-02 11:58:07 -08:00
Raphael Amorim
751bc5a47a Adding sample profile and profile loader 2026-02-02 10:25:53 -05:00
Eugene Rakhmatulin
ace61c2d55 added new mod for glm4.7-flash-awq, solo model support. 2026-01-29 18:18:00 -08:00
Eugene Rakhmatulin
7a81e90cd2 added -e parameter 2026-01-29 13:06:22 -08:00
Eugene Rakhmatulin
a3afb6f313 Merge branch 'main' into mxfp4 2026-01-28 13:25:26 -08:00
Eugene Rakhmatulin
74c02c37c2 warning message about wheel builds 2026-01-28 13:25:02 -08:00
Eugene Rakhmatulin
6b11902cc8 Updated README 2026-01-26 23:18:27 -08:00
Eugene Rakhmatulin
18a25c8382 Updated README 2026-01-08 14:38:12 -08:00
Eugene Rakhmatulin
4ee090f632 Updated README re: hf-download option 2025-12-24 08:37:33 -08:00
Eugene Rakhmatulin
04e6d27b84 Updated README re: mods functionality 2025-12-23 18:09:59 -08:00
Eugene Rakhmatulin
786a50c5c7 Updated README 2025-12-21 22:41:48 -08:00
Eugene Rakhmatulin
1139a37324 Added transformers v5 support 2025-12-21 22:41:03 -08:00
Eugene Rakhmatulin
c37053adf6 Updated README 2025-12-21 14:57:35 -08:00
Eugene Rakhmatulin
82802f0cad Added Quickstart section to README 2025-12-21 14:53:05 -08:00
Eugene Rakhmatulin
bbd3469549 Support vLLM release wheels 2025-12-21 11:15:52 -08:00
Eugene Rakhmatulin
2aa545a810 Added PSA about build cache 2025-12-21 00:49:59 -08:00
Eugene Rakhmatulin
63a1a6a97c Update README to reflect reduced build time and container size for vLLM 2025-12-20 23:16:12 -08:00
Eugene Rakhmatulin
dfe426e912 Add support for pre-release FlashInfer packages in Docker builds 2025-12-20 23:13:26 -08:00
Eugene Rakhmatulin
76988e0c75 Added --use-wheels to use precompiled vLLM wheels instead of compiling from the source 2025-12-20 20:25:07 -08:00
Eugene Rakhmatulin
0cac77c286 Fixed contributor username 2025-12-19 10:41:03 -08:00
Eugene Rakhmatulin
3eb57a6d49 Updated README - autodiscovery in copy ops 2025-12-19 10:39:28 -08:00
Eugene Rakhmatulin
244ad758d2 Updated README 2025-12-19 09:56:24 -08:00
Eugene Rakhmatulin
23858a3c7f Merge branch 'main' into pr-2 2025-12-19 08:51:52 -08:00
Eugene Rakhmatulin
de055928b8 Update CHANGELOG: Document --nccl-debug option for NCCL debug level control 2025-12-18 23:29:03 -08:00
Eugene Rakhmatulin
294d155532 Add NCCL debug level option to launch-cluster.sh 2025-12-18 23:28:12 -08:00
Eugene Rakhmatulin
8c53179cc2 changed extra docker args variable to VLLM_SPARK_EXTRA_DOCKER_ARGS for consistency 2025-12-18 22:27:27 -08:00
Eugene Rakhmatulin
cf9da89545 Updated README 2025-12-18 22:03:46 -08:00
Eugene Rakhmatulin
e6efd668cd Added Table of Contents to README 2025-12-18 15:43:09 -08:00
Eugene Rakhmatulin
8be691e806 Fixed issue with argument passing 2025-12-18 15:31:53 -08:00
Eugene Rakhmatulin
369283f655 Updated README.md with launch-cluster details. 2025-12-18 15:25:22 -08:00
Eric Lewis
11355677f6 Add parallel copy option to build-and-copy.sh
Introduced the --copy-parallel flag to enable concurrent copying of Docker images to multiple hosts. Updated the README with usage instructions and details about the new option. Refactored the script to support both serial and parallel copy modes for improved efficiency.
2025-12-18 01:24:48 -05:00
Eric Lewis
e67abd5e6e Add multi-host copy support to build-and-copy.sh
Updated build-and-copy.sh to support copying Docker images to multiple hosts using the new -c/--copy-to flag, which accepts space- or comma-separated host lists. The old --copy-to-host flag is retained as an alias for backward compatibility, and -h is now used for help. The README was updated to document these changes and provide new usage examples.
2025-12-18 00:32:45 -05:00
Eugene Rakhmatulin
79f6a204d1 Update README.md 2025-12-15 09:51:49 -08:00
eugr
02f842e1fd Updated README 2025-12-14 00:39:15 -08:00
eugr
295e1f2266 Removed MiniMax M2 temporary patch from Dockerfile; updated README.md 2025-12-11 13:24:57 -08:00
eugr
5fba205db4 Implemented a temporary patch for recently broken MiniMax-M2 (in builds after 12/10) for some quants. 2025-12-11 11:13:05 -08:00
eugr
9d351cd6d5 Updated README 2025-12-05 11:32:02 -08:00
eugr
270446be27 Add build-and-copy script for automated image building and deployment 2025-12-05 11:28:43 -08:00
eugr
6a66a4b66f Added patch to allow fastsafetensors in cluster config 2025-11-26 21:25:04 -08:00
eugr
712637a348 Added second RoCE interface to examples 2025-11-26 19:53:37 -08:00
eugr
bdf16a0a34 Formatting 2025-11-26 14:02:15 -08:00
eugr
cf8e411ad2 Added benchmarking 2025-11-26 14:01:04 -08:00
eugr
676fa2ace9 Formatting fix 2025-11-26 13:52:30 -08:00
eugr
4f27899939 Added some details on networking 2025-11-26 13:50:39 -08:00
eugr
1a4bc1d7aa Typo 2025-11-26 13:44:34 -08:00