Added --non-privileged flag to launch-cluster.sh

This commit is contained in:
Eugene Rakhmatulin
2026-02-15 00:12:06 -08:00
parent 4214d4fefe
commit f886505436
2 changed files with 104 additions and 6 deletions

View File

@@ -164,6 +164,23 @@ Don't do it every time you rebuild, because it will slow down compilation times.
For periodic maintenance, I recommend using a filter: `docker builder prune --filter until=72h`
### 2026-02-14
#### Non-Privileged Mode Support
Added `--non-privileged` flag to `launch-cluster.sh` for running containers without full privileged access while maintaining RDMA/InfiniBand functionality:
- Replaces `--privileged` with `--cap-add=IPC_LOCK`
- Replaces `--ipc=host` with `--shm-size=64g` (configurable via `--shm-size-gb`)
- Exposes RDMA devices via `--device=/dev/infiniband`
- Adds resource limits: memory (110GB), memory+swap (120GB), pids (4096)
Example usage:
```bash
./launch-cluster.sh --non-privileged exec vllm serve ...
./launch-cluster.sh --non-privileged --mem-limit-gb 120 --shm-size-gb 64 exec vllm serve ...
```
### 2026-02-12
Added a mod for Qwen3-Coder-Next-FP8 that fixes:
@@ -772,6 +789,34 @@ You can override the auto-detected values if needed:
| `--no-cache-dirs` | Do not mount default cache directories (~/.cache/vllm, ~/.cache/flashinfer, ~/.triton). |
| `--launch-script` | Path to bash script to execute in the container (from examples/ directory or absolute path). If launch script is specified, action should be omitted. |
| `-d` | Run in daemon mode (detached). |
| `--non-privileged` | Run in non-privileged mode (removes `--privileged` and `--ipc=host`). |
| `--mem-limit-gb` | Memory limit in GB (default: 110, only with `--non-privileged`). |
| `--mem-swap-limit-gb` | Memory+swap limit in GB (default: mem-limit + 10, only with `--non-privileged`). |
| `--pids-limit` | Process limit (default: 4096, only with `--non-privileged`). |
| `--shm-size-gb` | Shared memory size in GB (default: 64, only with `--non-privileged`). |
### Non-Privileged Mode
The `--non-privileged` flag allows running containers without full privileged access while maintaining RDMA/InfiniBand functionality:
```bash
./launch-cluster.sh --non-privileged exec vllm serve ...
```
When `--non-privileged` is specified:
- `--privileged` is replaced with `--cap-add=IPC_LOCK`
- `--ipc=host` is replaced with `--shm-size=64g` (configurable via `--shm-size-gb`)
- RDMA devices are exposed via `--device=/dev/infiniband`
- Resource limits are applied: memory (110GB), memory+swap (120GB), pids (4096)
These resource limits can be customized:
```bash
./launch-cluster.sh --non-privileged \
--mem-limit-gb 120 \
--mem-swap-limit-gb 130 \
--shm-size-gb 64 \
exec vllm serve ...
```
## 3\. Running the Container (Manual)