Updated README
This commit is contained in:
12
README.md
12
README.md
@@ -135,6 +135,18 @@ For periodic maintenance, I recommend using a filter: `docker builder prune --fi
|
|||||||
|
|
||||||
## CHANGELOG
|
## CHANGELOG
|
||||||
|
|
||||||
|
### 2026-04-12
|
||||||
|
|
||||||
|
#### Drop-caches mod for Qwen3.5-397B
|
||||||
|
|
||||||
|
Updated Qwen3.5-397B recipe (for dual node configuration) to use the new mod `mods/drop-caches` which clears filesystem caches every minute while the container is running, resolving fastsafetensors getting stuck during loading and a few other bugs when operating close to max memory limit.
|
||||||
|
|
||||||
|
### 2026-04-11
|
||||||
|
|
||||||
|
#### Pinned PyTorch Version
|
||||||
|
|
||||||
|
Pinned PyTorch to version 2.11.0 (previously using nightly builds) to fix incompatibility with transformers 5.x and avoid torch version mismatch in builds.
|
||||||
|
|
||||||
### 2026-04-02
|
### 2026-04-02
|
||||||
|
|
||||||
A new recipe for Gemma4-26B-A4B in "on-the-fly" FP8 quantization:
|
A new recipe for Gemma4-26B-A4B in "on-the-fly" FP8 quantization:
|
||||||
|
|||||||
Reference in New Issue
Block a user