Commit Graph

81 Commits

Author SHA1 Message Date
eca316efa4 add timeouts 2026-05-07 11:42:59 -05:00
e56d29528f set cache reuse to 1 2026-05-07 11:28:09 -05:00
ebcc0cf045 enable cache reuse 2026-05-07 11:26:30 -05:00
0997aa48b7 add max model len 2026-05-07 11:19:11 -05:00
923679cb29 set env 2026-05-07 11:13:23 -05:00
1e7630efe8 remove args for now 2026-05-06 18:00:06 -05:00
af8c9a1254 set profiles 2026-05-06 17:52:14 -05:00
4287242035 set profiles 2026-05-06 17:51:29 -05:00
59af501a82 set default profile 2026-05-06 17:51:00 -05:00
f602e1aec9 set string 2026-05-06 17:48:20 -05:00
da57ec24ee use qwen 2026-05-06 17:46:54 -05:00
066554aa36 use 32b 2026-05-06 17:42:18 -05:00
98f39b7c68 add qwen 2026-05-06 17:33:57 -05:00
e90c7eeaa8 add qwen cache 2026-05-06 17:33:31 -05:00
7269275809 add toolchoice 2026-05-06 17:22:36 -05:00
3989021efc use correct cert 2026-05-06 17:05:16 -05:00
4af01f2d0a add issuer 2026-05-06 17:03:36 -05:00
11cd367f94 allow all 2026-05-06 16:49:25 -05:00
12020fb753 update hostname 2026-05-06 16:46:17 -05:00
8bf88ebf2f set kustomization 2026-05-06 16:37:50 -05:00
a5416a75f3 add gateway and http route 2026-05-06 16:37:39 -05:00
1c8cca8db0 add address pool 2026-05-06 16:34:18 -05:00
b268e01870 add metallb 2026-05-06 16:32:36 -05:00
10ea7005d4 remove readonly 2026-05-06 16:23:19 -05:00
2d6c9bbdfd fix model name 2026-05-06 12:59:02 -05:00
df29d77652 fix image name 2026-05-06 12:53:46 -05:00
5e57329067 use nwer model 2026-05-06 12:52:03 -05:00
314e730995 fix yaml 2026-05-06 12:11:26 -05:00
df78657374 add test model 2026-05-06 12:07:59 -05:00
93fc5e52d7 fix chart name 2026-05-06 12:00:43 -05:00
2c831b04da remove 2026-05-06 11:59:24 -05:00
b238eec199 fix typo 2026-05-06 11:59:15 -05:00
cb1ff576f1 add nim-operaotr 2026-05-06 11:58:30 -05:00
4300d7e3cb add nim-operator 2026-05-06 11:56:50 -05:00
d877450fc0 remove granite 2026-05-06 11:50:46 -05:00
0e50ac4e0e remove kserve 2026-05-06 11:50:29 -05:00
cb56f3838d no lb 2026-05-05 15:21:10 -05:00
0bef50c896 use correct format 2026-05-05 15:15:37 -05:00
778083b9f6 use correctly 2026-05-05 15:13:59 -05:00
87f98fbca6 use alpha1 2026-05-05 15:11:49 -05:00
00f78688e5 set version 2026-05-05 15:10:56 -05:00
fa935586dc try v1 beta 2026-05-05 15:06:11 -05:00
ca6f6f4afc change name 2026-05-05 15:04:38 -05:00
f1306836f2 use v1 2026-05-05 15:01:07 -05:00
a0c6b57628 add inference model 2026-05-05 14:58:23 -05:00
973ffc99d1 change interval 2026-05-05 14:56:33 -05:00
0d4b42a2a8 add class back 2026-05-05 14:53:11 -05:00
40fe171ae4 dont add class yet 2026-05-05 14:50:30 -05:00
fd3e05184f remove kserve 2026-05-05 14:49:36 -05:00
541caca9e9 depend on infra 2026-05-05 14:46:46 -05:00