Optimize PPO coverage and recharge strategy

This commit is contained in:
2026-04-26 19:25:05 +08:00
parent 220de372e0
commit 5b6133db13
4 changed files with 399 additions and 108 deletions

View File

@@ -13,11 +13,13 @@ Configuration for Robot Vacuum PPO agent.
class Config:
# Feature dimensions (157D)
# 特征维度157D
# Feature dimensions: 21x21x6 local map + scalar planning features + last action.
# 特征维度21x21x6 多通道局部地图 + 标量规划特征 + 上一步动作。
VIEW_SIZE = 21
MAP_CHANNELS = 6
FEATURES = [
11 * 11, # wider local map view / 更大的局部地图视野
28, # global, charger, NPC, and map-stat features / 全局、充电桩、NPC、地图统计特征
VIEW_SIZE * VIEW_SIZE * MAP_CHANNELS,
66, # global memory, charger, NPC, and action-improvement features
8, # last action one-hot / 上一步动作 one-hot
]
FEATURE_SPLIT_SHAPE = FEATURES