10 likes | 115 Views
MDP + Random Sampling to Collect Beliefs. Temporal difference. Spatial difference. Solving Large-Scale POMDP Problems Via Belief State Analysis. Xin Li, William K. Cheung, Jiming Liu. Observation: Wall’s combination! Where am I on earth? How to get to the room with ?
E N D
MDP + Random Sampling to Collect Beliefs Temporal difference Spatial difference Solving Large-Scale POMDP Problems Via Belief State Analysis Xin Li, William K. Cheung, Jiming Liu Observation: Wall’s combination! Where am I on earth? How to get to the room with ? Which is the next better action? ? ? ? Policy quality comparison between the conventional belief compression and the proposed method with belief clustering Clustering Sub-policy1 Dimension reduction with EPCA per cluster Policy Low-dimensional space Discretization High-dimensional beliefspace Sub-policy2 X. Li, W. K. Cheung, J. Liu, "Towards Solving Large-Scale POMDP Problems via Spatio-Temporal Brief State Clustering," Proceedings of IJCAI-05 Workshop on Reasoning with Uncertainty in Robotics (RUR-05), July 2005