发明名称 APPARATUS AND ALGORITHMIC PROCESS FOR AN ADAPTIVE NAVIGATION POLICY IN PARTIALLY OBSERVABLE ENVIRONMENTS
摘要 An apparatus and method for automatic learning of high-level navigation in partially observable environments with landmarks uses full state information available at the landmark positions to determine navigation policy. Landmark Markov Decision Processes (MDPs) can be generated only for encountered parts of an environment when navigating from a starting state to a goal state within the environment, thereby reducing computational resources needed for a navigation solution that uses a fully modeled environment. An MDP policy is calculated using the SarsaLandmark algorithm, and the policy is transformed to a navigation solution based on the current position and connectivity information.
申请公布号 US2012233102(A1) 申请公布日期 2012.09.13
申请号 US201113046474 申请日期 2011.03.11
申请人 JAMES MICHAEL ROBERT;TOYOTA MOTOR ENGIN. & MANUFACT. N.A.(TEMA) 发明人 JAMES MICHAEL ROBERT
分类号 G06F15/18 主分类号 G06F15/18
代理机构 代理人
主权项
地址