摘要 |
Vehicle dispatch system includes upper stage unit, lower stage unit and interface communication unit. The upper stage unit, configured to generate vehicle schedules, is communicatively connected to the interface communication unit. The lower stage unit, communicatively connected to the upper stage unit and the interface communication unit, has two storage units and a control unit. The first storage unit stores in a state representation multiple possible states having multiple possible actions. The control unit, which receives the schedule as a state representation, is configured to simulate states during an episode by selecting a state action and determining a reward value. The second storage unit stores the reward value and has a policy linked to one possible action for each state. The interface communication unit, operable to receive and transmit vehicle communications, is configured to access the policy and its associated action and communicate the action to a vehicle. |