摘要 |
A parameter setting apparatus includes a memory, and a processor that executes a procedure in the memory, the procedure including, selecting and executes one of a plurality of optimization operations to optimize a control parameter of a mobile communication network in accordance with a common value function, in response to a state variable in each of a plurality of different areas in the mobile communication network, the common value function determining an action value of each optimization operation responsive to the state variable of the mobile communication network, determining a reward responsive to the state variable in each of the plurality of areas, and performing reinforcement learning to update the common value function in response to the reward determined on each area.
|