The present invention discloses a device of energy management system using multi-object reinforcement
learning, which is coupled with a renewable energy device, an energy storage device and a plurality of energy using appliances. According to the present invention, the device is provided a first RL unit and a second RL device therein, wherein the first RL unit is configured for confucting a first reinforcement learning based on an electricity tariff provided by a power utility and a demand request provided by a user, so as to generate a first scheduling policy for controlling the energy using appliances. On the other hand, the second RL unit is configured for confucting a second reinforcement learning based on the electricity tariff and at least one state parameter of the energy storage device, so as to generate a second scheduling policy for controlling the charing process and the discharing process of the energy storage device. Consequently, level of the user’s satisfaction therefore rises in case of the electricity cost being significantly lowered. |