A distributed warehouse management system and a distributed warehouse management method are provided. In the method, a central server receives orders, defines system states by using factory information of a warehouse, order information of the orders, and item information of shelves in the warehouse so as to establish a reinforcement learning agent, arranges an autonomous mobile robot (AMR) to perform a handling action of items in the orders, and calculates rewards according to an order completion time and a pickup volume so as to train the reinforcement learning agent. The central server receives a current order, and assigns the AMR suitable for handling the current order and the handling action performed thereby under a current system state by using the reinforcement learning agent. The AMR calculates a travel path from its own position to a target position of the handling action by using a path planning algorithm and executes the handling action. |