A hardware/software co-compressed computing method for a SRAM Computing-in-memory-based (CIM-based) processing unit is proposed. The hardware/software co-compressed computing method for the SRAM CIM-based processing unit includes a data dividing step, a sparsity step, an address assigning step and a hardware decoding and calculating step. The data dividing step is performed to divide a plurality of kernels corresponding to an input feature map into a plurality of weight groups. The sparsity step includes a weight setting step. The weight setting step is performed to set each of the weight groups to one of a zero weight group and a non-zero weight group according to a sparsity aware computing method. The address assigning step is performed to assign a plurality of index codes to a plurality of the non-zero weight groups of the kernels, respectively. The hardware decoding and calculating step is performed to execute an inner product to the non-zero weight group and the input feature data group corresponding to the non-zero weight group, and generate an output feature data group. Thus, the hardware/software co-compressed computing method and system for the SRAM CIM-based processing unit compress the computing data volume computed by the SRAM CIM-based accelerators, and increase the computing speed. |