US 11,755,549 B2
Method, electronic device, and computer program product for data processing
Tao Chen, Beijing (CN); and Geng Han, Beijing (CN)
Assigned to EMC IP HOLDING COMPANY LLC, Hopkinton, MA (US)
Filed by EMC IP Holding Company LLC, Hopkinton, MA (US)
Filed on Feb. 24, 2021, as Appl. No. 17/183,736.
Claims priority of application No. 202110090766.3 (CN), filed on Jan. 22, 2021.
Prior Publication US 2022/0237158 A1, Jul. 28, 2022
Int. Cl. G06F 16/215 (2019.01); H03M 7/40 (2006.01)
CPC G06F 16/215 (2019.01) [H03M 7/40 (2013.01)] 18 Claims
OG exemplary drawing
 
1. A method for data processing, including:
determining, based on sizes of multiple data segments included in data to be processed, a first time required to perform a matching operation for each data segment, wherein the matching operation is used to determine non-duplicate data segments;
determining, based on the size of each data segment and a compression level for the data to be processed, a second time required to perform a compression operation for each data segment, wherein a different compression level corresponds to a different compression ratio, different storage space that compressed data requires, and different time requirements to process;
determining, based on the first time, the second time, and a de-duplication rate for the data to be processed, a target mode for processing the multiple data segments from a first mode and a second mode,
wherein in the first mode, a compression operation is performed only on the non-duplicate data segments in the multiple data segments based on a time required to process the data using the first mode, the first time, the second time, and the de-duplication rate of the data to be processed, and
wherein in the second mode, a compression operation is performed on each of the multiple data segments; and
performing the matching operation and the compression operation based on the target mode.