US 9,811,424 B2
Optimizing restoration of deduplicated data
Kavita Chavda, Roswell, GA (US); Nagapramod S. Mandagere, San Jose, CA (US); Ramani R. Routray, San Jose, CA (US); and Pin Zhou, San Jose, CA (US)
Assigned to International Business Machines Corporation, Armonk, NY (US)
Filed by International Business Machines Corporation, Armonk, NY (US)
Filed on Mar. 18, 2016, as Appl. No. 15/73,703.
Application 15/073,703 is a continuation of application No. 14/334,814, filed on Jul. 18, 2014, granted, now 9,329,942.
Application 14/334,814 is a continuation of application No. 13/610,929, filed on Sep. 12, 2012, granted, now 8,849,851, issued on Sep. 30, 2014.
Prior Publication US 2016/0203058 A1, Jul. 14, 2016
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 17/30 (2006.01); G06F 11/14 (2006.01)
CPC G06F 11/1458 (2013.01) [G06F 11/1453 (2013.01); G06F 11/1469 (2013.01); G06F 17/3015 (2013.01); G06F 17/30194 (2013.01); G06F 17/30958 (2013.01); G06F 2201/84 (2013.01)] 12 Claims
OG exemplary drawing
 
1. A method for restoring deduplicated data, the method comprising the steps of:
a first computing device identifying a plurality of data retrieval requests for servicing with a plurality of unique data chunks, wherein each data retrieval request is serviced utilizing at least one of the plurality of unique data chunks, and wherein at least one of the plurality of unique data chunks is utilized for the servicing of two or more of the data retrieval requests;
the first computing device determining a servicing sequence of the plurality of data retrieval requests such that the two or more of the data retrieval requests that are serviced utilizing the at least one of the plurality of unique data chunks are serviced consecutively;
the first computing device servicing the plurality of data retrieval requests according to the servicing sequence; and
restoring deduplicated data using the plurality of unique data chunks;
wherein the determining a servicing sequence of the plurality of data retrieval requests further includes the step of:
the first computing device mapping the plurality of data retrieval requests into a relationship graph, wherein each node of the relationship graph corresponds to a data retrieval request, and wherein each edge of the relationship graph has an edge weight associated with a number of unique data chunks shared between a pair of data retrieval requests connected by the edge.