US 7,523,123 B2
Map-reduce with merge to process multiple relational datasets
Hung-Chih Yang, Sunnyvale, Calif. (US); Ali Dasdan, San Jose, Calif. (US); and Ruey-Lung Hsiao, Los Angeles, Calif. (US)
Assigned to Yahoo! Inc., Sunnyvale, Calif. (US)
Filed on Nov. 16, 2006, as Appl. No. 11/560,523.
Prior Publication US 2008/0120314 A1, May 22, 2008
Int. Cl. G06F 17/00 (2006.01)
U.S. Cl. 707—101  [707/102; 707/104.1] 55 Claims
OG exemplary drawing
 
1. A method of processing data and data relationships of at least two datasets, comprising:
for the data of each one of the datasets,
mapping the data of that dataset to corresponding intermediate data for that dataset; and
reducing the intermediate data for that dataset to a set of reduced intermediate data for that dataset; and
merging data corresponding to the sets of reduced intermediate data, in accordance with a merge condition.