US 9,811,604 B2
Method and system for defining an extension taxonomy
Itzhak Gleicher, Thornhill (CA); Yi Jian Ren, Scarborough (CA); and Christopher Phillip St. John, Toronto (CA)
Assigned to INTERNATIONAL BUSINESS MACHINES CORPORATION, Armonk, NY (US)
Filed by Itzhak Gleicher, Thornhill (CA); Yi Jian Ren, Scarborough (CA); and Christopher Phillip St. John, Toronto (CA)
Filed on Jul. 9, 2010, as Appl. No. 12/833,370.
Prior Publication US 2012/0011118 A1, Jan. 12, 2012
Int. Cl. G06F 17/00 (2006.01); G06F 17/30 (2006.01); G06F 17/22 (2006.01); G06Q 10/10 (2012.01); G06Q 40/02 (2012.01)
CPC G06F 17/30908 (2013.01) [G06F 17/2235 (2013.01); G06Q 10/10 (2013.01); G06Q 40/02 (2013.01)] 34 Claims
OG exemplary drawing
 
1. A computer system comprising:
at least one processor to analyze a portion of a source document selected by a user and determine an association between an element of a taxonomy and each location of a set of locations present within the selected portion of the source document, the at least one processor including:
a selection module to analyze the selected portion of the source document and identify the set of locations present in the selected portion of the source document based on a location map included with said source document that indicates each location within the source document;
an extension module to process each location of the identified set of locations in the selected portion of the source document and configured to:
retrieve a location from the identified set of locations, content of said retrieved location, and metadata associated with said retrieved location;
determine the presence of a pre-existing element of the taxonomy associated with the retrieved location based on at least one of said content of said retrieved location and said metadata associated with said retrieved location;
in response to determining that said taxonomy lacks said pre-existing taxonomy element, generate a new taxonomy element, determine and assign to said new taxonomy element a plurality of corresponding attributes based on an analysis of at least one of said content of said retrieved location and said metadata associated with said retrieved location, and link said new taxonomy element to said retrieved location of said source document; and
in response to determining that a pre-existing taxonomy element associated with a corresponding retrieved location is corrupt, generate an associated new taxonomy element, determine and assign to said associated new taxonomy element a plurality of corresponding attributes based on an analysis of at least one of content of the corresponding retrieved location and metadata associated with the corresponding retrieved location, and link said associated new taxonomy element to the corresponding retrieved location;
a connection module to link said associated pre-existing taxonomy element to said retrieved location in response to said extension module determining said presence of said pre-existing taxonomy element;
a relationship module to define one or more relationship links between taxonomy elements; and
a validation module to validate each of the relationship links.