US 9,811,584 B2
Information retrieval system, method, and program
Noritaka Adachi, Kanagawa (JP); Hiroshi Kurokawa, Tokyo (JP); Kensuke Matsuoka, Chiba (JP); and Yosuke Murakami, Kanagawa (JP)
Assigned to INTERNATIONAL BUSINESS MACHINES CORPORATION, Armonk, NY (US)
Appl. No. 14/235,151
Filed by Noritaka Adachi, Kanagawa (JP); Hiroshi Kurokawa, Tokyo (JP); Kensuke Matsuoka, Chiba (JP); and Yosuke Murakami, Kanagawa (JP)
PCT Filed May 1, 2012, PCT No. PCT/JP2012/061526
§ 371(c)(1), (2), (4) Date May 19, 2014,
PCT Pub. No. WO2013/021696, PCT Pub. Date Feb. 14, 2013.
Claims priority of application No. 2011-171639 (JP), filed on Aug. 5, 2011.
Prior Publication US 2014/0250118 A1, Sep. 4, 2014
Int. Cl. G06F 17/30 (2006.01)
CPC G06F 17/30648 (2013.01) [G06F 17/30386 (2013.01); G06F 17/30675 (2013.01)] 18 Claims
OG exemplary drawing
 
1. An information retrieval system having a storage device, a display device and a processor, wherein the processor is configured to:
receive from a user a search query including a plurality of keyword;
calculate a relevance to a plurality of documents on the basis of the plurality of keywords and an influence set for each keyword (t), wherein the relevance is calculated based on:
a coefficient (coord(q,d)) determined by a number of keywords in a search query (q) in a document (d);
a coefficient tf(t,d) determined by a frequency of a keyword (t) appearing in the document (d);
a coefficient idf(t) determined by a reciprocal of a proportion of documents containing the keyword t;
a weight (boost(t)) of the keyword t;
a coefficient indicative of weight (norm(t,d)) indicative of when a search index was created;
a coefficient (date(d)) determined by a date of the document d; and
a weight for the date (dateBoost); and
wherein the relevance (score) is calculated by:
Score(q,d)=coord(q,d)×queryNorm(q)×(Σtf(t,d)×(idf(t))2×boost(t)×norm(t,d)+(date (d))×(dateBoost));
display on the display device documents in the order of relevance;
display on the display device the influence set for each keyword;
receive changes to the displayed influence set from the user; and
recalculate the relevance on the basis of a change to the influence set and display on the display device documents in the order of relevance.