US 11,810,381 B2
Automatic rule prediction and generation for document classification and validation
Jayanth Gangadhar, Karnataka (IN); Karthick Ramanujam, Chennai (IN); Vivek Venkatanarasaiah, Bangalore (IN); Ullas M. Basavaraj, Bangalore (IN); and Ankur Bharatkumar Shah, Surat (IN)
Assigned to International Business Machines Corporation, Armonk, NY (US)
Filed by INTERNATIONAL BUSINESS MACHINES CORPORATION, Armonk, NY (US)
Filed on Jun. 10, 2021, as Appl. No. 17/303,914.
Prior Publication US 2022/0398397 A1, Dec. 15, 2022
Int. Cl. G06F 18/21 (2023.01); G06F 18/24 (2023.01); G06V 30/19 (2022.01); G06V 10/764 (2022.01); G06V 30/413 (2022.01)
CPC G06V 30/413 (2022.01) [G06F 18/2178 (2023.01); G06F 18/24765 (2023.01); G06V 10/765 (2022.01); G06V 30/19153 (2022.01)] 20 Claims
OG exemplary drawing
 
1. A computer-implemented method comprising:
in response to electronically receiving a document, automatically classifying the document, wherein automatically classifying the document comprises electronically identifying a document type associated with the document;
based on the document type associated with the document, automatically classifying different parts of the document and electronically tagging data associated with the different parts of the document based on one or more classification rules pertaining to the identified document type and identified data type in the document, wherein automatically classifying the document and automatically classifying the different parts of the document further comprises automatically generating and applying one or more classification rule suggestions for classifying the document and the different parts of the document;
in response to detecting first feedback associated with the one or more classification rule suggestions, updating the one or more classification rules corresponding to the one or more classification rule suggestions based on the first feedback, and applying the updated one or more classification rules;
automatically extracting the tagged data associated with the automatically classified document based on one or more data extraction rules associated with the identified document type and the identified data type, wherein automatically extracting the tagged data associated with the automatically classified document based on one or more data extraction rules further comprises automatically generating and applying one or more data extraction rule suggestions for extracting the tagged data;
in response to detecting second feedback associated with the one more data extraction rule suggestions, updating the one or more data extraction rules corresponding to the one more data extraction rule suggestions based on the second feedback, and applying the updated one or more data extraction rules; and
automatically generating, updating, and applying validation rules based on the identified document type, the detected first feedback, and the detected second feedback to validate the automatically classified document and the automatically tagged and extracted data, wherein automatically generating and applying the validation rules further comprises automatically generating and applying one or more validation rule suggestions for validating the automatically classified document and the automatically tagged and extracted data.