CPC G06F 40/30 (2020.01) [G06F 40/279 (2020.01)] | 22 Claims |
13. A method comprising:
identifying terminology that is the same for all subdomains in a domain to define a common terminology;
subtracting the common terminology from a language model to identify subdomain-specific terminology for each subdomain;
calculating a priori appearance probability of a series of words using the subdomain terminology for each subdomain; and
automatically determining similarity of meanings of terms across subdomains by identifying terms with the similar appearance probability within a series of words.
|