CPC G06F 18/214 (2023.01) [G06V 30/287 (2022.01)] | 20 Claims |
1. A computer system, comprising:
one or more processors; and
one or more machine-readable medium coupled to the one or more processors and storing computer program code comprising sets of instructions executable by the one or more processors to:
determine a plurality of character encodings mapping sets of numbers to a plurality of characters in a language character set, the plurality of characters composed of a plurality of graphical units, different values of numbers in the sets of numbers corresponding to a different graphical unit of the plurality of graphical units, each particular character of the plurality of characters being mapped to a particular set of the sets of numbers having values corresponding to a set of graphical units used in composing the particular character; and
train a machine learning model using the plurality of character encodings, the machine learning model configured to perform optical character recognition of the language character set.
|