US 7,467,086 B2
Methodology for generating enhanced demiphone acoustic models for speech recognition
Xavier Menendez-Pidal, Los Gatos, Calif. (US); Lex S. Olorenshaw, Half Moon Bay, Calif. (US); and Gustavo Hernandez Abrego, San Jose, Calif. (US)
Assigned to Sony Corporation, Tokyo (Japan); and Sony Electronics Inc., Park Ridge, N.J. (US)
Filed on Dec. 16, 2004, as Appl. No. 11/13,888.
Prior Publication US 2006/0136209 A1, Jun. 22, 2006
Int. Cl. G10L 15/28 (2006.01); G10O 15/06 (2006.01)
U.S. Cl. 704—255  [704/243; 704/244; 704/254] 14 Claims
OG exemplary drawing
 
1. A system for implementing a speech recognition engine, comprising:
demiphone acoustic models that said speech recognition engine utilizes to perform speech recognition procedures, said demiphone acoustic models each having three states that collectively form a preceding demiphone and a succeeding demiphone; and
an acoustic model generator that analyzes speech context information to configure each of said demiphone acoustic models as either a succeeding-dominant demiphone acoustic model or a preceding-dominant dominant demiphone acoustic model, a contextual dominance for each demiphone state from a given one of said demiphone acoustic models being determined by analyzing predominant contextual information in a triphone decision tree corresponding to said each demiphone state.