US 7,552,052 B2
Voice synthesis apparatus and method
Hideki Kemmochi, Hamamatsu (Japan)
Assigned to Yamaha Corporation, (Japan)
Filed on Jul. 13, 2005, as Appl. No. 11/180,108.
Claims priority of application No. 2004-209033 (JP), filed on Jul. 15, 2004.
Prior Publication US 2006/0015344 A1, Jan. 19, 2006
Int. Cl. G10L 13/02 (2006.01)
U.S. Cl. 704—258  [704/260; 704/265; 704/267; 704/268] 9 Claims
OG exemplary drawing
 
1. A voice synthesis apparatus comprising:
a voice segment acquisition section that acquires a voice segment including one or more phonemes;
a boundary designation section that designates a boundary intermediate between start and end positions of a vowel phoneme included in the voice segment acquired by the voice segment acquisition section,
wherein when the acquired voice segment where a region including an end point is a vowel phoneme, the boundary designation section designates, as the boundary, a time point earlier than a stationary point, which is a boundary point between a region where a waveform amplitude of the voice segment is substantially constant and a region where the waveform amplitude of the voice segment varies, and
wherein when the acquired voice segment where a region including a start point is a vowel phoneme, the boundary designation section designates, as the boundary, a time point later than the stationary point; and
a voice synthesis section that synthesizes a voice based on a region of the vowel phoneme that precedes the designated boundary of the vowel phoneme, or a region of the vowel phoneme that succeeds the designated boundary of the vowel phoneme,
wherein the start point and the end point of the vowel phoneme and the designated boundary of the vowel phoneme are time points on a time axis of the acquired voice segment,
wherein when the acquired voice segment where the region including the end point is a vowel phoneme, the voice synthesis section synthesizes the voice based on the region of the voice segment preceding the boundary designated by the boundary designation section, and
wherein when the acquires voice segment where the region including the start point is a vowel phoneme, the voice synthesis section synthesizes the voice based on the region of the voice segment succeeding the boundary designated by the boundary designation section.