| US 7,472,065 B2 | ||
| Generating paralinguistic phenomena via markup in text-to-speech synthesis | ||
| Andrew S. Aaron, Ardsley, N.Y. (US); Raimo Bakis, Briarcliff Manor, N.Y. (US); Ellen M. Eide, Bedford Hills, N.Y. (US); and Wael Hamza, Tarrytown, N.Y. (US) | ||
| Assigned to International Business Machines Corporation, Armonk, N.Y. (US) | ||
| Filed on Jun. 04, 2004, as Appl. No. 10/861,055. | ||
| Prior Publication US 2005/0273338 A1, Dec. 08, 2005 | ||
| Int. Cl. G10L 13/00 (2006.01); G10L 13/08 (2006.01) | ||
| U.S. Cl. 704—258 [704/260; 704/266] | 25 Claims |

| 1. A method of converting marked-up text into a synthesized stream, comprising:
providing marked-up text to a processor-based system;
converting the marked-up text into a text stream comprising a plurality of vocabulary items;
retrieving a plurality audio segments corresponding to the plurality of vocabulary items;
concatenating the plurality of audio segments to form a synthesized stream; and
audibly outputting the synthesized stream;
wherein the marked-up text comprises a normal text and a paralinguistic text;
wherein the normal text is differentiated from the paralinguistic text by using a grammar constraint; and
wherein the paralinguistic text is associated with more than one audio segment, wherein the retrieving of the plurality audio
segments comprises selecting one audio segment associated with the paralinguistic text.
|