US 7,580,834 B2
Fixed sound source vector generation method and fixed sound source codebook
Hiroyuki Ehara, Yokohama (Japan); Kazutoshi Yasunaga, Kyoto (Japan); Kazunori Mano, Nerima-ku (Japan); and Yusuke Hiwasaki, Higashiyamato (Japan)
Assigned to Panasonic Corporation, Osaka (Japan); and Nippon Telegraph and Telephone Corporation, Tokyo (Japan)
Appl. No. 10/505,100
PCT Filed Feb. 20, 2003, PCT No. PCT/JP03/01882
§ 371(c)(1), (2), (4) Date Aug. 19, 2004,
PCT Pub. No. WO03/071522, PCT Pub. Date Aug. 28, 2003.
Claims priority of application No. 2002-043878 (JP), filed on Feb. 20, 2002.
Prior Publication US 2005/0228652 A1, Oct. 13, 2005
Int. Cl. G10L 19/00 (2006.01)
U.S. Cl. 704—221 2 Claims
OG exemplary drawing
 
1. A CELP type speech decoder that receives an excitation gain code, an adaptive excitation vector code, and a fixed excitation vector code associated with encoded speech transmitted from a CELP type speech encoder and decodes the encoded speech, said CELP type speech decoder comprising:
a quantized gain generating section that receives the excitation gain code from the CELP type speech encoder and decodes an adaptive excitation vector gain and a fixed excitation vector gain specified by the excitation gain code;
an adaptive excitation codebook that receives the adaptive excitation vector code from the CELP type speech encoder and takes one frame of samples as an adaptive excitation vector from past excitation signal samples specified by the adaptive excitation vector code;
a fixed excitation codebook that receives the fixed excitation vector code from the CELP type speech encoder and generates a fixed excitation vector specified by the fixed excitation vector code;
an excitation vector generating section that generates an excitation vector by adding a vector obtained by multiplying the adaptive excitation vector gain and the adaptive excitation vector, and a vector obtained by multiplying the fixed excitation vector gain and the fixed excitation vector;
a high-frequency emphasis section that performs high-frequency emphasis processing on the excitation vector generated by the excitation vector generating section; and
a synthesis filter that performs filter synthesis of the excitation vector output from the high-frequency emphasis section employing a set of filter coefficients to output decoded speech data,
wherein said fixed excitation codebook comprises:
a comparing section that compares the shape of a pulse excitation vector with predetermined shapes to determine a predetermined shape which matches the shape of said pulse excitation vector;
a storing section that stores sets of dispersion vectors that are designed exclusively for each of said predetermined shapes;
a selecting section that selects a set of said dispersion vectors that are associated with the predetermined shape which matches the shape of said pulse excitation vector; and
a convolving section that convolves said pulse excitation vector with one of the dispersion vectors in the selected set to obtain the fixed excitation vector.