US 11,810,233 B2
	End-to-end virtual object animation generation method and apparatus, storage medium, and terminal
Congyi Wang, Shanghai (CN); Bin Wang, Shanghai (CN); and Jinxiang Chai, Shanghai (CN)
Assigned to MOFA (SHANGHAI) INFORMATION TECHNOLOGY CO., LTD., Shanghai (CN); and SHANGHAI MOVU TECHNOLOGY CO., LTD., Shanghai (CN)
Appl. No. 18/023,993
Filed by MOFA (SHANGHAI) INFORMATION TECHNOLOGY CO., LTD., Shanghai (CN); and SHANGHAI MOVU TECHNOLOGY CO., LTD., Shanghai (CN)
PCT Filed Aug. 9, 2021, PCT No. PCT/CN2021/111423 § 371(c)(1), (2) Date Feb. 28, 2023, PCT Pub. No. WO2022/048404, PCT Pub. Date Mar. 10, 2022.
Claims priority of application No. 202010905550.3 (CN), filed on Sep. 1, 2020.
Prior Publication US 2023/0267665 A1, Aug. 24, 2023
Int. Cl. G06T 13/20 (2011.01); G06T 13/40 (2011.01); G10L 15/187 (2013.01); G10L 13/08 (2013.01); G10L 13/02 (2013.01)

CPC G06T 13/205 (2013.01) [G06T 13/40 (2013.01); G10L 13/02 (2013.01); G10L 13/08 (2013.01); G10L 15/187 (2013.01)]

18 Claims

1. An end-to-end virtual object animation generation method, comprising:

receiving input information, wherein the input information comprises text information or audio information of a virtual object animation to be generated;

converting the input information into a pronunciation unit sequence;

performing a feature analysis of the pronunciation unit sequence to obtain a corresponding linguistic feature sequence; and

inputting the linguistic feature sequence into a preset timing mapping model to generate the virtual object animation based on the linguistic feature sequence;

wherein performing the feature analysis of the pronunciation unit sequence to obtain the corresponding linguistic feature sequence comprises:

performing a feature analysis of each pronunciation unit in the pronunciation unit sequence to obtain a linguistic feature of the each pronunciation unit; and

generating the corresponding linguistic feature sequence based on the linguistic feature of the each pronunciation unit; and

wherein performing the feature analysis of the each pronunciation unit in the pronunciation unit sequence to obtain the linguistic feature of the each pronunciation unit comprises:

analyzing a pronunciation feature of the each pronunciation unit to obtain an independent linguistic feature of the each pronunciation unit;

analyzing a pronunciation feature of an adjacent pronunciation unit of the each pronunciation unit to obtain an adjacent linguistic feature of the each pronunciation unit; and

generating the linguistic feature based on the independent linguistic feature and the adjacent linguistic feature.