National Institute of Advanced Industrial Science and Technology (AIST)
Research resultsPublications > AIST TODAY > 2010-3 No.37
AIST TODAYNo.37 2010-3 [ PDF:4.5MB ]


Singing synthesis technology by mimicking user's singing
- VocaListener: Synthesis of more natural singing by mimicking pitch and dynamics of a user's singing voice -

[ PDF:472KB ]
Tomoyasu Nakano
e-mail address
Masataka Goto
e-mail address
Information Technology Research Institute

We have developed a singing synthesis system named VocaListener that automatically estimates parameters (pitch and dynamics) for singing synthesis by mimicking a user's singing voice with the help of song lyrics. Since a natural voice is provided by the user, the synthesized singing voice mimicking it can be human-like and natural without time-consuming manual adjustments.

VocaListener iteratively estimates singing synthesis parameters so that the synthesized singing can become more similar to the user's singing in terms of pitch and dynamics. The iterative estimation provides robustness with respect to different singing synthesis systems and their singer databases. Moreover, VocaListener has a highly accurate lyrics-to-singing synchronization function, and we also provide an interface that lets a user easily correct synchronization errors just by pointing them out. In addition, VocaListener also has a function to improve synthesized singing as if the user's singing skills were improved.

Demonstration videos including examples of synthesized singing are available at http://staff.aist.go.jp/t.nakano/VocaListener/.

Figure
Overview of VocaListener that automatically estimates parameters for singing synthesis from user's singing voice and its song lyrics

Relational Information
AIST TODAY Vol.10, No.6, p.12 (2010)


 back