information

Type
Ensemble de conférences, symposium, congrès
performance location
Ircam, Salle Igor-Stravinsky (Paris)
duration
45 min
date
March 23, 2022

This session will present recent results of the Analysis/Synthesis team. The session will start with a short presentation of the new features of the Version 1.3.0 of the Ircam Singing Synthesis Software ISiS and will continue with more prospective results demonstrating the use of Deep Neural Networks for singing and spoken voice manipulation using the mel spectrogram as a parametric speech represetnation.

ISiS Version 1.3.0 (Guillaume Doras)
Accessing and manipulating the intermediate singing voice parameters representation.

Neural Vocoder (Axel Roebel)
Multi-band Excited WaveNet vocoder for resynthesis of spoken and singing voice from mel spectrograms.

Pitch Manipulation (Frederik Bous)
A deep auto-encoder with bottleneck that disentangles the F0 from the mel spectrogram.

Manipulation of perceived speech attitude (Clément Le Moine Veillon)
A deep neural network for manipulating the perceived attitude in speech.

speakers


share


Do you notice a mistake?

IRCAM

1, place Igor-Stravinsky
75004 Paris
+33 1 44 78 48 43

opening times

Monday through Friday 9:30am-7pm
Closed Saturday and Sunday

subway access

Hôtel de Ville, Rambuteau, Châtelet, Les Halles

Institut de Recherche et de Coordination Acoustique/Musique

Copyright © 2022 Ircam. All rights reserved.