information

Type
Soutenance de thèse/HDR
performance location
Ircam, Salle Igor-Stravinsky (Paris)
duration
01 h 33 min
date
July 15, 2015

La soutenance de thèse se fera devant un jury composé de :

Prof. Josh McDermott - Lab. for Computational Audition, MIT (Rapporteur, via Skype)
Prof. Shlomo Dubnov - Dep. of Music, UCSD - (Rapporteur, via Skype)
Prof. Laurent Daudet - Institut Langevin, Diderot University Paris - Examiner
Prof. Bruno Gas - ISIR, UPMC - Examiner
Prof. Alvin Su - SCREAM Lab, NCKU - Thesis director
Dr. Axel Roebel - IRCAM - Thesis director

In this thesis, we propose a new analysis-synthesis framework for environmental sounds and sound textures. It uses a parametric representation of sound textures by means of perceptually important statistics and an efficient mechanism to adapt statistics in the time-frequency domain. The statistic description is based on the short-time-Fourier-transform. The adaptation of statistics is achieved by utilizing the connection between the statistics on time-frequency representation and the spectra of time-frequency domain coefficients. If the order of statistics is not greater than two, feasible signals can directly be generated from statistical descriptions without iterative steps. When the order of statistics is greater than two, the algorithm can still adapt all the statistics within a reasonable amount of iterations.

The proposed framework allows easily extracting the statistical description of a sound texture then resynthesizes arbitrary long samples of the original sound texture from the statistical description.
A perceptual evaluation has shown that the quality of resynthesised sounds is at least as good as state-of-the-art but more efficient in terms of computation time.

speakers


share


Do you notice a mistake?

IRCAM

1, place Igor-Stravinsky
75004 Paris
+33 1 44 78 48 43

opening times

Monday through Friday 9:30am-7pm
Closed Saturday and Sunday

subway access

Hôtel de Ville, Rambuteau, Châtelet, Les Halles

Institut de Recherche et de Coordination Acoustique/Musique

Copyright © 2022 Ircam. All rights reserved.