From psychoacoustics to deep learning: learning low-level processing of sound with neural networks

pôle documentaire

Vous constatez une erreur ?

informations

set: Ateliers du Forum
évènements: Ateliers du Forum 2021
Type: Ensemble de conférences, symposium, congrès
Lieu de représentation: Ircam, Salle Igor-Stravinsky (Paris)
durée: 18 min
date: 19 mars 2021

Mel-filterbanks are fixed, engineered audio features which emulate human perception and have been used through the history of audio understanding up to today. However, their undeniable qualities are counterbalanced by the fundamental limitations of handmade representations. In this talk, I will present LEAF, a new, lightweight, fully learnable neural network that can be used as a drop-in replacement of mel-filterbanks. LEAF learns all operations of audio features extraction, from filtering to pooling, compression and normalization, and can be integrated into any neural network at a negligible parameter cost, to adapt to the task at hand. I will show how LEAF outperforms mel-filterbanks on a wide range of audio signals, including speech, music, audio events and animal sounds, providing a general-purpose learned frontend for audio classification.

intervenants

From psychoacoustics to deep learning: learning low-level processing of sound with neural networks

informations

intervenants

Les médias liés à cet évènement

Deep Learning for Voice processing

Towards helpful, customer-specific Text-To-Speech synthesis

Tools for creative AI and noise

Xtextures - Convolutional neural networks for texture synthesis and cross synthesis

Round Table IA : Questions/discussions

Melodic Scale and Virtual Choir, Max ISiS

Greg Beller, David Guennec, Nicolas Obin, Axel Roebel, Hugues Vinet. Table ronde

Session IA - An overview of AI for Music and Audio Generation

Interaction with musical generative agents

partager

IRCAM

heures d'ouverture

accès en transports