Do you notice a mistake?
NaN:NaN
00:00
Mel-filterbanks are fixed, engineered audio features which emulate human perception and have been used through the history of audio understanding up to today. However, their undeniable qualities are counterbalanced by the fundamental limitations of handmade representations. In this talk, I will present LEAF, a new, lightweight, fully learnable neural network that can be used as a drop-in replacement of mel-filterbanks. LEAF learns all operations of audio features extraction, from filtering to pooling, compression and normalization, and can be integrated into any neural network at a negligible parameter cost, to adapt to the task at hand. I will show how LEAF outperforms mel-filterbanks on a wide range of audio signals, including speech, music, audio events and animal sounds, providing a general-purpose learned frontend for audio classification.
October 28, 2024 00:32:44
October 28, 2024 00:29:57
October 28, 2024 00:20:33
October 28, 2024 00:20:36
April 29, 2021 00:30:04
October 28, 2024 00:26:39
May 17, 2021 00:20:20
October 28, 2024 00:47:23
October 28, 2024 00:21:30
Do you notice a mistake?