April 14, 2005 01 h 01 min
April 14, 2005 24 min
May 12, 2005 52 min
February 4, 2005 01 h 18 min
October 17, 2007 49 min
June 27, 2007 01 h 12 min
July 11, 2007 48 min
September 12, 2007 01 h 07 min
September 19, 2007 01 h 13 min
September 26, 2007 01 h 00 min
October 3, 2007 01 h 12 min
October 10, 2007 01 h 10 min
October 24, 2007 50 min
November 21, 2007 57 min
0:00/0:00
Among diverse research fields within computer music, synthesis and generation of audio signals epitomize the cross-disciplinarity of this domain, as it jointly nourishes both scientific and artistic practices since its creation. Some processes naturally handle both pathways, hence providing invertible representations of given sounds. On the top of that, recent trends in machine learning gave rise to powerful data-centered methods, raising several epistemological questions amongst researchers about their possible uses and concrete significations. Particularly, generative models focus on the generation of original content from automatically extracted features, not only questioning previous approaches in generation but also how these processes could be exploited for artistic purposes. Particularly, a specific family of generative models called variational methods are based on both unsupervised inference of features and direct generation. The interest of such methods is twofold : first, they resort to Bayesian inference to extract continuous low-dimensional representations, aiming to reflect the underlying structure of a data corpus. Secondly, these continuous representations, called latent spaces, can be inverted to generate the data back, providing powerful synthesis and in-domain interpolation capabilities.
Hence, such bijective systems can be interestingly used for sound synthesis, providing data-centered generation methods whose controls are automatically extracted from the data. Furthermore, the flexibility of such systems allow numerous ways of influencing the construction of these representations, by example with external information or perceptual constraints, such that the training process can also be embedded in their creative use. We will review the generative abilities of these methods when applied to the audio domain, and how the extracted spaces can be used as high-level features for audio analysis. We will also introduce diverse ways of using it for creative purposes, and also how these methods can be extended to integrate the temporal dimension of audio information. Finally, we will present how these generative processes can be embedded in musical and composition tools, and how they can be used to figure a novel use of synthesis algorithms.
Among diverse research fields within computer music, synthesis and generation of audio signals epitomize the cross-disciplinarity of this domain, as it jointly nourishes both scientific and artistic practices since its creation. Some processes naturally handle both pathways, hence providing invertible representations of given sounds. On the top of that, recent trends in machine learning gave rise to powerful data-centered methods, raising several epistemological questions amongst researchers about their possible uses and concrete significations. Particularly, generative models focus on the generation of original content from automatically extracted features, not only questioning previous approaches in generation but also how these processes could be exploited for artistic purposes. Particularly, a specific family of generative models called variational methods are based on both unsupervised inference of features and direct generation. The interest of such methods is twofold : first, they resort to Bayesian inference to extract continuous low-dimensional representations, aiming to reflect the underlying structure of a data corpus. Secondly, these continuous representations, called latent spaces, can be inverted to generate the data back, providing powerful synthesis and in-domain interpolation capabilities.
Hence, such bijective systems can be interestingly used for sound synthesis, providing data-centered generation methods whose controls are automatically extracted from the data. Furthermore, the flexibility of such systems allow numerous ways of influencing the construction of these representations, by example with external information or perceptual constraints, such that the training process can also be embedded in their creative use. We will review the generative abilities of these methods when applied to the audio domain, and how the extracted spaces can be used as high-level features for audio analysis. We will also introduce diverse ways of using it for creative purposes, and also how these methods can be extended to integrate the temporal dimension of audio information. Finally, we will present how these generative processes can be embedded in musical and composition tools, and how they can be used to figure a novel use of synthesis algorithms.