Page Not Found
Page not found. Your pixels are in another canvas.
A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.
Page not found. Your pixels are in another canvas.
About me
This is a page not in th emain menu
Published:
This post will show up by default. To disable scheduling of future posts, edit config.yml
and set future: false
.
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
Short description of portfolio item number 1
Published in Speech Communication, 2023
We show that the source-filter model of speech production naturally emerges in the latent space of an unsupervised VAE and we propose a weakly-supervised method to control the pitch and formant frequencies of speech signals in the VAE latent space.
Recommended citation: Learning and controlling the source-filter representation of speech with a variational autoencoder Samir Sadok, Simon Leglaive, Laurent Girin, Xavier Alameda-Pineda, Renaud Séguier Speech Communication, vol. 148, 2023. https://www-sciencedirect-com.ezproxy.universite-paris-saclay.fr/science/article/pii/S0167639323000304
Published in Workshop ICASSP (SASB), 2023
Combined VQ-VAE (unsupervised) with MAE (self-supervised) for speech emotion recognition.
Recommended citation: Sadok Samir, Simon Leglaive and Renaud Séguier. “A vector quantized masked autoencoder for speech emotion recognition.” (2023). https://arxiv.org/pdf/2304.11117.pdf
Published in Neural Networks (Elsevier), 2024
We present a multimodal and dynamical VAE (MDVAE) applied to unsupervised audio-visual speech representation learning.
Recommended citation: Samir Sadok, Simon Leglaive, Laurent Girin, Xavier Alameda-Pineda, Renaud Séguier. A multimodal dynamical variational autoencoder for audiovisual speech representation learning. Neural Networks (Elsevier), 2024 https://www.sciencedirect.com/science/article/pii/S0893608024000340
Published:
The objective of this day is to bring together researchers from the written, oral and sign language processing communities to study the representations extracted by deep neural models from massive data.
Published:
Presentation on my work: learning and control of variation factors for speech with variational autoencoders
Published:
Methods and models in signal processing.
Workshop, University de Rennes 1, 2021
I was supervising students during their practical work:
Workshop, CentralesSupelec, 2022
Supervision of two student projects for machine learning and deep learning.