Posts by Collection

portfolio

Primary-Ambient Source Separation

Published:

My first research project, spanning my M.Sc. at Nile University and an internship at Sony Stuttgart. The goal: automatically separate the direct sound from the diffuse ambience in a stereo recording, to enable surround sound upmixing.

Singing Voice Intelligibility

Published:

For my M.Sc. at NUS, I studied what makes song lyrics easy or hard to understand, and built systems to measure it automatically — motivated by a real application: recommending music for language learning.

Contextual Music Recommendation

Published:

My PhD project at Télécom Paris and Deezer, studying how listening context — activity, mood, device, time of day — shapes what music people want to hear, and building systems that learn to predict it automatically.

Speech Emotion Recognition

Published:

At Emobot, I led research on automatic speech emotion recognition — pushing accuracy significantly through synthetic data augmentation via emotion conversion, with direct impact on a real-time healthcare application.

publications

Intelligibility of Sung Lyrics: A Pilot Study

Published in The 19th International Society for Music Information Retreival Conference ISMIR, 2017

This paper is about estimating the intelligibility of the singing voice in a given song. We propose a set of acoustic features that are relevant for estimating the intelligibility. We also propose an approach for labeling songs with an intelligibility score accroding to human perception

Primary-Ambient Source Separation for Upmixing to Surround Sound Systems

Published in The IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018

This paper is about separting the primary and ambient sources from a sounds mixture to be used in surround sound upmixing. We propose a neural-network-based approach to apply the separation

Audio-Based Auto-Tagging With Contextual Tags for Music

Published in The IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020

This paper is about auto-tagging music tracks with context-related tags. The paper also presents a dataset of ∼50k tracks labelled with 15 different contexts.

talks

teaching