Posts by Collection

portfolio

2016

Primary-Ambient Source Separation

My first research project, spanning my M.Sc. at Nile University and an internship at Sony Stuttgart. The goal: automatically separate the direct sound from the diffuse ambience in a stereo recording, to enable surround sound upmixing.

2018

Singing Voice Intelligibility

For my M.Sc. at NUS, I studied what makes song lyrics easy or hard to understand, and built systems to measure it automatically — motivated by a real application: recommending music for language learning.

2022

Contextual Music Recommendation

My PhD project at Télécom Paris and Deezer, studying how listening context — activity, mood, device, time of day — shapes what music people want to hear, and building systems that learn to predict it automatically.

2024

Mood Monitoring with Speech Emotion Recognition

At Emobot, I led research on automatic speech emotion recognition — pushing accuracy significantly through synthetic data augmentation via emotion conversion, with direct impact on a real-time healthcare application.

publications

Published in The 19th International Society for Music Information Retreival Conference ISMIR, 2017

Intelligibility of Sung Lyrics: A Pilot Study

This paper is about estimating the intelligibility of the singing voice in a given song. We propose a set of acoustic features that are relevant for estimating the intelligibility. We also propose an approach for labeling songs with an intelligibility score accroding to human perception

Published in The IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018

Primary-Ambient Source Separation for Upmixing to Surround Sound Systems

This paper is about separting the primary and ambient sources from a sounds mixture to be used in surround sound upmixing. We propose a neural-network-based approach to apply the separation

Published in The IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020

Audio-Based Auto-Tagging With Contextual Tags for Music

This paper is about auto-tagging music tracks with context-related tags. The paper also presents a dataset of ∼50k tracks labelled with 15 different contexts.

talks

teaching