Debottam Dutta

PhD Student @ SiNRG | ECE, University of Illinois at Urbana-Champaign

dd_portrait.png

I am a 4th-year Ph.D. student in Electrical and Computer Engineering at the University of Illinois Urbana-Champaign, advised by Prof. Romit Roy Choudhury in the Signals and Inference Research Group (SiNRG). My research centers on generative and diffusion-based models, with an emphasis on compositional generation, controllable synthesis, and robustness. I explore how diffusion models represent and combine multiple visual or semantic concepts, and how corrective or guided sampling can improve their alignment and reliability.

Before my Ph.D., I worked as a Research Fellow at the LEAP Lab at the Indian Institute of Science, advised by Prof. Sriram Ganapathy, where I studied representation learning and interpretability for audio and speech.

Across both domains, my broader goal is to develop robust and controllable generative models that can compose multiple concepts coherently and generalize across modalities.


News

Jan 18, 2025 Our paper on Curvature Guided Monte-Carlo got accepted at ICASSP 2025
Sep 30, 2024 A paper got accepted at NeurIPS 2024 Workshop on AI-Driven Speech, Music, and Sound Generation.
Jan 1, 2024 Journal paper got accepted for TASLP.
Jun 22, 2023 Coswara dataset paper got accept at Nature Scientific data!
Jun 15, 2022 Two papers got accepted at Interspeech 2022.

Selected Publications

  1. ICASSP
    Estimating Multi-chirp Parameters using Curvature-guided Langevin Monte Carlo
    Sattwik Basu,  Debottam Dutta, Yu-Lin Wei, and Romit Roy Choudhury
    In ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2025
  2. Neurips
    Multi-Source Music Generation with Latent Diffusion
    Zhongweiyang Xu,  Debottam Dutta, Yu-Lin Wei, and Romit Roy Choudhury
    In Audio Imagination: NeurIPS 2024 Workshop AI-Driven Speech, Music, and Sound Generation 2024
  3. Nature Sci. Data
    Coswara: A respiratory sounds and symptoms dataset for remote screening of SARS-CoV-2 infection
    Debarpan Bhattacharya, Neeraj Kumar Sharma,  Debottam Dutta, Srikanth Raj Chetupalli, Pravin Mote, Sriram Ganapathy, C Chandrakiran, Sahiti Nori, K K Suhail, Sadhana Gonuguntla, and Murali Alagesan
    Sci. Data Jun 2023
  4. Interspeech
    Acoustic Representation Learning on Breathing and Speech Signals for COVID-19 Detection
    Debottam Dutta, Debarpan Bhattacharya, Sriram Ganapathy, Amir Hossein Poorjam, Deepak Mittal, and Maneesh Singh
    In Proc. Interspeech 2022 Jun 2022
  5. WASPAA
    A Multi-Head Relevance Weighting Framework for Learning Raw Waveform Audio Representations
    Debottam Dutta, Purvi Agrawal, and Sriram Ganapathy
    In 2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) Jun 2021