I am currently a PhD student at the University of Melbourne in Australia. My research journey is fully supported by the Melbourne Research Scholarship, and I’m incredibly fortunate to be supervised by Dr. Ting Dang and Prof. Eun-Jung Holden. Before joining Unimelb, I was an AI engineer at Fortemedia working with Dr. Rohan Kumar Das. I graduated as a CS Master student at NTU advised by Prof. Chng Eng Siong. I obtained my B.Eng degree from Jilin University.

I am contributing to building “adaptive”, “efficient”, and “robust” next-generation speech AI systems. At this moment, I mainly work in the post training paradigms for speech learning systems. Specifically, by merging continual learning, domain adaptation, knowledge editing, and reinforcement fine-tuning, we pave the way for speech models that continuously adapt, specialize efficiently, and self-correct in real-world environments. I have published more than 20 papers at the top international AI conferences and journals such as ACL, SPL, ICME, ICASSP, and INTERSPEECH.

🔥 News

  • 2025.09:  🎉🎉 Our ICASSP grand challenge ESDD 2026 has been launched.
  • 2025.08:  🎉🎉 I joined the University of Melbourne as a PhD student in Australia!
  • 2025.05:  🎉🎉 Four papers have been accepted to Interspeech 2025!

🔍 Research Area

Speech and Audio Processing: Sound Event Detection, Spoken Keyword Spotting, Speech Foundation Model, DeepFake Detection

Algorithm: Continual learning, Test time adaptation, Knowledge editing

🎓 Education

  • 08.2025 - Now, Doctor of Philosophy - Engineering and IT, The University of Melbourne, Australia
  • 08.2021 - 01.2023, Master of Science (Artificial Intelligence), Nanyang Technological University, Singapore
  • 08.2016 - 07.2020, B.E. in Internet of Things Engineering, Jilin University, Changchun, China

💼 Work Experience

  • 01.2023 - 08.2025, AI Engineer, Fortemedia Singapore
  • 07.2020 - 05.2021, Software Engineer, China Mobile (Chengdu) Industrial Research Institute

📝 Publications

INTERSPEECH 2025
sym

EnvSDD: Benchmarking Environmental Sound Deepfake Detection

Han Yin, Yang Xiao, Rohan Kumar Das, Jisheng Bai, Haohe Liu, Wenwu Wang, Mark D Plumbley

Project

  • The first large-scale curated dataset designed for Environmental Sound Deepfake Detection.

Continual Learning for Speech / Audio

Domain adaptation for Speech / Audio

Others

2025

Before 2024

🎖 Honors and Awards

  • 2025.07 ISCA (International Speech Communication Association) Grant, Interspeech, Rotterdam
  • 2025.03 Melbourne Research Scholarship, University of Melbourne