Deep Learning for Audio Classification-湖大信息科学与工程学院

我的位置在：首页 > 学术报告 > 正文

Deep Learning for Audio Classification

浏览次数:日期：2019-12-08编辑：信科院科研办

时间:2019.12.9 15:00

地点:学院220教室（原106）

报告简介:

Audio classification (e.g. audio scene analysis, audio event detection and audio tagging) have a variety of potential applications in security surveillance, intelligent sensing for smart homes and cities, multimedia search and retrieval, and healthcare. This research area is under rapid development recently, having attracted increasing interest from bothacademiaandindustrialists.In this talk, wewill present some recent and new development for several challenges related to this topic, including data challenges (e.g. DCASE challenges), acoustic modelling, feature learning, dealing with weakly labelled data, and learning with noisy labels. We will show some latest results of our proposed algorithms, such as the attention neural network algorithms for learning with weakly labelled data,and their resultson AudioSet – a large scale dataset provided by Google,as compared with several baselinemethods. We will also use some sound demos to illustrate the potentials of our proposed algorithms.

报告人介绍:

Wenwu Wang is a Professor in Signal Processing and Machine Learning, and a Co-Director of the Machine Audition Labwithin the Centre for Vision Speech and Signal Processing,University of Surrey,UK.He is also a Guest Professor at Qingdao University of Science and Technology, China.

He received the B.Sc. degree in 1997, the M.E. degree in 2000, and the Ph.D. degreein2002, all fromthe College of Automation,Harbin Engineering University, China. Heworked inKing’s College London (2002-2003), Cardiff University (2004-2005), Tao Group Ltd. (now Antix Labs Ltd.) (2005-2006), Creative Labs (2006-2007), andUniversity of Surrey(since May 2007). He was a Visiting Scholar at Ohio State University, USA, in 2008.His current research interests include blind signal processing, sparse signal processing, audio-visual signal processing, machine learning and perception, artificial intelligence, machine audition (listening), and statistical anomaly detection. He has(co)-authored over 250 publications in these areas.

He and his team have wonthe Reproducible System Award on DCASE 2019, Best Student Paper Award on LVA/ICA 2018, the Best Oral Presentation on FSDM 2016, the Top-Quality Paper Award in IEEE ICME 2015, Best Student Paper Award finalists on ICASSP 2019 and LVA/ICA 2010. He and his teamhaveachieved the 1st place (among 35 submitted systems) in the 2017 DCASE Challenge on "Large-scale weakly supervised sound event detection for smart cars", the 3^rdplace (among 558 submissions) on the 2018 Kaggle Challenge on"Freesound General-Purpose Audio Tagging",the TVB Europe Award for Best Achievement in Sound in 2016, the finalist for GooglePlay Best VR Experience in 2017, and the Best Solution Award on the Dstl Challenge "Under-sampled Signal Recognition" in 2012.

He has been a Senior Area Editor (2019-) and Associate Editor (2014-2018) for IEEE Transactions on Signal Processing.He is an Associate Editor (2019-) for EURASIP Journal on Audio Speech and Music Processing. He was a Publication Co-Chair for ICASSP 2019,Brighton, UK, and will serve as Tutorial Chair for ICASSP 2024, Seoul, South Korea. Healso serves as a Member (2019-) of the International Steering Committee of Latent Variable Analysis and Signal Separation.

More information on his personal page:http://personal.ee.surrey.ac.uk/Personal/W.Wang/

下一篇：: 从内容安全到行为安全-历史辩证论的新视角看网络空间安全