学生培养
研究领域
主讲课程
科研项目
[1] 面向情感计算的智能语音对话系统响应生成. 国家自然科学基金, 主持. 2021-2024
[2] 新能源汽车制造产业集聚区域网络协同制造集成技术研究与应用示范. 国家重点研发计划, 任务(课题)负责人. 2021-2023
[3] 多源涉诉信访智能处置技术研究. 国家重点研发计划, 课题骨干. 2018-2021
[4] 面向语音环境基于情感计算的动态推荐系统模型研究. 国家自然科学基金, 主持. 2018-2021
[5] 新一代汽车中智能语言处理若干关键技术研究. 国家自然科学基金, 主持. 2012-2015
[6] 互联网中图文与版式文件的涉密信息检测系统研制,长沙市科技计划重点项目,主持,2014-2015
[7] 基于嵌入式平台的自动语音识别系统前端处理关键技术研究. 湖南省自然科学基金重点项目, 主持. 2010-2012
[8] 嵌入式异构多核体系任务调度机制的研究与实现. 湖南省科技计划, 主持. 2009-2010
[9] 基于数据挖掘技术的语音合成方法研究. 湖南省自然科学基金, 主持. 2004-2005
[10] 工业试验台智能测控系统. 湖南省科技计划, 主持. 2003-2005
[11] 语音合成在嵌入式系统中的应用研究. 湖南省科技计划, 主持. 2005-2007
[12] 嵌入式语音处理算法研究及嵌入式语音系统设计. 湖南省财政厅资助项目, 主持. 2006-2008
[13] 基于无线网络的视音频动态迁移系统开发. 湖南省科技计划重点项目, 参与. 2010-2012
[14] 中国网上教育平台试点工程高教子系统开发. 国家发改委重大项目, 参与. 2004-2005
[15] 湖南麓山云数据科技服务有限公司-湖南大学产学研合作项目. 技术服务, 300万元, 负责人. 2017-2019
[16] 计算机系统组成与体系结构. 国家级精品资源共享课程, 负责人. 2013
[17] 计算机系统组成与体系结构. 国家级精品课程, 负责人. 2009
科技成果奖励
近年代表性论文
2026年论文发表列表:
Guanghui Ye, Huan Zhao*, Zhixue Zhao*, Tengfei Ma, Kehan Wang, Steffen Eger, Zhihua Jiang, SCIEval: Evaluating and Benchmarking the Faithfulness of Scientific Image Generation and Interpretation with Large Multimodal Models, CVPR 2026 (CCF A)
Kehan Wang, Huan Zhao*, Yong Wei, Xupeng Zha, Guanghui Ye, Cheng Zhu, Yiming Liu, Zixing Zhang*, PLUM-Net: Prototype-Induced Label Structuring for Disentangled Multimodal Representation Network, AAAI, 2026 (CCF A)
Guanghui Ye, Huan Zhao*, Yingxue Gao, Zhixue Zhao, Kehan Wang, Xupeng Zha, Zhihua Jiang* ,Making Visual Dialogue More Engaging: A New Task, Method, and Metric, AAAI, 2026 (CCF A)
Bo Li, Huan Zhao*, Yingxue Gao, Guanghui Ye, Yiming Liu, Zixing Zhang, Emodiffusion: Modeling Emotion Evolution with Diffusion for Diverse and Coherent Dialogue Generation, ICASSP, 2026 (CCF B)
Huan Zhao, Zexin Zhou, Guanghui Ye?, Focus before Reasoning: A Bidirectional Selection Framework for Noise-Mitigation in Knowledge-Based Vision Question Answering, ICASSP, 2026 (CCF B)
Huan Zhao, Zhijie Yu, Yong Wei, Bo Li, Yingxue Gao*, DSSR: DECOUPLING SALIENT AND SUBTLE REPRESENTATIONS UNDER MISSING MODALITIES FOR MULTIMODAL EMOTION RECOGNITION, ICASSP, 2026 (CCF B)
Huan Zhao, Ling Xiong, Kehan Wang*, Selective Hub Fusion with Modality-Heterogeneous Experts for Multimodal Emotion Recognition, ICASSP, 2026 (CCF B)
Huan Zhao,Gong Chen,Zhijie Yu,Yingxue Gao*,Graph-Based Emotion Consensus Perception Learning for Multimodal Emotion Recognition in Conversation,ICASSP,2026 (CCF B)
2025年论文发表列表:
Xupeng Zha, Huan Zhao*, Guanghui Ye. Zixing Zhang. Dual-View Learning for Conversational Emotion Recognition through Context and Emotion-Shift Modeling. AAAI 2025 (CCF A)
Guanghui Ye, Huan Zhao*, Zhixue Zhao, Yang Liu, Xupeng Zha, Zhihua Jiang. Knowledge Image Matters: Improving Knowledge-Based Visual Reasoning with Multi-Image Large Language Models. ACL 2025 (CCF A)
Guanghui Ye, Huan Zhao?, Zixing Zhang, Zhihua Jiang. UniDE: A Multi-level and Low-resource Framework for Automatic Dialogue Evaluation via LLM-based Data Augmentation and Multitask Learning. Information Processing and Managemen, 2025 (中科院一区 Top)
Huan Zhao, Zeyi Li, Song Wang*, Zixing Zhang, Keqin Li. Robust Hashing with Bilinear Drift for Image-Text Retrieval. IEEE Transactions on Circuits and Systems for Video Technology, 2025 (中科院一区 Top)
Haijiao Chen, Huan Zhao*, Zixing Zhang*, Keqin Li. Federal Parameter-Efficient Fine-Tuning for Speech Emotion Recognition. Expert Systems With Applications, 2025 (中科院一区 Top)
Guanghui Ye, Huan Zhao*, Bo Li, Haijiao Chen, Zhixue Zhao, Zhihua Jiang. CCDE: A Compact and Competitive Dialogue Evaluation Framework via Knowledge Distillation of Large Language Models. IEEE Transactions on Computational Social Systems, 2025 (中科院三区)
Haijiao Chen, Huan Zhao*, Yingxue Gao, Yiming Liu, Zixing Zhang. Parameter-Efficient Federal-Tuning Enhances Privacy Preserving for Speech Emotion Recognition[C]. ICASSP 2025 (CCF B)
Huan Zhao, Yingxue Gao*, Haijiao Chen, Bo Li, Guanghui Ye, Zixing Zhang. Enhanced Multimodal Emotion Recognition in Conversations via Contextual Filtering and Multi-Frequency Graph Propagation[C]. ICASSP 2025 (CCF B)
Yiming Liu, Huan Zhao*, Yaqian Liu, Haijiao Chen, Bo Li, Guanghui Ye. DSSM: Dual State Space Model for Human Motions Generation[C]. ICASSP 2025 (CCF B)
Zhongren Dong#, Haotian Guo#, Weixiang Xu, Huan Zhao, Zixing Zhang, Foundation Model-based Evaluation of Neuropsychiatric Disorders: A Lifespan-Inclusive, Multi-Modal, and Multi-Lingual Study, IEEE Journal of Selected Topics in Signal Processing, 2025 (CCF B)
Bin Wang, Yang Xu, Huan Zhao, Hao Zhang, Zixing Zhang*, PTalker: Personalized Speech-Driven 3D Talking Head Animation via Style Disentanglement and Modality Alignment, ACM Multimedia, 2025 (CCF A)
Cheng Zhu, Jing Han*, Qianshuai Xue, Kehang Wang, Huan Zhao, Zixing Zhang*, AudioFab: Building A General and Intelligent Audio Factory through Tool Learning, ACM Multimedia, 2025 (CCF A)
Zixing Zhang, Jiajun Li, Bin Wang, Yiming Liu, Huan Zhao, Bj?rn W. Schuller, XDGesture: an xLSTM-based diffusion model for co-speech gesture generation, ICASSP, 2025 (CCF B)
2024年论文发表列表:
Huan Zhao, Xupeng Zha*, and Zixing Zhang. 2024. EmoTransKG: An Innovative Emotion Knowledge Graph to Reveal Emotion Transformation. ACL 2024 (CCF A)
Song Wang, Huan Zhao?, Zixing Zhang, Keqin Li. Individual mapping and asymmetric dual supervision for discrete cross-modal hashing. Expert Systems With Applications, 2024 (中科院一区 Top)
Haijiao Chen, Huan Zhao, Zixing Zhang, Keqin Li. Discriminative Feature Learning-Based Federated Lightweight Distillation Against Multiple Attacks. IEEE Internet of Things Journal, 2024 (中科院一区 Top)
Haijiao Chen, Huan Zhao*, Zixing Zhang*. Gradient-Level Differential Privacy against Attribute Inference Attack for Speech Emotion Recognition, IEEE Signal processing Letters, 2024 (中科院二区)
Xupeng Zha, Huan Zhao*, Zixing Zhang*. Esingnn: Event-State Interactions Infused Heterogeneous Graph Neural Network for Conversational Emotion Recognition. ICASSP, 2024 (CCF B)
Yingxue Gao Huan Zhao? Zixing Zhang?. Adaptive Speech Emotion Representation Learning Based on Dynamic Graph. ICASSP, 2024 (CCF B)
Guanghui Ye, Huan Zhao?, Zixing Zhang, Xupeng Zha, Zhihua Jiang. LSTDial: Enhancing Dialogue Generation via Long- and Short-Term Measurement Feedback. NAACL 2024 (CCF B)
Huan Zhao, Yi Ju and Yingxue Gao. Bilevel Relational Graph Representation Learning-based Multimodal Emotion Recognition in Conversation. ICME 2024 (CCF B)
Zixing Zhang, Liyizhe Peng, Tao Pang, Jing Han*, Huan Zhao, Bjoern W. Schuller, Refashioning Emotion Recognition Modelling: The Advent of Generalised Large Models, IEEE Trans. on Computational Social Systems, 2024 (中科院二区)
Zixing Zhang, Zhongren Dong, Zhiqiang Gao, Shihao Gao, Donghao Wang, Ciqiang Chen, Yuhan Nie, Huan Zhao, Open Vocabulary Emotion Prediction Based on Large Multimodal Models, Proc. 2nd International Workshop on Multimodal and Responsible Affective Computing (MRAC), Melbourne, Australia, 2024. (2nd place in MER challenge)
Liyizhe Peng, Zixing Zhang*, Tao Pang, Jing Han, Huan Zhao, Hao Chen, Bj?rn W. Schuller, Customising General Large Language Models for Specialised Emotion Recognition Tasks, ICASSP, 2024 (CCF B)
2023年以前部分论文发表列表:
Song Wang, Zhao Huan, Nai Kei. Learning a maximized shared latent factor for cross-modal hashing. Knowledge-Based Systems, 2021 (中科院一区)
Song Wang, Huan Zhao,Yunbo Wang, Jing Huang, Keqin Li. Cross-modal image-text search via efficient discrete class alignment hashing. Information Processing and Management, 2022 (中科院一区 Top)
Song Wang, Huan Zhao* and Keqin Li. Discrete Joint Semantic Alignment Hashing for Cross-Modal Image-Text Search. IEEE Transactions on Circuits and Systems for Video Technology, 2022 (中科院一区 Top)
Tingting Li, Huan Zhao, Jing Huang,Keqin Li. Cross-domain image translation with a novel style-guided diversity loss design. Knowledge-Based Systems, 2022 (中科院一区 Top)
Huan Zhao, Haijiao Chen*, Yufeng Xiao, Zixing Zhang. Privacy-enhanced Federated Learning Against Attribute Inference Attack for Speech Emotion Recognition. ICASSP 2023 (CCF B)
Tingting Li, Huan Zhao, Song Wang, and Jing Huang. Style-Guided Image-to-Image Translation for Multiple Domains. ICMR, 2021 (CCF B)
Huan Zhao*, Yufeng xiao, Jing Han, Zixing Zhang. Compact Convolutional Recurrent Neural Networks Via Binarization for Speech Emotion Recognition. ICASSP 2019 (CCF B)
发表学术论文完整列表详见主页:
https://www.researchgate.net/profile/Huan-Zhao-42
https://orcid.org/my-orcid?orcid=0000-0001-6286-5868
专利及软件著作权
(一) 发明专利
[1] 赵欢, 王松, 陈佐, 谭彪. 一种融合特征评估和多层感知器的语音情感识别方法, 2017年. 发明专利, 授权, 专利号:201710607479.9.
[2] 赵欢, 李婷婷, 李祎颖. 一种基于主体增强的文本摘要生成方法, 2020年. 发明专利, 实审中.
[3] 赵欢, 李博, 李祎颖. 基于复制机制和变分神经推理的增强性文本摘要生成方法, 2020年. 发明专利, 实审中.
[4] 赵欢, 周晓晓, 肖宇锋, 陈佐. 基于DIS-NV特征的语音情感识别方法, 2017年. 发明专利,实审中.
[5] 赵欢, 张希翔, 谭彪. 基于即时语音内容检测的推荐方法及系统, 2015年. 发明专利, 授权, 专利号ZL 2015 1 0662383.3.
[6] 赵欢, 郑睿, 陈佐, 杨泽英, 张谦. 一种用于移动平台的无线语音控制方法及系统, 2014年. 发明专利, 授权, 专利号ZL 2014 1 0285216.7.
[7] 赵欢, 郑睿, 陈佐, 张希翔, 杨泽英. 一种声音模仿方法及装置, 2013年. 发明专利, 授权, 专利号ZL 2013 1 0423715.3.
[8] 赵欢, 王飞, 陈佐, 干文洁. 具有语音控制和哼唱检索功能的多媒体播放方法及装置, 2013年. 发明专利, 授权, 专利号ZL 2013 1 0298771.9.
(二) 实用新型专利
[1] 赵欢, 王飞, 陈佐, 干文洁. 具有语音控制和哼唱检索功能的多媒体播放装置, 实用新型专利, 授权, 专利号: ZL 2013 2 0422658.2.
[2] 赵欢, 冯璐, 陈佐, 王飞. 具有语音及文本输出选择功能的移动通信装置, 实用新型专利, 授权, 专利号ZL 2013 2 0461679.5.
[3] 赵欢, 陈佐. 面向实时音视频流的字幕叠加系统. 实用新型专利, 授权, 专利号: ZL 2011 2 0558388.9.
[4] 徐成, 李仁发, 刘彦, 秦云川, 罗正钦, 黄春毅, 彭蔓蔓, 吴蓉晖, 赵欢. 无线多媒体实时学习系统与方法, 实用新型专利, 授权(受理号200610032349.9, 公开号CN101155089).
(三) 软件著作权
[1] 赵欢(湖南大学).文本摘要自动生成软件. 计算机软件著作权, 登记号:2020SR0391470.
[2] 赵欢(湖南大学). 基于Seq2Seq 框架的闲聊对话系统,V1.0,2019.12. 计算机软件著作权, 登记号:2019R11L1950501.
[3] 赵欢(湖南大学). 基于微博话题的情感多分类系统V1.0, 2018.11. 计算机软件著作权, 登记号:2019SR0216837.
[4] 赵欢(湖南大学). 基于信任关系的协同过滤推荐系统V1.0, 2018.1. 计算机软件著作权, 登记号:2018SR329655.
[5] 赵欢(湖南大学). 基于语音的人格特征评估系统, 2016.3. 计算机软件著作权, 登记号:2016SR108509.
[6] 彭飞(湖南大学). 互联网中图文与版式文件的涉密信息检测系统,2015.12. 计算机软件著作权, 登记号:2016SR031982.
[7] 赵欢(湖南大学). 支持视音频动态迁移的媒体播放器软件. 2010.9. 计算机软件著作权, 登记号:2011SR007494.