视觉与语言:利用深度学习沟通视觉与语言
Vision and Language: Bridging Vision and Language with Deep Learning
梅涛   
报告人照片   Tao Mei is a Senior Researcher with Microsoft Research Asia. His current research interests include multimedia analysis and computer vision. He has authored over 100 papers, with 5,000+ citations and 10 best paper awards. He holds 17 U.S. granted patents and has shipped a dozen inventions and technologies to Microsoft products and services. He is an Editorial Board Member of IEEE Trans. on Multimedia and ACM Trans. on Multimedia Computing, Communications, and Applications. He is the General Co-chair of ACM ICIMCS 2013, the Program Co-chair of ACM Multimedia 2018, IEEE ICME 2015, IEEE MMSP 2015 and MMM 2013, and the Area Chair for a dozen international conferences. Tao received B.E. and Ph.D. degrees from the University of Science and Technology of China, Hefei, China, in 2001 and 2006, respectively. He is a Fellow of IAPR and a Distinguished Scientist of ACM.
  Visual recognition has been a fundamental challenge in computer vision for decades. Thanks to the recent development of deep learning techniques, researchers are striving to bridge vision (image and video) and natural language, which has become an emerging research area. We will present a few recent advances bridging vision and language with deep learning techniques, including image and video captioning, image and video chatting, storytelling, vision and language grounding, datasets, grand challenges, and open issues. In particular, we will introduce our recently developed approaches which investigate semantic attributes for image and video captioning.
报告时间:2016年12月13日15时00分    报告地点:西区科技实验楼西楼1213会议室
报名截止日期:2016年12月13日    可选人数:40