报 告 人:蔡剑飞,IEEE Fellow, 莫纳什大学教授
报告时间:2025年1月13日 周一 上午10:00
报告地点:湖南大学 逸夫楼报告厅
报告摘要: In recent years, the 3D vision community has witnessed a significant shift towards Neural Radiance Fields (NeRF) and 3D Gaussian Splatting (3DGS) as primary focal points. These innovations have fundamentally transformed how we represent 3D space and learn these representations through deep learning techniques with 2D image supervision. Existing research in NeRF and 3DGS can be broadly categorized into per-scene and generalizable resolutions. Per-scene approaches concentrate on optimizing 3D representations using numerous multi-view images, while generalizable solutions aim to learn from datasets containing diverse scenes, enabling models to generalize to new scenes with sparse views without necessitating retraining. This talk will present a series of advancements from our research group on generalizable 3D view synthesis, such as MatchNeRF, MVSplat, and MVSplat360 for improving generalizability in diverse scenarios.
近年来,神经辐射场 (NeRF) 和 3D 高斯泼溅 (3DGS) 成为3D 视觉领域研究的热点。这些创新方案从根本上改变了我们表示 3D 空间的方式,并通过带有 2D 图像监督的深度学习技术学习这些表示。NeRF 和 3DGS 的现有研究可大致分为每个场景和可推广的解决方案。每个场景的方法专注于使用大量多视图图像优化 3D 表示,而可推广的解决方案旨在从包含不同场景的数据集中学习,使模型能够推广到具有稀疏视图的新场景而无需重新训练。本次演讲将介绍我们研究小组在可推广的 3D 视图合成方面的一系列进展,例如 MatchNeRF、MVSplat 和 MVSplat360,以提高在不同场景中的推广能力。
报告人简介: Jianfei Cai is a Professor at Faculty of IT, Monash University, where he had served as the inaugural Head for the Data Science & AI Department. Before that, he was Head of Visual and Interactive Computing Division and Head of Computer Communications Division in Nanyang Technological University (NTU). His major research interests include computer vision, deep learning and multimedia. He has successfully trained 40+ PhD students with three getting NTU SCSE Outstanding PhD thesis award and one getting Monash FIT Graduate Research Student Excellence Award. Many of his PhD students joined leading IT companies such as Meta, Apple, Amazon, Adobe and TikTok or become faculty members in reputable universities. He is a co-recipient of paper awards in ACCV, ICCM, IEEE ICIP and MMSP, and a winner of Monash FIT’s Dean's Researcher of the Year Award. He serves or has served as an Associate Editor for TPAMI, IJCV, IEEE T-IP, T-MM, and T-CSVT as well as serving as Area Chair for CVPR, ICCV, ECCV, IJCAI, ACM Multimedia, ICME, ICIP and ISCAS. He was the Chair of IEEE CAS VSPC-TC during 2016-2018. He had served as the leading TPC Chair for IEEE ICME 2012, the best paper award committee chair & co-chair for IEEE T-MM 2020 & 2019, and the leading General Chair for ACM Multimedia 2024. He is a Fellow of IEEE.
蔡剑飞,IEEE Fellow,莫纳什大学教授,曾任该校数据科学与人工智能系首任系主任。曾担任新加坡南洋理工大学视觉与交互计算部主任和计算机通信部主任。研究方向包括计算机视觉、深度学习和多媒体。已成功培养了 40 多名博士生,其中三人获得 NTU SCSE 优秀博士论文奖,一名获得莫纳什 FIT 研究生优秀奖。他的许多博士生加入了 Meta、Apple、Amazon、Adobe 和 TikTok 等领先的 IT 公司或成为知名大学的教职员工。是 ACCV、ICCM、IEEE ICIP 和 MMSP 论文奖共同获得者,莫纳什 FIT 年度院长研究员奖获得者。目前或曾担任 TPAMI、IJCV、IEEE T-IP、T-MM 和 T-CSVT 的副主编,并担任 CVPR、ICCV、ECCV、IJCAI、ACM Multimedia、ICME、ICIP 和 ISCAS 的领域主席。曾于 2016 年至 2018 年期间担任 IEEE CAS VSPC-TC 主席。曾担任 IEEE ICME 2012 的首席 TPC 主席、IEEE T-MM 2020 和 2019 的最佳论文奖委员会主席和联合主席以及 ACM Multimedia 2024 的首席总主席。