一、 个人简介
石恒璨,博士,副教授,王耀南院士团队成员,机器人视觉感知与控制技术国家工程研究中心骨干成员。现为美国电气与电子工程师协会会员,美国计算机学会会员,中国图象图形学学会,中国人工智能学会会员。2021年获中国图象图形学学会优秀博士学位论文奖(全国仅10人)。2024年人才引进加入湖南大学电气与信息工程学院。
主要研究方向为人工智能、计算机与机器人视觉感知、视觉-语言多模态学习、弱监督学习与无监督学习等。在多媒体与计算机视觉顶级期刊会议IJCV、TMM、CVPR、ECCV、ACMMM、SIGIR等发表论文30余篇。长期担任顶级期刊IEEE TPAMI、IJCV、TMM、TCSVT、TIE等审稿人。并长期担任多媒体顶级会议ACMMM、计算机视觉顶级会议CVPR、ICCV、ECCV、机器学习顶级会议ICML、NeurIPS、ICLR、人工智能顶级会议AAAI、IJCAI等网络主席、领域主席,程序委员会委员。
联系方式:shihengcan@hnu.edu.cn
二、 招生信息
欢迎对人工智能、计算机与机器人视觉感知、视觉-语言多模态学习感兴趣的学生加入我们团队。
优秀学生可推荐至澳大利亚悉尼大学、莫纳什大学、阿德莱德大学、香港中文大学、香港城市大学等QS前100大学深造或访问交流,腾讯、阿里、百度、字节、商汤等知名企业位于中国、美国、澳大利亚、新加坡等地研发部门工作或实习。
三、 教育与工作经历
2024-至今,湖南大学,电气与信息工程学院,副教授
2020-2024,澳大利亚莫纳什大学,数据科学与人工智能,研究员
2014-2019,电子科技大学,信息与通信工程,博士
2010-2014, 电子科技大学,电子信息工程, 学士
四、 部分代表性科研成果
[1]. Hengcan Shi, Son Duy Dao, and Jianfei Cai. " LLMFormer: Large Language Model for Open-Vocabulary Semantic Segmentation." International Journal Of Computer Vision (IJCV), 2024. (计算机视觉顶级期刊,CCF-A)
[2]. Duy-Tho Le, Hengcan Shi, Jianfei Cai, Hamid Rezatofighi. "DifFUSER: Diffusion Model for Robust Multi-Sensor Fusion in 3D Object Detection and BEV Segmentation." European Conference on Computer Vision (ECCV), 2024. (计算机视觉顶级会议)
[3]. Duy Tho Le, Chenhui Gou, Stavya Datta, Hengcan Shi, Ian Reid, Jianfei Cai, Hamid Rezatofighi. "JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024. (计算机视觉顶级会议,CCF-A)
[4]. Hengcan Shi, Munawar Hayat, and Jianfei Cai. "Unified open-vocabulary dense visual prediction." IEEE Transactions on Multimedia (TMM), 2024. (多媒体顶级期刊,一区TOP)
[5]. Hengcan Shi, Munawar Hayat, Jianfei Cai, “Transformer Scale Gate for Semantic Segmentation”, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023. (计算机视觉顶级会议,CCF-A)
[6]. Hengcan Shi, Munawar Hayat, and Jianfei Cai. "Open-vocabulary object detection via scene graph discovery." Proceedings of the 31st ACM International Conference on Multimedia (ACMMM), 2023. (多媒体顶级会议,CCF-A)
[7]. Son Duy Dao, Hengcan Shi, Dinh Phung, Jianfei Cai. "Class Enhancement Losses with Pseudo Labels for Open-Vocabulary Semantic Segmentation." IEEE Transactions on Multimedia (TMM), 2023. (多媒体顶级期刊,一区TOP)
[8]. Yicheng Wu, Zhonghua Wu, Hengcan Shi, Bjoern Picker, Winston Chong, Jianfei Cai. "Coactseg: Learning from heterogeneous data for new multiple sclerosis lesion segmentation." International conference on medical image computing and computer-assisted intervention (MICCAI), 2023. (医学图像处理顶级会议)
[9]. Hengcan Shi, Munawar Hayat, Jianfei Cai, “Unpaired referring expression grounding via bidirectional cross-modal matching”, Neurocomputing, 2023 (二区TOP)
[10]. Hengcan Shi, Munawar Hayat, Yicheng Wu, Jianfei Cai, “ProposalCLIP: Unsupervised Open-Category Object Proposal Generation via Exploiting CLIP Cues”, IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022. (计算机视觉顶级会议,CCF-A)
[11]. Duy Tho Le, Hengcan Shi, Hamid Rezatofighi, Jianfei Cai, “Accurate and real-time 3D pedestrian detection using an efficient attentive pillar network”, IEEE Robotics and Automation Letters (RA-L), 2022 (二区TOP)
[12]. Tingtian Li, Zixun Sun, Haoruo Zhang, Jin Li, Ziming Wu, Hui Zhan, Yipeng Yu, Hengcan Shi, “Deep Music Retrieval for Fine-Grained Videos by Exploiting Cross-Modal-Encoded Voice-Overs”, Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR) 2021. (信息检索顶级会议,CCF-A)
[13]. Hengcan Shi, Hongliang Li, Qingbo Wu, and King Ngi Ngan, “Query Reconstruction Network for Referring Expression Image Segmentation”, IEEE Transactions on Multimedia (TMM), 2020. (多媒体顶级期刊,一区TOP)
[14]. Heqian Qiu, Hongliang Li, Qingbo Wu, and Hengcan Shi, “Offset Bin Classification Network for Accurate Object Detection”, IEEE Conference on ComputerVision and Pattern Recognition (CVPR), 2020. (计算机视觉顶级会议,CCF-A)
[15]. Heqian Qiu, Hongliang Li, Qingbo Wu, FanmanMeng, Hengcan Shi, Taijin Zhao, and King Ngi Ngan, “Language-Aware Fine-Grained Object Representation for Referring Expression Comprehension”, ACMinternational conference on Multimedia (ACM MM), 2020. (多媒体顶级会议,CCF-A)
[16]. Heqian Qiu, Hongliang Li, Qingbo Wu, Fanman Meng, Linfeng Xu, King Ngi Ngan, and Hengcan Shi, “Hierarchical Context Features Embedding for Object Detection”, IEEE Transactions on Multimedia (TMM), 2020. (多媒体顶级期刊,一区TOP)
[17]. Hengcan Shi, Hongliang Li, Qingbo Wu, and Zichen Song, “Scene Parsing via Integrated Classification Model and Variance-Based Regularization”, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019. (计算机视觉顶级会议,CCF-A)
[18]. Hengcan Shi, Hongliang Li, Qingbo Wu, FanmanMeng, and King Ngi Ngan, “Boosting scene parsing performance via reliable scale prediction”, ACMinternational conference on Multimedia (ACM MM), 2018. (多媒体顶级会议,CCF-A, Oral)
[19]. Hengcan Shi, Hongliang Li, Fanman Meng, and Qingbo Wu, “Key-Word-Aware Network for Referring Expression Image Segmentation”, European Conference on Computer Vision (ECCV), 2018. (计算机视觉顶级会议)
[20]. Hengcan Shi, Hongliang Li, Fanman Meng, Qingbo Wu, Linfeng Xu, and King N. Ngan,“Hierarchical Parsing Net: Semantic Scene Parsing from Global Scene to Objects”, IEEE Transactions on Multimedia (TMM), 2018 (多媒体顶级期刊,一区TOP)