石恒璨

一、 个人简介

    石恒璨,博士,副教授,王耀南院士团队成员,机器人视觉感知与控制技术国家工程研究中心骨干成员。现为美国电气与电子工程师协会会员,美国计算机学会会员,中国图象图形学学会,中国人工智能学会会员。2021年获中国图象图形学学会优秀博士学位论文奖(全国仅10人)。2024年人才引进加入湖南大学电气与信息工程学院。


    主要研究方向为人工智能、计算机与机器人视觉感知、视觉-语言多模态学习、弱监督学习与无监督学习等。在多媒体与计算机视觉顶级期刊会议IJCV、TMM、CVPR、ECCVACMMMSIGIR等发表论文30余篇。长期担任顶级期刊IEEE TPAMI、IJCVTMMTCSVTTIE等审稿人。并长期担任多媒体顶级会议ACMMM、计算机视觉顶级会议CVPRICCVECCV、机器学习顶级会议ICMLNeurIPSICLR、人工智能顶级会议AAAIIJCAI等网络主席、领域主席,程序委员会委员。


联系方式:shihengcan@hnu.edu.cn



二、 招生信息

   欢迎对人工智能计算机与机器人视觉感知视觉-语言多模态学习感兴趣的学生加入我们团队。


   优秀学生可推荐至澳大利亚悉尼大学、莫纳什大学、阿德莱德大学、香港中文大学、香港城市大学等QS100大学深造或访问交流,腾讯、阿里、百度、字节、商汤等知名企业位于中国、美国、澳大利亚、新加坡等地研发部门工作或实习。


三、 教育与工作经历

2024-至今,湖南大学,电气与信息工程学院,副教授

2020-2024,澳大利亚莫纳什大学,数据科学与人工智能,研究员

2014-2019,电子科技大学,信息与通信工程,博士

2010-2014, 电子科技大学,电子信息工程, 学士


四、 部分代表性科研成果

[1]. Hengcan Shi, Son Duy Dao, and Jianfei Cai. " LLMFormer: Large Language Model for Open-Vocabulary Semantic Segmentation." International Journal Of Computer Vision (IJCV), 2024. 计算机视觉顶级期刊,CCF-A

[2]. Duy-Tho Le, Hengcan Shi, Jianfei Cai, Hamid Rezatofighi. "DifFUSER: Diffusion Model for Robust Multi-Sensor Fusion in 3D Object Detection and BEV Segmentation." European Conference on Computer Vision (ECCV), 2024. 计算机视觉顶级会议

[3]. Duy Tho Le, Chenhui Gou, Stavya Datta, Hengcan Shi, Ian Reid, Jianfei Cai, Hamid Rezatofighi. "JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024. 计算机视觉顶级会议,CCF-A

[4]. Hengcan Shi, Munawar Hayat, and Jianfei Cai. "Unified open-vocabulary dense visual prediction." IEEE Transactions on Multimedia (TMM), 2024. 多媒体顶级期刊,一区TOP

[5]. Hengcan Shi, Munawar Hayat, Jianfei Cai, “Transformer Scale Gate for Semantic Segmentation”, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023. 计算机视觉顶级会议,CCF-A

[6]. Hengcan Shi, Munawar Hayat, and Jianfei Cai. "Open-vocabulary object detection via scene graph discovery." Proceedings of the 31st ACM International Conference on Multimedia (ACMMM), 2023. 多媒体顶级会议,CCF-A

[7]. Son Duy Dao, Hengcan Shi, Dinh Phung, Jianfei Cai. "Class Enhancement Losses with Pseudo Labels for Open-Vocabulary Semantic Segmentation." IEEE Transactions on Multimedia (TMM), 2023. 多媒体顶级期刊,一区TOP

[8]. Yicheng Wu, Zhonghua Wu, Hengcan Shi, Bjoern Picker, Winston Chong, Jianfei Cai. "Coactseg: Learning from heterogeneous data for new multiple sclerosis lesion segmentation." International conference on medical image computing and computer-assisted intervention (MICCAI), 2023. 医学图像处理顶级会议

[9]. Hengcan Shi, Munawar Hayat, Jianfei Cai, “Unpaired referring expression grounding via bidirectional cross-modal matching”, Neurocomputing, 2023 二区TOP

[10]. Hengcan Shi, Munawar Hayat, Yicheng Wu, Jianfei Cai, “ProposalCLIP: Unsupervised Open-Category Object Proposal Generation via Exploiting CLIP Cues”, IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022. 计算机视觉顶级会议,CCF-A

[11]. Duy Tho Le, Hengcan Shi, Hamid Rezatofighi, Jianfei Cai, “Accurate and real-time 3D pedestrian detection using an efficient attentive pillar network”, IEEE Robotics and Automation Letters (RA-L), 2022 二区TOP

[12]. Tingtian Li, Zixun Sun, Haoruo Zhang, Jin Li, Ziming Wu, Hui Zhan, Yipeng Yu, Hengcan Shi, “Deep Music Retrieval for Fine-Grained Videos by Exploiting Cross-Modal-Encoded Voice-Overs”, Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR) 2021. 信息检索顶级会议,CCF-A

[13]. Hengcan Shi, Hongliang Li, Qingbo Wu, and King Ngi Ngan, “Query Reconstruction Network for Referring Expression Image Segmentation”, IEEE Transactions on Multimedia (TMM), 2020. 多媒体顶级期刊,一区TOP

[14]. Heqian Qiu, Hongliang Li, Qingbo Wu, and Hengcan Shi, “Offset Bin Classification Network for Accurate Object Detection”, IEEE Conference on ComputerVision and Pattern Recognition (CVPR), 2020. 计算机视觉顶级会议,CCF-A

[15]. Heqian Qiu, Hongliang Li, Qingbo Wu, FanmanMeng, Hengcan Shi, Taijin Zhao, and King Ngi Ngan, “Language-Aware Fine-Grained Object Representation for Referring Expression Comprehension”, ACMinternational conference on Multimedia (ACM MM), 2020. 多媒体顶级会议,CCF-A

[16]. Heqian Qiu, Hongliang Li, Qingbo Wu, Fanman Meng, Linfeng Xu, King Ngi Ngan, and Hengcan Shi, “Hierarchical Context Features Embedding for Object Detection”, IEEE Transactions on Multimedia (TMM), 2020. 多媒体顶级期刊,一区TOP

[17]. Hengcan Shi, Hongliang Li, Qingbo Wu, and Zichen Song, “Scene Parsing via Integrated Classification Model and Variance-Based Regularization”, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019. 计算机视觉顶级会议,CCF-A

[18]. Hengcan Shi, Hongliang Li, Qingbo Wu, FanmanMeng, and King Ngi Ngan, “Boosting scene parsing performance via reliable scale prediction”, ACMinternational conference on Multimedia (ACM MM), 2018. 多媒体顶级会议,CCF-A, Oral

[19]. Hengcan Shi, Hongliang Li, Fanman Meng, and Qingbo Wu, “Key-Word-Aware Network for Referring Expression Image Segmentation”, European Conference on Computer Vision (ECCV), 2018. 计算机视觉顶级会议

[20]. Hengcan Shi, Hongliang Li, Fanman Meng, Qingbo Wu, Linfeng Xu, and King N. Ngan,Hierarchical Parsing Net: Semantic Scene Parsing from Global Scene to Objects”, IEEE Transactions on Multimedia (TMM), 2018 多媒体顶级期刊,一区TOP