Dr. Guo-Sen Xie is currently a full Professor at Nanjing University of Science and Technology (NJUST). He received his Ph.D. degree in Pattern Recognition and Intelligent System from Institute of Automation, Chinese Academy of Sciences (CASIA) under the supervision of Prof. Cheng-Lin Liu, in 2016. From 2020 to 2022, he was a researcher at Mohamed bin Zayed University of Artificial Intelligence. From 2018 to 2020, he was a research scientist at Inception Institute of Artificial Intelligence. He was a visiting researcher at National University of Singapore (NUS) from 2014 to 2015, working with Prof. Shuicheng Yan. He has authored/co-authored over 80 papers in top conferences/journals such as IEEE TPAMI, IJCV, CVPR, ICCV, and ECCV. His current research interests mainly focus on computer vision, pattern recognition, and multimodal large language models (MLLMs).
Our research focuses on data-efficient visual learning, multimodal intelligent perception, and self-evolving vision-language agents, aiming to develop robust, generalizable, and adaptive AI systems for complex real-world environments. By bridging visual understanding, industrial intelligence, and autonomous decision-making, we seek to advance trustworthy AI technologies from perception to reasoning and action.
1. Data-Efficient Visual Understanding
Developing robust and generalizable visual learning methods for complex image and video data under limited supervision, with a focus on representation learning, perception reliability, and open-world visual understanding.
2. Multimodal Perception and Industrial Intelligence
Exploring multimodal learning frameworks for intelligent perception in real-world industrial scenarios, aiming to improve defect detection, anomaly understanding, and trustworthy visual decision-making.
3. Self-Evolving Agents and Vision-Language Intelligence
Building adaptive vision-language agents with reasoning, navigation, and continual self-improvement capabilities, towards autonomous learning and decision-making in open environments.
News
🔥 招生信息 招收 2027 年入学(推免)硕士研究生多名,以及 2027 年入学博士研究生(各类)。 欢迎对人工智能、计算机视觉、多模态大模型等方向感兴趣的同学踊跃联系报名: 📧 guosen.xie@njust.edu.cn
🎓 长期招生 每年可招收 硕士生 3–4 名、博士生 1–2 名。欢迎有志于科研的同学报考硕士、博士研究生。课题组将在 论文发表、科研训练、就业发展和继续深造 等方面予以悉心指导;优秀学生可推荐至国内外相关单位进行交流访问。 同时欢迎优秀本科生联系参与科研项目。
Publications
100+ highly-influenced papers have been published, including IEEE TPAMI (IF: 23.6), IJCV (IF: 19.5), Proceedings of the IEEE (20.6), IEEE TIP (IF: 10.6), IEEE TNNLS (IF: 10.4), IEEE TCYB (IF: 11.8), IEEE TMM (IF: 7.3), IEEE TCSVT (IF: 8.4), PR (IF: 8.0), NeurIPS (CCF A), CVPR (CCF A), ICCV (CCF A), ECCV (CCF B), AAAI (CCF A), IJCAI (CCF A), ACM MM (CCF A), etc. Selected Papers:
|
G.-S. Xie, X.-Y. Zhang, S. Yan, and C.-L. Liu. SDE: A novel selective, discriminative and equalizing feature representation for visual recognition, International Journal of Computer Vision (IJCV), vol. 124, no.2, pp. 145-168, 2017. (CCF A) |
|
G.-S. Xie, X.-Y. Zhang, Y. Yao, Z. Zhang, F. Zhao, and L. Shao. VMAN: A Virtual Mainstay Alignment Network for Transductive Zero-Shot Learning, IEEE Transactions on Image Processing (TIP), vol. 30, pp. 4316-4329, 2021. (CCF A) |
|
G.-S. Xie, L. Liu, X. Jin, F. Zhu, Z. Zhang, J. Qin, Y. Yao, and L. Shao. Attentive region embedding network for zero-shot learning. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 9384-9393, 2019. (CCF A) |
|
G.-S. Xie, H. Xiong, J. Liu, Y. Yao, and L. Shao. Few-Shot Semantic Segmentation with Cyclic Memory Network, IEEE International Conference on Computer Vision (ICCV), pp. 7293-7302, 2021. (CCF A) |
|
G.-S. Xie, L. Liu, F. Zhu, F. Zhao, Z. Zhang, Y. Yao, J. Qin, and L. Shao. Region graph embedding network for zero-shot learning. European Conference on Computer Vision (ECCV), pp. 562-580, 2020. (CCF B) |
|
G.-S. Xie, X.-Y. Zhang, X. Shu, S. Yan, C.-L. Liu. Task-driven Feature Pooling for Image Classification, IEEE International Conference on Computer Vision (ICCV), pp. 1179-1187, 2015. (CCF A) |
|
G.-S. Xie, T.-Z. Xiang, X.-Y. Zhang, Z. Zhang, F. Zhao, L. Shao, X. Li. Leveraging Balanced Semantic Embedding for Generative Zero-Shot Learning, IEEE Transactions on Neural Networks and Learning Systems (TNNLS), vol. 34, no. 11, pp. 9575-9582, 2023. (CCF B) |
|
G.-S. Xie, Z. Zhang, G. Liu, F. Zhu, L. Liu, L. Shao, and X. Li. Generalized zero-shot learning with multiple graph adaptive generative networks, IEEE Trans. on Neural Networks and Learning Systems (TNNLS), vol. 33, no. 7, pp. 2903-2915, 2022. (CCF B) |
|
G.-S. Xie, X.-Y. Zhang, S. Yan, and C.-L. Liu. Hybrid CNN and Dictionary-Based Models for Scene Recognition and Domain Adaptation, IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), vol.27, no.6, pp. 1263-1274, 2017. (CCF B) |
|
X. Shu, L. Zhang, J.Tang, G.-S. Xie, S. Yan. Computational face reader, International Conference on Multimedia Modeling (MMM), pp. 114-126, 2016. (Best Student Paper Award) (CCF C) |
|
S. Chen, Z. Hong, G.-S. Xie, Y. Song, J. Zhao, X. You, S. Yan, L. Shao, TransZero++: Cross Attribute-Guided Transformer for Zero-Shot Learning, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 36, no. 1, pp. 330-338, 2022. (CCF A) |
|
W. Li, G. Chu, J. Chen, G.-S. Xie, C. Shan, F. Zhao, Lad-reasoner: Tiny multimodal models are good reasoners for logical anomaly detection, arXiv preprint arXiv:2504.12749(2025), under ACM MM submission. (CCF A) |
|
X.-Y. Zhang, G.-S. Xie, X. Li, T. Mei, C.-L. Liu, A survey on learning to reject, Proceedings of the IEEE(Proc. IEEE), vol. 111, no. 2, pp. 185-215, 2023. (CCF A) |
|
H. Xiong, L. Huang, W.J.T. Zang, X. Zhen, G.-S. Xie, B. Gu, L. Song, On the Number of Linear Regions of Convolutional Neural Networks With Piecewise Linear Activations, IEEE Transactions on Pattern Analysis and Machine Intelligence(TPAMI), vol. 46, no. 7, pp. 5131-5148, 2024. (CCF A) |
- For more papers, please kindly refer to my Google Scholar page.
Group
-
Positions for Interns/Master/Ph.D's Programme
- We are looking for students, who are self-motivated and have a solid foundation in mathematics and programming. Please feel free to drop me an email (guosen.xie@njust.edu.cn) if you are interested.
Service
- Associate Editor:
- IEEE Transactions on Image Processing (CCF A, IF=13.7)
- Pattern Recognition (IF =7.6)
- Artificial Intelligence and Applications (AIA) Journal
- Area Chair:
- ICLR 2025-2026
- NeurIPS 2026
- (Senior) PC Member:
- CVPR 2019-2026, ICCV 2019-2026, ECCV 2020-2026, WACV 2021-2026
- NeurIPS 2022-2025, ICML 2023-2026, AAAI 2020-2026, IJCAI 2020-2025
- ICLR 2022-2024
- Reviewer:
- IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
- International Journal of Computer Vision (IJCV)
- IEEE Transactions on Image Processing (TIP)
- IEEE Transactions on Neural Networks and Learning Systems (TNNLS)
- IEEE Transactions on Cybernetics (TCYB)
- IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)
- IEEE Transactions on Multimedia (TMM)
- IEEE Transactions on Geoscience and Remote Sensing (TGRS)
- Pattern Recognition Journal (PR)
- Neural Networks (NN)