Xubin Zhong

I am now a Ph.D candidate student at South China University of Technology (SCUT), under the supervision of Prof. Dacheng Tao and Prof. Changxing Ding at present. I was an intern at Baidu NLP Group from 2023.07 to 2023.10. Before this, I got my bachelor's degree in 2019 and was recommended to study for my doctor's degree at SCUT.

Email / Google Scholar / Github

Research

I'm interested in computer vision, multi-modality learning, large language model, multi-task learning and human-object interaction detection. Representative papers are listed below.

	Disentangled Interaction Representation for One-Stage Human-Object Interaction Detection Xubin Zhong, Changxing Ding, Yupeng Hu, Dacheng Tao Preprint, 2023 To the best of our knowledge, DIR is the first approach that enables the one-stage HOI detection models to extract disentangled interaction representations.
	Towards Hard-Positive Query Mining for DETR-based Human-Object Interaction Detection Xubin Zhong, Changxing Ding, Zijian Li, Shaoli Huang ECCV, 2022 project page / arXiv To the best of our knowledge, HQM is the first approach that promotes the robustness of DETR-based models from the perspective of hard example mining.
	Distillation Using Oracle Queries for Transformer-based Human-Object Interaction Detection Xian Qu, Changxing Ding, Xingao Li, Xubin Zhong, Dacheng Tao CVPR, 2022 project page / We propose an efficient knowledge distillation model, named Distillation using Oracle Queries (DOQ), which shares parameters between teacher and student networks
	Glance and Gaze: Inferring Action-aware Points for One-Stage Human-Object Interaction Detection Xubin Zhong, Xian Qu, Changxing Ding, Dacheng Tao CVPR, 2021 project page / arXiv / We propose a novel one-stage method, namely Glance and Gaze Network (GGNet), which adaptively models a set of action-aware points (ActPoints) via glance and gaze steps.
	Polysemy Deciphering Network for Robust Human-Object Interaction Detection Xubin Zhong, Changxing Ding, Xian Qu, Dacheng Tao IJCV, 2021 project page / arXiv / Through deciphering the visual polysemy of verbs, our approach is demonstrated to outperform state-of-the-art methods by significant margins on the HICO-DET, V-COCO, and HOIVP databases
	Polysemy Deciphering Network for Human-Object Interaction Detection Xubin Zhong, Changxing Ding, Xian Qu, Dacheng Tao ECCV, 2020 project page / We propose a novel Polysemy Deciphering Network (PD-Net), which decodes the visual polysemy of verbs for HOI detection

Honors and Awards

GAC Enterprise Scholarship at SCUT, 2021

Third prize of National Advanced Mathematics Competition, 2018

International Special Prize in Asia Pacific Mathematical Modeling Contest, 2017

National Special Prize in National Energy Conservation and Emission Reduction Competition, 2017