Xubin Zhong

I am now a Ph.D candidate student at South China University of Technology (SCUT), under the supervision of Prof. Dacheng Tao and Prof. Changxing Ding at present. I was an intern at Baidu NLP Group from 2023.07 to 2023.10. Before this, I got my bachelor's degree in 2019 and was recommended to study for my doctor's degree at SCUT.

Email  /  Google Scholar  /  Github

profile photo
Research

I'm interested in computer vision, multi-modality learning, large language model, multi-task learning and human-object interaction detection. Representative papers are listed below.

Disentangled Interaction Representation for One-Stage Human-Object Interaction Detection
Xubin Zhong, Changxing Ding, Yupeng Hu, Dacheng Tao
Preprint, 2023

To the best of our knowledge, DIR is the first approach that enables the one-stage HOI detection models to extract disentangled interaction representations.

Towards Hard-Positive Query Mining for DETR-based Human-Object Interaction Detection
Xubin Zhong, Changxing Ding, Zijian Li, Shaoli Huang
ECCV, 2022
project page / arXiv

To the best of our knowledge, HQM is the first approach that promotes the robustness of DETR-based models from the perspective of hard example mining.

Distillation Using Oracle Queries for Transformer-based Human-Object Interaction Detection
Xian Qu, Changxing Ding, Xingao Li, Xubin Zhong, Dacheng Tao
CVPR, 2022  
project page /

We propose an efficient knowledge distillation model, named Distillation using Oracle Queries (DOQ), which shares parameters between teacher and student networks

Glance and Gaze: Inferring Action-aware Points for One-Stage Human-Object Interaction Detection
Xubin Zhong, Xian Qu, Changxing Ding, Dacheng Tao
CVPR, 2021  
project page / arXiv /

We propose a novel one-stage method, namely Glance and Gaze Network (GGNet), which adaptively models a set of action-aware points (ActPoints) via glance and gaze steps.

Polysemy Deciphering Network for Robust Human-Object Interaction Detection
Xubin Zhong, Changxing Ding, Xian Qu, Dacheng Tao
IJCV, 2021  
project page / arXiv /

Through deciphering the visual polysemy of verbs, our approach is demonstrated to outperform state-of-the-art methods by significant margins on the HICO-DET, V-COCO, and HOIVP databases

Polysemy Deciphering Network for Human-Object Interaction Detection
Xubin Zhong, Changxing Ding, Xian Qu, Dacheng Tao
ECCV, 2020  
project page /

We propose a novel Polysemy Deciphering Network (PD-Net), which decodes the visual polysemy of verbs for HOI detection

Honors and Awards

GAC Enterprise Scholarship at SCUT, 2021

Third prize of National Advanced Mathematics Competition, 2018

International Special Prize in Asia Pacific Mathematical Modeling Contest, 2017

National Special Prize in National Energy Conservation and Emission Reduction Competition, 2017


This pape is built upon Jon Barron's website.
Also, consider using Leonid Keselman's Jekyll fork of this page.