Research
I'm interested in computer vision, multi-modality learning, scene understanding, large language model and multi-task learning. Representative papers are listed below.
|
|
Disentangled Interaction Representation for One-Stage Human-Object Interaction Detection
Xubin Zhong,
Changxing Ding,
Yupeng Hu,
Dacheng Tao
Preprint, 2023
To the best of our knowledge, DIR is the first approach that enables the one-stage HOI detection models to extract disentangled interaction representations.
|
|
Towards Hard-Positive Query Mining for DETR-based Human-Object Interaction Detection
Xubin Zhong,
Changxing Ding,
Zijian Li,
Shaoli Huang
ECCV, 2022
project page
/
arXiv
To the best of our knowledge, HQM is the first approach that promotes the robustness of DETR-based models from the perspective of hard example mining.
|
|
Distillation Using Oracle Queries for Transformer-based Human-Object
Interaction Detection
Xian Qu,
Changxing Ding,
Xingao Li,
Xubin Zhong,
Dacheng Tao
CVPR, 2022  
project page
/
We propose an
efficient knowledge distillation model, named Distillation
using Oracle Queries (DOQ), which shares parameters between teacher and student networks
|
|
Glance and Gaze: Inferring Action-aware Points for One-Stage Human-Object
Interaction Detection
Xubin Zhong,
Xian Qu,
Changxing Ding,
Dacheng Tao
CVPR, 2021  
project page
/
arXiv
/
We propose a novel one-stage method, namely Glance and Gaze
Network (GGNet), which adaptively models a set of action-aware points (ActPoints) via glance and gaze steps.
|
|
Polysemy Deciphering Network for Robust Human-Object Interaction Detection
Xubin Zhong,
Changxing Ding,
Xian Qu,
Dacheng Tao
IJCV, 2021  
project page
/
arXiv
/
Through deciphering the visual polysemy of verbs, our approach is
demonstrated to outperform state-of-the-art methods by significant margins on the HICO-DET, V-COCO, and HOIVP databases
|
|
Polysemy Deciphering Network for Human-Object Interaction Detection
Xubin Zhong,
Changxing Ding,
Xian Qu,
Dacheng Tao
ECCV, 2020  
project page
/
We propose a novel Polysemy Deciphering Network (PD-Net), which decodes the visual polysemy of verbs
for HOI detection
|
Honors and Awards
GAC Enterprise Scholarship at SCUT, 2021
Third prize of National Advanced Mathematics Competition, 2018
International Special Prize in Asia Pacific Mathematical Modeling Contest, 2017
National Special Prize in National Energy Conservation and Emission Reduction Competition, 2017
|
|