Publications

Filter by type:

. SiRi: A Simple Selective Retraining Mechanism for Transformer-based Visual Grounding. ECCV, 2022.

PDF

. Multi-query Video Retrieval. ECCV, 2022.

PDF

. Quantized GAN for Complex Music Generation from Dance Videos. ECCV, 2022.

PDF

. Learning to Learn by Jointly Optimizing Neural Architecture and Weights. CVPR, 2022.

PDF Code

. Large-scale Video Panoptic Segmentation in the Wild: A Benchmark. CVPR, 2022.

PDF Code Dataset Supplement

. Switchable Novel Object Captioner. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022.

PDF Bibtex

. Saying the Unseen: Video Descriptions via Dialog Agents. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021.

PDF Code

. VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild. CVPR, 2021.

PDF Code Dataset Video Bibtex

. Learning to Anticipate Egocentric Actions by Imagination. IEEE Transactions on Image Processing (TIP), 2021.

PDF Bibtex

. Progressive Transfer Learning for Face Anti-Spoofing. IEEE Transactions on Image Processing (TIP), 2021.

PDF

. Identifying Visible Parts via Pose Estimation for Occluded Person Re-Identification. IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021.

PDF Bibtex

. Learning Audio-Visual Correlations from Variational Cross-Modal Generation. ICASSP, 2021.

PDF

. Learning with Noisy Labels via Self-Reweighting from Class Centroids. IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021.

PDF Bibtex

. Holistic LSTM for Pedestrian Trajectory Prediction. IEEE Transactions on Image Processing (TIP), 2021.

PDF

. Symbiotic Attention for Egocentric Action Recognition with Object-centric Alignment. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020.

PDF

. Unsupervised Person Re-identification via Softened Similarity Learning. CVPR, 2020.

PDF

. Gated Channel Transformation for Visual Recognition. CVPR, 2020.

PDF Code

. Unsupervised Person Re-identification via Cross-camera Similarity Exploration. IEEE Transactions on Image Processing (TIP), 2020.

PDF Bibtex

. Revisiting EmbodiedQA: A Simple Baseline and Beyond. IEEE Transactions on Image Processing (TIP), 2020.

PDF Bibtex

. Symbiotic Attention with Privileged Information for Egocentric Action Recognition. AAAI (Oral), 2020.

PDF

. Cascaded Revision Network for Novel Object Captioning. IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2020.

PDF Code

. Dual Attention Matching for Audio-Visual Event Localization. ICCV (Oral), 2019.

PDF Bibtex

. Pose-Guided Feature Alignment for Occluded Person Re-identification. ICCV, 2019.

PDF Code Dataset

. Auto-ReID: Searching for a Part-aware ConvNet for Person Re-Identification. ICCV, 2019.

PDF

. Baidu-UTS Submission to the EPIC-Kitchens Action Recognition Challenge 2019. We achieved the 1st place in the EPIC-Kitchens Action Recognition Challenge @ CVPR19. This is our report in the CVPR Workshop, 2019.

PDF Official Site

. Improving Person Re-identification by Attribute and Identity Learning. Pattern Recognition, 2019.

PDF Code Dataset Bibtex

. Progressive Learning for Person Re-Identification with One Example. IEEE Transactions on Image Processing (TIP), 2019.

PDF Code Dataset Bibtex IEEE

. Decoupled Novel Object Captioner. ACM-MM, 2018.

PDF Code Bibtex