Publications
Plan-R1: Safe and Feasible Trajectory Planning as Language Modeling
International Conference on Learning Representations (ICLR), 2026
UniPercept: A Unified Diffusion Model for Generalizable Visual Perception
Computer Vision and Pattern Recognition (CVPR), 2026
Revisiting Visual Corruptions in LVLMs: A Shape–Texture Perspective on Model Failures
Computer Vision and Pattern Recognition (CVPR), 2026
Collaborative Map-Based and Route-Based Policy Learning for Continuous Vision-and-Language Navigation
IEEE Robotics and Automation Letters (RAL), 2026
Patching the visual ability of large multimodal models by collaborating with small models
Frontiers of Computer Science (FCS), 2026
Task-Oriented Token Pruning for Efficient Object Detection and Segmentation
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2025
Benchmarking Multimodal Large Language Models Against Image Corruptions
International Conference on Computer Vision (ICCV), 2025
Feature Decomposition-Recomposition in Large Vision-Language Model for Few-Shot Class-Incremental Learning
International Conference on Computer Vision (ICCV), 2025
eLabrador: A wearable navigation system for visually impaired individuals
IEEE Transactions on Automation Science and Engineering (TASE), 2025
RAP: Role-Aware Joint Prediction and Planning in Autonomous Driving
IEEE Robotics and Automation Letters (RAL), 2025
Walking World Model for Visually Impaired Path Following
IEEE Robotics and Automation Letters (RAL), 2025
Outstanding Paper Award, Frontiers of Computer Science
VIPLFaceNet: an open source deep face recognition SDK, 2024
A Simple Romance Between Multi-Exit Vision Transformer and Token Reduction
International Conference on Learning Representations (ICLR), 2024
HPNet: Dynamic Trajectory Forecasting with Historical Prediction Attention
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024
PreLAR: World Model Pre-training with Learnable Action Representation
European Conference on Computer Vision (ECCV), 2024
Function-Consistent Feature Distillation
International Conference on Learning Representations (ICLR), 2023
DandelionNet: Domain Composition with Instance Adaptive Classification for Domain Generalization
IEEE/CVF International Conference on Computer Vision (ICCV), 2023
BLPSeg: Balance the Label Preference in Scribble-Supervised Semantic Segmentation
IEEE Trans. on Image Processing (TIP), 2023
Mutual Learning of Joint and Separate Domain Alignments for Multi-Source Domain Adaptation
IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2022
GAN with Multivariate Disentangling for Controllable Hair Editing
European Conference on Computer Vision (ECCV), 2022
Personalized Convolution for Face Recognition
International Journal of Computer Vision (IJCV), 2022
EigenGAN: Layer-Wise Eigen-Learning for GANs
International Conference on Computer Vision (ICCV), 2021
Image style disentangling for instance-level facial attribute transfer
Computer Vision and Image Understanding (CVIU), 2021
Self-Supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020
Unsupervised Domain Adaptation With Hierarchical Gradient Synchronization
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020
Learning deep face representation with long-tail data: An aggregate-and-disperse approach
Pattern Recognition Letters, 2020
Deformable Face Net for Pose Invariant Face Recognition
Pattern Recognition (PR), 2020
Learning to Learn Adaptive Classifier-Predictor for Few Shot Learning
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2020
Fully Learnable Group Convolution for Acceleration of Deep Neural Networks
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019
Weakly Supervised Object Detection with Segmentation Collaboration
IEEE International Conference on Computer Vision (ICCV), 2019
S^2GAN: Share Aging Factors across Ages and Share Aging Trends among Individuals
IEEE International Conference on Computer Vision (ICCV), 2019
AttGAN: Facial Attribute Editing by Only Changing What You Want
IEEE Trans. on Image Processing (TIP), 2019
Hierarchical Attention for Part-Aware Face Detection
International Journal of Computer Vision (IJCV), 2019
Face Anti-Spoofing with Multi-Scale Information
International Conference on Pattern Recognition (ICPR), 2018
Hierarchical Training for Large Scale Face Recognition with Few Samples Per Subject
IEEE International Conference on Image Processing (ICIP), 2018
Task-adaptive Feature Reweighting for Few Shot Classification
Asian Conference on Computer Vision (ACCV), 2018
Generative Adversarial Network with Spatial Attention for Face Attribute Editing
European Conference on Computer Vision (ECCV), 2018
Face Recognition with Contrastive Convolution
European Conference on Computer Vision (ECCV), 2018
Real-Time Rotation-Invariant Face Detection With Progressive Calibration Networks
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018
Duplex Generative Adversarial Network for Unsupervised Domain Adaptation
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018
ACM MM 'Kinship Recognition Track of Recognizing Families In the Wild Data Challenge', 1st Place
Award, 2017
IEEE CVPR 'Landmark Detection Track of Faces in-the-wild Challenge', 2nd Place
Award, 2017
Noisy Face Image Sets Refining Collaborated with Discriminant Feature Space Learning
IEEE Conference on Automatic Face and Gesture Recognition (FG), 2017
Self-Error-Correcting Convolutional Neural Network for Learning with Noisy Labels
IEEE Conference on Automatic Face and Gesture Recognition (FG), 2017
A Fully End-to-End Cascaded CNN for Facial Landmark Detection
IEEE Conference on Automatic Face and Gesture Recognition (FG), 2017
LDF-Net: Learning a Displacement Field Network for Face Recognition Across Pose
IEEE Conference on Automatic Face and Gesture Recognition (FG), 2017
Robust FEC-CNN: A High Accuracy Facial Landmark Detection System
IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) on Faces in-the-wild Challenge, 2017
KinNet: Fine-to-Coarse Deep Metric Learning for Kinship
ACM Multimedia Conference Workshop on Recognizing Families In the Wild (FIW) Data Challenge, 2017
Recursive Spatial Transformer (ReST) for Alignment-Free Face Recognition
IEEE International Conference on Computer Vision (ICCV), 2017
Occlusion-free Face Alignment: Deep Regression Networks Coupled with De-corrupt AutoEncoders
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016
Multi-view Deep Network for Cross-view Classification
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016
VIPLFaceNet: An Open Source Deep Face Recognition SDK
Frontiers of Computer Science, 2016
Funnel-structured cascade for multi-view face detection with alignment-awareness
Neurocomputing, 2016
Multi-view Discriminant Analysis
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2016
IEEE ICCV 'Apparent Age Estimation Track of ChaLearn Looking at People', 2nd Place
Award, 2015
IEEE FG 'Point and Shoot Face Recognition Challenge (PaSC)', 1st Place
Award, 2015
Report on the FG 2015 Video Person Recognition Evaluation
IEEE International Conference on Automatic Face and Gesture Recognition (FG), 2015
AgeNet: Deeply Learned Regressor and Classifier for Robust Apparent Age Estimation
ChaLearn Looking at People Workshop on ICCV, 2015
Leveraging Datasets with Varying Annotations for Face Alignment via Deep Regression Network
International Conference on Computer Vision (ICCV), 2015
Bi-shifting Auto-Encoder for Unsupervised Domain Adaptation
International Conference on Computer Vision (ICCV), 2015
Topic-aware Deep Auto-encoders (TDA) for Face Alignment
Asian Conference on Computer Vision (ACCV), 2014
Coarse-to-Fine Auto-encoder Networks (CFAN) for Real-time Face Alignment
European Conference on Computer Vision (ECCV), 2014
Stacked Progressive Auto-Encoders (SPAE) for Face Recognition Across Poses
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014
Semi-supervised Hashing via Kernel Hyperplane Learning for Scalable Image Search
IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2014
Domain Adaptation for Face Recognition: Targetize Source Domain Bridged by Common Subspace
International Journal of Computer Vision (IJCV), 2014
Adaptive Discriminant Learning for Face Recognition
Pattern Recognition (PR), 2013
Learning Prototype Hyperplanes for Face Verification in the Wild
IEEE Trans. on Image Processing (TIP), 2013
Multi-view Discriminant Analysis
European Conference on Computer Vision (ECCV), 2012
Adaptive Discriminant Analysis for face recognition from Single Sample per Person
International Conference on Automatic Face and Gesture Recognition (FG), 2011
Side-Information based Linear Discriminant Analysis for Face Recognition
British Machine Vision Conference (BMVC), 2011