I am currently a postdoctoral researcher at Tongji University, jointly supervised by Prof. Guang Chen and Prof. Changjun Jiang. I earned my Ph.D. from Tongji University in 2024.
My research focuses on transfer learning and its applications in Embodied AI and AI4Science. I am particularly interested in developing robust and efficient algorithms for transfer learning and their applications in computer vision, robotics, and drug discovery.
* indicates equal contribution
We introduce RCP-Bench, the first comprehensive benchmark designed to evaluate the robustness of collaborative detection models under a wide range of real-world corruptions. RCPBench includes three new datasets (i.e., OPV2V-C, V2XSet-C, and DAIR-V2X-C) that simulate six collaborative cases and 14 types of camera corruption resulting from external environmental factors, sensor failures, and temporal misalignments.
We promote the vanilla BMD to BMD-v2 by incorporating a consistency-guided reweighting strategy to improve inter-class balanced sampling, and leveraging the silhouettes metric to realize adaptive intra-class multicentric clustering.
Despite the simple global and local clustering (GLC) technique achieving commendable performance in separating "known" and "unknown" data, its reliance on pseudo-labeling supervision limits its capacity to discriminate among different "unknown" categories. To alleviate this, we promote GLC to GLC++ by developing a new contrastive affinity learning strategy.
In this paper, we delve into TTA in 3D point cloud segmentation and propose a novel Hierarchical Geometry Learning (HGL) framework. HGL comprises three complementary modules from local, global to temporal learning in a bottom-up manner.
For Universal Domain Adaptation (UniDA), in this paper, we propose a new idea of LEArning Decomposition (LEAD), which decouples features into source-known and -unknown components to identify target-private data.
In this paper, we target a practical setting in IP protection, i.e., Source-free Model IP protection. To achieve this, we propose a novel MAsk Pruning (MAP) framework.
In this paper, we explore the Source-free Universal Domain Adaptation (SF-UniDA). SF-UniDA is appealing in view that universal model adaptation can be resolved only on the basis of a standard pre-trained closed-set model.
Existing single-DG techniques commonly devise various data-augmentation algorithms. In contrast, we target a versatile Modality-Agnostic Debiasing (MAD) framework for single-DG, that enables generalization for different modalities.
In this paper, we proposed a bio-inspired event-camera based falls temporal localization framework. Specifically, we propose a event density-based action proposal generation scheme, and introduce a temporal-spatial attention mechanism for action modeling.
In this paper, we propose an action-context modeling network termed ACM-Net, which integrates a three-branch attention module to measure the likelihood of each temporal point being action instance, context, or non-action background, simultaneously.
IEEE TPAMI, IJCV, IEEE TIP, IEEE TMM, IEEE TCSVT, ACM TOMM, etc.
CVPR, ICCV, ECCV, ICRA, IROS, NeurIPS, etc.