Cross-domain Few-Shot Learning (CD-FSL) aims at transferring knowledge from a source dataset to new target domains with few labeled data. However, most of the exitsing CD-FSL works focus on the classification task, overlooking object detection. Thus, this paper delves into object detection tasks in CD-FSL, also known as cross-domain few-shot object detection (CD-FSOD). Previous traditional FSOD methods could roughtly grouped into meta-learning based ones and finetuning based ones, while a recent transformer-based open-set detector, DE-ViT, shows exceptional performance in FSOD, surpassing other methods as depicted in the below figure. This inspired us to study:
a) Our motivation: The DE-ViT open-set detector excels in FSOD but strug-gles in CD-FSOD, inspiring our creation of CD-ViTO. (b) Technical motivation: FSOD models face challenges when dealing with cross-domain targets, such as small inter-class variance (ICV), indefinable boundaries (IB), and varying appearances (styles).
>> To answer the first question:
>> To answer the second question:
We build a new CD-ViTO method via enhancing the existing DE-ViT with the following novel modules:
In addition to the visual examples as shown in the benchmark figure, we further provide more infomations here. All the target datasets could be found on our github repo.
We build our method upon the base open-set detector (DE-ViT) and finetune our method using few labeled instances of target domain. Modules in blue are inherited from DE-ViT while modules in orange are proposed by us.
New improvements include learnable instance features, instance reweighting, domain prompter, and finetuning; More details about the modules please refer to our paper.
We hightlight that all the modules are very lightwight causing very negligible or even no cost.
@article{fu2024cross,
title={Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object Detector},
author={Fu, Yuqian and Wang, Yu and Pan, Yixuan and Huai, Lian and Qiu, Xingyu and Shangguan, Zeyu and Liu, Tong and Kong, Lingjie and Fu, Yanwei and Van Gool, Luc and others},
journal={arXiv preprint arXiv:2402.03094},
year={2024}
}