1 code implementation • 17 Oct 2024 • Rongyao Fang, Chengqi Duan, Kun Wang, Hao Li, Hao Tian, Xingyu Zeng, Rui Zhao, Jifeng Dai, Hongsheng Li, Xihui Liu
This work represents a significant step towards a truly unified MLLM capable of adapting to the granularity demands of various visual tasks.
1 code implementation • 24 Jun 2024 • Sirui Chen, Mengying Xu, Kun Wang, Xingyu Zeng, Rui Zhao, Shengjie Zhao, Chaochao Lu
Causal reasoning is a cornerstone of how humans interpret the world.
2 code implementations • 1 May 2024 • Sirui Chen, Bo Peng, Meiqi Chen, Ruiqi Wang, Mengying Xu, Xingyu Zeng, Rui Zhao, Shengjie Zhao, Yu Qiao, Chaochao Lu
Recent advances in language models have expanded the horizons of artificial intelligence across various domains, sparking inquiries into their potential for causal reasoning.
no code implementations • 19 Nov 2023 • Yilun Kong, Jingqing Ruan, Yihong Chen, Bin Zhang, Tianpeng Bao, Shiwei Shi, Guoqing Du, Xiaoru Hu, Hangyu Mao, Ziyue Li, Xingyu Zeng, Rui Zhao
Large Language Models (LLMs) have demonstrated proficiency in addressing tasks that necessitate a combination of task planning and the usage of external tools that require a blend of task planning and the utilization of external tools, such as APIs.
no code implementations • 16 Nov 2023 • Yuhan Sun, Mukai Li, Yixin Cao, Kun Wang, Wenxiao Wang, Xingyu Zeng, Rui Zhao
In response, we introduce ControlPE (Continuously Controllable Prompt Engineering).
no code implementations • 12 Oct 2023 • Zhixuan Liang, Xingyu Zeng, Rui Zhao, Ping Luo
Active learning strategies aim to train high-performance models with minimal labeled data by selecting the most informative instances for labeling.
no code implementations • 7 Aug 2023 • Jingqing Ruan, Yihong Chen, Bin Zhang, Zhiwei Xu, Tianpeng Bao, Guoqing Du, Shiwei Shi, Hangyu Mao, Ziyue Li, Xingyu Zeng, Rui Zhao
With recent advancements in natural language processing, Large Language Models (LLMs) have emerged as powerful tools for various real-world applications.
no code implementations • 23 Mar 2023 • Shaobo Lin, Kun Wang, Xingyu Zeng, Rui Zhao
To construct a representative synthetic training dataset, we maximize the diversity of the selected images via a sample-based and cluster-based method.
no code implementations • 15 Mar 2023 • Guoqiang Jin, Fan Yang, Mingshan Sun, Ruyi Zhao, Yakun Liu, Wei Li, Tianpeng Bao, Liwei Wu, Xingyu Zeng, Rui Zhao
To this end, we propose SeqCo-DETR, a novel Sequence Consistency-based self-supervised method for object DEtection with TRansformers.
no code implementations • 28 Feb 2023 • Shaobo Lin, Kun Wang, Xingyu Zeng, Rui Zhao
Specifically, we first discover the base images which contain the FP of novel categories and select a certain amount of samples from them for the base and novel categories balance.
no code implementations • 26 Jan 2023 • Shaobo Lin, Xingyu Zeng, Rui Zhao
The generalization power of the pre-trained model is the key for few-shot deep learning.
no code implementations • 12 Oct 2022 • Shaobo Lin, Xingyu Zeng, Rui Zhao
Conventional training of deep neural networks usually requires a substantial amount of data with expensive human annotations.
no code implementations • 29 Sep 2021 • Shaobo Lin, Xingyu Zeng, Rui Zhao
Conventional training of deep neural networks usually requires a substantial amount of data with expensive human annotations.
1 code implementation • ECCV 2020 • Xinzhu Ma, Shinan Liu, Zhiyi Xia, Hongwen Zhang, Xingyu Zeng, Wanli Ouyang
Based on this observation, we design an image based CNN detector named Patch-Net, which is more generalized and can be instantiated as pseudo-LiDAR based 3D detectors.
no code implementations • ECCV 2020 • Peng Su, Kun Wang, Xingyu Zeng, Shixiang Tang, Dapeng Chen, Di Qiu, Xiaogang Wang
Then this domain-vector is used to encode the features from another domain through a conditional normalization, resulting in different domains' features carrying the same domain attribute.
Ranked #1 on
Unsupervised Domain Adaptation
on SIM10K to BDD100K
no code implementations • 5 Feb 2020 • Yingjie Cai, Buyu Li, Zeyu Jiao, Hongsheng Li, Xingyu Zeng, Xiaogang Wang
Monocular 3D object detection task aims to predict the 3D bounding boxes of objects based on monocular RGB images.
no code implementations • CVPR 2019 • Buyu Li, Wanli Ouyang, Lu Sheng, Xingyu Zeng, Xiaogang Wang
We present an efficient 3D object detection framework based on a single RGB image in the scenario of autonomous driving.
Ranked #18 on
Vehicle Pose Estimation
on KITTI Cars Hard
1 code implementation • 8 Oct 2016 • Xingyu Zeng, Wanli Ouyang, Junjie Yan, Hongsheng Li, Tong Xiao, Kun Wang, Yu Liu, Yucong Zhou, Bin Yang, Zhe Wang, Hui Zhou, Xiaogang Wang
The effectiveness of GBD-Net is shown through experiments on three object detection datasets, ImageNet, Pascal VOC2007 and Microsoft COCO.
1 code implementation • 9 Apr 2016 • Kai Kang, Hongsheng Li, Junjie Yan, Xingyu Zeng, Bin Yang, Tong Xiao, Cong Zhang, Zhe Wang, Ruohui Wang, Xiaogang Wang, Wanli Ouyang
Temporal and contextual information of videos are not fully investigated and utilized.
no code implementations • 9 Dec 2015 • Xingyu Zeng, Wanli Ouyang, Xiaogang Wang
We propose a representation learning pipeline to use the relationship as supervision for improving the learned representation in object detection.
no code implementations • ICCV 2015 • Wanli Ouyang, Hongyang Li, Xingyu Zeng, Xiaogang Wang
Experimental results show that the attributes are helpful in learning better features and improving the object detection accuracy by 2. 6% in mAP on the ILSVRC 2014 object detection dataset and 2. 4% in mAP on PASCAL VOC 2007 object detection dataset.
no code implementations • CVPR 2015 • Wanli Ouyang, Xiaogang Wang, Xingyu Zeng, Shi Qiu, Ping Luo, Yonglong Tian, Hongsheng Li, Shuo Yang, Zhe Wang, Chen-Change Loy, Xiaoou Tang
In this paper, we propose deformable deep convolutional neural networks for generic object detection.
no code implementations • 11 Sep 2014 • Wanli Ouyang, Ping Luo, Xingyu Zeng, Shi Qiu, Yonglong Tian, Hongsheng Li, Shuo Yang, Zhe Wang, Yuanjun Xiong, Chen Qian, Zhenyao Zhu, Ruohui Wang, Chen-Change Loy, Xiaogang Wang, Xiaoou Tang
In the proposed new deep architecture, a new deformation constrained pooling (def-pooling) layer models the deformation of object parts with geometric constraint and penalty.
no code implementations • CVPR 2013 • Wanli Ouyang, Xingyu Zeng, Xiaogang Wang
In this paper, we propose a mutual visibility deep model that jointly estimates the visibility statuses of overlapping pedestrians.