Search Results for author: Xu Cao

Found 35 papers, 15 papers with code

Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting

2 code implementations • 10 Apr 2024 • Hao Lu, Jiaqi Tang, Xinli Xu, Xu Cao, Yunpeng Zhang, Guoqing Wang, Dalong Du, Hao Chen, Yingcong Chen

Finally, for MC3D-Det joint training, the elaborate dataset merge strategy is designed to solve the problem of inconsistent camera numbers and camera parameters.

3D Object Detection Autonomous Driving +1

140

Paper
Code

Spurious Correlations in Machine Learning: A Survey

no code implementations • 20 Feb 2024 • Wenqian Ye, Guangtao Zheng, Xu Cao, Yunsheng Ma, Xia Hu, Aidong Zhang

Machine learning systems are known to be sensitive to spurious correlations between biased features of the inputs (e. g., background, texture, and secondary objects) and the corresponding labels.

Paper
Add Code

Exploring the Impact of In-Browser Deep Learning Inference on Quality of User Experience and Performance

no code implementations • 8 Feb 2024 • QiPeng Wang, Shiqi Jiang, Zhenpeng Chen, Xu Cao, Yuanchun Li, Aoyu Li, Ying Zhang, Yun Ma, Ting Cao, Xuanzhe Liu

Additionally, we noticed that in-browser inference increases the time it takes for graphical user interface (GUI) components to load in web browsers by a significant 67. 2\%, which severely impacts the overall QoE for users of web applications that depend on this technology.

Paper
Add Code

If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents

no code implementations • 1 Jan 2024 • Ke Yang, Jiateng Liu, John Wu, Chaoqi Yang, Yi R. Fung, Sha Li, Zixuan Huang, Xu Cao, Xingyao Wang, Yiquan Wang, Heng Ji, ChengXiang Zhai

The prominent large language models (LLMs) of today differ from past language models not only in size, but also in the fact that they are trained on a combination of natural language and formal language (code).

Code Generation

Paper
Add Code

SuperNormal: Neural Surface Reconstruction via Multi-View Normal Integration

no code implementations • 8 Dec 2023 • Xu Cao, Takafumi Taketomi

We present SuperNormal, a fast, high-fidelity approach to multi-view 3D reconstruction using surface normal maps.

3D Reconstruction Multi-View 3D Reconstruction +1

Paper
Add Code

LaMPilot: An Open Benchmark Dataset for Autonomous Driving with Language Model Programs

1 code implementation • 7 Dec 2023 • Yunsheng Ma, Can Cui, Xu Cao, Wenqian Ye, Peiran Liu, Juanwu Lu, Amr Abdelraouf, Rohit Gupta, Kyungtae Han, Aniket Bera, James M. Rehg, Ziran Wang

Autonomous driving (AD) has made significant strides in recent years.

Autonomous Driving Code Generation +1

Paper
Code

A Survey on Multimodal Large Language Models for Autonomous Driving

1 code implementation • 21 Nov 2023 • Can Cui, Yunsheng Ma, Xu Cao, Wenqian Ye, Yang Zhou, Kaizhao Liang, Jintai Chen, Juanwu Lu, Zichong Yang, Kuei-Da Liao, Tianren Gao, Erlong Li, Kun Tang, Zhipeng Cao, Tong Zhou, Ao Liu, Xinrui Yan, Shuqi Mei, Jianguo Cao, Ziran Wang, Chao Zheng

We first introduce the background of Multimodal Large Language Models (MLLMs), the multimodal models development using LLMs, and the history of autonomous driving.

Autonomous Driving

131

Paper
Code

MACP: Efficient Model Adaptation for Cooperative Perception

1 code implementation • 25 Oct 2023 • Yunsheng Ma, Juanwu Lu, Can Cui, Sicheng Zhao, Xu Cao, Wenqian Ye, Ziran Wang

We approach this objective by identifying the key challenges of shifting from single-agent to cooperative settings, adapting the model by freezing most of its parameters and adding a few lightweight modules.

Paper
Code

Receive, Reason, and React: Drive as You Say with Large Language Models in Autonomous Vehicles

no code implementations • 12 Oct 2023 • Can Cui, Yunsheng Ma, Xu Cao, Wenqian Ye, Ziran Wang

The fusion of human-centric design and artificial intelligence (AI) capabilities has opened up new possibilities for next-generation autonomous vehicles that go beyond transportation.

Autonomous Driving Decision Making

Paper
Add Code

PIE: Simulating Disease Progression via Progressive Image Editing

1 code implementation • 21 Sep 2023 • Kaizhao Liang, Xu Cao, Kuei-Da Liao, Tianren Gao, Wenqian Ye, Zhengyu Chen, Jianguo Cao, Tejas Nama, Jimeng Sun

Disease progression simulation is a crucial area of research that has significant implications for clinical diagnosis, prognosis, and treatment.

Paper
Code

Drive as You Speak: Enabling Human-Like Interaction with Large Language Models in Autonomous Vehicles

no code implementations • 19 Sep 2023 • Can Cui, Yunsheng Ma, Xu Cao, Wenqian Ye, Ziran Wang

The future of autonomous vehicles lies in the convergence of human-centric design and advanced AI capabilities.

Autonomous Driving Decision Making

Paper
Add Code

Accelerating In-Browser Deep Learning Inference on Diverse Edge Clients through Just-in-Time Kernel Optimizations

no code implementations • 16 Sep 2023 • Fucheng Jia, Shiqi Jiang, Ting Cao, Wei Cui, Tianrui Xia, Xu Cao, Yuanchun Li, Deyu Zhang, Ju Ren, Yunxin Liu, Lili Qiu, Mao Yang

Web applications are increasingly becoming the primary platform for AI service delivery, making in-browser deep learning (DL) inference more prominent.

Paper
Add Code

Mitigating Transformer Overconfidence via Lipschitz Regularization

1 code implementation • 12 Jun 2023 • Wenqian Ye, Yunsheng Ma, Xu Cao, Kun Tang

Though Transformers have achieved promising results in many computer vision tasks, they tend to be over-confident in predictions, as the standard Dot Product Self-Attention (DPSA) can barely preserve distance for the unbounded input domain.

Paper
Code

Learning Remote Sensing Object Detection with Single Point Supervision

1 code implementation • 23 May 2023 • Shitian He, Huanxin Zou, Yingqian Wang, Boyang Li, Xu Cao, Ning Jing

In this paper, we make the first attempt to achieve RS object detection with single point supervision, and propose a PSOD method tailored for RS images.

Object object-detection +1

Paper
Code

CEMFormer: Learning to Predict Driver Intentions from In-Cabin and External Cameras via Spatial-Temporal Transformers

no code implementations • 13 May 2023 • Yunsheng Ma, Wenqian Ye, Xu Cao, Amr Abdelraouf, Kyungtae Han, Rohit Gupta, Ziran Wang

Driver intention prediction seeks to anticipate drivers' actions by analyzing their behaviors with respect to surrounding traffic environments.

Paper
Add Code

Multi-View Azimuth Stereo via Tangent Space Consistency

1 code implementation • CVPR 2023 • Xu Cao, Hiroaki Santo, Fumio Okura, Yasuyuki Matsushita

We present a method for 3D reconstruction only using calibrated multi-view surface azimuth maps.

3D Reconstruction 3D Shape Reconstruction

Paper
Code

SuperScaler: Supporting Flexible DNN Parallelization via a Unified Abstraction

no code implementations • 21 Jan 2023 • Zhiqi Lin, Youshan Miao, Guodong Liu, Xiaoxiang Shi, Quanlu Zhang, Fan Yang, Saeed Maleki, Yi Zhu, Xu Cao, Cheng Li, Mao Yang, Lintao Zhang, Lidong Zhou

SuperScaler is a system that facilitates the design and generation of highly flexible parallelization plans.

Scheduling

Paper
Add Code

THMA: Tencent HD Map AI System for Creating HD Map Annotations

no code implementations • 14 Dec 2022 • Kun Tang, Xu Cao, Zhipeng Cao, Tong Zhou, Erlong Li, Ao Liu, Shengtao Zou, Chang Liu, Shuqi Mei, Elena Sizikova, Chao Zheng

THMA has been deployed by the Tencent Map team to provide services to downstream companies and users, serving over 1, 000 labeling workers and producing more than 30, 000 kilometers of HD map data per day at most.

Active Learning Weakly-supervised Learning

Paper
Add Code

ECON: Explicit Clothed humans Optimized via Normal integration

1 code implementation • CVPR 2023 • Yuliang Xiu, Jinlong Yang, Xu Cao, Dimitrios Tzionas, Michael J. Black

To increase robustness for these cases, existing work uses an explicit parametric body model to constrain surface reconstruction, but this limits the recovery of free-form surfaces such as loose clothing that deviates from the body.

Surface Reconstruction

1,021

Paper
Code

ViTASD: Robust Vision Transformer Baselines for Autism Spectrum Disorder Facial Diagnosis

1 code implementation • 30 Oct 2022 • Xu Cao, Wenqian Ye, Elena Sizikova, Xue Bai, Megan Coffee, Hongwu Zeng, Jianguo Cao

Research progress in the field of ASD facial analysis in pediatric patients has been hindered due to a lack of well-established baselines.

Paper
Code

Automatic Infectious Disease Classification Analysis with Concept Discovery

1 code implementation • 28 Aug 2022 • Elena Sizikova, Joshua Vendrow, Xu Cao, Rachel Grotheer, Jamie Haddock, Lara Kassab, Alona Kryshchenko, Thomas Merkh, R. W. M. A. Madushani, Kenny Moise, Annie Ulichney, Huy V. Vo, Chuntian Wang, Megan Coffee, Kathryn Leonard, Deanna Needell

Automatic infectious disease classification from images can facilitate needed medical diagnoses.

Classification

Paper
Code

A Compacted Structure for Cross-domain learning on Monocular Depth and Flow Estimation

no code implementations • 25 Aug 2022 • Yu Chen, Xu Cao, Xiaoyi Lin, Baoru Huang, Xiao-Yun Zhou, Jian-Qing Zheng, Guang-Zhong Yang

A dual-head mechanism is used to predict optical flow for rigid and non-rigid motion based on a divide-and-conquer manner, which significantly improves the optical flow estimation performance.

Autonomous Driving Depth Estimation +1

Paper
Add Code

Improving Computed Tomography (CT) Reconstruction via 3D Shape Induction

1 code implementation • 23 Aug 2022 • Elena Sizikova, Xu Cao, Ashia Lewis, Kenny Moise, Megan Coffee

Chest computed tomography (CT) imaging adds valuable insight in the diagnosis and management of pulmonary infectious diseases, like tuberculosis (TB).

3D Reconstruction Computed Tomography (CT) +1

Paper
Code

AggPose: Deep Aggregation Vision Transformer for Infant Pose Estimation

1 code implementation • 11 May 2022 • Xu Cao, Xiaoye Li, Liya Ma, Yi Huang, Xuan Feng, Zening Chen, Hongwu Zeng, Jianguo Cao

We show that AggPose outperforms hybrid model HRFormer and TokenPose in the infant pose estimation dataset.

Ranked #7 on Keypoint Detection on MS COCO

Keypoint Detection

Paper
Code

Normal Integration via Inverse Plane Fitting With Minimum Point-to-Plane Distance

1 code implementation • CVPR 2021 • Xu Cao, Boxin Shi, Fumio Okura, Yasuyuki Matsushita

Experimental results on analytically computed, synthetic, and real-world surfaces show that our method yields accurate and stable reconstruction for both orthographic and perspective normal maps.

Surface Reconstruction

Paper
Code

VeniBot: Towards Autonomous Venipuncture with Semi-supervised Vein Segmentation from Ultrasound Images

no code implementations • 27 May 2021 • Yu Chen, Yuxuan Wang, Bolin Lai, Zijie Chen, Xu Cao, Nanyang Ye, Zhongyuan Ren, Junbo Zhao, Xiao-Yun Zhou, Peng Qi

In the modern medical care, venipuncture is an indispensable procedure for both diagnosis and treatment.

Segmentation

Paper
Add Code

VeniBot: Towards Autonomous Venipuncture with Automatic Puncture Area and Angle Regression from NIR Images

no code implementations • 27 May 2021 • Xu Cao, Zijie Chen, Bolin Lai, Yuxuan Wang, Yu Chen, Zhengqing Cao, Zhilin Yang, Nanyang Ye, Junbo Zhao, Xiao-Yun Zhou, Peng Qi

For the automation, we focus on the positioning part and propose a Dual-In-Dual-Out network based on two-step learning and two-task learning, which can achieve fully automatic regression of the suitable puncture area and angle from near-infrared(NIR) images.

Navigate regression

Paper
Add Code

Treatment Planning System for Electron FLASH Radiotherapy: Open-source for Clinical Implementation

no code implementations • 9 Mar 2021 • Mahbubur Rahman, M. Ramish Ashraf, David J. Gladstone, Petr Bruza, Lesley A. Jarvis, Philip E. Schaner, Xu Cao, Brian W. Pogue, P. Jack Hoopes, Rongxiao Zhang

eFLASH-RT plans were MC forward calculated in Geant4 for a mouse brain treatment and compared to a conventional (Conv-RT) plan in Eclipse for a human patient with metastatic renal cell carcinoma.

Medical Physics

Paper
Add Code

Balanced Joint Adversarial Training for Robust Intent Detection and Slot Filling

no code implementations • COLING 2020 • Xu Cao, Deyi Xiong, Chongyang Shi, Chao Wang, Yao Meng, Changjian Hu

Joint intent detection and slot filling has recently achieved tremendous success in advancing the performance of utterance understanding.

Intent Detection slot-filling +1

Paper
Add Code

Stereoscopic Flash and No-Flash Photography for Shape and Albedo Recovery

no code implementations • CVPR 2020 • Xu Cao, Michael Waechter, Boxin Shi, Ye Gao, Bo Zheng, Yasuyuki Matsushita

From the stereo image pair, we recover a rough shape that captures low-frequency shape variation without high-frequency details.

Shadow Detection

Paper
Add Code

CAggNet: Crossing Aggregation Network for Medical Image Segmentation

no code implementations • 16 Apr 2020 • Xu Cao, Yanghao Lin

In this paper, we present Crossing Aggregation Network (CAggNet), a novel densely connected semantic segmentation approach for medical image analysis.

Image Segmentation Medical Image Segmentation +3

Paper
Add Code

UCT-ADP Progressive Bias Algorithm for Solving Gomoku

1 code implementation • 11 Dec 2019 • Xu Cao, Yanghao Lin

We combine Adaptive Dynamic Programming (ADP), a reinforcement learning method and UCB applied to trees (UCT) algorithm with a more powerful heuristic function based on Progressive Bias method and two pruning strategies for a traditional board game Gomoku.

Paper
Code

Neural Style Transfer for Point Clouds

no code implementations • 14 Mar 2019 • Xu Cao, Weimin WANG, Katashi Nagao

How can we edit or transform the geometric or color property of a point cloud?

Style Transfer

Paper
Add Code

Point Cloud Colorization Based on Densely Annotated 3D Shape Dataset

no code implementations • 12 Oct 2018 • Xu Cao, Katashi Nagao

This paper introduces DensePoint, a densely sampled and annotated point cloud dataset containing over 10, 000 single objects across 16 categories, by merging different kind of information from two existing datasets.

Colorization

Paper
Add Code

L2GSCI: Local to Global Seam Cutting and Integrating for Accurate Face Contour Extraction

no code implementations • 5 Mar 2017 • Yongwei Nie, Xu Cao, Chengjiang Long, Ping Li, Guiqing Li

Current face alignment algorithms can robustly find a set of landmarks along face contour.

Face Alignment

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.