Search Results for author: Tong Wu

Found 76 papers, 30 papers with code

DeferredGS: Decoupled and Editable Gaussian Splatting with Deferred Shading

no code implementations15 Apr 2024 Tong Wu, Jia-Mu Sun, Yu-Kun Lai, Yuewen Ma, Leif Kobbelt, Lin Gao

To address these issues, we introduce DeferredGS, a method for decoupling and editing the Gaussian splatting representation using deferred shading.

ComboVerse: Compositional 3D Assets Creation Using Spatially-Aware Diffusion Guidance

no code implementations19 Mar 2024 Yongwei Chen, Tengfei Wang, Tong Wu, Xingang Pan, Kui Jia, Ziwei Liu

Though promising results have been achieved in single object generation, these methods often struggle to model complex 3D assets that inherently contain multiple objects.

3D Generation Object

Recent Advances in 3D Gaussian Splatting

no code implementations17 Mar 2024 Tong Wu, Yu-Jie Yuan, Ling-Xiao Zhang, Jie Yang, Yan-Pei Cao, Ling-Qi Yan, Lin Gao

The emergence of 3D Gaussian Splatting (3DGS) has greatly accelerated the rendering speed of novel view synthesis.

3D Reconstruction Dynamic Reconstruction +1

3DTopia: Large Text-to-3D Generation Model with Hybrid Diffusion Priors

1 code implementation4 Mar 2024 Fangzhou Hong, Jiaxiang Tang, Ziang Cao, Min Shi, Tong Wu, Zhaoxi Chen, Tengfei Wang, Liang Pan, Dahua Lin, Ziwei Liu

Specifically, it is powered by a text-conditioned tri-plane latent diffusion model, which quickly generates coarse 3D samples for fast prototyping.

3D Generation Text to 3D +1

Depth Estimation Algorithm Based on Transformer-Encoder and Feature Fusion

no code implementations3 Mar 2024 Linhan Xia, Junbang Liu, Tong Wu

This research presents a novel depth estimation algorithm based on a Transformer-encoder architecture, tailored for the NYU and KITTI Depth Dataset.

Depth Estimation SSIM

Sinkhorn Distance Minimization for Knowledge Distillation

1 code implementation27 Feb 2024 Xiao Cui, Yulei Qin, Yuting Gao, Enwei Zhang, Zihan Xu, Tong Wu, Ke Li, Xing Sun, Wengang Zhou, Houqiang Li

We propose the Sinkhorn Knowledge Distillation (SinKD) that exploits the Sinkhorn distance to ensure a nuanced and precise assessment of the disparity between teacher and student distributions.

Knowledge Distillation

Growing from Exploration: A self-exploring framework for robots based on foundation models

no code implementations24 Jan 2024 Shoujie Li, Ran Yu, Tong Wu, JunWen Zhong, Xiao-Ping Zhang, Wenbo Ding

In this work, we propose a framework named GExp, which enables robots to explore and learn autonomously without human intervention.

Few-Shot Learning

GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation

1 code implementation8 Jan 2024 Tong Wu, Guandao Yang, Zhibing Li, Kai Zhang, Ziwei Liu, Leonidas Guibas, Dahua Lin, Gordon Wetzstein

These metrics lack the flexibility to generalize to different evaluation criteria and might not align well with human preferences.

3D Generation Text to 3D

Guidelines in Wastewater-based Epidemiology of SARS-CoV-2 with Diagnosis

no code implementations26 Dec 2023 Madiha Fatima, Zhihua Cao, Aichun Huang, Shengyuan Wu, Xinxian Fan, Yi Wang, Liu Jiren, Ziyun Zhu, Qiongrou Ye, Yuan Ma, Joseph K. F Chow, Peng Jia, Yangshou Liu, Yubin Lin, Manjun Ye, Tong Wu, ZHIXUN LI, Cong Cai, Wenhai Zhang, Cheris H. Q. Ding, Yuanzhe Cai, Feijuan Huang

With the global spread and increasing transmission rate of SARS-CoV-2, more and more laboratories and researchers are turning their attention to wastewater-based epidemiology (WBE), hoping it can become an effective tool for large-scale testing and provide more ac-curate predictions of the number of infected individuals.

Epidemiology

Gemini vs GPT-4V: A Preliminary Comparison and Combination of Vision-Language Models Through Qualitative Cases

1 code implementation22 Dec 2023 Zhangyang Qi, Ye Fang, Mengchen Zhang, Zeyi Sun, Tong Wu, Ziwei Liu, Dahua Lin, Jiaqi Wang, Hengshuang Zhao

We conducted a series of structured experiments to evaluate their performance in various industrial application scenarios, offering a comprehensive perspective on their practical utility.

Building Lane-Level Maps from Aerial Images

1 code implementation20 Dec 2023 Jiawei Yao, Xiaochao Pan, Tong Wu, Xiaofeng Zhang

In this paper, we introduce for the first time a large-scale aerial image dataset built for lane detection, with high-quality polyline lane annotations on high-resolution images of around 80 kilometers of road.

Autonomous Driving Lane Detection

HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image

no code implementations7 Dec 2023 Tong Wu, Zhibing Li, Shuai Yang, Pan Zhang, Xinggang Pan, Jiaqi Wang, Dahua Lin, Ziwei Liu

Extensive experiments demonstrate the effectiveness of HyperDreamer in modeling region-aware materials with high-resolution textures and enabling user-friendly editing.

Semantic Segmentation

Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

1 code implementation6 Dec 2023 Zeyi Sun, Ye Fang, Tong Wu, Pan Zhang, Yuhang Zang, Shu Kong, Yuanjun Xiong, Dahua Lin, Jiaqi Wang

Alpha-CLIP not only preserves the visual recognition ability of CLIP but also enables precise control over the emphasis of image contents.

3D Generation

GPT4Point: A Unified Framework for Point-Language Understanding and Generation

1 code implementation5 Dec 2023 Zhangyang Qi, Ye Fang, Zeyi Sun, Xiaoyang Wu, Tong Wu, Jiaqi Wang, Dahua Lin, Hengshuang Zhao

Multimodal Large Language Models (MLLMs) have excelled in 2D image-text comprehension and image generation, but their understanding of the 3D world is notably deficient, limiting progress in 3D language understanding and generation.

3D Generation Reading Comprehension

Retargeting Visual Data with Deformation Fields

no code implementations22 Nov 2023 Tim Elsner, Julia Berger, Tong Wu, Victor Czech, Lin Gao, Leif Kobbelt

Seam carving is an image editing method that enable content-aware resizing, including operations like removing objects.

PatchCURE: Improving Certifiable Robustness, Model Utility, and Computation Efficiency of Adversarial Patch Defenses

1 code implementation19 Oct 2023 Chong Xiang, Tong Wu, Sihui Dai, Jonathan Petit, Suman Jana, Prateek Mittal

State-of-the-art defenses against adversarial patch attacks can now achieve strong certifiable robustness with a marginal drop in model utility.

Large-Vocabulary 3D Diffusion Model with Transformer

no code implementations14 Sep 2023 Ziang Cao, Fangzhou Hong, Tong Wu, Liang Pan, Ziwei Liu

To this end, we propose a novel triplane-based 3D-aware Diffusion model with TransFormer, DiffTF, for handling challenges via three aspects.

3D Generation

Improving Depth Gradient Continuity in Transformers: A Comparative Study on Monocular Depth Estimation with CNN

no code implementations16 Aug 2023 Jiawei Yao, Tong Wu, Xiaofeng Zhang

To explore the differences between Transformers and CNNs, we employ a sparse pixel approach to contrastively analyze the distinctions between the two.

Monocular Depth Estimation

Robust Data Clustering with Outliers via Transformed Tensor Low-Rank Representation

1 code implementation18 Jul 2023 Tong Wu

Recently, tensor low-rank representation (TLRR) has become a popular tool for tensor data recovery and clustering, due to its empirical success and theoretical guarantees.

Clustering Outlier Detection

Towards Trustworthy Explanation: On Causal Rationalization

1 code implementation25 Jun 2023 Wenbo Zhang, Tong Wu, Yunlong Wang, Yong Cai, Hengrui Cai

With recent advances in natural language processing, rationalization becomes an essential self-explaining diagram to disentangle the black box by selecting a subset of input texts to account for the major variation in prediction.

Causal Inference

Differential Privacy for Class-based Data: A Practical Gaussian Mechanism

no code implementations8 Jun 2023 Raksha Ramakrishna, Anna Scaglione, Tong Wu, Nikhil Ravi, Sean Peisert

In this paper, we present a notion of differential privacy (DP) for data that comes from different classes.

Privacy-Preserving In-Context Learning for Large Language Models

no code implementations2 May 2023 Tong Wu, Ashwinee Panda, Jiachen T. Wang, Prateek Mittal

Based on the general paradigm of DP-ICL, we instantiate several techniques showing how to privatize ICL for text classification and language generation.

In-Context Learning Privacy Preserving +3

A Randomized Approach for Tight Privacy Accounting

no code implementations17 Apr 2023 Jiachen T. Wang, Saeed Mahloujifar, Tong Wu, Ruoxi Jia, Prateek Mittal

In this paper, we propose a new differential privacy paradigm called estimate-verify-release (EVR), which addresses the challenges of providing a strict upper bound for privacy parameter in DP compositions by converting an estimate of privacy parameter into a formal guarantee.

Privacy Preserving

V3Det: Vast Vocabulary Visual Detection Dataset

no code implementations ICCV 2023 Jiaqi Wang, Pan Zhang, Tao Chu, Yuhang Cao, Yujie Zhou, Tong Wu, Bin Wang, Conghui He, Dahua Lin

2) Hierarchical Category Organization: The vast vocabulary of V3Det is organized by a hierarchical category tree which annotates the inclusion relationship among categories, encouraging the exploration of category relationships in vast and open vocabulary object detection.

Chatbot Object +2

Enhancing Text Generation with Cooperative Training

1 code implementation16 Mar 2023 Tong Wu, Hao Wang, Zhongshen Zeng, Wei Wang, Hai-Tao Zheng, Jiaxing Zhang

Recently, there has been a surge in the use of generated data to enhance the performance of downstream models, largely due to the advancements in pre-trained language models.

MRPC QQP +2

Constrained Reinforcement Learning for Predictive Control in Real-Time Stochastic Dynamic Optimal Power Flow

no code implementations21 Feb 2023 Tong Wu, Anna Scaglione, Daniel Arnold

This paper presents a novel primal-dual approach for learning optimal constrained DRL policies for dynamic optimal power flow problems, with the aim of controlling power generations and battery outputs.

reinforcement-learning Reinforcement Learning (RL)

Uncovering Adversarial Risks of Test-Time Adaptation

no code implementations29 Jan 2023 Tong Wu, Feiran Jia, Xiangyu Qi, Jiachen T. Wang, Vikash Sehwag, Saeed Mahloujifar, Prateek Mittal

Recently, test-time adaptation (TTA) has been proposed as a promising solution for addressing distribution shifts.

Test-time Adaptation

SLAN: Self-Locator Aided Network for Vision-Language Understanding

no code implementations ICCV 2023 Jiang-Tian Zhai, Qi Zhang, Tong Wu, Xing-Yu Chen, Jiang-Jiang Liu, Ming-Ming Cheng

By aggregating vision-language information, the region filter selects key regions and the region adaptor updates their coordinates with text guidance.

Image Retrieval Retrieval

SLAN: Self-Locator Aided Network for Cross-Modal Understanding

no code implementations28 Nov 2022 Jiang-Tian Zhai, Qi Zhang, Tong Wu, Xing-Yu Chen, Jiang-Jiang Liu, Bo Ren, Ming-Ming Cheng

By aggregating cross-modal information, the region filter selects key regions and the region adaptor updates their coordinates with text guidance.

Image Retrieval Retrieval

Voxurf: Voxel-based Efficient and Accurate Neural Surface Reconstruction

1 code implementation26 Aug 2022 Tong Wu, Jiaqi Wang, Xingang Pan, Xudong Xu, Christian Theobalt, Ziwei Liu, Dahua Lin

Previous methods based on neural volume rendering mostly train a fully implicit model with MLPs, which typically require hours of training for a single scene.

Surface Reconstruction

Complex-Value Spatio-temporal Graph Convolutional Neural Networks and its Applications to Electric Power Systems AI

no code implementations17 Aug 2022 Tong Wu, Anna Scaglione, Daniel Arnold

The effective representation, precessing, analysis, and visualization of large-scale structured data over graphs are gaining a lot of attention.

Cyber Attack Detection

Just Rotate it: Deploying Backdoor Attacks via Rotation Transformation

no code implementations22 Jul 2022 Tong Wu, Tianhao Wang, Vikash Sehwag, Saeed Mahloujifar, Prateek Mittal

Our attack can be easily deployed in the real world since it only requires rotating the object, as we show in both image classification and object detection applications.

Data Augmentation Image Classification +3

Human-Robot Commensality: Bite Timing Prediction for Robot-Assisted Feeding in Groups

no code implementations7 Jul 2022 Jan Ondras, Abrar Anwar, Tong Wu, Fanjun Bu, Malte Jung, Jorge Jose Ortiz, Tapomayukh Bhattacharjee

While existing robotic systems for feeding people with mobility limitations focus on solitary dining, commensality, the act of eating together, is often the practice of choice.

Towards A Proactive ML Approach for Detecting Backdoor Poison Samples

2 code implementations26 May 2022 Xiangyu Qi, Tinghao Xie, Jiachen T. Wang, Tong Wu, Saeed Mahloujifar, Prateek Mittal

First, we uncover a post-hoc workflow underlying most prior work, where defenders passively allow the attack to proceed and then leverage the characteristics of the post-attacked model to uncover poison samples.

Spatio-Temporal Graph Convolutional Neural Networks for Physics-Aware Grid Learning Algorithms

no code implementations31 Mar 2022 Tong Wu, Ignacio Losada Carreno, Anna Scaglione, Daniel Arnold

This paper proposes a model-free Volt-VAR control (VVC) algorithm via the spatio-temporal graph ConvNet-based deep reinforcement learning (STGCN-DRL) framework, whose goal is to control smart inverters in an unbalanced distribution system.

reinforcement-learning Reinforcement Learning (RL)

Artificial Intelligence Enables Real-Time and Intuitive Control of Prostheses via Nerve Interface

no code implementations16 Mar 2022 Diu Khue Luu, Anh Tuan Nguyen, Ming Jiang, Markus W. Drealan, Jian Xu, Tong Wu, Wing-kin Tam, Wenfeng Zhao, Brian Z. H. Lim, Cynthia K. Overstreet, Qi Zhao, Jonathan Cheng, Edward W. Keefer, Zhi Yang

Objective: The next generation prosthetic hand that moves and feels like a real hand requires a robust neural interconnection between the human minds and machines.

Balanced Chamfer Distance as a Comprehensive Metric for Point Cloud Completion

1 code implementation NeurIPS 2021 Tong Wu, Liang Pan, Junzhe Zhang, Tai Wang, Ziwei Liu, Dahua Lin

We adopt DCD to evaluate the point cloud completion task, where experimental results show that DCD pays attention to both the overall structure and local geometric details and provides a more reliable evaluation even when CD and EMD contradict each other.

Point Cloud Completion

Density-aware Chamfer Distance as a Comprehensive Metric for Point Cloud Completion

1 code implementation24 Nov 2021 Tong Wu, Liang Pan, Junzhe Zhang, Tai Wang, Ziwei Liu, Dahua Lin

We adopt DCD to evaluate the point cloud completion task, where experimental results show that DCD pays attention to both the overall structure and local geometric details and provides a more reliable evaluation even when CD and EMD contradict each other.

Point Cloud Completion

Few-Shot Object Detection via Association and DIscrimination

1 code implementation NeurIPS 2021 Yuhang Cao, Jiaqi Wang, Ying Jin, Tong Wu, Kai Chen, Ziwei Liu, Dahua Lin

1) In the association step, in contrast to implicitly leveraging multiple base classes, we construct a compact novel class feature space via explicitly imitating a specific base class feature space.

Few-Shot Object Detection Object +3

Modeling spatial waves of Wolbachia invasion for controlling mosquito-borne diseases

no code implementations24 Aug 2021 Zhuolin Qu, Tong Wu, James Mac Hyman

Field trials and modeling studies have shown that the fraction of infection among the mosquitoes must exceed a threshold level for the infection to persist.

Generalizing Nucleus Recognition Model in Multi-source Images via Pruning

no code implementations6 Jul 2021 Jiatong Cai, Chenglu Zhu, Can Cui, Honglin Li, Tong Wu, Shichuan Zhang, Lin Yang

In addition, the model is optimized by fine-tuning on merged domains to eliminate the interference of class mismatching among various domains.

Domain Generalization

Adversarial Robustness under Long-Tailed Distribution

1 code implementation CVPR 2021 Tong Wu, Ziwei Liu, Qingqiu Huang, Yu Wang, Dahua Lin

We then perform a systematic study on existing long-tailed recognition methods in conjunction with the adversarial training framework.

Adversarial Robustness

Towards Evaluating and Training Verifiably Robust Neural Networks

1 code implementation CVPR 2021 Zhaoyang Lyu, Minghao Guo, Tong Wu, Guodong Xu, Kehuan Zhang, Dahua Lin

Recent works have shown that interval bound propagation (IBP) can be used to train verifiably robust neural networks.

RLAD: Time Series Anomaly Detection through Reinforcement Learning and Active Learning

no code implementations31 Mar 2021 Tong Wu, Jorge Ortiz

We introduce a new semi-supervised, time series anomaly detection algorithm that uses deep reinforcement learning (DRL) and active learning to efficiently learn and adapt to anomalies in real-world time series data.

Active Learning Anomaly Detection +4

Hot electron generation through near-field excitation of plasmonic nanoresonators

no code implementations11 Mar 2021 Felix Binkowski, Tong Wu, Philippe Lalanne, Sven Burger, Alexander O. Govorov

We theoretically study hot electron generation through the emission of a dipole source coupled to a nanoresonator on a metal surface.

Optics Mesoscale and Nanoscale Physics

Three-Dimensional Lip Motion Network for Text-Independent Speaker Recognition

1 code implementation13 Oct 2020 Jianrong Wang, Tong Wu, Shanyu Wang, Mei Yu, Qiang Fang, Ju Zhang, Li Liu

To this end, in this work, we present a novel end-to-end 3D lip motion Network (3LMNet) by utilizing the sentence-level 3D lip motion (S3DLM) to recognize speakers in both the text-independent and text-dependent contexts.

Sentence Speaker Recognition +1

TM-NET: Deep Generative Networks for Textured Meshes

no code implementations13 Oct 2020 Lin Gao, Tong Wu, Yu-Jie Yuan, Ming-Xian Lin, Yu-Kun Lai, Hao Zhang

We introduce a conditional autoregressive model for texture generation, which can be conditioned on both part geometry and textures already generated for other parts to achieve texture compatibility.

Graphics

Physical Adversarial Attack on Vehicle Detector in the Carla Simulator

no code implementations31 Jul 2020 Tong Wu, Xuefei Ning, Wenshuo Li, Ranran Huang, Huazhong Yang, Yu Wang

In this paper, we tackle the issue of physical adversarial examples for object detectors in the wild.

Adversarial Attack

Distribution-Balanced Loss for Multi-Label Classification in Long-Tailed Datasets

1 code implementation ECCV 2020 Tong Wu, Qingqiu Huang, Ziwei Liu, Yu Wang, Dahua Lin

We present a new loss function called Distribution-Balanced Loss for the multi-label recognition problems that exhibit long-tailed class distributions.

Binary Classification General Classification +2

Adversarial Robustness of Deep Sensor Fusion Models

no code implementations23 Jun 2020 Shaojie Wang, Tong Wu, Ayan Chakrabarti, Yevgeniy Vorobeychik

First, we find that the fusion model is usually both more accurate, and more robust against single-source attacks than single-sensor deep neural networks.

Adversarial Robustness Autonomous Driving +4

Meta Segmentation Network for Ultra-Resolution Medical Images

no code implementations19 Feb 2020 Tong Wu, Yuan Xie, Yanyun Qu, Bicheng Dai, Shuxin Chen

MSN can fast generate the weights of fusion layers through a simple meta-learner, requiring only a few training samples and epochs to converge.

Image Segmentation Meta-Learning +2

Representation Learning of EHR Data via Graph-Based Medical Entity Embedding

no code implementations7 Oct 2019 Tong Wu, Yunlong Wang, Yue Wang, Emily Zhao, Yilian Yuan, Zhi Yang

Automatic representation learning of key entities in electronic health record (EHR) data is a critical step for healthcare informatics that turns heterogeneous medical records into structured and actionable information.

Graph Embedding Representation Learning

Enhancing Model Interpretability and Accuracy for Disease Progression Prediction via Phenotype-Based Patient Similarity Learning

no code implementations26 Sep 2019 Yue Wang, Tong Wu, Yunlong Wang, Gao Wang

Models have been proposed to extract temporal patterns from longitudinal electronic health records (EHR) for clinical predictive models.

regression

Defending Against Physically Realizable Attacks on Image Classification

2 code implementations ICLR 2020 Tong Wu, Liang Tong, Yevgeniy Vorobeychik

Finally, we demonstrate that adversarial training using our new attack yields image classification models that exhibit high robustness against the physically realizable attacks we study, offering the first effective generic defense against such attacks.

Classification General Classification +1

AIBench: An Industry Standard Internet Service AI Benchmark Suite

no code implementations13 Aug 2019 Wanling Gao, Fei Tang, Lei Wang, Jianfeng Zhan, Chunxin Lan, Chunjie Luo, Yunyou Huang, Chen Zheng, Jiahui Dai, Zheng Cao, Daoyi Zheng, Haoning Tang, Kunlin Zhan, Biao Wang, Defei Kong, Tong Wu, Minghe Yu, Chongkang Tan, Huan Li, Xinhui Tian, Yatao Li, Junchao Shao, Zhenyu Wang, Xiaoyu Wang, Hainan Ye

On the basis of the AIBench framework, abstracting the real-world data sets and workloads from one of the top e-commerce providers, we design and implement the first end-to-end Internet service AI benchmark, which contains the primary modules in the critical paths of an industry scale application and is scalable to deploy on different cluster scales.

Benchmarking Learning-To-Rank

SDM-NET: Deep Generative Network for Structured Deformable Mesh

no code implementations13 Aug 2019 Lin Gao, Jie Yang, Tong Wu, Yu-Jie Yuan, Hongbo Fu, Yu-Kun Lai, Hao Zhang

At the structural level, we train a Structured Parts VAE (SP-VAE), which jointly learns the part structure of a shape collection and the part geometries, ensuring a coherence between global shape structure and surface details.

Predicting Treatment Initiation from Clinical Time Series Data via Graph-Augmented Time-Sensitive Model

no code implementations1 Jul 2019 Fan Zhang, Tong Wu, Yunlong Wang, Yong Cai, Cao Xiao, Emily Zhao, Lucas Glass, Jimeng Sun

Many computational models were proposed to extract temporal patterns from clinical time series for each patient and among patient group for predictive healthcare.

Time Series Time Series Analysis

Human Action Attribute Learning From Video Data Using Low-Rank Representations

no code implementations23 Dec 2016 Tong Wu, Prudhvi Gurram, Raghuveer M. Rao, Waheed U. Bajwa

Representation of human actions as a sequence of human body movements or action attributes enables the development of models for human activity recognition and summarization.

Action Recognition Attribute +3

Learning the nonlinear geometry of high-dimensional data: Models and algorithms

no code implementations21 Dec 2014 Tong Wu, Waheed U. Bajwa

This paper revisits the problem of data-driven learning of these geometric structures and puts forth two new nonlinear geometric models for data describing "related" objects/phenomena.

Clustering Vocal Bursts Intensity Prediction

Painting Analysis Using Wavelets and Probabilistic Topic Models

no code implementations26 Jan 2014 Tong Wu, Gungor Polatkan, David Steel, William Brown, Ingrid Daubechies, Robert Calderbank

In this paper, computer-based techniques for stylistic analysis of paintings are applied to the five panels of the 14th century Peruzzi Altarpiece by Giotto di Bondone.

Clustering Topic Models

Cannot find the paper you are looking for? You can Submit a new open access paper.