Search Results for author: Tong Wu

Found 76 papers, 30 papers with code

Caption-Supervised Face Recognition: Training a State-of-the-Art Face Model without Manual Annotation

no code implementations • ECCV 2020 • Qingqiu Huang, Lei Yang, Huaiyi Huang, Tong Wu, Dahua Lin

Captioned images are widely available on the web, while the captions often contain the names of the subjects in the images.

Face Model Face Recognition

Paper
Add Code

DeferredGS: Decoupled and Editable Gaussian Splatting with Deferred Shading

no code implementations • 15 Apr 2024 • Tong Wu, Jia-Mu Sun, Yu-Kun Lai, Yuewen Ma, Leif Kobbelt, Lin Gao

To address these issues, we introduce DeferredGS, a method for decoupling and editing the Gaussian splatting representation using deferred shading.

Paper
Add Code

ComboVerse: Compositional 3D Assets Creation Using Spatially-Aware Diffusion Guidance

no code implementations • 19 Mar 2024 • Yongwei Chen, Tengfei Wang, Tong Wu, Xingang Pan, Kui Jia, Ziwei Liu

Though promising results have been achieved in single object generation, these methods often struggle to model complex 3D assets that inherently contain multiple objects.

3D Generation Object

Paper
Add Code

Recent Advances in 3D Gaussian Splatting

no code implementations • 17 Mar 2024 • Tong Wu, Yu-Jie Yuan, Ling-Xiao Zhang, Jie Yang, Yan-Pei Cao, Ling-Qi Yan, Lin Gao

The emergence of 3D Gaussian Splatting (3DGS) has greatly accelerated the rendering speed of novel view synthesis.

3D Reconstruction Dynamic Reconstruction +1

Paper
Add Code

3DTopia: Large Text-to-3D Generation Model with Hybrid Diffusion Priors

1 code implementation • 4 Mar 2024 • Fangzhou Hong, Jiaxiang Tang, Ziang Cao, Min Shi, Tong Wu, Zhaoxi Chen, Tengfei Wang, Liang Pan, Dahua Lin, Ziwei Liu

Specifically, it is powered by a text-conditioned tri-plane latent diffusion model, which quickly generates coarse 3D samples for fast prototyping.

3D Generation Text to 3D +1

546

Paper
Code

Depth Estimation Algorithm Based on Transformer-Encoder and Feature Fusion

no code implementations • 3 Mar 2024 • Linhan Xia, Junbang Liu, Tong Wu

This research presents a novel depth estimation algorithm based on a Transformer-encoder architecture, tailored for the NYU and KITTI Depth Dataset.

Depth Estimation SSIM

Paper
Add Code

Sinkhorn Distance Minimization for Knowledge Distillation

1 code implementation • 27 Feb 2024 • Xiao Cui, Yulei Qin, Yuting Gao, Enwei Zhang, Zihan Xu, Tong Wu, Ke Li, Xing Sun, Wengang Zhou, Houqiang Li

We propose the Sinkhorn Knowledge Distillation (SinKD) that exploits the Sinkhorn distance to ensure a nuanced and precise assessment of the disparity between teacher and student distributions.

Knowledge Distillation

Paper
Code

Growing from Exploration: A self-exploring framework for robots based on foundation models

no code implementations • 24 Jan 2024 • Shoujie Li, Ran Yu, Tong Wu, JunWen Zhong, Xiao-Ping Zhang, Wenbo Ding

In this work, we propose a framework named GExp, which enables robots to explore and learn autonomously without human intervention.

Few-Shot Learning

Paper
Add Code

GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation

1 code implementation • 8 Jan 2024 • Tong Wu, Guandao Yang, Zhibing Li, Kai Zhang, Ziwei Liu, Leonidas Guibas, Dahua Lin, Gordon Wetzstein

These metrics lack the flexibility to generalize to different evaluation criteria and might not align well with human preferences.

3D Generation Text to 3D

172

Paper
Code

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

1 code implementation • 5 Jan 2024 • DeepSeek-AI, :, Xiao Bi, Deli Chen, Guanting Chen, Shanhuang Chen, Damai Dai, Chengqi Deng, Honghui Ding, Kai Dong, Qiushi Du, Zhe Fu, Huazuo Gao, Kaige Gao, Wenjun Gao, Ruiqi Ge, Kang Guan, Daya Guo, JianZhong Guo, Guangbo Hao, Zhewen Hao, Ying He, Wenjie Hu, Panpan Huang, Erhang Li, Guowei Li, Jiashi Li, Yao Li, Y. K. Li, Wenfeng Liang, Fangyun Lin, A. X. Liu, Bo Liu, Wen Liu, Xiaodong Liu, Xin Liu, Yiyuan Liu, Haoyu Lu, Shanghao Lu, Fuli Luo, Shirong Ma, Xiaotao Nie, Tian Pei, Yishi Piao, Junjie Qiu, Hui Qu, Tongzheng Ren, Zehui Ren, Chong Ruan, Zhangli Sha, Zhihong Shao, Junxiao Song, Xuecheng Su, Jingxiang Sun, Yaofeng Sun, Minghui Tang, Bingxuan Wang, Peiyi Wang, Shiyu Wang, Yaohui Wang, Yongji Wang, Tong Wu, Y. Wu, Xin Xie, Zhenda Xie, Ziwei Xie, Yiliang Xiong, Hanwei Xu, R. X. Xu, Yanhong Xu, Dejian Yang, Yuxiang You, Shuiping Yu, Xingkai Yu, B. Zhang, Haowei Zhang, Lecong Zhang, Liyue Zhang, Mingchuan Zhang, Minghua Zhang, Wentao Zhang, Yichao Zhang, Chenggang Zhao, Yao Zhao, Shangyan Zhou, Shunfeng Zhou, Qihao Zhu, Yuheng Zou

The rapid development of open-source large language models (LLMs) has been truly remarkable.

1,104

Paper
Code

Guidelines in Wastewater-based Epidemiology of SARS-CoV-2 with Diagnosis

no code implementations • 26 Dec 2023 • Madiha Fatima, Zhihua Cao, Aichun Huang, Shengyuan Wu, Xinxian Fan, Yi Wang, Liu Jiren, Ziyun Zhu, Qiongrou Ye, Yuan Ma, Joseph K. F Chow, Peng Jia, Yangshou Liu, Yubin Lin, Manjun Ye, Tong Wu, ZHIXUN LI, Cong Cai, Wenhai Zhang, Cheris H. Q. Ding, Yuanzhe Cai, Feijuan Huang

With the global spread and increasing transmission rate of SARS-CoV-2, more and more laboratories and researchers are turning their attention to wastewater-based epidemiology (WBE), hoping it can become an effective tool for large-scale testing and provide more ac-curate predictions of the number of infected individuals.

Epidemiology

Paper
Add Code

Gemini vs GPT-4V: A Preliminary Comparison and Combination of Vision-Language Models Through Qualitative Cases

1 code implementation • 22 Dec 2023 • Zhangyang Qi, Ye Fang, Mengchen Zhang, Zeyi Sun, Tong Wu, Ziwei Liu, Dahua Lin, Jiaqi Wang, Hengshuang Zhao

We conducted a series of structured experiments to evaluate their performance in various industrial application scenarios, offering a comprehensive perspective on their practical utility.

182

Paper
Code

Building Lane-Level Maps from Aerial Images

1 code implementation • 20 Dec 2023 • Jiawei Yao, Xiaochao Pan, Tong Wu, Xiaofeng Zhang

In this paper, we introduce for the first time a large-scale aerial image dataset built for lane detection, with high-quality polyline lane annotations on high-resolution images of around 80 kilometers of road.

Autonomous Driving Lane Detection

Paper
Code

HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image

no code implementations • 7 Dec 2023 • Tong Wu, Zhibing Li, Shuai Yang, Pan Zhang, Xinggang Pan, Jiaqi Wang, Dahua Lin, Ziwei Liu

Extensive experiments demonstrate the effectiveness of HyperDreamer in modeling region-aware materials with high-resolution textures and enabling user-friendly editing.

Semantic Segmentation

Paper
Add Code

Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

1 code implementation • 6 Dec 2023 • Zeyi Sun, Ye Fang, Tong Wu, Pan Zhang, Yuhang Zang, Shu Kong, Yuanjun Xiong, Dahua Lin, Jiaqi Wang

Alpha-CLIP not only preserves the visual recognition ability of CLIP but also enables precise control over the emphasis of image contents.

3D Generation

478

Paper
Code

GPT4Point: A Unified Framework for Point-Language Understanding and Generation

1 code implementation • 5 Dec 2023 • Zhangyang Qi, Ye Fang, Zeyi Sun, Xiaoyang Wu, Tong Wu, Jiaqi Wang, Dahua Lin, Hengshuang Zhao

Multimodal Large Language Models (MLLMs) have excelled in 2D image-text comprehension and image generation, but their understanding of the 3D world is notably deficient, limiting progress in 3D language understanding and generation.

3D Generation Reading Comprehension

250

Paper
Code

Retargeting Visual Data with Deformation Fields

no code implementations • 22 Nov 2023 • Tim Elsner, Julia Berger, Tong Wu, Victor Czech, Lin Gao, Leif Kobbelt

Seam carving is an image editing method that enable content-aware resizing, including operations like removing objects.

Paper
Add Code

Towards Robust Text Retrieval with Progressive Learning

1 code implementation • 20 Nov 2023 • Tong Wu, Yulei Qin, Enwei Zhang, Zihan Xu, Yuting Gao, Ke Li, Xing Sun

However, existing embedding models for text retrieval usually have three non-negligible limitations.

Machine Reading Comprehension Question Answering +2

Paper
Code

PatchCURE: Improving Certifiable Robustness, Model Utility, and Computation Efficiency of Adversarial Patch Defenses

1 code implementation • 19 Oct 2023 • Chong Xiang, Tong Wu, Sihui Dai, Jonathan Petit, Suman Jana, Prateek Mittal

State-of-the-art defenses against adversarial patch attacks can now achieve strong certifiable robustness with a marginal drop in model utility.

Paper
Code

Geometry-Guided Ray Augmentation for Neural Surface Reconstruction with Sparse Views

no code implementations • 9 Oct 2023 • Jiawei Yao, Chen Wang, Tong Wu, Chuming Li

In this paper, we propose a novel method for 3D scene and object reconstruction from sparse multi-view images.

Object Reconstruction Surface Reconstruction

Paper
Add Code

1st Place Solution of Egocentric 3D Hand Pose Estimation Challenge 2023 Technical Report:A Concise Pipeline for Egocentric Hand Pose Reconstruction

no code implementations • 7 Oct 2023 • Zhishan Zhou, Zhi Lv, Shihao Zhou, Minqiang Zou, Tong Wu, Mochen Yu, Yao Tang, Jiajun Liang

This report introduce our work on Egocentric 3D Hand Pose Estimation workshop.

3D Hand Pose Estimation

Paper
Add Code

Large-Vocabulary 3D Diffusion Model with Transformer

no code implementations • 14 Sep 2023 • Ziang Cao, Fangzhou Hong, Tong Wu, Liang Pan, Ziwei Liu

To this end, we propose a novel triplane-based 3D-aware Diffusion model with TransFormer, DiffTF, for handling challenges via three aspects.

3D Generation

Paper
Add Code

Improving Depth Gradient Continuity in Transformers: A Comparative Study on Monocular Depth Estimation with CNN

no code implementations • 16 Aug 2023 • Jiawei Yao, Tong Wu, Xiaofeng Zhang

To explore the differences between Transformers and CNNs, we employ a sparse pixel approach to contrastively analyze the distinctions between the two.

Monocular Depth Estimation

Paper
Add Code

Robust Data Clustering with Outliers via Transformed Tensor Low-Rank Representation

1 code implementation • 18 Jul 2023 • Tong Wu

Recently, tensor low-rank representation (TLRR) has become a popular tool for tensor data recovery and clustering, due to its empirical success and theoretical guarantees.

Clustering Outlier Detection

Paper
Code

Towards Trustworthy Explanation: On Causal Rationalization

1 code implementation • 25 Jun 2023 • Wenbo Zhang, Tong Wu, Yunlong Wang, Yong Cai, Hengrui Cai

With recent advances in natural language processing, rationalization becomes an essential self-explaining diagram to disentangle the black box by selecting a subset of input texts to account for the major variation in prediction.

Causal Inference

Paper
Code

Differential Privacy for Class-based Data: A Practical Gaussian Mechanism

no code implementations • 8 Jun 2023 • Raksha Ramakrishna, Anna Scaglione, Tong Wu, Nikhil Ravi, Sean Peisert

In this paper, we present a notion of differential privacy (DP) for data that comes from different classes.

Paper
Add Code

AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation

1 code implementation • NeurIPS 2023 • Tong Wu, Zhihao Fan, Xiao Liu, Yeyun Gong, Yelong Shen, Jian Jiao, Hai-Tao Zheng, Juntao Li, Zhongyu Wei, Jian Guo, Nan Duan, Weizhu Chen

Diffusion models have gained significant attention in the realm of image generation due to their exceptional performance.

Common Sense Reasoning Denoising +4

608

Paper
Code

Privacy-Preserving In-Context Learning for Large Language Models

no code implementations • 2 May 2023 • Tong Wu, Ashwinee Panda, Jiachen T. Wang, Prateek Mittal

Based on the general paradigm of DP-ICL, we instantiate several techniques showing how to privatize ICL for text classification and language generation.

In-Context Learning Privacy Preserving +3

Paper
Add Code

A Randomized Approach for Tight Privacy Accounting

no code implementations • 17 Apr 2023 • Jiachen T. Wang, Saeed Mahloujifar, Tong Wu, Ruoxi Jia, Prateek Mittal

In this paper, we propose a new differential privacy paradigm called estimate-verify-release (EVR), which addresses the challenges of providing a strict upper bound for privacy parameter in DP compositions by converting an estimate of privacy parameter into a formal guarantee.

Privacy Preserving

Paper
Add Code

V3Det: Vast Vocabulary Visual Detection Dataset

no code implementations • ICCV 2023 • Jiaqi Wang, Pan Zhang, Tao Chu, Yuhang Cao, Yujie Zhou, Tong Wu, Bin Wang, Conghui He, Dahua Lin

2) Hierarchical Category Organization: The vast vocabulary of V3Det is organized by a hierarchical category tree which annotates the inclusion relationship among categories, encouraging the exploration of category relationships in vast and open vocabulary object detection.

Chatbot Object +2

Paper
Add Code

Enhancing Text Generation with Cooperative Training

1 code implementation • 16 Mar 2023 • Tong Wu, Hao Wang, Zhongshen Zeng, Wei Wang, Hai-Tao Zheng, Jiaxing Zhang

Recently, there has been a surge in the use of generated data to enhance the performance of downstream models, largely due to the advancements in pre-trained language models.

MRPC QQP +2

Paper
Code

Constrained Reinforcement Learning for Predictive Control in Real-Time Stochastic Dynamic Optimal Power Flow

no code implementations • 21 Feb 2023 • Tong Wu, Anna Scaglione, Daniel Arnold

This paper presents a novel primal-dual approach for learning optimal constrained DRL policies for dynamic optimal power flow problems, with the aim of controlling power generations and battery outputs.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Uncovering Adversarial Risks of Test-Time Adaptation

no code implementations • 29 Jan 2023 • Tong Wu, Feiran Jia, Xiangyu Qi, Jiachen T. Wang, Vikash Sehwag, Saeed Mahloujifar, Prateek Mittal

Recently, test-time adaptation (TTA) has been proposed as a promising solution for addressing distribution shifts.

Test-time Adaptation

Paper
Add Code

LF-checker: Machine Learning Acceleration of Bounded Model Checking for Concurrency Verification (Competition Contribution)

1 code implementation • 22 Jan 2023 • Tong Wu, Edoardo Manino, Fatimah Aljaafari, Pavlos Petoumenos, Lucas C. Cordeiro

We describe and evaluate LF-checker, a metaverifier tool based on machine learning.

Paper
Code

OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation

1 code implementation • CVPR 2023 • Tong Wu, Jiarui Zhang, Xiao Fu, Yuxin Wang, Jiawei Ren, Liang Pan, Wayne Wu, Lei Yang, Jiaqi Wang, Chen Qian, Dahua Lin, Ziwei Liu

Recent advances in modeling 3D objects mostly rely on synthetic datasets due to the lack of large-scale realscanned 3D databases.

Novel View Synthesis Object +1

416

Paper
Code

SLAN: Self-Locator Aided Network for Vision-Language Understanding

no code implementations • ICCV 2023 • Jiang-Tian Zhai, Qi Zhang, Tong Wu, Xing-Yu Chen, Jiang-Jiang Liu, Ming-Ming Cheng

By aggregating vision-language information, the region filter selects key regions and the region adaptor updates their coordinates with text guidance.

Image Retrieval Retrieval

Paper
Add Code

Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise

1 code implementation • 22 Dec 2022 • Zhenghao Lin, Yeyun Gong, Yelong Shen, Tong Wu, Zhihao Fan, Chen Lin, Nan Duan, Weizhu Chen

In this paper, we introduce a novel dIffusion language modEl pre-training framework for text generation, which we call GENIE.

Denoising Language Modelling +1

608

Paper
Code

SLAN: Self-Locator Aided Network for Cross-Modal Understanding

no code implementations • 28 Nov 2022 • Jiang-Tian Zhai, Qi Zhang, Tong Wu, Xing-Yu Chen, Jiang-Jiang Liu, Bo Ren, Ming-Ming Cheng

By aggregating cross-modal information, the region filter selects key regions and the region adaptor updates their coordinates with text guidance.

Image Retrieval Retrieval

Paper
Add Code

Voxurf: Voxel-based Efficient and Accurate Neural Surface Reconstruction

1 code implementation • 26 Aug 2022 • Tong Wu, Jiaqi Wang, Xingang Pan, Xudong Xu, Christian Theobalt, Ziwei Liu, Dahua Lin

Previous methods based on neural volume rendering mostly train a fully implicit model with MLPs, which typically require hours of training for a single scene.

Surface Reconstruction

400

Paper
Code

Complex-Value Spatio-temporal Graph Convolutional Neural Networks and its Applications to Electric Power Systems AI

no code implementations • 17 Aug 2022 • Tong Wu, Anna Scaglione, Daniel Arnold

The effective representation, precessing, analysis, and visualization of large-scale structured data over graphs are gaining a lot of attention.

Cyber Attack Detection

Paper
Add Code

Just Rotate it: Deploying Backdoor Attacks via Rotation Transformation

no code implementations • 22 Jul 2022 • Tong Wu, Tianhao Wang, Vikash Sehwag, Saeed Mahloujifar, Prateek Mittal

Our attack can be easily deployed in the real world since it only requires rotating the object, as we show in both image classification and object detection applications.

Data Augmentation Image Classification +3

Paper
Add Code

Human-Robot Commensality: Bite Timing Prediction for Robot-Assisted Feeding in Groups

no code implementations • 7 Jul 2022 • Jan Ondras, Abrar Anwar, Tong Wu, Fanjun Bu, Malte Jung, Jorge Jose Ortiz, Tapomayukh Bhattacharjee

While existing robotic systems for feeding people with mobility limitations focus on solitary dining, commensality, the act of eating together, is often the practice of choice.

Paper
Add Code

Towards A Proactive ML Approach for Detecting Backdoor Poison Samples

2 code implementations • 26 May 2022 • Xiangyu Qi, Tinghao Xie, Jiachen T. Wang, Tong Wu, Saeed Mahloujifar, Prateek Mittal

First, we uncover a post-hoc workflow underlying most prior work, where defenders passively allow the attack to proceed and then leverage the characteristics of the post-attacked model to uncover poison samples.

114

Paper
Code

Spatio-Temporal Graph Convolutional Neural Networks for Physics-Aware Grid Learning Algorithms

no code implementations • 31 Mar 2022 • Tong Wu, Ignacio Losada Carreno, Anna Scaglione, Daniel Arnold

This paper proposes a model-free Volt-VAR control (VVC) algorithm via the spatio-temporal graph ConvNet-based deep reinforcement learning (STGCN-DRL) framework, whose goal is to control smart inverters in an unbalanced distribution system.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Artificial Intelligence Enables Real-Time and Intuitive Control of Prostheses via Nerve Interface

no code implementations • 16 Mar 2022 • Diu Khue Luu, Anh Tuan Nguyen, Ming Jiang, Markus W. Drealan, Jian Xu, Tong Wu, Wing-kin Tam, Wenfeng Zhao, Brian Z. H. Lim, Cynthia K. Overstreet, Qi Zhao, Jonathan Cheng, Edward W. Keefer, Zhi Yang

Objective: The next generation prosthetic hand that moves and feels like a real hand requires a robust neural interconnection between the human minds and machines.

Paper
Add Code

Multi-View Partial (MVP) Point Cloud Challenge 2021 on Completion and Registration: Methods and Results

2 code implementations • 22 Dec 2021 • Liang Pan, Tong Wu, Zhongang Cai, Ziwei Liu, Xumin Yu, Yongming Rao, Jiwen Lu, Jie zhou, Mingye Xu, Xiaoyuan Luo, Kexue Fu, Peng Gao, Manning Wang, Yali Wang, Yu Qiao, Junsheng Zhou, Xin Wen, Peng Xiang, Yu-Shen Liu, Zhizhong Han, Yuanjie Yan, Junyi An, Lifa Zhu, Changwei Lin, Dongrui Liu, Xin Li, Francisco Gómez-Fernández, Qinlong Wang, Yang Yang

Based on the MVP dataset, this paper reports methods and results in the Multi-View Partial Point Cloud Challenge 2021 on Completion and Registration.

3D Reconstruction Point Cloud Completion +2

149

Paper
Code

Balanced Chamfer Distance as a Comprehensive Metric for Point Cloud Completion

1 code implementation • NeurIPS 2021 • Tong Wu, Liang Pan, Junzhe Zhang, Tai Wang, Ziwei Liu, Dahua Lin

We adopt DCD to evaluate the point cloud completion task, where experimental results show that DCD pays attention to both the overall structure and local geometric details and provides a more reliable evaluation even when CD and EMD contradict each other.

Point Cloud Completion

133

Paper
Code

Density-aware Chamfer Distance as a Comprehensive Metric for Point Cloud Completion

1 code implementation • 24 Nov 2021 • Tong Wu, Liang Pan, Junzhe Zhang, Tai Wang, Ziwei Liu, Dahua Lin

Point Cloud Completion

133

Paper
Code

Few-Shot Object Detection via Association and DIscrimination

1 code implementation • NeurIPS 2021 • Yuhang Cao, Jiaqi Wang, Ying Jin, Tong Wu, Kai Chen, Ziwei Liu, Dahua Lin

1) In the association step, in contrast to implicitly leveraging multiple base classes, we construct a compact novel class feature space via explicitly imitating a specific base class feature space.

Few-Shot Object Detection Object +3

Paper
Code

Modeling spatial waves of Wolbachia invasion for controlling mosquito-borne diseases

no code implementations • 24 Aug 2021 • Zhuolin Qu, Tong Wu, James Mac Hyman

Field trials and modeling studies have shown that the fraction of infection among the mosquitoes must exceed a threshold level for the infection to persist.

Paper
Add Code

Generalizing Nucleus Recognition Model in Multi-source Images via Pruning

no code implementations • 6 Jul 2021 • Jiatong Cai, Chenglu Zhu, Can Cui, Honglin Li, Tong Wu, Shichuan Zhang, Lin Yang

In addition, the model is optimized by fine-tuning on merged domains to eliminate the interference of class mismatching among various domains.

Domain Generalization

Paper
Add Code

Embedded Discriminative Attention Mechanism for Weakly Supervised Semantic Segmentation

1 code implementation • CVPR 2021 • Tong Wu, Junshi Huang, Guangyu Gao, Xiaoming Wei, Xiaolin Wei, Xuan Luo, Chi Harold Liu

In inference, we directly use the activation masks from the DA layer as pseudo-labels for segmentation.

Segmentation Weakly supervised Semantic Segmentation +1

Paper
Code

Adversarial Robustness under Long-Tailed Distribution

1 code implementation • CVPR 2021 • Tong Wu, Ziwei Liu, Qingqiu Huang, Yu Wang, Dahua Lin

We then perform a systematic study on existing long-tailed recognition methods in conjunction with the adversarial training framework.

Adversarial Robustness

Paper
Code

Towards Evaluating and Training Verifiably Robust Neural Networks

1 code implementation • CVPR 2021 • Zhaoyang Lyu, Minghao Guo, Tong Wu, Guodong Xu, Kehuan Zhang, Dahua Lin

Recent works have shown that interval bound propagation (IBP) can be used to train verifiably robust neural networks.

Paper
Code

RLAD: Time Series Anomaly Detection through Reinforcement Learning and Active Learning

no code implementations • 31 Mar 2021 • Tong Wu, Jorge Ortiz

We introduce a new semi-supervised, time series anomaly detection algorithm that uses deep reinforcement learning (DRL) and active learning to efficiently learn and adapt to anomalies in real-world time series data.

Active Learning Anomaly Detection +4

Paper
Add Code

Hot electron generation through near-field excitation of plasmonic nanoresonators

no code implementations • 11 Mar 2021 • Felix Binkowski, Tong Wu, Philippe Lalanne, Sven Burger, Alexander O. Govorov

We theoretically study hot electron generation through the emission of a dipole source coupled to a nanoresonator on a metal surface.

Optics Mesoscale and Nanoscale Physics

Paper
Add Code

Privacy-Preserving Distributed Optimal Power Flow with Partially Homomorphic Encryption

no code implementations • 21 Jan 2021 • Tong Wu, Changhong Zhao, Ying-Jun Angela Zhang

In this way, the dual update of ADMM can be encrypted by PHE.

Privacy Preserving

Paper
Add Code

A Coarse-to-Fine Auto-Sampler For Long-tailed Image Recognition

no code implementations • CUHK Course IERG5350 2020 • Tong Wu, Hao Li

The long-tail distributed data in the real world has always been a great challenge for deep learning.

Data Augmentation Representation Learning

Paper
Add Code

Shaping Deep Feature Space towards Gaussian Mixture for Visual Classification

no code implementations • 18 Nov 2020 • Weitao Wan, Jiansheng Chen, Cheng Yu, Tong Wu, Yuanyi Zhong, Ming-Hsuan Yang

In this work, we propose a Gaussian mixture (GM) loss function for deep neural networks for visual classification.

Classification General Classification +1

Paper
Add Code

Three-Dimensional Lip Motion Network for Text-Independent Speaker Recognition

1 code implementation • 13 Oct 2020 • Jianrong Wang, Tong Wu, Shanyu Wang, Mei Yu, Qiang Fang, Ju Zhang, Li Liu

To this end, in this work, we present a novel end-to-end 3D lip motion Network (3LMNet) by utilizing the sentence-level 3D lip motion (S3DLM) to recognize speakers in both the text-independent and text-dependent contexts.

Sentence Speaker Recognition +1

Paper
Code

TM-NET: Deep Generative Networks for Textured Meshes

no code implementations • 13 Oct 2020 • Lin Gao, Tong Wu, Yu-Jie Yuan, Ming-Xian Lin, Yu-Kun Lai, Hao Zhang

We introduce a conditional autoregressive model for texture generation, which can be conditioned on both part geometry and textures already generated for other parts to achieve texture compatibility.

Graphics

Paper
Add Code

Physical Adversarial Attack on Vehicle Detector in the Carla Simulator

no code implementations • 31 Jul 2020 • Tong Wu, Xuefei Ning, Wenshuo Li, Ranran Huang, Huazhong Yang, Yu Wang

In this paper, we tackle the issue of physical adversarial examples for object detectors in the wild.

Adversarial Attack

Paper
Add Code

Distribution-Balanced Loss for Multi-Label Classification in Long-Tailed Datasets

1 code implementation • ECCV 2020 • Tong Wu, Qingqiu Huang, Ziwei Liu, Yu Wang, Dahua Lin

We present a new loss function called Distribution-Balanced Loss for the multi-label recognition problems that exhibit long-tailed class distributions.

Ranked #7 on Long-tail Learning on VOC-MLT

Binary Classification General Classification +2

349

Paper
Code

Adversarial Robustness of Deep Sensor Fusion Models

no code implementations • 23 Jun 2020 • Shaojie Wang, Tong Wu, Ayan Chakrabarti, Yevgeniy Vorobeychik

First, we find that the fusion model is usually both more accurate, and more robust against single-source attacks than single-sensor deep neural networks.

Adversarial Robustness Autonomous Driving +4

Paper
Add Code

Meta Segmentation Network for Ultra-Resolution Medical Images

no code implementations • 19 Feb 2020 • Tong Wu, Yuan Xie, Yanyun Qu, Bicheng Dai, Shuxin Chen

MSN can fast generate the weights of fusion layers through a simple meta-learner, requiring only a few training samples and epochs to converge.

Image Segmentation Meta-Learning +2

Paper
Add Code

Representation Learning of EHR Data via Graph-Based Medical Entity Embedding

no code implementations • 7 Oct 2019 • Tong Wu, Yunlong Wang, Yue Wang, Emily Zhao, Yilian Yuan, Zhi Yang

Automatic representation learning of key entities in electronic health record (EHR) data is a critical step for healthcare informatics that turns heterogeneous medical records into structured and actionable information.

Graph Embedding Representation Learning

Paper
Add Code

Enhancing Model Interpretability and Accuracy for Disease Progression Prediction via Phenotype-Based Patient Similarity Learning

no code implementations • 26 Sep 2019 • Yue Wang, Tong Wu, Yunlong Wang, Gao Wang

Models have been proposed to extract temporal patterns from longitudinal electronic health records (EHR) for clinical predictive models.

regression

Paper
Add Code

Defending Against Physically Realizable Attacks on Image Classification

2 code implementations • ICLR 2020 • Tong Wu, Liang Tong, Yevgeniy Vorobeychik

Finally, we demonstrate that adversarial training using our new attack yields image classification models that exhibit high robustness against the physically realizable attacks we study, offering the first effective generic defense against such attacks.

Classification General Classification +1

Paper
Code

AIBench: An Industry Standard Internet Service AI Benchmark Suite

no code implementations • 13 Aug 2019 • Wanling Gao, Fei Tang, Lei Wang, Jianfeng Zhan, Chunxin Lan, Chunjie Luo, Yunyou Huang, Chen Zheng, Jiahui Dai, Zheng Cao, Daoyi Zheng, Haoning Tang, Kunlin Zhan, Biao Wang, Defei Kong, Tong Wu, Minghe Yu, Chongkang Tan, Huan Li, Xinhui Tian, Yatao Li, Junchao Shao, Zhenyu Wang, Xiaoyu Wang, Hainan Ye

On the basis of the AIBench framework, abstracting the real-world data sets and workloads from one of the top e-commerce providers, we design and implement the first end-to-end Internet service AI benchmark, which contains the primary modules in the critical paths of an industry scale application and is scalable to deploy on different cluster scales.

Benchmarking Learning-To-Rank

Paper
Add Code

SDM-NET: Deep Generative Network for Structured Deformable Mesh

no code implementations • 13 Aug 2019 • Lin Gao, Jie Yang, Tong Wu, Yu-Jie Yuan, Hongbo Fu, Yu-Kun Lai, Hao Zhang

At the structural level, we train a Structured Parts VAE (SP-VAE), which jointly learns the part structure of a shape collection and the part geometries, ensuring a coherence between global shape structure and surface details.

Paper
Add Code

A Mobile Cloud Collaboration Fall Detection System Based on Ensemble Learning

no code implementations • 5 Jul 2019 • Tong Wu, Yang Gu, Yiqiang Chen, Yunlong Xiao, Jiwei Wang

Falls are one of the important causes of accidental or unintentional injury death worldwide.

Ensemble Learning Specificity

Paper
Add Code

Predicting Treatment Initiation from Clinical Time Series Data via Graph-Augmented Time-Sensitive Model

no code implementations • 1 Jul 2019 • Fan Zhang, Tong Wu, Yunlong Wang, Yong Cai, Cao Xiao, Emily Zhao, Lucas Glass, Jimeng Sun

Many computational models were proposed to extract temporal patterns from clinical time series for each patient and among patient group for predictive healthcare.

Time Series Time Series Analysis

Paper
Add Code

Deep Compressive Autoencoder for Action Potential Compression in Large-Scale Neural Recording

1 code implementation • 14 Sep 2018 • Tong Wu, Wenfeng Zhao, Edward Keefer, Zhi Yang

The proposed model is built upon a deep compressive autoencoder (CAE) with discrete latent embeddings.

Quantization Spike Sorting

Paper
Code

Human Action Attribute Learning From Video Data Using Low-Rank Representations

no code implementations • 23 Dec 2016 • Tong Wu, Prudhvi Gurram, Raghuveer M. Rao, Waheed U. Bajwa

Representation of human actions as a sequence of human body movements or action attributes enables the development of models for human activity recognition and summarization.

Action Recognition Attribute +3

Paper
Add Code

Learning the nonlinear geometry of high-dimensional data: Models and algorithms

no code implementations • 21 Dec 2014 • Tong Wu, Waheed U. Bajwa

This paper revisits the problem of data-driven learning of these geometric structures and puts forth two new nonlinear geometric models for data describing "related" objects/phenomena.

Clustering Vocal Bursts Intensity Prediction

Paper
Add Code

Painting Analysis Using Wavelets and Probabilistic Topic Models

no code implementations • 26 Jan 2014 • Tong Wu, Gungor Polatkan, David Steel, William Brown, Ingrid Daubechies, Robert Calderbank

In this paper, computer-based techniques for stylistic analysis of paintings are applied to the five panels of the 14th century Peruzzi Altarpiece by Giotto di Bondone.

Clustering Topic Models

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.