Search Results for author: Zhao Wang

Found 34 papers, 13 papers with code

Existence Is Chaos: Enhancing 3D Human Motion Prediction with Uncertainty Consideration

no code implementations21 Mar 2024 Zhihao Wang, Yulin Zhou, Ningyu Zhang, Xiaosong Yang, Jun Xiao, Zhao Wang

We believe our work could provide a novel perspective to consider the uncertainty quality for the general motion prediction task and encourage the studies in this field.

Human motion prediction motion prediction

An AI-Driven Approach to Wind Turbine Bearing Fault Diagnosis from Acoustic Signals

no code implementations14 Mar 2024 Zhao Wang, Xiaomeng Li, Na Li, Longlong Shu

This study aimed to develop a deep learning model for the classification of bearing faults in wind turbine generators from acoustic signals.

Efficient Transferability Assessment for Selection of Pre-trained Detectors

no code implementations14 Mar 2024 Zhao Wang, Aoxue Li, Zhenguo Li, Qi Dou

Given this zoo, we adopt 7 target datasets from 5 diverse domains as the downstream target tasks for evaluation.

Open-Vocabulary Object Detection with Meta Prompt Representation and Instance Contrastive Optimization

no code implementations14 Mar 2024 Zhao Wang, Aoxue Li, Fengwei Zhou, Zhenguo Li, Qi Dou

Without using knowledge distillation, ensemble model or extra training data during detector training, our proposed MIC outperforms previous SOTA methods trained with these complex techniques on LVIS.

Contrastive Learning Knowledge Distillation +2

EndoGS: Deformable Endoscopic Tissues Reconstruction with Gaussian Splatting

1 code implementation21 Jan 2024 Lingting Zhu, Zhao Wang, Jiahao Cui, Zhenchao Jin, Guying Lin, Lequan Yu

Specifically, our approach incorporates deformation fields to handle dynamic scenes, depth-guided supervision with spatial-temporal weight masks to optimize 3D targets with tool occlusion from a single viewpoint, and surface-aligned regularization terms to capture the much better geometry.

3D Reconstruction

CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects

no code implementations18 Jan 2024 Zhao Wang, Aoxue Li, Enze Xie, Lingting Zhu, Yong Guo, Qi Dou, Zhenguo Li

Customized text-to-video generation aims to generate high-quality videos guided by text prompts and subject references.

Object Text-to-Video Generation +1

A graph-based multimodal framework to predict gentrification

no code implementations25 Dec 2023 Javad Eshtiyagh, Baotong Zhang, Yujing Sun, Linhui Wu, Zhao Wang

Gentrification--the transformation of a low-income urban area caused by the influx of affluent residents--has many revitalizing benefits.

Multimodal Deep Learning

Semantic Face Compression for Metaverse: A Compact 3D Descriptor Based Approach

no code implementations24 Sep 2023 Binzhe Li, Bolin Chen, Zhao Wang, Shiqi Wang, Yan Ye

In this letter, we envision a new metaverse communication paradigm for virtual avatar faces, and develop the semantic face compression with compact 3D facial descriptors.

Foundation Model for Endoscopy Video Analysis via Large-scale Self-supervised Pre-train

1 code implementation29 Jun 2023 Zhao Wang, Chang Liu, Shaoting Zhang, Qi Dou

Foundation models have exhibited remarkable success in various applications, such as disease diagnosis and text report generation.

Segmentation Transfer Learning

3DSAM-adapter: Holistic Adaptation of SAM from 2D to 3D for Promptable Medical Image Segmentation

1 code implementation23 Jun 2023 Shizhan Gong, Yuan Zhong, Wenao Ma, Jinpeng Li, Zhao Wang, Jingyang Zhang, Pheng-Ann Heng, Qi Dou

Notably, the original SAM architecture is designed for 2D natural images, therefore would not be able to extract the 3D spatial information from volumetric medical data effectively.

Image Segmentation Medical Image Segmentation +2

Interactive Face Video Coding: A Generative Compression Framework

2 code implementations20 Feb 2023 Bolin Chen, Zhao Wang, Binzhe Li, Shurun Wang, Shiqi Wang, Yan Ye

In this paper, we propose a novel framework for Interactive Face Video Coding (IFVC), which allows humans to interact with the intrinsic visual representations instead of the signals.

$C^*$-algebra Net: A New Approach Generalizing Neural Network Parameters to $C^*$-algebra

no code implementations20 Jun 2022 Yuka Hashimoto, Zhao Wang, Tomoko Matsui

We apply our framework to practical problems such as density estimation and few-shot learning and show that our framework enables us to learn features of data even with a limited number of samples.

Density Estimation Few-Shot Learning

Rethinking Multi-Modal Alignment in Video Question Answering from Feature and Sample Perspectives

no code implementations25 Apr 2022 Shaoning Xiao, Long Chen, Kaifeng Gao, Zhao Wang, Yi Yang, Zhimeng Zhang, Jun Xiao

From the view of feature, we break down the video into trajectories and first leverage trajectory feature in VideoQA to enhance the alignment between two modalities.

Question Answering Video Question Answering

Federated Learning from Only Unlabeled Data with Class-Conditional-Sharing Clients

1 code implementation7 Apr 2022 Nan Lu, Zhao Wang, Xiaoxiao Li, Gang Niu, Qi Dou, Masashi Sugiyama

We propose federation of unsupervised learning (FedUL), where the unlabeled data are transformed into surrogate labeled data for each of the clients, a modified model is trained by supervised FL, and the wanted model is recovered from the modified model.

Federated Learning

Self-attention based anchor proposal for skeleton-based action recognition

no code implementations17 Dec 2021 Ruijie Hou, Zhao Wang

Skeleton sequences are widely used for action recognition task due to its lightweight and compact characteristics.

Action Recognition Skeleton Based Action Recognition

Enhancing Model Robustness and Fairness with Causality: A Regularization Approach

1 code implementation EMNLP (CINLP) 2021 Zhao Wang, Kai Shu, Aron Culotta

In this paper, we propose a simple and intuitive regularization approach to integrate causal knowledge during model training and build a robust and fair model by emphasizing causal features and de-emphasizing spurious features.

Causal Inference counterfactual +1

Unsupervised Federated Learning is Possible

no code implementations ICLR 2022 Nan Lu, Zhao Wang, Xiaoxiao Li, Gang Niu, Qi Dou, Masashi Sugiyama

We propose federation of unsupervised learning (FedUL), where the unlabeled data are transformed into surrogate labeled data for each of the clients, a modified model is trained by supervised FL, and the wanted model is recovered from the modified model.

Federated Learning

End-to-end Compression Towards Machine Vision: Network Architecture Design and Optimization

no code implementations1 Jul 2021 Shurun Wang, Zhao Wang, Shiqi Wang, Yan Ye

In this paper, we show that the design and optimization of network architecture could be further improved for compression towards machine vision.

object-detection Object Detection

Efficient Ring-topology Decentralized Federated Learning with Deep Generative Models for Industrial Artificial Intelligent

no code implementations15 Apr 2021 Zhao Wang, Yifan Hu, Jun Xiao, Chao Wu

A novel ring FL topology as well as a map-reduce based synchronizing method are designed in the proposed RDFL to improve decentralized FL performance and bandwidth utilization.

Federated Learning

Multi-Density Attention Network for Loop Filtering in Video Compression

no code implementations8 Apr 2021 Zhao Wang, Changyue Ma, Yan Ye

In this paper, we propose a on-line scaling based multi-density attention network for loop filtering in video compression.

Video Compression

A Cross Channel Context Model for Latents in Deep Image Compression

no code implementations4 Mar 2021 Changyue Ma, Zhao Wang, Ruling Liao, Yan Ye

The proposed cross channel context model is combined with the joint autoregressive and hierarchical prior entropy model.

Image Compression MS-SSIM +1

Minimization of ion micromotion with artificial neural network

no code implementations3 Mar 2021 Yang Liu, Qi-feng Lao, Peng-fei Lu, Xin-xin Rao, Hao Wu, Teng Liu, Kun-xu Wang, Zhao Wang, Ming-shen Li, Feng Zhu, Luo Le

Minimizing the micromotion of the single trapped ion in a linear Paul trap is a tedious and time-consuming work, but is of great importance in cooling the ion into the motional ground state as well as maintaining long coherence time, which is crucial for quantum information processing and quantum computation.

Atomic Physics Quantum Physics

Robustness to Spurious Correlations in Text Classification via Automatically Generated Counterfactuals

1 code implementation18 Dec 2020 Zhao Wang, Aron Culotta

However, the classifier trained on the combined data is more robust and performs well on both the original test data and the counterfactual test data (e. g., 12%-25% increase in accuracy compared with the traditional classifier).

counterfactual General Classification +2

Identifying Spurious Correlations for Robust Text Classification

1 code implementation Findings of the Association for Computational Linguistics 2020 Zhao Wang, Aron Culotta

The predictions of text classifiers are often driven by spurious correlations -- e. g., the term `Spielberg' correlates with positively reviewed movies, even though the term itself does not semantically convey a positive sentiment.

feature selection General Classification +5

Are Words Commensurate with Actions? Quantifying Commitment to a Cause from Online Public Messaging

no code implementations6 Oct 2020 Zhao Wang, Jennifer Cutler, Aron Culotta

Often, this public messaging is aimed at aligning the entity with a particular cause or issue, such as the environment or public health.

text-classification Text Classification

Contrastive Cross-site Learning with Redesigned Net for COVID-19 CT Classification

1 code implementation15 Sep 2020 Zhao Wang, Quande Liu, Qi Dou

The pandemic of coronavirus disease 2019 (COVID-19) has lead to a global public health crisis spreading hundreds of countries.

COVID-19 Diagnosis General Classification

Neural Architecture Search on Acoustic Scene Classification

no code implementations30 Dec 2019 Jixiang Li, Chuming Liang, Bo Zhang, Zhao Wang, Fei Xiang, Xiangxiang Chu

Convolutional neural networks are widely adopted in Acoustic Scene Classification (ASC) tasks, but they generally carry a heavy computational burden.

Acoustic Scene Classification Classification +3

When do Words Matter? Understanding the Impact of Lexical Choice on Audience Perception using Individual Treatment Effect Estimation

no code implementations12 Nov 2018 Zhao Wang, Aron Culotta

However, we lack general methods for estimating the causal effect of lexical choice on the perception of a specific sentence.

Sentence

Cannot find the paper you are looking for? You can Submit a new open access paper.