Search Results for author: Dadong Wang

Found 18 papers, 3 papers with code

Benchmarking Audio Visual Segmentation for Long-Untrimmed Videos

no code implementations CVPR 2024 Chen Liu, Peike Patrick Li, Qingtao Yu, Hongwei Sheng, Dadong Wang, Lincheng Li, Xin Yu

Considering that pixel-level annotations are difficult to achieve in some complex scenes we also provide the bounding boxes to indicate the sounding regions.

Benchmarking

BAVS: Bootstrapping Audio-Visual Segmentation by Integrating Foundation Knowledge

no code implementations20 Aug 2023 Chen Liu, Peike Li, Hu Zhang, Lincheng Li, Zi Huang, Dadong Wang, Xin Yu

In a nutshell, our BAVS is designed to eliminate the interference of background noise or off-screen sounds in segmentation by establishing the audio-visual correspondences in an explicit manner.

Audio Classification Segmentation

Audio-Visual Segmentation by Exploring Cross-Modal Mutual Semantics

no code implementations31 Jul 2023 Chen Liu, Peike Li, Xingqun Qi, Hu Zhang, Lincheng Li, Dadong Wang, Xin Yu

However, we observed that prior arts are prone to segment a certain salient object in a video regardless of the audio information.

Object Segmentation +1

Unleashing the Potential of Regularization Strategies in Learning with Noisy Labels

no code implementations11 Jul 2023 Hui Kang, Sheng Liu, Huaxi Huang, Jun Yu, Bo Han, Dadong Wang, Tongliang Liu

In recent years, research on learning with noisy labels has focused on devising novel algorithms that can achieve robustness to noisy training labels while generalizing to clean data.

Learning with noisy labels

ESceme: Vision-and-Language Navigation with Episodic Scene Memory

1 code implementation2 Mar 2023 Qi Zheng, Daqing Liu, Chaoyue Wang, Jing Zhang, Dadong Wang, DaCheng Tao

Vision-and-language navigation (VLN) simulates a visual agent that follows natural-language navigation instructions in real-world scenes.

Vision and Language Navigation

Cross-Modal Contrastive Learning for Robust Reasoning in VQA

1 code implementation21 Nov 2022 Qi Zheng, Chaoyue Wang, Daqing Liu, Dadong Wang, DaCheng Tao

For each positive pair, we regard the images from different graphs as negative samples and deduct the version of multi-positive contrastive learning.

Contrastive Learning Question Answering +1

CNN-based Local Vision Transformer for COVID-19 Diagnosis

no code implementations5 Jul 2022 Hongyan Xu, Xiu Su, Dadong Wang

Deep learning technology can be used as an assistive technology to help doctors quickly and accurately identify COVID-19 infections.

COVID-19 Diagnosis Image Classification

Multi-scale alignment and Spatial ROI Module for COVID-19 Diagnosis

no code implementations4 Jul 2022 Hongyan Xu, Dadong Wang, Arcot Sowmya

However, in CT and CXR images, the infected area occupies only a small part of the image.

COVID-19 Diagnosis

Bypass Network for Semantics Driven Image Paragraph Captioning

no code implementations21 Jun 2022 Qi Zheng, Chaoyue Wang, Dadong Wang

Most existing methods model the coherence through the topic transition that dynamically infers a topic vector from preceding sentences.

Image Paragraph Captioning Sentence

Visual Superordinate Abstraction for Robust Concept Learning

no code implementations28 May 2022 Qi Zheng, Chaoyue Wang, Dadong Wang, DaCheng Tao

Concept learning constructs visual representations that are connected to linguistic semantics, which is fundamental to vision-language tasks.

Attribute Question Answering +1

Monitoring of Pigmented Skin Lesions Using 3D Whole Body Imaging

no code implementations14 May 2022 David Ahmedt-Aristizabal, Chuong Nguyen, Lachlan Tychsen-Smith, Ashley Stacey, Shenghong Li, Joseph Pathikulangara, Lars Petersson, Dadong Wang

A modular camera rig arranged in a cylindrical configuration was designed to automatically capture images of the entire skin surface of a subject synchronously from multiple angles.

Image Reconstruction Lesion Detection +1

Computer-Aided Extraction of Select MRI Markers of Cerebral Small Vessel Disease: A Systematic Review

no code implementations4 Apr 2022 Jiyang Jiang, Dadong Wang, Yang song, Perminder S. Sachdev, Wei Wen

Cerebral small vessel disease (CSVD) is a major vascular contributor to cognitive impairment in ageing, including dementias.

Transfer Learning

Video-based cattle identification and action recognition

1 code implementation14 Oct 2021 Chuong Nguyen, Dadong Wang, Karl Von Richter, Philip Valencia, Flavio A. P. Alvarenga, Gregory Bishop-Hurley

We demonstrate a working prototype for the monitoring of cow welfare by automatically analysing the animal behaviours.

Action Recognition

PDANet: Pyramid Density-aware Attention Net for Accurate Crowd Counting

no code implementations16 Jan 2020 Saeed Amirgholipour, Xiangjian He, Wenjing Jia, Dadong Wang, Lei Liu

For this purpose, a classifier evaluates the density level of the input features and then passes them to the corresponding high and low crowded DAD modules.

Crowd Counting Decoder

A-CCNN: adaptive ccnn for density estimation and crowd counting

no code implementations19 Apr 2018 Saeed Amirgholipour Kasmani, Xiangjian He, Wenjing Jia, Dadong Wang, Michelle Zeibots

Crowd counting, for estimating the number of people in a crowd using vision-based computer techniques, has attracted much interest in the research community.

Crowd Counting Density Estimation +1

A Structural Correlation Filter Combined with A Multi-task Gaussian Particle Filter for Visual Tracking

no code implementations3 Mar 2018 Manna Dai, Shuying Cheng, Xiangjian He, Dadong Wang

First, it can detect the tracked target in a large-scale search scope via weak KCF trackers and evaluate the reliability of weak trackers\rq decisions for a Gaussian particle filter to make a strong decision, and hence it can tackle fast motions, appearance variations, occlusions and re-detections.

Visual Tracking

Cannot find the paper you are looking for? You can Submit a new open access paper.