Search Results for author: Ruoyu Wang

Found 28 papers, 12 papers with code

Multitask frame-level learning for few-shot sound event detection

no code implementations17 Mar 2024 Liang Zou, Genwei Yan, Ruoyu Wang, Jun Du, Meng Lei, Tian Gao, Xin Fang

This paper focuses on few-shot Sound Event Detection (SED), which aims to automatically recognize and classify sound events with limited samples.

Data Augmentation Event Detection +1

StainFuser: Controlling Diffusion for Faster Neural Style Transfer in Multi-Gigapixel Histology Images

1 code implementation14 Mar 2024 Robert Jewsbury, Ruoyu Wang, Abhir Bhalerao, Nasir Rajpoot, Quoc Dang Vu

Stain normalization algorithms aim to transform the color and intensity characteristics of a source multi-gigapixel histology image to match those of a target image, mitigating inconsistencies in the appearance of stains used to highlight cellular components in the images.

Computational Efficiency Instance Segmentation +3

A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition

1 code implementation7 Mar 2024 Yusheng Dai, Hang Chen, Jun Du, Ruoyu Wang, Shihao Chen, Jiefeng Ma, Haotian Wang, Chin-Hui Lee

In this paper, we investigate this contrasting phenomenon from the perspective of modality bias and reveal that an excessive modality bias on the audio caused by dropout is the underlying reason.

Audio-Visual Speech Recognition Knowledge Distillation +2

DETER: Detecting Edited Regions for Deterring Generative Manipulations

no code implementations16 Dec 2023 Sai Wang, Ye Zhu, Ruoyu Wang, Amaya Dharmasiri, Olga Russakovsky, Yu Wu

While face swapping and attribute editing are performed on similar face regions such as eyes and nose, the inpainting operation can be performed on random image regions, removing the spurious correlations of previous datasets.

Attribute Face Swapping +1

An Automated Pipeline for Tumour-Infiltrating Lymphocyte Scoring in Breast Cancer

1 code implementation10 Nov 2023 Adam J Shephard, Mostafa Jahanifar, Ruoyu Wang, Muhammad Dawood, Simon Graham, Kastytis Sidlauskas, Syed Ali Khurram, Nasir M Rajpoot, Shan E Ahmed Raza

Tumour-infiltrating lymphocytes (TILs) are considered as a valuable prognostic markers in both triple-negative and human epidermal growth factor receptor 2 (HER2) positive breast cancer.

whole slide images

Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding with Sequence-to-Sequence Architecture

1 code implementation17 Sep 2023 Gaobin Yang, Maokui He, Shutong Niu, Ruoyu Wang, Yanyan Yue, Shuangqing Qian, Shilong Wu, Jun Du, Chin-Hui Lee

We propose a novel neural speaker diarization system using memory-aware multi-speaker embedding with sequence-to-sequence architecture (NSD-MS2S), which integrates the strengths of memory-aware multi-speaker embedding (MA-MSE) and sequence-to-sequence (Seq2Seq) architecture, leading to improvement in both efficiency and performance.

speaker-diarization Speaker Diarization

The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction

no code implementations15 Sep 2023 Shilong Wu, Chenxi Wang, Hang Chen, Yusheng Dai, Chenyue Zhang, Ruoyu Wang, Hongbo Lan, Jun Du, Chin-Hui Lee, Jingdong Chen, Shinji Watanabe, Sabato Marco Siniscalchi, Odette Scharenborg, Zhong-Qiu Wang, Jia Pan, Jianqing Gao

This pioneering effort aims to set the first benchmark for the AVTSE task, offering fresh insights into enhancing the ac-curacy of back-end speech recognition systems through AVTSE in challenging and real acoustic environments.

Audio-Visual Speech Recognition speech-recognition +2

The USTC-NERCSLIP Systems for the CHiME-7 DASR Challenge

no code implementations28 Aug 2023 Ruoyu Wang, Maokui He, Jun Du, Hengshun Zhou, Shutong Niu, Hang Chen, Yanyan Yue, Gaobin Yang, Shilong Wu, Lei Sun, Yanhui Tu, Haitao Tang, Shuangqing Qian, Tian Gao, Mengzhi Wang, Genshun Wan, Jia Pan, Jianqing Gao, Chin-Hui Lee

This technical report details our submission system to the CHiME-7 DASR Challenge, which focuses on speaker diarization and speech recognition under complex multi-speaker scenarios.

speaker-diarization Speaker Diarization +2

Concavity-Induced Distance for Unoriented Point Cloud Decomposition

no code implementations19 Jun 2023 Ruoyu Wang, Yanfei Xue, Bharath Surianarayanan, Dong Tian, Chen Feng

We propose Concavity-induced Distance (CID) as a novel way to measure the dissimilarity between a pair of points in an unoriented point cloud.

Instance Segmentation Semantic Segmentation

Diffusion in Diffusion: Cyclic One-Way Diffusion for Text-Vision-Conditioned Generation

1 code implementation14 Jun 2023 Ruoyu Wang, Yongqi Yang, Zhihao Qian, Ye Zhu, Yu Wu

In this work, we investigate the diffusion (physics) in diffusion (machine learning) properties and propose our Cyclic One-Way Diffusion (COW) method to control the direction of diffusion phenomenon given a pre-trained frozen diffusion model for versatile customization application scenarios, where the low-level pixel information from the conditioning needs to be preserved.

Denoising Image Generation

P$^2$SDF for Neural Indoor Scene Reconstruction

no code implementations1 Mar 2023 Jing Li, Jinpeng Yu, Ruoyu Wang, Zhengxin Li, Zhengyu Zhang, Lina Cao, Shenghua Gao

As the unsupervised plane segments are usually noisy and inaccurate, we propose to assign different weights to the sampled points on the plane in plane estimation as well as the regularization loss.

Indoor Scene Reconstruction Surface Reconstruction

An Aggregation of Aggregation Methods in Computational Pathology

no code implementations2 Nov 2022 Mohsin Bilal, Robert Jewsbury, Ruoyu Wang, Hammam M. AlGhamdi, Amina Asif, Mark Eastwood, Nasir Rajpoot

Image analysis and machine learning algorithms operating on multi-gigapixel whole-slide images (WSIs) often process a large number of tiles (sub-images) and require aggregating predictions from the tiles in order to predict WSI-level labels.

Multiple Instance Learning whole slide images

Self-Supervised Visual Place Recognition by Mining Temporal and Feature Neighborhoods

no code implementations19 Aug 2022 Chao Chen, Xinhao Liu, Xuchu Xu, Yiming Li, Li Ding, Ruoyu Wang, Chen Feng

Inspired by noisy label learning, we propose a novel self-supervised framework named \textit{TF-VPR} that uses temporal neighborhoods and learnable feature neighborhoods to discover unknown spatial neighborhoods.

Data Augmentation Representation Learning +1

Breaking Correlation Shift via Conditional Invariant Regularizer

no code implementations14 Jul 2022 Mingyang Yi, Ruoyu Wang, Jiachen Sun, Zhenguo Li, Zhi-Ming Ma

The correlation shift is caused by the spurious attributes that correlate to the class label, as the correlation between them may vary in training and test data.

TIAger: Tumor-Infiltrating Lymphocyte Scoring in Breast Cancer for the TiGER Challenge

1 code implementation23 Jun 2022 Adam Shephard, Mostafa Jahanifar, Ruoyu Wang, Muhammad Dawood, Simon Graham, Kastytis Sidlauskas, Syed Ali Khurram, Nasir Rajpoot, Shan E Ahmed Raza

The Tumor InfiltratinG lymphocytes in breast cancER (TiGER) challenge, aims to assess the prognostic significance of computer-generated TILs scores for predicting survival as part of a Cox proportional hazards model.

Out-of-distribution Generalization with Causal Invariant Transformations

no code implementations CVPR 2022 Ruoyu Wang, Mingyang Yi, Zhitang Chen, Shengyu Zhu

In this work, we obviate these assumptions and tackle the OOD problem without explicitly recovering the causal feature.

Out-of-Distribution Generalization

ECRECer: Enzyme Commission Number Recommendation and Benchmarking based on Multiagent Dual-core Learning

1 code implementation8 Feb 2022 Zhenkun Shi, Qianqian Yuan, Ruoyu Wang, Hoaran Li, Xiaoping Liao, Hongwu Ma

Take UniPort protein "A0A0U5GJ41" as an example (1. 14.-.-), ECRECer annotated it with "1. 14. 11. 38", which supported by further protein structure analysis based on AlphaFold2.

Benchmarking Protein Language Model

Active Learning-Based Optimization of Scientific Experimental Design

no code implementations29 Dec 2021 Ruoyu Wang

Active learning (AL) is a machine learning algorithm that can achieve greater accuracy with fewer labeled training instances, for having the ability to ask oracles to label the most valuable unlabeled data chosen iteratively and heuristically by query strategies.

Active Learning Experimental Design

Improving OOD Generalization with Causal Invariant Transformations

no code implementations29 Sep 2021 Ruoyu Wang, Mingyang Yi, Shengyu Zhu, Zhitang Chen

In this work, we obviate these assumptions and tackle the OOD problem without explicitly recovering the causal feature.

Constructing Flow Graphs from Procedural Cybersecurity Texts

1 code implementation Findings (ACL) 2021 Kuntal Kumar Pal, Kazuaki Kashihara, Pratyay Banerjee, Swaroop Mishra, Ruoyu Wang, Chitta Baral

We must read the whole text to identify the relevant information or identify the instruction flows to complete a task, which is prone to failures.

Sentence Sentence Embeddings

Deep Weakly Supervised Positioning

no code implementations10 Apr 2021 Ruoyu Wang, Xuchu Xu, Li Ding, Yang Huang, Chen Feng

PoseNet can map a photo to the position where it is taken, which is appealing in robotics.

Characterization of Excess Risk for Locally Strongly Convex Population Risk

1 code implementation4 Dec 2020 Mingyang Yi, Ruoyu Wang, Zhi-Ming Ma

Our bounds underscore that with locally strongly convex population risk, the models trained by any proper iterative algorithm can generalize well, even for non-convex problems, and $d$ is large.

SPARE3D: A Dataset for SPAtial REasoning on Three-View Line Drawings

1 code implementation CVPR 2020 Wenyu Han, Siyuan Xiang, Chenhui Liu, Ruoyu Wang, Chen Feng

Our experiments show that although convolutional networks have achieved superhuman performance in many visual learning tasks, their spatial reasoning performance on SPARE3D tasks is either lower than average human performance or even close to random guesses.

Real-time Soft Body 3D Proprioception via Deep Vision-based Sensing

1 code implementation8 Apr 2019 Ruoyu Wang, Shiheng Wang, Songyu Du, Erdong Xiao, Wenzhen Yuan, Chen Feng

Soft bodies made from flexible and deformable materials are popular in many robotics applications, but their proprioceptive sensing has been a long-standing challenge.

Robotics

Cannot find the paper you are looking for? You can Submit a new open access paper.