Search Results for author: Yun Wang

Found 18 papers, 7 papers with code

Conformer-Based Self-Supervised Learning for Non-Speech Audio Tasks

no code implementations14 Oct 2021 Sangeeta Srivastava, Yun Wang, Andros Tjandra, Anurag Kumar, Chunxi Liu, Kritika Singh, Yatharth Saraf

While self-supervised speech representation learning has been popular in the speech research community, very few works have comprehensively analyzed audio representation learning for non-speech audio tasks.

Fine-tuning Representation Learning +1

Transferring Voice Knowledge for Acoustic Event Detection: An Empirical Study

no code implementations7 Oct 2021 Dawei Liang, Yangyang Shi, Yun Wang, Nayan Singhal, Alex Xiao, Jonathan Shaw, Edison Thomaz, Ozlem Kalinli, Mike Seltzer

Detection of common events and scenes from audio is useful for extracting and understanding human contexts in daily life.

Event Detection

PTNet: A High-Resolution Infant MRI Synthesizer Based on Transformer

1 code implementation28 May 2021 Xuzhe Zhang, Xinzi He, Jia Guo, Nabil Ettehadi, Natalie Aw, David Semanek, Jonathan Posner, Andrew Laine, Yun Wang

Magnetic resonance imaging (MRI) noninvasively provides critical information about how human brain structures develop across stages of life.

Wasserstein Coupled Graph Learning for Cross-Modal Retrieval

no code implementations ICCV 2021 Yun Wang, Tong Zhang, Xueya Zhang, Zhen Cui, Yuge Huang, Pengcheng Shen, Shaoxin Li, Jian Yang

Then, a Wasserstein coupled dictionary, containing multiple pairs of counterpart graph keys with each key corresponding to one modality, is constructed for further feature learning.

Cross-Modal Retrieval Graph Embedding +1

An Infrared Communication System based on Handstand Pendulum

no code implementations9 Sep 2020 Xingchen Li, Changlu Li, Yun Wang, Mengqi Lei

In this system, 940nm infrared light is mainly used for audio signal transmission, and an handstand pendulum based on PID is used to control the angle and stability of infrared light emission.

Instance-Aware Graph Convolutional Network for Multi-Label Classification

no code implementations19 Aug 2020 Yun Wang, Tong Zhang, Zhen Cui, Chunyan Xu, Jian Yang

For label diffusion of instance-awareness in graph convolution, rather than using the statistical label correlation alone, an image-dependent label correlation matrix (LCM), fusing both the statistical LCM and an individual one of each image instance, is constructed for graph inference on labels to inject adaptive information of label-awareness into the learned features of the model.

Classification General Classification +2

Selective Attention Encoders by Syntactic Graph Convolutional Networks for Document Summarization

no code implementations18 Mar 2020 Haiyang Xu, Yun Wang, Kun Han, Baochang Ma, Junwen Chen, Xiangang Li

Abstractive text summarization is a challenging task, and one need to design a mechanism to effectively extract salient information from the source text and then generate a summary.

Abstractive Text Summarization Document Summarization

Learning Alignment for Multimodal Emotion Recognition from Speech

1 code implementation6 Sep 2019 Haiyang Xu, HUI ZHANG, Kun Han, Yun Wang, Yiping Peng, Xiangang Li

Further, emotion recognition will be beneficial from using audio-textual multimodal information, it is not trivial to build a system to learn from multimodality.

Multimodal Emotion Recognition Speech Emotion Recognition +1

A Comparison of Five Multiple Instance Learning Pooling Functions for Sound Event Detection with Weak Labeling

3 code implementations22 Oct 2018 Yun Wang, Juncheng Li, Florian Metze

This paper compares five types of pooling functions both theoretically and experimentally, with special focus on their performance of localization.

Sound Audio and Speech Processing

Connectionist Temporal Localization for Sound Event Detection with Sequential Labeling

2 code implementations22 Oct 2018 Yun Wang, Florian Metze

Research on sound event detection (SED) with weak labeling has mostly focused on presence/absence labeling, which provides no temporal information at all about the event occurrences.

Sound Audio and Speech Processing

Feedback-Controlled Sequential Lasso Screening

no code implementations21 Aug 2016 Yun Wang, Xu Chen, Peter J. Ramadge

In this context, we propose and explore a feedback controlled sequential screening scheme.

Model Selection

The Symmetry of a Simple Optimization Problem in Lasso Screening

no code implementations21 Aug 2016 Yun Wang, Peter J. Ramadge

Recently dictionary screening has been proposed as an effective way to improve the computational efficiency of solving the lasso problem, which is one of the most commonly used method for learning sparse representations.

Screening Tests for Lasso Problems

no code implementations19 May 2014 Zhen James Xiang, Yun Wang, Peter J. Ramadge

For a given target vector, dictionary screening quickly identifies a subset of dictionary columns that will receive zero weight in a solution of the corresponding lasso problem.

Unsupervised Feature Learning by Deep Sparse Coding

no code implementations20 Dec 2013 Yunlong He, Koray Kavukcuoglu, Yun Wang, Arthur Szlam, Yanjun Qi

In this paper, we propose a new unsupervised feature learning framework, namely Deep Sparse Coding (DeepSC), that extends sparse coding to a multi-layer architecture for visual object recognition tasks.

Object Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.