Search Results for author: Shuo Liu

Found 34 papers, 8 papers with code

Multi-Channel CNN-based Object Detection for Enhanced Situation Awareness

no code implementations30 Nov 2017 Shuo Liu, Zheng Liu

In this study, we propose a novel detection algorithm for military objects by fusing multi-channel CNNs.

Computational Efficiency Object +3

Efficient Traffic-Sign Recognition with Scale-aware CNN

no code implementations31 May 2018 Yuchen Yang, Shuo Liu, Wei Ma, Qiuyuan Wang, Zheng Liu

The paper presents a Traffic Sign Recognition (TSR) system, which can fast and accurately recognize traffic signs of different sizes in images.

General Classification Traffic Sign Recognition

IR2VI: Enhanced Night Environmental Perception by Unsupervised Thermal Image Translation

no code implementations25 Jun 2018 Shuo Liu, Vijay John, Erik Blasch, Zheng Liu, Ying Huang

Context enhancement is critical for night vision (NV) applications, especially for the dark night situation without any artificial lights.

Translation

Single-Channel Speech Separation with Auxiliary Speaker Embeddings

no code implementations24 Jun 2019 Shuo Liu, Gil Keren, Björn Schuller

We present a novel source separation model to decompose asingle-channel speech signal into two speech segments belonging to two different speakers.

Speech Separation

AVEC 2019 Workshop and Challenge: State-of-Mind, Detecting Depression with AI, and Cross-Cultural Affect Recognition

no code implementations10 Jul 2019 Fabien Ringeval, Björn Schuller, Michel Valstar, NIcholas Cummins, Roddy Cowie, Leili Tavabi, Maximilian Schmitt, Sina Alisamir, Shahin Amiriparian, Eva-Maria Messner, Siyang Song, Shuo Liu, Ziping Zhao, Adria Mallol-Ragolta, Zhao Ren, Mohammad Soleymani, Maja Pantic

The Audio/Visual Emotion Challenge and Workshop (AVEC 2019) "State-of-Mind, Detecting Depression with AI, and Cross-cultural Affect Recognition" is the ninth competition event aimed at the comparison of multimedia processing and machine learning methods for automatic audiovisual health and emotion analysis, with all participants competing strictly under the same conditions.

Emotion Recognition

N-HANS: Introducing the Augsburg Neuro-Holistic Audio-eNhancement System

1 code implementation16 Nov 2019 Shuo Liu, Gil Keren, Björn Schuller

N-HANS is a Python toolkit for in-the-wild audio enhancement, including speech, music, and general audio denoising, separation, and selective noise or source suppression.

Sound Audio and Speech Processing

An Early Study on Intelligent Analysis of Speech under COVID-19: Severity, Sleep Quality, Fatigue, and Anxiety

no code implementations30 Apr 2020 Jing Han, Kun Qian, Meishu Song, Zijiang Yang, Zhao Ren, Shuo Liu, Juan Liu, Huaiyuan Zheng, Wei Ji, Tomoya Koike, Xiao Li, Zixing Zhang, Yoshiharu Yamamoto, Björn W. Schuller

In particular, by analysing speech recordings from these patients, we construct audio-only-based models to automatically categorise the health state of patients from four aspects, including the severity of illness, sleep quality, fatigue, and anxiety.

Sleep Quality

Byzantine Fault-Tolerant Distributed Machine Learning Using Stochastic Gradient Descent (SGD) and Norm-Based Comparative Gradient Elimination (CGE)

no code implementations11 Aug 2020 Nirupam Gupta, Shuo Liu, Nitin H. Vaidya

We show that the CGE gradient-filter guarantees fault-tolerance against a bounded fraction of Byzantine agents under standard stochastic assumptions, and is computationally simpler compared to many existing gradient-filters such as multi-KRUM, geometric median-of-means, and the spectral filters.

On designing finite time iterative learning control based on steady state frequency response

no code implementations6 Oct 2021 Shuo Liu, Richard W. Longman, Benjamas Panomruttanarug

Iterative Learning Control (ILC) is useful in spacecraft application for repeated high precision scanning maneuvers.

Modifying and optimizing the inverse of the frequency response circulant matrix as an iterative learning control compensator

no code implementations6 Oct 2021 Shuo Liu, Richard W. Longman

The purpose of this paper is to create a method of designing ILC compensators based on steady state frequency response, and have the ILC converge to zero error in spite of transients and bandwidth.

Multistage linguistic conditioning of convolutional layers for speech emotion recognition

no code implementations13 Oct 2021 Andreas Triantafyllopoulos, Uwe Reichel, Shuo Liu, Stephan Huber, Florian Eyben, Björn W. Schuller

In this contribution, we investigate the effectiveness of deep fusion of text and audio features for categorical and dimensional speech emotion recognition (SER).

Speech Emotion Recognition

Semi-supervised Multi-task Learning for Semantics and Depth

no code implementations14 Oct 2021 Yufeng Wang, Yi-Hsuan Tsai, Wei-Chih Hung, Wenrui Ding, Shuo Liu, Ming-Hsuan Yang

Multi-Task Learning (MTL) aims to enhance the model generalization by sharing representations between related tasks for better performance.

Depth Estimation Multi-Task Learning +1

Utilizing Redundancy in Cost Functions for Resilience in Distributed Optimization and Learning

no code implementations21 Oct 2021 Shuo Liu, Nirupam Gupta, Nitin Vaidya

We demonstrate, both theoretically and empirically, the merits of our proposed redundancy model in improving the robustness of DGD against asynchronous and Byzantine agents, and their extensions to distributed stochastic gradient descent (D-SGD) for robust distributed machine learning with asynchronous and Byzantine agents.

Distributed Optimization

Audio Self-supervised Learning: A Survey

no code implementations2 Mar 2022 Shuo Liu, Adria Mallol-Ragolta, Emilia Parada-Cabeleiro, Kun Qian, Xin Jing, Alexander Kathan, Bin Hu, Bjoern W. Schuller

Inspired by the humans' cognitive ability to generalise knowledge and skills, Self-Supervised Learning (SSL) targets at discovering general representations from large-scale data without requiring human annotations, which is an expensive and time consuming task.

Self-Supervised Learning

A Temporal-oriented Broadcast ResNet for COVID-19 Detection

no code implementations31 Mar 2022 Xin Jing, Shuo Liu, Emilia Parada-Cabaleiro, Andreas Triantafyllopoulos, Meishu Song, Zijiang Yang, Björn W. Schuller

Detecting COVID-19 from audio signals, such as breathing and coughing, can be used as a fast and efficient pre-testing method to reduce the virus transmission.

Computational Efficiency

TLP: A Deep Learning-based Cost Model for Tensor Program Tuning

1 code implementation7 Nov 2022 Yi Zhai, Yu Zhang, Shuo Liu, Xiaomeng Chu, Jie Peng, Jianmin Ji, Yanyong Zhang

Instead of extracting features from the tensor program itself, TLP extracts features from the schedule primitives.

Multi-Task Learning

Impact of Redundancy on Resilience in Distributed Optimization and Learning

no code implementations16 Nov 2022 Shuo Liu, Nirupam Gupta, Nitin H. Vaidya

In particular, we introduce the notion of $(f, r; \epsilon)$-resilience to characterize how well the true solution is approximated in the presence of up to $f$ Byzantine faulty agents, and up to $r$ slow agents (or stragglers) -- smaller $\epsilon$ represents a better approximation.

Distributed Optimization

Towards Selection of Text-to-speech Data to Augment ASR Training

no code implementations30 May 2023 Shuo Liu, Leda Sari, Chunyang Wu, Gil Keren, Yuan Shangguan, Jay Mahadeokar, Ozlem Kalinli

This paper presents a method for selecting appropriate synthetic speech samples from a given large text-to-speech (TTS) dataset as supplementary training data for an automatic speech recognition (ASR) model.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

MetaML: Automating Customizable Cross-Stage Design-Flow for Deep Learning Acceleration

no code implementations14 Jun 2023 Zhiqiang Que, Shuo Liu, Markus Rognlien, Ce Guo, Jose G. F. Coutinho, Wayne Luk

This paper introduces a novel optimization framework for deep neural network (DNN) hardware accelerators, enabling the rapid development of customized and automated design flows.

A Method to Speed Up Convergence of Iterative Learning Control for High Precision Repetitive Motions

no code implementations29 Jul 2023 Richard W. Longman, Shuo Liu, Tarek A. Elsharhawy

Iterative Learning Control (ILC) records previous run tracking error, adjusts the next run command, aiming for zero tracking error in the real world, not our model of the world.

Startup Acquisitions: Acquihires and Talent Hoarding

no code implementations19 Aug 2023 Jean-Michel Benkert, Igor Letina, Shuo Liu

We study how competitive forces may drive firms to inefficiently acquire startup talent.

FishMOT: A Simple and Effective Method for Fish Tracking Based on IoU Matching

1 code implementation6 Sep 2023 Shuo Liu, Lulu Han, Xiaoyang Liu, Junli Ren, Fang Wang, YingLiu, Yuanshan Lin

Wherein, a basic module performs target association based on IoU of detection boxes between successive frames to deal with morphological change of fish; an interaction module combines IoU of detection boxes and IoU of fish entity to handle occlusions; a refind module use spatio-temporal information uses spatio-temporal information to overcome the tracking failure resulting from the missed detection by the detector under complex environment.

Fish Detection Multi-Object Tracking +3

Inductive Cognitive Diagnosis for Fast Student Learning in Web-Based Online Intelligent Education Systems

1 code implementation17 Apr 2024 Shuo Liu, Junhao Shen, Hong Qian, Aimin Zhou

To this end, this paper proposes an inductive cognitive diagnosis model (ICDM) for fast new students' mastery levels inference in WOIESs.

cognitive diagnosis

SOS-1K: A Fine-grained Suicide Risk Classification Dataset for Chinese Social Media Analysis

no code implementations19 Apr 2024 Hongzhi Qi, Hanfei Liu, Jianqiang Li, Qing Zhao, Wei Zhai, Dan Luo, Tian Yu He, Shuo Liu, Bing Xiang Yang, Guanghui Fu

Seven pre-trained models were evaluated in two tasks: high and low suicide risk, and fine-grained suicide risk classification on a level of 0 to 10.

Exploring and Unleashing the Power of Large Language Models in Automated Code Translation

no code implementations23 Apr 2024 Zhen Yang, Fang Liu, Zhongxing Yu, Jacky Wai Keung, Jia Li, Shuo Liu, Yifan Hong, Xiaoxue Ma, Zhi Jin, Ge Li

Specifically, UniTrans first craft a series of test cases for target programs with the assistance of source programs.

Cannot find the paper you are looking for? You can Submit a new open access paper.