Search Results for author: Yiming Wang

Found 78 papers, 29 papers with code

Noise-injected Consistency Training and Entropy-constrained Pseudo Labeling for Semi-supervised Extractive Summarization

1 code implementation COLING 2022 Yiming Wang, Qianren Mao, Junnan Liu, Weifeng Jiang, Hongdong Zhu, JianXin Li

Labeling large amounts of extractive summarization data is often prohibitive expensive due to time, financial, and expertise constraints, which poses great challenges to incorporating summarization system in practical applications.

Extractive Summarization

Mind the Error! Detection and Localization of Instruction Errors in Vision-and-Language Navigation

no code implementations15 Mar 2024 Francesco Taioli, Stefano Rosa, Alberto Castellini, Lorenzo Natale, Alessio Del Bue, Alessandro Farinelli, Marco Cristani, Yiming Wang

Moreover, we formally define the task of Instruction Error Detection and Localization, and establish an evaluation protocol on top of our benchmark dataset.

One for all: A novel Dual-space Co-training baseline for Large-scale Multi-View Clustering

no code implementations28 Jan 2024 Zisen Kong, Zhiqiang Fu, Dongxia Chang, Yiming Wang, Yao Zhao

We jointly optimize the construction of the latent consistent anchor graph and the feature transformation to generate a discriminative anchor graph.

Clustering

R-Judge: Benchmarking Safety Risk Awareness for LLM Agents

1 code implementation18 Jan 2024 Tongxin Yuan, Zhiwei He, Lingzhong Dong, Yiming Wang, Ruijie Zhao, Tian Xia, Lizhen Xu, Binglin Zhou, Fangqi Li, Zhuosheng Zhang, Rui Wang, Gongshen Liu

We introduce R-Judge, a benchmark crafted to evaluate the proficiency of LLMs in judging and identifying safety risks given agent interaction records.

Benchmarking

Geometrically-driven Aggregation for Zero-shot 3D Point Cloud Understanding

no code implementations4 Dec 2023 Guofeng Mei, Luigi Riz, Yiming Wang, Fabio Poiesi

To this end, we introduce the first training-free aggregation technique that leverages the point cloud's 3D geometric structure to improve the quality of the transferred Vision-Language Models.

Segmentation Semantic Segmentation

PrivateLoRA For Efficient Privacy Preserving LLM

no code implementations23 Nov 2023 Yiming Wang, Yu Lin, Xiaodong Zeng, Guannan Zhang

To our knowledge, our proposed framework is the first efficient and privacy-preserving LLM solution in the literature.

Language Modelling Large Language Model +1

MultiLoRA: Democratizing LoRA for Better Multi-Task Learning

no code implementations20 Nov 2023 Yiming Wang, Yu Lin, Xiaodong Zeng, Guannan Zhang

Further investigation into weight update matrices of MultiLoRA exhibits reduced dependency on top singular vectors and more democratic unitary transform contributions.

Multi-Task Learning Natural Language Understanding +1

Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents

1 code implementation20 Nov 2023 Zhuosheng Zhang, Yao Yao, Aston Zhang, Xiangru Tang, Xinbei Ma, Zhiwei He, Yiming Wang, Mark Gerstein, Rui Wang, Gongshen Liu, Hai Zhao

Large language models (LLMs) have dramatically enhanced the field of language intelligence, as demonstrably evidenced by their formidable empirical performance across a spectrum of complex reasoning tasks.

Delving into CLIP latent space for Video Anomaly Recognition

1 code implementation4 Oct 2023 Luca Zanella, Benedetta Liberatori, Willi Menapace, Fabio Poiesi, Yiming Wang, Elisa Ricci

We tackle the complex problem of detecting and recognising anomalies in surveillance videos at the frame level, utilising only video-level supervision.

Anomaly Detection Multiple Instance Learning +1

ResidualTransformer: Residual Low-Rank Learning with Weight-Sharing for Transformer Layers

no code implementations3 Oct 2023 Yiming Wang, Jinyu Li

In this paper, we aim to reduce model size by reparameterizing model weights across Transformer encoder layers and assuming a special weight composition and structure.

speech-recognition Speech Recognition

Survey on video anomaly detection in dynamic scenes with moving cameras

no code implementations14 Aug 2023 Runyu Jiao, Yi Wan, Fabio Poiesi, Yiming Wang

The increasing popularity of compact and inexpensive cameras, e. g.~dash cameras, body cameras, and cameras equipped on robots, has sparked a growing interest in detecting anomalies within dynamic scenes recorded by moving cameras.

Anomaly Detection Video Anomaly Detection

Attentive Multimodal Fusion for Optical and Scene Flow

1 code implementation28 Jul 2023 Youjie Zhou, Guofeng Mei, Yiming Wang, Fabio Poiesi, Yi Wan

This paper presents an investigation into the estimation of optical and scene flow using RGBD information in scenarios where the RGB modality is affected by noise or captured in dark environments.

Meta-Reasoning: Semantics-Symbol Deconstruction for Large Language Models

1 code implementation30 Jun 2023 Yiming Wang, Zhuosheng Zhang, Pei Zhang, Baosong Yang, Rui Wang

Neural-symbolic methods have demonstrated efficiency in enhancing the reasoning abilities of large language models (LLMs).

Domain Generalization In-Context Learning +1

Vocabulary-free Image Classification

1 code implementation NeurIPS 2023 Alessandro Conti, Enrico Fini, Massimiliano Mancini, Paolo Rota, Yiming Wang, Elisa Ricci

We thus formalize a novel task, termed as Vocabulary-free Image Classification (VIC), where we aim to assign to an input image a class that resides in an unconstrained language-induced semantic space, without the prerequisite of a known vocabulary.

Classification Image Classification +4

Audio-Visual Dataset and Method for Anomaly Detection in Traffic Videos

1 code implementation24 May 2023 Błażej Leporowski, Arian Bakhtiarnia, Nicole Bonnici, Adrian Muscat, Luca Zanella, Yiming Wang, Alexandros Iosifidis

We introduce the first audio-visual dataset for traffic anomaly detection taken from real-world scenes, called MAVAD, with a diverse range of weather and illumination conditions.

Anomaly Detection

Element-aware Summarization with Large Language Models: Expert-aligned Evaluation and Chain-of-Thought Method

1 code implementation22 May 2023 Yiming Wang, Zhuosheng Zhang, Rui Wang

Further, we propose a Summary Chain-of-Thought (SumCoT) technique to elicit LLMs to generate summaries step by step, which helps them integrate more fine-grained details of source documents into the final summaries that correlate with the human writing mindset.

Benchmarking Hallucination

Positional Diffusion: Ordering Unordered Sets with Diffusion Probabilistic Models

1 code implementation20 Mar 2023 Francesco Giuliari, Gianluca Scarpellini, Stuart James, Yiming Wang, Alessio Del Bue

We present Positional Diffusion, a plug-and-play graph formulation with Diffusion Probabilistic Models to address positional reasoning.

Sentence Sentence Ordering +1

3DSGrasp: 3D Shape-Completion for Robotic Grasp

no code implementations2 Jan 2023 Seyed S. Mohammadi, Nuno F. Duarte, Dimitris Dimou, Yiming Wang, Matteo Taiana, Pietro Morerio, Atabak Dehban, Plinio Moreno, Alexandre Bernardino, Alessio Del Bue, Jose Santos-Victor

However, in practice, PCDs are often incomplete when objects are viewed from few and sparse viewpoints before the grasping action, leading to the generation of wrong or inaccurate grasp poses.

Robotic Grasping

Label-Guided Knowledge Distillation for Continual Semantic Segmentation on 2D Images and 3D Point Clouds

1 code implementation ICCV 2023 Ze Yang, Ruibo Li, Evan Ling, Chi Zhang, Yiming Wang, Dezhao Huang, Keng Teck Ma, Minhoe Hur, Guosheng Lin

To address this issue, we propose a new label-guided knowledge distillation (LGKD) loss, where the old model output is expanded and transplanted (with the guidance of the ground truth label) to form a semantically appropriate class correspondence with the new model output.

Continual Semantic Segmentation Knowledge Distillation +1

Learning with linear mixed model for group recommendation systems

no code implementations17 Dec 2022 Baode Gao, Guangpeng Zhan, Hanzhang Wang, Yiming Wang, Shengxin Zhu

Accurate prediction of users' responses to items is one of the main aims of many computational advising applications.

Recommendation Systems

NeuS2: Fast Learning of Neural Implicit Surfaces for Multi-view Reconstruction

1 code implementation ICCV 2023 Yiming Wang, Qin Han, Marc Habermann, Kostas Daniilidis, Christian Theobalt, Lingjie Liu

Recent methods for neural surface representation and rendering, for example NeuS, have demonstrated the remarkably high-quality reconstruction of static scenes.

Surface Reconstruction

Query Your Model with Definitions in FrameNet: An Effective Method for Frame Semantic Role Labeling

1 code implementation5 Dec 2022 Ce Zheng, Yiming Wang, Baobao Chang

Such methods usually model role classification as naive multi-class classification and treat arguments individually, which neglects label semantics and interactions between arguments and thus hindering performance and generalization of models.

Classification Multi-class Classification +1

DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation

1 code implementation18 Nov 2022 Yuhang Lai, Chengxi Li, Yiming Wang, Tianyi Zhang, Ruiqi Zhong, Luke Zettlemoyer, Scott Wen-tau Yih, Daniel Fried, Sida Wang, Tao Yu

We introduce DS-1000, a code generation benchmark with a thousand data science problems spanning seven Python libraries, such as NumPy and Pandas.

Code Generation Memorization

Oracle-guided Contrastive Clustering

no code implementations1 Nov 2022 Mengdie Wang, Liyuan Shang, Suyun Zhao, Yiming Wang, Hong Chen, Cuiping Li, XiZhao Wang

Accordingly, the query results, guided by oracles with distinctive demands, may drive the OCC's clustering results in a desired orientation.

Active Learning Clustering +2

Leveraging commonsense for object localisation in partial scenes

no code implementations1 Nov 2022 Francesco Giuliari, Geri Skenderi, Marco Cristani, Alessio Del Bue, Yiming Wang

With the proposed graph-based scene representation, we estimate the unknown position of the target object using a Graph Neural Network that implements a novel attentional message passing mechanism.

Object Position

ConfMix: Unsupervised Domain Adaptation for Object Detection via Confidence-based Mixing

1 code implementation20 Oct 2022 Giulio Mattolin, Luca Zanella, Elisa Ricci, Yiming Wang

Unsupervised Domain Adaptation (UDA) for object detection aims to adapt a model trained on a source domain to detect instances from a new target domain for which annotations are not available.

Object Detection Unsupervised Domain Adaptation

CTCBERT: Advancing Hidden-unit BERT with CTC Objectives

no code implementations16 Oct 2022 Ruchao Fan, Yiming Wang, Yashesh Gaur, Jinyu Li

We examine CTCBERT on IDs from HuBERT Iter1, HuBERT Iter2, and PBERT.

Neural Novel Actor: Learning a Generalized Animatable Neural Representation for Human Actors

no code implementations25 Aug 2022 Yiming Wang, Qingzhe Gao, Libin Liu, Lingjie Liu, Christian Theobalt, Baoquan Chen

The learned representation can be used to synthesize novel view images of an arbitrary person from a sparse set of cameras, and further animate them with the user's pose control.

Attribute

PI-Trans: Parallel-ConvMLP and Implicit-Transformation Based GAN for Cross-View Image Translation

1 code implementation9 Jul 2022 Bin Ren, Hao Tang, Yiming Wang, Xia Li, Wei Wang, Nicu Sebe

For semantic-guided cross-view image translation, it is crucial to learn where to sample pixels from the source view image and where to reallocate them guided by the target view semantic map, especially when there is little overlap or drastic view difference between the source and target images.

Generative Adversarial Network

Long-tailed Recognition by Learning from Latent Categories

no code implementations2 Jun 2022 Weide Liu, Zhonghua Wu, Yiming Wang, Henghui Ding, Fayao Liu, Jie Lin, Guosheng Lin

Previous long-tailed recognition methods commonly focus on the data augmentation or re-balancing strategy of the tail classes to give more attention to tail classes during the model training.

Data Augmentation Long-tail Learning

Spatial Commonsense Graph for Object Localisation in Partial Scenes

1 code implementation CVPR 2022 Francesco Giuliari, Geri Skenderi, Marco Cristani, Yiming Wang, Alessio Del Bue

The SCG is used to estimate the unknown position of the target object in two steps: first, we feed the SCG into a novel Proximity Prediction Network, a graph neural network that uses attention to perform distance prediction between the node representing the target object and the nodes representing the observed objects in the SCG; second, we propose a Localisation Module based on circular intersection to estimate the object position using all the predicted pairwise distances in order to be independent of any reference system.

Object Position

Behavior Recognition Based on the Integration of Multigranular Motion Features

no code implementations7 Mar 2022 Lizong Zhang, Yiming Wang, Bei Hui, Xiujian Zhang, Sijuan Liu, Shuxin Feng

Specifically, behavior recognition may even rely more on the modeling of temporal information containing short-range and long-range motions; this contrasts with computer vision tasks involving images that focus on the understanding of spatial information.

Action Recognition

ACTIVE:Augmentation-Free Graph Contrastive Learning for Partial Multi-View Clustering

no code implementations1 Mar 2022 Yiming Wang, Dongxia Chang, Zhiqiang Fu, Jie Wen, Yao Zhao

In this paper, we propose an augmentation-free graph contrastive learning framework, namely ACTIVE, to solve the problem of partial multi-view clustering.

Clustering Contrastive Learning +1

Graph-based Generative Face Anonymisation with Pose Preservation

1 code implementation10 Dec 2021 Nicola Dall'Asen, Yiming Wang, Hao Tang, Luca Zanella, Elisa Ricci

With the goal to maintain the geometric attributes of the source face, i. e., the facial pose and expression, and to promote more natural face generation, we propose to exploit a Bipartite Graph to explicitly model the relations between the facial landmarks of the source identity and the ones of the condition identity through a deep model.

Face Detection Face Generation

Loop closure detection using local 3D deep descriptors

1 code implementation31 Oct 2021 Youjie Zhou, Yiming Wang, Fabio Poiesi, Qi Qin, Yi Wan

We compare our L3D-based loop closure approach with recent approaches on LiDAR data and achieve state-of-the-art loop closure detection accuracy.

Loop Closure Detection

Wav2vec-Switch: Contrastive Learning from Original-noisy Speech Pairs for Robust Speech Recognition

no code implementations11 Oct 2021 Yiming Wang, Jinyu Li, Heming Wang, Yao Qian, Chengyi Wang, Yu Wu

In this paper we propose wav2vec-Switch, a method to encode noise robustness into contextualized representations of speech via contrastive learning.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +7

Double Low-Rank Representation With Projection Distance Penalty for Clustering

no code implementations CVPR 2021 Zhiqiang Fu, Yao Zhao, Dongxia Chang, Xingxing Zhang, Yiming Wang

This paper presents a novel, simple yet robust self-representation method, i. e., Double Low-Rank Representation with Projection Distance penalty (DLRRPD) for clustering.

Clustering

Consistent Multiple Graph Embedding for Multi-View Clustering

no code implementations11 May 2021 Yiming Wang, Dongxia Chang, Zhiqiang Fu, Yao Zhao

Specifically, a multiple graph auto-encoder(M-GAE) is designed to flexibly encode the complementary information of multi-view data using a multi-graph attention fusion encoder.

Clustering Graph Attention +1

Seeing All From a Few: Nodes Selection Using Graph Pooling for Graph Clustering

no code implementations30 Apr 2021 Yiming Wang, Dongxia Chang, Zhiqian Fu, Yao Zhao

This paper is the first attempt to employ graph pooling technique for node clustering and we propose a novel dual graph embedding network (DGEN), which is designed as a two-step graph encoder connected by a graph pooling layer to learn the graph embedding.

Clustering Graph Clustering +2

Auto-weighted low-rank representation for clustering

no code implementations26 Apr 2021 Zhiqiang Fu, Yao Zhao, Dongxia Chang, Xingxing Zhang, Yiming Wang

In this paper, a novel unsupervised low-rank representation model, i. e., Auto-weighted Low-Rank Representation (ALRR), is proposed to construct a more favorable similarity graph (SG) for clustering.

Clustering Representation Learning

Wake Word Detection with Streaming Transformers

no code implementations8 Feb 2021 Yiming Wang, Hang Lv, Daniel Povey, Lei Xie, Sanjeev Khudanpur

Modern wake word detection systems usually rely on neural networks for acoustic modeling.

From Point to Space: 3D Moving Human Pose Estimation Using Commodity WiFi

no code implementations28 Dec 2020 Yiming Wang, Lingchao Guo, Zhaoming Lu, Xiangming Wen, Shuang Zhou, Wanyu Meng

To reconstruct 3D poses of people who move throughout the space rather than a fixed point, we fuse the amplitude and phase into Channel State Information (CSI) images which can provide both pose and position information.

3D Pose Estimation Position

Subject-independent Human Pose Image Construction with Commodity Wi-Fi

no code implementations22 Dec 2020 Shuang Zhou, Lingchao Guo, Zhaoming Lu, Xiangming Wen, Wei Zheng, Yiming Wang

Existing papers achieve good results when constructing the images of subjects who are in the prior training samples.

Where to Explore Next? ExHistCNN for History-aware Autonomous 3D Exploration

1 code implementation ECCV 2020 Yiming Wang, Alessio Del Bue

In this work we address the problem of autonomous 3D exploration of an unknown indoor environment using a depth camera.

3D Reconstruction

Single Image Human Proxemics Estimation for Visual Social Distancing

1 code implementation3 Nov 2020 Maya Aghaei, Matteo Bustreo, Yiming Wang, Gianluca Bailo, Pietro Morerio, Alessio Del Bue

In this work, we address the problem of estimating the so-called "Social Distancing" given a single uncalibrated image in unconstrained scenarios.

PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR

1 code implementation20 May 2020 Yiwen Shao, Yiming Wang, Daniel Povey, Sanjeev Khudanpur

We present PyChain, a fully parallelized PyTorch implementation of end-to-end lattice-free maximum mutual information (LF-MMI) training for the so-called \emph{chain models} in the Kaldi automatic speech recognition (ASR) toolkit.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Wake Word Detection with Alignment-Free Lattice-Free MMI

1 code implementation17 May 2020 Yiming Wang, Hang Lv, Daniel Povey, Lei Xie, Sanjeev Khudanpur

Always-on spoken language interfaces, e. g. personal digital assistants, rely on a wake word to start processing spoken input.

Espresso: A Fast End-to-end Neural Speech Recognition Toolkit

1 code implementation18 Sep 2019 Yiming Wang, Tongfei Chen, Hainan Xu, Shuoyang Ding, Hang Lv, Yiwen Shao, Nanyun Peng, Lei Xie, Shinji Watanabe, Sanjeev Khudanpur

We present Espresso, an open-source, modular, extensible end-to-end neural automatic speech recognition (ASR) toolkit based on the deep learning library PyTorch and the popular neural machine translation toolkit fairseq.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

End-to-end Anchored Speech Recognition

no code implementations6 Feb 2019 Yiming Wang, Xing Fan, I-Fan Chen, Yuzong Liu, Tongfei Chen, Björn Hoffmeister

The anchored segment refers to the wake-up word part of an audio stream, which contains valuable speaker information that can be used to suppress interfering speech and background noise.

Multi-Task Learning speech-recognition +1

Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks

1 code implementation Interspeech 2018 2018 Daniel Povey, Gaofeng Cheng, Yiming Wang, Ke Li, Hainan Xu, Mahsa Yarmohammadi, Sanjeev Khudanpur

Time Delay Neural Networks (TDNNs), also known as onedimensional Convolutional Neural Networks (1-d CNNs), are an efficient and well-performing neural network architecture for speech recognition.

speech-recognition Speech Recognition

A GPU-based WFST Decoder with Exact Lattice Generation

no code implementations9 Apr 2018 Zhehuai Chen, Justin Luitjens, Hainan Xu, Yiming Wang, Daniel Povey, Sanjeev Khudanpur

We describe initial work on an extension of the Kaldi toolkit that supports weighted finite-state transducer (WFST) decoding on Graphics Processing Units (GPUs).

Scheduling

Purely sequence-trained neural networks for ASR based on lattice-free MMI

no code implementations INTERSPEECH 2016 2016 Daniel Povey, Vijayaditya Peddinti, Daniel Galvez, Pegah Ghahrmani, Vimal Manohar, Xingyu Na, Yiming Wang, Sanjeev Khudanpur

Models trained with LFMMI provide a relative word error rate reduction of ∼11. 5%, over those trained with cross-entropy objective function, and ∼8%, over those trained with cross-entropy and sMBR objective functions.

Language Modelling Speech Recognition

Accelerated Mini-batch Randomized Block Coordinate Descent Method

no code implementations NeurIPS 2014 Tuo Zhao, Mo Yu, Yiming Wang, Raman Arora, Han Liu

When the regularization function is block separable, we can solve the minimization problems in a randomized block coordinate descent (RBCD) manner.

Sparse Learning Stochastic Optimization

Cannot find the paper you are looking for? You can Submit a new open access paper.