Search Results for author: Chao Yang

Found 96 papers, 28 papers with code

Feature selection for classification with class-separability strategy and data envelopment analysis

no code implementations6 May 2014 Yishi Zhang, Chao Yang, Anrong Yang, Chan Xiong, Xingchi Zhou, Zigang Zhang

In this paper, a novel feature selection method is presented, which is based on Class-Separability (CS) strategy and Data Envelopment Analysis (DEA).

feature selection General Classification

Low-rank SIFT: An Affine Invariant Feature for Place Recognition

no code implementations7 Aug 2014 Chao Yang, Shengnan Caih, Jingdong Wang, Long Quan

As an extension of SIFT, our method seeks to add prior to solve the ill-posed affine parameter estimation problem and normalizes them directly, and is applicable to objects with regular structures.

feature selection Translation

Exact Hybrid Covariance Thresholding for Joint Graphical Lasso

no code implementations7 Mar 2015 Qingming Tang, Chao Yang, Jian Peng, Jinbo Xu

This paper proposes a novel hybrid covariance thresholding algorithm that can effectively identify zero entries in the precision matrices and split a large joint graphical lasso problem into small subproblems.

Symmetry-aware Depth Estimation using Deep Neural Networks

no code implementations20 Apr 2016 Guilin Liu, Chao Yang, Zimo Li, Duygu Ceylan, Qi-Xing Huang

Due to the abundance of 2D product images from the Internet, developing efficient and scalable algorithms to recover the missing depth information is central to many applications.

Depth Estimation

High-Resolution Image Inpainting using Multi-Scale Neural Patch Synthesis

1 code implementation CVPR 2017 Chao Yang, Xin Lu, Zhe Lin, Eli Shechtman, Oliver Wang, Hao Li

Recent advances in deep learning have shown exciting promise in filling large holes in natural images with semantically plausible and context aware details, impacting fundamental image manipulation tasks such as object removal.

Image Inpainting Image Manipulation +1

Realistic Dynamic Facial Textures From a Single Image Using GANs

no code implementations ICCV 2017 Kyle Olszewski, Zimo Li, Chao Yang, Yi Zhou, Ronald Yu, Zeng Huang, Sitao Xiang, Shunsuke Saito, Pushmeet Kohli, Hao Li

By retargeting the PCA expression geometry from the source, as well as using the newly inferred texture, we can both animate the face and perform video face replacement on the source video using the target appearance.

Shape Inpainting using 3D Generative Adversarial Network and Recurrent Convolutional Networks

1 code implementation ICCV 2017 Weiyue Wang, Qiangui Huang, Suya You, Chao Yang, Ulrich Neumann

The 3D-ED-GAN is a 3D convolutional neural network trained with a generative adversarial paradigm to fill missing 3D data in low-resolution.

Generative Adversarial Network

Contextual-based Image Inpainting: Infer, Match, and Translate

no code implementations ECCV 2018 Yuhang Song, Chao Yang, Zhe Lin, Xiaofeng Liu, Qin Huang, Hao Li, C. -C. Jay Kuo

We study the task of image inpainting, which is to fill in the missing region of an incomplete image with plausible contents.

Image Inpainting Translation

Deep Learning: A Tool for Computational Nuclear Physics

no code implementations8 Mar 2018 Gianina Alina Negoita, Glenn R. Luecke, James P. Vary, Pieter Maris, Andrey M. Shirokov, Ik Jae Shin, Youngman Kim, Esmond G. Ng, Chao Yang

In recent years, several successful applications of the Artificial Neural Networks (ANNs) have emerged in nuclear physics and high-energy physics, as well as in biology, chemistry, meteorology, and other fields of science.

Computational Physics Nuclear Theory

Image Inpainting using Block-wise Procedural Training with Annealed Adversarial Counterpart

no code implementations23 Mar 2018 Chao Yang, Yuhang Song, Xiaofeng Liu, Qingming Tang, C. -C. Jay Kuo

We present a new approach to address the difficulty of training a very deep generative model to synthesize high-quality photo-realistic inpainting.

Facial Inpainting Image Harmonization

SPG-Net: Segmentation Prediction and Guidance Network for Image Inpainting

1 code implementation9 May 2018 Yuhang Song, Chao Yang, Yeji Shen, Peng Wang, Qin Huang, C. -C. Jay Kuo

In this paper, we focus on image inpainting task, aiming at recovering the missing area of an incomplete image given the context information.

Image Inpainting Interactive Segmentation +2

PortraitGAN for Flexible Portrait Manipulation

no code implementations5 Jul 2018 Jiali Duan, Xiaoyuan Guo, Yuhang Song, Chao Yang, C. -C. Jay Kuo

Previous methods have dealt with discrete manipulation of facial attributes such as smile, sad, angry, surprise etc, out of canonical expressions and they are not scalable, operating in single modality.

A Survey on Deep Transfer Learning

no code implementations6 Aug 2018 Chuanqi Tan, Fuchun Sun, Tao Kong, Wenchang Zhang, Chao Yang, Chunfang Liu

As a new classification platform, deep learning has recently received increasing attention from researchers and has been successfully applied to many domains.

General Classification Transfer Learning

Deep learning: Extrapolation tool for ab initio nuclear theory

no code implementations6 Oct 2018 Gianina Alina Negoita, James P. Vary, Glenn R. Luecke, Pieter Maris, Andrey M. Shirokov, Ik Jae Shin, Youngman Kim, Esmond G. Ng, Chao Yang, Matthew Lockner, Gurpur M. Prabhu

The NCSM and other approaches require an extrapolation of the results obtained in a finite basis space to the infinite basis space limit and assessment of the uncertainty of those extrapolations.

Coherent Semantic Attention for Image Inpainting

1 code implementation ICCV 2019 Hongyu Liu, Bin Jiang, Yi Xiao, Chao Yang

The latest deep learning-based approaches have shown promising results for the challenging task of inpainting missing regions of an image.

Image Inpainting

Context-Integrated and Feature-Refined Network for Lightweight Object Parsing

no code implementations26 Jul 2019 Bin Jiang, Wenxuan Tu, Chao Yang, Junsong Yuan

The core components of CIFReNet are the Long-skip Refinement Module (LRM) and the Multi-scale Context Integration Module (MCIM).

Scene Parsing Semantic Segmentation

Reinforcement Learning from Imperfect Demonstrations under Soft Expert Guidance

no code implementations16 Nov 2019 Mingxuan Jing, Xiaojian Ma, Wenbing Huang, Fuchun Sun, Chao Yang, Bin Fang, Huaping Liu

In this paper, we study Reinforcement Learning from Demonstrations (RLfD) that improves the exploration efficiency of Reinforcement Learning (RL) by providing expert demonstrations.

reinforcement-learning Reinforcement Learning (RL)

Constrained R-CNN: A general image manipulation detection model

no code implementations19 Nov 2019 Chao Yang, Huizhou Li, Fangting Lin, Bin Jiang, Hao Zhao

Finally, the coarse localization information guides the model to further learn the finer local features and segment out the tampered region.

General Classification Image Forensics +4

Boundary-Aware Salient Object Detection via Recurrent Two-Stream Guided Refinement Network

no code implementations11 Dec 2019 Fangting Lin, Chao Yang, Huizhou Li, Bin Jiang

Recent deep learning based salient object detection methods which utilize both saliency and boundary features have achieved remarkable performance.

object-detection RGB Salient Object Detection +1

Towards Disentangled Representations for Human Retargeting by Multi-view Learning

no code implementations12 Dec 2019 Chao Yang, Xiaofeng Liu, Qingming Tang, C. -C. Jay Kuo

We study the problem of learning disentangled representations for data across multiple domains and its applications in human retargeting.

MULTI-VIEW LEARNING

Unconstrained Facial Expression Transfer using Style-based Generator

1 code implementation12 Dec 2019 Chao Yang, Ser-Nam Lim

Given two face images, our method can create plausible results that combine the appearance of one image and the expression of the other.

Image Manipulation

One-Stage Inpainting with Bilateral Attention and Pyramid Filling Block

no code implementations18 Dec 2019 Hongyu Liu, Bin Jiang, Wei Huang, Chao Yang

However, the two-stage architecture is time-consuming, the contextual information lack high-level semantics and ignores both the semantic relevance and distance information of hole's feature patches, these limitations result in blurry textures and distorted structures of final result.

Image Inpainting Texture Synthesis

One-Shot Domain Adaptation For Face Generation

no code implementations CVPR 2020 Chao Yang, Ser-Nam Lim

To generate images of the same distribution, we introduce a style-mixing technique that transfers the low-level statistics from the target to faces randomly generated with the model.

Domain Adaptation Face Generation

Efficient Alternating Least Squares Algorithms for Low Multilinear Rank Approximation of Tensors

no code implementations6 Apr 2020 Chuanfu Xiao, Chao Yang, Min Li

In this paper, we propose a new class of truncated HOSVD algorithms based on alternating least squares (ALS) for efficiently computing the low multilinear rank approximation of tensors.

PFNN: A Penalty-Free Neural Network Method for Solving a Class of Second-Order Boundary-Value Problems on Complex Geometries

2 code implementations14 Apr 2020 Hailong Sheng, Chao Yang

We present PFNN, a penalty-free neural network method, to efficiently solve a class of second-order boundary-value problems on complex geometries.

Fast and Robust Registration of Aerial Images and LiDAR data Based on Structrual Features and 3D Phase Correlation

no code implementations21 Apr 2020 Bai Zhu, Yuanxin Ye, Chao Yang, Liang Zhou, Huiyu Liu, Yungang Cao

Subsequently, a robust structural feature descriptor is build based on dense gradient features, and the 3D phase correlation is used to detect control points (CPs) between aerial images and LiDAR data in the frequency domain, where the image matching is accelerated by the 3D Fast Fourier Transform (FFT).

Improving Co-registration for Sentinel-1 SAR and Sentinel-2 Optical images

no code implementations22 May 2020 Yuanxin Ye, Chao Yang, Bai Zhu, Youquan He, Huarong Jia

Finally, the obtained correspondences are employed to measure the misregistration shifts between the images.

On the Efficient Evaluation of the Exchange Correlation Potential on Graphics Processing Unit Clusters

2 code implementations7 Jul 2020 David B. Williams-Young, Wibe A. de Jong, Hubertus J. J. van Dam, Chao Yang

We demonstrate the performance and scalability of the implementation of the purposed method in the NWChemEx software package by comparing to the existing scalable CPU XC integration in NWChem.

Computational Physics Distributed, Parallel, and Cluster Computing Chemical Physics

Rethinking Image Inpainting via a Mutual Encoder-Decoder with Feature Equalizations

1 code implementation ECCV 2020 Hongyu Liu, Bin Jiang, Yibing Song, Wei Huang, Chao Yang

We use CNN features from the deep and shallow layers of the encoder to represent structures and textures of an input image, respectively.

Image Inpainting

Gated Res2Net for Multivariate Time Series Analysis

1 code implementation19 Sep 2020 Chao Yang, Mingxing Jiang, Zhongwen Guo, Yu-An Liu

Through the utilization of gated mechanism, the network can control the process of information sending hence can better capture and utilize the both the temporal information and the correlations between the feature maps.

Time Series Time Series Analysis

a-Tucker: Input-Adaptive and Matricization-Free Tucker Decomposition for Dense Tensors on CPUs and GPUs

no code implementations20 Oct 2020 Min Li, Chuanfu Xiao, Chao Yang

A mode-wise flexible Tucker decomposition algorithm is proposed to enable the switch of different solvers for the factor matrices and core tensor, and a machine-learning adaptive solver selector is applied to automatically cope with the variations of both the input data and the hardware.

CoRe: An Efficient Coarse-refined Training Framework for BERT

no code implementations27 Nov 2020 Cheng Yang, Shengnan Wang, Yuechuan Li, Chao Yang, Ming Yan, Jingqiao Zhang, Fangquan Lin

In the second phase, we transform the trained relaxed BERT model into the original BERT and further retrain the model.

Progressively Stacking 2.0: A Multi-stage Layerwise Training Method for BERT Training Speedup

no code implementations27 Nov 2020 Cheng Yang, Shengnan Wang, Chao Yang, Yuechuan Li, Ru He, Jingqiao Zhang

In BERT training, the backward computation is much more time-consuming than the forward computation, especially in the distributed training setting in which the backward computation time further includes the communication time for gradient synchronization.

Unified Streaming and Non-streaming Two-pass End-to-end Model for Speech Recognition

5 code implementations10 Dec 2020 BinBin Zhang, Di wu, Zhuoyuan Yao, Xiong Wang, Fan Yu, Chao Yang, Liyong Guo, Yaguang Hu, Lei Xie, Xin Lei

In this paper, we present a novel two-pass approach to unify streaming and non-streaming end-to-end (E2E) speech recognition in a single model.

Sentence speech-recognition +1

Learning content and context with language bias for Visual Question Answering

1 code implementation21 Dec 2020 Chao Yang, Su Feng, Dongsheng Li, HuaWei Shen, Guoqing Wang, Bin Jiang

Many works concentrate on how to reduce language bias which makes models answer questions ignoring visual content and language context.

Question Answering Visual Question Answering

VoxelHop: Successive Subspace Learning for ALS Disease Classification Using Structural MRI

no code implementations13 Jan 2021 Xiaofeng Liu, Fangxu Xing, Chao Yang, C. -C. Jay Kuo, Suma Babu, Georges El Fakhri, Thomas Jenkins, Jonghye Woo

Deep learning has great potential for accurate detection and classification of diseases with medical imaging data, but the performance is often limited by the number of training datasets and memory requirements.

Classification Dimensionality Reduction +1

Symmetric-Constrained Irregular Structure Inpainting for Brain MRI Registration with Tumor Pathology

no code implementations17 Jan 2021 Xiaofeng Liu, Fangxu Xing, Chao Yang, C. -C. Jay Kuo, Georges ElFakhri, Jonghye Woo

Deformable registration of magnetic resonance images between patients with brain tumors and healthy subjects has been an important tool to specify tumor geometry through location alignment and facilitate pathological analysis.

Brain Tumor Segmentation Image Inpainting +3

WeNet: Production oriented Streaming and Non-streaming End-to-End Speech Recognition Toolkit

4 code implementations2 Feb 2021 Zhuoyuan Yao, Di wu, Xiong Wang, BinBin Zhang, Fan Yu, Chao Yang, Zhendong Peng, Xiaoyu Chen, Lei Xie, Xin Lei

In this paper, we propose an open source, production first, and production ready speech recognition toolkit called WeNet in which a new two-pass approach is implemented to unify streaming and non-streaming end-to-end (E2E) speech recognition in a single model.

speech-recognition Speech Recognition

FDNet: A Deep Learning Approach with Two Parallel Cross Encoding Pathways for Precipitation Nowcasting

no code implementations6 May 2021 Bi-Ying Yan, Chao Yang, Feng Chen, Kohei Takeda, Changjun Wang

To the best of our knowledge, this is the first network architecture with flow and deformation separation to model the evolution of radar echoes for precipitation nowcasting.

Optical Flow Estimation

Composite Localization for Human Pose Estimation

no code implementations15 May 2021 ZiFan Chen, Xin Qin, Chao Yang, Li Zhang

This work proposes a novel deep learning framework for human pose estimation called composite localization to divide the complex learning objective into two simpler ones: a sparse heatmap to find the keypoint's approximate location and two short-distance offsetmaps to obtain its final precise coordinates.

Distance regression Pose Estimation

U2++: Unified Two-pass Bidirectional End-to-end Model for Speech Recognition

no code implementations10 Jun 2021 Di wu, BinBin Zhang, Chao Yang, Zhendong Peng, Wenjing Xia, Xiaoyu Chen, Xin Lei

On the experiment of AISHELL-1, we achieve a 4. 63\% character error rate (CER) with a non-streaming setup and 5. 05\% with a streaming setup with 320ms latency by U2++.

Data Augmentation speech-recognition +1

SAS: Self-Augmentation Strategy for Language Model Pre-training

1 code implementation14 Jun 2021 Yifei Xu, Jingqiao Zhang, Ru He, Liangzhu Ge, Chao Yang, Cheng Yang, Ying Nian Wu

In this paper, we propose a self-augmentation strategy (SAS) where a single network is utilized for both regular pre-training and contextualized data augmentation for the training in later epochs.

Data Augmentation Language Modelling +2

Adapting Off-the-Shelf Source Segmenter for Target Medical Image Segmentation

no code implementations23 Jun 2021 Xiaofeng Liu, Fangxu Xing, Chao Yang, Georges El Fakhri, Jonghye Woo

To alleviate this, in this work, we target source free UDA for segmentation, and propose to adapt an ``off-the-shelf" segmentation model pre-trained in the source domain to the target domain, with an adaptive batch-wise normalization statistics adaptation framework.

Image Segmentation Medical Image Segmentation +3

Economic Dispatch of an Integrated Microgrid Based on the Dynamic Process of CCGT Plant

no code implementations5 Jul 2021 Zhiyi Lin, Chunyue Song, Jun Zhao, Chao Yang, Huan Yin

Intra-day economic dispatch of an integrated microgrid is a fundamental requirement to integrate distributed generators.

energy management Management

A review of computational tools for generating metagenome-assembled genomes from metagenomic sequencing data

no code implementations2 Sep 2021 Chao Yang, Debajyoti Chowdhury, Zhenmiao Zhang, William K. Cheung, Aiping Lu, Zhao Xiang Bian, Lu Zhang

Metagenomics has equipped us with new avenues of investigating the microbiome, from studying a single species to a complex community in a dynamic ecosystem.

Cultural Vocal Bursts Intensity Prediction

WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition

1 code implementation7 Oct 2021 BinBin Zhang, Hang Lv, Pengcheng Guo, Qijie Shao, Chao Yang, Lei Xie, Xin Xu, Hui Bu, Xiaoyu Chen, Chenchen Zeng, Di wu, Zhendong Peng

In this paper, we present WenetSpeech, a multi-domain Mandarin corpus consisting of 10000+ hours high-quality labeled speech, 2400+ hours weakly labeled speech, and about 10000 hours unlabeled speech, with 22400+ hours in total.

Label Error Detection Optical Character Recognition +4

A rank-adaptive higher-order orthogonal iteration algorithm for truncated Tucker decomposition

no code implementations25 Oct 2021 Chuanfu Xiao, Chao Yang

We propose a novel rank-adaptive higher-order orthogonal iteration (HOOI) algorithm to compute the truncated Tucker decomposition of higher-order tensors with a given error tolerance, and prove that the method is locally optimal and monotonically convergent.

End-to-end Adaptive Distributed Training on PaddlePaddle

1 code implementation6 Dec 2021 Yulong Ao, Zhihua Wu, dianhai yu, Weibao Gong, Zhiqing Kui, Minxu Zhang, Zilingfeng Ye, Liang Shen, Yanjun Ma, Tian Wu, Haifeng Wang, Wei Zeng, Chao Yang

The experiments demonstrate that our framework can satisfy various requirements from the diversity of applications and the heterogeneity of resources with highly competitive performance.

Language Modelling Recommendation Systems +1

DAS-PINNs: A deep adaptive sampling method for solving high-dimensional partial differential equations

1 code implementation28 Dec 2021 Kejun Tang, Xiaoliang Wan, Chao Yang

In this work we propose a deep adaptive sampling (DAS) method for solving partial differential equations (PDEs), where deep neural networks are utilized to approximate the solutions of PDEs and deep generative models are employed to generate new collocation points that refine the training set.

WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit

3 code implementations29 Mar 2022 BinBin Zhang, Di wu, Zhendong Peng, Xingchen Song, Zhuoyuan Yao, Hang Lv, Lei Xie, Chao Yang, Fuping Pan, Jianwei Niu

Recently, we made available WeNet, a production-oriented end-to-end speech recognition toolkit, which introduces a unified two-pass (U2) framework and a built-in runtime to address the streaming and non-streaming decoding modes in a single model.

Language Modelling speech-recognition +1

GUIM -- General User and Item Embedding with Mixture of Representation in E-commerce

no code implementations2 Jul 2022 Chao Yang, Ru He, Fangquan Lin, Suoyuan Song, Jingqiao Zhang, Cheng Yang

Our goal is to build general representation (embedding) for each user and each product item across Alibaba's businesses, including Taobao and Tmall which are among the world's biggest e-commerce websites.

Contrastive Learning Marketing

Privacy Preservation by Local Design in Cooperative Networked Control Systems

no code implementations8 Jul 2022 Chao Yang, Wen Yang, Hongbo Shi

In this paper, we study the privacy preservation problem in a cooperative networked control system working for the task of LQG control.

Research on Multi-Objective Planning of Electric Vehicle Charging Stations Considering the Condition of Urban Traffic Network

no code implementations27 Aug 2022 Limeng Wang, Chao Yang, Yi Zhang, Fanjin Bu

How to weigh various factors to construct a reasonable model of charging station location and capacity has become a major difficulty in the field of electric vehicle charging facility planning.

R2FD2: Fast and Robust Matching of Multimodal Remote Sensing Image via Repeatable Feature Detector and Rotation-invariant Feature Descriptor

no code implementations5 Dec 2022 Bai Zhu, Chao Yang, Jinkun Dai, Jianwei Fan, Yuanxin Ye

Automatically identifying feature correspondences between multimodal images is facing enormous challenges because of the significant differences both in radiation and geometry.

TA-MoE: Topology-Aware Large Scale Mixture-of-Expert Training

1 code implementation20 Feb 2023 Chang Chen, Min Li, Zhihua Wu, dianhai yu, Chao Yang

In this paper, we propose TA-MoE, a topology-aware routing strategy for large-scale MoE trainging, from a model-system co-design perspective, which can dynamically adjust the MoE dispatch pattern according to the network topology.

Probing reaction channels via reinforcement learning

no code implementations27 May 2023 Senwei Liang, Aditya N. Singh, Yuanran Zhu, David T. Limmer, Chao Yang

We propose a reinforcement learning based method to identify important configurations that connect reactant and product states along chemical reaction paths.

reinforcement-learning

Adversarial Adaptive Sampling: Unify PINN and Optimal Transport for the Approximation of PDEs

no code implementations30 May 2023 Kejun Tang, Jiayu Zhai, Xiaoliang Wan, Chao Yang

The key idea is to use a deep generative model to adjust random samples in the training set such that the residual induced by the approximate PDE solution can maintain a smooth profile when it is being minimized.

Discovering Intrinsic Spatial-Temporal Logic Rules to Explain Human Actions

no code implementations NeurIPS 2023 Chengzhi Cao, Chao Yang, Shuang Li

Our approach is inspired by the fact that human actions are usually driven by their intentions or desires, and are influenced by environmental factors such as the spatial relationships with surrounding objects.

Sports Analytics

Sensor Selection for Remote State Estimation with QoS Requirement Constraints

no code implementations26 Jun 2023 Huiwen Yang, Lingying Huang, Chao Yang, Yilin Mo, Ling Shi

By utilizing the solution of the relaxed problem, we propose a heuristic sensor selection algorithm which can provide a good suboptimal solution.

A Phase-Coded Time-Domain Interleaved OTFS Waveform with Improved Ambiguity Function

no code implementations26 Jul 2023 Jiajun Zhu, Yanqun Tang, Chao Yang, Chi Zhang, Haoran Yin, Jiaojiao Xiong, Yuhua Chen

To enhance the sensing performance of the orthogonal time frequency space (OTFS) waveform, we propose a novel time-domain interleaved cyclic-shifted P4-coded OTFS (TICP4-OTFS) with improved ambiguity function.

Reinforcement Logic Rule Learning for Temporal Point Processes

no code implementations11 Aug 2023 Chao Yang, Lu Wang, Kun Gao, Shuang Li

Leveraging the temporal point process modeling and learning framework, the rule content and weights will be gradually optimized until the likelihood of the observational event sequences is optimal.

Point Processes

Hawkes Processes with Delayed Granger Causality

no code implementations11 Aug 2023 Chao Yang, Hengyuan Miao, Shuang Li

We aim to explicitly model the delayed Granger causal effects based on multivariate Hawkes processes.

3D Implicit Transporter for Temporally Consistent Keypoint Discovery

1 code implementation ICCV 2023 Chengliang Zhong, Yuhang Zheng, Yupeng Zheng, Hao Zhao, Li Yi, Xiaodong Mu, Ling Wang, Pengfei Li, Guyue Zhou, Chao Yang, Xinliang Zhang, Jian Zhao

To address this issue, the Transporter method was introduced for 2D data, which reconstructs the target frame from the source frame to incorporate both spatial and temporal information.

Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization

1 code implementation5 Oct 2023 Zhanhui Zhou, Jie Liu, Chao Yang, Jing Shao, Yu Liu, Xiangyu Yue, Wanli Ouyang, Yu Qiao

A single language model (LM), despite aligning well with an average labeler through reinforcement learning from human feedback (RLHF), may not universally suit diverse human preferences.

Language Modelling Long Form Question Answering

Learning nonlinear integral operators via Recurrent Neural Networks and its application in solving Integro-Differential Equations

no code implementations13 Oct 2023 Hardeep Bassi, Yuanran Zhu, Senwei Liang, Jia Yin, Cian C. Reeves, Vojtech Vlcek, Chao Yang

In this paper, we propose using LSTM-RNNs (Long Short-Term Memory-Recurrent Neural Networks) to learn and represent nonlinear integral operators that appear in nonlinear integro-differential equations (IDEs).

Numerical Integration

MM-SafetyBench: A Benchmark for Safety Evaluation of Multimodal Large Language Models

1 code implementation29 Nov 2023 Xin Liu, Yichen Zhu, Jindong Gu, Yunshi Lan, Chao Yang, Yu Qiao

The security concerns surrounding Large Language Models (LLMs) have been extensively explored, yet the safety of Multimodal Large Language Models (MLLMs) remains understudied.

Critic-Guided Decision Transformer for Offline Reinforcement Learning

no code implementations21 Dec 2023 Yuanfu Wang, Chao Yang, Ying Wen, Yu Liu, Yu Qiao

Recent advancements in offline reinforcement learning (RL) have underscored the capabilities of Return-Conditioned Supervised Learning (RCSL), a paradigm that learns the action distribution based on target returns for each state in a supervised manner.

D4RL Offline RL +3

SEER: Facilitating Structured Reasoning and Explanation via Reinforcement Learning

no code implementations24 Jan 2024 Guoxin Chen, Kexin Tang, Chao Yang, Fuying Ye, Yu Qiao, Yiming Qian

Moreover, existing reinforcement learning (RL) based methods overlook the structured relationships, underutilizing the potential of RL in structured reasoning.

Question Answering reinforcement-learning +1

Framework of Resilient Transmission Network Reconfiguration Considering Cyber-Attacks

no code implementations28 Jan 2024 Chao Yang, Gaoqi Liang, Steven R. Weller, Shaoyan Li, Junhua Zhao, ZhaoYang Dong

Fast and reliable transmission network reconfiguration is critical in improving power grid resilience to cyber-attacks.

Safety of Multimodal Large Language Models on Images and Text

1 code implementation1 Feb 2024 Xin Liu, Yichen Zhu, Yunshi Lan, Chao Yang, Yu Qiao

In this paper, we systematically survey current efforts on the evaluation, attack, and defense of MLLMs' safety on images and text.

Unveiling Latent Causal Rules: A Temporal Point Process Approach for Abnormal Event Explanation

no code implementations3 Feb 2024 Yiling Kuang, Chao Yang, Yang Yang, Shuang Li

In the M-step, we update both the rule set and model parameters to enhance the likelihood function's lower bound.

Point Processes

Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey

1 code implementation14 Feb 2024 Zhichen Dong, Zhanhui Zhou, Chao Yang, Jing Shao, Yu Qiao

Large Language Models (LLMs) are now commonplace in conversation applications.

Deep adaptive sampling for surrogate modeling without labeled data

1 code implementation17 Feb 2024 Xili Wang, Kejun Tang, Jiayu Zhai, Xiaoliang Wan, Chao Yang

In this work, we present a deep adaptive sampling method for surrogate modeling ($\text{DAS}^2$), where we generalize the deep adaptive sampling (DAS) method [62] [Tang, Wan and Yang, 2023] to build surrogate models for low-regularity parametric differential equations.

Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!

1 code implementation19 Feb 2024 Zhanhui Zhou, Jie Liu, Zhichen Dong, Jiaheng Liu, Chao Yang, Wanli Ouyang, Yu Qiao

Large language models (LLMs) need to undergo safety alignment to ensure safe conversations with humans.

Language Modelling

TreeEval: Benchmark-Free Evaluation of Large Language Models through Tree Planning

1 code implementation20 Feb 2024 Xiang Li, Yunshi Lan, Chao Yang

Recently, numerous new benchmarks have been established to evaluate the performance of large language models (LLMs) via either computing a holistic score or employing another LLM as a judge.

Question Generation Question-Generation

RoboScript: Code Generation for Free-Form Manipulation Tasks across Real and Simulation

no code implementations22 Feb 2024 Junting Chen, Yao Mu, Qiaojun Yu, Tianming Wei, Silang Wu, Zhecheng Yuan, Zhixuan Liang, Chao Yang, Kaipeng Zhang, Wenqi Shao, Yu Qiao, Huazhe Xu, Mingyu Ding, Ping Luo

To bridge this ``ideal-to-real'' gap, this paper presents \textbf{RobotScript}, a platform for 1) a deployable robot manipulation pipeline powered by code generation; and 2) a code generation benchmark for robot manipulation tasks in free-form natural language.

Code Generation Common Sense Reasoning +2

Rethinking Mutual Information for Language Conditioned Skill Discovery on Imitation Learning

no code implementations27 Feb 2024 Zhaoxun Ju, Chao Yang, Hongbo Wang, Yu Qiao, Fuchun Sun

Language-conditioned robot behavior plays a vital role in executing complex tasks by associating human commands or instructions with perception and actions.

Imitation Learning Quantization

GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping

1 code implementation14 Mar 2024 Yuhang Zheng, Xiangyu Chen, Yupeng Zheng, Songen Gu, Runyi Yang, Bu Jin, Pengfei Li, Chengliang Zhong, Zengmao Wang, Lina Liu, Chao Yang, Dawei Wang, Zhen Chen, Xiaoxiao Long, Meiqing Wang

In particular, we propose an Efficient Feature Distillation (EFD) module that employs contrastive learning to efficiently and accurately distill language embeddings derived from foundational models.

Contrastive Learning Robotic Grasping

Privacy Preservation by Intermittent Transmission in Cooperative LQG Control Systems

no code implementations25 Mar 2024 Wenhao Lin, Yuqing Ni, Wen Yang, Chao Yang

Under the given threshold of the control performance loss, a trade-off optimization problem is proposed.

VideoDistill: Language-aware Vision Distillation for Video Question Answering

no code implementations1 Apr 2024 Bo Zou, Chao Yang, Yu Qiao, Chengbin Quan, Youjian Zhao

In this paper, we are inspired by the human recognition and learning pattern and propose VideoDistill, a framework with language-aware (i. e., goal-driven) behavior in both vision perception and answer generation process.

Answer Generation Question Answering +1

LLaMA-Excitor: General Instruction Tuning via Indirect Feature Interaction

no code implementations1 Apr 2024 Bo Zou, Chao Yang, Yu Qiao, Chengbin Quan, Youjian Zhao

LLaMA-Excitor ensures a self-adaptive allocation of additional attention to input instructions, thus effectively preserving LLMs' pre-trained knowledge when fine-tuning LLMs on low-quality instruction-following datasets.

Image Captioning Instruction Following

Cannot find the paper you are looking for? You can Submit a new open access paper.