Search Results for author: Yang Gao

Found 241 papers, 103 papers with code

Impact of Preference Noise on the Alignment Performance of Generative Language Models

no code implementations • 15 Apr 2024 • Yang Gao, Dana Alon, Donald Metzler

A key requirement in developing Generative Language Models (GLMs) is to have their values aligned with human values.

Paper
Add Code

Constructing and Exploring Intermediate Domains in Mixed Domain Semi-supervised Medical Image Segmentation

1 code implementation • 13 Apr 2024 • Qinghe Ma, Jian Zhang, Lei Qi, Qian Yu, Yinghuan Shi, Yang Gao

To fully utilize the information within the intermediate domain, we propose a symmetric Guidance training strategy (SymGD), which additionally offers direct guidance to unlabeled data by merging pseudo labels from intermediate samples.

Image Segmentation Segmentation +4

Paper
Code

ONNXPruner: ONNX-Based General Model Pruning Adapter

no code implementations • 10 Apr 2024 • Dongdong Ren, Wenbin Li, Tianyu Ding, Lei Wang, Qi Fan, Jing Huo, Hongbing Pan, Yang Gao

However, the practical application of these algorithms across various models and platforms remains a significant challenge.

Paper
Add Code

InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD

2 code implementations • 9 Apr 2024 • Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Songyang Zhang, Haodong Duan, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Zhe Chen, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Kai Chen, Conghui He, Xingcheng Zhang, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang

The Large Vision-Language Model (LVLM) field has seen significant advancements, yet its progression has been hindered by challenges in comprehending fine-grained visual content due to limited resolution.

Ranked #11 on Visual Question Answering on MM-Vet

4k Language Modelling +1

1,555

Paper
Code

Best-of-Venom: Attacking RLHF by Injecting Poisoned Preference Data

no code implementations • 8 Apr 2024 • Tim Baumgärtner, Yang Gao, Dana Alon, Donald Metzler

Reinforcement Learning from Human Feedback (RLHF) is a popular method for aligning Language Models (LM) with human values and preferences.

Paper
Add Code

Exploiting Inter-sample and Inter-feature Relations in Dataset Distillation

1 code implementation • 31 Mar 2024 • Wenxiao Deng, Wenbin Li, Tianyu Ding, Lei Wang, Hongguang Zhang, Kuihua Huang, Jing Huo, Yang Gao

However, these methods face two primary limitations: the dispersed feature distribution within the same class in synthetic datasets, reducing class discrimination, and an exclusive focus on mean feature consistency, lacking precision and comprehensiveness.

Paper
Code

InternLM2 Technical Report

1 code implementation • 26 Mar 2024 • Zheng Cai, Maosong Cao, Haojiong Chen, Kai Chen, Keyu Chen, Xin Chen, Xun Chen, Zehui Chen, Zhi Chen, Pei Chu, Xiaoyi Dong, Haodong Duan, Qi Fan, Zhaoye Fei, Yang Gao, Jiaye Ge, Chenya Gu, Yuzhe Gu, Tao Gui, Aijia Guo, Qipeng Guo, Conghui He, Yingfan Hu, Ting Huang, Tao Jiang, Penglong Jiao, Zhenjiang Jin, Zhikai Lei, Jiaxing Li, Jingwen Li, Linyang Li, Shuaibin Li, Wei Li, Yining Li, Hongwei Liu, Jiangning Liu, Jiawei Hong, Kaiwen Liu, Kuikun Liu, Xiaoran Liu, Chengqi Lv, Haijun Lv, Kai Lv, Li Ma, Runyuan Ma, Zerun Ma, Wenchang Ning, Linke Ouyang, Jiantao Qiu, Yuan Qu, FuKai Shang, Yunfan Shao, Demin Song, Zifan Song, Zhihao Sui, Peng Sun, Yu Sun, Huanze Tang, Bin Wang, Guoteng Wang, Jiaqi Wang, Jiayu Wang, Rui Wang, Yudong Wang, Ziyi Wang, Xingjian Wei, Qizhen Weng, Fan Wu, Yingtong Xiong, Chao Xu, Ruiliang Xu, Hang Yan, Yirong Yan, Xiaogui Yang, Haochen Ye, Huaiyuan Ying, JIA YU, Jing Yu, Yuhang Zang, Chuyu Zhang, Li Zhang, Pan Zhang, Peng Zhang, Ruijie Zhang, Shuo Zhang, Songyang Zhang, Wenjian Zhang, Wenwei Zhang, Xingcheng Zhang, Xinyue Zhang, Hui Zhao, Qian Zhao, Xiaomeng Zhao, Fengzhe Zhou, Zaida Zhou, Jingming Zhuo, Yicheng Zou, Xipeng Qiu, Yu Qiao, Dahua Lin

The evolution of Large Language Models (LLMs) like ChatGPT and GPT-4 has sparked discussions on the advent of Artificial General Intelligence (AGI).

Ranked #5 on Long-Context Understanding on Ada-LEval (BestAnswer)

4k Long-Context Understanding

5,117

Paper
Code

QSMDiff: Unsupervised 3D Diffusion Models for Quantitative Susceptibility Mapping

no code implementations • 21 Mar 2024 • Zhuang Xiong, Wei Jiang, Yang Gao, Feng Liu, Hongfu Sun

In this work, we developed a 3D image patch-based diffusion model, namely QSMDiff, for robust QSM reconstruction across different scan parameters, alongside simultaneous super-resolution and image-denoising tasks.

Image Denoising Image Generation +1

Paper
Add Code

SETA: Semantic-Aware Token Augmentation for Domain Generalization

1 code implementation • 18 Mar 2024 • Jintao Guo, Lei Qi, Yinghuan Shi, Yang Gao

In this paper, we study the impact of prior CNN-based augmentation methods on token-based models, revealing their performance is suboptimal due to the lack of incentivizing the model to learn holistic shape information.

Data Augmentation Domain Generalization

Paper
Code

Concatenate, Fine-tuning, Re-training: A SAM-enabled Framework for Semi-supervised 3D Medical Image Segmentation

1 code implementation • 17 Mar 2024 • Shumeng Li, Lei Qi, Qian Yu, Jing Huo, Yinghuan Shi, Yang Gao

Segment Anything Model (SAM) fine-tuning has shown remarkable performance in medical image segmentation in a fully supervised manner, but requires precise annotations.

Image Segmentation Segmentation +2

Paper
Code

EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data

no code implementations • 1 Mar 2024 • Shengjie Wang, Shaohuai Liu, Weirui Ye, Jiacheng You, Yang Gao

We have expanded the performance of EfficientZero to multiple domains, encompassing both continuous and discrete actions, as well as visual and low-dimensional inputs.

Continuous Control Reinforcement Learning (RL)

Paper
Add Code

Revisiting Disentanglement in Downstream Tasks: A Study on Its Necessity for Abstract Visual Reasoning

1 code implementation • 1 Mar 2024 • Ruiqian Nai, Zixin Wen, Ji Li, Yuanzhi Li, Yang Gao

This paper further investigates the necessity of disentangled representation in downstream applications.

Disentanglement Informativeness +1

Paper
Code

Can Transformers Capture Spatial Relations between Objects?

no code implementations • 1 Mar 2024 • Chuan Wen, Dinesh Jayaraman, Yang Gao

Spatial relationships between objects represent key scene information for humans to understand and interact with the world.

Relation

Paper
Add Code

Data-freeWeight Compress and Denoise for Large Language Models

no code implementations • 26 Feb 2024 • Runyu Peng, Yunhua Zhou, Qipeng Guo, Yang Gao, Hang Yan, Xipeng Qiu, Dahua Lin

Significantly, our method is characterized by without necessitating additional involvement of any corpus, while simultaneously preserving orthogonality in conjunction with pruning and quantization methods.

Quantization

Paper
Add Code

Distributionally Robust Graph-based Recommendation System

1 code implementation • 20 Feb 2024 • Bohao Wang, Jiawei Chen, Changdong Li, Sheng Zhou, Qihao Shi, Yang Gao, Yan Feng, Chun Chen, Can Wang

DR-GNN addresses two core challenges: 1) To enable DRO to cater to graph data intertwined with GNN, we reinterpret GNN as a graph smoothing regularizer, thereby facilitating the nuanced application of DRO; 2) Given the typically sparse nature of recommendation data, which might impede robust optimization, we introduce slight perturbations in the training distribution to expand its support.

Recommendation Systems

Paper
Code

Angle Robustness Unmanned Aerial Vehicle Navigation in GNSS-Denied Scenarios

no code implementations • 4 Feb 2024 • Yuxin Wang, Zunlei Feng, Haofei Zhang, Yang Gao, Jie Lei, Li Sun, Mingli Song

Due to the inability to receive signals from the Global Navigation Satellite System (GNSS) in extreme conditions, achieving accurate and robust navigation for Unmanned Aerial Vehicles (UAVs) is a challenging task.

Paper
Add Code

InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model

1 code implementation • 29 Jan 2024 • Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Xilin Wei, Songyang Zhang, Haodong Duan, Maosong Cao, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Xinyue Zhang, Wei Li, Jingwen Li, Kai Chen, Conghui He, Xingcheng Zhang, Yu Qiao, Dahua Lin, Jiaqi Wang

We introduce InternLM-XComposer2, a cutting-edge vision-language model excelling in free-form text-image composition and comprehension.

Ranked #16 on Visual Question Answering on MM-Vet

Language Modelling Visual Question Answering

1,555

Paper
Code

General Flow as Foundation Affordance for Scalable Robot Learning

no code implementations • 21 Jan 2024 • Chengbo Yuan, Chuan Wen, Tong Zhang, Yang Gao

Our predicted flow offers actionable geometric and physics guidance, thus facilitating stable zero-shot skill transfer in real-world scenarios. We deploy our method with a policy based on closed-loop flow prediction.

Paper
Add Code

Identifying and Analyzing Task-Encoding Tokens in Large Language Models

no code implementations • 20 Jan 2024 • Yu Bai, Heyan Huang, Cesare Spinoso-Di Piano, Marc-Antoine Rondeau, Sanxing Chen, Yang Gao, Jackie Chi Kit Cheung

In-context learning (ICL) has become an effective solution for few-shot learning in natural language processing.

Computational Efficiency Few-Shot Learning +1

Paper
Add Code

Learning Generalizable Models via Disentangling Spurious and Enhancing Potential Correlations

1 code implementation • 11 Jan 2024 • Na Wang, Lei Qi, Jintao Guo, Yinghuan Shi, Yang Gao

2) From the feature perspective, the simple Tail Interaction module implicitly enhances potential correlations among all samples from all source domains, facilitating the acquisition of domain-invariant representations across multiple domains for the model.

Data Augmentation Domain Generalization

Paper
Code

Any-point Trajectory Modeling for Policy Learning

no code implementations • 28 Dec 2023 • Chuan Wen, Xingyu Lin, John So, Kai Chen, Qi Dou, Yang Gao, Pieter Abbeel

Learning from demonstration is a powerful method for teaching robots new skills, and having more demonstration data often improves policy learning.

Trajectory Modeling Transfer Learning

Paper
Add Code

PG-LBO: Enhancing High-Dimensional Bayesian Optimization with Pseudo-Label and Gaussian Process Guidance

1 code implementation • 28 Dec 2023 • Taicai Chen, Yue Duan, Dong Li, Lei Qi, Yinghuan Shi, Yang Gao

Based on this technique, we assign appropriate training weights to unlabeled data to enhance the construction of a discriminative latent space.

Bayesian Optimization Pseudo Label

Paper
Code

Social-Transmotion: Promptable Human Trajectory Prediction

1 code implementation • 26 Dec 2023 • Saeed Saadatnejad, Yang Gao, Kaouther Messaoud, Alexandre Alahi

We translate the idea of a prompt from Natural Language Processing (NLP) to the task of human trajectory prediction, where a prompt can be a sequence of x-y coordinates on the ground, bounding boxes in the image plane, or body pose keypoints in either 2D or 3D.

Autonomous Vehicles Trajectory Prediction

Paper
Code

Heterogeneous Graph Neural Architecture Search with GPT-4

1 code implementation • 14 Dec 2023 • Haoyuan Dong, Yang Gao, Haishuai Wang, Hong Yang, Peng Zhang

The basic idea of GHGNAS is to design a set of prompts that can guide GPT-4 toward the task of generating new heterogeneous graph neural architectures.

Neural Architecture Search

Paper
Code

Graph vs. Sequence: An Empirical Study on Knowledge Forms for Knowledge-Grounded Dialogue

no code implementations • 13 Dec 2023 • Yizhe Yang, Heyan Huang, Yihang Liu, Yang Gao

Knowledge-grounded dialogue is a task of generating an informative response based on both the dialogue history and external knowledge source.

Knowledge Graphs Model Selection

Paper
Add Code

Digital Life Project: Autonomous 3D Characters with Social Intelligence

no code implementations • 7 Dec 2023 • Zhongang Cai, Jianping Jiang, Zhongfei Qing, Xinying Guo, Mingyuan Zhang, Zhengyu Lin, Haiyi Mei, Chen Wei, Ruisi Wang, Wanqi Yin, Xiangyu Fan, Han Du, Liang Pan, Peng Gao, Zhitao Yang, Yang Gao, Jiaqi Li, Tianxiang Ren, Yukun Wei, Xiaogang Wang, Chen Change Loy, Lei Yang, Ziwei Liu

In this work, we present Digital Life Project, a framework utilizing language as the universal medium to build autonomous 3D characters, who are capable of engaging in social interactions and expressing with articulated body motions, thereby simulating life in a digital environment.

Ranked #2 on Motion Synthesis on InterHuman

Motion Captioning Motion Synthesis

Paper
Add Code

Look Before You Leap: Unveiling the Power of GPT-4V in Robotic Vision-Language Planning

no code implementations • 29 Nov 2023 • Yingdong Hu, Fanqi Lin, Tong Zhang, Li Yi, Yang Gao

In this study, we are interested in imbuing robots with the capability of physically-grounded task planning.

Paper
Add Code

TSST: A Benchmark and Evaluation Models for Text Speech-Style Transfer

no code implementations • 14 Nov 2023 • Huashan Sun, Yixiao Wu, Yinghao Li, Jiawei Li, Yizhe Yang, Yang Gao

In summary, we present the TSST task, a new benchmark for style transfer and emphasizing human-oriented evaluation, exploring and advancing the performance of current LLMs.

Style Transfer Text Style Transfer

Paper
Add Code

Plug-and-Play Latent Feature Editing for Orientation-Adaptive Quantitative Susceptibility Mapping Neural Networks

1 code implementation • 14 Nov 2023 • Yang Gao, Zhuang Xiong, Shanshan Shan, Yin Liu, Pengfei Rong, Min Li, Alan H Wilman, G. Bruce Pike, Feng Liu, Hongfu Sun

The proposed OA-LFE-empowered iQSM, which we refer to as iQSM+, is trained in a self-supervised manner on a specially-designed simulation brain dataset.

Paper
Code

Uni-O4: Unifying Online and Offline Deep Reinforcement Learning with Multi-Step On-Policy Optimization

no code implementations • 6 Nov 2023 • Kun Lei, Zhengmao He, Chenhao Lu, Kaizhe Hu, Yang Gao, Huazhe Xu

Owning to the alignment of objectives in two phases, the RL agent can transfer between offline and online learning seamlessly.

Reinforcement Learning (RL)

Paper
Add Code

JRDB-Traj: A Dataset and Benchmark for Trajectory Forecasting in Crowds

1 code implementation • 5 Nov 2023 • Saeed Saadatnejad, Yang Gao, Hamid Rezatofighi, Alexandre Alahi

To address this, we introduce a novel dataset for end-to-end trajectory forecasting, facilitating the evaluation of models in scenarios involving less-than-ideal preceding modules such as tracking.

Autonomous Navigation Benchmarking +1

Paper
Code

The Eval4NLP 2023 Shared Task on Prompting Large Language Models as Explainable Metrics

1 code implementation • 30 Oct 2023 • Christoph Leiter, Juri Opitz, Daniel Deutsch, Yang Gao, Rotem Dror, Steffen Eger

Specifically, we propose a novel competition setting in which we select a list of allowed LLMs and disallow fine-tuning to ensure a focus on prompting.

Machine Translation Text Generation

Paper
Code

MindLLM: Pre-training Lightweight Large Language Model from Scratch, Evaluations and Domain Applications

no code implementations • 24 Oct 2023 • Yizhe Yang, Huashan Sun, Jiawei Li, Runheng Liu, Yinghao Li, Yuhang Liu, Heyan Huang, Yang Gao

Large Language Models (LLMs) have demonstrated remarkable performance across various natural language tasks, marking significant strides towards general artificial intelligence.

Language Modelling Large Language Model

Paper
Add Code

DexCatch: Learning to Catch Arbitrary Objects with Dexterous Hands

no code implementations • 13 Oct 2023 • Fengbo Lan, Shengjie Wang, Yunzhe Zhang, Haotian Xu, Oluwatosin Oseni, Yang Gao, Tao Zhang

Achieving human-like dexterous manipulation remains a crucial area of research in robotics.

Paper
Add Code

Imitation Learning from Observation with Automatic Discount Scheduling

no code implementations • 11 Oct 2023 • Yuyang Liu, Weijun Dong, Yingdong Hu, Chuan Wen, Zhao-Heng Yin, Chongjie Zhang, Yang Gao

Nonetheless, we identify that tasks characterized by a progress dependency property pose significant challenges for such approaches; in these tasks, the agent needs to initially learn the expert's preceding behaviors before mastering the subsequent ones.

Imitation Learning reinforcement-learning +1

Paper
Add Code

Rethink Baseline of Integrated Gradients from the Perspective of Shapley Value

no code implementations • 7 Oct 2023 • Shuyang Liu, Zixuan Chen, Ge Shi, Ji Wang, Changjie Fan, Yu Xiong, Runze Wu Yujing Hu, Ze Ji, Yang Gao

Thus, we propose a novel baseline construction method called Shapley Integrated Gradients (SIG) that searches for a set of baselines by proportional sampling to partly simulate the computation path of Shapley Value.

Paper
Add Code

Foundation Reinforcement Learning: towards Embodied Generalist Agents with Foundation Prior Assistance

no code implementations • 4 Oct 2023 • Weirui Ye, Yunsheng Zhang, Mengchen Wang, Shengjie Wang, Xianfan Gu, Pieter Abbeel, Yang Gao

Our method tolerates the unavoidable noise in embodied foundation models.

Quantization reinforcement-learning

Paper
Add Code

Graph Neural Architecture Search with GPT-4

no code implementations • 30 Sep 2023 • Haishuai Wang, Yang Gao, Xin Zheng, Peng Zhang, Hongyang Chen, Jiajun Bu, Philip S. Yu

In this paper, we integrate GPT-4 into GNAS and propose a new GPT-4 based Graph Neural Architecture Search method (GPT4GNAS for short).

Neural Architecture Search

Paper
Add Code

CasIL: Cognizing and Imitating Skills via a Dual Cognition-Action Architecture

no code implementations • 28 Sep 2023 • Zixuan Chen, Ze Ji, Shuyang Liu, Jing Huo, Yiyu Chen, Yang Gao

Heuristically, we extend the usual notion of action to a dual Cognition (high-level)-Action (low-level) architecture by introducing intuitive human cognitive priors, and propose a novel skill IL framework through human-robot interaction, called Cognition-Action-based Skill Imitation Learning (CasIL), for the robotic agent to effectively cognize and imitate the critical skills from raw visual demonstrations.

Imitation Learning

Paper
Add Code

OpenMSD: Towards Multilingual Scientific Documents Similarity Measurement

1 code implementation • 19 Sep 2023 • Yang Gao, Ji Ma, Ivan Korotkov, Keith Hall, Dana Alon, Don Metzler

We propose the first multilingual scientific documents dataset, Open-access Multilingual Scientific Documents (OpenMSD), which has 74M papers in 103 languages and 778M citation pairs.

32,732

Paper
Code

Exploring Flat Minima for Domain Generalization with Large Learning Rates

no code implementations • 12 Sep 2023 • Jian Zhang, Lei Qi, Yinghuan Shi, Yang Gao

Instead, we observe that leveraging a large learning rate can simultaneously promote weight diversity and facilitate the identification of flat regions in the loss landscape.

Domain Generalization Semantic Segmentation

Paper
Add Code

A Theoretical Explanation of Activation Sparsity through Flat Minima and Adversarial Robustness

no code implementations • 6 Sep 2023 • Ze Peng, Lei Qi, Yinghuan Shi, Yang Gao

Although having attributed it to training dynamics, existing theoretical explanations of activation sparsity are restricted to shallow networks, small training steps and special training, despite its emergence in deep models standardly trained for a large number of steps.

Paper
Add Code

InsertNeRF: Instilling Generalizability into NeRF with HyperNet Modules

1 code implementation • 26 Aug 2023 • Yanqi Bao, Tianyu Ding, Jing Huo, Wenbin Li, Yuxin Li, Yang Gao

By utilizing multiple plug-and-play HyperNet modules, InsertNeRF dynamically tailors NeRF's weights to specific reference scenes, transforming multi-scale sampling-aware features into scene-specific representations.

Paper
Code

IOMatch: Simplifying Open-Set Semi-Supervised Learning with Joint Inliers and Outliers Utilization

1 code implementation • ICCV 2023 • Zekun Li, Lei Qi, Yinghuan Shi, Yang Gao

Semi-supervised learning (SSL) aims to leverage massive unlabeled data when labels are expensive to obtain.

open-set classification

Paper
Code

Efficient Last-iterate Convergence Algorithms in Solving Games

no code implementations • 22 Aug 2023 • Linjian Meng, Zhenxing Ge, Wenbin Li, Bo An, Yang Gao

Recent works propose a Reward Transformation (RT) framework for MWU, which removes the uniqueness condition and achieves competitive performance with OMWU.

counterfactual

Paper
Add Code

DomainAdaptor: A Novel Approach to Test-time Adaptation

1 code implementation • ICCV 2023 • Jian Zhang, Lei Qi, Yinghuan Shi, Yang Gao

To deal with the domain shift between training and test samples, current methods have primarily focused on learning generalizable features during training and ignore the specificity of unseen samples that are also critical during the test.

Specificity Test-time Adaptation

Paper
Code

Quantitative Susceptibility Mapping through Model-based Deep Image Prior (MoDIP)

no code implementations • 18 Aug 2023 • Zhuang Xiong, Yang Gao, Yin Liu, Amir Fazlollahi, Peter Nestor, Feng Liu, Hongfu Sun

The data-driven approach of supervised learning methods has limited applicability in solving dipole inversion in Quantitative Susceptibility Mapping (QSM) with varying scan parameters across different objects.

Image Reconstruction

Paper
Add Code

Where and How: Mitigating Confusion in Neural Radiance Fields from Sparse Inputs

1 code implementation • 5 Aug 2023 • Yanqi Bao, Yuxin Li, Jing Huo, Tianyu Ding, Xinyue Liang, Wenbin Li, Yang Gao

Neural Radiance Fields from Sparse input} (NeRF-S) have shown great potential in synthesizing novel views with a limited number of observed viewpoints.

Attribute

Paper
Code

3D Medical Image Segmentation with Sparse Annotation via Cross-Teaching between 3D and 2D Networks

1 code implementation • 30 Jul 2023 • Heng Cai, Lei Qi, Qian Yu, Yinghuan Shi, Yang Gao

Our experimental results on the MMWHS dataset demonstrate that our method outperforms the state-of-the-art (SOTA) semi-supervised segmentation methods.

Image Segmentation Medical Image Segmentation +3

Paper
Code

DNA-Rendering: A Diverse Neural Actor Repository for High-Fidelity Human-centric Rendering

1 code implementation • ICCV 2023 • Wei Cheng, Ruixiang Chen, Wanqi Yin, Siming Fan, Keyu Chen, Honglin He, Huiwen Luo, Zhongang Cai, Jingbo Wang, Yang Gao, Zhengming Yu, Zhengyu Lin, Daxuan Ren, Lei Yang, Ziwei Liu, Chen Change Loy, Chen Qian, Wayne Wu, Dahua Lin, Bo Dai, Kwan-Yee Lin

Realistic human-centric rendering plays a key role in both computer vision and computer graphics.

Camera Calibration Novel View Synthesis

198

Paper
Code

Policy Contrastive Imitation Learning

no code implementations • 6 Jul 2023 • Jialei Huang, ZhaoHeng Yin, Yingdong Hu, Yang Gao

However, the performance of AIL is still unsatisfactory on the more challenging tasks.

Binary Classification Imitation Learning +1

Paper
Add Code

Towards Explainable Evaluation Metrics for Machine Translation

no code implementations • 22 Jun 2023 • Christoph Leiter, Piyawat Lertvittayakumjorn, Marina Fomicheva, Wei Zhao, Yang Gao, Steffen Eger

In this context, we also discuss the latest state-of-the-art approaches to explainable metrics based on generative models such as ChatGPT and GPT4.

Machine Translation Translation

Paper
Add Code

A Universal Semantic-Geometric Representation for Robotic Manipulation

no code implementations • 18 Jun 2023 • Tong Zhang, Yingdong Hu, Hanchen Cui, Hang Zhao, Yang Gao

To this end, we present $\textbf{Semantic-Geometric Representation} (\textbf{SGR})$, a universal perception module for robotics that leverages the rich semantic information of large-scale pre-trained 2D models and inherits the merits of 3D spatial reasoning.

Paper
Add Code

Shades of meaning: Uncovering the geometry of ambiguous word representations through contextualised language models

no code implementations • 26 Apr 2023 • Benedetta Cevoli, Chris Watkins, Yang Gao, Kathleen Rastle

Lexical ambiguity presents a profound and enduring challenge to the language sciences.

Paper
Add Code

Programmatically Grounded, Compositionally Generalizable Robotic Manipulation

no code implementations • 26 Apr 2023 • Renhao Wang, Jiayuan Mao, Joy Hsu, Hang Zhao, Jiajun Wu, Yang Gao

Robots operating in the real world require both rich manipulation skills as well as the ability to semantically reason about when to apply those skills.

Imitation Learning

Paper
Add Code

For Pre-Trained Vision Models in Motor Control, Not All Policy Learning Methods are Created Equal

no code implementations • 10 Apr 2023 • Yingdong Hu, Renhao Wang, Li Erran Li, Yang Gao

Our study yields a series of intriguing results, including the discovery that the effectiveness of pre-training is highly dependent on the choice of the downstream policy learning algorithm.

Imitation Learning Reinforcement Learning (RL)

Paper
Add Code

Seer: Language Instructed Video Prediction with Latent Diffusion Models

no code implementations • 27 Mar 2023 • Xianfan Gu, Chuan Wen, Weirui Ye, Jiaming Song, Yang Gao

Imagining the future trajectory is the key for robots to make sound planning and successfully reach their goals.

Denoising Video Prediction

Paper
Add Code

Orthogonal Annotation Benefits Barely-supervised Medical Image Segmentation

1 code implementation • CVPR 2023 • Heng Cai, Shumeng Li, Lei Qi, Qian Yu, Yinghuan Shi, Yang Gao

Subsequently, by introducing unlabeled volumes, we propose a dual-network paradigm named Dense-Sparse Co-training (DeSCO) that exploits dense pseudo labels in early stage and sparse labels in later stage and meanwhile forces consistent output of two networks.

Image Segmentation Semantic Segmentation +1

Paper
Code

Real-time scheduling of renewable power systems through planning-based reinforcement learning

no code implementations • 9 Mar 2023 • Shaohuai Liu, Jinbo Liu, Weirui Ye, Nan Yang, Guanglun Zhang, Haiwang Zhong, Chongqing Kang, Qirong Jiang, Xuri Song, Fangchun Di, Yang Gao

The well-trained scheduling agent significantly reduces renewable curtailment and load shedding, which are issues arising from traditional scheduling's reliance on inaccurate day-ahead forecasts.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Decision Transformer under Random Frame Dropping

1 code implementation • 3 Mar 2023 • Kaizhe Hu, Ray Chen Zheng, Yang Gao, Huazhe Xu

Typical RL methods usually require considerable online interaction data that are costly and unsafe to collect in the real world.

Offline RL

Paper
Code

Efficient Exploration Using Extra Safety Budget in Constrained Policy Optimization

no code implementations • 28 Feb 2023 • Haotian Xu, Shengjie Wang, Zhaolei Wang, Yunzhe Zhang, Qing Zhuo, Yang Gao, Tao Zhang

In the early stage, our method loosens the practical constraints of unsafe transitions (adding extra safety budget) with the aid of a new metric we propose.

Efficient Exploration Reinforcement Learning (RL)

Paper
Add Code

Entity-Agnostic Representation Learning for Parameter-Efficient Knowledge Graph Embedding

1 code implementation • 3 Feb 2023 • Mingyang Chen, Wen Zhang, Zhen Yao, Yushan Zhu, Yang Gao, Jeff Z. Pan, Huajun Chen

In our proposed model, Entity-Agnostic Representation Learning (EARL), we only learn the embeddings for a small set of entities and refer to them as reserved entities.

Entity Embeddings Knowledge Graph Embedding +3

Paper
Code

Few-shot Semantic Segmentation with Support-induced Graph Convolutional Network

no code implementations • 9 Jan 2023 • Jie Liu, Yanqi Bao, Wenzhe Yin, Haochen Wang, Yang Gao, Jan-Jakob Sonke, Efstratios Gavves

However, the appearance variations between objects from the same category could be extremely large, leading to unreliable feature matching and query mask prediction.

Ranked #40 on Few-Shot Semantic Segmentation on PASCAL-5i (1-Shot)

Few-Shot Semantic Segmentation

Paper
Add Code

A Policy Optimization Method Towards Optimal-time Stability

no code implementations • 2 Jan 2023 • Shengjie Wang, Fengbo Lan, Xiang Zheng, Yuxue Cao, Oluwatosin Oseni, Haotian Xu, Tao Zhang, Yang Gao

In current model-free reinforcement learning (RL) algorithms, stability criteria based on sampling methods are commonly utilized to guide policy optimization.

Reinforcement Learning (RL)

Paper
Add Code

PanoViT: Vision Transformer for Room Layout Estimation from a Single Panoramic Image

no code implementations • 23 Dec 2022 • Weichao Shen, Yuan Dong, Zonghao Chen, Zhengyi Zhao, Yang Gao, Zhu Liu

In this paper, we propose PanoViT, a panorama vision transformer to estimate the room layout from a single panoramic image.

Position Room Layout Estimation

Paper
Add Code

Pre-Trained Image Encoder for Generalizable Visual Reinforcement Learning

no code implementations • 17 Dec 2022 • Zhecheng Yuan, Zhengrong Xue, Bo Yuan, Xueqian Wang, Yi Wu, Yang Gao, Huazhe Xu

Hence, we propose Pre-trained Image Encoder for Generalizable visual reinforcement learning (PIE-G), a simple yet effective framework that can generalize to the unseen visual scenarios in a zero-shot manner.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

A Unified Framework for Contrastive Learning from a Perspective of Affinity Matrix

no code implementations • 26 Nov 2022 • Wenbin Li, Meihao Kong, Xuesong Yang, Lei Wang, Jing Huo, Yang Gao, Jiebo Luo

In this study, we present a new unified contrastive learning representation framework (named UniCLR) suitable for all the above four kinds of methods from a novel perspective of basic affinity matrix.

Contrastive Learning Representation Learning

Paper
Add Code

Affine Transformation Edited and Refined Deep Neural Network for Quantitative Susceptibility Mapping

no code implementations • 25 Nov 2022 • Zhuang Xiong, Yang Gao, Feng Liu, Hongfu Sun

We propose an end-to-end AFfine Transformation Edited and Refined (AFTER) deep neural network for QSM, which is robust against arbitrary acquisition orientation and spatial resolution up to 0. 6 mm isotropic at the finest.

Paper
Add Code

Spending Thinking Time Wisely: Accelerating MCTS with Virtual Expansions

no code implementations • 23 Oct 2022 • Weirui Ye, Pieter Abbeel, Yang Gao

This paper proposes the Virtual MCTS (V-MCTS), a variant of MCTS that spends more search time on harder states and less search time on simpler states adaptively.

Atari Games Board Games

Paper
Add Code

Planning for Sample Efficient Imitation Learning

1 code implementation • 18 Oct 2022 • Zhao-Heng Yin, Weirui Ye, Qifeng Chen, Yang Gao

Inspired by the recent success of EfficientZero in RL, we propose EfficientImitate (EI), a planning-based imitation learning method that can achieve high in-environment sample efficiency and performance simultaneously.

Imitation Learning

Paper
Code

Learning Explicit Credit Assignment for Cooperative Multi-Agent Reinforcement Learning via Polarization Policy Gradient

1 code implementation • 10 Oct 2022 • Wubing Chen, Wenbin Li, Xiao Liu, Shangdong Yang, Yang Gao

Empirically, we evaluate MAPPG on the well-known matrix game and differential game, and verify that MAPPG can converge to the global optimum for both discrete and continuous action spaces.

Multi-agent Reinforcement Learning reinforcement-learning +3

Paper
Code

Modeling Inter-Class and Intra-Class Constraints in Novel Class Discovery

1 code implementation • CVPR 2023 • Wenbin Li, Zhichen Fan, Jing Huo, Yang Gao

Specifically, we propose an inter-class sKLD constraint to effectively exploit the disjoint relationship between labelled and unlabelled classes, enforcing the separability for different classes in the embedding space.

Novel Class Discovery

Paper
Code

Predictive Inference with Feature Conformal Prediction

1 code implementation • 1 Oct 2022 • Jiaye Teng, Chuan Wen, Dinghuai Zhang, Yoshua Bengio, Yang Gao, Yang Yuan

Conformal prediction is a distribution-free technique for establishing valid prediction intervals.

Conformal Prediction Image Segmentation +5

Paper
Code

PLN: Parasitic-Like Network for Barely Supervised Medical Image Segmentation

1 code implementation • IEEE Transactions on Medical Imaging 2022 • Shumeng Li, Heng Cai; Lei Qi, Qian Yu, Yinghuan Shi, Yang Gao

In this paper, by introducing an extremely sparse annotation way of labeling only one slice per 3D image, we investigate a novel barely-supervised segmentation setting with only a few sparsely-labeled images along with a large amount of unlabeled images.

Image Segmentation Medical Image Segmentation +2

Paper
Code

USEEK: Unsupervised SE(3)-Equivariant 3D Keypoints for Generalizable Manipulation

no code implementations • 28 Sep 2022 • Zhengrong Xue, Zhecheng Yuan, Jiashun Wang, Xueqian Wang, Yang Gao, Huazhe Xu

Can a robot manipulate intra-category unseen objects in arbitrary poses with the help of a mere demonstration of grasping pose on a single object instance?

Keypoint Detection Object

Paper
Add Code

MIXRTs: Toward Interpretable Multi-Agent Reinforcement Learning via Mixing Recurrent Soft Decision Trees

no code implementations • 15 Sep 2022 • Zichuan Liu, Yuanyang Zhu, Zhi Wang, Yang Gao, Chunlin Chen

While achieving tremendous success in various fields, existing multi-agent reinforcement learning (MARL) with a black-box neural network architecture makes decisions in an opaque manner that hinders humans from understanding the learned knowledge and how input observations influence decisions.

Multi-agent Reinforcement Learning reinforcement-learning +3

Paper
Add Code

Semantic-Aware Fine-Grained Correspondence

1 code implementation • 21 Jul 2022 • Yingdong Hu, Renhao Wang, Kaifeng Zhang, Yang Gao

Establishing visual correspondence across images is a challenging and essential task.

Pose Tracking Self-Supervised Learning +4

Paper
Code

Resolving Copycat Problems in Visual Imitation Learning via Residual Action Prediction

no code implementations • 20 Jul 2022 • Chia-Chi Chuang, Donglin Yang, Chuan Wen, Yang Gao

This is especially the case with image observations, where a single image only includes one view of the scene, and it suffers from a lack of motion information and object occlusions.

Imitation Learning

Paper
Add Code

EleGANt: Exquisite and Locally Editable GAN for Makeup Transfer

1 code implementation • 20 Jul 2022 • Chenyu Yang, Wanrong He, Yingqing Xu, Yang Gao

Most existing methods view makeup transfer as transferring color distributions of different facial regions and ignore details such as eye shadows and blushes.

139

Paper
Code

Fighting Fire with Fire: Avoiding DNN Shortcuts through Priming

no code implementations • 22 Jun 2022 • Chuan Wen, Jianing Qian, Jierui Lin, Jiaye Teng, Dinesh Jayaraman, Yang Gao

Across applications spanning supervised classification and sequential control, deep learning has been reported to find "shortcut" solutions that fail catastrophically under minor changes in the data distribution.

Autonomous Driving Classification +5

Paper
Add Code

Auto-Encoding Adversarial Imitation Learning

no code implementations • 22 Jun 2022 • Kaifeng Zhang, Rui Zhao, Ziming Zhang, Yang Gao

In this work, we propose Auto-Encoding Adversarial Imitation Learning (AEAIL), a robust and scalable AIL framework.

Imitation Learning Reinforcement Learning (RL)

Paper
Add Code

An Empirical Study on Disentanglement of Negative-free Contrastive Learning

1 code implementation • 9 Jun 2022 • Jinkun Cao, Ruiqian Nai, Qing Yang, Jialei Huang, Yang Gao

In this paper, we examine negative-free contrastive learning methods to study the disentanglement property empirically.

Contrastive Learning Disentanglement

Paper
Code

HuMMan: Multi-Modal 4D Human Dataset for Versatile Sensing and Modeling

no code implementations • 28 Apr 2022 • Zhongang Cai, Daxuan Ren, Ailing Zeng, Zhengyu Lin, Tao Yu, Wenjia Wang, Xiangyu Fan, Yang Gao, Yifan Yu, Liang Pan, Fangzhou Hong, Mingyuan Zhang, Chen Change Loy, Lei Yang, Ziwei Liu

4D human sensing and modeling are fundamental tasks in vision and graphics with numerous applications.

Fine-grained Action Recognition Pose Estimation

Paper
Add Code

$G^2$: Enhance Knowledge Grounded Dialogue via Ground Graph

no code implementations • 27 Apr 2022 • Yizhe Yang, Yang Gao, Jiawei Li, Heyan Huang

Besides, a Ground Graph Aware Transformer ($G^2AT$) is proposed to enhance knowledge grounded response generation.

Response Generation

Paper
Add Code

On the pragmatism of using binary classifiers over data intensive neural network classifiers for detection of COVID-19 from voice

no code implementations • 11 Apr 2022 • Ankit Shah, Hira Dhamyal, Yang Gao, Daniel Arancibia, Mario Arancibia, Bhiksha Raj, Rita Singh

Lately, there has been a global effort by multiple research groups to detect COVID-19 from voice.

Paper
Add Code

PSP: Pre-trained Soft Prompts for Few-Shot Abstractive Summarization

no code implementations • COLING 2022 • Xiaochen Liu, Yang Gao, Yu Bai, Jiawei Li, Yinan Hu, Heyan Huang, Boxing Chen

Few-shot abstractive summarization has become a challenging task in natural language generation.

Abstractive Text Summarization Text Generation

Paper
Add Code

BFRnet: A deep learning-based MR background field removal method for QSM of the brain containing significant pathological susceptibility sources

1 code implementation • 6 Apr 2022 • Xuanyu Zhu, Yang Gao, Feng Liu, Stuart Crozier, Hongfu Sun

The BFRnet method is compared with three conventional BFR methods and one previous deep learning method using simulated and in vivo brains from 4 healthy and 2 hemorrhagic subjects.

Paper
Code

DePA: Improving Non-autoregressive Machine Translation with Dependency-Aware Decoder

1 code implementation • 30 Mar 2022 • Jiaao Zhan, Qian Chen, Boxing Chen, Wen Wang, Yu Bai, Yang Gao

We propose a novel and general Dependency-Aware Decoder (DePA) to enhance target dependency modeling in the decoder of fully NAT models from two perspectives: decoder self-attention and decoder input.

Machine Translation Translation

Paper
Code

MutexMatch: Semi-Supervised Learning with Mutex-Based Consistency Regularization

3 code implementations • 27 Mar 2022 • Yue Duan, Zhen Zhao, Lei Qi, Lei Wang, Luping Zhou, Yinghuan Shi, Yang Gao

The core issue in semi-supervised learning (SSL) lies in how to effectively leverage unlabeled data, whereas most existing methods tend to put a great emphasis on the utilization of high-confidence samples yet seldom fully explore the usage of low-confidence samples.

Ranked #1 on Semi-Supervised Image Classification on Mini-ImageNet, 1000 Labels

Semi-Supervised Image Classification

Paper
Code

Playing Lottery Tickets in Style Transfer Models

no code implementations • 25 Mar 2022 • Meihao Kong, Jing Huo, Wenbin Li, Jing Wu, Yu-Kun Lai, Yang Gao

(2) Using iterative magnitude pruning, we find the matching subnetworks at 89. 2% sparsity in AdaIN and 73. 7% sparsity in SANet, which demonstrates that style transfer models can play lottery tickets too.

Style Transfer

Paper
Add Code

Towards Explainable Evaluation Metrics for Natural Language Generation

1 code implementation • 21 Mar 2022 • Christoph Leiter, Piyawat Lertvittayakumjorn, Marina Fomicheva, Wei Zhao, Yang Gao, Steffen Eger

We also provide a synthesizing overview over recent approaches for explainable machine translation metrics and discuss how they relate to those goals and properties.

Machine Translation Text Generation +2

Paper
Code

TCM-SD: A Benchmark for Probing Syndrome Differentiation via Natural Language Processing

1 code implementation • CCL 2022 • Mucheng Ren, Heyan Huang, Yuxiang Zhou, Qianwen Cao, Yuan Bu, Yang Gao

Therefore, in this paper, we focus on the core task of the TCM diagnosis and treatment system -- syndrome differentiation (SD) -- and we introduce the first public large-scale dataset for SD, called TCM-SD.

Language Modelling

Paper
Code

Ask to Understand: Question Generation for Multi-hop Question Answering

no code implementations • 17 Mar 2022 • Jiawei Li, Mucheng Ren, Yang Gao, Yizhe Yang

Specifically, we carefully design an end-to-end QG module on the basis of a classical QA module, which could help the model understand the context by asking inherently logical sub-questions, thus inheriting interpretability from the QD-based method and showing superior performance.

Multi-hop Question Answering Question Answering +2

Paper
Add Code

CYBORGS: Contrastively Bootstrapping Object Representations by Grounding in Segmentation

1 code implementation • 17 Mar 2022 • Renhao Wang, Hang Zhao, Yang Gao

Many recent approaches in contrastive learning have worked to close the gap between pretraining on iconic images like ImageNet and pretraining on complex scenes like COCO.

Contrastive Learning Object +1

Paper
Code

Generalized Bandit Regret Minimizer Framework in Imperfect Information Extensive-Form Game

no code implementations • 11 Mar 2022 • Linjian Meng, Yang Gao

In this paper, we propose a generalized framework for this learning setting.

Paper
Add Code

Keeping Minimal Experience to Achieve Efficient Interpretable Policy Distillation

no code implementations • 2 Mar 2022 • Xiao Liu, Shuyang Liu, Wenbin Li, Shangdong Yang, Yang Gao

Although deep reinforcement learning has become a universal solution for complex control tasks, its real-world applicability is still limited because lacking security guarantees for policies.

Paper
Add Code

Transformers in Medical Image Analysis: A Review

no code implementations • 24 Feb 2022 • Kelei He, Chen Gan, Zhuoyuan Li, Islem Rekik, Zihao Yin, Wen Ji, Yang Gao, Qian Wang, Junfeng Zhang, Dinggang Shen

Transformers have dominated the field of natural language processing, and recently impacted the computer vision area.

Image Generation

Paper
Add Code

Online Attentive Kernel-Based Temporal Difference Learning

no code implementations • 22 Jan 2022 • Guang Yang, Xingguo Chen, Shangdong Yang, Huihui Wang, Shaokang Dong, Yang Gao

Moreover, in learning sparse representations, attention mechanisms are utilized to represent the degree of sparsification, and a smooth attentive function is introduced into the kernel-based VFA.

Acrobot Reinforcement Learning (RL)

Paper
Add Code

MVDG: A Unified Multi-view Framework for Domain Generalization

1 code implementation • 23 Dec 2021 • Jian Zhang, Lei Qi, Yinghuan Shi, Yang Gao

Beyond the training stage, overfitting could also cause unstable prediction in the test stage.

Domain Generalization Meta-Learning

Paper
Code

Maximum Entropy Population-Based Training for Zero-Shot Human-AI Coordination

2 code implementations • 22 Dec 2021 • Rui Zhao, Jinming Song, Yufeng Yuan, Hu Haifeng, Yang Gao, Yi Wu, Zhongqian Sun, Yang Wei

We study the problem of training a Reinforcement Learning (RL) agent that is collaborative with humans without using any human data.

Reinforcement Learning (RL)

Paper
Code

PLACE dropout: A Progressive Layer-wise and Channel-wise Dropout for Domain Generalization

1 code implementation • 7 Dec 2021 • Jintao Guo, Lei Qi, Yinghuan Shi, Yang Gao

Particularly, the proposed method can generate a variety of data variants to better deal with the overfitting issue.

Domain Generalization

Paper
Code

Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration

2 code implementations • NeurIPS 2021 • Lulu Zheng, Jiarui Chen, Jianhao Wang, Jiamin He, Yujing Hu, Yingfeng Chen, Changjie Fan, Yang Gao, Chongjie Zhang

Efficient exploration in deep cooperative multi-agent reinforcement learning (MARL) still remains challenging in complex coordination problems.

Efficient Exploration Multi-agent Reinforcement Learning +4

Paper
Code

Instant tissue field and magnetic susceptibility mapping from MR raw phase using Laplacian enabled deep neural networks

2 code implementations • 15 Nov 2021 • Yang Gao, Zhuang Xiong, Amir Fazlollahi, Peter J Nestor, Viktor Vegh, Fatima Nasrallah, Craig Winter, G. Bruce Pike, Stuart Crozier, Feng Liu, Hongfu Sun

In addition, experiments on patients with intracranial hemorrhage and multiple sclerosis were also performed to test the generalization of the novel neural networks.

Paper
Code

Mastering Atari Games with Limited Data

3 code implementations • NeurIPS 2021 • Weirui Ye, Shaohuai Liu, Thanard Kurutach, Pieter Abbeel, Yang Gao

Recently, there has been significant progress in sample efficient image-based RL algorithms; however, consistent human-level performance on the Atari game benchmark remains an elusive goal.

Ranked #2 on Atari Games 100k on Atari 100k

Atari Games Atari Games 100k

2,373

Paper
Code

Discovering Non-monotonic Autoregressive Orderings with Variational Inference

1 code implementation • 27 Oct 2021 • Xuanlin Li, Brandon Trabucco, Dong Huk Park, Michael Luo, Sheng Shen, Trevor Darrell, Yang Gao

Permutations then serve as target generation orders for training an insertion-based Transformer language model.

Image Captioning Language Modelling +3

Paper
Code

NAS-FCOS: Efficient Search for Object Detection Architectures

1 code implementation • 24 Oct 2021 • Ning Wang, Yang Gao, Hao Chen, Peng Wang, Zhi Tian, Chunhua Shen, Yanning Zhang

Neural Architecture Search (NAS) has shown great potential in effectively reducing manual effort in network design by automatically discovering optimal architectures.

Neural Architecture Search Object +2

187

Paper
Code

Inconsistency-aware Uncertainty Estimation for Semi-supervised Medical Image Segmentation

1 code implementation • 17 Oct 2021 • Yinghuan Shi, Jian Zhang, Tong Ling, Jiwen Lu, Yefeng Zheng, Qian Yu, Lei Qi, Yang Gao

In semi-supervised medical image segmentation, most previous works draw on the common assumption that higher entropy means higher uncertainty.

Image Segmentation Segmentation +2

Paper
Code

Unifying Cross-lingual Summarization and Machine Translation with Compression Rate

1 code implementation • 15 Oct 2021 • Yu Bai, Heyan Huang, Kai Fan, Yang Gao, Yiming Zhu, Jiaao Zhan, Zewen Chi, Boxing Chen

Through introducing compression rate, the information ratio between the source and the target text, we regard the MT task as a special CLS task with a compression rate of 100%.

Data Augmentation Machine Translation +1

Paper
Code

Better Pseudo-label: Joint Domain-aware Label and Dual-classifier for Semi-supervised Domain Generalization

no code implementations • 10 Oct 2021 • Ruiqi Wang, Lei Qi, Yinghuan Shi, Yang Gao

Also, considering inconsistent goals between generalization and pseudo-labeling: former prevents overfitting on all source domains while latter might overfit the unlabeled source domains for high accuracy, we employ a dual-classifier to independently perform pseudo-labeling and domain generalization in the training process.

Domain Generalization Pseudo Label +1

Paper
Add Code

The Eval4NLP Shared Task on Explainable Quality Estimation: Overview and Results

1 code implementation • EMNLP (Eval4NLP) 2021 • Marina Fomicheva, Piyawat Lertvittayakumjorn, Wei Zhao, Steffen Eger, Yang Gao

In this paper, we introduce the Eval4NLP-2021shared task on explainable quality estimation.

Sentence Translation

Paper
Code

Disentangling Properties of Contrastive Methods

no code implementations • 29 Sep 2021 • Jinkun Cao, Qing Yang, Jialei Huang, Yang Gao

In this paper, we explored the possibility of using contrastive methods to learn a disentangled representation, a discriminative approach that is drastically different from previous approaches.

Disentanglement Object Recognition

Paper
Add Code

Auto-Encoding Inverse Reinforcement Learning

no code implementations • 29 Sep 2021 • Kaifeng Zhang, Rui Zhao, Ziming Zhang, Yang Gao

Reinforcement learning (RL) provides a powerful framework for decision-making, but its application in practice often requires a carefully designed reward function.

Imitation Learning reinforcement-learning +1

Paper
Add Code

Fight fire with fire: countering bad shortcuts in imitation learning with good shortcuts

no code implementations • 29 Sep 2021 • Chuan Wen, Jianing Qian, Jierui Lin, Dinesh Jayaraman, Yang Gao

When operating in partially observed settings, it is important for a control policy to fuse information from a history of observations.

Autonomous Driving Continuous Control +1

Paper
Add Code

Attention-based Interpretation and Response to The Trade-Off of Adversarial Training

no code implementations • 29 Sep 2021 • Changbin Shao, Wenbin Li, ZhenHua Feng, Jing Huo, Yang Gao

To boost the robustness of a model against adversarial examples, adversarial training has been regarded as a benchmark method.

Paper
Add Code

To be Closer: Learning to Link up Aspects with Opinions

1 code implementation • EMNLP 2021 • Yuxiang Zhou, Lejian Liao, Yang Gao, Zhanming Jie, Wei Lu

Dependency parse trees are helpful for discovering the opinion words in aspect-based sentiment analysis (ABSA).

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA)

Paper
Code

LibFewShot: A Comprehensive Library for Few-shot Learning

1 code implementation • 10 Sep 2021 • Wenbin Li, Ziyi, Wang, Xuesong Yang, Chuanqi Dong, Pinzhuo Tian, Tiexin Qin, Jing Huo, Yinghuan Shi, Lei Wang, Yang Gao, Jiebo Luo

Furthermore, based on LibFewShot, we provide comprehensive evaluations on multiple benchmarks with various backbone architectures to evaluate common pitfalls and effects of different training tricks.

Data Augmentation Few-Shot Image Classification +2

803

Paper
Code

Supporting Complaints Investigation for Nursing and Midwifery Regulatory Agencies

no code implementations • ACL 2021 • Piyawat Lertvittayakumjorn, Ivan Petej, Yang Gao, Yamuna Krishnamurthy, Anna Van Der Gaag, Robert Jago, Kostas Stathis

Health professional regulators aim to protect the health and well-being of patients and the public by setting standards for scrutinising and overseeing the training and conduct of health and care professionals.

Decision Making

Paper
Add Code

Crosslink-Net: Double-branch Encoder Segmentation Network via Fusing Vertical and Horizontal Convolutions

1 code implementation • 24 Jul 2021 • Qian Yu, Lei Qi, Luping Zhou, Lei Wang, Yilong Yin, Yinghuan Shi, Wuzhang Wang, Yang Gao

Together, the above two schemes give rise to a novel double-branch encoder segmentation framework for medical image segmentation, namely Crosslink-Net.

Image Segmentation Medical Image Segmentation +2

Paper
Code

Trip-ROMA: Self-Supervised Learning with Triplets and Random Mappings

1 code implementation • 22 Jul 2021 • Wenbin Li, Xuesong Yang, Meihao Kong, Lei Wang, Jing Huo, Yang Gao, Jiebo Luo

However, in small data regimes, we can not obtain a sufficient number of negative pairs or effectively avoid the over-fitting problem when negatives are not used at all.

Representation Learning Self-Supervised Learning +1

Paper
Code

Differentiable Architecture Pruning for Transfer Learning

no code implementations • 7 Jul 2021 • Nicolo Colombo, Yang Gao

We propose a new gradient-based approach for extracting sub-architectures from a given large model.

Transfer Learning

Paper
Add Code

Keyframe-Focused Visual Imitation Learning

no code implementations • 11 Jun 2021 • Chuan Wen, Jierui Lin, Jianing Qian, Yang Gao, Dinesh Jayaraman

Imitation learning trains control policies by mimicking pre-recorded expert demonstrations.

Continuous Control Graph Learning +1

Paper
Add Code

ST++: Make Self-training Work Better for Semi-supervised Semantic Segmentation

1 code implementation • CVPR 2022 • Lihe Yang, Wei Zhuo, Lei Qi, Yinghuan Shi, Yang Gao

In this work, we first construct a strong baseline of self-training (namely ST) for semi-supervised semantic segmentation via injecting strong data augmentations (SDA) on unlabeled images to alleviate overfitting noisy labels as well as decouple similar predictions between the teacher and student.

Semi-Supervised Semantic Segmentation

223

Paper
Code

Feature-based Style Randomization for Domain Generalization

no code implementations • 6 Jun 2021 • Yue Wang, Lei Qi, Yinghuan Shi, Yang Gao

As a recent noticeable topic, domain generalization (DG) aims to first learn a generic model on multiple source domains and then directly generalize to an arbitrary unseen target domain without any additional adaption.

Data Augmentation Domain Generalization

Paper
Add Code

Do You Listen with One or Two Microphones? A Unified ASR Model for Single and Multi-Channel Audio

no code implementations • 4 Jun 2021 • Gokce Keskin, Minhua Wu, Brian King, Harish Mallidi, Yang Gao, Jasha Droppo, Ariya Rastrow, Roland Maas

An ASR model that operates on both primary and auxiliary data can achieve better accuracy compared to a primary-only solution; and a model that can serve both primary-only (PO) and primary-plus-auxiliary (PPA) modes is highly desirable.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Prediction or Comparison: Toward Interpretable Qualitative Reasoning

no code implementations • Findings (ACL) 2021 • Mucheng Ren, Heyan Huang, Yang Gao

Qualitative relationships illustrate how changing one property (e. g., moving velocity) affects another (e. g., kinetic energy) and constitutes a considerable portion of textual knowledge.

Question Answering

Paper
Add Code

Deep grey matter quantitative susceptibility mapping from small spatial coverages using deep learning

no code implementations • 1 Jun 2021 • Xuanyu Zhu, Yang Gao, Feng Liu, Stuart Crozier, Hongfu Sun

Method: A recently proposed deep learning-based QSM method, namely xQSM, is investigated to assess the accuracy of dipole inversion on reduced brain coverages.

Paper
Add Code

Cross-Lingual Abstractive Summarization with Limited Parallel Resources

1 code implementation • ACL 2021 • Yu Bai, Yang Gao, Heyan Huang

Employing one unified decoder to generate the sequential concatenation of monolingual and cross-lingual summaries, MCLAS makes the monolingual summarization task a prerequisite of the cross-lingual summarization (CLS) task.

Abstractive Text Summarization Cross-Lingual Abstractive Summarization +1

Paper
Code

Deep Learning Traversability Estimator for Mobile Robots in Unstructured Environments

1 code implementation • 23 May 2021 • Marco Visca, Sampo Kuutti, Roger Powell, Yang Gao, Saber Fallah

Terrain traversability analysis plays a major role in ensuring safe robotic navigation in unstructured environments.

Paper
Code

Cross-Modality Brain Tumor Segmentation via Bidirectional Global-to-Local Unsupervised Domain Adaptation

1 code implementation • 17 May 2021 • Kelei He, Wen Ji, Tao Zhou, Zhuoyuan Li, Jing Huo, Xin Zhang, Yang Gao, Dinggang Shen, Bing Zhang, Junfeng Zhang

Specifically, a bidirectional image synthesis and segmentation module is proposed to segment the brain tumor using the intermediate data distributions generated for the two domains, which includes an image-to-image translator and a shared-weighted segmentation network.

Brain Tumor Segmentation Image Generation +3

Paper
Code

Attention-based Neural Beamforming Layers for Multi-channel Speech Recognition

no code implementations • 12 May 2021 • Bhargav Pulugundla, Yang Gao, Brian King, Gokce Keskin, Harish Mallidi, Minhua Wu, Jasha Droppo, Roland Maas

The end-to-end 2D Conv-Attention model is compared with a multi-head self-attention and superdirective-based neural beamformers.

speech-recognition Speech Recognition

Paper
Add Code

Adapting by Pruning: A Case Study on BERT

1 code implementation • 7 May 2021 • Yang Gao, Nicolo Colombo, Wei Wang

Adapting pre-trained neural models to downstream tasks has become the standard practice for obtaining high-quality models.

124

Paper
Code

Local descriptor-based multi-prototype network for few-shot Learning

no code implementations • Pattern Recognition 2021 • Hongwei Huang 1, Zhangkai Wu 1, Wenbin Li 2, Jing Huo 2, ∗, Yang Gao

Prototype-based few-shot learning methods are promising in that they are simple yet effective to handle any-shot problems, and many prototype associated works are raised since then.

Few-Shot Image Classification Few-Shot Learning

Paper
Add Code

CAT: Cross-Attention Transformer for One-Shot Object Detection

no code implementations • 30 Apr 2021 • Weidong Lin, Yuyan Deng, Yang Gao, Ning Wang, Jinghao Zhou, Lingqiao Liu, Lei Zhang, Peng Wang

Given a query patch from a novel class, one-shot object detection aims to detect all instances of that class in a target image through the semantic similarity comparison.

Object object-detection +3

Paper
Add Code

Conv1D Energy-Aware Path Planner for Mobile Robots in Unstructured Environments

no code implementations • 4 Apr 2021 • Marco Visca, Arthur Bouton, Roger Powell, Yang Gao, Saber Fallah

Driving energy consumption plays a major role in the navigation of mobile robots in challenging environments, especially if they are left to operate unattended under limited on-board power.

Self-Supervised Learning

Paper
Add Code

Towards Self-Adaptive Metric Learning On the Fly

no code implementations • 3 Apr 2021 • Yang Gao, Yi-Fan Li, Swarup Chandra, Latifur Khan, Bhavani Thuraisingham

In this paper, we present a new online metric learning framework that attempts to tackle the challenge by learning an ANN-based metric with adaptive model complexity from a stream of constraints.

Image Classification Image Retrieval +2

Paper
Add Code

SetConv: A New Approach for Learning from Imbalanced Data

no code implementations • EMNLP 2020 • Yang Gao, Yi-Fan Li, Yu Lin, Charu Aggarwal, Latifur Khan

For many real-world classification problems, e. g., sentiment classification, most existing machine learning methods are biased towards the majority class when the Imbalance Ratio (IR) is high.

BIG-bench Machine Learning Classification +3

Paper
Add Code

Prototypical Cross-domain Self-supervised Learning for Few-shot Unsupervised Domain Adaptation

1 code implementation • CVPR 2021 • Xiangyu Yue, Zangwei Zheng, Shanghang Zhang, Yang Gao, Trevor Darrell, Kurt Keutzer, Alberto Sangiovanni Vincentelli

In this paper, we propose an end-to-end Prototypical Cross-domain Self-Supervised Learning (PCS) framework for Few-shot Unsupervised Domain Adaptation (FUDA).

Ranked #6 on Semantic Segmentation on DensePASS

Contrastive Learning Self-Supervised Learning +2

Paper
Code

Mining Latent Classes for Few-shot Segmentation

1 code implementation • ICCV 2021 • Lihe Yang, Wei Zhuo, Lei Qi, Yinghuan Shi, Yang Gao

Our method aims to alleviate this problem and enhance the feature embedding on latent novel classes.

Ranked #41 on Few-Shot Semantic Segmentation on PASCAL-5i (5-Shot)

Few-Shot Semantic Segmentation

Paper
Code

NDT-Transformer: Large-Scale 3D Point Cloud Localisation using the Normal Distribution Transform Representation

1 code implementation • 23 Mar 2021 • Zhicheng Zhou, Cheng Zhao, Daniel Adolfsson, Songzhi Su, Yang Gao, Tom Duckett, Li Sun

Benefiting from the NDT representation and NDT-Transformer network, the learned global descriptors are enriched with both geometrical and contextual information.

Ranked #13 on Point Cloud Retrieval on Oxford RobotCar (LiDAR 4096 points)

Autonomous Driving Loop Closure Detection +2

Paper
Code

Accelerating Quantitative Susceptibility Mapping using Compressed Sensing and Deep Neural Network

2 code implementations • 17 Mar 2021 • Yang Gao, Martijn Cloos, Feng Liu, Stuart Crozier, G. Bruce Pike, Hongfu Sun

In this study, a learning-based Deep Complex Residual Network (DCRNet) is proposed to recover both the magnitude and phase images from incoherently undersampled data, enabling high acceleration of QSM acquisition.

SSIM

Paper
Code

Mutual Information State Intrinsic Control

2 code implementations • ICLR 2021 • Rui Zhao, Yang Gao, Pieter Abbeel, Volker Tresp, Wei Xu

Reinforcement learning has been shown to be highly successful at many challenging tasks.

Paper
Code

Improving Context-Based Meta-Reinforcement Learning with Self-Supervised Trajectory Contrastive Learning

no code implementations • 10 Mar 2021 • Bernie Wang, Simon Xu, Kurt Keutzer, Yang Gao, Bichen Wu

To address this, we propose a novel self-supervised learning task, which we named Trajectory Contrastive Learning (TCL), to improve meta-training.

Contrastive Learning Meta Reinforcement Learning +3

Paper
Add Code

A novel multiple instance learning framework for COVID-19 severity assessment via data augmentation and self-supervised learning

no code implementations • 7 Feb 2021 • Zekun Li, Wei Zhao, Feng Shi, Lei Qi, Xingzhi Xie, Ying WEI, Zhongxiang Ding, Yang Gao, Shangjie Wu, Jun Liu, Yinghuan Shi, Dinggang Shen

How to fast and accurately assess the severity level of COVID-19 is an essential problem, when millions of people are suffering from the pandemic around the world.

COVID-19 Diagnosis Data Augmentation +3

Paper
Add Code

Deep Symmetric Adaptation Network for Cross-modality Medical Image Segmentation

no code implementations • 18 Jan 2021 • Xiaoting Han, Lei Qi, Qian Yu, Ziqi Zhou, Yefeng Zheng, Yinghuan Shi, Yang Gao

These typical methods usually utilize a translation network to transform images from the source domain to target domain or train the pixel-level classifier merely using translated source images and original target images.

Image Segmentation Medical Image Segmentation +4

Paper
Add Code

Reinforcement Learning with Latent Flow

2 code implementations • NeurIPS 2021 • Wenling Shang, Xiaofei Wang, Aravind Srinivas, Aravind Rajeswaran, Yang Gao, Pieter Abbeel, Michael Laskin

Temporal information is essential to learning effective policies with Reinforcement Learning (RL).

Ranked #1 on Montezuma's Revenge on Atari 2600 Montezuma's Revenge

Continuous Control Montezuma's Revenge +4

Paper
Code

LoFGAN: Fusing Local Representations for Few-Shot Image Generation

1 code implementation • ICCV 2021 • Zheng Gu, Wenbin Li, Jing Huo, Lei Wang, Yang Gao

Given only a few available images for a novel unseen category, few-shot image generation aims to generate more data for this category.

Generative Adversarial Network Image Generation

Paper
Code

Discovering Autoregressive Orderings with Variational Inference

1 code implementation • ICLR 2021 • Xuanlin Li, Brandon Trabucco, Dong Huk Park, Michael Luo, Sheng Shen, Trevor Darrell, Yang Gao

One strategy to recover this information is to decode both the content and location of tokens.

Code Generation Image Captioning +2

Paper
Code

Discrete Predictive Representation for Long-horizon Planning

no code implementations • 1 Jan 2021 • Thanard Kurutach, Julia Peng, Yang Gao, Stuart Russell, Pieter Abbeel

Discrete representations have been key in enabling robots to plan at more abstract levels and solve temporally-extended tasks more efficiently for decades.

Object Reinforcement Learning (RL)

Paper
Add Code

Maximizing absorption in photon trapping ultra-fast silicon photodetectors

no code implementations • 22 Dec 2020 • Cesar Bartolo-Perez, Wayesh Qarony, Soroush Ghandiparsi, Ahmed S. Mayet, Ahasan Ahamed, Hilal Cansizoglu, Yang Gao, Ekaterina Ponizovskaya Devine, Toshishige Yamada, Aly F Elrefaie, Shih-Yuan Wang, M. Saif Islam

Photon trapping structures address this trade-off by enhancing the light-matter interactions, but maximizing their performance remains a challenge due to a multitude of factors influencing their design and fabrication.

Optics Applied Physics

Paper
Add Code

Fighting Copycat Agents in Behavioral Cloning from Observation Histories

no code implementations • NeurIPS 2020 • Chuan Wen, Jierui Lin, Trevor Darrell, Dinesh Jayaraman, Yang Gao

Imitation learning trains policies to map from input observations to the actions that an expert would choose.

Imitation Learning

Paper
Add Code

CariMe: Unpaired Caricature Generation with Multiple Exaggerations

2 code implementations • 1 Oct 2020 • Zheng Gu, Chuanqi Dong, Jing Huo, Wenbin Li, Yang Gao

Previous caricature generation methods are obsessed with predicting definite image warping from a given photo while ignoring the intrinsic representation and distribution for exaggerations in caricatures.

Caricature Image-to-Image Translation

Paper
Code

Disentangling Neural Architectures and Weights: A Case Study in Supervised Classification

no code implementations • 11 Sep 2020 • Nicolo Colombo, Yang Gao

To find the optimal weight-agnostic network, we use a novel and computationally efficient method that translates the hard architecture-search problem into a feasible optimization problem. More specifically, we look at the optimal task-specific architectures as the optimal configuration of binary networks with {0, 1}-valued weights, which can be found through an approximate gradient descent strategy.

General Classification

Paper
Add Code

ePointDA: An End-to-End Simulation-to-Real Domain Adaptation Framework for LiDAR Point Cloud Segmentation

no code implementations • 7 Sep 2020 • Sicheng Zhao, Yezhen Wang, Bo Li, Bichen Wu, Yang Gao, Pengfei Xu, Trevor Darrell, Kurt Keutzer

They require prior knowledge of real-world statistics and ignore the pixel-level dropout noise gap and the spatial feature gap between different domains.

Autonomous Driving Domain Adaptation +3

Paper
Add Code

Learning-based Computer-aided Prescription Model for Parkinson's Disease: A Data-driven Perspective

no code implementations • 31 Jul 2020 • Yinghuan Shi, Wanqi Yang, Kim-Han Thung, Hao Wang, Yang Gao, Yang Pan, Li Zhang, Dinggang Shen

Then, we build a novel computer-aided prescription model by learning the relation between observed symptoms and prescription drug.

Paper
Add Code

Unsupervised Domain Attention Adaptation Network for Caricature Attribute Recognition

1 code implementation • ECCV 2020 • Wen Ji, Kelei He, Jing Huo, Zheng Gu, Yang Gao

The implementation of the proposed method is available at https://github. com/KeleiHe/DAAN.

Attribute Caricature +1

Paper
Code

HF-UNet: Learning Hierarchically Inter-Task Relevance in Multi-Task U-Net for Accurate Prostate Segmentation

no code implementations • 21 May 2020 • Kelei He, Chunfeng Lian, Bing Zhang, Xin Zhang, Xiaohuan Cao, Dong Nie, Yang Gao, Junfeng Zhang, Dinggang Shen

In this paper, we tackle the challenging task of prostate segmentation in CT images by a two-stage network with 1) the first stage to fast localize, and 2) the second stage to accurately segment the prostate.

Multi-Task Learning Segmentation

Paper
Add Code

Manifold Alignment for Semantically Aligned Style Transfer

1 code implementation • ICCV 2021 • Jing Huo, Shiyin Jin, Wenbin Li, Jing Wu, Yu-Kun Lai, Yinghuan Shi, Yang Gao

In this paper, we make a new assumption that image features from the same semantic region form a manifold and an image with multiple semantic regions follows a multi-manifold distribution.

Semantic Segmentation Style Transfer

Paper
Code

MetricUNet: Synergistic Image- and Voxel-Level Learning for Precise CT Prostate Segmentation via Online Sampling

no code implementations • 15 May 2020 • Kelei He, Chunfeng Lian, Ehsan Adeli, Jing Huo, Yang Gao, Bing Zhang, Junfeng Zhang, Dinggang Shen

Therefore, the proposed network has a dual-branch architecture that tackles two tasks: 1) a segmentation sub-network aiming to generate the prostate segmentation, and 2) a voxel-metric learning sub-network aiming to improve the quality of the learned feature space supervised by a metric loss.

Metric Learning Multi-Task Learning +2

Paper
Add Code

Synergistic Learning of Lung Lobe Segmentation and Hierarchical Multi-Instance Classification for Automated Severity Assessment of COVID-19 in CT Images

no code implementations • 8 May 2020 • Kelei He, Wei Zhao, Xingzhi Xie, Wen Ji, Mingxia Liu, Zhenyu Tang, Feng Shi, Yang Gao, Jun Liu, Junfeng Zhang, Dinggang Shen

Considering that only a few infection regions in a CT image are related to the severity assessment, we first represent each input image by a bag that contains a set of 2D image patches (with each cropped from a specific slice).

Segmentation

Paper
Add Code

SUPERT: Towards New Frontiers in Unsupervised Evaluation Metrics for Multi-Document Summarization

1 code implementation • ACL 2020 • Yang Gao, Wei Zhao, Steffen Eger

Compared to the state-of-the-art unsupervised evaluation metrics, SUPERT correlates better with human ratings by 18-39%.

Document Summarization Multi-Document Summarization +4

Paper
Code

On the Limitations of Cross-lingual Encoders as Exposed by Reference-Free Machine Translation Evaluation

1 code implementation • ACL 2020 • Wei Zhao, Goran Glavaš, Maxime Peyrard, Yang Gao, Robert West, Steffen Eger

We systematically investigate a range of metrics based on state-of-the-art cross-lingual semantic representations obtained with pretrained M-BERT and LASER.

Language Modelling Machine Translation +4

Paper
Code

xQSM-Quantitative Susceptibility Mapping with Octave Convolutional Neural Networks

1 code implementation • 14 Apr 2020 • Yang Gao, Xuanyu Zhu, Stuart Crozier, Feng Liu, Hongfu Sun

Quantitative susceptibility mapping (QSM) is a valuable magnetic resonance imaging (MRI) contrast mechanism that has demonstrated broad clinical applications.

Image and Video Processing

Paper
Code

Diversity Helps: Unsupervised Few-shot Learning via Distribution Shift-based Data Augmentation

1 code implementation • 13 Apr 2020 • Tiexin Qin, Wenbin Li, Yinghuan Shi, Yang Gao

Importantly, we highlight the value and importance of the distribution diversity in the augmentation-based pretext few-shot tasks, which can effectively alleviate the overfitting problem and make the few-shot model learn more robust feature representations.

Ranked #12 on Unsupervised Few-Shot Image Classification on Tiered ImageNet 5-way (5-shot)

Data Augmentation Unsupervised Few-Shot Image Classification +1

Paper
Code

Crossover-Net: Leveraging the Vertical-Horizontal Crossover Relation for Robust Segmentation

no code implementations • 3 Apr 2020 • Qian Yu, Yinghuan Shi, Yefeng Zheng, Yang Gao, Jianbing Zhu, Yakang Dai

Robust segmentation for non-elongated tissues in medical images is hard to realize due to the large variation of the shape, size, and appearance of these tissues in different patients.

Relation Segmentation

Paper
Add Code

Generalizable Model-agnostic Semantic Segmentation via Target-specific Normalization

1 code implementation • 27 Mar 2020 • Jian Zhang, Lei Qi, Yinghuan Shi, Yang Gao

Semantic segmentation in a supervised learning manner has achieved significant progress in recent years.

Domain Generalization Segmentation +1

Paper
Code

Deep Learning on Knowledge Graph for Recommender System: A Survey

no code implementations • 25 Mar 2020 • Yang Gao, Yi-Fan Li, Yu Lin, Hang Gao, Latifur Khan

Recent advances in research have demonstrated the effectiveness of knowledge graphs (KG) in providing valuable external knowledge to improve recommendation systems (RS).

Graph Embedding Knowledge Graphs +1

Paper
Add Code

Phylogenetic Study of 2019-nCoV by Using Alignment Free Method (Evolutionary Bifurcation of Novel Coronavirus Mutants)

no code implementations • 3 Mar 2020 • Yang Gao, Tao Li, Liaofu Luo

It is found that there exist three types of virus mutations, namely, the mutation among sub-branches of the same branch, the off-root mutation and the root-oriented mutation between large branches of the tree.

Paper
Add Code

Automatic Data Augmentation via Deep Reinforcement Learning for Effective Kidney Tumor Segmentation

no code implementations • 22 Feb 2020 • Tiexin Qin, Ziyuan Wang, Kelei He, Yinghuan Shi, Yang Gao, Dinggang Shen

Conventional data augmentation realized by performing simple pre-processing operations (\eg, rotation, crop, \etc) has been validated for its advantage in enhancing the performance for medical image segmentation.

Data Augmentation Image Segmentation +5

Paper
Add Code

Mutual Information-based State-Control for Intrinsically Motivated Reinforcement Learning

no code implementations • 5 Feb 2020 • Rui Zhao, Yang Gao, Pieter Abbeel, Volker Tresp, Wei Xu

In reinforcement learning, an agent learns to reach a set of goals by means of an external reward signal.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Asymmetric Distribution Measure for Few-shot Learning

no code implementations • 1 Feb 2020 • Wenbin Li, Lei Wang, Jing Huo, Yinghuan Shi, Yang Gao, Jiebo Luo

Given the natural asymmetric relation between a query image and a support class, we argue that an asymmetric measure is more suitable for metric-based few-shot learning.

Few-Shot Image Classification Few-Shot Learning

Paper
Add Code

MW-GAN: Multi-Warping GAN for Caricature Generation with Multi-Style Geometric Exaggeration

no code implementations • 7 Jan 2020 • Haodi Hou, Jing Huo, Jing Wu, Yu-Kun Lai, Yang Gao

Given an input face photo, the goal of caricature generation is to produce stylized, exaggerated caricatures that share the same identity as the photo.

Caricature Style Transfer

Paper
Add Code

Multi-Agent Game Abstraction via Graph Attention Neural Network

no code implementations • 25 Nov 2019 • Yong Liu, Weixun Wang, Yujing Hu, Jianye Hao, Xingguo Chen, Yang Gao

Traditional methods attempt to use pre-defined rules to capture the interaction relationship between agents.

Graph Attention Multi-agent Reinforcement Learning

Paper
Add Code

Differentiable Meta-learning Model for Few-shot Semantic Segmentation

no code implementations • 23 Nov 2019 • Pinzhuo Tian, Zhangkai Wu, Lei Qi, Lei Wang, Yinghuan Shi, Yang Gao

To address the annotation scarcity issue in some cases of semantic segmentation, there have been a few attempts to develop the segmentation model in the few-shot learning paradigm.

Few-Shot Semantic Segmentation Object +2

Paper
Add Code

Interactive Text Ranking with Bayesian Optimisation: A Case Study on Community QA and Summarisation

1 code implementation • 22 Nov 2019 • Edwin Simpson, Yang Gao, Iryna Gurevych

For many NLP applications, such as question answering and summarisation, the goal is to select the best solution from a large space of candidates to meet a particular user's needs.

Bayesian Optimisation Community Question Answering +1

Paper
Code

Defensive Few-shot Learning

1 code implementation • 16 Nov 2019 • Wenbin Li, Lei Wang, Xingxing Zhang, Lei Qi, Jing Huo, Yang Gao, Jiebo Luo

(2) how to narrow the distribution gap between clean and adversarial examples under the few-shot setting?

Adversarial Defense Few-Shot Learning

Paper
Code

Visual cryptography in single-pixel imaging

no code implementations • 12 Nov 2019 • Shuming Jiao, Jun Feng, Yang Gao, Ting Lei, Xiaocong Yuan

The secret image can be recovered when identical illumination patterns are projected onto multiple visual key images and a single detector is used to record the total light intensities.

Paper
Add Code

Does deep learning always outperform simple linear regression in optical imaging?

no code implementations • 31 Oct 2019 • Shuming Jiao, Yang Gao, Jun Feng, Ting Lei, Xiaocong Yuan

Despite the success, the limitations and drawbacks of deep learning in optical imaging have been seldom investigated.

regression

Paper
Add Code

Data hiding in complex-amplitude modulation using a digital micromirror device

no code implementations • 24 Oct 2019 • Shuming Jiao, Dongfang Zhang, Chonglei Zhang, Yang Gao, Ting Lei, Xiaocong Yuan

A digital micromirror device (DMD) is an amplitude-type spatial light modulator.

Paper
Add Code

Automatic Data Augmentation by Learning the Deterministic Policy

1 code implementation • 18 Oct 2019 • Yinghuan Shi, Tiexin Qin, Yong liu, Jiwen Lu, Yang Gao, Dinggang Shen

By introducing an unified optimization goal, DeepAugNet intends to combine the data augmentation and the deep model training in an end-to-end training manner which is realized by simultaneously training a hybrid architecture of dueling deep Q-learning algorithm and a surrogate deep model.

Data Augmentation Q-Learning

Paper
Code

Zero-shot Policy Learning with Spatial Temporal RewardDecomposition on Contingency-aware Observation

1 code implementation • 17 Oct 2019 • Huazhe Xu, Boyuan Chen, Yang Gao, Trevor Darrell

The agent is first presented with previous experiences in the training environment, along with task description in the form of trajectory-level sparse rewards.

Continuous Control Model Predictive Control +2

Paper
Code

GUIDEGAN: ATTENTION BASED SPATIAL GUIDANCE FOR IMAGE-TO-IMAGE TRANSLATION

no code implementations • 25 Sep 2019 • Yu Lin, Yigong Wang, YiFan Li, Zhuoyi Wang, Yang Gao, Latifur Khan

To tackle this problem, we propose a GuideGAN based on attention mechanism.

Generative Adversarial Network Image-to-Image Translation +1

Paper
Add Code

Scoring-Aggregating-Planning: Learning task-agnostic priors from interactions and sparse rewards for zero-shot generalization

no code implementations • 25 Sep 2019 • Huazhe Xu, Boyuan Chen, Yang Gao, Trevor Darrell

In this paper, we propose Scoring-Aggregating-Planning (SAP), a framework that can learn task-agnostic semantics and dynamics priors from arbitrary quality interactions as well as the corresponding sparse rewards and then plan on unseen tasks in zero-shot condition.

Zero-shot Generalization

Paper
Add Code

From Few to More: Large-scale Dynamic Multiagent Curriculum Learning

no code implementations • 6 Sep 2019 • Weixun Wang, Tianpei Yang, Yong liu, Jianye Hao, Xiaotian Hao, Yujing Hu, Yingfeng Chen, Changjie Fan, Yang Gao

In this paper, we design a novel Dynamic Multiagent Curriculum Learning (DyMA-CL) to solve large-scale problems by starting from learning on a multiagent scenario with a small size and progressively increasing the number of agents.

Paper
Add Code

MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance

4 code implementations • IJCNLP 2019 • Wei Zhao, Maxime Peyrard, Fei Liu, Yang Gao, Christian M. Meyer, Steffen Eger

A robust evaluation metric has a profound impact on the development of text generation systems.

Data-to-Text Generation Image Captioning +2

185

Paper
Code

Better Rewards Yield Better Summaries: Learning to Summarise Without References

2 code implementations • IJCNLP 2019 • Florian Böhm, Yang Gao, Christian M. Meyer, Ori Shapira, Ido Dagan, Iryna Gurevych

Human evaluation experiments show that, compared to the state-of-the-art supervised-learning systems and ROUGE-as-rewards RL summarisation systems, the RL systems using our learned rewards during training generate summarieswith higher human ratings.

Reinforcement Learning (RL)

Paper
Code

Progressive Cross-camera Soft-label Learning for Semi-supervised Person Re-identification

no code implementations • 15 Aug 2019 • Lei Qi, Lei Wang, Jing Huo, Yinghuan Shi, Yang Gao

In this paper, we focus on the semi-supervised person re-identification (Re-ID) case, which only has the intra-camera (within-camera) labels but not inter-camera (cross-camera) labels.

Semi-Supervised Person Re-Identification

Paper
Add Code

GreyReID: A Two-stream Deep Framework with RGB-grey Information for Person Re-identification

no code implementations • 14 Aug 2019 • Lei Qi, Lei Wang, Jing Huo, Yinghuan Shi, Yang Gao

Moreover, in the training process, we adopt the joint learning scheme to simultaneously train each branch by the independent loss function, which can enhance the generalization ability of each branch.

Person Re-Identification

Paper
Add Code

Adversarial Camera Alignment Network for Unsupervised Cross-camera Person Re-identification

no code implementations • 2 Aug 2019 • Lei Qi, Lei Wang, Jing Huo, Yinghuan Shi, Xin Geng, Yang Gao

To achieve the camera alignment, we develop a Multi-Camera Adversarial Learning (MCAL) to map images of different cameras into a shared subspace.

Person Re-Identification

Paper
Add Code

Reward Learning for Efficient Reinforcement Learning in Extractive Document Summarisation

1 code implementation • 30 Jul 2019 • Yang Gao, Christian M. Meyer, Mohsen Mesgar, Iryna Gurevych

The predominant RL paradigm for summarisation learns a cross-input policy, which requires considerable time, data and parameter tuning due to the huge search spaces and the delayed rewards.

Decision Making Learning-To-Rank +2

Paper
Code

Action Semantics Network: Considering the Effects of Actions in Multiagent Systems

1 code implementation • ICLR 2020 • Weixun Wang, Tianpei Yang, Yong liu, Jianye Hao, Xiaotian Hao, Yujing Hu, Yingfeng Chen, Changjie Fan, Yang Gao

ASN characterizes different actions' influence on other agents using neural networks based on the action semantics between them.

Starcraft Starcraft II

Paper
Code

Deep Learning for Spacecraft Pose Estimation from Photorealistic Rendering

1 code implementation • 9 Jul 2019 • Pedro F. Proenca, Yang Gao

On-orbit proximity operations in space rendezvous, docking and debris removal require precise and robust 6D pose estimation under a wide range of lighting conditions and against highly textured background, i. e., the Earth.

6D Pose Estimation 6D Pose Estimation using RGB +1

Paper
Code

NAS-FCOS: Fast Neural Architecture Search for Object Detection

3 code implementations • CVPR 2020 • Ning Wang, Yang Gao, Hao Chen, Peng Wang, Zhi Tian, Chunhua Shen, Yanning Zhang

The success of deep neural networks relies on significant architecture engineering.

Ranked #124 on Object Detection on COCO test-dev

Neural Architecture Search Object +2

27,693

Paper
Code

Preference-based Interactive Multi-Document Summarisation

1 code implementation • 7 Jun 2019 • Yang Gao, Christian M. Meyer, Iryna Gurevych

Interactive NLP is a promising paradigm to close the gap between automatic NLP systems and the human upper bound.

Active Learning reinforcement-learning +1

Paper
Code

Known-plaintext attack and ciphertext-only attack for encrypted single-pixel imaging

no code implementations • 31 May 2019 • Shuming Jiao, Yang Gao, Ting Lei, Zhenwei Xie, Xiaocong Yuan

In many previous works, a single-pixel imaging (SPI) system is constructed as an optical image encryption system.

Cryptanalysis

Paper
Add Code

Optical machine learning with incoherent light and a single-pixel detector

no code implementations • 24 Apr 2019 • Shuming Jiao, Jun Feng, Yang Gao, Ting Lei, Zhenwei Xie, Xiaocong Yuan

Like an optical computer, the system can perform machine learning tasks such as number digit recognition in an all-optical manner.

BIG-bench Machine Learning

Paper
Add Code

GraphNAS: Graph Neural Architecture Search with Reinforcement Learning

1 code implementation • 22 Apr 2019 • Yang Gao, Hong Yang, Peng Zhang, Chuan Zhou, Yue Hu

On node classification tasks, GraphNAS can design a novel network architecture that rivals the best human-invented architecture in terms of test set accuracy.

Ranked #13 on Node Classification on PPI

General Classification Neural Architecture Search +3

170

Paper
Code

Crowdsourcing Lightweight Pyramids for Manual Summary Evaluation

1 code implementation • NAACL 2019 • Ori Shapira, David Gabay, Yang Gao, Hadar Ronen, Ramakanth Pasunuru, Mohit Bansal, Yael Amsterdamer, Ido Dagan

Conducting a manual evaluation is considered an essential part of summary evaluation methodology.

Paper
Code

Thinkey: A Scalable Blockchain Architecture

no code implementations • 9 Apr 2019 • Shan Chen, Weiguo Dai, Yuanxi Dai, Hao Fu, Yang Gao, Jianqi Guo, Haoqing He, Yuhong Liu

This paper presents Thinkey, an efficient, secure, infinitely scalable and decentralized blockchain architecture.

Cryptography and Security

Paper
Add Code

A Novel Unsupervised Camera-aware Domain Adaptation Framework for Person Re-identification

no code implementations • ICCV 2019 • Lei Qi, Lei Wang, Jing Huo, Luping Zhou, Yinghuan Shi, Yang Gao

For the first issue, we highlight the presence of camera-level sub-domains as a unique characteristic of person Re-ID, and develop camera-aware domain adaptation to reduce the discrepancy not only between source and target domains but also across these sub-domains.

Ranked #19 on Unsupervised Domain Adaptation on Market to Duke

Person Re-Identification Representation Learning +1

Paper
Add Code

Risk Averse Robust Adversarial Reinforcement Learning

no code implementations • 31 Mar 2019 • Xinlei Pan, Daniel Seita, Yang Gao, John Canny

In this paper we introduce risk-averse robust adversarial reinforcement learning (RARARL), using a risk-averse protagonist and a risk-seeking adversary.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.