Search Results for author: Yang Wei

Found 17 papers, 8 papers with code

Enhance Reasoning for Large Language Models in the Game Werewolf

no code implementations4 Feb 2024 Shuang Wu, Liwen Zhu, Tao Yang, Shiwei Xu, Qiang Fu, Yang Wei, Haobo Fu

This paper presents an innovative framework that integrates Large Language Models (LLMs) with an external Thinker module to enhance the reasoning capabilities of LLM-based agents.

Prompt Engineering

Exploring Self-Supervised Learning for SAR ATR: A Knowledge-Guided Predictive Perspective

no code implementations26 Nov 2023 Weijie Li, Yang Wei, Tianpeng Liu, Yuenan Hou, YuXuan Li, Zhen Liu, Yongxiang Liu, Li Liu

Furthermore, we employ local masks and multi-scale features to accommodate the large image scale and target scale variations in remote sensing scenarios.

Representation Learning Self-Supervised Learning

Make Pixels Dance: High-Dynamic Video Generation

no code implementations18 Nov 2023 Yan Zeng, Guoqiang Wei, Jiani Zheng, Jiaxin Zou, Yang Wei, Yuchen Zhang, Hang Li

Creating high-dynamic videos such as motion-rich actions and sophisticated visual effects poses a significant challenge in the field of artificial intelligence.

Text-to-Video Generation Video Generation

Patch Is Not All You Need

no code implementations21 Aug 2023 Changzhen Li, Jie Zhang, Yang Wei, Zhilong Ji, Jinfeng Bai, Shiguang Shan

Vision Transformers have achieved great success in computer visions, delivering exceptional performance across various tasks.

What Matters in Training a GPT4-Style Language Model with Multimodal Inputs?

2 code implementations5 Jul 2023 Yan Zeng, Hanbo Zhang, Jiani Zheng, Jiangnan Xia, Guoqiang Wei, Yang Wei, Yuchen Zhang, Tao Kong

However, the performance of these models heavily relies on design choices such as network structures, training data, and training strategies, and these choices have not been extensively discussed in the literature, making it difficult to quantify progress in this field.

Instruction Following Language Modelling

Maximum Entropy Population-Based Training for Zero-Shot Human-AI Coordination

2 code implementations22 Dec 2021 Rui Zhao, Jinming Song, Yufeng Yuan, Hu Haifeng, Yang Gao, Yi Wu, Zhongqian Sun, Yang Wei

We study the problem of training a Reinforcement Learning (RL) agent that is collaborative with humans without using any human data.

Reinforcement Learning (RL)

LightSeq2: Accelerated Training for Transformer-based Models on GPUs

1 code implementation12 Oct 2021 Xiaohui Wang, Yang Wei, Ying Xiong, Guyue Huang, Xian Qian, Yufei Ding, Mingxuan Wang, Lei LI

In this paper, we present LightSeq2, a system to accelerate training for a general family of Transformer models on GPUs.

Machine Translation Speech Recognition +1

Actor-Critic Policy Optimization in a Large-Scale Imperfect-Information Game

no code implementations ICLR 2022 Haobo Fu, Weiming Liu, Shuang Wu, Yijia Wang, Tao Yang, Kai Li, Junliang Xing, Bin Li, Bo Ma, Qiang Fu, Yang Wei

The deep policy gradient method has demonstrated promising results in many large-scale games, where the agent learns purely from its own experience.

counterfactual Policy Gradient Methods

In-Order Chart-Based Constituent Parsing

no code implementations8 Feb 2021 Yang Wei, Yuanbin Wu, Man Lan

We propose a novel in-order chart-based model for constituent parsing.

SRQA: Synthetic Reader for Factoid Question Answering

1 code implementation2 Sep 2020 Jiuniu Wang, Wenjia Xu, Xingyu Fu, Yang Wei, Li Jin, Ziyan Chen, Guangluan Xu, Yirong Wu

This model enhances the question answering system in the multi-document scenario from three aspects: model structure, optimization goal, and training method, corresponding to Multilayer Attention (MA), Cross Evidence (CE), and Adversarial Training (AT) respectively.

Question Answering

A Span-based Linearization for Constituent Trees

1 code implementation ACL 2020 Yang Wei, Yuanbin Wu, Man Lan

We propose a novel linearization of a constituent tree, together with a new locally normalized model.

Sentence

RRU-Net: The Ringed Residual U-Net for Image Splicing Forgery Detection

1 code implementation cvpr 2019 workshop 2019 Xiuli Bi, Yang Wei, Bin Xiao, Weisheng Li

The core idea of the RRU-Net is to strengthen the learning way of CNN, which is inspired by the recall and the consolidation mechanism of the human brain and implemented by the propagation and the feedback process of the residual in CNN.

Attribute

TightCap: 3D Human Shape Capture with Clothing Tightness Field

1 code implementation4 Apr 2019 Xin Chen, Anqi Pang, Yang Wei, Lan Xui, Jingyi Yu

In this paper, we present TightCap, a data-driven scheme to capture both the human shape and dressed garments accurately with only a single 3D human scan, which enables numerous applications such as virtual try-on, biometrics and body evaluation.

Virtual Try-on

Adversarial Metric Learning

no code implementations9 Feb 2018 Shuo Chen, Chen Gong, Jian Yang, Xiang Li, Yang Wei, Jun Li

In distinguishment stage, a metric is exhaustively learned to try its best to distinguish both the adversarial pairs and the original training pairs.

Metric Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.