Search Results for author: Wei Liang

Found 23 papers, 10 papers with code

Move as You Say, Interact as You Can: Language-guided Human Motion Generation with Scene Affordance

1 code implementation • 26 Mar 2024 • Zan Wang, Yixin Chen, Baoxiong Jia, Puhao Li, Jinlu Zhang, Jingze Zhang, Tengyu Liu, Yixin Zhu, Wei Liang, Siyuan Huang

Despite significant advancements in text-to-motion synthesis, generating language-guided human motion within 3D environments poses substantial challenges.

Motion Synthesis

Paper
Code

Let Storytelling Tell Vivid Stories: An Expressive and Fluent Multimodal Storyteller

no code implementations • 12 Mar 2024 • Chuanqi Zang, Jiji Tang, Rongsheng Zhang, Zeng Zhao, Tangjie Lv, Mingtao Pei, Wei Liang

Storytelling aims to generate reasonable and vivid narratives based on an ordered image stream.

Story Generation

Paper
Add Code

Language-driven All-in-one Adverse Weather Removal

no code implementations • 3 Dec 2023 • Hao Yang, Liyuan Pan, Yan Yang, Wei Liang

Then, with the guidance of degradation prior, we sparsely select restoration experts from a candidate list dynamically based on a Mixture-of-Experts (MoE) structure.

Paper
Add Code

DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation

no code implementations • ICCV 2023 • Hanqing Wang, Wei Liang, Luc van Gool, Wenguan Wang

VLN-CE is a recently released embodied task, where AI agents need to navigate a freely traversable environment to reach a distant target location, given language instructions.

Decision Making Navigate +1

Paper
Add Code

MEWL: Few-shot multimodal word learning with referential uncertainty

1 code implementation • 1 Jun 2023 • Guangyuan Jiang, Manjie Xu, Shiji Xin, Wei Liang, Yujia Peng, Chi Zhang, Yixin Zhu

To fill in this gap, we introduce the MachinE Word Learning (MEWL) benchmark to assess how machines learn word meaning in grounded visual scenes.

Paper
Code

Quantifying and Defending against Privacy Threats on Federated Knowledge Graph Embedding

no code implementations • 6 Apr 2023 • Yuke Hu, Wei Liang, Ruofan Wu, Kai Xiao, Weiqiang Wang, Xiaochen Li, Jinfei Liu, Zhan Qin

Knowledge Graph Embedding (KGE) is a fundamental technique that extracts expressive representation from knowledge graph (KG) to facilitate diverse downstream tasks.

Knowledge Graph Embedding

Paper
Add Code

Diffusion-based Generation, Optimization, and Planning in 3D Scenes

2 code implementations • CVPR 2023 • Siyuan Huang, Zan Wang, Puhao Li, Baoxiong Jia, Tengyu Liu, Yixin Zhu, Wei Liang, Song-Chun Zhu

SceneDiffuser provides a unified model for solving scene-conditioned generation, optimization, and planning.

Denoising Grasp Generation +2

314

Paper
Code

The state-of-the-art 3D anisotropic intracranial hemorrhage segmentation on non-contrast head CT: The INSTANCE challenge

no code implementations • 9 Jan 2023 • Xiangyu Li, Gongning Luo, Kuanquan Wang, Hongyu Wang, Jun Liu, Xinjie Liang, Jie Jiang, Zhenghao Song, Chunyue Zheng, Haokai Chi, Mingwang Xu, Yingte He, Xinghua Ma, Jingwen Guo, Yifan Liu, Chuanpu Li, Zeli Chen, Md Mahfuzur Rahman Siddiquee, Andriy Myronenko, Antoine P. Sanner, Anirban Mukhopadhyay, Ahmed E. Othman, Xingyu Zhao, Weiping Liu, Jinhuang Zhang, Xiangyuan Ma, Qinghui Liu, Bradley J. MacIntosh, Wei Liang, Moona Mazher, Abdul Qayyum, Valeriia Abramova, Xavier Lladó, Shuo Li

It is intended to resolve the above-mentioned problems and promote the development of both intracranial hemorrhage segmentation and anisotropic data processing.

Image Segmentation Medical Image Segmentation +2

Paper
Add Code

Discovering the Real Association: Multimodal Causal Reasoning in Video Question Answering

no code implementations • CVPR 2023 • Chuanqi Zang, Hanqing Wang, Mingtao Pei, Wei Liang

For textual data, the model prefers the local phrase semantics which may deviate from the global semantics in long sentences.

Question Answering Video Question Answering

Paper
Add Code

Towards Versatile Embodied Navigation

1 code implementation • 30 Oct 2022 • Hanqing Wang, Wei Liang, Luc van Gool, Wenguan Wang

With the emergence of varied visual navigation tasks (e. g, image-/object-/audio-goal and vision-language navigation) that specify the target in different ways, the community has made appealing advances in training specialized agents capable of handling individual navigation tasks well.

Decision Making Vision-Language Navigation +1

Paper
Code

HUMANISE: Language-conditioned Human Motion Generation in 3D Scenes

1 code implementation • 18 Oct 2022 • Zan Wang, Yixin Chen, Tengyu Liu, Yixin Zhu, Wei Liang, Siyuan Huang

Learning to generate diverse scene-aware and goal-oriented human motions in 3D scenes remains challenging due to the mediocre characteristics of the existing datasets on Human-Scene Interaction (HSI); they only have limited scale/quality and lack semantics.

106

Paper
Code

LGC-Net: A Lightweight Gyroscope Calibration Network for Efficient Attitude Estimation

no code implementations • 19 Sep 2022 • Yaohua Liu, Wei Liang, Jinqiang Cui

This paper presents a lightweight, efficient calibration neural network model for denoising low-cost microelectromechanical system (MEMS) gyroscope and estimating the attitude of a robot in real-time.

Denoising

Paper
Add Code

Confidence Band Estimation for Survival Random Forests

1 code implementation • 26 Apr 2022 • Sarah Elizabeth Formentini, Wei Liang, Ruoqing Zhu

The idea is to estimate the variance-covariance matrix of the cumulative hazard function prediction on a grid of time points.

valid

Paper
Code

Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-Language Navigation

1 code implementation • CVPR 2022 • Hanqing Wang, Wei Liang, Jianbing Shen, Luc van Gool, Wenguan Wang

Since the rise of vision-language navigation (VLN), great progress has been made in instruction following -- building a follower to navigate environments under the guidance of instructions.

counterfactual Data Augmentation +3

Paper
Code

PEARL: Parallelized Expert-Assisted Reinforcement Learning for Scene Rearrangement Planning

no code implementations • 10 May 2021 • Hanqing Wang, Zan Wang, Wei Liang, Lap-Fai Yu

Scene Rearrangement Planning (SRP) is an interior task proposed recently.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Structured Scene Memory for Vision-Language Navigation

1 code implementation • CVPR 2021 • Hanqing Wang, Wenguan Wang, Wei Liang, Caiming Xiong, Jianbing Shen

Recently, numerous algorithms have been developed to tackle the problem of vision-language navigation (VLN), i. e., entailing an agent to navigate 3D environments through following linguistic instructions.

Decision Making Navigate +1

Paper
Code

Active Visual Information Gathering for Vision-Language Navigation

1 code implementation • ECCV 2020 • Hanqing Wang, Wenguan Wang, Tianmin Shu, Wei Liang, Jianbing Shen

Vision-language navigation (VLN) is the task of entailing an agent to carry out navigational instructions inside photo-realistic environments.

Vision-Language Navigation

Paper
Code

An Efficient Target Detection and Recognition Method in Aerial Remote-sensing Images Based on Multiangle Regions-of-Interest

no code implementations • 22 Jul 2019 • Guangcun Shan, Hongyu Wang, Wei Liang, Congcong Liu, Qizi Ma, Quan Quan

Recently, deep learning technology have been extensively used in the field of image recognition.

Object Detection Region Proposal

Paper
Add Code

Robust Encoder-Decoder Learning Framework towards Offline Handwritten Mathematical Expression Recognition Based on Multi-Scale Deep Neural Network

no code implementations • 8 Feb 2019 • Guangcun Shan, Hongyu Wang, Wei Liang

Offline handwritten mathematical expression recognition is a challenging task, because handwritten mathematical expressions mainly have two problems in the process of recognition.

Paper
Add Code

3D Face Synthesis Driven by Personality Impression

no code implementations • 27 Sep 2018 • Yining Lang, Wei Liang, Yujia Wang, Lap-Fai Yu

In this paper, we propose a novel approach to synthesize 3D faces based on personality impression for creating virtual characters.

Graphics

Paper
Add Code

Deep Single-View 3D Object Reconstruction with Visual Hull Embedding

1 code implementation • 10 Sep 2018 • Hanqing Wang, Jiaolong Yang, Wei Liang, Xin Tong

The key idea of our method is to leverage object mask and pose estimation from CNNs to assist the 3D shape learning by constructing a probabilistic single-view visual hull inside of the network.

3D Object Reconstruction Object +1

Paper
Code

Transferring Objects: Joint Inference of Container and Human Pose

no code implementations • ICCV 2017 • Hanqing Wang, Wei Liang, Lap-Fai Yu

In the inference phase, given a scanned 3D scene with different object candidates and a dictionary of human poses, our approach infers the best object as a container together with human pose for transferring a given object.

Object

Paper
Add Code

Chinese Short Text Classification Based on Domain Knowledge

no code implementations • IJCNLP 2013 • Xiao Feng, Yang shen, Chengyong Liu, Wei Liang, Shuwu Zhang

General Classification text-classification +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.