Search Results for author: Lei Yang

Found 144 papers, 57 papers with code

面向垂直领域的阅读理解数据增强方法(Method for reading comprehension data enhancement in vertical field)

no code implementations • CCL 2020 • Zhengwei Lv, Lei Yang, Zhizhong Shi, Xiao Liang, Tao Lei, Duoxing Liu

阅读理解问答系统是利用语义理解等自然语言处理技术, 根据输入问题, 对非结构化文档数据进行分析, 生成一个答案, 具有很高的研究和应用价值。在垂直领域应用过程中, 阅读理解问答数据标注成本高且用户问题表达复杂多样, 使得阅读理解问答系统准确率低、鲁棒性差。针对这一问题, 本文提出一种面向垂直领域的阅读理解问答数据的增强方法, 该方法基于真实用户问题, 构造阅读理解训练数据, 一方面降低标注成本, 另一方面增加训练数据多样性, 提升模型的准确率和鲁棒性。本文用汽车领域数据对该方法进行实验验证, 其结果表明该方法对垂直领域阅读理解模型的准确率和鲁棒性均能有效提升。

Reading Comprehension

Paper
Add Code

SemanticAdv: Generating Adversarial Examples via Attribute-conditioned Image Editing

1 code implementation • ECCV 2020 • Haonan Qiu, Chaowei Xiao, Lei Yang, Xinchen Yan, Honglak Lee, Bo Li

Deep neural networks (DNNs) have achieved great successes in various vision applications due to their strong expressive power.

Adversarial Attack Attribute +2

Paper
Code

Caption-Supervised Face Recognition: Training a State-of-the-Art Face Model without Manual Annotation

no code implementations • ECCV 2020 • Qingqiu Huang, Lei Yang, Huaiyi Huang, Tong Wu, Dahua Lin

Captioned images are widely available on the web, while the captions often contain the names of the subjects in the images.

Face Model Face Recognition

Paper
Add Code

RoadBEV: Road Surface Reconstruction in Bird's Eye View

1 code implementation • 9 Apr 2024 • Tong Zhao, Lei Yang, Yichen Xie, Mingyu Ding, Masayoshi Tomizuka, Yintao Wei

This paper uniformly proposes two simple yet effective models for road elevation reconstruction in BEV named RoadBEV-mono and RoadBEV-stereo, which estimate road elevation with monocular and stereo images, respectively.

Autonomous Driving Monocular Depth Estimation +2

Paper
Code

Large Motion Model for Unified Multi-Modal Motion Generation

no code implementations • 1 Apr 2024 • Mingyuan Zhang, Daisheng Jin, Chenyang Gu, Fangzhou Hong, Zhongang Cai, Jingfang Huang, Chongzhi Zhang, Xinying Guo, Lei Yang, Ying He, Ziwei Liu

In this work, we present Large Motion Model (LMM), a motion-centric, multi-modal framework that unifies mainstream motion generation tasks into a generalist model.

Paper
Add Code

Duolando: Follower GPT with Off-Policy Reinforcement Learning for Dance Accompaniment

no code implementations • 27 Mar 2024 • Li SiYao, Tianpei Gu, Zhitao Yang, Zhengyu Lin, Ziwei Liu, Henghui Ding, Lei Yang, Chen Change Loy

We introduce a novel task within the field of 3D dance generation, termed dance accompaniment, which necessitates the generation of responsive movements from a dance partner, the "follower", synchronized with the lead dancer's movements and the underlying musical rhythm.

Paper
Add Code

AiOS: All-in-One-Stage Expressive Human Pose and Shape Estimation

no code implementations • 26 Mar 2024 • Qingping Sun, Yanjun Wang, Ailing Zeng, Wanqi Yin, Chen Wei, Wenjia Wang, Haiyi Mei, Chi Sing Leung, Ziwei Liu, Lei Yang, Zhongang Cai

Expressive human pose and shape estimation (a. k. a.

Human Detection

Paper
Add Code

WHAC: World-grounded Humans and Cameras

1 code implementation • 19 Mar 2024 • Wanqi Yin, Zhongang Cai, Ruisi Wang, Fanzhou Wang, Chen Wei, Haiyi Mei, Weiye Xiao, Zhitao Yang, Qingping Sun, Atsushi Yamashita, Ziwei Liu, Lei Yang

In this study, we aim to recover expressive parametric human models (i. e., SMPL-X) and corresponding camera poses jointly, by leveraging the synergy between three critical players: the world, the human, and the camera.

Pose Estimation

171

Paper
Code

GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection

no code implementations • 18 Mar 2024 • Ziying Song, Lei Yang, Shaoqing Xu, Lin Liu, Dongyang Xu, Caiyan Jia, Feiyang Jia, Li Wang

Additionally, we propose a Global Align module to rectify the misalignment between LiDAR and camera BEV features.

3D Object Detection Autonomous Driving +3

Paper
Add Code

Den-SOFT: Dense Space-Oriented Light Field DataseT for 6-DOF Immersive Experience

no code implementations • 15 Mar 2024 • Xiaohang Yu, Zhengxian Yang, Shi Pan, Yuqi Han, Haoxiang Wang, Jun Zhang, Shi Yan, Borong Lin, Lei Yang, Tao Yu, Lu Fang

We have built a custom mobile multi-camera large-space dense light field capture system, which provides a series of high-quality and sufficiently dense light field images for various scenarios.

3D Reconstruction 3D Scene Reconstruction +1

Paper
Add Code

Neuromorphic Synergy for Video Binarization

1 code implementation • 20 Feb 2024 • ShiJie Lin, Xiang Zhang, Lei Yang, Lei Yu, Bin Zhou, Xiaowei Luo, Wenping Wang, Jia Pan

We also develop an efficient integration method to propagate this binary image to high frame rate binary video.

Binarization Camera Calibration +1

Paper
Code

Clients Collaborate: Flexible Differentially Private Federated Learning with Guaranteed Improvement of Utility-Privacy Trade-off

no code implementations • 10 Feb 2024 • Yuecheng Li, Tong Wang, Chuan Chen, Jian Lou, Bin Chen, Lei Yang, Zibin Zheng

This implies that our FedCEO can effectively recover the disrupted semantic information by smoothing the global semantic space for different privacy settings and continuous training processes.

Federated Learning

Paper
Add Code

SGV3D:Towards Scenario Generalization for Vision-based Roadside 3D Object Detection

1 code implementation • 29 Jan 2024 • Lei Yang, Xinyu Zhang, Jun Li, Li Wang, Chuang Zhang, Li Ju, Zhiwei Li, Yang shen

Our method surpasses all previous methods by a significant margin in new scenes, including +42. 57% for vehicle, +5. 87% for pedestrian, and +14. 89% for cyclist compared to BEVHeight on the DAIR-V2X-I heterologous benchmark.

3D Object Detection Autonomous Vehicles +1

Paper
Code

Robustness-Aware 3D Object Detection in Autonomous Driving: A Review and Outlook

no code implementations • 12 Jan 2024 • Ziying Song, Lin Liu, Feiyang Jia, Yadan Luo, Guoxin Zhang, Lei Yang, Li Wang, Caiyan Jia

In the realm of modern autonomous driving, the perception system is indispensable for accurately assessing the state of the surrounding environment, thereby enabling informed prediction and planning.

3D Object Detection Autonomous Driving +2

Paper
Add Code

RoboFusion: Towards Robust Multi-Modal 3D Object Detection via SAM

1 code implementation • 8 Jan 2024 • Ziying Song, Guoxing Zhang, Lin Liu, Lei Yang, Shaoqing Xu, Caiyan Jia, Feiyang Jia, Li Wang

To align SAM or SAM-AD with multi-modal methods, we then introduce AD-FPN for upsampling the image features extracted by SAM.

3D Object Detection Autonomous Driving +2

Paper
Code

A Physics-guided Generative AI Toolkit for Geophysical Monitoring

no code implementations • 6 Jan 2024 • Junhuan Yang, Hanchen Wang, Yi Sheng, Youzuo Lin, Lei Yang

Full-waveform inversion (FWI) plays a vital role in geoscience to explore the subsurface.

SSIM

Paper
Add Code

On Optimal Sampling for Learning SDF Using MLPs Equipped with Positional Encoding

no code implementations • 2 Jan 2024 • Guying Lin, Lei Yang, YuAn Liu, Congyi Zhang, Junhui Hou, Xiaogang Jin, Taku Komura, John Keyser, Wenping Wang

Sampling against this intrinsic frequency following the Nyquist-Sannon sampling theorem allows us to determine an appropriate training sampling rate.

Paper
Add Code

Discrete Distribution Networks

no code implementations • 29 Dec 2023 • Lei Yang

This selected output is then fed back into the network as a condition for the second layer, thereby generating new outputs more similar to the GT.

Paper
Add Code

FineMoGen: Fine-Grained Spatio-Temporal Motion Generation and Editing

1 code implementation • NeurIPS 2023 • Mingyuan Zhang, Huirong Li, Zhongang Cai, Jiawei Ren, Lei Yang, Ziwei Liu

Notably, FineMoGen further enables zero-shot motion editing capabilities with the aid of modern large language models (LLM), which faithfully manipulates motion sequences with fine-grained instructions.

Ranked #2 on Motion Synthesis on KIT Motion-Language

Motion Synthesis

Paper
Code

Learning Dense Correspondence for NeRF-Based Face Reenactment

no code implementations • 16 Dec 2023 • Songlin Yang, Wei Wang, Yushi Lan, Xiangyu Fan, Bo Peng, Lei Yang, Jing Dong

Therefore, we are inspired to ask: Can we learn the dense correspondence between different NeRF-based face representations without a 3D parametric model prior?

Face Reenactment

Paper
Add Code

TrojFair: Trojan Fairness Attacks

no code implementations • 16 Dec 2023 • Mengxin Zheng, Jiaqi Xue, Yi Sheng, Lei Yang, Qian Lou, Lei Jiang

TrojFair is a stealthy Fairness attack that is resilient to existing model fairness audition detectors since the model for clean inputs is fair.

Fairness

Paper
Add Code

Let All be Whitened: Multi-teacher Distillation for Efficient Visual Retrieval

1 code implementation • 15 Dec 2023 • Zhe Ma, Jianfeng Dong, Shouling Ji, Zhenguang Liu, Xuhong Zhang, Zonghui Wang, Sifeng He, Feng Qian, Xiaobo Zhang, Lei Yang

Instead of crafting a new method pursuing further improvement on accuracy, in this paper we propose a multi-teacher distillation framework Whiten-MTD, which is able to transfer knowledge from off-the-shelf pre-trained retrieval models to a lightweight student model for efficient visual retrieval.

Image Retrieval Retrieval +1

Paper
Code

Towards Robust and Expressive Whole-body Human Pose and Shape Estimation

1 code implementation • NeurIPS 2023 • Hui EnPang, Zhongang Cai, Lei Yang, Qingyi Tao, Zhonghua Wu, Tianwei Zhang, Ziwei Liu

Whole-body pose and shape estimation aims to jointly predict different behaviors (e. g., pose, hand gesture, facial expression) of the entire human body from a monocular image.

Paper
Code

Speed Up Federated Learning in Heterogeneous Environment: A Dynamic Tiering Approach

1 code implementation • 9 Dec 2023 • Seyed Mahmoud Sajjadi Mohammadabadi, Syed Zawad, Feng Yan, Lei Yang

The dynamic tier scheduler assigns clients to suitable tiers to minimize the overall training time in each round.

Ranked #1 on Image Classification on CIFAR-10 (training time (s) metric)

Federated Learning Image Classification

Paper
Code

PrimDiffusion: Volumetric Primitives Diffusion for 3D Human Generation

1 code implementation • NeurIPS 2023 • Zhaoxi Chen, Fangzhou Hong, Haiyi Mei, Guangcong Wang, Lei Yang, Ziwei Liu

We present PrimDiffusion, the first diffusion-based framework for 3D human generation.

3D Inpainting Denoising

Paper
Code

Digital Life Project: Autonomous 3D Characters with Social Intelligence

no code implementations • 7 Dec 2023 • Zhongang Cai, Jianping Jiang, Zhongfei Qing, Xinying Guo, Mingyuan Zhang, Zhengyu Lin, Haiyi Mei, Chen Wei, Ruisi Wang, Wanqi Yin, Xiangyu Fan, Han Du, Liang Pan, Peng Gao, Zhitao Yang, Yang Gao, Jiaqi Li, Tianxiang Ren, Yukun Wei, Xiaogang Wang, Chen Change Loy, Lei Yang, Ziwei Liu

In this work, we present Digital Life Project, a framework utilizing language as the universal medium to build autonomous 3D characters, who are capable of engaging in social interactions and expressing with articulated body motions, thereby simulating life in a digital environment.

Ranked #2 on Motion Synthesis on InterHuman

Motion Captioning Motion Synthesis

Paper
Add Code

AttriHuman-3D: Editable 3D Human Avatar Generation with Attribute Decomposition and Indexing

no code implementations • 3 Dec 2023 • Fan Yang, Tianyi Chen, Xiaosheng He, Zhongang Cai, Lei Yang, Si Wu, Guosheng Lin

We propose AttriHuman-3D, an editable 3D human generation model, which address the aforementioned problems with attribute decomposition and indexing.

Attribute Disentanglement

Paper
Add Code

Generative Hierarchical Temporal Transformer for Hand Action Recognition and Motion Prediction

no code implementations • 29 Nov 2023 • Yilin Wen, Hao Pan, Takehiko Ohkawa, Lei Yang, Jia Pan, Yoichi Sato, Taku Komura, Wenping Wang

We present a novel framework that concurrently tackles hand action recognition and 3D future hand motion prediction.

Action Recognition motion prediction

Paper
Add Code

GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting

1 code implementation • 24 Nov 2023 • YiWen Chen, Zilong Chen, Chi Zhang, Feng Wang, Xiaofeng Yang, Yikai Wang, Zhongang Cai, Lei Yang, Huaping Liu, Guosheng Lin

3D editing plays a crucial role in many areas such as gaming and virtual reality.

855

Paper
Code

Machine-Learned Atomic Cluster Expansion Potentials for Fast and Quantum-Accurate Thermal Simulations of Wurtzite AlN

no code implementations • 20 Nov 2023 • Guang Yang, Yuan-Bin Liu, Lei Yang, Bing-Yang Cao

Using the atomic cluster expansion (ACE) framework, we develop a machine learning interatomic potential for fast and accurately modelling the phonon transport properties of wurtzite aluminum nitride.

Paper
Add Code

Community-Aware Efficient Graph Contrastive Learning via Personalized Self-Training

no code implementations • 18 Nov 2023 • Yuecheng Li, YanMing Hu, Lele Fu, Chuan Chen, Lei Yang, Zibin Zheng

However, for unsupervised and structure-related tasks such as community detection, current GCL algorithms face difficulties in acquiring the necessary community-level information, resulting in poor performance.

Community Detection Contrastive Learning +1

Paper
Add Code

Story-to-Motion: Synthesizing Infinite and Controllable Character Animation from Long Text

no code implementations • 13 Nov 2023 • Zhongfei Qing, Zhongang Cai, Zhitao Yang, Lei Yang

Generating natural human motion from a story has the potential to transform the landscape of animation, gaming, and film industries.

Motion Synthesis Position

Paper
Add Code

Contrastive Deep Nonnegative Matrix Factorization for Community Detection

1 code implementation • 4 Nov 2023 • Yuecheng Li, Jialong Chen, Chuan Chen, Lei Yang, Zibin Zheng

Recently, nonnegative matrix factorization (NMF) has been widely adopted for community detection, because of its better interpretability.

Ranked #1 on Community Detection on Pubmed

Community Detection Contrastive Learning +3

Paper
Code

GUPNet++: Geometry Uncertainty Propagation Network for Monocular 3D Object Detection

1 code implementation • 24 Oct 2023 • Yan Lu, Xinzhu Ma, Lei Yang, Tianzhu Zhang, Yating Liu, Qi Chu, Tong He, Yonghui Li, Wanli Ouyang

It models the uncertainty propagation relationship of the geometry projection during training, improving the stability and efficiency of the end-to-end model learning.

Monocular 3D Object Detection object-detection

126

Paper
Code

Fuzzy-NMS: Improving 3D Object Detection with Fuzzy Classification in NMS

no code implementations • 21 Oct 2023 • Li Wang, Xinyu Zhang, Fachuan Zhao, Chuze Wu, Yichen Wang, Ziying Song, Lei Yang, Jun Li, Huaping Liu

The proposed Fuzzy-NMS module combines the volume and clustering density of candidate bounding boxes, refining them with a fuzzy classification method and optimizing the appropriate suppression thresholds to reduce uncertainty in the NMS process.

3D Object Detection object-detection

Paper
Add Code

FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer

no code implementations • 20 Oct 2023 • Xinyu Zhang, Li Wang, Zhiqiang Jiang, Kun Dai, Tao Xie, Lei Yang, Wenhao Yu, Yang shen, Jun Li

However, these methods only integrate long-range context information among keypoints with a fixed receptive field, which constrains the network from reconciling the importance of features with different receptive fields to realize complete image perception, hence limiting the matching accuracy.

Homography Estimation Pose Estimation +1

Paper
Add Code

Training and Predicting Visual Error for Real-Time Applications

1 code implementation • 13 Oct 2023 • João Libório Cardoso, Bernhard Kerbl, Lei Yang, Yury Uralsky, Michael Wimmer

Specifically, we train and deploy a neural network to estimate the visual error resulting from reusing shading or using reduced shading rates.

Paper
Code

GraphAlign: Enhancing Accurate Feature Alignment by Graph matching for Multi-Modal 3D Object Detection

no code implementations • ICCV 2023 • Ziying Song, Haiyue Wei, Lin Bai, Lei Yang, Caiyan Jia

Through the projection calibration between the image and point cloud, we project the nearest neighbors of point cloud features onto the image features.

3D Object Detection Autonomous Driving +3

Paper
Add Code

Dual Radar: A Multi-modal Dataset with Dual 4D Radar for Autonomous Driving

1 code implementation • 11 Oct 2023 • Xinyu Zhang, Li Wang, Jian Chen, Cheng Fang, Lei Yang, Ziying Song, Guangqi Yang, Yichen Wang, Xiaofei Zhang, Jun Li, Zhiwei Li, Qingshan Yang, Zhenlin Zhang, Shuzhi Sam Ge

Compared with commonly used 3D radars, the latest 4D radars have precise vertical resolution and higher point cloud density, making it a highly promising sensor for autonomous driving in complex environmental perception.

3D Object Detection Autonomous Driving +1

101

Paper
Code

Evaluating Explanation Methods for Vision-and-Language Navigation

no code implementations • 10 Oct 2023 • Guanqi Chen, Lei Yang, Guanhua Chen, Jia Pan

The ability to navigate robots with natural language instructions in an unknown environment is a crucial step for achieving embodied artificial intelligence (AI).

Decision Making Navigate +3

Paper
Add Code

MonoGAE: Roadside Monocular 3D Object Detection with Ground-Aware Embeddings

no code implementations • 30 Sep 2023 • Lei Yang, Jiaxin Yu, Xinyu Zhang, Jun Li, Li Wang, Yi Huang, Chuang Zhang, Hong Wang, Yiming Li

We discover that most existing monocular 3D object detectors rely on the ego-vehicle prior assumption that the optical axis of the camera is parallel to the ground.

Autonomous Driving Monocular 3D Object Detection +1

Paper
Add Code

BEVHeight++: Toward Robust Visual Centric 3D Object Detection

no code implementations • 28 Sep 2023 • Lei Yang, Tao Tang, Jun Li, Peng Chen, Kun Yuan, Li Wang, Yi Huang, Xinyu Zhang, Kaicheng Yu

In essence, we regress the height to the ground to achieve a distance-agnostic formulation to ease the optimization process of camera-only perception methods.

3D Object Detection Autonomous Driving +2

Paper
Add Code

Learning Segment Similarity and Alignment in Large-Scale Content Based Video Retrieval

no code implementations • 20 Sep 2023 • Chen Jiang, Kaiming Huang, Sifeng He, Xudong Yang, Wei zhang, Xiaobo Zhang, Yuan Cheng, Lei Yang, Qing Wang, Furong Xu, Tan Pan, Wei Chu

SSAN is based on two newly proposed modules in video retrieval: (1) An efficient Self-supervised Keyframe Extraction (SKE) module to reduce redundant frame features, (2) A robust Similarity Pattern Detection (SPD) module for temporal alignment.

Retrieval Video Retrieval

Paper
Add Code

Video Infringement Detection via Feature Disentanglement and Mutual Information Maximization

1 code implementation • 13 Sep 2023 • Zhenguang Liu, Xinyang Yu, Ruili Wang, Shuai Ye, Zhe Ma, Jianfeng Dong, Sifeng He, Feng Qian, Xiaobo Zhang, Roger Zimmermann, Lei Yang

We theoretically analyzed the mutual information between the label and the disentangled features, arriving at a loss that maximizes the extraction of task-relevant information from the original feature.

Disentanglement

Paper
Code

ReliTalk: Relightable Talking Portrait Generation from a Single Video

1 code implementation • 5 Sep 2023 • Haonan Qiu, Zhaoxi Chen, Yuming Jiang, Hang Zhou, Xiangyu Fan, Lei Yang, Wayne Wu, Ziwei Liu

Our key insight is to decompose the portrait's reflectance from implicitly learned audio-driven facial normals and images.

Single-Image Portrait Relighting

Paper
Code

ImmersiveNeRF: Hybrid Radiance Fields for Unbounded Immersive Light Field Reconstruction

no code implementations • 4 Sep 2023 • Xiaohang Yu, Haoxiang Wang, Yuqi Han, Lei Yang, Tao Yu, Qionghai Dai

This paper proposes a hybrid radiance field representation for unbounded immersive light field reconstruction which supports high-quality rendering and aggressive view extrapolation.

Segmentation

Paper
Add Code

PointHPS: Cascaded 3D Human Pose and Shape Estimation from Point Clouds

no code implementations • 28 Aug 2023 • Zhongang Cai, Liang Pan, Chen Wei, Wanqi Yin, Fangzhou Hong, Mingyuan Zhang, Chen Change Loy, Lei Yang, Ziwei Liu

To tackle these challenges, we propose a principled framework, PointHPS, for accurate 3D HPS from point clouds captured in real-world settings, which iteratively refines point features through a cascaded architecture.

3D human pose and shape estimation

Paper
Add Code

Muffin: A Framework Toward Multi-Dimension AI Fairness by Uniting Off-the-Shelf Models

no code implementations • 26 Aug 2023 • Yi Sheng, Junhuan Yang, Lei Yang, Yiyu Shi, Jingtongf Hu, Weiwen Jiang

Model fairness (a. k. a., bias) has become one of the most critical problems in a wide range of AI applications.

Attribute Autonomous Driving +1

Paper
Add Code

IT3D: Improved Text-to-3D Generation with Explicit View Synthesis

1 code implementation • 22 Aug 2023 • YiWen Chen, Chi Zhang, Xiaofeng Yang, Zhongang Cai, Gang Yu, Lei Yang, Guosheng Lin

Recent strides in Text-to-3D techniques have been propelled by distilling knowledge from powerful large text-to-image diffusion models (LDMs).

3D Generation Text to 3D

201

Paper
Code

HumanLiff: Layer-wise 3D Human Generation with Diffusion Model

no code implementations • 18 Aug 2023 • Shoukang Hu, Fangzhou Hong, Tao Hu, Liang Pan, Haiyi Mei, Weiye Xiao, Lei Yang, Ziwei Liu

In this work, we propose HumanLiff, the first layer-wise 3D human generative model with a unified diffusion process.

3D Generation Neural Rendering

Paper
Add Code

DNA-Rendering: A Diverse Neural Actor Repository for High-Fidelity Human-centric Rendering

1 code implementation • ICCV 2023 • Wei Cheng, Ruixiang Chen, Wanqi Yin, Siming Fan, Keyu Chen, Honglin He, Huiwen Luo, Zhongang Cai, Jingbo Wang, Yang Gao, Zhengming Yu, Zhengyu Lin, Daxuan Ren, Lei Yang, Ziwei Liu, Chen Change Loy, Chen Qian, Wayne Wu, Dahua Lin, Bo Dai, Kwan-Yee Lin

Realistic human-centric rendering plays a key role in both computer vision and computer graphics.

Camera Calibration Novel View Synthesis

199

Paper
Code

FedCME: Client Matching and Classifier Exchanging to Handle Data Heterogeneity in Federated Learning

no code implementations • 17 Jul 2023 • Jun Nie, Danyang Xiao, Lei Yang, Weigang Wu

This can alleviate the performance degradation on the aggregated global model.

Federated Learning

Paper
Add Code

ConKI: Contrastive Knowledge Injection for Multimodal Sentiment Analysis

no code implementations • 27 Jun 2023 • Yakun Yu, Mingjun Zhao, Shi-ang Qi, Feiran Sun, Baoxun Wang, Weidong Guo, Xiaoli Wang, Lei Yang, Di Niu

Multimodal Sentiment Analysis leverages multimodal signals to detect the sentiment of a speaker.

Contrastive Learning General Knowledge +2

Paper
Add Code

ZeRO++: Extremely Efficient Collective Communication for Giant Model Training

1 code implementation • 16 Jun 2023 • Guanhua Wang, Heyang Qin, Sam Ade Jacobs, Connor Holmes, Samyam Rajbhandari, Olatunji Ruwase, Feng Yan, Lei Yang, Yuxiong He

Zero Redundancy Optimizer (ZeRO) has been used to train a wide range of large language models on massive GPUs clusters due to its ease of use, efficiency, and good scalability.

Quantization

32,692

Paper
Code

RenderMe-360: A Large Digital Asset Library and Benchmarks Towards High-fidelity Head Avatars

1 code implementation • NeurIPS 2023 • Dongwei Pan, Long Zhuo, Jingtan Piao, Huiwen Luo, Wei Cheng, Yuxin Wang, Siming Fan, Shengqi Liu, Lei Yang, Bo Dai, Ziwei Liu, Chen Change Loy, Chen Qian, Wayne Wu, Dahua Lin, Kwan-Yee Lin

It is a large-scale digital library for head avatars with three key attributes: 1) High Fidelity: all subjects are captured by 60 synchronized, high-resolution 2K cameras in 360 degrees.

2k Image Matting +2

217

Paper
Code

NeRF2: Neural Radio-Frequency Radiance Fields

no code implementations • 10 May 2023 • Xiaopeng Zhao, Zhenlin An, Qingrui Pan, Lei Yang

Inspired by the great success of using a neural network to describe the optical field in computer vision, we propose a neural radio-frequency radiance field, NeRF$^\textbf{2}$, which represents a continuous volumetric scene function that makes sense of an RF signal's propagation.

Indoor Localization Position

Paper
Add Code

Boundary-aware Backward-Compatible Representation via Adversarial Learning in Image Retrieval

1 code implementation • CVPR 2023 • Tan Pan, Furong Xu, Xudong Yang, Sifeng He, Chen Jiang, Qingpei Guo, Feng Qian Xiaobo Zhang, Yuan Cheng, Lei Yang, Wei Chu

For traditional model upgrades, the old model will not be replaced by the new one until the embeddings of all the images in the database are re-computed by the new model, which takes days or weeks for a large amount of data.

Image Retrieval Retrieval

Paper
Code

Towards Medical Artificial General Intelligence via Knowledge-Enhanced Multimodal Pretraining

1 code implementation • 26 Apr 2023 • Bingqian Lin, Zicong Chen, Mingjie Li, Haokun Lin, Hang Xu, Yi Zhu, Jianzhuang Liu, Wenjia Cai, Lei Yang, Shen Zhao, Chenfei Wu, Ling Chen, Xiaojun Chang, Yi Yang, Lei Xing, Xiaodan Liang

In MOTOR, we combine two kinds of basic medical knowledge, i. e., general and specific knowledge, in a complementary manner to boost the general pretraining process.

Medical Visual Question Answering Question Answering +1

Paper
Code

Search-Map-Search: A Frame Selection Paradigm for Action Recognition

no code implementations • CVPR 2023 • Mingjun Zhao, Yakun Yu, Xiaoli Wang, Lei Yang, Di Niu

To overcome the limitations of existing methods, we propose a Search-Map-Search learning paradigm which combines the advantages of heuristic search and supervised learning to select the best combination of frames from a video as one entity.

Action Recognition Video Understanding

Paper
Add Code

AoI-Delay Tradeoff in Mobile Edge Caching: A Mixed-Order Drift-Plus-Penalty Algorithm

no code implementations • 18 Apr 2023 • Ran Li, Chuan Huang, Xiaoqi Qin, Lei Yang

Mobile edge caching (MEC) is a promising technique to improve the quality of service (QoS) for mobile users (MU) by bringing data to the network edge.

Decision Making Scheduling

Paper
Add Code

ReMoDiffuse: Retrieval-Augmented Motion Diffusion Model

1 code implementation • ICCV 2023 • Mingyuan Zhang, Xinying Guo, Liang Pan, Zhongang Cai, Fangzhou Hong, Huirong Li, Lei Yang, Ziwei Liu

However, the performance on more diverse motions remains unsatisfactory.

Ranked #1 on Motion Synthesis on KIT Motion-Language

Denoising Motion Synthesis +1

292

Paper
Code

SynBody: Synthetic Dataset with Layered Human Models for 3D Human Perception and Modeling

1 code implementation • ICCV 2023 • Zhitao Yang, Zhongang Cai, Haiyi Mei, Shuai Liu, Zhaoxi Chen, Weiye Xiao, Yukun Wei, Zhongfei Qing, Chen Wei, Bo Dai, Wayne Wu, Chen Qian, Dahua Lin, Ziwei Liu, Lei Yang

Synthetic data has emerged as a promising source for 3D human research as it offers low-cost access to large-scale human datasets.

Human Mesh Recovery Neural Rendering

171

Paper
Code

Zolly: Zoom Focal Length Correctly for Perspective-Distorted Human Mesh Reconstruction

1 code implementation • ICCV 2023 • Wenjia Wang, Yongtao Ge, Haiyi Mei, Zhongang Cai, Qingping Sun, Yanjun Wang, Chunhua Shen, Lei Yang, Taku Komura

As it is hard to calibrate single-view RGB images in the wild, existing 3D human mesh reconstruction (3DHMR) methods either use a constant large focal length or estimate one based on the background environment context, which can not tackle the problem of the torso, limb, hand or face distortion caused by perspective camera projection when the camera is close to the human body.

Ranked #5 on 3D Human Pose Estimation on 3DPW

3D Human Pose Estimation 3D Reconstruction

Paper
Code

SHERF: Generalizable Human NeRF from a Single Image

1 code implementation • ICCV 2023 • Shoukang Hu, Fangzhou Hong, Liang Pan, Haiyi Mei, Lei Yang, Ziwei Liu

To this end, we propose a bank of 3D-aware hierarchical features, including global, point-level, and pixel-aligned features, to facilitate informative encoding.

3D Human Reconstruction

285

Paper
Code

BEVHeight: A Robust Framework for Vision-based Roadside 3D Object Detection

1 code implementation • CVPR 2023 • Lei Yang, Kaicheng Yu, Tao Tang, Jun Li, Kun Yuan, Li Wang, Xinyu Zhang, Peng Chen

In essence, instead of predicting the pixel-wise depth, we regress the height to the ground to achieve a distance-agnostic formulation to ease the optimization process of camera-only perception methods.

Ranked #3 on 3D Object Detection on Rope3D

3D Object Detection Autonomous Driving +1

173

Paper
Code

On-Device Unsupervised Image Segmentation

no code implementations • 24 Feb 2023 • Junhuan Yang, Yi Sheng, Yuzhou Zhang, Weiwen Jiang, Lei Yang

What's more, for a larger size image in the BBBC005 dataset, the existing approach cannot be accommodated to Raspberry PI due to out of memory; on the other hand, SegHDC can obtain segmentation results within 3 minutes while achieving a 0. 9587 IoU score.

Image Segmentation Segmentation +2

Paper
Add Code

Web Photo Source Identification based on Neural Enhanced Camera Fingerprint

1 code implementation • 18 Feb 2023 • Feng Qian, Sifeng He, Honghao Huang, Huanyu Ma, Xiaobo Zhang, Lei Yang

With the growing popularity of smartphone photography in recent years, web photos play an increasingly important role in all walks of life.

Metric Learning

Paper
Code

Dynamic Storyboard Generation in an Engine-based Virtual Environment for Video Production

no code implementations • 30 Jan 2023 • Anyi Rao, Xuekun Jiang, Yuwei Guo, Linning Xu, Lei Yang, Libiao Jin, Dahua Lin, Bo Dai

Amateurs working on mini-films and short-form videos usually spend lots of time and effort on the multi-round complicated process of setting and adjusting scenes, plots, and cameras to deliver satisfying video shots.

Paper
Add Code

OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation

1 code implementation • CVPR 2023 • Tong Wu, Jiarui Zhang, Xiao Fu, Yuxin Wang, Jiawei Ren, Liang Pan, Wayne Wu, Lei Yang, Jiaqi Wang, Chen Qian, Dahua Lin, Ziwei Liu

Recent advances in modeling 3D objects mostly rely on synthetic datasets due to the lack of large-scale realscanned 3D databases.

Novel View Synthesis Object +1

420

Paper
Code

Label-Efficient Interactive Time-Series Anomaly Detection

no code implementations • 30 Dec 2022 • Hong Guo, Yujing Wang, Jieyu Zhang, Zhengjie Lin, Yunhai Tong, Lei Yang, Luoxing Xiong, Congrui Huang

Time-series anomaly detection is an important task and has been widely applied in the industry.

Active Learning Time Series +2

Paper
Add Code

Development of A Real-time POCUS Image Quality Assessment and Acquisition Guidance System

no code implementations • 16 Dec 2022 • Zhenge Jia, Yiyu Shi, Jingtong Hu, Lei Yang, Benjamin Nti

Point-of-care ultrasound (POCUS) is one of the most commonly applied tools for cardiac function imaging in the clinical routine of the emergency department and pediatric intensive care unit.

Image Quality Assessment

Paper
Add Code

TransVCL: Attention-enhanced Video Copy Localization Network with Flexible Supervision

2 code implementations • 23 Nov 2022 • Sifeng He, Yue He, Minlong Lu, Chen Jiang, Xudong Yang, Feng Qian, Xiaobo Zhang, Lei Yang, Jiandong Zhang

Previous methods typically start from frame-to-frame similarity matrix generated by cosine similarity between frame-level features of the input video pair, and then detect and refine the boundaries of copied segments on similarity matrix under temporal constraints.

Retrieval Video Retrieval

108

Paper
Code

DCVQE: A Hierarchical Transformer for Video Quality Assessment

no code implementations • 10 Oct 2022 • Zutong Li, Lei Yang

Inspired by our observation on the actions of human annotation, we put forward a Divide and Conquer Video Quality Estimator (DCVQE) for NR-VQA.

Video Quality Assessment Visual Question Answering (VQA)

Paper
Add Code

A Keypoint Based Enhancement Method for Audio Driven Free View Talking Head Synthesis

no code implementations • 7 Oct 2022 • Yichen Han, Ya Li, Yingming Gao, Jinlong Xue, Songpo Wang, Lei Yang

Then we used keypoint decomposition to extract video synthesis controlling parameters from the backend output and the source image.

Paper
Add Code

Benchmarking and Analyzing 3D Human Pose and Shape Estimation Beyond Algorithms

1 code implementation • 21 Sep 2022 • Hui En Pang, Zhongang Cai, Lei Yang, Tianwei Zhang, Ziwei Liu

Experiments with 10 backbones, ranging from CNNs to transformers, show the knowledge learnt from a proximity task is readily transferable to human mesh recovery.

3D human pose and shape estimation Benchmarking +1

112

Paper
Code

Hierarchical Temporal Transformer for 3D Hand Pose Estimation and Action Recognition from Egocentric RGB Videos

1 code implementation • CVPR 2023 • Yilin Wen, Hao Pan, Lei Yang, Jia Pan, Taku Komura, Wenping Wang

Understanding dynamic hand motions and actions from egocentric RGB videos is a fundamental yet challenging task due to self-occlusion and ambiguity.

3D Hand Pose Estimation Action Recognition

Paper
Code

CAMO-MOT: Combined Appearance-Motion Optimization for 3D Multi-Object Tracking with Camera-LiDAR Fusion

no code implementations • 6 Sep 2022 • Li Wang, Xinyu Zhang, Wenyuan Qin, Xiaoyu Li, Lei Yang, Zhiwei Li, Lei Zhu, Hong Wang, Jun Li, Huaping Liu

As such, we propose a novel camera-LiDAR fusion 3D MOT framework based on the Combined Appearance-Motion Optimization (CAMO-MOT), which uses both camera and LiDAR data and significantly reduces tracking failures caused by occlusion and false detection.

3D Multi-Object Tracking Autonomous Driving +2

Paper
Add Code

MotionDiffuse: Text-Driven Human Motion Generation with Diffusion Model

2 code implementations • 31 Aug 2022 • Mingyuan Zhang, Zhongang Cai, Liang Pan, Fangzhou Hong, Xinying Guo, Lei Yang, Ziwei Liu

Instead of a deterministic language-motion mapping, MotionDiffuse generates motions through a series of denoising steps in which variations are injected.

Ranked #17 on Motion Synthesis on KIT Motion-Language

Denoising Motion Synthesis

778

Paper
Code

Federated Self-Supervised Contrastive Learning and Masked Autoencoder for Dermatological Disease Diagnosis

no code implementations • 24 Aug 2022 • Yawen Wu, Dewen Zeng, Zhepeng Wang, Yi Sheng, Lei Yang, Alaina J. James, Yiyu Shi, Jingtong Hu

Self-supervised learning (SSL) methods, contrastive learning (CL) and masked autoencoders (MAE), can leverage the unlabeled data to pre-train models, followed by fine-tuning with limited labels.

Contrastive Learning Federated Learning +1

Paper
Add Code

Mix-Teaching: A Simple, Unified and Effective Semi-Supervised Learning Framework for Monocular 3D Object Detection

1 code implementation • 10 Jul 2022 • Lei Yang, Xinyu Zhang, Li Wang, Minghan Zhu, Chuang Zhang, Jun Li

Besides, by leveraging full training set and the additional 48K raw images of KITTI, it can further improve the MonoFlex by +4. 65% improvement on AP@0. 7 for car detection, reaching 18. 54% AP@0. 7, which ranks the 1st place among all monocular based methods on KITTI test leaderboard.

Autonomous Driving Model Optimization +2

Paper
Code

Not All Models Are Equal: Predicting Model Transferability in a Self-challenging Fisher Space

1 code implementation • 7 Jul 2022 • Wenqi Shao, Xun Zhao, Yixiao Ge, Zhaoyang Zhang, Lei Yang, Xiaogang Wang, Ying Shan, Ping Luo

It is challenging because the ground-truth model ranking for each task can only be generated by fine-tuning the pre-trained models on the target dataset, which is brute-force and computationally expensive.

Ranked #2 on Transferability on classification benchmark

Transferability

Paper
Code

Learn to Predict How Humans Manipulate Large-sized Objects from Interactive Motions

no code implementations • 25 Jun 2022 • Weilin Wan, Lei Yang, Lingjie Liu, Zhuoying Zhang, Ruixing Jia, Yi-King Choi, Jia Pan, Christian Theobalt, Taku Komura, Wenping Wang

We also observe that an object's intrinsic physical properties are useful for the object motion prediction, and thus design a set of object dynamic descriptors to encode such intrinsic properties.

Human-Object Interaction Detection motion prediction +1

Paper
Add Code

RT-DNAS: Real-time Constrained Differentiable Neural Architecture Search for 3D Cardiac Cine MRI Segmentation

no code implementations • 8 Jun 2022 • Qing Lu, Xiaowei Xu, Shunjie Dong, Cong Hao, Lei Yang, Cheng Zhuo, Yiyu Shi

Accurately segmenting temporal frames of cine magnetic resonance imaging (MRI) is a crucial step in various real-time MRI guided cardiac interventions.

MRI segmentation Neural Architecture Search

Paper
Add Code

Data Imputation for Multivariate Time Series Sensor Data with Large Gaps of Missing Data

1 code implementation • IEEE Sensors Journal 2022 • Rui Wu, Scott D. Hamshaw, Lei Yang, Dustin W. Kincaid, Randall Etheridge, Amir Ghasemkhani

Imputation of missing sensor-collected data is often an important step prior to machine learning and statistical data analysis.

Imputation Time Series +1

Paper
Code

AvatarCLIP: Zero-Shot Text-Driven Generation and Animation of 3D Avatars

1 code implementation • 17 May 2022 • Fangzhou Hong, Mingyuan Zhang, Liang Pan, Zhongang Cai, Lei Yang, Ziwei Liu

Our key insight is to take advantage of the powerful vision-language model CLIP for supervising neural human generation, in terms of 3D geometry, texture and animation.

Language Modelling Motion Synthesis +1

1,039

Paper
Code

HuMMan: Multi-Modal 4D Human Dataset for Versatile Sensing and Modeling

no code implementations • 28 Apr 2022 • Zhongang Cai, Daxuan Ren, Ailing Zeng, Zhengyu Lin, Tao Yu, Wenjia Wang, Xiangyu Fan, Yang Gao, Yifan Yu, Liang Pan, Fangzhou Hong, Mingyuan Zhang, Chen Change Loy, Lei Yang, Ziwei Liu

4D human sensing and modeling are fundamental tasks in vision and graphics with numerous applications.

Fine-grained Action Recognition Pose Estimation

Paper
Add Code

Practical Stereo Matching via Cascaded Recurrent Network with Adaptive Correlation

3 code implementations • CVPR 2022 • Jiankun Li, Peisen Wang, Pengfei Xiong, Tao Cai, Ziwei Yan, Lei Yang, Jiangyu Liu, Haoqiang Fan, Shuaicheng Liu

With the advent of convolutional neural networks, stereo matching algorithms have recently gained tremendous progress.

Stereo Matching

443

Paper
Code

DeciWatch: A Simple Baseline for 10x Efficient 2D and 3D Pose Estimation

1 code implementation • 16 Mar 2022 • Ailing Zeng, Xuan Ju, Lei Yang, Ruiyuan Gao, Xizhou Zhu, Bo Dai, Qiang Xu

This paper proposes a simple baseline framework for video-based 2D/3D human pose estimation that can achieve 10 times efficiency improvement over existing works without any performance degradation, named DeciWatch.

Ranked #1 on 2D Human Pose Estimation on JHMDB (2D poses only)

2D Human Pose Estimation 3D Human Pose Estimation +2

169

Paper
Code

A Large-scale Comprehensive Dataset and Copy-overlap Aware Evaluation Protocol for Segment-level Video Copy Detection

1 code implementation • CVPR 2022 • Sifeng He, Xudong Yang, Chen Jiang, Gang Liang, Wei zhang, Tan Pan, Qing Wang, Furong Xu, Chunguang Li, Jingxiong Liu, Hui Xu, Kaiming Huang, Yuan Cheng, Feng Qian, Xiaobo Zhang, Lei Yang

In this paper, we introduce VCSL (Video Copy Segment Localization), a new comprehensive segment-level annotated video copy dataset.

Benchmarking Copy Detection

108

Paper
Code

Visual-Tactile Sensing for Real-time Liquid Volume Estimation in Grasping

no code implementations • 23 Feb 2022 • Fan Zhu, Ruixing Jia, Lei Yang, Youcan Yan, Zheng Wang, Jia Pan, Wenping Wang

We propose a deep visuo-tactile model for realtime estimation of the liquid inside a deformable container in a proprioceptive way. We fuse two sensory modalities, i. e., the raw visual inputs from the RGB camera and the tactile cues from our specific tactile sensor without any extra sensor calibrations. The robotic system is well controlled and adjusted based on the estimation model in real time.

Multi-Task Learning

Paper
Add Code

The Larger The Fairer? Small Neural Networks Can Achieve Fairness for Edge Devices

no code implementations • 23 Feb 2022 • Yi Sheng, Junhuan Yang, Yawen Wu, Kevin Mao, Yiyu Shi, Jingtong Hu, Weiwen Jiang, Lei Yang

Results show that FaHaNa can identify a series of neural networks with higher fairness and accuracy on a dermatology dataset.

Face Recognition Fairness +2

Paper
Add Code

Federated Contrastive Learning for Dermatological Disease Diagnosis via On-device Learning

no code implementations • 14 Feb 2022 • Yawen Wu, Dewen Zeng, Zhepeng Wang, Yi Sheng, Lei Yang, Alaina J. James, Yiyu Shi, Jingtong Hu

The recently developed self-supervised learning approach, contrastive learning (CL), can leverage the unlabeled data to pre-train a model, after which the model is fine-tuned on limited labeled data for dermatological disease diagnosis.

Contrastive Learning Federated Learning +1

Paper
Add Code

Automated Architecture Search for Brain-inspired Hyperdimensional Computing

no code implementations • 11 Feb 2022 • Junhuan Yang, Yi Sheng, Sizhe Zhang, Ruixuan Wang, Kenneth Foreman, Mikell Paige, Xun Jiao, Weiwen Jiang, Lei Yang

On the Clintox dataset, which tries to learn features from developed drugs that passed/failed clinical trials for toxicity reasons, the searched HDC architecture obtains the state-of-the-art ROC-AUC scores, which are 0. 80% higher than the manually designed HDC and 9. 75% higher than conventional neural networks.

Drug Discovery

Paper
Add Code

SkiM: Skipping Memory LSTM for Low-Latency Real-Time Continuous Speech Separation

no code implementations • 26 Jan 2022 • Chenda Li, Lei Yang, Weiqin Wang, Yanmin Qian

We adopt the time-domain speech separation method and the recently proposed Graph-PIT to build a super low-latency online speech separation model, which is very important for the real application.

Speech Separation

Paper
Add Code

SmoothNet: A Plug-and-Play Network for Refining Human Poses in Videos

2 code implementations • 27 Dec 2021 • Ailing Zeng, Lei Yang, Xuan Ju, Jiefeng Li, Jianyi Wang, Qiang Xu

With a simple yet effective motion-aware fully-connected network, SmoothNet improves the temporal smoothness of existing pose estimators significantly and enhances the estimation accuracy of those challenging frames as a side-effect.

2D Human Pose Estimation 3D Human Pose Estimation +2

5,006

Paper
Code

SimiGrad: Fine-Grained Adaptive Batching for Large Scale Training using Gradient Similarity Measurement

1 code implementation • NeurIPS 2021 • Heyang Qin, Samyam Rajbhandari, Olatunji Ruwase, Feng Yan, Lei Yang, Yuxiong He

In this paper, we propose a fully automated and lightweight adaptive batching methodology to enable fine-grained batch size adaption (e. g., at a mini-batch level) that can achieve state-of-the-art performance with record breaking batch sizes.

Computational Efficiency

Paper
Code

BSC: Block-based Stochastic Computing to Enable Accurate and Efficient TinyML

no code implementations • 12 Nov 2021 • Yuhong Song, Edwin Hsing-Mean Sha, Qingfeng Zhuge, Rui Xu, Yongzhuo Zhang, Bingzhe Li, Lei Yang

Unlike ML on the edge, TinyML with a limited energy supply has higher demands on low-power execution.

Paper
Add Code

Dense Representative Tooth Landmark/axis Detection Network on 3D Model

no code implementations • 8 Nov 2021 • Guangshun Wei, Zhiming Cui, Jie Zhu, Lei Yang, Yuanfeng Zhou, Pradeep Singh, Min Gu, Wenping Wang

Results show that our method can produce tooth landmarks with high accuracy.

Paper
Add Code

Robust Event Classification Using Imperfect Real-world PMU Data

no code implementations • 19 Oct 2021 • Yunchuan Liu, Lei Yang, Amir Ghasemkhani, Hanif Livani, Virgilio A. Centeno, Pin-Yu Chen, Junshan Zhang

Specifically, the data preprocessing step addresses the data quality issues of PMU measurements (e. g., bad data and missing data); in the fine-grained event data extraction step, a model-free event detection method is developed to accurately localize the events from the inaccurate event timestamps in the event logs; and the feature engineering step constructs the event features based on the patterns of different event types, in order to improve the performance and the interpretability of the event classifiers.

Classification Event Detection +1

Paper
Add Code

Detecting Gender Bias in Transformer-based Models: A Case Study on BERT

no code implementations • 15 Oct 2021 • Bingbing Li, Hongwu Peng, Rajat Sainju, Junhuan Yang, Lei Yang, Yueying Liang, Weiwen Jiang, Binghui Wang, Hang Liu, Caiwen Ding

In this paper, we propose a novel gender bias detection method by utilizing attention map for transformer-based models.

Bias Detection Gender Bias Detection

Paper
Add Code

Playing for 3D Human Recovery

no code implementations • 14 Oct 2021 • Zhongang Cai, Mingyuan Zhang, Jiawei Ren, Chen Wei, Daxuan Ren, Zhengyu Lin, Haiyu Zhao, Lei Yang, Chen Change Loy, Ziwei Liu

Specifically, we contribute GTA-Human, a large-scale 3D human dataset generated with the GTA-V game engine, featuring a highly diverse set of subjects, actions, and scenarios.

Paper
Add Code

iShape: A First Step Towards Irregular Shape Instance Segmentation

no code implementations • 30 Sep 2021 • Lei Yang, Yan Zi Wei, Yisheng He, Wei Sun, Zhenhang Huang, Haibin Huang, Haoqiang Fan

In this paper, we introduce a brand new dataset to promote the study of instance segmentation for objects with irregular shapes.

Ranked #1 on Instance Segmentation on iShape

Instance Segmentation Segmentation +1

Paper
Add Code

Can Noise on Qubits Be Learned in Quantum Neural Network? A Case Study on QuantumFlow

no code implementations • 8 Sep 2021 • Zhiding Liang, Zhepeng Wang, Junhuan Yang, Lei Yang, JinJun Xiong, Yiyu Shi, Weiwen Jiang

Specifically, this paper targets quantum neural network (QNN), and proposes to learn the errors in the training phase, so that the identified QNN model can be resilient to noise.

Paper
Add Code

Learning Skeletal Graph Neural Networks for Hard 3D Pose Estimation

no code implementations • ICCV 2021 • Ailing Zeng, Xiao Sun, Lei Yang, Nanxuan Zhao, Minhao Liu, Qiang Xu

While the average prediction accuracy has been improved significantly over the years, the performance on hard poses with depth ambiguity, self-occlusion, and complex or rare poses is still far from satisfactory.

Ranked #23 on Skeleton Based Action Recognition on NTU RGB+D 120

3D Human Pose Estimation 3D Pose Estimation +3

Paper
Add Code

Distributed Attention for Grounded Image Captioning

no code implementations • 2 Aug 2021 • Nenglun Chen, Xingjia Pan, Runnan Chen, Lei Yang, Zhiwen Lin, Yuqiang Ren, Haolei Yuan, Xiaowei Guo, Feiyue Huang, Wenping Wang

We study the problem of weakly supervised grounded image captioning.

Image Captioning Sentence

Paper
Add Code

Geometry Uncertainty Projection Network for Monocular 3D Object Detection

1 code implementation • ICCV 2021 • Yan Lu, Xinzhu Ma, Lei Yang, Tianzhu Zhang, Yating Liu, Qi Chu, Junjie Yan, Wanli Ouyang

In this paper, we propose a Geometry Uncertainty Projection Network (GUP Net) to tackle the error amplification problem at both inference and training stages.

Ranked #2 on 3D Object Detection From Monocular Images on Waymo Open Dataset

3D Object Detection From Monocular Images Depth Estimation +3

126

Paper
Code

DISP6D: Disentangled Implicit Shape and Pose Learning for Scalable 6D Pose Estimation

1 code implementation • 27 Jul 2021 • Yilin Wen, Xiangyu Li, Hao Pan, Lei Yang, Zheng Wang, Taku Komura, Wenping Wang

Scalable 6D pose estimation for rigid objects from RGB images aims at handling multiple objects and generalizing to novel objects.

6D Pose Estimation Metric Learning +2

Paper
Code

IPS300+: a Challenging Multimodal Dataset for Intersection Perception System

no code implementations • 5 Jun 2021 • Huanan Wang, Xinyu Zhang, Jun Li, Zhiwei Li, Lei Yang, Shuyue Pan, Yongqiang Deng

Through an IPS (Intersection Perception System) installed at the diagonal of the intersection, this paper proposes a high-quality multimodal dataset for the intersection perception task.

Paper
Add Code

Lite-FPN for Keypoint-based Monocular 3D Object Detection

1 code implementation • 1 May 2021 • Lei Yang, Xinyu Zhang, Li Wang, Minghan Zhu, Jun Li

3D object detection with a single image is an essential and challenging task for autonomous driving.

Autonomous Driving Monocular 3D Object Detection +2

Paper
Code

Refer-it-in-RGBD: A Bottom-up Approach for 3D Visual Grounding in RGBD Images

2 code implementations • CVPR 2021 • Haolin Liu, Anran Lin, Xiaoguang Han, Lei Yang, Yizhou Yu, Shuguang Cui

Grounding referring expressions in RGBD image has been an emerging field.

Object Visual Grounding

Paper
Code

The Age of Correlated Features in Supervised Learning based Forecasting

no code implementations • 27 Feb 2021 • MD Kamran Chowdhury Shisher, Heyang Qin, Lei Yang, Feng Yan, Yin Sun

In these applications, a neural network is trained to predict a time-varying target (e. g., solar power), based on multiple correlated features (e. g., temperature, humidity, and cloud coverage).

Paper
Add Code

Solving the Travelling Thief Problem based on Item Selection Weight and Reverse Order Allocation

no code implementations • 16 Dec 2020 • Lei Yang, Zitong Zhang, Xiaotian Jia, Peipei Kang, Wensheng Zhang, Dongya Wang

The Travelling Thief Problem (TTP) is a challenging combinatorial optimization problem that attracts many scholars.

Combinatorial Optimization

Paper
Add Code

Edge Computing Assisted Autonomous Flight for UAV: Synergies between Vision and Communications

no code implementations • 10 Dec 2020 • Quan Chen, Hai Zhu, Lei Yang, Xiaoqian Chen, Sofie Pollin, Evgenii Vinogradov

By proposing a framework of Edge Computing Assisted Autonomous Flight (ECAAF), we illustrate that vision and communications can interact with and assist each other with the aid of edge computing and offloading, and further speed up the UAV mission completion.

Edge-computing Trajectory Planning Networking and Internet Architecture Robotics Systems and Control Systems and Control

Paper
Add Code

CoEdge: Cooperative DNN Inference with Adaptive Workload Partitioning over Heterogeneous Edge Devices

no code implementations • 6 Dec 2020 • Liekang Zeng, Xu Chen, Zhi Zhou, Lei Yang, Junshan Zhang

CoEdge utilizes available computation and communication resources at the edge and dynamically partitions the DNN inference workload adaptive to devices' computing capabilities and network conditions.

Paper
Add Code

Event Cause Analysis in Distribution Networks using Synchro Waveform Measurements

no code implementations • 25 Aug 2020 • Iman Niazazari, Hanif Livani, Amir Ghasemkhani, Yunchuan Liu, Lei Yang

This paper presents a machine learning method for event cause analysis to enhance situational awareness in distribution networks.

BIG-bench Machine Learning

Paper
Add Code

Deep Neural Network based Wide-Area Event Classification in Power Systems

no code implementations • 24 Aug 2020 • Iman Niazazari, Amir Ghasemkhani, Yunchuan Liu, Shuchismita Biswas, Hanif Livani, Lei Yang, Virgilio Centeno

This paper presents a wide-area event classification in transmission power grids.

Bayesian Optimization Classification +1

Paper
Add Code

E-Tree Learning: A Novel Decentralized Model Learning Framework for Edge AI

no code implementations • 4 Aug 2020 • Lei Yang, Yanyan Lu, Jiannong Cao, Jiaming Huang, Mingjin Zhang

In this paper, we propose a novel decentralized model learning approach, namely E-Tree, which makes use of a well-designed tree structure imposed on the edge devices.

Clustering Federated Learning

Paper
Add Code

Mapping in a cycle: Sinkhorn regularized unsupervised learning for point cloud shapes

1 code implementation • ECCV 2020 • Lei Yang, Wenxi Liu, Zhiming Cui, Nenglun Chen, Wenping Wang

We propose an unsupervised learning framework with the pretext task of finding dense correspondences between point cloud shapes from the same category based on the cycle-consistency formulation.

Paper
Code

Learn to Propagate Reliably on Noisy Affinity Graphs

no code implementations • ECCV 2020 • Lei Yang, Qingqiu Huang, Huaiyi Huang, Linning Xu, Dahua Lin

Recent works have shown that exploiting unlabeled data through label propagation can substantially reduce the labeling cost, which has been a critical issue in developing visual recognition models.

Open-Ended Question Answering

Paper
Add Code

Standing on the Shoulders of Giants: Hardware and Neural Architecture Co-Search with Hot Start

1 code implementation • 17 Jul 2020 • Weiwen Jiang, Lei Yang, Sakyasingha Dasgupta, Jingtong Hu, Yiyu Shi

To tackle this issue, HotNAS builds a chain of tools to design hardware to support compression, based on which a global optimizer is developed to automatically co-search all the involved search spaces.

Neural Architecture Search

Paper
Code

Learning to Cluster Faces via Confidence and Connectivity Estimation

3 code implementations • CVPR 2020 • Lei Yang, Dapeng Chen, Xiaohang Zhan, Rui Zhao, Chen Change Loy, Dahua Lin

With the vertex confidence and edge connectivity, we can naturally organize more relevant vertices on the affinity graph and group them into clusters.

Clustering Connectivity Estimation +2

698

Paper
Code

Co-Exploration of Neural Architectures and Heterogeneous ASIC Accelerator Designs Targeting Multiple Tasks

no code implementations • 10 Feb 2020 • Lei Yang, Zheyu Yan, Meng Li, Hyoukjun Kwon, Liangzhen Lai, Tushar Krishna, Vikas Chandra, Weiwen Jiang, Yiyu Shi

Neural Architecture Search (NAS) has demonstrated its power on various AI accelerating platforms such as Field Programmable Gate Arrays (FPGAs) and Graphic Processing Units (GPUs).

Neural Architecture Search

Paper
Add Code

Deep Human Answer Understanding for Natural Reverse QA

no code implementations • 1 Dec 2019 • Rujing Yao, Linlin Hou, Lei Yang, Jie Gui, Qing Yin, Ou wu

This study focuses on a reverse question answering (QA) procedure, in which machines proactively raise questions and humans supply the answers.

Question Answering

Paper
Add Code

Blockchain for Future Smart Grid: A Comprehensive Survey

1 code implementation • 8 Nov 2019 • Muhammad Baqer Mollah, Jun Zhao, Dusit Niyato, Kwok-Yan Lam, Xin Zhang, Amer M. Y. M. Ghias, Leong Hai Koh, Lei Yang

In this paper, we aim to provide a comprehensive survey on application of blockchain in smart grid.

Cryptography and Security Distributed, Parallel, and Cluster Computing Networking and Internet Architecture Social and Information Networks Systems and Control Systems and Control

Paper
Code

Device-Circuit-Architecture Co-Exploration for Computing-in-Memory Neural Accelerators

no code implementations • 31 Oct 2019 • Weiwen Jiang, Qiuwen Lou, Zheyu Yan, Lei Yang, Jingtong Hu, Xiaobo Sharon Hu, Yiyu Shi

In this paper, we are the first to bring the computing-in-memory architecture, which can easily transcend the memory wall, to interplay with the neural architecture search, aiming to find the most efficient neural architectures with high network accuracy and maximized hardware efficiency.

Neural Architecture Search

Paper
Add Code

Practical Low Latency Proof of Work Consensus

2 code implementations • 25 Sep 2019 • Lei Yang, Vivek Bagaria, Gerui Wang, Mohammad Alizadeh, David Tse, Giulia Fanti, Pramod Viswanath

Bitcoin is the first fully-decentralized permissionless blockchain protocol to achieve a high level of security, but at the expense of poor throughput and latency.

Distributed, Parallel, and Cluster Computing Cryptography and Security Networking and Internet Architecture

Paper
Code

Hardware/Software Co-Exploration of Neural Architectures

1 code implementation • 6 Jul 2019 • Weiwen Jiang, Lei Yang, Edwin Sha, Qingfeng Zhuge, Shouzhen Gu, Sakyasingha Dasgupta, Yiyu Shi, Jingtong Hu

We propose a novel hardware and software co-exploration framework for efficient neural architecture search (NAS).

Neural Architecture Search

Paper
Code

A Pvalue-guided Anomaly Detection Approach Combining Multiple Heterogeneous Log Parser Algorithms on IIoT Systems

no code implementations • 5 Jul 2019 • Xueshuo Xie, Zhi Wang, Xuhang Xiao, Lei Yang, Shenwei Huang, Tao Li

In this paper, we use blockchain to prevent logs from being tampered with and propose a pvalue-guided anomaly detection approach.

Cryptography and Security

Paper
Add Code

Sparse Solutions of a Class of Constrained Optimization Problems

no code implementations • 1 Jul 2019 • Lei Yang, Xiaojun Chen, Shuhuang Xiang

Specifically, without any condition on the matrix $A$, we provide upper bounds in cardinality and infinity norm for the optimal solutions, and show that all optimal solutions must be on the boundary of the feasible set when $0<p<1$.

Denoising

Paper
Add Code

SemanticAdv: Generating Adversarial Examples via Attribute-conditional Image Editing

1 code implementation • 19 Jun 2019 • Haonan Qiu, Chaowei Xiao, Lei Yang, Xinchen Yan, Honglak Lee, Bo Li

In this paper, we aim to explore the impact of semantic manipulation on DNNs predictions by manipulating the semantic attributes of images and generate "unrestricted adversarial examples".

Attribute Face Recognition +1

Paper
Code

AUTOHOME-ORCA at SemEval-2019 Task 8: Application of BERT for Fact-Checking in Community Forums

no code implementations • SEMEVAL 2019 • Zhengwei Lv, Duoxing Liu, Haifeng Sun, Xiao Liang, Tao Lei, Zhizhong Shi, Feng Zhu, Lei Yang

In order to address this task, we propose a system based on the BERT model with meta information of questions.

Community Question Answering Fact Checking

Paper
Add Code

Adaptive Intelligent Secondary Control of Microgrids Using a Biologically-Inspired Reinforcement Learning

no code implementations • 2 May 2019 • Mohammad Jafari, Vahid Sarfi, Amir Ghasemkhani, Hanif Livani, Lei Yang, Hao Xu

In this paper, a biologically-inspired adaptive intelligent secondary controller is developed for microgrids to tackle system dynamics uncertainties, faults, and/or disturbances.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Learning to Cluster Faces on an Affinity Graph

3 code implementations • CVPR 2019 • Lei Yang, Xiaohang Zhan, Dapeng Chen, Junjie Yan, Chen Change Loy, Dahua Lin

Face recognition sees remarkable progress in recent years, and its performance has reached a very high level.

Clustering Face Recognition +1

698

Paper
Code

Accuracy vs. Efficiency: Achieving Both through FPGA-Implementation Aware Neural Architecture Search

no code implementations • 31 Jan 2019 • Weiwen Jiang, Xinyi Zhang, Edwin H. -M. Sha, Lei Yang, Qingfeng Zhuge, Yiyu Shi, Jingtong Hu

In addition, with a performance abstraction model to analyze the latency of neural architectures without training, our framework can quickly prune architectures that do not satisfy the specification, leading to higher efficiency.

Neural Architecture Search

Paper
Add Code

RPC: A Large-Scale Retail Product Checkout Dataset

no code implementations • 22 Jan 2019 • Xiu-Shen Wei, Quan Cui, Lei Yang, Peng Wang, Lingqiao Liu

The main challenge of this problem comes from the large scale and the fine-grained nature of the product categories as well as the difficulty for collecting training images that reflect the realistic checkout scenarios due to continuous update of the products.

Paper
Add Code

A Fast Globally Linearly Convergent Algorithm for the Computation of Wasserstein Barycenters

no code implementations • 12 Sep 2018 • Lei Yang, Jia Li, Defeng Sun, Kim-Chuan Toh

When the support points of the barycenter are pre-specified, this problem can be modeled as a linear programming (LP) problem whose size can be extremely large.

Paper
Add Code

Feature Fusion through Multitask CNN for Large-scale Remote Sensing Image Segmentation

no code implementations • 24 Jul 2018 • Shihao Sun, Lei Yang, Wenjie Liu, Ruirui Li

In recent years, Fully Convolutional Networks (FCN) has been widely used in various semantic segmentation tasks, including multi-modal remote sensing imagery.

Image Segmentation Segmentation +1

Paper
Add Code

TreeSegNet: Adaptive Tree CNNs for Subdecimeter Aerial Image Segmentation

no code implementations • 29 Apr 2018 • Kai Yue, Lei Yang, Ruirui Li, Wei Hu, Fan Zhang, Wei Li

For the task of subdecimeter aerial imagery segmentation, fine-grained semantic segmentation results are usually difficult to obtain because of complex remote sensing content and optical conditions.

Image Segmentation Segmentation +1

Paper
Add Code

Accelerated Training for Massive Classification via Dynamic Class Selection

no code implementations • 5 Jan 2018 • Xingcheng Zhang, Lei Yang, Junjie Yan, Dahua Lin

Massive classification, a classification task defined over a vast number of classes (hundreds of thousands or even millions), has become an essential part of many real-world systems, such as face recognition.

Classification Face Recognition +1

Paper
Add Code

Proximal Gradient Method with Extrapolation and Line Search for a Class of Nonconvex and Nonsmooth Problems

no code implementations • 18 Nov 2017 • Lei Yang

To solve this class of problems, we propose a proximal gradient method with extrapolation and line search (PGels).

Variable Selection

Paper
Add Code

DeepUNet: A Deep Fully Convolutional Network for Pixel-level Sea-Land Segmentation

2 code implementations • 1 Sep 2017 • Ruirui Li, Wenjie Liu, Lei Yang, Shihao Sun, Wei Hu, Fan Zhang, Wei Li

Semantic segmentation is a fundamental research in remote sensing image processing.

Segmentation Semantic Segmentation

Paper
Code

On Scalable Inference with Stochastic Gradient Descent

no code implementations • 1 Jul 2017 • Yixin Fang, Jinfeng Xu, Lei Yang

In many applications involving large dataset or online updating, stochastic gradient descent (SGD) provides a scalable way to compute parameter estimates and has gained increasing popularity due to its numerical convenience and memory efficiency.

Paper
Add Code

A Non-monotone Alternating Updating Method for A Class of Matrix Factorization Problems

no code implementations • 18 May 2017 • Lei Yang, Ting Kei Pong, Xiaojun Chen

Finally, we conduct some numerical experiments using real datasets to compare our method with some existing efficient methods for non-negative matrix factorization and matrix completion.

Matrix Completion

Paper
Add Code

Minimum $n$-Rank Approximation via Iterative Hard Thresholding

no code implementations • 18 Nov 2013 • Min Zhang, Lei Yang, Zheng-Hai Huang

Additionally, combining an effective heuristic for determining $n$-rank, we can also apply the proposed algorithm to solve MnRA when $n$-rank is unknown in advance.

Image Inpainting Video Inpainting

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.