Search Results for author: Ye Yuan

Found 122 papers, 51 papers with code

On the Powerball Method for Optimization

no code implementations • 24 Mar 2016 • Ye Yuan, Mu Li, Jun Liu, Claire J. Tomlin

We propose a new method to accelerate the convergence of optimization algorithms.

Paper
Add Code

Review Networks for Caption Generation

no code implementations • NeurIPS 2016 • Zhilin Yang, Ye Yuan, Yuexin Wu, Ruslan Salakhutdinov, William W. Cohen

We propose a novel extension of the encoder-decoder framework, called a review network.

Caption Generation Image Captioning

Paper
Add Code

Inverse Power Flow Problem

no code implementations • 21 Oct 2016 • Ye Yuan, Steven Low, Omid Ardakanian, Claire Tomlin

We show that the admittance matrix can be uniquely identified from a sequence of measurements corresponding to different steady states when every node in the system is equipped with a measurement device, and a Kron-reduced admittance matrix can be determined even if some nodes in the system are not monitored (hidden nodes).

Paper
Add Code

Words or Characters? Fine-grained Gating for Reading Comprehension

1 code implementation • 6 Nov 2016 • Zhilin Yang, Bhuwan Dhingra, Ye Yuan, Junjie Hu, William W. Cohen, Ruslan Salakhutdinov

Previous work combines word-level and character-level representations using concatenation or scalar weighting, which is suboptimal for high-level tasks like reading comprehension.

Ranked #50 on Question Answering on SQuAD1.1 dev

Question Answering Reading Comprehension +1

Paper
Code

Joint Hand Detection and Rotation Estimation by Using CNN

no code implementations • 8 Dec 2016 • Xiaoming Deng, Ye Yuan, Yinda Zhang, Ping Tan, Liang Chang, Shuo Yang, Hongan Wang

Hand detection is essential for many hand related tasks, e. g. parsing hand pose, understanding gesture, which are extremely useful for robotics and human-computer interaction.

General Classification Hand Detection +2

Paper
Add Code

Understanding Convolution for Semantic Segmentation

5 code implementations • 27 Feb 2017 • Panqu Wang, Pengfei Chen, Ye Yuan, Ding Liu, Zehua Huang, Xiaodi Hou, Garrison Cottrell

This framework 1) effectively enlarges the receptive fields (RF) of the network to aggregate global information; 2) alleviates what we call the "gridding issue" caused by the standard dilated convolution operation.

Ranked #20 on Semantic Segmentation on PASCAL VOC 2012 test

Segmentation Semantic Segmentation +1

1,565

Paper
Code

Video Representation Learning and Latent Concept Mining for Large-scale Multi-label Video Classification

no code implementations • 5 Jul 2017 • Po-Yao Huang, Ye Yuan, Zhenzhong Lan, Lu Jiang, Alexander G. Hauptmann

We report on CMU Informedia Lab's system used in Google's YouTube 8 Million Video Understanding Challenge.

Attribute General Classification +3

Paper
Add Code

On Identification of Distribution Grids

1 code implementation • 5 Nov 2017 • Omid Ardakanian, Vincent W. S. Wong, Roel Dobbe, Steven H. Low, Alexandra von Meier, Claire Tomlin, Ye Yuan

Large-scale integration of distributed energy resources into residential distribution feeders necessitates careful control of their operation through power flow analysis.

Paper
Code

Face Attention Network: An Effective Face Detector for the Occluded Faces

1 code implementation • 20 Nov 2017 • Jianfeng Wang, Ye Yuan, Gang Yu

The performance of face detection has been largely improved with the development of convolutional neural network.

Ranked #1 on Occluded Face Detection on MAFA

Data Augmentation Occluded Face Detection

314

Paper
Code

SFace: An Efficient Network for Face Detection in Large Scale Variations

no code implementations • 18 Apr 2018 • Jianfeng Wang, Ye Yuan, Boxun Li, Gang Yu, Sun Jian

A new dataset called 4K-Face is also introduced to evaluate the performance of face detection with extreme large scale variations.

4k Face Detection +1

Paper
Add Code

EANN: Event Adversarial Neural Networks for Multi-Modal Fake News Detection

1 code implementation • Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining 2018 • Yaqing Wang, Fenglong Ma, Zhiwei Jin, Ye Yuan, Guangxu Xun, Kishlay Jha, Lu Su, Jing Gao

One of the unique challenges for fake news detection on social media is how to identify fake news on newly emerged events.

Fake News Detection Sentence Classification

219

Paper
Code

3D Ego-Pose Estimation via Imitation Learning

no code implementations • ECCV 2018 • Ye Yuan, Kris Kitani

Motivated by this, we propose a novel control-based approach to model human motion with physics simulation and use imitation learning to learn a video-conditioned control policy for ego-pose estimation.

Domain Adaptation Imitation Learning +1

Paper
Add Code

Data-driven Discovery of Cyber-Physical Systems

1 code implementation • 1 Oct 2018 • Ye Yuan, Xiuchuan Tang, Wei Pan, Xiuting Li, Wei Zhou, Hai-Tao Zhang, Han Ding, Jorge Goncalves

Cyber-physical systems (CPSs) embed software into the physical world.

Paper
Code

A deep learning-based remaining useful life prediction approach for bearings

1 code implementation • 8 Dec 2018 • Cheng Cheng, Guijun Ma, Yong Zhang, Mingyang Sun, Fei Teng, Han Ding, Ye Yuan

In industrial applications, nearly half the failures of motors are caused by the degradation of rolling element bearings (REBs).

Paper
Code

A General End-to-end Diagnosis Framework for Manufacturing Systems

no code implementations • 17 Dec 2018 • Ye Yuan, Guijun Ma, Cheng Cheng, Beitong Zhou, Huan Zhao, Hai-Tao Zhang, Han Ding

A central challenge in manufacturing sector lies in the requirement of a general framework to ensure satisfied diagnosis and monitoring performances in different manufacturing applications.

Management

Paper
Add Code

Bridging the Gap Between Computational Photography and Visual Recognition

no code implementations • 28 Jan 2019 • Rosaura G. VidalMata, Sreya Banerjee, Brandon RichardWebster, Michael Albright, Pedro Davalos, Scott McCloskey, Ben Miller, Asong Tambo, Sushobhan Ghosh, Sudarshan Nagesh, Ye Yuan, Yueyu Hu, Junru Wu, Wenhan Yang, Xiaoshuai Zhang, Jiaying Liu, Zhangyang Wang, Hwann-Tzong Chen, Tzu-Wei Huang, Wen-Chi Chin, Yi-Chun Li, Mahmoud Lababidi, Charles Otto, Walter J. Scheirer

From the observed results, it is evident that we are in the early days of building a bridge between computational photography and visual recognition, leaving many opportunities for innovation in this area.

Image Restoration Object Recognition

Paper
Add Code

WIDER Face and Pedestrian Challenge 2018: Methods and Results

no code implementations • 19 Feb 2019 • Chen Change Loy, Dahua Lin, Wanli Ouyang, Yuanjun Xiong, Shuo Yang, Qingqiu Huang, Dongzhan Zhou, Wei Xia, Quanquan Li, Ping Luo, Junjie Yan, Jian-Feng Wang, Zuoxin Li, Ye Yuan, Boxun Li, Shuai Shao, Gang Yu, Fangyun Wei, Xiang Ming, Dong Chen, Shifeng Zhang, Cheng Chi, Zhen Lei, Stan Z. Li, Hongkai Zhang, Bingpeng Ma, Hong Chang, Shiguang Shan, Xilin Chen, Wu Liu, Boyan Zhou, Huaxiong Li, Peng Cheng, Tao Mei, Artem Kukharenko, Artem Vasenin, Nikolay Sergievskiy, Hua Yang, Liangqi Li, Qiling Xu, Yuan Hong, Lin Chen, Mingjun Sun, Yirong Mao, Shiying Luo, Yongjun Li, Ruiping Wang, Qiaokang Xie, Ziyang Wu, Lei Lu, Yiheng Liu, Wengang Zhou

This paper presents a review of the 2018 WIDER Challenge on Face and Pedestrian.

Face Detection Pedestrian Detection +2

Paper
Add Code

Wasserstein Distance based Deep Adversarial Transfer Learning for Intelligent Fault Diagnosis

no code implementations • 2 Mar 2019 • Cheng Cheng, Beitong Zhou, Guijun Ma, Dongrui Wu, Ye Yuan

However, for diverse working conditions in the industry, deep learning suffers two difficulties: one is that the well-defined (source domain) and new (target domain) datasets are with different feature distributions; another one is the fact that insufficient or no labelled data in target domain significantly reduce the accuracy of fault diagnosis.

Transfer Learning

Paper
Add Code

Optimize TSK Fuzzy Systems for Regression Problems: Mini-Batch Gradient Descent with Regularization, DropRule and AdaBound (MBGD-RDA)

1 code implementation • 26 Mar 2019 • Dongrui Wu, Ye Yuan, Yihua Tan

Our final algorithm, mini-batch gradient descent with regularization, DropRule and AdaBound (MBGD-RDA), can achieve fast convergence in training TSK fuzzy systems, and also superior generalization performance in testing.

Paper
Code

A Novel GAN-based Fault Diagnosis Approach for Imbalanced Industrial Time Series

no code implementations • 1 Apr 2019 • Wenqian Jiang, Cheng Cheng, Beitong Zhou, Guijun Ma, Ye Yuan

This paper proposes a novel fault diagnosis approach based on generative adversarial networks (GAN) for imbalanced industrial time series where normal samples are much larger than failure cases.

Time Series Time Series Analysis

Paper
Add Code

UG$^{2+}$ Track 2: A Collective Benchmark Effort for Evaluating and Advancing Image Understanding in Poor Visibility Environments

no code implementations • 9 Apr 2019 • Ye Yuan, Wenhan Yang, Wenqi Ren, Jiaying Liu, Walter J. Scheirer, Zhangyang Wang

The UG$^{2+}$ challenge in IEEE CVPR 2019 aims to evoke a comprehensive discussion and exploration about how low-level vision techniques can benefit the high-level automatic visual recognition in various scenarios.

Face Detection

Paper
Add Code

Generative Hybrid Representations for Activity Forecasting with No-Regret Learning

no code implementations • CVPR 2020 • Jiaqi Guan, Ye Yuan, Kris M. Kitani, Nicholas Rhinehart

Automatically reasoning about future human behaviors is a difficult problem but has significant practical applications to assistive systems.

Paper
Add Code

Ego-Pose Estimation and Forecasting as Real-Time PD Control

1 code implementation • ICCV 2019 • Ye Yuan, Kris Kitani

We propose the use of a proportional-derivative (PD) control based policy learned via reinforcement learning (RL) to estimate and forecast 3D human pose from egocentric videos.

Egocentric Pose Estimation Human Pose Forecasting +2

Paper
Code

Diverse Trajectory Forecasting with Determinantal Point Processes

no code implementations • ICLR 2020 • Ye Yuan, Kris Kitani

To learn the parameters of the DSF, the diversity of the trajectory samples is evaluated by a diversity loss based on a determinantal point process (DPP).

Ranked #5 on Human Pose Forecasting on HumanEva-I

Autonomous Vehicles Human Pose Forecasting +2

Paper
Add Code

ABD-Net: Attentive but Diverse Person Re-Identification

5 code implementations • ICCV 2019 • Tianlong Chen, Shaojin Ding, Jingyi Xie, Ye Yuan, Wuyang Chen, Yang Yang, Zhou Ren, Zhangyang Wang

Attention mechanism has been shown to be effective for person re-identification (Re-ID).

Ranked #16 on Person Re-Identification on Market-1501-C

Person Re-Identification Retrieval

608

Paper
Code

Machine Discovery of Partial Differential Equations from Spatiotemporal Data

1 code implementation • 15 Sep 2019 • Ye Yuan, Junlin Li, Liang Li, Frank Jiang, Xiuchuan Tang, Fumin Zhang, Sheng Liu, Jorge Goncalves, Henning U. Voss, Xiuting Li, Jürgen Kurths, Han Ding

The study presents a general framework for discovering underlying Partial Differential Equations (PDEs) using measured spatiotemporal data.

Paper
Code

Is There Mode Collapse? A Case Study on Face Generation and Its Black-box Calibration

no code implementations • 25 Sep 2019 • Zhenyu Wu, Ye Yuan, Zhaowen Wang, Jianming Zhang, Zhangyang Wang, Hailin Jin

Generative adversarial networks (GANs) nowadays are capable of producing im-ages of incredible realism.

Face Generation

Paper
Add Code

PowerSGD: Powered Stochastic Gradient Descent Methods for Accelerated Non-Convex Optimization

no code implementations • 25 Sep 2019 • Jun Liu, Beitong Zhou, Weigao Sun, Ruijuan Chen, Claire J. Tomlin, Ye Yuan

In this paper, we propose a novel technique for improving the stochastic gradient descent (SGD) method to train deep networks, which we term \emph{PowerSGD}.

Paper
Add Code

SiamFC++: Towards Robust and Accurate Visual Tracking with Target Estimation Guidelines

6 code implementations • 14 Nov 2019 • Yinda Xu, Zeyu Wang, Zuoxin Li, Ye Yuan, Gang Yu

Following these guidelines, we design our Fully Convolutional Siamese tracker++ (SiamFC++) by introducing both classification and target state estimation branch(G1), classification score without ambiguity(G2), tracking without prior knowledge(G3), and estimation quality score(G4).

Ranked #2 on Visual Object Tracking on VOT2017/18 (using extra training data)

Classification General Classification +3

811

Paper
Code

Calibrated Domain-Invariant Learning for Highly Generalizable Large Scale Re-Identification

1 code implementation • 26 Nov 2019 • Ye Yuan, Wuyang Chen, Tianlong Chen, Yang Yang, Zhou Ren, Zhangyang Wang, Gang Hua

Many real-world applications, such as city-scale traffic monitoring and control, requires large-scale re-identification.

Paper
Code

A Practical Solution for SAR Despeckling With Adversarial Learning Generated Speckled-to-Speckled Images

no code implementations • 13 Dec 2019 • Ye Yuan, Jian Guan, Pengming Feng, Yanxia Wu

In this letter, we aim to address a synthetic aperture radar (SAR) despeckling problem with the necessity of neither clean (speckle-free) SAR images nor independent speckled image pairs from the same scene, and a practical solution for SAR despeckling (PSD) is proposed.

Paper
Add Code

In Defense of the Triplet Loss Again: Learning Robust Person Re-Identification with Fast Approximated Triplet Loss and Label Distillation

1 code implementation • 17 Dec 2019 • Ye Yuan, Wuyang Chen, Yang Yang, Zhangyang Wang

This work addresses the above two shortcomings of triplet loss, extending its effectiveness to large-scale ReID datasets with potentially noisy labels.

Person Re-Identification

Paper
Code

State-Aware Tracker for Real-Time Video Object Segmentation

1 code implementation • CVPR 2020 • Xi Chen, Zuoxin Li, Ye Yuan, Gang Yu, Jianxin Shen, Donglian Qi

For higher efficiency, SAT takes advantage of the inter-frame consistency and deals with each target object as a tracklet.

Segmentation Semantic Segmentation +2

811

Paper
Code

Uncertainty Quantification for Deep Context-Aware Mobile Activity Recognition and Unknown Context Discovery

no code implementations • 3 Mar 2020 • Zepeng Huo, Arash Pakbin, Xiaohan Chen, Nathan Hurley, Ye Yuan, Xiaoning Qian, Zhangyang Wang, Shuai Huang, Bobak Mortazavi

Activity recognition in wearable computing faces two key challenges: i) activity characteristics may be context-dependent and change under different contexts or situations; ii) unknown contexts and activities may occur from time to time, requiring flexibility and adaptability of the algorithm.

Clustering Human Activity Recognition +1

Paper
Add Code

PTP: Parallelized Tracking and Prediction with Graph Neural Networks and Diversity Sampling

no code implementations • 17 Mar 2020 • Xinshuo Weng, Ye Yuan, Kris Kitani

We evaluate on KITTI and nuScenes datasets showing that our method with socially-aware feature learning and diversity sampling achieves new state-of-the-art performance on 3D MOT and trajectory prediction.

3D Multi-Object Tracking Trajectory Forecasting

Paper
Add Code

DLow: Diversifying Latent Flows for Diverse Human Motion Prediction

1 code implementation • ECCV 2020 • Ye Yuan, Kris Kitani

To obtain samples from a pretrained generative model, most existing generative human motion prediction methods draw a set of independent Gaussian latent codes and convert them to motion samples.

Ranked #1 on Human Pose Forecasting on AMASS (APD metric)

Human motion prediction Human Pose Forecasting +1

107

Paper
Code

BoostTree and BoostForest for Ensemble Learning

1 code implementation • 21 Mar 2020 • Changming Zhao, Dongrui Wu, Jian Huang, Ye Yuan, Hai-Tao Zhang, Ruimin Peng, Zhenhua Shi

Bootstrap aggregating (Bagging) and boosting are two popular ensemble learning approaches, which combine multiple base learners to generate a composite model for more accurate and more reliable performance.

Ensemble Learning General Classification +1

Paper
Code

Optical Non-Line-of-Sight Physics-based 3D Human Pose Estimation

1 code implementation • CVPR 2020 • Mariko Isogawa, Ye Yuan, Matthew O'Toole, Kris Kitani

We bring together a diverse set of technologies from NLOS imaging, human pose estimation and deep reinforcement learning to construct an end-to-end data processing pipeline that converts a raw stream of photon measurements into a full 3D human pose sequence estimate.

3D Human Pose Estimation Humanoid Control +1

Paper
Code

Semi-Supervised Cervical Dysplasia Classification With Learnable Graph Convolutional Network

no code implementations • 1 Apr 2020 • Yanglan Ou, Yuan Xue, Ye Yuan, Tao Xu, Vincent Pisztora, Jia Li, Xiaolei Huang

In this paper, we propose a novel and more flexible GCN model with a feature encoder that adaptively updates the adjacency matrix during learning and demonstrate that this model design leads to improved performance.

Classification General Classification

Paper
Add Code

Curiosity-Driven Energy-Efficient Worker Scheduling in Vehicular Crowdsourcing: A Deep Reinforcement Learning Approach

1 code implementation • ICDE 2020 • Chi Harold Liu, Yinuo Zhao, Zipeng Dai, Ye Yuan, Guoren Wang, Dapeng Wu, Kin K. Leung

Spatial crowdsourcing (SC) utilizes the potential of a crowd to accomplish certain location based tasks.

Fairness Reinforcement Learning (RL) +1

Paper
Code

Residual Force Control for Agile Human Behavior Imitation and Extended Motion Synthesis

1 code implementation • NeurIPS 2020 • Ye Yuan, Kris Kitani

Our approach is the first humanoid control method that successfully learns from a large-scale human motion dataset (Human3. 6M) and generates diverse long-term motions.

Humanoid Control Motion Synthesis

148

Paper
Code

Self-PU: Self Boosted and Calibrated Positive-Unlabeled Training

1 code implementation • ICML 2020 • Xuxi Chen, Wuyang Chen, Tianlong Chen, Ye Yuan, Chen Gong, Kewei Chen, Zhangyang Wang

Many real-world applications have to tackle the Positive-Unlabeled (PU) learning problem, i. e., learning binary classifiers from a large amount of unlabeled data and a few labeled positive examples.

Paper
Code

AnchorFace: An Anchor-based Facial Landmark Detector Across Large Poses

1 code implementation • 7 Jul 2020 • Zixuan Xu, Banghuai Li, Miao Geng, Ye Yuan

Based on the prediction of each anchor template, we propose to aggregate the results, which can reduce the landmark uncertainty due to the large poses.

Ranked #1 on Face Alignment on AFLW-Full (Mean NME metric)

Face Alignment Facial Landmark Detection

Paper
Code

On Deep Unsupervised Active Learning

no code implementations • 28 Jul 2020 • Changsheng Li, Handong Ma, Zhao Kang, Ye Yuan, Xiao-Yu Zhang, Guoren Wang

Unsupervised active learning has attracted increasing attention in recent years, where its goal is to select representative samples in an unsupervised setting for human annotating.

Active Learning

Paper
Add Code

Efficient Non-Line-of-Sight Imaging from Transient Sinograms

no code implementations • ECCV 2020 • Mariko Isogawa, Dorian Chan, Ye Yuan, Kris Kitani, Matthew O'Toole

Non-line-of-sight (NLOS) imaging techniques use light that diffusely reflects off of visible surfaces (e. g., walls) to see around corners.

Paper
Add Code

AutoPose: Searching Multi-Scale Branch Aggregation for Pose Estimation

no code implementations • 16 Aug 2020 • Xinyu Gong, Wuyang Chen, Yifan Jiang, Ye Yuan, Xian-Ming Liu, Qian Zhang, Yuan Li, Zhangyang Wang

Such simplification limits the fusion of information at different scales and fails to maintain high-resolution representations.

2D Human Pose Estimation Neural Architecture Search +1

Paper
Add Code

End-to-End 3D Multi-Object Tracking and Trajectory Forecasting

no code implementations • 25 Aug 2020 • Xinshuo Weng, Ye Yuan, Kris Kitani

To evaluate this hypothesis, we propose a unified solution for 3D MOT and trajectory forecasting which also incorporates two additional novel computational units.

3D Multi-Object Tracking Trajectory Forecasting

Paper
Add Code

PCAL: A Privacy-preserving Intelligent Credit Risk Modeling Framework Based on Adversarial Learning

no code implementations • 6 Oct 2020 • Yuli Zheng, Zhenyu Wu, Ye Yuan, Tianlong Chen, Zhangyang Wang

While machine learning is increasingly used in this field, the resulting large-scale collection of user private information has reinvigorated the privacy debate, considering dozens of data breach incidents every year caused by unauthorized hackers, and (potentially even more) information misuse/abuse by authorized parties.

BIG-bench Machine Learning Privacy Preserving

Paper
Add Code

Scalable Graph Neural Networks via Bidirectional Propagation

1 code implementation • NeurIPS 2020 • Ming Chen, Zhewei Wei, Bolin Ding, Yaliang Li, Ye Yuan, Xiaoyong Du, Ji-Rong Wen

Most notably, GBP can deliver superior performance on a graph with over 60 million nodes and 1. 8 billion edges in less than half an hour on a single machine.

Graph Sampling

Paper
Code

Automated data extraction of bar chart raster images

no code implementations • 9 Nov 2020 • Alex Carderas, Ye Yuan, Itamar Livnat, Ryan Yanagihara, Rosita Saul, Gabrielle Montes De Oca, Kai Zheng, Andrew W. Browne

Objective: To develop software utilizing optical character recognition toward the automatic extraction of data from bar charts for meta-analysis.

Optical Character Recognition Optical Character Recognition (OCR) +1

Paper
Add Code

Kinematics-Guided Reinforcement Learning for Object-Aware 3D Ego-Pose Estimation

no code implementations • 10 Nov 2020 • Zhengyi Luo, Ryo Hachiuma, Ye Yuan, Shun Iwase, Kris M. Kitani

We propose a method for incorporating object interaction and human body dynamics into the task of 3D ego-pose estimation using a head-mounted camera.

Human-Object Interaction Detection Object +4

Paper
Add Code

Causal inference using deep neural networks

no code implementations • 25 Nov 2020 • Ye Yuan, Xueying Ding, Ziv Bar-Joseph

Causal inference from observation data is a core problem in many scientific fields.

Causal Inference

Paper
Add Code

FSCE: Few-Shot Object Detection via Contrastive Proposal Encoding

2 code implementations • CVPR 2021 • Bo Sun, Banghuai Li, Shengcai Cai, Ye Yuan, Chi Zhang

We present Few-Shot object detection via Contrastive proposals Encoding (FSCE), a simple yet effective approach to learning contrastive-aware object proposal encodings that facilitate the classification of detected objects.

Ranked #13 on Few-Shot Object Detection on MS-COCO (30-shot)

Contrastive Learning Few-Shot Learning +4

272

Paper
Code

AgentFormer: Agent-Aware Transformers for Socio-Temporal Multi-Agent Forecasting

2 code implementations • ICCV 2021 • Ye Yuan, Xinshuo Weng, Yanglan Ou, Kris Kitani

Instead, we would prefer a method that allows an agent's state at one time to directly affect another agent's state at a future time.

Ranked #10 on Trajectory Prediction on ETH/UCY

Autonomous Driving Pedestrian Trajectory Prediction +1

243

Paper
Code

SimPoE: Simulated Character Control for 3D Human Pose Estimation

no code implementations • CVPR 2021 • Ye Yuan, Shih-En Wei, Tomas Simon, Kris Kitani, Jason Saragih

Based on this refined kinematic pose, the policy learns to compute dynamics-based control (e. g., joint torques) of the character to advance the current-frame pose estimate to the pose estimate of the next frame.

Ranked #229 on 3D Human Pose Estimation on Human3.6M

3D Human Pose Estimation

Paper
Add Code

LambdaUNet: 2.5D Stroke Lesion Segmentation of Diffusion-weighted MR Images

1 code implementation • 28 Apr 2021 • Yanglan Ou, Ye Yuan, Xiaolei Huang, Kelvin Wong, John Volpi, James Z. Wang, Stephen T. C. Wong

Thus, it is not ideal to apply most existing segmentation methods as they are designed for either 2D or 3D images.

Image Segmentation Lesion Segmentation +1

Paper
Code

Dynamics-Regulated Kinematic Policy for Egocentric Pose Estimation

1 code implementation • NeurIPS 2021 • Zhengyi Luo, Ryo Hachiuma, Ye Yuan, Kris Kitani

By comparing the pose instructed by the kinematic model against the pose generated by the dynamics model, we can use their misalignment to further improve the kinematic model.

Egocentric Pose Estimation Human-Object Interaction Detection +2

Paper
Code

DeceFL: A Principled Decentralized Federated Learning Framework

1 code implementation • 15 Jul 2021 • Ye Yuan, Jun Liu, Dou Jin, Zuogong Yue, Ruijuan Chen, Maolin Wang, Chuan Sun, Lei Xu, Feng Hua, Xin He, Xinlei Yi, Tao Yang, Hai-Tao Zhang, Shaochun Sui, Han Ding

Although there has been a joint effort in tackling such a critical issue by proposing privacy-preserving machine learning frameworks, such as federated learning, most state-of-the-art frameworks are built still in a centralized way, in which a central client is needed for collecting and distributing model information (instead of data itself) from every other client, leading to high communication pressure and high vulnerability when there exists a failure at or attack on the central client.

Federated Learning Privacy Preserving

Paper
Code

Black-Box Diagnosis and Calibration on GAN Intra-Mode Collapse: A Pilot Study

1 code implementation • 23 Jul 2021 • Zhenyu Wu, Zhaowen Wang, Ye Yuan, Jianming Zhang, Zhangyang Wang, Hailin Jin

Existing diversity tests of samples from GANs are usually conducted qualitatively on a small scale, and/or depends on the access to original training data as well as the trained model parameters.

Image Generation

Paper
Code

Temporal Knowledge Consistency for Unsupervised Visual Representation Learning

1 code implementation • ICCV 2021 • Weixin Feng, Yuanjiang Wang, Lihua Ma, Ye Yuan, Chi Zhang

The instance discrimination paradigm has become dominant in unsupervised learning.

Representation Learning

120

Paper
Code

Font Completion and Manipulation by Cycling Between Multi-Modality Representations

1 code implementation • 30 Aug 2021 • Ye Yuan, Wuyang Chen, Zhaowen Wang, Matthew Fisher, Zhifei Zhang, Zhangyang Wang, Hailin Jin

The novel graph constructor maps a glyph's latent code to its graph representation that matches expert knowledge, which is trained to help the translation task.

Image-to-Image Translation Representation Learning +2

Paper
Code

POI Alias Discovery in Delivery Addresses using User Locations

no code implementations • 20 Sep 2021 • Tianfu He, Guochun Chen, Chuishi Meng, Huajun He, Zheyi Pan, Yexin Li, Sijie Ruan, Huimin Ren, Ye Yuan, Ruiyuan Li, Junbo Zhang, Jie Bao, Hui He, Yu Zheng

People often refer to a place of interest (POI) by an alias.

Decision Making

Paper
Add Code

Backdoor Attacks on Federated Learning with Lottery Ticket Hypothesis

1 code implementation • 22 Sep 2021 • Zeyuan Yin, Ye Yuan, Panfeng Guo, Pan Zhou

Edge devices in federated learning usually have much more limited computation and communication resources compared to servers in a data center.

Backdoor Attack Federated Learning +1

Paper
Code

Transform2Act: Learning a Transform-and-Control Policy for Efficient Agent Design

1 code implementation • ICLR 2022 • Ye Yuan, Yuda Song, Zhengyi Luo, Wen Sun, Kris Kitani

Specifically, we learn a conditional policy that, in an episode, first applies a sequence of transform actions to modify an agent's skeletal structure and joint attributes, and then applies control actions under the new design.

Decision Making Policy Gradient Methods

Paper
Code

Causal Effect Estimation using Variational Information Bottleneck

1 code implementation • 26 Oct 2021 • Zhenyu Lu, Yurong Cheng, Mingjun Zhong, George Stoian, Ye Yuan, Guoren Wang

A typical approach is to formulate causal inference as a supervised learning problem and so counterfactual could be predicted.

Causal Inference counterfactual

Paper
Code

Learning Deep Representation with Energy-Based Self-Expressiveness for Subspace Clustering

no code implementations • 28 Oct 2021 • Yanming Li, Changsheng Li, Shiye Wang, Ye Yuan, Guoren Wang

In this paper, we propose a new deep subspace clustering framework, motivated by the energy-based models.

Clustering Representation Learning +1

Paper
Add Code

FBNet: Feature Balance Network for Urban-Scene Segmentation

no code implementations • 5 Nov 2021 • Lei Gan, Huabin Huang, Banghuai Li, Ye Yuan

In this paper, we present a novel add-on module, named Feature Balance Network (FBNet), to eliminate the feature camouflage in urban-scene segmentation.

Autonomous Driving Image Segmentation +2

Paper
Add Code

Deep Unsupervised Active Learning on Learnable Graphs

no code implementations • 8 Nov 2021 • Handong Ma, Changsheng Li, Xinchu Shi, Ye Yuan, Guoren Wang

To make the learnt graph structure more stable and effective, we take into account $k$-nearest neighbor graph as a priori, and learn a relation propagation graph structure.

Active Learning Graph structure learning +2

Paper
Add Code

GLAMR: Global Occlusion-Aware Human Mesh Recovery with Dynamic Cameras

1 code implementation • CVPR 2022 • Ye Yuan, Umar Iqbal, Pavlo Molchanov, Kris Kitani, Jan Kautz

Since the joint reconstruction of human motions and camera poses is underconstrained, we propose a global trajectory predictor that generates global human trajectories based on local body movements.

Ranked #1 on Global 3D Human Pose Estimation on EMDB

Global 3D Human Pose Estimation Human Mesh Recovery

340

Paper
Code

Boosting Contrastive Learning with Relation Knowledge Distillation

no code implementations • 8 Dec 2021 • Kai Zheng, Yuanjiang Wang, Ye Yuan

We delve into this problem and find that the lightweight model is prone to collapse in semantic space when simply performing instance-wise contrast.

Contrastive Learning Knowledge Distillation +2

Paper
Add Code

On Almost Sure Convergence Rates of Stochastic Gradient Methods

no code implementations • 9 Feb 2022 • Jun Liu, Ye Yuan

We further provide last-iterate almost sure convergence rates analysis for stochastic gradient methods on weakly convex smooth functions, in contrast with most existing results in the literature that only provide convergence in expectation for a weighted average of the iterates.

Paper
Add Code

Syntax-Aware Network for Handwritten Mathematical Expression Recognition

2 code implementations • CVPR 2022 • Ye Yuan, Xiao Liu, Wondimu Dikubab, Hui Liu, Zhilong Ji, Zhongqin Wu, Xiang Bai

In this paper, we propose a simple and efficient method for HMER, which is the first to incorporate syntax information into an encoder-decoder network.

Paper
Code

Adaptive Divergence-based Non-negative Latent Factor Analysis

no code implementations • 30 Mar 2022 • Ye Yuan, Guangxiao Yuan, Renfang Wang, Xin Luo

High-Dimensional and Incomplete (HDI) data are frequently found in various industrial applications with complex interactions among numerous nodes, which are commonly non-negative for representing the inherent non-negativity of node interactions.

Computational Efficiency

Paper
Add Code

Online No-regret Model-Based Meta RL for Personalized Navigation

no code implementations • 5 Apr 2022 • Yuda Song, Ye Yuan, Wen Sun, Kris Kitani

Our theoretical analysis shows that our method is a no-regret algorithm and we provide the convergence rate in the agnostic setting.

Model-based Reinforcement Learning Model Predictive Control

Paper
Add Code

Self-Supervised Information Bottleneck for Deep Multi-View Subspace Clustering

no code implementations • 26 Apr 2022 • Shiye Wang, Changsheng Li, Yanming Li, Ye Yuan, Guoren Wang

Inheriting the advantages from information bottleneck, SIB-MSC can learn a latent space for each view to capture common information among the latent representations of different views by removing superfluous information from the view itself while retaining sufficient information for the latent representations of other views.

Clustering Multi-view Subspace Clustering

Paper
Add Code

Unified Simulation, Perception, and Generation of Human Behavior

no code implementations • 28 Apr 2022 • Ye Yuan

Understanding and modeling human behavior is fundamental to almost any computer vision and robotics applications that involve humans.

Paper
Add Code

A Sampling Theorem for Exact Identification of Continuous-time Nonlinear Dynamical Systems

no code implementations • 29 Apr 2022 • Zhexuan Zeng, Zuogong Yue, Alexandre Mauroy, Jorge Goncalves, Ye Yuan

The necessary and sufficient condition is proposed -- which is built from Koopman operator -- to the exact identification of the CT system from sampled data.

Paper
Add Code

PI-NLF: A Proportional-Integral Approach for Non-negative Latent Factor Analysis

no code implementations • 5 May 2022 • Ye Yuan, Xin Luo

A high-dimensional and incomplete (HDI) matrix frequently appears in various big-data-related applications, which demonstrates the inherently non-negative interactions among numerous nodes.

Computational Efficiency Representation Learning

Paper
Add Code

Symbolic Expression Transformer: A Computer Vision Approach for Symbolic Regression

no code implementations • 24 May 2022 • Jiachen Li, Ye Yuan, Hong-Bin Shen

Symbolic Regression (SR) is a type of regression analysis to automatically find the mathematical expression that best fits the data.

regression Symbolic Regression

Paper
Add Code

Patcher: Patch Transformers with Mixture of Experts for Precise Medical Image Segmentation

1 code implementation • 3 Jun 2022 • Yanglan Ou, Ye Yuan, Xiaolei Huang, Stephen T. C. Wong, John Volpi, James Z. Wang, Kelvin Wong

We also propose a new mixture-of-experts (MoE) based decoder, which treats the feature maps from the encoder as experts and selects a suitable set of expert features to predict the label for each pixel.

Image Segmentation Lesion Segmentation +2

Paper
Code

FreeKD: Free-direction Knowledge Distillation for Graph Neural Networks

no code implementations • 14 Jun 2022 • Kaituo Feng, Changsheng Li, Ye Yuan, Guoren Wang

Knowledge distillation (KD) has demonstrated its effectiveness to boost the performance of graph neural networks (GNNs), where its goal is to distill knowledge from a deeper teacher GNN into a shallower student GNN.

Knowledge Distillation reinforcement-learning +2

Paper
Add Code

From Universal Humanoid Control to Automatic Physically Valid Character Creation

no code implementations • 18 Jun 2022 • Zhengyi Luo, Ye Yuan, Kris M. Kitani

Second, we use a design-and-control framework to optimize a humanoid's physical attributes to find body designs that can better imitate the pre-specified human motion sequence(s).

Humanoid Control valid

Paper
Add Code

Embodied Scene-aware Human Pose Estimation

no code implementations • 18 Jun 2022 • Zhengyi Luo, Shun Iwase, Ye Yuan, Kris Kitani

Since 2D third-person observations are coupled with the camera pose, we propose to disentangle the camera pose and use a multi-step projection gradient defined in the global coordinate frame as the movement cue for our embodied agent.

Ranked #306 on 3D Human Pose Estimation on Human3.6M

3D Human Pose Estimation Causal Inference +1

Paper
Add Code

SearchMorph:Multi-scale Correlation Iterative Network for Deformable Registration

no code implementations • 27 Jun 2022 • Xiao Fan, Shuxin Zhuang, Zhemin Zhuang, Ye Yuan, Shunmin Qiu, Alex Noel Joseph Raj, Yibiao Rong

Deformable image registration can obtain dynamic information about images, which is of great significance in medical image analysis.

Image Registration Motion Estimation

Paper
Add Code

Multi-Prior Learning via Neural Architecture Search for Blind Face Restoration

1 code implementation • 28 Jun 2022 • Yanjiang Yu, Puyang Zhang, Kaihao Zhang, Wenhan Luo, Changsheng Li, Ye Yuan, Guoren Wang

To this end, we propose a Face Restoration Searching Network (FRSNet) to adaptively search the suitable feature extraction architecture within our specified search space, which can directly contribute to the restoration quality.

Blind Face Restoration Neural Architecture Search

Paper
Code

Robust Knowledge Adaptation for Dynamic Graph Neural Networks

1 code implementation • 22 Jul 2022 • Hanjie Li, Changsheng Li, Kaituo Feng, Ye Yuan, Guoren Wang, Hongyuan Zha

By this means, we can adaptively propagate knowledge to other nodes for learning robust node embedding representations.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression Recognition

2 code implementations • 23 Jul 2022 • Bohan Li, Ye Yuan, Dingkang Liang, Xiao Liu, Zhilong Ji, Jinfeng Bai, Wenyu Liu, Xiang Bai

Recently, most handwritten mathematical expression recognition (HMER) methods adopt the encoder-decoder networks, which directly predict the markup sequences from formula images with the attention mechanism.

Optical Character Recognition (OCR)

343

Paper
Code

A Nonlinear PID-Enhanced Adaptive Latent Factor Analysis Model

no code implementations • 4 Aug 2022 • Jinli Li, Ye Yuan

High-dimensional and incomplete (HDI) data holds tremendous interactive information in various industrial applications.

Paper
Add Code

Adaptive Latent Factor Analysis via Generalized Momentum-Incorporated Particle Swarm Optimization

no code implementations • 4 Aug 2022 • Jiufang Chen, Ye Yuan

Stochastic gradient descent (SGD) algorithm is an effective learning strategy to build a latent factor analysis (LFA) model on a high-dimensional and incomplete (HDI) matrix.

Paper
Add Code

RGB-Only Reconstruction of Tabletop Scenes for Collision-Free Manipulator Control

no code implementations • 21 Oct 2022 • Zhenggang Tang, Balakumar Sundaralingam, Jonathan Tremblay, Bowen Wen, Ye Yuan, Stephen Tyree, Charles Loop, Alexander Schwing, Stan Birchfield

We present a system for collision-free control of a robot manipulator that uses only RGB views of the world.

Model Predictive Control

Paper
Add Code

Beta R-CNN: Looking into Pedestrian Detection from Another Perspective

no code implementations • NeurIPS 2020 • Zixuan Xu, Banghuai Li, Ye Yuan, Anhong Dang

What's more, to fully exploit Beta Representation, a novel pipeline Beta R-CNN equipped with BetaHead and BetaMask is proposed, leading to high detection performance in occluded and crowded scenes.

Ranked #11 on Pedestrian Detection on CityPersons

Pedestrian Detection

Paper
Add Code

PALT: Parameter-Lite Transfer of Language Models for Knowledge Graph Completion

1 code implementation • 25 Oct 2022 • Jianhao Shen, Chenguang Wang, Ye Yuan, Jiawei Han, Heng Ji, Koushik Sen, Ming Zhang, Dawn Song

For instance, we outperform the fully finetuning approaches on a KG completion benchmark by tuning only 1% of the parameters.

Ranked #5 on Link Prediction on UMLS

Knowledge Graph Completion Link Prediction +1

Paper
Code

1st Place Solutions for UG2+ Challenge 2022 ATMOSPHERIC TURBULENCE MITIGATION

no code implementations • 30 Oct 2022 • Zhuang Liu, Zhichao Zhao, Ye Yuan, Zhi Qiao, Jinfeng Bai, Zhilong Ji

In this technical report, we briefly introduce the solution of our team ''summer'' for Atomospheric Turbulence Mitigation in UG$^2$+ Challenge in CVPR 2022.

Image Quality Assessment Image Reconstruction

Paper
Add Code

Prototype as Query for Few Shot Semantic Segmentation

1 code implementation • 27 Nov 2022 • Leilei Cao, Yibo Guo, Ye Yuan, Qiangguo Jin

In this way, the spatial details can be better captured and the semantic features of target class in the query image can be focused.

Ranked #11 on Few-Shot Semantic Segmentation on COCO-20i (5-shot)

Few-Shot Semantic Segmentation

Paper
Code

A Node-collaboration-informed Graph Convolutional Network for Precise Representation to Undirected Weighted Graphs

no code implementations • 30 Nov 2022 • Ying Wang, Ye Yuan, Xin Luo

Based on this idea, a Node-collaboration-informed Graph Convolutional Network (NGCN) is proposed with three-fold ideas: a) Learning latent collaborative information from the interaction of node pairs via a node-collaboration module; b) Building the residual connection and weighted representation propagation to obtain high representation capacity; and c) Implementing the model optimization in an end-to-end fashion to achieve precise representation to the target UWG.

Model Optimization Representation Learning

Paper
Add Code

PhysDiff: Physics-Guided Human Motion Diffusion Model

no code implementations • ICCV 2023 • Ye Yuan, Jiaming Song, Umar Iqbal, Arash Vahdat, Jan Kautz

Specifically, we propose a physics-based motion projection module that uses motion imitation in a physics simulator to project the denoised motion of a diffusion step to a physically-plausible motion.

Denoising

Paper
Add Code

Learning Human Dynamics in Autonomous Driving Scenarios

no code implementations • ICCV 2023 • Jingbo Wang, Ye Yuan, Zhengyi Luo, Kevin Xie, Dahua Lin, Umar Iqbal, Sanja Fidler, Sameh Khamis

In this work, we propose a holistic framework for learning physically plausible human dynamics from real driving scenarios, narrowing the gap between real and simulated human behavior in safety-critical applications.

Autonomous Driving Human Dynamics

Paper
Add Code

Almost Sure Saddle Avoidance of Stochastic Gradient Methods without the Bounded Gradient Assumption

no code implementations • 15 Feb 2023 • Jun Liu, Ye Yuan

We prove that various stochastic gradient descent methods, including the stochastic gradient descent (SGD), stochastic heavy-ball (SHB), and stochastic Nesterov's accelerated gradient (SNAG) methods, almost surely avoid any strict saddle manifold.

Paper
Add Code

EquiPocket: an E(3)-Equivariant Geometric Graph Neural Network for Ligand Binding Site Prediction

1 code implementation • 23 Feb 2023 • Yang Zhang, Wenbing Huang, Zhewei Wei, Ye Yuan, Zhaohan Ding

Predicting the binding sites of the target proteins plays a fundamental role in drug discovery.

Drug Discovery

Paper
Code

Trace and Pace: Controllable Pedestrian Animation via Guided Trajectory Diffusion

no code implementations • CVPR 2023 • Davis Rempe, Zhengyi Luo, Xue Bin Peng, Ye Yuan, Kris Kitani, Karsten Kreis, Sanja Fidler, Or Litany

We introduce a method for generating realistic pedestrian trajectories and full-body animations that can be controlled to meet user-defined goals.

Collision Avoidance

Paper
Add Code

Robust Tickets Can Transfer Better: Drawing More Transferable Subnetworks in Transfer Learning

no code implementations • 24 Apr 2023 • Yonggan Fu, Ye Yuan, Shang Wu, Jiayi Yuan, Yingyan Lin

Transfer learning leverages feature representations of deep neural networks (DNNs) pretrained on source tasks with rich data to empower effective finetuning on downstream tasks.

Adversarial Robustness Transfer Learning

Paper
Add Code

NeRFool: Uncovering the Vulnerability of Generalizable Neural Radiance Fields against Adversarial Perturbations

1 code implementation • 10 Jun 2023 • Yonggan Fu, Ye Yuan, Souvik Kundu, Shang Wu, Shunyao Zhang, Yingyan Lin

Generalizable Neural Radiance Fields (GNeRF) are one of the most promising real-world solutions for novel view synthesis, thanks to their cross-scene generalization capability and thus the possibility of instant rendering on new scenes.

Adversarial Robustness Novel View Synthesis

Paper
Code

Shared Growth of Graph Neural Networks via Prompted Free-direction Knowledge Distillation

no code implementations • 2 Jul 2023 • Kaituo Feng, Yikun Miao, Changsheng Li, Ye Yuan, Guoren Wang

Knowledge distillation (KD) has shown to be effective to boost the performance of graph neural networks (GNNs), where the typical objective is to distill knowledge from a deeper teacher GNN into a shallower student GNN.

Knowledge Distillation Transfer Learning

Paper
Add Code

DREAM: Domain-free Reverse Engineering Attributes of Black-box Model

no code implementations • 20 Jul 2023 • Rongqing Li, Jiaqi Yu, Changsheng Li, Wenhan Luo, Ye Yuan, Guoren Wang

There is a crucial limitation: these works assume the dataset used for training the target model to be known beforehand and leverage this dataset for model attribute attack.

Attribute

Paper
Add Code

TREA: Tree-Structure Reasoning Schema for Conversational Recommendation

1 code implementation • 20 Jul 2023 • Wendi Li, Wei Wei, Xiaoye Qu, Xian-Ling Mao, Ye Yuan, Wenfeng Xie, Dangyang Chen

TREA constructs a multi-hierarchical scalable tree as the reasoning structure to clarify the causal relationships between mentioned entities, and fully utilizes historical conversations to generate more reasonable and suitable responses for recommended results.

Knowledge Graphs Recommendation Systems

Paper
Code

Semantic Graph Representation Learning for Handwritten Mathematical Expression Recognition

no code implementations • 21 Aug 2023 • Zhuang Liu, Ye Yuan, Zhilong Ji, Jingfeng Bai, Xiang Bai

Then we design a semantic aware module (SAM), which projects the visual and classification feature into semantic space.

Graph Representation Learning

Paper
Add Code

FIMO: A Challenge Formal Dataset for Automated Theorem Proving

1 code implementation • 8 Sep 2023 • Chengwu Liu, Jianhao Shen, Huajian Xin, Zhengying Liu, Ye Yuan, Haiming Wang, Wei Ju, Chuanyang Zheng, Yichun Yin, Lin Li, Ming Zhang, Qun Liu

We present FIMO, an innovative dataset comprising formal mathematical problem statements sourced from the International Mathematical Olympiad (IMO) Shortlisted Problems.

Automated Theorem Proving

Paper
Code

TRIGO: Benchmarking Formal Mathematical Proof Reduction for Generative Language Models

1 code implementation • 16 Oct 2023 • Jing Xiong, Jianhao Shen, Ye Yuan, Haiming Wang, Yichun Yin, Zhengying Liu, Lin Li, Zhijiang Guo, Qingxing Cao, Yinya Huang, Chuanyang Zheng, Xiaodan Liang, Ming Zhang, Qun Liu

Automated theorem proving (ATP) has become an appealing domain for exploring the reasoning ability of the recent successful generative language models.

Automated Theorem Proving Benchmarking +1

Paper
Code

Learning to Generate Parameters of ConvNets for Unseen Image Data

no code implementations • 18 Oct 2023 • Shiye Wang, Kaituo Feng, Changsheng Li, Ye Yuan, Guoren Wang

Typical Convolutional Neural Networks (ConvNets) depend heavily on large amounts of image data and resort to an iterative optimization algorithm (e. g., SGD or Adam) to learn network parameters, which makes training very time- and resource-intensive.

Paper
Add Code

PACE: Human and Camera Motion Estimation from in-the-wild Videos

no code implementations • 20 Oct 2023 • Muhammed Kocabas, Ye Yuan, Pavlo Molchanov, Yunrong Guo, Michael J. Black, Otmar Hilliges, Jan Kautz, Umar Iqbal

This design combines the strengths of SLAM and motion priors, which leads to significant improvements in human and camera motion estimation.

Motion Estimation

Paper
Add Code

Good Better Best: Self-Motivated Imitation Learning for noisy Demonstrations

no code implementations • 24 Oct 2023 • Ye Yuan, Xin Li, Yong Heng, Leiji Zhang, Mingzhong Wang

Imitation Learning (IL) aims to discover a policy by minimizing the discrepancy between the agent's behavior and expert demonstrations.

Imitation Learning

Paper
Add Code

GAvatar: Animatable 3D Gaussian Avatars with Implicit Mesh Learning

no code implementations • 18 Dec 2023 • Ye Yuan, Xueting Li, Yangyi Huang, Shalini De Mello, Koki Nagano, Jan Kautz, Umar Iqbal

Gaussian splatting has emerged as a powerful 3D representation that harnesses the advantages of both explicit (mesh) and implicit (NeRF) 3D representations.

Paper
Add Code

AGG: Amortized Generative 3D Gaussians for Single Image to 3D

no code implementations • 8 Jan 2024 • Dejia Xu, Ye Yuan, Morteza Mardani, Sifei Liu, Jiaming Song, Zhangyang Wang, Arash Vahdat

To overcome these challenges, we introduce an Amortized Generative 3D Gaussian framework (AGG) that instantly produces 3D Gaussians from a single image, eliminating the need for per-instance optimization.

3D Generation 3D Reconstruction +2

Paper
Add Code

An ADRC-Incorporated Stochastic Gradient Descent Algorithm for Latent Factor Analysis

no code implementations • 13 Jan 2024 • Jinli Li, Ye Yuan

However, such a model commonly encounters the problem of slow convergence because a standard SGD algorithm only considers the current learning error to compute the stochastic gradient without considering the historical and future state of the learning error.

Computational Efficiency

Paper
Add Code

Tensor Graph Convolutional Network for Dynamic Graph Representation Learning

no code implementations • 13 Jan 2024 • Ling Wang, Ye Yuan

Dynamic graphs (DG) describe dynamic interactions between entities in many practical scenarios.

Graph Representation Learning

Paper
Add Code

Preparing Lessons for Progressive Training on Language Models

1 code implementation • 17 Jan 2024 • Yu Pan, Ye Yuan, Yichun Yin, Jiaxin Shi, Zenglin Xu, Ming Zhang, Lifeng Shang, Xin Jiang, Qun Liu

The rapid progress of Transformers in artificial intelligence has come at the cost of increased resource consumption and greenhouse gas emissions due to growing model sizes.

Paper
Code

FinLLMs: A Framework for Financial Reasoning Dataset Generation with Large Language Models

no code implementations • 19 Jan 2024 • Ziqiang Yuan, Kaiyuan Wang, Shoutai Zhu, Ye Yuan, Jingya Zhou, Yanlin Zhu, Wenqi Wei

To address the limited data resources and reduce the annotation cost, we introduce FinLLMs, a method for generating financial question-answering data based on common financial formulas using Large Language Models.

Question Answering

Paper
Add Code

Structured Entity Extraction Using Large Language Models

no code implementations • 6 Feb 2024 • Haolun Wu, Ye Yuan, Liana Mikaelyan, Alexander Meulemans, Xue Liu, James Hensman, Bhaskar Mitra

Recent advances in machine learning have significantly impacted the field of information extraction, with Large Language Models (LLMs) playing a pivotal role in extracting structured information from unstructured text.

Paper
Add Code

Measuring Vision-Language STEM Skills of Neural Models

1 code implementation • 27 Feb 2024 • Jianhao Shen, Ye Yuan, Srbuhi Mirzoyan, Ming Zhang, Chenguang Wang

Compared to existing datasets that often focus on examining expert-level ability, our dataset includes fundamental skills and questions designed based on the K-12 curriculum.

Multimodal Reasoning

Paper
Code

On the Road to Portability: Compressing End-to-End Motion Planner for Autonomous Driving

1 code implementation • 2 Mar 2024 • Kaituo Feng, Changsheng Li, Dongchun Ren, Ye Yuan, Guoren Wang

However, the oversized neural networks render them impractical for deployment on resource-constrained systems, which unavoidably requires more computational time and resources during reference. To handle this, knowledge distillation offers a promising approach that compresses models by enabling a smaller student model to learn from a larger teacher model.

Autonomous Driving Knowledge Distillation +1

Paper
Code

Enhancing LLM Factual Accuracy with RAG to Counter Hallucinations: A Case Study on Domain-Specific Queries in Private Knowledge-Bases

1 code implementation • 15 Mar 2024 • Jiarui Li, Ye Yuan, Zehua Zhang

We proposed an end-to-end system design towards utilizing Retrieval Augmented Generation (RAG) to improve the factual accuracy of Large Language Models (LLMs) for domain-specific and time-sensitive queries related to private knowledge-bases.

Retrieval

Paper
Code

Measuring Social Norms of Large Language Models

no code implementations • 3 Apr 2024 • Ye Yuan, Kexin Tang, Jianhao Shen, Ming Zhang, Chenguang Wang

This enables the direct comparison of the social understanding of large language models to humans, more specifically, elementary students.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.