Search Results for author: Yang Zhou

Found 121 papers, 40 papers with code

RigNet: Neural Rigging for Articulated Characters

1 code implementation1 May 2020 Zhan Xu, Yang Zhou, Evangelos Kalogerakis, Chris Landreth, Karan Singh

We present RigNet, an end-to-end automated method for producing animation rigs from input character models.

SceneGraphNet: Neural Message Passing for 3D Indoor Scene Augmentation

2 code implementations ICCV 2019 Yang Zhou, Zachary While, Evangelos Kalogerakis

In this paper we propose a neural message passing approach to augment an input 3D indoor scene with new objects matching their surroundings.

3D Object Recognition Scene Generation

MakeItTalk: Speaker-Aware Talking-Head Animation

3 code implementations27 Apr 2020 Yang Zhou, Xintong Han, Eli Shechtman, Jose Echevarria, Evangelos Kalogerakis, DIngzeyu Li

We present a method that generates expressive talking heads from a single facial image with audio as the only input.

Talking Face Generation Talking Head Generation

LRM: Large Reconstruction Model for Single Image to 3D

1 code implementation8 Nov 2023 Yicong Hong, Kai Zhang, Jiuxiang Gu, Sai Bi, Yang Zhou, Difan Liu, Feng Liu, Kalyan Sunkavalli, Trung Bui, Hao Tan

We propose the first Large Reconstruction Model (LRM) that predicts the 3D model of an object from a single input image within just 5 seconds.

Image to 3D

Non-Stationary Texture Synthesis by Adversarial Expansion

1 code implementation11 May 2018 Yang Zhou, Zhen Zhu, Xiang Bai, Dani Lischinski, Daniel Cohen-Or, Hui Huang

We demonstrate that this conceptually simple approach is highly effective for capturing large-scale structures, as well as other non-stationary attributes of the input exemplar.

Generative Adversarial Network Texture Synthesis

Skeleton-free Pose Transfer for Stylized 3D Characters

1 code implementation28 Jul 2022 Zhouyingcheng Liao, Jimei Yang, Jun Saito, Gerard Pons-Moll, Yang Zhou

We present the first method that automatically transfers poses between stylized 3D characters without skeletal rigging.

Pose Transfer

LLM Inference Unveiled: Survey and Roofline Model Insights

2 code implementations26 Feb 2024 Zhihang Yuan, Yuzhang Shang, Yang Zhou, Zhen Dong, Zhe Zhou, Chenhao Xue, Bingzhe Wu, Zhikai Li, Qingyi Gu, Yong Jae Lee, Yan Yan, Beidi Chen, Guangyu Sun, Kurt Keutzer

Our survey stands out from traditional literature reviews by not only summarizing the current state of research but also by introducing a framework based on roofline model for systematic analysis of LLM inference techniques.

Knowledge Distillation Language Modelling +3

Rethinking Performance Gains in Image Dehazing Networks

1 code implementation23 Sep 2022 Yuda Song, Yang Zhou, Hui Qian, Xin Du

Image dehazing is an active topic in low-level vision, and many image dehazing networks have been proposed with the rapid development of deep learning.

Image Dehazing Single Image Dehazing

A Survey on Multimodal Large Language Models for Autonomous Driving

1 code implementation21 Nov 2023 Can Cui, Yunsheng Ma, Xu Cao, Wenqian Ye, Yang Zhou, Kaizhao Liang, Jintai Chen, Juanwu Lu, Zichong Yang, Kuei-Da Liao, Tianren Gao, Erlong Li, Kun Tang, Zhipeng Cao, Tong Zhou, Ao Liu, Xinrui Yan, Shuqi Mei, Jianguo Cao, Ziran Wang, Chao Zheng

We first introduce the background of Multimodal Large Language Models (MLLMs), the multimodal models development using LLMs, and the history of autonomous driving.

Autonomous Driving

Morig: Motion-aware rigging of character meshes from point clouds

1 code implementation17 Oct 2022 Zhan Xu, Yang Zhou, Li Yi, Evangelos Kalogerakis

We present MoRig, a method that automatically rigs character meshes driven by single-view point cloud streams capturing the motion of performing characters.

Predicting Animation Skeletons for 3D Articulated Models via Volumetric Nets

1 code implementation22 Aug 2019 Zhan Xu, Yang Zhou, Evangelos Kalogerakis, Karan Singh

We present a learning method for predicting animation skeletons for input 3D models of articulated characters.

ETNet: Error Transition Network for Arbitrary Style Transfer

1 code implementation NeurIPS 2019 Chunjin Song, Zhijie Wu, Yang Zhou, Minglun Gong, Hui Huang

Numerous valuable efforts have been devoted to achieving arbitrary style transfer since the seminal work of Gatys et al.

Style Transfer

Triplet-Center Loss for Multi-View 3D Object Retrieval

1 code implementation CVPR 2018 Xinwei He, Yang Zhou, Zhichao Zhou, Song Bai, Xiang Bai

Most existing 3D object recognition algorithms focus on leveraging the strong discriminative power of deep learning models with softmax loss for the classification of 3D data, while learning discriminative features with deep metric learning for 3D object retrieval is more or less neglected.

3D Object Recognition 3D Object Retrieval +6

Learning Visibility for Robust Dense Human Body Estimation

1 code implementation23 Aug 2022 Chun-Han Yao, Jimei Yang, Duygu Ceylan, Yi Zhou, Yang Zhou, Ming-Hsuan Yang

An alternative approach is to estimate dense vertices of a predefined template body in the image space.

DINO-Mix: Enhancing Visual Place Recognition with Foundational Vision Model and Feature Mixing

1 code implementation1 Nov 2023 Gaoshuang Huang, Yang Zhou, Xiaofei Hu, Chenglong Zhang, Luying Zhao, Wenjian Gan, Mingbo Hou

In this study, we utilize the DINOv2 model as the backbone network for trimming and fine-tuning to extract robust image features.

Visual Place Recognition

SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction

1 code implementation18 Mar 2024 Yang Zhou, Hao Shao, Letian Wang, Steven L. Waslander, Hongsheng Li, Yu Liu

Context information, such as road maps and surrounding agents' states, provides crucial geometric and semantic information for motion behavior prediction.

Autonomous Vehicles motion prediction

UMMAFormer: A Universal Multimodal-adaptive Transformer Framework for Temporal Forgery Localization

1 code implementation28 Aug 2023 Rui Zhang, Hongxia Wang, Mingshan Du, Hanqing Liu, Yang Zhou, Qiang Zeng

Our approach introduces a Temporal Feature Abnormal Attention (TFAA) module based on temporal feature reconstruction to enhance the detection of temporal differences.

Binary Classification Temporal Forgery Localization +1

Neural Texture Synthesis With Guided Correspondence

1 code implementation CVPR 2023 Yang Zhou, Kaijian Chen, Rongjun Xiao, Hui Huang

More importantly, the Guided Correspondence loss can function as a general textural loss in, e. g., training generative networks for real-time controlled synthesis and inversion-based single-image editing.

Texture Synthesis

Federated Learning of Large Language Models with Parameter-Efficient Prompt Tuning and Adaptive Optimization

1 code implementation23 Oct 2023 Tianshi Che, Ji Liu, Yang Zhou, Jiaxiang Ren, Jiwen Zhou, Victor S. Sheng, Huaiyu Dai, Dejing Dou

This paper proposes a Parameter-efficient prompt Tuning approach with Adaptive Optimization, i. e., FedPepTAO, to enable efficient and effective FL of LLMs.

Federated Learning

Diffuse3D: Wide-Angle 3D Photography via Bilateral Diffusion

1 code implementation ICCV 2023 Yutao Jiang, Yang Zhou, Yuan Liang, Wenxi Liu, Jianbo Jiao, Yuhui Quan, Shengfeng He

To address the above issues, we propose Diffuse3D which employs a pre-trained diffusion model for global synthesis, while amending the model to activate depth-aware inference.

Denoising Novel View Synthesis

Generating Non-Stationary Textures using Self-Rectification

1 code implementation5 Jan 2024 Yang Zhou, Rongjun Xiao, Dani Lischinski, Daniel Cohen-Or, Hui Huang

This paper addresses the challenge of example-based non-stationary texture synthesis.

Texture Synthesis

Deformable One-shot Face Stylization via DINO Semantic Guidance

1 code implementation1 Mar 2024 Yang Zhou, Zichong Chen, Hui Huang

This paper addresses the complex issue of one-shot face stylization, focusing on the simultaneous consideration of appearance and structure, where previous methods have fallen short.

One-Shot Face Stylization

None Class Ranking Loss for Document-Level Relation Extraction

1 code implementation1 May 2022 Yang Zhou, Wee Sun Lee

This ignores the context of entity pairs and the label correlations between the none class and pre-defined classes, leading to sub-optimal predictions.

Document-level Relation Extraction Emotion Classification +3

Faithful Extreme Rescaling via Generative Prior Reciprocated Invertible Representations

1 code implementation CVPR 2022 Zhixuan Zhong, Liangyu Chai, Yang Zhou, Bailin Deng, Jia Pan, Shengfeng He

This paper presents a Generative prior ReciprocAted Invertible rescaling Network (GRAIN) for generating faithful high-resolution (HR) images from low-resolution (LR) invertible images with an extreme upscaling factor (64x).

More is Better: Enhancing Open-Domain Dialogue Generation via Multi-Source Heterogeneous Knowledge

1 code implementation EMNLP 2021 Sixing Wu, Ying Li, Minghui Wang, Dawei Zhang, Yang Zhou, Zhonghai Wu

Despite achieving remarkable performance, previous knowledge-enhanced works usually only use a single-source homogeneous knowledge base of limited knowledge coverage.

Dialogue Generation

Modular Degradation Simulation and Restoration for Under-Display Camera

1 code implementation23 Sep 2022 Yang Zhou, Yuda Song, Xin Du

Together with a pixel-wise discriminator and supervised loss, we can train the generator to simulate the UDC imaging degradation process.

Generative Adversarial Network Image Restoration

Cyclic Learning: Bridging Image-level Labels and Nuclei Instance Segmentation

1 code implementation5 Jun 2023 Yang Zhou, Yongjian Wu, Zihua Wang, Bingzheng Wei, Maode Lai, Jianzhong Shou, Yubo Fan, Yan Xu

Experiments on three datasets demonstrate the good generality of our method, which outperforms other image-level weakly supervised methods for nuclei instance segmentation, and achieves comparable performance to fully-supervised methods.

Instance Segmentation Multi-Task Learning +4

Zero-shot Nuclei Detection via Visual-Language Pre-trained Models

1 code implementation30 Jun 2023 Yongjian Wu, Yang Zhou, Jiya Saiyin, Bingzheng Wei, Maode Lai, Jianzhong Shou, Yubo Fan, Yan Xu

Foremost, our work demonstrates that the VLPM pre-trained on natural image-text pairs exhibits astonishing potential for downstream tasks in the medical field as well.

object-detection Object Detection

Cross-lingual Entity Alignment with Adversarial Kernel Embedding and Adversarial Knowledge Translation

1 code implementation16 Apr 2021 Gong Zhang, Yang Zhou, Sixing Wu, Zeru Zhang, Dejing Dou

With the guidance of known aligned entities in the context of multiple random walks, an adversarial knowledge translation model is developed to fill and translate masked entities in pairwise random walks from two KGs.

Attribute Entity Alignment +2

PENet: A Joint Panoptic Edge Detection Network

1 code implementation15 Mar 2023 Yang Zhou, Giuseppe Loianno

In recent years, compact and efficient scene understanding representations have gained popularity in increasing situational awareness and autonomy of robotic systems.

Edge Detection Multi-Task Learning +2

DRMC: A Generalist Model with Dynamic Routing for Multi-Center PET Image Synthesis

1 code implementation11 Jul 2023 Zhiwen Yang, Yang Zhou, HUI ZHANG, Bingzheng Wei, Yubo Fan, Yan Xu

To address this, we develop a generalist model that shares architecture and parameters across centers to utilize the shared knowledge.

Image Generation

Structure Learning of Probabilistic Graphical Models: A Comprehensive Survey

2 code implementations29 Nov 2011 Yang Zhou

Probabilistic graphical models combine the graph theory and probability theory to give a multivariate statistical modeling.

Marketing

Full 3D Reconstruction of Transparent Objects

no code implementations9 May 2018 Bojian Wu, Yang Zhou, Yiming Qian, Minglun Gong, Hui Huang

Numerous techniques have been proposed for reconstructing 3D models for opaque objects in past decades.

3D Reconstruction Transparent objects

A Tube-and-Droplet-based Approach for Representing and Analyzing Motion Trajectories

no code implementations10 Sep 2016 Weiyao Lin, Yang Zhou, Hongteng Xu, Junchi Yan, Mingliang Xu, Jianxin Wu, Zicheng Liu

Our approach first leverages the complete information from given trajectories to construct a thermal transfer field which provides a context-rich way to describe the global motion pattern in a scene.

3D Action Recognition Anomaly Detection +2

A Large-scale Distributed Video Parsing and Evaluation Platform

no code implementations29 Nov 2016 Kai Yu, Yang Zhou, Da Li, Zhang Zhang, Kaiqi Huang

Visual surveillance systems have become one of the largest data sources of Big Visual Data in real world.

Interaction Part Mining: A Mid-Level Approach for Fine-Grained Action Recognition

no code implementations CVPR 2015 Yang Zhou, Bingbing Ni, Richang Hong, Meng Wang, Qi Tian

Secondly, these object regions are matched and tracked across frames to form a large spatio-temporal graph based on the appearance matching and the dense motion trajectories through them.

Fine-grained Action Recognition Human-Object Interaction Detection +2

Cascaded Interactional Targeting Network for Egocentric Video Analysis

no code implementations CVPR 2016 Yang Zhou, Bingbing Ni, Richang Hong, Xiaokang Yang, Qi Tian

Firstly, a novel EM-like learning framework is proposed to train the pixel-level deep convolutional neural network (DCNN) by seamlessly integrating weakly supervised data (i. e., massive bounding box annotations) with a small set of strongly supervised data (i. e., fully annotated hand segmentation maps) to achieve state-of-the-art hand segmentation performance.

Action Recognition Foreground Segmentation +4

Unsupervised Trajectory Clustering via Adaptive Multi-Kernel-Based Shrinkage

no code implementations ICCV 2015 Hongteng Xu, Yang Zhou, Weiyao Lin, Hongyuan Zha

Facing to the challenges of trajectory clustering, e. g., large variations within a cluster and ambiguities across clusters, we first introduce an adaptive multi-kernel-based estimation process to estimate the `shrunk' positions and speeds of trajectories' points.

Anomaly Detection Clustering +1

Platoon trajectories generation: A unidirectional interconnected LSTM-based car following model

no code implementations25 Oct 2019 Yangxin Lin, Ping Wang, Yang Zhou, Fan Ding, Chen Wang, Huachun Tan

However, the traffic micro-simulation accuracy of car following models in a platoon level, especially during traffic oscillations, still needs to be enhanced.

A Video Analysis Method on Wanfang Dataset via Deep Neural Network

no code implementations28 Feb 2020 Jinlong Kang, Jiaxiang Zheng, Heng Bai, Xiaoting Xue, Yang Zhou, Jun Guo

To solve this problem, in this paper, we describe the new function for real-time multi-object detection in sports competition and pedestrians flow detection in public based on deep learning.

Object object-detection +1

A Probabilistic Model with Commonsense Constraints for Pattern-based Temporal Fact Extraction

no code implementations WS 2020 Yang Zhou, Tong Zhao, Meng Jiang

Textual patterns (e. g., Country's president Person) are specified and/or generated for extracting factual information from unstructured data.

TAG Text Generation

VisemeNet: Audio-Driven Animator-Centric Speech Animation

no code implementations24 May 2018 Yang Zhou, Zhan Xu, Chris Landreth, Evangelos Kalogerakis, Subhransu Maji, Karan Singh

We present a novel deep-learning based approach to producing animator-centric speech motion curves that drive a JALI or standard FACS-based production face-rig, directly from input audio.

Graphics

Multi-boundary entanglement in Chern-Simons theory with finite gauge groups

no code implementations3 Mar 2020 Siddharth Dwivedi, Andrea Addazi, Yang Zhou, Puneet Sharma

We study the multi-boundary entanglement structure of the states prepared in (1+1) and (2+1) dimensional Chern-Simons theory with finite discrete gauge group $G$.

High Energy Physics - Theory Mesoscale and Nanoscale Physics Quantum Physics

Neural Latent Dependency Model for Sequence Labeling

no code implementations10 Nov 2020 Yang Zhou, Yong Jiang, Zechuan Hu, Kewei Tu

One limitation of linear chain CRFs is their inability to model long-range dependencies between labels.

Adversarial Attacks on Deep Graph Matching

no code implementations NeurIPS 2020 Zijie Zhang, Zeru Zhang, Yang Zhou, Yelong Shen, Ruoming Jin, Dejing Dou

Despite achieving remarkable performance, deep graph learning models, such as node classification and network embedding, suffer from harassment caused by small adversarial perturbations.

Adversarial Attack Density Estimation +5

Manifold Partition Discriminant Analysis

no code implementations23 Nov 2020 Yang Zhou, Shiliang Sun

We propose a novel algorithm for supervised dimensionality reduction named Manifold Partition Discriminant Analysis (MPDA).

Supervised dimensionality reduction

A Top-Down Approach for the Multiple Exercises and Valuation of Employee Stock Options

no code implementations9 Jun 2019 Tim Leung, Yang Zhou

We propose a new framework to value employee stock options (ESOs) that captures multiple exercises of different quantities over time.

Optimal Dynamic Futures Portfolio in a Regime-Switching Market Framework

no code implementations14 Oct 2019 Tim Leung, Yang Zhou

We study the problem of dynamically trading futures in a regime-switching market.

$K$-theoretic quasimap wall-crossing

no code implementations2 Dec 2020 Ming Zhang, Yang Zhou

In this paper, we prove a K-theoretic wall-crossing formula for $\epsilon$-stable quasimaps for all GIT targets in all genera.

Algebraic Geometry Mathematical Physics Mathematical Physics 14N35

Partially Connected Automated Vehicle Cooperative Control Strategy with a Deep Reinforcement Learning Approach

no code implementations3 Dec 2020 Haotian Shi, Yang Zhou, Keshu Wu, Xin Wang, Yangxin Lin, Bin Ran

This paper proposes a cooperative strategy of connected and automated vehicles (CAVs) longitudinal control for partially connected and automated traffic environment based on deep reinforcement learning (DRL) algorithm, which enhances the string stability of mixed traffic, car following efficiency, and energy efficiency.

reinforcement-learning Reinforcement Learning (RL)

Defect extremal surface as the holographic counterpart of Island formula

no code implementations14 Dec 2020 Feiyu Deng, Jinwei Chu, Yang Zhou

We propose defect extremal surface as the holographic counterpart of boundary quantum extremal surface.

High Energy Physics - Theory Strongly Correlated Electrons General Relativity and Quantum Cosmology Quantum Physics

Transformer-based Language Model Fine-tuning Methods for COVID-19 Fake News Detection

no code implementations14 Jan 2021 Ben Chen, Bin Chen, Dehong Gao, Qijin Chen, Chengfu Huo, Xiaonan Meng, Weijun Ren, Yang Zhou

However, universal language models may perform weakly in these fake news detection for lack of large-scale annotated data and sufficient semantic understanding of domain-specific knowledge.

Fake News Detection Language Modelling

CogNet: Bridging Linguistic Knowledge, World Knowledge and Commonsense Knowledge

no code implementations3 Mar 2021 Chenhao Wang, Yubo Chen, Zhipeng Xue, Yang Zhou, Jun Zhao

In this paper, we present CogNet, a knowledge base (KB) dedicated to integrating three types of knowledge: (1) linguistic knowledge from FrameNet, which schematically describes situations, objects and events.

World Knowledge

Connected and Automated Vehicle Distributed Control for On-ramp Merging Scenario: A Virtual Rotation Approach

no code implementations28 Mar 2021 Tianyi Chen, Meng Wang, Siyuan Gong, Yang Zhou, Bin Ran

In this study, we propose a rotation-based connected automated vehicle (CAV) distributed cooperative control strategy for an on-ramp merging scenario.

Optimal Dynamic Futures Portfolios Under a Multiscale Central Tendency Ornstein-Uhlenbeck Model

no code implementations24 Feb 2021 Tim Leung, Yang Zhou

We study the problem of dynamically trading multiple futures whose underlying asset price follows a multiscale central tendency Ornstein-Uhlenbeck (MCTOU) model.

From Distributed Machine Learning to Federated Learning: A Survey

no code implementations29 Apr 2021 Ji Liu, Jizhou Huang, Yang Zhou, Xuhong LI, Shilei Ji, Haoyi Xiong, Dejing Dou

Because of laws or regulations, the distributed data and computing resources cannot be directly shared among different regions or organizations for machine learning tasks.

BIG-bench Machine Learning Federated Learning

Towards a Better Understanding of Linear Models for Recommendation

no code implementations27 May 2021 Ruoming Jin, Dong Li, Jing Gao, Zhi Liu, Li Chen, Yang Zhou

Through the derivation and analysis of the closed-form solutions for two basic regression and matrix factorization approaches, we found these two approaches are indeed inherently related but also diverge in how they "scale-down" the singular values of the original user-item interaction matrix.

regression

Focus on Local: Detecting Lane Marker from Bottom Up via Key Point

no code implementations CVPR 2021 Zhan Qu, Huan Jin, Yang Zhou, Zhen Yang, Wei zhang

Mainstream lane marker detection methods are implemented by predicting the overall structure and deriving parametric curves through post-processing.

Lane Detection

Virtual synchronous generator of PV generation without energy storage for frequency support in autonomous microgrid

no code implementations4 Jul 2021 Cheng Zhong, Huayi Li, Yang Zhou, Yueming Lv, Jikai Chen, Yang Li

PV generation reserve a part of the active power in accordance with the pre-defined power versus voltage curve.

Multi-Modal Multi-Instance Learning for Retinal Disease Recognition

no code implementations25 Sep 2021 Xirong Li, Yang Zhou, Jie Wang, Hailan Lin, Jianchun Zhao, Dayong Ding, Weihong Yu, Youxin Chen

We propose in this paper Multi-Modal Multi-Instance Learning (MM-MIL) for selectively fusing CFP and OCT modalities.

Validating the Lottery Ticket Hypothesis with Inertial Manifold Theory

no code implementations NeurIPS 2021 Zeru Zhang, Jiayin Jin, Zijie Zhang, Yang Zhou, Xin Zhao, Jiaxiang Ren, Ji Liu, Lingfei Wu, Ruoming Jin, Dejing Dou

Despite achieving remarkable efficiency, traditional network pruning techniques often follow manually-crafted heuristics to generate pruned sparse networks.

Network Pruning

Multi-Robot Collaborative Perception with Graph Neural Networks

no code implementations5 Jan 2022 Yang Zhou, Jiuhong Xiao, Yue Zhou, Giuseppe Loianno

Multi-robot systems such as swarms of aerial robots are naturally suited to offer additional flexibility, resilience, and robustness in several tasks compared to a single robot by enabling cooperation among the agents.

Decision Making Monocular Depth Estimation +1

FedDUAP: Federated Learning with Dynamic Update and Adaptive Pruning Using Shared Data on the Server

no code implementations25 Apr 2022 Hong Zhang, Ji Liu, Juncheng Jia, Yang Zhou, Huaiyu Dai, Dejing Dou

Despite achieving remarkable performance, Federated Learning (FL) suffers from two critical challenges, i. e., limited computational resources and low training efficiency.

Federated Learning

3D Segmentation Guided Style-based Generative Adversarial Networks for PET Synthesis

no code implementations18 May 2022 Yang Zhou, Zhiwen Yang, HUI ZHANG, Eric I-Chao Chang, Yubo Fan, Yan Xu

(2) We adopt a task-driven strategy that couples a segmentation task with a generative adversarial network (GAN) framework to improve the translation performance.

Generative Adversarial Network Translation

Diversity Matters: Fully Exploiting Depth Clues for Reliable Monocular 3D Object Detection

no code implementations CVPR 2022 Zhuoling Li, Zhan Qu, Yang Zhou, Jianzhuang Liu, Haoqian Wang, Lihui Jiang

To tackle this problem, we propose a depth solving system that fully explores the visual clues from the subtasks in M3OD and generates multiple estimations for the depth of each target.

Depth Estimation Monocular 3D Object Detection +2

A Lightweight NMS-free Framework for Real-time Visual Fault Detection System of Freight Trains

no code implementations25 May 2022 Guodong Sun, Yang Zhou, Huilin Pan, Bo Wu, Ye Hu, Yang Zhang

In this paper, we propose a lightweight NMS-free framework to achieve real-time detection and high accuracy simultaneously.

Fault Detection

Play It Cool: Dynamic Shifting Prevents Thermal Throttling

no code implementations22 Jun 2022 Yang Zhou, Feng Liang, Ting-Wu Chin, Diana Marculescu

Machine learning (ML) has entered the mobile era where an enormous number of ML models are deployed on edge devices.

Input-agnostic Certified Group Fairness via Gaussian Parameter Smoothing

no code implementations22 Jun 2022 Jiayin Jin, Zeru Zhang, Yang Zhou, Lingfei Wu

Theoretical analysis is conducted to derive that the Nemytskii operator is smooth and induces a Frechet differentiable smooth manifold.

Fairness

Accelerated Federated Learning with Decoupled Adaptive Optimization

no code implementations14 Jul 2022 Jiayin Jin, Jiaxiang Ren, Yang Zhou, Lingjuan Lyu, Ji Liu, Dejing Dou

The federated learning (FL) framework enables edge clients to collaboratively learn a shared inference model while keeping privacy of training data on clients.

Federated Learning

Contact2Grasp: 3D Grasp Synthesis via Hand-Object Contact Constraint

no code implementations17 Oct 2022 Haoming Li, Xinzhuo Lin, Yang Zhou, Xiang Li, Yuchi Huo, Jiming Chen, Qi Ye

To tackle the challenge, we introduce an intermediate variable for grasp contact areas to constrain the grasp generation; in other words, we factorize the mapping into two sequential stages by assuming that grasping poses are fully constrained given contact maps: 1) we first learn contact map distributions to generate the potential contact maps for grasps; 2) then learn a mapping from the contact maps to the grasping poses.

Grasp Generation Object +2

Pixel-Aligned Non-parametric Hand Mesh Reconstruction

no code implementations17 Oct 2022 Shijian Jiang, Guwen Han, Danhang Tang, Yang Zhou, Xiang Li, Jiming Chen, Qi Ye

The decoder aggregate both local image features in pixels and geometric features in vertices.

ClassPruning: Speed Up Image Restoration Networks by Dynamic N:M Pruning

no code implementations10 Nov 2022 Yang Zhou, Yuda Song, Hui Qian, Xin Du

Image restoration tasks have achieved tremendous performance improvements with the rapid advancement of deep neural networks.

Image Restoration

Multi-Job Intelligent Scheduling with Cross-Device Federated Learning

no code implementations24 Nov 2022 Ji Liu, Juncheng Jia, Beichen Ma, Chendi Zhou, Jingbo Zhou, Yang Zhou, Huaiyu Dai, Dejing Dou

The system model enables a parallel training process of multiple jobs, with a cost model based on the data fairness and the training time of diverse devices during the parallel training process.

Bayesian Optimization Fairness +2

Visual Fault Detection of Multi-scale Key Components in Freight Trains

no code implementations26 Nov 2022 Yang Zhang, Yang Zhou, Huilin Pan, Bo Wu, Guodong Sun

Fault detection for key components in the braking system of freight trains is critical for ensuring railway transportation safety.

Fault Detection

Cost-minimization predictive energy management of a postal-delivery fuel cell electric vehicle with intelligent battery State-of-Charge Planner

no code implementations28 Dec 2022 Yang Zhou, Fuzeng Li, Xianfeng Xu, Zhen Zhang, Alexandre Ravey, Marie-Cécile Péra, Ruiqing Ma

Fuel cell electric vehicles have earned substantial attentions in recent decades due to their high-efficiency and zero-emission features, while the high operating costs remain the major barrier towards their large-scale commercialization.

energy management Management +1

Towards Fair Machine Learning Software: Understanding and Addressing Model Bias Through Counterfactual Thinking

no code implementations16 Feb 2023 Zichong Wang, Yang Zhou, Meikang Qiu, Israat Haque, Laura Brown, Yi He, Jianwu Wang, David Lo, Wenbin Zhang

The increasing use of Machine Learning (ML) software can lead to unfair and unethical decisions, thus fairness bugs in software are becoming a growing concern.

Benchmarking counterfactual +1

Faster Learning of Temporal Action Proposal via Sparse Multilevel Boundary Generator

1 code implementation6 Mar 2023 Qing Song, Yang Zhou, Mengjie Hu, Chun Liu

Temporal action localization in videos presents significant challenges in the field of computer vision.

Temporal Action Localization

Medical Phrase Grounding with Region-Phrase Context Contrastive Alignment

no code implementations14 Mar 2023 Zhihao Chen, Yang Zhou, Anh Tran, Junting Zhao, Liang Wan, Gideon Ooi, Lionel Cheng, Choon Hua Thng, Xinxing Xu, Yong liu, Huazhu Fu

To enable MedRPG to locate nuanced medical findings with better region-phrase correspondences, we further propose Tri-attention Context contrastive alignment (TaCo).

Phrase Grounding Visual Grounding

L0-norm constraint normalized subband adaptive filtering algorithm: Performance development and AEC application

no code implementations10 Apr 2023 Dongxu Liu, Haiquan Zhao, Yang Zhou

Limited by fixed step-size and sparsity penalty factor, the conventional sparsity-aware normalized subband adaptive filtering (NSAF) type algorithms suffer from trade-off requirements of high filtering accurateness and quicker convergence behavior for sparse system identification.

LEMMA

Single-View View Synthesis with Self-Rectified Pseudo-Stereo

no code implementations19 Apr 2023 Yang Zhou, Hanjie Wu, Wenxi Liu, Zheng Xiong, Jing Qin, Shengfeng He

In this way, the challenging novel view synthesis process is decoupled into two simpler problems of stereo synthesis and 3D reconstruction.

3D Reconstruction Novel View Synthesis

A Survey on Cross-Architectural IoT Malware Threat Hunting

no code implementations9 Jun 2023 Anandharaju Durai Raju, Ibrahim Abualhaol, Ronnie Salvador Giagone, Yang Zhou, Shengqiang Huang

In recent years, the increase in non-Windows malware threats had turned the focus of the cybersecurity community.

Malware Detection

Efficient Visual Fault Detection for Freight Train Braking System via Heterogeneous Self Distillation in the Wild

no code implementations3 Jul 2023 Yang Zhang, Huilin Pan, Yang Zhou, Mingying Li, Guodong Sun

Efficient visual fault detection of freight trains is a critical part of ensuring the safe operation of railways under the restricted hardware environment.

Fault Detection object-detection +1

Fast algorithms for k-submodular maximization subject to a matroid constraint

no code implementations26 Jul 2023 Shuxian Niu, Qian Liu, Yang Zhou, Min Li

In this paper, we apply a Threshold-Decreasing Algorithm to maximize $k$-submodular functions under a matroid constraint, which reduces the query complexity of the algorithm compared to the greedy algorithm with little loss in approximation ratio.

GRIP: Generating Interaction Poses Using Latent Consistency and Spatial Cues

no code implementations22 Aug 2023 Omid Taheri, Yi Zhou, Dimitrios Tzionas, Yang Zhou, Duygu Ceylan, Soren Pirk, Michael J. Black

In contrast, we introduce GRIP, a learning-based method that takes, as input, the 3D motion of the body and the object, and synthesizes realistic motion for both hands before, during, and after object interaction.

Mixed Reality Object

Graph-Based Interaction-Aware Multimodal 2D Vehicle Trajectory Prediction using Diffusion Graph Convolutional Networks

no code implementations5 Sep 2023 Keshu Wu, Yang Zhou, Haotian Shi, Xiaopeng Li, Bin Ran

Within this framework, vehicles' motions are conceptualized as nodes in a time-varying graph, and the traffic interactions are represented by a dynamic adjacency matrix.

Graph Embedding Intent Detection +1

Visual Environment Assessment for Safe Autonomous Quadrotor Landing

no code implementations16 Nov 2023 Mattia Secchiero, Nishanth Bobbili, Yang Zhou, Giuseppe Loianno

Autonomous identification and evaluation of safe landing zones are of paramount importance for ensuring the safety and effectiveness of aerial robots in the event of system failures, low battery, or the successful completion of specific tasks.

Sequencing-enabled Hierarchical Cooperative CAV On-ramp Merging Control with Enhanced Stability and Feasibility

no code implementations25 Nov 2023 Sixu Li, Yang Zhou, Xinyue Ye, Jiwan Jiang, Meng Wang

Subsequently, the lower-level control employs a longitudinal distributed model predictive control (MPC) supplemented by a virtual car-following (CF) concept to ensure asymptotic local stability, l_2 norm string stability, and safety.

Model Predictive Control

Spatial-wise Dynamic Distillation for MLP-like Efficient Visual Fault Detection of Freight Trains

1 code implementation10 Dec 2023 Yang Zhang, Huilin Pan, Mingying Li, An Wang, Yang Zhou, Hongliang Ren

Existing modeling shortcomings of spatial invariance and pooling layers in conventional CNNs often ignore the neglect of crucial global information, resulting in error localization for fault objection tasks of freight trains.

Fault Detection object-detection +1

Beyond 1D and oversimplified kinematics: A generic analytical framework for surrogate safety measures

no code implementations12 Dec 2023 Sixu Li, Mohammad Anis, Dominique Lord, Hao Zhang, Yang Zhou, Xinyue Ye

This paper presents a generic analytical framework tailored for surrogate safety measures (SSMs) that is versatile across various highway geometries, capable of encompassing vehicle dynamics of differing dimensionality and fidelity, and suitable for dynamic, real-world environments.

AEDFL: Efficient Asynchronous Decentralized Federated Learning with Heterogeneous Devices

no code implementations18 Dec 2023 Ji Liu, Tianshi Che, Yang Zhou, Ruoming Jin, Huaiyu Dai, Dejing Dou, Patrick Valduriez

First, we propose an asynchronous FL system model with an efficient model aggregation method for improving the FL convergence.

Federated Learning

In-Hand 3D Object Reconstruction from a Monocular RGB Video

no code implementations27 Dec 2023 Shijian Jiang, Qi Ye, Rengan Xie, Yuchi Huo, Xiang Li, Yang Zhou, Jiming Chen

We evaluate our approach on HO3D and HOD datasets and demonstrate that it outperforms the state-of-the-art methods in terms of reconstruction surface quality, with an improvement of $52\%$ on HO3D and $20\%$ on HOD.

3D Object Reconstruction 3D Reconstruction +2

Jump Cut Smoothing for Talking Heads

no code implementations9 Jan 2024 Xiaojuan Wang, Taesung Park, Yang Zhou, Eli Shechtman, Richard Zhang

We leverage the appearance of the subject from the other source frames in the video, fusing it with a mid-level representation driven by DensePose keypoints and face landmarks.

ActAnywhere: Subject-Aware Video Background Generation

no code implementations19 Jan 2024 Boxiao Pan, Zhan Xu, Chun-Hao Paul Huang, Krishna Kumar Singh, Yang Zhou, Leonidas J. Guibas, Jimei Yang

Generating video background that tailors to foreground subject motion is an important problem for the movie industry and visual effects community.

Template-Free Single-View 3D Human Digitalization with Diffusion-Guided LRM

no code implementations22 Jan 2024 Zhenzhen Weng, Jingyuan Liu, Hao Tan, Zhan Xu, Yang Zhou, Serena Yeung-Levy, Jimei Yang

We present Human-LRM, a diffusion-guided feed-forward model that predicts the implicit field of a human from a single image.

Overview of Sensing Attacks on Autonomous Vehicle Technologies and Impact on Traffic Flow

no code implementations26 Jan 2024 Zihao Li, Sixu Li, Hao Zhang, Yang Zhou, Siyang Xie, Yunlong Zhang

While perception systems in Connected and Autonomous Vehicles (CAVs), which encompass both communication technologies and advanced sensors, promise to significantly reduce human driving errors, they also expose CAVs to various cyberattacks.

Autonomous Vehicles

Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models

no code implementations22 Feb 2024 Yixuan Ren, Yang Zhou, Jimei Yang, Jing Shi, Difan Liu, Feng Liu, Mingi Kwon, Abhinav Shrivastava

With the emergence of text-to-video (T2V) diffusion models, its temporal counterpart, motion customization, has not yet been well investigated.

Video Generation

A Survey of Lottery Ticket Hypothesis

no code implementations7 Mar 2024 Bohan Liu, Zijie Zhang, Peixiong He, Zhensen Wang, Yang Xiao, Ruimeng Ye, Yang Zhou, Wei-Shinn Ku, Bo Hui

The Lottery Ticket Hypothesis (LTH) states that a dense neural network model contains a highly sparse subnetwork (i. e., winning tickets) that can achieve even better performance than the original model when trained in isolation.

CTSM: Combining Trait and State Emotions for Empathetic Response Model

1 code implementation22 Mar 2024 Wang Yufeng, Chen Chao, Yang Zhou, Wang Shuhui, Liao Xiangwen

Specifically, to sufficiently perceive emotions in dialogue, we first construct and encode trait and state emotion embeddings, and then we further enhance emotional perception capability through an emotion guidance module that guides emotion representation.

Contrastive Learning Empathetic Response Generation +1

MedRG: Medical Report Grounding with Multi-modal Large Language Model

no code implementations10 Apr 2024 Ke Zou, Yang Bai, Zhihao Chen, Yang Zhou, Yidi Chen, Kai Ren, Meng Wang, Xuedong Yuan, Xiaojing Shen, Huazhu Fu

Medical Report Grounding is pivotal in identifying the most relevant regions in medical images based on a given phrase query, a critical aspect in medical image analysis and radiological diagnosis.

Language Modelling Large Language Model +2

Cannot find the paper you are looking for? You can Submit a new open access paper.