Search Results for author: Yang Zhou

Found 121 papers, 40 papers with code

Tell2Design: A Dataset for Language-Guided Floor Plan Generation

2 code implementations • 27 Nov 2023 • Sicong Leng, Yang Zhou, Mohammed Haroon Dupty, Wee Sun Lee, Sam Conrad Joyce, Wei Lu

We make multiple contributions to initiate research on this task.

Conditional Image Generation

7,771

Paper
Code

Large-Scale 3D Shape Reconstruction and Segmentation from ShapeNet Core55

1 code implementation • 17 Oct 2017 • Li Yi, Lin Shao, Manolis Savva, Haibin Huang, Yang Zhou, Qirui Wang, Benjamin Graham, Martin Engelcke, Roman Klokov, Victor Lempitsky, Yuan Gan, Pengyu Wang, Kun Liu, Fenggen Yu, Panpan Shui, Bingyang Hu, Yan Zhang, Yangyan Li, Rui Bu, Mingchao Sun, Wei Wu, Minki Jeong, Jaehoon Choi, Changick Kim, Angom Geetchandra, Narasimha Murthy, Bhargava Ramu, Bharadwaj Manda, M. Ramanathan, Gautam Kumar, P Preetham, Siddharth Srivastava, Swati Bhugra, Brejesh lall, Christian Haene, Shubham Tulsiani, Jitendra Malik, Jared Lafer, Ramsey Jones, Siyuan Li, Jie Lu, Shi Jin, Jingyi Yu, Qi-Xing Huang, Evangelos Kalogerakis, Silvio Savarese, Pat Hanrahan, Thomas Funkhouser, Hao Su, Leonidas Guibas

We introduce a large-scale 3D shape understanding benchmark using data and annotation from ShapeNet 3D object database.

3D Part Segmentation 3D Reconstruction +1

1,989

Paper
Code

RigNet: Neural Rigging for Articulated Characters

1 code implementation • 1 May 2020 • Zhan Xu, Yang Zhou, Evangelos Kalogerakis, Chris Landreth, Karan Singh

We present RigNet, an end-to-end automated method for producing animation rigs from input character models.

1,293

Paper
Code

SceneGraphNet: Neural Message Passing for 3D Indoor Scene Augmentation

2 code implementations • ICCV 2019 • Yang Zhou, Zachary While, Evangelos Kalogerakis

In this paper we propose a neural message passing approach to augment an input 3D indoor scene with new objects matching their surroundings.

3D Object Recognition Scene Generation

923

Paper
Code

MakeItTalk: Speaker-Aware Talking-Head Animation

3 code implementations • 27 Apr 2020 • Yang Zhou, Xintong Han, Eli Shechtman, Jose Echevarria, Evangelos Kalogerakis, DIngzeyu Li

We present a method that generates expressive talking heads from a single facial image with audio as the only input.

Talking Face Generation Talking Head Generation

923

Paper
Code

LRM: Large Reconstruction Model for Single Image to 3D

1 code implementation • 8 Nov 2023 • Yicong Hong, Kai Zhang, Jiuxiang Gu, Sai Bi, Yang Zhou, Difan Liu, Feng Liu, Kalyan Sunkavalli, Trung Bui, Hao Tan

We propose the first Large Reconstruction Model (LRM) that predicts the 3D model of an object from a single input image within just 5 seconds.

Image to 3D

714

Paper
Code

Non-Stationary Texture Synthesis by Adversarial Expansion

1 code implementation • 11 May 2018 • Yang Zhou, Zhen Zhu, Xiang Bai, Dani Lischinski, Daniel Cohen-Or, Hui Huang

We demonstrate that this conceptually simple approach is highly effective for capturing large-scale structures, as well as other non-stationary attributes of the input exemplar.

Generative Adversarial Network Texture Synthesis

369

Paper
Code

Skeleton-free Pose Transfer for Stylized 3D Characters

1 code implementation • 28 Jul 2022 • Zhouyingcheng Liao, Jimei Yang, Jun Saito, Gerard Pons-Moll, Yang Zhou

We present the first method that automatically transfers poses between stylized 3D characters without skeletal rigging.

Pose Transfer

175

Paper
Code

LLM Inference Unveiled: Survey and Roofline Model Insights

2 code implementations • 26 Feb 2024 • Zhihang Yuan, Yuzhang Shang, Yang Zhou, Zhen Dong, Zhe Zhou, Chenhao Xue, Bingzhe Wu, Zhikai Li, Qingyi Gu, Yong Jae Lee, Yan Yan, Beidi Chen, Guangyu Sun, Kurt Keutzer

Our survey stands out from traditional literature reviews by not only summarizing the current state of research but also by introducing a framework based on roofline model for systematic analysis of LLM inference techniques.

Knowledge Distillation Language Modelling +3

148

Paper
Code

Rethinking Performance Gains in Image Dehazing Networks

1 code implementation • 23 Sep 2022 • Yuda Song, Yang Zhou, Hui Qian, Xin Du

Image dehazing is an active topic in low-level vision, and many image dehazing networks have been proposed with the rapid development of deep learning.

Ranked #2 on Image Dehazing on RS-Haze

Image Dehazing Single Image Dehazing

137

Paper
Code

A Survey on Multimodal Large Language Models for Autonomous Driving

1 code implementation • 21 Nov 2023 • Can Cui, Yunsheng Ma, Xu Cao, Wenqian Ye, Yang Zhou, Kaizhao Liang, Jintai Chen, Juanwu Lu, Zichong Yang, Kuei-Da Liao, Tianren Gao, Erlong Li, Kun Tang, Zhipeng Cao, Tong Zhou, Ao Liu, Xinrui Yan, Shuqi Mei, Jianguo Cao, Ziran Wang, Chao Zheng

We first introduce the background of Multimodal Large Language Models (MLLMs), the multimodal models development using LLMs, and the history of autonomous driving.

Autonomous Driving

130

Paper
Code

Morig: Motion-aware rigging of character meshes from point clouds

1 code implementation • 17 Oct 2022 • Zhan Xu, Yang Zhou, Li Yi, Evangelos Kalogerakis

We present MoRig, a method that automatically rigs character meshes driven by single-view point cloud streams capturing the motion of performing characters.

Paper
Code

Predicting Animation Skeletons for 3D Articulated Models via Volumetric Nets

1 code implementation • 22 Aug 2019 • Zhan Xu, Yang Zhou, Evangelos Kalogerakis, Karan Singh

We present a learning method for predicting animation skeletons for input 3D models of articulated characters.

Paper
Code

ETNet: Error Transition Network for Arbitrary Style Transfer

1 code implementation • NeurIPS 2019 • Chunjin Song, Zhijie Wu, Yang Zhou, Minglun Gong, Hui Huang

Numerous valuable efforts have been devoted to achieving arbitrary style transfer since the seminal work of Gatys et al.

Style Transfer

Paper
Code

Triplet-Center Loss for Multi-View 3D Object Retrieval

1 code implementation • CVPR 2018 • Xinwei He, Yang Zhou, Zhichao Zhou, Song Bai, Xiang Bai

Most existing 3D object recognition algorithms focus on leveraging the strong discriminative power of deep learning models with softmax loss for the classification of 3D data, while learning discriminative features with deep metric learning for 3D object retrieval is more or less neglected.

3D Object Recognition 3D Object Retrieval +6

Paper
Code

Diverse and Informative Dialogue Generation with Context-Specific Commonsense Knowledge Awareness

1 code implementation • ACL 2020 • Sixing Wu, Ying Li, Dawei Zhang, Yang Zhou, Zhonghai Wu

We collect and build a large-scale Chinese dataset aligned with the commonsense knowledge for dialogue generation.

Dialogue Generation Knowledge Graphs

Paper
Code

Learning Visibility for Robust Dense Human Body Estimation

1 code implementation • 23 Aug 2022 • Chun-Han Yao, Jimei Yang, Duygu Ceylan, Yi Zhou, Yang Zhou, Ming-Hsuan Yang

An alternative approach is to estimate dense vertices of a predefined template body in the image space.

Paper
Code

DINO-Mix: Enhancing Visual Place Recognition with Foundational Vision Model and Feature Mixing

1 code implementation • 1 Nov 2023 • Gaoshuang Huang, Yang Zhou, Xiaofei Hu, Chenglong Zhang, Luying Zhao, Wenjian Gan, Mingbo Hou

In this study, we utilize the DINOv2 model as the backbone network for trimming and fine-tuning to extract robust image features.

Visual Place Recognition

Paper
Code

SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction

1 code implementation • 18 Mar 2024 • Yang Zhou, Hao Shao, Letian Wang, Steven L. Waslander, Hongsheng Li, Yu Liu

Context information, such as road maps and surrounding agents' states, provides crucial geometric and semantic information for motion behavior prediction.

Autonomous Vehicles motion prediction

Paper
Code

UMMAFormer: A Universal Multimodal-adaptive Transformer Framework for Temporal Forgery Localization

1 code implementation • 28 Aug 2023 • Rui Zhang, Hongxia Wang, Mingshan Du, Hanqing Liu, Yang Zhou, Qiang Zeng

Our approach introduces a Temporal Feature Abnormal Attention (TFAA) module based on temporal feature reconstruction to enhance the detection of temporal differences.

Ranked #1 on Temporal Forgery Localization on LAV-DF

Binary Classification Temporal Forgery Localization +1

Paper
Code

Neural Texture Synthesis With Guided Correspondence

1 code implementation • CVPR 2023 • Yang Zhou, Kaijian Chen, Rongjun Xiao, Hui Huang

More importantly, the Guided Correspondence loss can function as a general textural loss in, e. g., training generative networks for real-time controlled synthesis and inversion-based single-image editing.

Texture Synthesis

Paper
Code

Learning Navigational Visual Representations with Semantic Map Supervision

1 code implementation • ICCV 2023 • Yicong Hong, Yang Zhou, Ruiyi Zhang, Franck Dernoncourt, Trung Bui, Stephen Gould, Hao Tan

Being able to perceive the semantics and the spatial structure of the environment is essential for visual navigation of a household robot.

Representation Learning Self-Supervised Learning +2

Paper
Code

Federated Learning of Large Language Models with Parameter-Efficient Prompt Tuning and Adaptive Optimization

1 code implementation • 23 Oct 2023 • Tianshi Che, Ji Liu, Yang Zhou, Jiaxiang Ren, Jiwen Zhou, Victor S. Sheng, Huaiyu Dai, Dejing Dou

This paper proposes a Parameter-efficient prompt Tuning approach with Adaptive Optimization, i. e., FedPepTAO, to enable efficient and effective FL of LLMs.

Federated Learning

Paper
Code

Diffuse3D: Wide-Angle 3D Photography via Bilateral Diffusion

1 code implementation • ICCV 2023 • Yutao Jiang, Yang Zhou, Yuan Liang, Wenxi Liu, Jianbo Jiao, Yuhui Quan, Shengfeng He

To address the above issues, we propose Diffuse3D which employs a pre-trained diffusion model for global synthesis, while amending the model to activate depth-aware inference.

Denoising Novel View Synthesis

Paper
Code

APES: Articulated Part Extraction from Sprite Sheets

1 code implementation • CVPR 2022 • Zhan Xu, Matthew Fisher, Yang Zhou, Deepali Aneja, Rushikesh Dudhat, Li Yi, Evangelos Kalogerakis

Rigged puppets are one of the most prevalent representations to create 2D character animations.

Paper
Code

Generating Non-Stationary Textures using Self-Rectification

1 code implementation • 5 Jan 2024 • Yang Zhou, Rongjun Xiao, Dani Lischinski, Daniel Cohen-Or, Hui Huang

This paper addresses the challenge of example-based non-stationary texture synthesis.

Texture Synthesis

Paper
Code

Deformable One-shot Face Stylization via DINO Semantic Guidance

1 code implementation • 1 Mar 2024 • Yang Zhou, Zichong Chen, Hui Huang

This paper addresses the complex issue of one-shot face stylization, focusing on the simultaneous consideration of appearance and structure, where previous methods have fallen short.

One-Shot Face Stylization

Paper
Code

None Class Ranking Loss for Document-Level Relation Extraction

1 code implementation • 1 May 2022 • Yang Zhou, Wee Sun Lee

This ignores the context of entity pairs and the label correlations between the none class and pre-defined classes, leading to sub-optimal predictions.

Document-level Relation Extraction Emotion Classification +3

Paper
Code

Faithful Extreme Rescaling via Generative Prior Reciprocated Invertible Representations

1 code implementation • CVPR 2022 • Zhixuan Zhong, Liangyu Chai, Yang Zhou, Bailin Deng, Jia Pan, Shengfeng He

This paper presents a Generative prior ReciprocAted Invertible rescaling Network (GRAIN) for generating faithful high-resolution (HR) images from low-resolution (LR) invertible images with an extreme upscaling factor (64x).

Paper
Code

More is Better: Enhancing Open-Domain Dialogue Generation via Multi-Source Heterogeneous Knowledge

1 code implementation • EMNLP 2021 • Sixing Wu, Ying Li, Minghui Wang, Dawei Zhang, Yang Zhou, Zhonghai Wu

Despite achieving remarkable performance, previous knowledge-enhanced works usually only use a single-source homogeneous knowledge base of limited knowledge coverage.

Dialogue Generation

Paper
Code

Modular Degradation Simulation and Restoration for Under-Display Camera

1 code implementation • 23 Sep 2022 • Yang Zhou, Yuda Song, Xin Du

Together with a pixel-wise discriminator and supervised loss, we can train the generator to simulate the UDC imaging degradation process.

Generative Adversarial Network Image Restoration

Paper
Code

Cyclic Learning: Bridging Image-level Labels and Nuclei Instance Segmentation

1 code implementation • 5 Jun 2023 • Yang Zhou, Yongjian Wu, Zihua Wang, Bingzheng Wei, Maode Lai, Jianzhong Shou, Yubo Fan, Yan Xu

Experiments on three datasets demonstrate the good generality of our method, which outperforms other image-level weakly supervised methods for nuclei instance segmentation, and achieves comparable performance to fully-supervised methods.

Instance Segmentation Multi-Task Learning +4

Paper
Code

Zero-shot Nuclei Detection via Visual-Language Pre-trained Models

1 code implementation • 30 Jun 2023 • Yongjian Wu, Yang Zhou, Jiya Saiyin, Bingzheng Wei, Maode Lai, Jianzhong Shou, Yubo Fan, Yan Xu

Foremost, our work demonstrates that the VLPM pre-trained on natural image-text pairs exhibits astonishing potential for downstream tasks in the medical field as well.

object-detection Object Detection

Paper
Code

Cross-lingual Entity Alignment with Adversarial Kernel Embedding and Adversarial Knowledge Translation

1 code implementation • 16 Apr 2021 • Gong Zhang, Yang Zhou, Sixing Wu, Zeru Zhang, Dejing Dou

With the guidance of known aligned entities in the context of multiple random walks, an adversarial knowledge translation model is developed to fill and translate masked entities in pairwise random walks from two KGs.

Attribute Entity Alignment +2

Paper
Code

PENet: A Joint Panoptic Edge Detection Network

1 code implementation • 15 Mar 2023 • Yang Zhou, Giuseppe Loianno

In recent years, compact and efficient scene understanding representations have gained popularity in increasing situational awareness and autonomy of robotic systems.

Edge Detection Multi-Task Learning +2

Paper
Code

DRMC: A Generalist Model with Dynamic Routing for Multi-Center PET Image Synthesis

1 code implementation • 11 Jul 2023 • Zhiwen Yang, Yang Zhou, HUI ZHANG, Bingzheng Wei, Yubo Fan, Yan Xu

To address this, we develop a generalist model that shares architecture and parameters across centers to utilize the shared knowledge.

Image Generation

Paper
Code

Structure Learning of Probabilistic Graphical Models: A Comprehensive Survey

2 code implementations • 29 Nov 2011 • Yang Zhou

Probabilistic graphical models combine the graph theory and probability theory to give a multivariate statistical modeling.

Marketing

Paper
Code

Full 3D Reconstruction of Transparent Objects

no code implementations • 9 May 2018 • Bojian Wu, Yang Zhou, Yiming Qian, Minglun Gong, Hui Huang

Numerous techniques have been proposed for reconstructing 3D models for opaque objects in past decades.

3D Reconstruction Transparent objects

Paper
Add Code

A Tube-and-Droplet-based Approach for Representing and Analyzing Motion Trajectories

no code implementations • 10 Sep 2016 • Weiyao Lin, Yang Zhou, Hongteng Xu, Junchi Yan, Mingliang Xu, Jianxin Wu, Zicheng Liu

Our approach first leverages the complete information from given trajectories to construct a thermal transfer field which provides a context-rich way to describe the global motion pattern in a scene.

3D Action Recognition Anomaly Detection +2

Paper
Add Code

A Large-scale Distributed Video Parsing and Evaluation Platform

no code implementations • 29 Nov 2016 • Kai Yu, Yang Zhou, Da Li, Zhang Zhang, Kaiqi Huang

Visual surveillance systems have become one of the largest data sources of Big Visual Data in real world.

Paper
Add Code

DeepMove: Learning Place Representations through Large Scale Movement Data

no code implementations • 11 Jul 2018 • Yang Zhou, Yan Huang

DeepMove is spatial and temporal context aware.

Clustering

Paper
Add Code

EFANet: Exchangeable Feature Alignment Network for Arbitrary Style Transfer

no code implementations • 26 Nov 2018 • Zhijie Wu, Chunjin Song, Yang Zhou, Minglun Gong, Hui Huang

Style transfer has been an important topic both in computer vision and graphics.

Style Transfer

Paper
Add Code

Regularized Distance Metric Learning:Theory and Algorithm

no code implementations • NeurIPS 2009 • Rong Jin, Shijun Wang, Yang Zhou

In this paper, we examine the generalization error of regularized distance metric learning.

Face Recognition General Classification +1

Paper
Add Code

Interaction Part Mining: A Mid-Level Approach for Fine-Grained Action Recognition

no code implementations • CVPR 2015 • Yang Zhou, Bingbing Ni, Richang Hong, Meng Wang, Qi Tian

Secondly, these object regions are matched and tracked across frames to form a large spatio-temporal graph based on the appearance matching and the dense motion trajectories through them.

Fine-grained Action Recognition Human-Object Interaction Detection +2

Paper
Add Code

Cascaded Interactional Targeting Network for Egocentric Video Analysis

no code implementations • CVPR 2016 • Yang Zhou, Bingbing Ni, Richang Hong, Xiaokang Yang, Qi Tian

Firstly, a novel EM-like learning framework is proposed to train the pixel-level deep convolutional neural network (DCNN) by seamlessly integrating weakly supervised data (i. e., massive bounding box annotations) with a small set of strongly supervised data (i. e., fully annotated hand segmentation maps) to achieve state-of-the-art hand segmentation performance.

Action Recognition Foreground Segmentation +4

Paper
Add Code

Unsupervised Trajectory Clustering via Adaptive Multi-Kernel-Based Shrinkage

no code implementations • ICCV 2015 • Hongteng Xu, Yang Zhou, Weiyao Lin, Hongyuan Zha

Facing to the challenges of trajectory clustering, e. g., large variations within a cluster and ambiguities across clusters, we first introduce an adaptive multi-kernel-based estimation process to estimate the `shrunk' positions and speeds of trajectories' points.

Anomaly Detection Clustering +1

Paper
Add Code

Platoon trajectories generation: A unidirectional interconnected LSTM-based car following model

no code implementations • 25 Oct 2019 • Yangxin Lin, Ping Wang, Yang Zhou, Fan Ding, Chen Wang, Huachun Tan

However, the traffic micro-simulation accuracy of car following models in a platoon level, especially during traffic oscillations, still needs to be enhanced.

Paper
Add Code

End-To-End Trainable Video Super-Resolution Based on a New Mechanism for Implicit Motion Estimation and Compensation

no code implementations • 5 Jan 2020 • Xiaohong Liu, Lingshi Kong, Yang Zhou, Jiying Zhao, Jun Chen

Video super-resolution aims at generating a high-resolution video from its low-resolution counterpart.

Motion Compensation Motion Estimation +1

Paper
Add Code

A Video Analysis Method on Wanfang Dataset via Deep Neural Network

no code implementations • 28 Feb 2020 • Jinlong Kang, Jiaxiang Zheng, Heng Bai, Xiaoting Xue, Yang Zhou, Jun Guo

To solve this problem, in this paper, we describe the new function for real-time multi-object detection in sports competition and pedestrians flow detection in public based on deep learning.

Object object-detection +1

Paper
Add Code

A Probabilistic Model with Commonsense Constraints for Pattern-based Temporal Fact Extraction

no code implementations • WS 2020 • Yang Zhou, Tong Zhao, Meng Jiang

Textual patterns (e. g., Country's president Person) are specified and/or generated for extracting factual information from unstructured data.

TAG Text Generation

Paper
Add Code

VisemeNet: Audio-Driven Animator-Centric Speech Animation

no code implementations • 24 May 2018 • Yang Zhou, Zhan Xu, Chris Landreth, Evangelos Kalogerakis, Subhransu Maji, Karan Singh

We present a novel deep-learning based approach to producing animator-centric speech motion curves that drive a JALI or standard FACS-based production face-rig, directly from input audio.

Graphics

Paper
Add Code

Multi-boundary entanglement in Chern-Simons theory with finite gauge groups

no code implementations • 3 Mar 2020 • Siddharth Dwivedi, Andrea Addazi, Yang Zhou, Puneet Sharma

We study the multi-boundary entanglement structure of the states prepared in (1+1) and (2+1) dimensional Chern-Simons theory with finite discrete gauge group $G$.

High Energy Physics - Theory Mesoscale and Nanoscale Physics Quantum Physics

Paper
Add Code

Neural Latent Dependency Model for Sequence Labeling

no code implementations • 10 Nov 2020 • Yang Zhou, Yong Jiang, Zechuan Hu, Kewei Tu

One limitation of linear chain CRFs is their inability to model long-range dependencies between labels.

Paper
Add Code

Adversarial Attacks on Deep Graph Matching

no code implementations • NeurIPS 2020 • Zijie Zhang, Zeru Zhang, Yang Zhou, Yelong Shen, Ruoming Jin, Dejing Dou

Despite achieving remarkable performance, deep graph learning models, such as node classification and network embedding, suffer from harassment caused by small adversarial perturbations.

Adversarial Attack Density Estimation +5

Paper
Add Code

Manifold Partition Discriminant Analysis

no code implementations • 23 Nov 2020 • Yang Zhou, Shiliang Sun

We propose a novel algorithm for supervised dimensionality reduction named Manifold Partition Discriminant Analysis (MPDA).

Supervised dimensionality reduction

Paper
Add Code

A Top-Down Approach for the Multiple Exercises and Valuation of Employee Stock Options

no code implementations • 9 Jun 2019 • Tim Leung, Yang Zhou

We propose a new framework to value employee stock options (ESOs) that captures multiple exercises of different quantities over time.

Paper
Add Code

Optimal Dynamic Futures Portfolio in a Regime-Switching Market Framework

no code implementations • 14 Oct 2019 • Tim Leung, Yang Zhou

We study the problem of dynamically trading futures in a regime-switching market.

Paper
Add Code

$K$-theoretic quasimap wall-crossing

no code implementations • 2 Dec 2020 • Ming Zhang, Yang Zhou

In this paper, we prove a K-theoretic wall-crossing formula for $\epsilon$-stable quasimaps for all GIT targets in all genera.

Algebraic Geometry Mathematical Physics Mathematical Physics 14N35

Paper
Add Code

Partially Connected Automated Vehicle Cooperative Control Strategy with a Deep Reinforcement Learning Approach

no code implementations • 3 Dec 2020 • Haotian Shi, Yang Zhou, Keshu Wu, Xin Wang, Yangxin Lin, Bin Ran

This paper proposes a cooperative strategy of connected and automated vehicles (CAVs) longitudinal control for partially connected and automated traffic environment based on deep reinforcement learning (DRL) algorithm, which enhances the string stability of mixed traffic, car following efficiency, and energy efficiency.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Defect extremal surface as the holographic counterpart of Island formula

no code implementations • 14 Dec 2020 • Feiyu Deng, Jinwei Chu, Yang Zhou

We propose defect extremal surface as the holographic counterpart of boundary quantum extremal surface.

High Energy Physics - Theory Strongly Correlated Electrons General Relativity and Quantum Cosmology Quantum Physics

Paper
Add Code

Transformer-based Language Model Fine-tuning Methods for COVID-19 Fake News Detection

no code implementations • 14 Jan 2021 • Ben Chen, Bin Chen, Dehong Gao, Qijin Chen, Chengfu Huo, Xiaonan Meng, Weijun Ren, Yang Zhou

However, universal language models may perform weakly in these fake news detection for lack of large-scale annotated data and sufficient semantic understanding of domain-specific knowledge.

Fake News Detection Language Modelling

Paper
Add Code

CogNet: Bridging Linguistic Knowledge, World Knowledge and Commonsense Knowledge

no code implementations • 3 Mar 2021 • Chenhao Wang, Yubo Chen, Zhipeng Xue, Yang Zhou, Jun Zhao

In this paper, we present CogNet, a knowledge base (KB) dedicated to integrating three types of knowledge: (1) linguistic knowledge from FrameNet, which schematically describes situations, objects and events.

World Knowledge

Paper
Add Code

Connected and Automated Vehicle Distributed Control for On-ramp Merging Scenario: A Virtual Rotation Approach

no code implementations • 28 Mar 2021 • Tianyi Chen, Meng Wang, Siyuan Gong, Yang Zhou, Bin Ran

In this study, we propose a rotation-based connected automated vehicle (CAV) distributed cooperative control strategy for an on-ramp merging scenario.

Paper
Add Code

Optimal Dynamic Futures Portfolios Under a Multiscale Central Tendency Ornstein-Uhlenbeck Model

no code implementations • 24 Feb 2021 • Tim Leung, Yang Zhou

We study the problem of dynamically trading multiple futures whose underlying asset price follows a multiscale central tendency Ornstein-Uhlenbeck (MCTOU) model.

Paper
Add Code

From Distributed Machine Learning to Federated Learning: A Survey

no code implementations • 29 Apr 2021 • Ji Liu, Jizhou Huang, Yang Zhou, Xuhong LI, Shilei Ji, Haoyi Xiong, Dejing Dou

Because of laws or regulations, the distributed data and computing resources cannot be directly shared among different regions or organizations for machine learning tasks.

BIG-bench Machine Learning Federated Learning

Paper
Add Code

Infrastructure Assisted Constrained Connected Automated Vehicle Trajectory Optimization on Curved Roads: A Spatial Formulation on a Curvilinear Coordinate

no code implementations • 1 Mar 2021 • Ran Yi, Yang Zhou, Xin Wang, Zhiyuan Liu, Xiaotian Li, Bin Ran

This paper presents an infrastructure assisted constrained connected automated vehicles (CAVs) trajectory optimization method on curved roads.

Model Predictive Control

Paper
Add Code

Towards a Better Understanding of Linear Models for Recommendation

no code implementations • 27 May 2021 • Ruoming Jin, Dong Li, Jing Gao, Zhi Liu, Li Chen, Yang Zhou

Through the derivation and analysis of the closed-form solutions for two basic regression and matrix factorization approaches, we found these two approaches are indeed inherently related but also diverge in how they "scale-down" the singular values of the original user-item interaction matrix.

regression

Paper
Add Code

Focus on Local: Detecting Lane Marker from Bottom Up via Key Point

no code implementations • CVPR 2021 • Zhan Qu, Huan Jin, Yang Zhou, Zhen Yang, Wei zhang

Mainstream lane marker detection methods are implemented by predicting the overall structure and deriving parametric curves through post-processing.

Ranked #2 on Lane Detection on TuSimple

Lane Detection

Paper
Add Code

Virtual synchronous generator of PV generation without energy storage for frequency support in autonomous microgrid

no code implementations • 4 Jul 2021 • Cheng Zhong, Huayi Li, Yang Zhou, Yueming Lv, Jikai Chen, Yang Li

PV generation reserve a part of the active power in accordance with the pre-defined power versus voltage curve.

Paper
Add Code

Multi-Modal Multi-Instance Learning for Retinal Disease Recognition

no code implementations • 25 Sep 2021 • Xirong Li, Yang Zhou, Jie Wang, Hailan Lin, Jianchun Zhao, Dayong Ding, Weihong Yu, Youxin Chen

We propose in this paper Multi-Modal Multi-Instance Learning (MM-MIL) for selectively fusing CFP and OCT modalities.

Paper
Add Code

Adversarial Attack against Cross-lingual Knowledge Graph Alignment

no code implementations • EMNLP 2021 • Zeru Zhang, Zijie Zhang, Yang Zhou, Lingfei Wu, Sixing Wu, Xiaoying Han, Dejing Dou, Tianshi Che, Da Yan

Recent literatures have shown that knowledge graph (KG) learning models are highly vulnerable to adversarial attacks.

Adversarial Attack Entity Alignment

Paper
Add Code

Validating the Lottery Ticket Hypothesis with Inertial Manifold Theory

no code implementations • NeurIPS 2021 • Zeru Zhang, Jiayin Jin, Zijie Zhang, Yang Zhou, Xin Zhao, Jiaxiang Ren, Ji Liu, Lingfei Wu, Ruoming Jin, Dejing Dou

Despite achieving remarkable efficiency, traditional network pruning techniques often follow manually-crafted heuristics to generate pruned sparse networks.

Network Pruning

Paper
Add Code

Efficient Device Scheduling with Multi-Job Federated Learning

no code implementations • 11 Dec 2021 • Chendi Zhou, Ji Liu, Juncheng Jia, Jingbo Zhou, Yang Zhou, Huaiyu Dai, Dejing Dou

However, the scheduling of devices for multiple jobs with FL remains a critical and open problem.

Bayesian Optimization Fairness +2

Paper
Add Code

Multi-Robot Collaborative Perception with Graph Neural Networks

no code implementations • 5 Jan 2022 • Yang Zhou, Jiuhong Xiao, Yue Zhou, Giuseppe Loianno

Multi-robot systems such as swarms of aerial robots are naturally suited to offer additional flexibility, resilience, and robustness in several tasks compared to a single robot by enabling cooperation among the agents.

Decision Making Monocular Depth Estimation +1

Paper
Add Code

FedDUAP: Federated Learning with Dynamic Update and Adaptive Pruning Using Shared Data on the Server

no code implementations • 25 Apr 2022 • Hong Zhang, Ji Liu, Juncheng Jia, Yang Zhou, Huaiyu Dai, Dejing Dou

Despite achieving remarkable performance, Federated Learning (FL) suffers from two critical challenges, i. e., limited computational resources and low training efficiency.

Federated Learning

Paper
Add Code

Lesion Localization in OCT by Semi-Supervised Object Detection

no code implementations • 24 Apr 2022 • Yue Wu, Yang Zhou, Jianchun Zhao, Jingyuan Yang, Weihong Yu, Youxin Chen, Xirong Li

Over 300 million people worldwide are affected by various retinal diseases.

object-detection Object Detection +1

Paper
Add Code

Human-Centered Prior-Guided and Task-Dependent Multi-Task Representation Learning for Action Recognition Pre-Training

no code implementations • 27 Apr 2022 • Guanhong Wang, Keyu Lu, Yang Zhou, Zhanhao He, Gaoang Wang

Recently, much progress has been made for self-supervised action recognition.

Contrastive Learning Human Parsing +5

Paper
Add Code

Preserve Pre-trained Knowledge: Transfer Learning With Self-Distillation For Action Recognition

no code implementations • 1 May 2022 • Yang Zhou, Zhanhao He, Keyu Lu, Guanhong Wang, Gaoang Wang

Video-based action recognition is one of the most popular topics in computer vision.

Action Recognition Representation Learning +1

Paper
Add Code

The AISP-SJTU Simultaneous Translation System for IWSLT 2022

no code implementations • IWSLT (ACL) 2022 • Qinpei Zhu, Renshou Wu, Guangfeng Liu, Xinyu Zhu, Xingyu Chen, Yang Zhou, Qingliang Miao, Rui Wang, Kai Yu

This paper describes AISP-SJTU’s submissions for the IWSLT 2022 Simultaneous Translation task.

Translation

Paper
Add Code

3D Segmentation Guided Style-based Generative Adversarial Networks for PET Synthesis

no code implementations • 18 May 2022 • Yang Zhou, Zhiwen Yang, HUI ZHANG, Eric I-Chao Chang, Yubo Fan, Yan Xu

(2) We adopt a task-driven strategy that couples a segmentation task with a generative adversarial network (GAN) framework to improve the translation performance.

Generative Adversarial Network Translation

Paper
Add Code

Diversity Matters: Fully Exploiting Depth Clues for Reliable Monocular 3D Object Detection

no code implementations • CVPR 2022 • Zhuoling Li, Zhan Qu, Yang Zhou, Jianzhuang Liu, Haoqian Wang, Lihui Jiang

To tackle this problem, we propose a depth solving system that fully explores the visual clues from the subtasks in M3OD and generates multiple estimations for the depth of each target.

Depth Estimation Monocular 3D Object Detection +2

Paper
Add Code

A Lightweight NMS-free Framework for Real-time Visual Fault Detection System of Freight Trains

no code implementations • 25 May 2022 • Guodong Sun, Yang Zhou, Huilin Pan, Bo Wu, Ye Hu, Yang Zhang

In this paper, we propose a lightweight NMS-free framework to achieve real-time detection and high accuracy simultaneously.

Fault Detection

Paper
Add Code

Play It Cool: Dynamic Shifting Prevents Thermal Throttling

no code implementations • 22 Jun 2022 • Yang Zhou, Feng Liang, Ting-Wu Chin, Diana Marculescu

Machine learning (ML) has entered the mobile era where an enormous number of ML models are deployed on edge devices.

Paper
Add Code

Input-agnostic Certified Group Fairness via Gaussian Parameter Smoothing

no code implementations • 22 Jun 2022 • Jiayin Jin, Zeru Zhang, Yang Zhou, Lingfei Wu

Theoretical analysis is conducted to derive that the Nemytskii operator is smooth and induces a Frechet differentiable smooth manifold.

Fairness

Paper
Add Code

Accelerated Federated Learning with Decoupled Adaptive Optimization

no code implementations • 14 Jul 2022 • Jiayin Jin, Jiaxiang Ren, Yang Zhou, Lingjuan Lyu, Ji Liu, Dejing Dou

The federated learning (FL) framework enables edge clients to collaboratively learn a shared inference model while keeping privacy of training data on clients.

Federated Learning

Paper
Add Code

Audio-driven Neural Gesture Reenactment with Video Motion Graphs

no code implementations • CVPR 2022 • Yang Zhou, Jimei Yang, DIngzeyu Li, Jun Saito, Deepali Aneja, Evangelos Kalogerakis

We present a method that reenacts a high-quality video with gestures matching a target speech audio.

valid

Paper
Add Code

Contact2Grasp: 3D Grasp Synthesis via Hand-Object Contact Constraint

no code implementations • 17 Oct 2022 • Haoming Li, Xinzhuo Lin, Yang Zhou, Xiang Li, Yuchi Huo, Jiming Chen, Qi Ye

To tackle the challenge, we introduce an intermediate variable for grasp contact areas to constrain the grasp generation; in other words, we factorize the mapping into two sequential stages by assuming that grasping poses are fully constrained given contact maps: 1) we first learn contact map distributions to generate the potential contact maps for grasps; 2) then learn a mapping from the contact maps to the grasping poses.

Grasp Generation Object +2

Paper
Add Code

Pixel-Aligned Non-parametric Hand Mesh Reconstruction

no code implementations • 17 Oct 2022 • Shijian Jiang, Guwen Han, Danhang Tang, Yang Zhou, Xiang Li, Jiming Chen, Qi Ye

The decoder aggregate both local image features in pixels and geometric features in vertices.

Paper
Add Code

ClassPruning: Speed Up Image Restoration Networks by Dynamic N:M Pruning

no code implementations • 10 Nov 2022 • Yang Zhou, Yuda Song, Hui Qian, Xin Du

Image restoration tasks have achieved tremendous performance improvements with the rapid advancement of deep neural networks.

Image Restoration

Paper
Add Code

Multi-Job Intelligent Scheduling with Cross-Device Federated Learning

no code implementations • 24 Nov 2022 • Ji Liu, Juncheng Jia, Beichen Ma, Chendi Zhou, Jingbo Zhou, Yang Zhou, Huaiyu Dai, Dejing Dou

The system model enables a parallel training process of multiple jobs, with a cost model based on the data fairness and the training time of diverse devices during the parallel training process.

Bayesian Optimization Fairness +2

Paper
Add Code

Visual Fault Detection of Multi-scale Key Components in Freight Trains

no code implementations • 26 Nov 2022 • Yang Zhang, Yang Zhou, Huilin Pan, Bo Wu, Guodong Sun

Fault detection for key components in the braking system of freight trains is critical for ensuring railway transportation safety.

Fault Detection

Paper
Add Code

Cost-minimization predictive energy management of a postal-delivery fuel cell electric vehicle with intelligent battery State-of-Charge Planner

no code implementations • 28 Dec 2022 • Yang Zhou, Fuzeng Li, Xianfeng Xu, Zhen Zhang, Alexandre Ravey, Marie-Cécile Péra, Ruiqing Ma

Fuel cell electric vehicles have earned substantial attentions in recent decades due to their high-efficiency and zero-emission features, while the high operating costs remain the major barrier towards their large-scale commercialization.

energy management Management +1

Paper
Add Code

Towards Fair Machine Learning Software: Understanding and Addressing Model Bias Through Counterfactual Thinking

no code implementations • 16 Feb 2023 • Zichong Wang, Yang Zhou, Meikang Qiu, Israat Haque, Laura Brown, Yi He, Jianwu Wang, David Lo, Wenbin Zhang

The increasing use of Machine Learning (ML) software can lead to unfair and unethical decisions, thus fairness bugs in software are becoming a growing concern.

Benchmarking counterfactual +1

Paper
Add Code

Artificial Intelligence System for Detection and Screening of Cardiac Abnormalities using Electrocardiogram Images

no code implementations • 10 Feb 2023 • Deyun Zhang, Shijia Geng, Yang Zhou, Weilun Xu, Guodong Wei, Kai Wang, Jie Yu, Qiang Zhu, Yongkui Li, Yonghong Zhao, Xingyue Chen, Rui Zhang, Zhaoji Fu, Rongbo Zhou, Yanqi E, Sumei Fan, Qinghao Zhao, Chuandong Cheng, Nan Peng, Liang Zhang, Linlin Zheng, Jianjun Chu, Hongbin Xu, Chen Tan, Jian Liu, Huayue Tao, Tong Liu, Kangyin Chen, Chenyang Jiang, Xingpeng Liu, Shenda Hong

In this study, we present an AI system developed to detect and screen cardiac abnormalities (CAs) from real-world ECG images.

Paper
Add Code

Faster Learning of Temporal Action Proposal via Sparse Multilevel Boundary Generator

1 code implementation • 6 Mar 2023 • Qing Song, Yang Zhou, Mengjie Hu, Chun Liu

Temporal action localization in videos presents significant challenges in the field of computer vision.

Temporal Action Localization

Paper
Code

Medical Phrase Grounding with Region-Phrase Context Contrastive Alignment

no code implementations • 14 Mar 2023 • Zhihao Chen, Yang Zhou, Anh Tran, Junting Zhao, Liang Wan, Gideon Ooi, Lionel Cheng, Choon Hua Thng, Xinxing Xu, Yong liu, Huazhu Fu

To enable MedRPG to locate nuanced medical findings with better region-phrase correspondences, we further propose Tri-attention Context contrastive alignment (TaCo).

Phrase Grounding Visual Grounding

Paper
Add Code

L0-norm constraint normalized subband adaptive filtering algorithm: Performance development and AEC application

no code implementations • 10 Apr 2023 • Dongxu Liu, Haiquan Zhao, Yang Zhou

Limited by fixed step-size and sparsity penalty factor, the conventional sparsity-aware normalized subband adaptive filtering (NSAF) type algorithms suffer from trade-off requirements of high filtering accurateness and quicker convergence behavior for sparse system identification.

LEMMA

Paper
Add Code

Single-View View Synthesis with Self-Rectified Pseudo-Stereo

no code implementations • 19 Apr 2023 • Yang Zhou, Hanjie Wu, Wenxi Liu, Zheng Xiong, Jing Qin, Shengfeng He

In this way, the challenging novel view synthesis process is decoupled into two simpler problems of stereo synthesis and 3D reconstruction.

3D Reconstruction Novel View Synthesis

Paper
Add Code

On building machine learning pipelines for Android malware detection: a procedural survey of practices, challenges and opportunities

no code implementations • 12 Jun 2023 • Masoud Mehrabi Koushki, Ibrahim Abualhaol, Anandharaju Durai Raju, Yang Zhou, Ronnie Salvador Giagone, Huang Shengqiang

In this paper, we address this problem with a review of 42 highly-cited papers, spanning a decade of research (from 2011 to 2021).

Android Malware Detection Dimensionality Reduction +1

Paper
Add Code

A Survey on Cross-Architectural IoT Malware Threat Hunting

no code implementations • 9 Jun 2023 • Anandharaju Durai Raju, Ibrahim Abualhaol, Ronnie Salvador Giagone, Yang Zhou, Shengqiang Huang

In recent years, the increase in non-Windows malware threats had turned the focus of the cybersecurity community.

Malware Detection

Paper
Add Code

Efficient Visual Fault Detection for Freight Train Braking System via Heterogeneous Self Distillation in the Wild

no code implementations • 3 Jul 2023 • Yang Zhang, Huilin Pan, Yang Zhou, Mingying Li, Guodong Sun

Efficient visual fault detection of freight trains is a critical part of ensuring the safe operation of railways under the restricted hardware environment.

Fault Detection object-detection +1

Paper
Add Code

Fast algorithms for k-submodular maximization subject to a matroid constraint

no code implementations • 26 Jul 2023 • Shuxian Niu, Qian Liu, Yang Zhou, Min Li

In this paper, we apply a Threshold-Decreasing Algorithm to maximize $k$-submodular functions under a matroid constraint, which reduces the query complexity of the algorithm compared to the greedy algorithm with little loss in approximation ratio.

Paper
Add Code

GRIP: Generating Interaction Poses Using Latent Consistency and Spatial Cues

no code implementations • 22 Aug 2023 • Omid Taheri, Yi Zhou, Dimitrios Tzionas, Yang Zhou, Duygu Ceylan, Soren Pirk, Michael J. Black

In contrast, we introduce GRIP, a learning-based method that takes, as input, the 3D motion of the body and the object, and synthesizes realistic motion for both hands before, during, and after object interaction.

Mixed Reality Object

Paper
Add Code

Graph-Based Interaction-Aware Multimodal 2D Vehicle Trajectory Prediction using Diffusion Graph Convolutional Networks

no code implementations • 5 Sep 2023 • Keshu Wu, Yang Zhou, Haotian Shi, Xiaopeng Li, Bin Ran

Within this framework, vehicles' motions are conceptualized as nodes in a time-varying graph, and the traffic interactions are represented by a dynamic adjacency matrix.

Graph Embedding Intent Detection +1

Paper
Add Code

ContactGen: Generative Contact Modeling for Grasp Generation

no code implementations • ICCV 2023 • Shaowei Liu, Yang Zhou, Jimei Yang, Saurabh Gupta, Shenlong Wang

This paper presents a novel object-centric contact representation ContactGen for hand-object interaction.

Grasp Generation Object

Paper
Add Code

Visual Environment Assessment for Safe Autonomous Quadrotor Landing

no code implementations • 16 Nov 2023 • Mattia Secchiero, Nishanth Bobbili, Yang Zhou, Giuseppe Loianno

Autonomous identification and evaluation of safe landing zones are of paramount importance for ensuring the safety and effectiveness of aerial robots in the event of system failures, low battery, or the successful completion of specific tasks.

Paper
Add Code

Deep Learning for Vascular Segmentation and Applications in Phase Contrast Tomography Imaging

no code implementations • 22 Nov 2023 • Ekin Yagis, Shahab Aslani, Yashvardhan Jain, Yang Zhou, Shahrokh Rahmani, Joseph Brunet, Alexandre Bellier, Christopher Werlein, Maximilian Ackermann, Danny Jonigk, Paul Tafforeau, Peter D Lee, Claire Walsh

Moreover, decreased connectivity in finer vessels and higher segmentation errors at vessel boundaries were observed.

Segmentation

Paper
Add Code

Sequencing-enabled Hierarchical Cooperative CAV On-ramp Merging Control with Enhanced Stability and Feasibility

no code implementations • 25 Nov 2023 • Sixu Li, Yang Zhou, Xinyue Ye, Jiwan Jiang, Meng Wang

Subsequently, the lower-level control employs a longitudinal distributed model predictive control (MPC) supplemented by a virtual car-following (CF) concept to ensure asymptotic local stability, l_2 norm string stability, and safety.

Model Predictive Control

Paper
Add Code

Spatial-wise Dynamic Distillation for MLP-like Efficient Visual Fault Detection of Freight Trains

1 code implementation • 10 Dec 2023 • Yang Zhang, Huilin Pan, Mingying Li, An Wang, Yang Zhou, Hongliang Ren

Existing modeling shortcomings of spatial invariance and pooling layers in conventional CNNs often ignore the neglect of crucial global information, resulting in error localization for fault objection tasks of freight trains.

Fault Detection object-detection +1

Paper
Code

Beyond 1D and oversimplified kinematics: A generic analytical framework for surrogate safety measures

no code implementations • 12 Dec 2023 • Sixu Li, Mohammad Anis, Dominique Lord, Hao Zhang, Yang Zhou, Xinyue Ye

This paper presents a generic analytical framework tailored for surrogate safety measures (SSMs) that is versatile across various highway geometries, capable of encompassing vehicle dynamics of differing dimensionality and fidelity, and suitable for dynamic, real-world environments.

Paper
Add Code

A Generic Stochastic Hybrid Car-following Model Based on Approximate Bayesian Computation

no code implementations • 27 Nov 2023 • Jiwan Jiang, Yang Zhou, Xin Wang, Soyoung Ahn

However, the CF behavior of human drivers is highly stochastic and nonlinear.

Paper
Add Code

AEDFL: Efficient Asynchronous Decentralized Federated Learning with Heterogeneous Devices

no code implementations • 18 Dec 2023 • Ji Liu, Tianshi Che, Yang Zhou, Ruoming Jin, Huaiyu Dai, Dejing Dou, Patrick Valduriez

First, we propose an asynchronous FL system model with an efficient model aggregation method for improving the FL convergence.

Federated Learning

Paper
Add Code

In-Hand 3D Object Reconstruction from a Monocular RGB Video

no code implementations • 27 Dec 2023 • Shijian Jiang, Qi Ye, Rengan Xie, Yuchi Huo, Xiang Li, Yang Zhou, Jiming Chen

We evaluate our approach on HO3D and HOD datasets and demonstrate that it outperforms the state-of-the-art methods in terms of reconstruction surface quality, with an improvement of $52\%$ on HO3D and $20\%$ on HOD.

3D Object Reconstruction 3D Reconstruction +2

Paper
Add Code

Jump Cut Smoothing for Talking Heads

no code implementations • 9 Jan 2024 • Xiaojuan Wang, Taesung Park, Yang Zhou, Eli Shechtman, Richard Zhang

We leverage the appearance of the subject from the other source frames in the video, fusing it with a mid-level representation driven by DensePose keypoints and face landmarks.

Paper
Add Code

ActAnywhere: Subject-Aware Video Background Generation

no code implementations • 19 Jan 2024 • Boxiao Pan, Zhan Xu, Chun-Hao Paul Huang, Krishna Kumar Singh, Yang Zhou, Leonidas J. Guibas, Jimei Yang

Generating video background that tailors to foreground subject motion is an important problem for the movie industry and visual effects community.

Paper
Add Code

Template-Free Single-View 3D Human Digitalization with Diffusion-Guided LRM

no code implementations • 22 Jan 2024 • Zhenzhen Weng, Jingyuan Liu, Hao Tan, Zhan Xu, Yang Zhou, Serena Yeung-Levy, Jimei Yang

We present Human-LRM, a diffusion-guided feed-forward model that predicts the implicit field of a human from a single image.

Paper
Add Code

Overview of Sensing Attacks on Autonomous Vehicle Technologies and Impact on Traffic Flow

no code implementations • 26 Jan 2024 • Zihao Li, Sixu Li, Hao Zhang, Yang Zhou, Siyang Xie, Yunlong Zhang

While perception systems in Connected and Autonomous Vehicles (CAVs), which encompass both communication technologies and advanced sensors, promise to significantly reduce human driving errors, they also expose CAVs to various cyberattacks.

Autonomous Vehicles

Paper
Add Code

Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models

no code implementations • 22 Feb 2024 • Yixuan Ren, Yang Zhou, Jimei Yang, Jing Shi, Difan Liu, Feng Liu, Mingi Kwon, Abhinav Shrivastava

With the emergence of text-to-video (T2V) diffusion models, its temporal counterpart, motion customization, has not yet been well investigated.

Video Generation

Paper
Add Code

A Survey of Lottery Ticket Hypothesis

no code implementations • 7 Mar 2024 • Bohan Liu, Zijie Zhang, Peixiong He, Zhensen Wang, Yang Xiao, Ruimeng Ye, Yang Zhou, Wei-Shinn Ku, Bo Hui

The Lottery Ticket Hypothesis (LTH) states that a dense neural network model contains a highly sparse subnetwork (i. e., winning tickets) that can achieve even better performance than the original model when trained in isolation.

Paper
Add Code

CTSM: Combining Trait and State Emotions for Empathetic Response Model

1 code implementation • 22 Mar 2024 • Wang Yufeng, Chen Chao, Yang Zhou, Wang Shuhui, Liao Xiangwen

Specifically, to sufficiently perceive emotions in dialogue, we first construct and encode trait and state emotion embeddings, and then we further enhance emotional perception capability through an emotion guidance module that guides emotion representation.

Contrastive Learning Empathetic Response Generation +1

Paper
Code

MedRG: Medical Report Grounding with Multi-modal Large Language Model

no code implementations • 10 Apr 2024 • Ke Zou, Yang Bai, Zhihao Chen, Yang Zhou, Yidi Chen, Kai Ren, Meng Wang, Xuedong Yuan, Xiaojing Shen, Huazhu Fu

Medical Report Grounding is pivotal in identifying the most relevant regions in medical images based on a given phrase query, a critical aspect in medical image analysis and radiological diagnosis.

Language Modelling Large Language Model +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.