Search Results for author: Jie Chen

Found 362 papers, 140 papers with code

Dome-DETR: DETR with Density-Oriented Feature-Query Manipulation for Efficient Tiny Object Detection

no code implementations9 May 2025 Zhangchi Hu, Peixi Wu, Jie Chen, Huyue Zhu, Yijun Wang, Yansong Peng, Hebei Li, Xiaoyan Sun

Tiny object detection plays a vital role in drone surveillance, remote sensing, and autonomous systems, enabling the identification of small targets across vast landscapes.

object-detection Object Detection

Efficient Spiking Point Mamba for Point Cloud Analysis

no code implementations19 Apr 2025 Peixi Wu, Bosong Chai, Menghua Zheng, Wei Li, Zhangchi Hu, Jie Chen, Zheyu Zhang, Hebei Li, Xiaoyan Sun

Due to the poor performance of simply transferring Mamba to 3D SNNs, SPM is designed to utilize both the sequence modeling capabilities of Mamba and the temporal feature extraction of SNNs.

Computational Efficiency Mamba

Learning Physics-Informed Color-Aware Transforms for Low-Light Image Enhancement

no code implementations16 Apr 2025 Xingxing Yang, Jie Chen, Zaifeng Yang

To address these challenges, we introduce a Physics-informed Color-aware Transform (PiCat), a learning-based framework that converts low-light images from the sRGB color space into deep illumination-invariant descriptors via our proposed Color-aware Transform (CAT).

Low-Light Image Enhancement

LightFormer: A lightweight and efficient decoder for remote sensing image segmentation

no code implementations15 Apr 2025 Sihang Chen, Lijun Yun, Ze Liu, Jianfeng Zhu, Jie Chen, Hui Wang, Yueping Nie

Deep learning techniques have achieved remarkable success in the semantic segmentation of remote sensing images and in land-use change detection.

Change Detection Decoder +2

Robust Offline Imitation Learning Through State-level Trajectory Stitching

no code implementations28 Mar 2025 Shuze Wang, Yunpeng Mei, Hongjie Cao, Yetian Yuan, Gang Wang, Jian Sun, Jie Chen

Imitation learning (IL) has proven effective for enabling robots to acquire visuomotor skills through expert demonstrations.

Imitation Learning

Fast and Physically-based Neural Explicit Surface for Relightable Human Avatars

no code implementations24 Mar 2025 Jiacheng Wu, Ruiqi Zhang, Jie Chen, HUI ZHANG

Efficiently modeling relightable human avatars from sparse-view videos is crucial for AR/VR applications.

Disentanglement

Model Risk Management for Generative AI In Financial Institutions

no code implementations19 Mar 2025 Anwesha Bhattacharyya, Ye Yu, Hanyu Yang, Rahul Singh, Tarun Joshi, Jie Chen, Kiran Yalavarthy

The success of OpenAI's ChatGPT in 2023 has spurred financial enterprises into exploring Generative AI applications to reduce costs or drive revenue within different lines of businesses in the Financial Industry.

Management

Defense Against Model Stealing Based on Account-Aware Distribution Discrepancy

1 code implementation16 Mar 2025 Jian-Ping Mei, Weibin Zhang, Jie Chen, Xuyun Zhang, Tiantian Zhu

Malicious users attempt to replicate commercial models functionally at low cost by training a clone model with query responses.

Image Classification

R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

1 code implementation7 Mar 2025 Huatong Song, Jinhao Jiang, Yingqian Min, Jie Chen, Zhipeng Chen, Wayne Xin Zhao, Lei Fang, Ji-Rong Wen

To address this, we propose \textbf{R1-Searcher}, a novel two-stage outcome-based RL approach designed to enhance the search capabilities of LLMs.

RAG Reinforcement Learning (RL)

An Empirical Study on Eliciting and Improving R1-like Reasoning Models

1 code implementation6 Mar 2025 Zhipeng Chen, Yingqian Min, Beichen Zhang, Jie Chen, Jinhao Jiang, Daixuan Cheng, Wayne Xin Zhao, Zheng Liu, Xu Miao, Yang Lu, Lei Fang, Zhongyuan Wang, Ji-Rong Wen

This approach achieves a remarkable accuracy of 86. 67% with greedy search on AIME 2024, underscoring its effectiveness in enhancing model capabilities.

Pleno-Generation: A Scalable Generative Face Video Compression Framework with Bandwidth Intelligence

no code implementations24 Feb 2025 Bolin Chen, Hanwei Zhu, Shanzhi Yin, Lingyu Zhu, Jie Chen, Ru-Ling Liao, Shiqi Wang, Yan Ye

The novel PGen framework leverages scalable representation and layered reconstruction for Generative Face Video Compression (GFVC), in an attempt to imbue the bitstream with intelligence in different granularity.

Video Compression

SphereFusion: Efficient Panorama Depth Estimation via Gated Fusion

no code implementations9 Feb 2025 Qingsong Yan, Qiang Wang, Kaiyong Zhao, Jie Chen, Bo Li, Xiaowen Chu, Fei Deng

Specifically, SphereFusion initially employs 2D image convolution and mesh operations to extract two distinct types of features from the panorama image in both equirectangular and spherical projection domains.

Autonomous Driving Depth Estimation

Every Angle Is Worth A Second Glance: Mining Kinematic Skeletal Structures from Multi-view Joint Cloud

no code implementations5 Feb 2025 Junkun Jiang, Jie Chen, Ho Yin Au, Mingyuan Chen, Wei Xue, Yike Guo

Multi-person motion capture over sparse angular observations is a challenging problem under interference from both self- and mutual-occlusions.

All

Teaching Language Models to Critique via Reinforcement Learning

no code implementations5 Feb 2025 Zhihui Xie, Jie Chen, Liyu Chen, Weichao Mao, Jingjing Xu, Lingpeng Kong

Teaching large language models (LLMs) to critique and refine their outputs is crucial for building systems that can iteratively improve, yet it is fundamentally limited by the ability to provide accurate judgments and actionable suggestions.

Code Generation reinforcement-learning +1

Hybrid Two-Stage Reconstruction of Multiscale Subsurface Flow with Physics-informed Residual Connected Neural Operator

no code implementations22 Jan 2025 Peiqi Li, Jie Chen

In the first stage, a data-driven model is used to reconstruct the multiscale basis function based on the permeability field to achieve effective dimensionality reduction while preserving the necessary multiscale features.

Dimensionality Reduction

Hierarchical Banzhaf Interaction for General Video-Language Representation Learning

1 code implementation30 Dec 2024 Peng Jin, Hao Li, Li Yuan, Shuicheng Yan, Jie Chen

As an important subfield, video-language representation learning focuses on learning representations using global semantic interactions between pre-defined video-text pairs.

Contrastive Learning Question Answering +4

Scalable Hierarchical Reinforcement Learning for Hyper Scale Multi-Robot Task Planning

no code implementations27 Dec 2024 Xuan Zhou, Xiang Shi, LeLe Zhang, Chen Chen, Hongbo Li, Lin Ma, Fang Deng, Jie Chen

Also, our planner can successfully scale up to hyper scale MRTP instances in RMFS with up to 200 robots and 1000 retrieval racks on unlearned maps while keeping superior performance over other methods.

counterfactual Hierarchical Reinforcement Learning +4

YuLan-Mini: An Open Data-efficient Language Model

2 code implementations23 Dec 2024 Yiwen Hu, Huatong Song, Jia Deng, Jiapeng Wang, Jie Chen, Kun Zhou, Yutao Zhu, Jinhao Jiang, Zican Dong, Wayne Xin Zhao, Ji-Rong Wen

Effective pre-training of large language models (LLMs) has been challenging due to the immense resource demands and the complexity of the technical processes involved.

Language Modeling Language Modelling +1

Seed-CTS: Unleashing the Power of Tree Search for Superior Performance in Competitive Coding Tasks

no code implementations17 Dec 2024 Hao Wang, Boyi Liu, Yufeng Zhang, Jie Chen

Leveraging Qwen2. 5-Coder-32B-Instruct, our approach achieves a pass rate of 0. 305 on LiveCodeBench-Hard, surpassing the pass@100 performance of GPT4o-0513 (0. 245).

Code Generation

Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems

3 code implementations12 Dec 2024 Yingqian Min, Zhipeng Chen, Jinhao Jiang, Jie Chen, Jia Deng, Yiwen Hu, Yiru Tang, Jiapeng Wang, Xiaoxue Cheng, Huatong Song, Wayne Xin Zhao, Zheng Liu, Zhongyuan Wang, Ji-Rong Wen

We introduce an ``imitate, explore, and self-improve'' framework, denoted as \textbf{STILL-2}, as our primary technical approach to train the reasoning model.

Adversarial Purification by Consistency-aware Latent Space Optimization on Data Manifolds

no code implementations11 Dec 2024 Shuhai Zhang, Jiahao Yang, Hui Luo, Jie Chen, Li Wang, Feng Liu, Bo Han, Mingkui Tan

Leveraging this insight, we propose Consistency Model-based Adversarial Purification (CMAP), which optimizes vectors within the latent space of a pre-trained consistency model to generate samples for restoring clean data.

Adversarial Purification

Mind the Gap: Towards Generalizable Autonomous Penetration Testing via Domain Randomization and Meta-Reinforcement Learning

1 code implementation5 Dec 2024 Shicheng Zhou, Jingju Liu, Yuliang Lu, Jiahai Yang, Yue Zhang, Jie Chen

GAP introduces a Real-to-Sim-to-Real pipeline that (a) enables end-to-end policy learning in unknown real environments while constructing realistic simulations; (b) improves agents' generalization ability by leveraging domain randomization and meta-RL learning. Specially, we are among the first to apply domain randomization in autonomous pentesting and propose a large language model-powered domain randomization method for synthetic environment generation.

Large Language Model Meta Reinforcement Learning +1

RelayGS: Reconstructing Dynamic Scenes with Large-Scale and Complex Motions via Relay Gaussians

1 code implementation3 Dec 2024 Qiankun Gao, Yanmin Wu, Chengxiang Wen, Jiarui Meng, Luyang Tang, Jie Chen, Ronggang Wang, Jian Zhang

Finally, we jointly learn the scene's temporal motion and refine the canonical Gaussians learned from the first two stages.

3DGS

Conditional Distribution Learning on Graphs

1 code implementation20 Nov 2024 Jie Chen, Hua Mao, Yuanbiao Gou, Zhu Wang, Xi Peng

To avoid the conflict between the MPM and the CL of negative pairs, positive pairs of node representations are retained for measuring the similarity between the original features and the corresponding weakly augmented features.

Contrastive Learning Data Augmentation +3

Adversarial Diffusion Compression for Real-World Image Super-Resolution

2 code implementations20 Nov 2024 Bin Chen, Gehui Li, Rongyuan Wu, Xindong Zhang, Jie Chen, Jian Zhang, Lei Zhang

Real-world image super-resolution (Real-ISR) aims to reconstruct high-resolution images from low-resolution inputs degraded by complex, unknown processes.

Decoder Denoising +1

HiCoM: Hierarchical Coherent Motion for Streamable Dynamic Scene with 3D Gaussian Splatting

1 code implementation12 Nov 2024 Qiankun Gao, Jiarui Meng, Chengxiang Wen, Jie Chen, Jian Zhang

The online reconstruction of dynamic scenes from multi-view streaming videos faces significant challenges in training, rendering and storage efficiency.

3DGS

An Efficient Hierarchical Preconditioner-Learner Architecture for Reconstructing Multi-scale Basis Functions of High-dimensional Subsurface Fluid Flow

no code implementations1 Nov 2024 Peiqi Li, Jie Chen

This model offers a novel method for efficient and accurate subsurface fluid flow modeling, with promising potential for more complex real-world applications.

Computational Efficiency

The D-Subspace Algorithm for Online Learning over Distributed Networks

no code implementations26 Oct 2024 Yitong Chen, Danqi Jin, Jie Chen, Cedric Richard

This material introduces the D-Subspace algorithm derived on the basis of the centralized algorithm [1], which originally addresses parameter estimation problems under a subspace constraint.

parameter estimation

Standardizing Generative Face Video Compression using Supplemental Enhancement Information

no code implementations19 Oct 2024 Bolin Chen, Yan Ye, Jie Chen, Ru-Ling Liao, Shanzhi Yin, Shiqi Wang, Kaifa Yang, Yue Li, Yiling Xu, Ye-kui Wang, Shiv Gehlot, Guan-Ming Su, Peng Yin, Sean McCarthy, Gary J. Sullivan

This paper proposes a Generative Face Video Compression (GFVC) approach using Supplemental Enhancement Information (SEI), where a series of compact spatial and temporal representations of a face video signal (i. e., 2D/3D key-points, facial semantics and compact features) can be coded using SEI message and inserted into the coded video bitstream.

Video Compression

Beyond GFVC: A Progressive Face Video Compression Framework with Adaptive Visual Tokens

1 code implementation11 Oct 2024 Bolin Chen, Shanzhi Yin, Zihan Zhang, Jie Chen, Ru-Ling Liao, Lingyu Zhu, Shiqi Wang, Yan Ye

Recently, deep generative models have greatly advanced the progress of face video coding towards promising rate-distortion performance and diverse application functionalities.

Motion Estimation Philosophy +1

Identifying Money Laundering Subgraphs on the Blockchain

1 code implementation10 Oct 2024 Kiwhan Song, Mohamed Ali Dhraief, Muhua Xu, Locke Cai, Xuhao Chen, Arvind, Jie Chen

Furthermore, we demonstrate the effectiveness of RevFilter in discovering new suspicious subgraphs, confirming its utility for practical AML.

Benchmarking

Remote Sensing Image Segmentation Using Vision Mamba and Multi-Scale Multi-Frequency Feature Fusion

no code implementations8 Oct 2024 Yice Cao, ChenChen Liu, Zhenhua Wu, Wenxin Yao, Liu Xiong, Jie Chen, Zhixiang Huang

This method designs a cross-scanning visual state space block (CVSSBlock) that uses cross 2D scanning (CS2D) to fully capture global information from multiple directions, while by incorporating convolutional neural network branches to overcome the constraints of Vision Mamba (VMamba) in acquiring local information, this approach facilitates a comprehensive analysis of both global and local features.

Image Segmentation Mamba +2

Multimodal Large Language Models for Inverse Molecular Design with Retrosynthetic Planning

1 code implementation5 Oct 2024 Gang Liu, Michael Sun, Wojciech Matusik, Meng Jiang, Jie Chen

While large language models (LLMs) have integrated images, adapting them to graphs remains challenging, limiting their applications in materials and drug design.

Benchmarking Drug Design +2

Tannenbaum's gain-margin optimization meets Polyak's heavy-ball algorithm

no code implementations30 Sep 2024 Wuwei Wu, Jie Chen, Mihailo R. Jovanović, Tryphon T. Georgiou

The link between first-order optimization methods and robust control theory sheds new light into limits of algorithmic performance for such methods, and suggests a new framework where similar computational problems can be systematically studied and algorithms optimized.

Unrolling Plug-and-Play Network for Hyperspectral Unmixing

no code implementations7 Sep 2024 Min Zhao, Linruize Tang, Jie Chen

The carefully designed unfolding deep architecture is used to learn the spectral and spatial information from the hyperspectral image, which we refer to as inner priors.

Hyperspectral Unmixing

Hierarchical Sparse Representation Clustering for High-Dimensional Data Streams

1 code implementation7 Sep 2024 Jie Chen, Hua Mao, Yuanbiao Gou, Xi Peng

Second, these algorithms are highly sensitive to the noise contained in high-dimensional data streams.

Clustering

Dynamic Self-Consistency: Leveraging Reasoning Paths for Efficient LLM Sampling

no code implementations30 Aug 2024 Guangya Wan, Yuqi Wu, Jie Chen, Sheng Li

Self-Consistency (SC) is a widely used method to mitigate hallucinations in Large Language Models (LLMs) by sampling the LLM multiple times and outputting the most frequent solution.

Do Graph Neural Networks Work for High Entropy Alloys?

1 code implementation29 Aug 2024 Hengrui Zhang, Ruishu Huang, Jie Chen, James M. Rondinelli, Wei Chen

Graph neural networks (GNNs) have excelled in predictive modeling for both crystals and molecules, owing to the expressiveness of graph representations.

Property Prediction

PartFormer: Awakening Latent Diverse Representation from Vision Transformer for Object Re-Identification

no code implementations29 Aug 2024 Lei Tan, Pingyang Dai, Jie Chen, Liujuan Cao, Yongjian Wu, Rongrong Ji

Extracting robust feature representation is critical for object re-identification to accurately identify objects across non-overlapping cameras.

Diversity Object +1

CoT Rerailer: Enhancing the Reliability of Large Language Models in Complex Reasoning Tasks through Error Detection and Correction

no code implementations25 Aug 2024 Guangya Wan, Yuqi Wu, Jie Chen, Sheng Li

We propose the CoT Rerailer to address these challenges, employing self-consistency and multi-agent debate systems to identify and rectify errors in the reasoning process.

Decision Making Question Answering

Procedural Synthesis of Synthesizable Molecules

1 code implementation24 Aug 2024 Michael Sun, Alston Lo, Minghao Guo, Jie Chen, Connor Coley, Wojciech Matusik

Drawing inspiration from syntax-guided synthesis approaches, we decouple the syntactic skeleton from the semantics of a synthetic tree to create a bilevel framework for reasoning about the combinatorial space of synthesis pathways.

Evolutionary Algorithms Program Synthesis

Towards Effective and Efficient Continual Pre-training of Large Language Models

no code implementations26 Jul 2024 Jie Chen, Zhipeng Chen, Jiapeng Wang, Kun Zhou, Yutao Zhu, Jinhao Jiang, Yingqian Min, Wayne Xin Zhao, Zhicheng Dou, Jiaxin Mao, Yankai Lin, Ruihua Song, Jun Xu, Xu Chen, Rui Yan, Zhewei Wei, Di Hu, Wenbing Huang, Ji-Rong Wen

To make the CPT approach more traceable, this paper presents a technical report for continually pre-training Llama-3 (8B), which significantly enhances the Chinese language ability and scientific reasoning ability of the backbone model.

Math

Make a Strong Teacher with Label Assistance: A Novel Knowledge Distillation Approach for Semantic Segmentation

1 code implementation18 Jul 2024 Shoumeng Qiu, Jie Chen, Xinrun Li, Ru Wan, xiangyang xue, Jian Pu

Specifically, for the teacher model training, we propose to noise the label and then incorporate it into input to effectively boost the lightweight teacher performance.

Knowledge Distillation Semantic Segmentation

Automated Label Unification for Multi-Dataset Semantic Segmentation with GNNs

no code implementations15 Jul 2024 Rong Ma, Jie Chen, xiangyang xue, Jian Pu

This enables semantic segmentation models to be trained simultaneously on multiple datasets, resulting in performance improvements.

Semantic Segmentation

Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation

no code implementations15 Jul 2024 Peng Jin, Hao Li, Zesen Cheng, Kehan Li, Runyi Yu, Chang Liu, Xiangyang Ji, Li Yuan, Jie Chen

Specifically, we provide an automated method for reference local action sampling and leverage graph attention networks to assess the guiding weight of each local action in the overall motion synthesis.

Graph Attention Motion Generation +1

LLMBox: A Comprehensive Library for Large Language Models

1 code implementation8 Jul 2024 Tianyi Tang, Yiwen Hu, Bingqian Li, Wenyang Luo, Zijing Qin, Haoxiang Sun, Jiapeng Wang, Shiyi Xu, Xiaoxue Cheng, Geyang Guo, Han Peng, Bowen Zheng, Yiru Tang, Yingqian Min, Yushuo Chen, Jie Chen, Yuanqian Zhao, Luran Ding, Yuhao Wang, Zican Dong, Chunxuan Xia, Junyi Li, Kun Zhou, Wayne Xin Zhao, Ji-Rong Wen

To facilitate the research on large language models (LLMs), this paper presents a comprehensive and unified library, LLMBox, to ease the development, use, and evaluation of LLMs.

Unveiling the Flaws: Exploring Imperfections in Synthetic Data and Mitigation Strategies for Large Language Models

no code implementations18 Jun 2024 Jie Chen, Yupeng Zhang, Bingning Wang, Wayne Xin Zhao, Ji-Rong Wen, WeiPeng Chen

Synthetic data has been proposed as a solution to address the issue of high-quality data scarcity in the training of large language models (LLMs).

Instruction Following

Unlock the Correlation between Supervised Fine-Tuning and Reinforcement Learning in Training Code Large Language Models

no code implementations14 Jun 2024 Jie Chen, Xintian Han, Yu Ma, Xun Zhou, Liang Xiang

A large number of work has been conducted to improve the model's performance on code-related benchmarks with either modifications to the algorithm or refinement of the dataset.

Code Generation Reinforcement Learning (RL)

Make Your Actor Talk: Generalizable and High-Fidelity Lip Sync with Motion and Appearance Disentanglement

no code implementations12 Jun 2024 Runyi Yu, Tianyu He, Ailing Zhang, Yuchi Wang, Junliang Guo, Xu Tan, Chang Liu, Jie Chen, Jiang Bian

Instead, we propose to disentangle the motion and appearance, and then generate them one by one with a speech-to-motion diffusion model and a motion-conditioned appearance generation model.

Disentanglement Motion Generation

Exploring Mathematical Extrapolation of Large Language Models with Synthetic Data

no code implementations4 Jun 2024 Haolong Li, Yu Ma, Yinqi Zhang, Chen Ye, Jie Chen

In this paper, through a newly proposed arithmetical puzzle problem, we show that the model can perform well on multi-step reasoning tasks via fine-tuning on high-quality synthetic data.

Mathematical Reasoning Text Generation

Graph Neural Preconditioners for Iterative Solutions of Sparse Linear Systems

1 code implementation2 Jun 2024 Jie Chen

Empirical evaluation on over 800 matrices suggests that the construction time of these graph neural preconditioners (GNPs) is more predictable and can be much shorter than that of other widely used ones, such as ILU and AMG, while the execution time is faster than using a Krylov method as the preconditioner, such as in inner-outer GMRES.

Learning-Based Intermittent CSI Estimation with Adaptive Intervals in Integrated Sensing and Communication Systems

no code implementations23 May 2024 Jie Chen, Xianbin Wang

Due to the distinct objectives and multipath utilization mechanisms between the communication module and radar module, the system design of integrated sensing and communication (ISAC) necessitates two types of channel state information (CSI), i. e., communication CSI representing the whole channel gain and phase shifts, and radar CSI exclusively focused on target mobility and position information.

Integrated sensing and communication ISAC

HARIS: Human-Like Attention for Reference Image Segmentation

no code implementations17 May 2024 Mengxi Zhang, Heqing Lian, Yiming Liu, Jie Chen

In this paper, we propose a referring image segmentation method called HARIS, which introduces the Human-Like Attention mechanism and uses the parameter-efficient fine-tuning (PEFT) framework.

Image Segmentation parameter-efficient fine-tuning +2

GraCo: Granularity-Controllable Interactive Segmentation

1 code implementation CVPR 2024 Yian Zhao, Kehan Li, Zesen Cheng, Pengchong Qiao, Xiawu Zheng, Rongrong Ji, Chang Liu, Li Yuan, Jie Chen

In this work, we introduce Granularity-Controllable Interactive Segmentation (GraCo), a novel approach that allows precise control of prediction granularity by introducing additional parameters to input.

Interactive Segmentation Segmentation

The Shape of Money Laundering: Subgraph Representation Learning on the Blockchain with the Elliptic2 Dataset

2 code implementations29 Apr 2024 Claudio Bellei, Muhua Xu, Ross Phillips, Tom Robinson, Mark Weber, Tim Kaler, Charles E. Leiserson, Arvind, Jie Chen

We posit that certain domain applications, such as anti-money laundering (AML), are inherently subgraph problems and mainstream graph techniques have been operating at a suboptimal level of abstraction.

Representation Learning

Generation of Uncorrelated Residual Variables for Chemical Process Fault Diagnosis via Transfer Learning-based Input-Output Decoupled Network

no code implementations29 Apr 2024 Zhuofu Pan, Qingkai Sui, Yalin Wang, Jiang Luo, Jie Chen, Hongtian Chen

However, traditional methods exhibit limited effectiveness in modeling high-dimensional nonlinearity and big data, and the decoupling idea has not been well-valued in data-driven frameworks.

Chemical Process Diagnostic +4

Adaptive Catalyst Discovery Using Multicriteria Bayesian Optimization with Representation Learning

1 code implementation18 Apr 2024 Jie Chen, Pengfei Ou, Yuxin Chang, Hengrui Zhang, Xiao-Yan Li, Edward H. Sargent, Wei Chen

The results demonstrate that our approach achieves high prediction accuracy, facilitates interpretable feature extraction, and enables multicriteria design optimization, leading to significant reduction of computing power and time (10x reduction of required DFT calculations) in high-performance catalyst discovery.

Bayesian Optimization Representation Learning +1

ParCo: Part-Coordinating Text-to-Motion Synthesis

1 code implementation27 Mar 2024 Qiran Zou, Shangyuan Yuan, Shian Du, Yu Wang, Chang Liu, Yi Xu, Jie Chen, Xiangyang Ji

However, these methods encounter challenges such as the lack of coordination between different part motions and difficulties for networks to understand part concepts.

Motion Synthesis

Invertible Diffusion Models for Compressed Sensing

1 code implementation25 Mar 2024 Bin Chen, Zhenyu Zhang, Weiqi Li, Chen Zhao, Jiwen Yu, Shijie Zhao, Jie Chen, Jian Zhang

To enable such memory-intensive end-to-end fine-tuning, we propose a novel two-level invertible design to transform both (1) multi-step sampling process and (2) noise estimation U-Net in each step into invertible networks.

compressed sensing Image Compressed Sensing +2

Deep unfolding Network for Hyperspectral Image Super-Resolution with Automatic Exposure Correction

no code implementations14 Mar 2024 Yuan Fang, Yipeng Liu, Jie Chen, Zhen Long, Ao Li, Chong-Yung Chi, Ce Zhu

In recent years, the fusion of high spatial resolution multispectral image (HR-MSI) and low spatial resolution hyperspectral image (LR-HSI) has been recognized as an effective method for HSI super-resolution (HSI-SR).

Exposure Correction Hyperspectral Image Super-Resolution +1

Representing Molecules as Random Walks Over Interpretable Grammars

no code implementations13 Mar 2024 Michael Sun, Minghao Guo, Weize Yuan, Veronika Thost, Crystal Elaine Owens, Aristotle Franklin Grosz, Sharvaa Selvan, Katelyn Zhou, Hassan Mohiuddin, Benjamin J Pedretti, Zachary P Smith, Jie Chen, Wojciech Matusik

Recent research in molecular discovery has primarily been devoted to small, drug-like molecules, leaving many similarly important applications in material design without adequate technology.

Property Prediction

Inverse Optimal Control for Linear Quadratic Tracking with Unknown Target States

no code implementations27 Feb 2024 Yao Li, Chengpu Yu, Hao Fang, Jie Chen

A computationally efficient and numerically reliable parameter identification algorithm is proposed by equating optimal control strategies with a system of linear equations, and the associated relative error upper bound is derived in terms of data volume and signal-to-noise ratio.

Boundary Exploration for Bayesian Optimization With Unknown Physical Constraints

1 code implementation12 Feb 2024 Yunsheng Tian, Ane Zuniga, Xinwei Zhang, Johannes P. Dürholt, Payel Das, Jie Chen, Wojciech Matusik, Mina Konaković Luković

In this paper, we observe that in such scenarios optimal solution typically lies on the boundary between feasible and infeasible regions of the design space, making it considerably more difficult than that with interior optima.

Bayesian Optimization Gaussian Processes

Consistency Enhancement-Based Deep Multiview Clustering via Contrastive Learning

no code implementations23 Jan 2024 Hao Yang, Hua Mao, Wai Lok Woo, Jie Chen, Xi Peng

Furthermore, the representation process for clustering is enhanced through spectral clustering, and the consistency across multiple views is improved.

Clustering Contrastive Learning +2

Structured Code Representations Enable Data-Efficient Adaptation of Code Language Models

no code implementations19 Jan 2024 Mayank Agarwal, Yikang Shen, Bailin Wang, Yoon Kim, Jie Chen

In this work, we explore data-efficient adaptation of pre-trained code models by further pre-training and fine-tuning them with program structures.

Instance Brownian Bridge as Texts for Open-vocabulary Video Instance Segmentation

1 code implementation18 Jan 2024 Zesen Cheng, Kehan Li, Hao Li, Peng Jin, Chang Liu, Xiawu Zheng, Rongrong Ji, Jie Chen

To mold instance queries to follow Brownian bridge and accomplish alignment with class texts, we design Bridge-Text Alignment (BTA) to learn discriminative bridge-level representations of instances via contrastive objectives.

Instance Segmentation Semantic Segmentation +1

Exploring Latent Cross-Channel Embedding for Accurate 3D Human Pose Reconstruction in a Diffusion Framework

1 code implementation18 Jan 2024 Junkun Jiang, Jie Chen

However, there is still ample room for improvement as these methods often overlook the exploration of correlation between the 2D and 3D joint-level features.

Graph Attention Monocular 3D Human Pose Estimation

The Dawn After the Dark: An Empirical Study on Factuality Hallucination in Large Language Models

1 code implementation6 Jan 2024 Junyi Li, Jie Chen, Ruiyang Ren, Xiaoxue Cheng, Wayne Xin Zhao, Jian-Yun Nie, Ji-Rong Wen

To tackle the LLM hallucination, three key questions should be well studied: how to detect hallucinations (detection), why do LLMs hallucinate (source), and what can be done to mitigate them (mitigation).

Hallucination

MolSets: Molecular Graph Deep Sets Learning for Mixture Property Modeling

1 code implementation27 Dec 2023 Hengrui Zhang, Jie Chen, James M. Rondinelli, Wei Chen

This complexity is particularly evident in molecular mixtures, a frequently explored space for materials such as battery electrolytes.

Graph Neural Network mixture property prediction +1

Multi-scale Progressive Feature Embedding for Accurate NIR-to-RGB Spectral Domain Translation

no code implementations26 Dec 2023 Xingxing Yang, Jie Chen, Zaifeng Yang

To address these challenges, we propose to colorize NIR images via a multi-scale progressive feature embedding network (MPFNet), with the guidance of grayscale image colorization.

Colorization Image Colorization +1

Hyperspectral Image Reconstruction via Combinatorial Embedding of Cross-Channel Spatio-Spectral Clues

1 code implementation18 Dec 2023 Xingxing Yang, Jie Chen, Zaifeng Yang

Existing learning-based hyperspectral reconstruction methods show limitations in fully exploiting the information among the hyperspectral bands.

Image Reconstruction

Prospective Role of Foundation Models in Advancing Autonomous Vehicles

no code implementations8 Dec 2023 Jianhua Wu, Bingzhao Gao, Jincheng Gao, Jianhao Yu, Hongqing Chu, Qiankun Yu, Xun Gong, Yi Chang, H. Eric Tseng, Hong Chen, Jie Chen

With the development of artificial intelligence and breakthroughs in deep learning, large-scale Foundation Models (FMs), such as GPT, Sora, etc., have achieved remarkable results in many fields including natural language processing and computer vision.

Autonomous Driving Scene Understanding +1

Distributed Speech Dereverberation Using Weighted Prediction Error

no code implementations5 Dec 2023 Ziye Yang, Mengfei Zhang, Jie Chen

However, in scenarios where microphone nodes are dispersed, the centralized approach of the WPE method requires aggregating all observations for inverse filtering, resulting in a significant computational burden.

Prediction Speech Dereverberation

Cross-View Graph Consistency Learning for Invariant Graph Representations

1 code implementation20 Nov 2023 Jie Chen, Zhiming Li, Hua Mao, Wai Lok Woo, Xi Peng

In this paper, we propose a cross-view graph consistency learning (CGCL) method that learns invariant graph representations for link prediction.

Attribute Data Augmentation +2

Robust Control of Unknown Switched Linear Systems from Noisy Data

no code implementations19 Nov 2023 Wenjie Liu, Yifei Li, Jian Sun, Gang Wang, Jie Chen

This paper investigates the problem of data-driven stabilization for linear discrete-time switched systems with unknown switching dynamics.

Data-driven Control Against False Data Injection Attacks

no code implementations14 Nov 2023 Wenjie Liu, Lidong Li, Jian Sun, Fang Deng, Gang Wang, Jie Chen

To this end, a general FDI attack model is presented, which imposes minimally constraints on the switching frequency of attack channels and the magnitude of attack matrices.

Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations

1 code implementation13 Nov 2023 Zilu Tang, Mayank Agarwal, Alex Shypula, Bailin Wang, Derry Wijaya, Jie Chen, Yoon Kim

This work explores the use of self-generated natural language explanations as an intermediate step for code-to-code translation with language models.

Code Translation Translation

Generative Face Video Coding Techniques and Standardization Efforts: A Review

1 code implementation5 Nov 2023 Bolin Chen, Jie Chen, Shiqi Wang, Yan Ye

Generative Face Video Coding (GFVC) techniques can exploit the compact representation of facial priors and the strong inference capability of deep generative models, achieving high-quality face video communication in ultra-low bandwidth scenarios.

Self-triggered Consensus Control of Multi-agent Systems from Data

no code implementations19 Oct 2023 Yifei Li, Xin Wang, Jian Sun, Gang Wang, Jie Chen

In the presence of external disturbances, a model-based STC scheme is put forth for $\mathcal{H}_{\infty}$-consensus of MASs, serving as a baseline for the data-driven STC.

Changes-Aware Transformer: Learning Generalized Changes Representation

no code implementations24 Sep 2023 Dan Wang, Licheng Jiao, Jie Chen, Shuyuan Yang, Fang Liu

After refinement, the changed pixels in the difference feature space are closer to each other, which facilitates change detection.

Change Detection

Towards Real-World Burst Image Super-Resolution: Benchmark and Method

1 code implementation ICCV 2023 Pengxu Wei, Yujing Sun, Xingbei Guo, Chang Liu, Jie Chen, Xiangyang Ji, Liang Lin

Despite substantial advances, single-image super-resolution (SISR) is always in a dilemma to reconstruct high-quality images with limited information from one input image, especially in realistic scenarios.

Burst Image Super-Resolution

Monotone Tree-Based GAMI Models by Adapting XGBoost

no code implementations5 Sep 2023 Linwei Hu, Soroush Aramideh, Jie Chen, Vijayan N. Nair

It is straightforward to fit a monotone model to $f(x)$ using the options in XGBoost.

Hierarchical Grammar-Induced Geometry for Data-Efficient Molecular Property Prediction

1 code implementation4 Sep 2023 Minghao Guo, Veronika Thost, Samuel W Song, Adithya Balachandran, Payel Das, Jie Chen, Wojciech Matusik

Still, these techniques are faced with a common challenge in practice: Labeled data are limited by the cost of manual extraction from literature and laborious experimentation.

Drug Discovery Molecular Property Prediction +1

Improving Mandarin Prosodic Structure Prediction with Multi-level Contextual Information

no code implementations31 Aug 2023 Jie Chen, Changhe Song, Deyi Tuo, Xixin Wu, Shiyin Kang, Zhiyong Wu, Helen Meng

For text-to-speech (TTS) synthesis, prosodic structure prediction (PSP) plays an important role in producing natural and intelligible speech.

Decoder Multi-Task Learning +2

Explicifying Neural Implicit Fields for Efficient Dynamic Human Avatar Modeling via a Neural Explicit Surface

no code implementations7 Aug 2023 Ruiqi Zhang, Jie Chen, Qiang Wang

This paper proposes a technique for efficiently modeling dynamic humans by explicifying the implicit neural fields via a Neural Explicit Surface (NES).

Computational Efficiency

Cooperative Colorization: Exploring Latent Cross-Domain Priors for NIR Image Spectrum Translation

no code implementations7 Aug 2023 Xingxing Yang, Jie Chen, Zaifeng Yang

To address these challenges, we propose a cooperative learning paradigm that colorizes NIR images in parallel with another proxy grayscale colorization task by exploring latent cross-domain priors (i. e., latent spectrum context priors and task domain priors), dubbed CoColor.

Colorization Image Colorization +1

Data-driven Polytopic Output Synchronization of Heterogeneous Multi-agent Systems from Noisy Data

no code implementations14 Jul 2023 Yifei Li, Wenjie Liu, Jian Sun, Gang Wang, Lihua Xie, Jie Chen

This method utilizes measured data and a noise-matrix polytope to ensure near-optimal output synchronization.

Unsupervised Hyperspectral and Multispectral Images Fusion Based on the Cycle Consistency

no code implementations7 Jul 2023 Shuaikai Shi, Lijun Zhang, Yoann Altmann, Jie Chen

In this paper, we propose an unsupervised HSI and MSI fusion model based on the cycle consistency, called CycFusion.

Hyperspectral and Multispectral Image Fusion Using the Conditional Denoising Diffusion Probabilistic Model

1 code implementation7 Jul 2023 Shuaikai Shi, Lijun Zhang, Jie Chen

Specifically, the DDPM-Fus contains the forward diffusion process which gradually adds Gaussian noise to the high spatial resolution HSI (HrHSI) and another reverse denoising process which learns to predict the desired HrHSI from its noisy version conditioning on the corresponding high spatial resolution MSI (HrMSI) and low spatial resolution HSI (LrHSI).

Denoising

Real-time Workload Pattern Analysis for Large-scale Cloud Databases

no code implementations5 Jul 2023 Jiaqi Wang, Tianyi Li, Anni Wang, Xiaoze Liu, Lu Chen, Jie Chen, Jianye Liu, Junyang Wu, Feifei Li, Yunjun Gao

This has led to the increasing volume of database workloads, which provides the opportunity for pattern analysis.

AE-RED: A Hyperspectral Unmixing Framework Powered by Deep Autoencoder and Regularization by Denoising

no code implementations1 Jul 2023 Min Zhao, Jie Chen, Nicolas Dobigeon

In this way, both the characteristics of the deep autoencoder based unmixing methods and priors provided by denoisers are merged into our well-designed framework to enhance the unmixing performance.

Denoising Hyperspectral Unmixing

Guided Deep Generative Model-based Spatial Regularization for Multiband Imaging Inverse Problems

no code implementations29 Jun 2023 Min Zhao, Nicolas Dobigeon, Jie Chen

More precisely, the regularization is conceived as a deep generative network able to encode spatial semantic features contained in this auxiliary image of high spatial resolution.

Image Inpainting

A Gromov--Wasserstein Geometric View of Spectrum-Preserving Graph Coarsening

1 code implementation15 Jun 2023 Yifan Chen, Rentian Yao, Yun Yang, Jie Chen

The study includes a set of experiments to support the theory and method, including approximating the GW distance, preserving the graph spectrum, classifying graphs using spectral information, and performing regression using graph convolutional networks.

Graph Classification regression

Federated Learning of Models Pre-Trained on Different Features with Consensus Graphs

1 code implementation2 Jun 2023 Tengfei Ma, Trong Nghia Hoang, Jie Chen

Second, we need to learn a consensus graph that captures the high-order interactions between local feature spaces and how to combine them to achieve a better prediction.

Federated Learning Time Series

Out-of-Distributed Semantic Pruning for Robust Semi-Supervised Learning

1 code implementation CVPR 2023 Yu Wang, Pengchong Qiao, Chang Liu, Guoli Song, Xiawu Zheng, Jie Chen

We argue that an overlooked problem of robust SSL is its corrupted information on semantic level, practically limiting the development of the field.

GC-Flow: A Graph-Based Flow Network for Effective Clustering

1 code implementation26 May 2023 Tianchun Wang, Farzaneh Mirzazadeh, Xiang Zhang, Jie Chen

Graph convolutional networks (GCNs) are \emph{discriminative models} that directly model the class posterior $p(y|\mathbf{x})$ for semi-supervised classification of graph data.

Clustering Representation Learning

Interpretable Machine Learning based on Functional ANOVA Framework: Algorithms and Comparisons

no code implementations25 May 2023 Linwei Hu, Vijayan N. Nair, Agus Sudjianto, Aijun Zhang, Jie Chen

To understand and explain the model results, one had to rely on post hoc explainability techniques, which are known to have limitations.

Interpretable Machine Learning

Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment

4 code implementations20 May 2023 Peng Jin, Hao Li, Zesen Cheng, Jinfa Huang, Zhennan Wang, Li Yuan, Chang Liu, Jie Chen

In this paper, we propose the Disentangled Conceptualization and Set-to-set Alignment (DiCoSA) to simulate the conceptualizing and reasoning process of human beings.

Retrieval Video Retrieval

Coordinated Transformer with Position \& Sample-aware Central Loss for Anatomical Landmark Detection

no code implementations18 May 2023 Qikui Zhu, Yihui Bi, Danxin Wang, Xiangpeng Chu, Jie Chen, Yanqing Wang

Heatmap-based anatomical landmark detection is still facing two unresolved challenges: 1) inability to accurately evaluate the distribution of heatmap; 2) inability to effectively exploit global spatial structure information.

Anatomical Landmark Detection Position +1

TG-VQA: Ternary Game of Video Question Answering

no code implementations17 May 2023 Hao Li, Peng Jin, Zesen Cheng, Songyang Zhang, Kai Chen, Zhennan Wang, Chang Liu, Jie Chen

Video question answering aims at answering a question about the video content by reasoning the alignment semantics within them.

Contrastive Learning Question Answering +2

Multi-view MERA Subspace Clustering

1 code implementation16 May 2023 Zhen Long, Ce Zhu, Jie Chen, Zihan Li, Yazhou Ren, Yipeng Liu

Benefiting from multiple interactions among orthogonal/semi-orthogonal (low-rank) factors, the low-rank MERA has a strong representation power to capture the complex inter/intra-view information in the self-representation tensor.

Clustering Multi-view Subspace Clustering

Long-lead forecasts of wintertime air stagnation index in southern China using oceanic memory effects

no code implementations16 May 2023 Chenhong Zhou, Xiaorui Zhang, Meng Gao, Shanshan Liu, Yike Guo, Jie Chen

Stagnant weather condition is one of the major contributors to air pollution as it is favorable for the formation and accumulation of pollutants.

Management

CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model

1 code implementation11 May 2023 Zhen Ye, Wei Xue, Xu Tan, Jie Chen, Qifeng Liu, Yike Guo

In this paper, we propose a "Co"nsistency "Mo"del-based "Speech" synthesis method, CoMoSpeech, which achieve speech synthesis through a single diffusion sampling step while achieving high audio quality.

Denoising Singing Voice Synthesis +3

Flexible Job Shop Scheduling via Dual Attention Network Based Reinforcement Learning

1 code implementation9 May 2023 Runqing Wang, Gang Wang, Jian Sun, Fang Deng, Jie Chen

The complex relationships between operations and machines are represented precisely and concisely, for which a dual-attention network (DAN) comprising several interconnected operation message attention blocks and machine message attention blocks is proposed.

Decision Making Deep Reinforcement Learning +3

Communication-Efficient Graph Neural Networks with Probabilistic Neighborhood Expansion Analysis and Caching

2 code implementations4 May 2023 Tim Kaler, Alexandros-Stavros Iliopoulos, Philip Murzynowski, Tao B. Schardl, Charles E. Leiserson, Jie Chen

To significantly reduce the communication volume without compromising prediction accuracy, we propose a policy for caching data associated with frequently accessed vertices in remote partitions.

Recommendation Systems

Learning Robust Data-based LQG Controllers from Noisy Data

no code implementations2 May 2023 Wenjie Liu, Jian Sun, Gang Wang, Francesco Bullo, Jie Chen

In this work, a data-based formulation for computing the steady-state Kalman gain is proposed based on semi-definite programming (SDP) using some noise-free input-state-output data.

Dynamic Video Frame Interpolation with integrated Difficulty Pre-Assessment

no code implementations25 Apr 2023 Ban Chen, Xin Jin, Youxin Chen, Longhai Wu, Jie Chen, Jayoon Koo, Cheul-hee Hahm

Extensive experiments show that easy samples pass through fast models while difficult samples inference with heavy models, and our proposed pipeline can improve the accuracy-efficiency trade-off for VFI.

Video Frame Interpolation

Deep Multiview Clustering by Contrasting Cluster Assignments

1 code implementation ICCV 2023 Jie Chen, Hua Mao, Wai Lok Woo, Xi Peng

Then, a cluster-level CVCL strategy is presented to explore consistent semantic label information among the multiple views in the fine-tuning stage.

Clustering Contrastive Learning +1

DETRs Beat YOLOs on Real-time Object Detection

7 code implementations CVPR 2024 Yian Zhao, Wenyu Lv, Shangliang Xu, Jinman Wei, Guanzhong Wang, Qingqing Dang, Yi Liu, Jie Chen

Our RT-DETR-R50 / R101 achieves 53. 1% / 54. 3% AP on COCO and 108 / 74 FPS on T4 GPU, outperforming previously advanced YOLOs in both speed and accuracy.

Decoder Object +2

Experts' cognition-driven safe noisy labels learning for precise segmentation of residual tumor in breast cancer

no code implementations13 Apr 2023 Yongquan Yang, Jie Chen, Yani Wei, Mohammad Alobaidi, Hong Bu

Precise segmentation of residual tumor in breast cancer (PSRTBC) after neoadjuvant chemotherapy is a fundamental key technique in the treatment process of breast cancer.

Weakly-supervised Learning

Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning

4 code implementations CVPR 2023 Peng Jin, Jinfa Huang, Pengfei Xiong, Shangxuan Tian, Chang Liu, Xiangyang Ji, Li Yuan, Jie Chen

Contrastive learning-based video-language representation learning approaches, e. g., CLIP, have achieved outstanding performance, which pursue semantic interaction upon pre-defined video-text pairs.

Contrastive Learning Question Answering +5

Multi-granularity Interaction Simulation for Unsupervised Interactive Segmentation

no code implementations ICCV 2023 Kehan Li, Yian Zhao, Zhennan Wang, Zesen Cheng, Peng Jin, Xiangyang Ji, Li Yuan, Chang Liu, Jie Chen

Interactive segmentation enables users to segment as needed by providing cues of objects, which introduces human-computer interaction for many fields, such as image editing and medical image analysis.

Interactive Segmentation Medical Image Analysis

DiffusionRet: Generative Text-Video Retrieval with Diffusion Model

4 code implementations ICCV 2023 Peng Jin, Hao Li, Zesen Cheng, Kehan Li, Xiangyang Ji, Chang Liu, Li Yuan, Jie Chen

Existing text-video retrieval solutions are, in essence, discriminant models focused on maximizing the conditional likelihood, i. e., p(candidates|query).

Retrieval Video Retrieval

GLASU: A Communication-Efficient Algorithm for Federated Learning with Vertically Distributed Graph Data

1 code implementation16 Mar 2023 Xinwei Zhang, Mingyi Hong, Jie Chen

In this paper, we propose a model splitting method that splits a backbone GNN across the clients and the server and a communication-efficient algorithm, GLASU, to train such a model.

Graph Neural Network Vertical Federated Learning

Nonlinear Hyperspectral Unmixing based on Multilinear Mixing Model using Convolutional Autoencoders

no code implementations14 Mar 2023 Tingting Fang, Fei Zhu, Jie Chen

Current deep learning-based nonlinear unmixing focuses on the models in additive, bilinear-based formulations.

Hyperspectral Unmixing

Parallel Vertex Diffusion for Unified Visual Grounding

no code implementations13 Mar 2023 Zesen Cheng, Kehan Li, Peng Jin, Xiangyang Ji, Li Yuan, Chang Liu, Jie Chen

An intuitive materialization of our paradigm is Parallel Vertex Diffusion (PVD) to directly set vertex coordinates as the generation target and use a diffusion model to train and infer.

Visual Grounding

Mastering Strategy Card Game (Hearthstone) with Improved Techniques

no code implementations9 Mar 2023 Changnan Xiao, Yongxin Zhang, Xuefeng Huang, Qinhan Huang, Jie Chen, Peng Sun

Strategy card game is a well-known genre that is demanding on the intelligent game-play and can be an ideal test-bench for AI.

Decision Making

A Coarse to Fine Framework for Object Detection in High Resolution Image

no code implementations2 Mar 2023 Jinyan Liu, Jie Chen

In this paper, we introduce a simple yet efficient approach that improves accuracy of object detection especially for small objects and large scale variance scene while reducing the computational cost in high resolution image.

Object object-detection +1

Self-triggered Resilient Stabilization of Linear Systems with Quantized Output

no code implementations14 Feb 2023 Wenjie Liu, Masashi Wakaiki, Jian Sun, Gang Wang, Jie Chen

If, in addition, the transmission protocols at the controller-to-actuator (C-A) and sensor-to-controller (S-C) channels can be adapted, the self-triggered control architecture can be considerably simplified, leveraging a delicate observer-based deadbeat controller to eliminate the need for running the controller in parallel at the encoder side.

Graph Neural Network-Inspired Kernels for Gaussian Processes in Semi-Supervised Learning

1 code implementation12 Feb 2023 Zehao Niu, Mihai Anitescu, Jie Chen

Gaussian processes (GPs) are an attractive class of machine learning models because of their simplicity and flexibility as building blocks of more complex Bayesian models.

Gaussian Processes Graph Neural Network +1

GUAP: Graph Universal Attack Through Adversarial Patching

no code implementations4 Jan 2023 Xiao Zang, Jie Chen, Bo Yuan

Graph neural networks (GNNs) are a class of effective deep learning models for node classification tasks; yet their predictive capability may be severely compromised under adversarially designed unnoticeable perturbations to the graph structure and/or node data.

Graph Attention Node Classification

High-Frequency Stereo Matching Network

no code implementations CVPR 2023 Haoliang Zhao, Huizhou Zhou, Yongjun Zhang, Jie Chen, Yitong Yang, Yong Zhao

In the field of binocular stereo matching, remarkable progress has been made by iterative methods like RAFT-Stereo and CREStereo.

Stereo Matching

The Devil is in the Crack Orientation: A New Perspective for Crack Detection

no code implementations ICCV 2023 Zhuangzhuang Chen, Jin Zhang, Zhuonan Lai, Guanming Zhu, Zun Liu, Jie Chen, Jianqiang Li

However, the vanilla adaptation of the existing oriented object detection methods to the crack detection tasks will result in limited performance, due to the boundary discontinuity issue and the ambiguities in sub-crack orientation.

Crack Segmentation object-detection +2

TopoSeg: Topology-Aware Nuclear Instance Segmentation

no code implementations ICCV 2023 Hongliang He, Jun Wang, Pengxu Wei, Fan Xu, Xiangyang Ji, Chang Liu, Jie Chen

Experiments on three nuclear instance segmentation datasets justify the superiority of TopoSeg, which achieves state-of-the-art performance.

Instance Segmentation Segmentation +1

Position Embedding Needs an Independent Layer Normalization

1 code implementation10 Dec 2022 Runyi Yu, Zhennan Wang, Yinhuai Wang, Kehan Li, Yian Zhao, Jian Zhang, Guoli Song, Jie Chen

By analyzing the input and output of each encoder layer in VTs using reparameterization and visualization, we find that the default PE joining method (simply adding the PE and patch embedding together) operates the same affine transformation to token embedding and PE, which limits the expressiveness of PE and hence constrains the performance of VTs.

Position

Tuning-free Plug-and-Play Hyperspectral Image Deconvolution with Deep Priors

1 code implementation28 Nov 2022 Xiuheng Wang, Jie Chen, Cédric Richard

Deconvolution is a widely used strategy to mitigate the blurring and noisy degradation of hyperspectral images~(HSI) generated by the acquisition devices.

Denoising Image Deconvolution

Learnable Blur Kernel for Single-Image Defocus Deblurring in the Wild

no code implementations25 Nov 2022 Jucai Zhai, Pengcheng Zeng, Chihao Ma, Yong Zhao, Jie Chen

The proposed method consists of a learnable blur kernel to estimate the defocus map, which is an unsupervised method, and a single-image defocus deblurring generative adversarial network (DefocusGAN) for the first time.

Deblurring Generative Adversarial Network +2

Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations

4 code implementations21 Nov 2022 Peng Jin, Jinfa Huang, Fenglin Liu, Xian Wu, Shen Ge, Guoli Song, David A. Clifton, Jie Chen

Most video-and-language representation learning approaches employ contrastive learning, e. g., CLIP, to project the video and text features into a common latent space according to the semantic similarities of text-video pairs.

Ranked #2 on Video Retrieval on LSMDC (text-to-video Mean Rank metric)

Contrastive Learning Representation Learning +5

Dual Complementary Dynamic Convolution for Image Recognition

no code implementations11 Nov 2022 Longbin Yan, Yunxiao Qin, Shumin Liu, Jie Chen

As a powerful engine, vanilla convolution has promoted huge breakthroughs in various computer tasks.

Image Classification object-detection +2

Robust Manifold Nonnegative Tucker Factorization for Tensor Data Representation

no code implementations8 Nov 2022 Jianyu Wang, Linruize Tang, Jie Chen, Jingdong Chen

Nonnegative Tucker Factorization (NTF) minimizes the euclidean distance or Kullback-Leibler divergence between the original data and its low-rank approximation which often suffers from grossly corruptions or outliers and the neglect of manifold structures of data.

A Unified Pyramid Recurrent Network for Video Frame Interpolation

1 code implementation CVPR 2023 Xin Jin, Longhai Wu, Jie Chen, Youxin Chen, Jayoon Koo, Cheul-hee Hahm

Cast in a flexible pyramid framework, UPR-Net exploits lightweight recurrent modules for both bi-directional flow estimation and intermediate frame synthesis.

Optical Flow Estimation Video Frame Interpolation

SA-MLP: Distilling Graph Knowledge from GNNs into Structure-Aware MLP

4 code implementations18 Oct 2022 Jie Chen, Shouzhen Chen, Mingyuan Bai, Junbin Gao, Junping Zhang, Jian Pu

Then, we introduce a novel structure-mixing knowledge distillation strategy to enhance the learning ability of MLPs for structure information.

Knowledge Distillation Node Classification

ACSeg: Adaptive Conceptualization for Unsupervised Semantic Segmentation

no code implementations CVPR 2023 Kehan Li, Zhennan Wang, Zesen Cheng, Runyi Yu, Yian Zhao, Guoli Song, Chang Liu, Li Yuan, Jie Chen

Recently, self-supervised large-scale visual pre-training models have shown great promise in representing pixel-level semantic relationships, significantly promoting the development of unsupervised dense prediction tasks, e. g., unsupervised semantic segmentation (USS).

Image Segmentation Unsupervised Semantic Segmentation

Flexible Alignment Super-Resolution Network for Multi-Contrast MRI

1 code implementation7 Oct 2022 Yiming Liu, Mengxi Zhang, Weiqin Zhang, Bo Jiang, Bo Hou, Dan Liu, Jie Chen, Heqing Lian

To tackle this problem, we propose the Flexible Alignment Super-Resolution Network (FASR-Net) for multi-contrast MRI Super-Resolution.

Super-Resolution

Toward 3D Spatial Reasoning for Human-like Text-based Visual Question Answering

no code implementations21 Sep 2022 Hao Li, Jinfa Huang, Peng Jin, Guoli Song, Qi Wu, Jie Chen

Under this setting, these 2D spatial reasoning approaches cannot distinguish the fine-grain spatial relations between visual objects and scene texts on the same image plane, thereby impairing the interpretability and performance of TextVQA models.

Image Captioning Optical Character Recognition (OCR) +4

Deep Hyperspectral and Multispectral Image Fusion with Inter-image Variability

1 code implementation24 Aug 2022 Xiuheng Wang, Ricardo Augusto Borsoi, Cédric Richard, Jie Chen

The fusion problem is stated as an optimization problem in the maximum a posteriori framework.

Data-Driven Control of Distributed Event-Triggered Network Systems

no code implementations22 Aug 2022 Xin Wang, Jian Sun, Gang Wang, Frank Allgöwer, Jie Chen

The present paper deals with data-driven event-triggered control of a class of unknown discrete-time interconnected systems (a. k. a.

Audio Deepfake Attribution: An Initial Dataset and Investigation

no code implementations21 Aug 2022 Xinrui Yan, Jiangyan Yi, JianHua Tao, Jie Chen

To address the challenges of attribution of continuously emerging unknown audio generation tools in the real world, we propose the Class-Representation Multi-Center Learning (CRML) method for open-set audio deepfake attribution (OSADA).

Audio Generation Binary Classification +2

Pathway to Future Symbiotic Creativity

no code implementations18 Aug 2022 Yike Guo, Qifeng Liu, Jie Chen, Wei Xue, Jie Fu, Henrik Jensen, Fernando Rosas, Jeffrey Shaw, Xing Wu, Jiji Zhang, Jianliang Xu

This report presents a comprehensive view of our vision on the development path of the human-machine symbiotic art creation.

Philosophy

Unsupervised domain adaptation semantic segmentation of high-resolution remote sensing imagery with invariant domain-level prototype memory

1 code implementation16 Aug 2022 Jingru Zhu, Ya Guo, Geng Sun, Libo Yang, Min Deng, Jie Chen

This study proposes a novel unsupervised domain adaptation semantic segmentation network (MemoryAdaptNet) for the semantic segmentation of HRS imagery.

Pseudo Label Pseudo Label Filtering +3

OpenMedIA: Open-Source Medical Image Analysis Toolbox and Benchmark under Heterogeneous AI Computing Platforms

no code implementations11 Aug 2022 Jia-Xin Zhuang, Xiansong Huang, Yang Yang, Jiancong Chen, Yue Yu, Wei Gao, Ge Li, Jie Chen, Tong Zhang

In this paper, we present OpenMedIA, an open-source toolbox library containing a rich set of deep learning methods for medical image analysis under heterogeneous Artificial Intelligence (AI) computing platforms.

Image Classification Medical Image Analysis +3

Neural Optimization Machine: A Neural Network Approach for Optimization

no code implementations8 Aug 2022 Jie Chen, Yongming Liu

The NN objective function can have arbitrary architectures and activation functions.

Multiobjective Optimization

Event-triggered Consensus Control of Heterogeneous Multi-agent Systems: Model- and Data-based Analysis

no code implementations1 Aug 2022 Xin Wang, Jian Sun, Gang Wang, Jie Chen

This article deals with model- and data-based consensus control of heterogenous leader-following multi-agent systems (MASs) under an event-triggering transmission scheme.

Locality Guidance for Improving Vision Transformers on Tiny Datasets

1 code implementation20 Jul 2022 Kehan Li, Runyi Yu, Zhennan Wang, Li Yuan, Guoli Song, Jie Chen

Therefore, our locality guidance approach is very simple and efficient, and can serve as a basic performance enhancement method for VTs on tiny datasets.

NDF: Neural Deformable Fields for Dynamic Human Modelling

1 code implementation19 Jul 2022 Ruiqi Zhang, Jie Chen

However, the learned canonical representation is static and the current design of the deformation fields is not able to represent large movements or detailed geometry changes.

Data-driven Self-triggered Control via Trajectory Prediction

no code implementations18 Jul 2022 Wenjie Liu, Jian Sun, Gang Wang, Francesco Bullo, Jie Chen

Self-triggered control, a well-documented technique for reducing the communication overhead while ensuring desired system performance, is gaining increasing popularity.

Model Predictive Control Prediction +1

A Survey of Decision Making in Adversarial Games

no code implementations16 Jul 2022 Xiuxian Li, Min Meng, Yiguang Hong, Jie Chen

Game theory has by now found numerous applications in various fields, including economics, industry, jurisprudence, and artificial intelligence, where each player only cares about its own interest in a noncooperative or cooperative manner, but without obvious malice to other players.

Decision Making Jurisprudence +1

A Dual-Masked Auto-Encoder for Robust Motion Capture with Spatial-Temporal Skeletal Token Completion

1 code implementation15 Jul 2022 Junkun Jiang, Jie Chen, Yike Guo

In order to demonstrate the proposed model's capability in dealing with severe data loss scenarios, we contribute a high-accuracy and challenging motion capture dataset of multi-person interactions with severe occlusion.

Using Model-Based Trees with Boosting to Fit Low-Order Functional ANOVA Models

no code implementations14 Jul 2022 Linwei Hu, Jie Chen, Vijayan N. Nair

We propose a new algorithm, called GAMI-Tree, that is similar to EBM, but has a number of features that lead to better performance.

BIG-bench Machine Learning Interpretable Machine Learning

Adaptive Random Fourier Features Kernel LMS

no code implementations14 Jul 2022 Wei Gao, Jie Chen, Cédric Richard, Wentao Shi, Qunfei Zhang

We propose the adaptive random Fourier features Gaussian kernel LMS (ARFF-GKLMS).

Shapley Computations Using Surrogate Model-Based Trees

no code implementations11 Jul 2022 Zhipu Zhou, Jie Chen, Linwei Hu

Shapley-related techniques have gained attention as both global and local interpretation tools because of their desirable properties.

model

$L_2$BN: Enhancing Batch Normalization by Equalizing the $L_2$ Norms of Features

no code implementations6 Jul 2022 Zhennan Wang, Kehan Li, Runyi Yu, Yian Zhao, Pengchong Qiao, Chang Liu, Fan Xu, Xiangyang Ji, Guoli Song, Jie Chen

In this paper, we analyze batch normalization from the perspective of discriminability and find the disadvantages ignored by previous studies: the difference in $l_2$ norms of sample features can hinder batch normalization from obtaining more distinguished inter-class features and more compact intra-class features.

Acoustic Scene Classification Image Classification +1

Bridging Mean-Field Games and Normalizing Flows with Trajectory Regularization

no code implementations30 Jun 2022 Han Huang, Jiajia Yu, Jie Chen, Rongjie Lai

In this work, we unravel the connections between MFGs and NFs by contextualizing the training of an NF as solving the MFG.

Integration of Physics-Based and Data-Driven Models for Hyperspectral Image Unmixing

1 code implementation11 Jun 2022 Jie Chen, Min Zhao, Xiuheng Wang, Cédric Richard, Susanto Rahardja

Spectral unmixing is one of the most important quantitative analysis tasks in hyperspectral data processing.

Hyperspectral Unmixing

Joint learning of object graph and relation graph for visual question answering

no code implementations9 May 2022 Hao Li, Xu Li, Belhal Karimi, Jie Chen, Mingming Sun

Modeling visual question answering(VQA) through scene graphs can significantly improve the reasoning accuracy and interpretability.

Attribute Graph Neural Network +3

Performance and Interpretability Comparisons of Supervised Machine Learning Algorithms: An Empirical Study

no code implementations27 Apr 2022 Alice J. Liu, Arpita Mukherjee, Linwei Hu, Jie Chen, Vijayan N. Nair

Overall, XGB and FFNNs were competitive, with FFNNs showing better performance in smooth models and tree-based boosting algorithms performing better in non-smooth models.

BIG-bench Machine Learning

ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval

no code implementations CVPR 2022 Mengjun Cheng, Yipeng Sun, Longchao Wang, Xiongwei Zhu, Kun Yao, Jie Chen, Guoli Song, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang

Visual appearance is considered to be the most important cue to understand images for cross-modal retrieval, while sometimes the scene text appearing in images can provide valuable information to understand the visual semantics.

Ranked #10 on Cross-Modal Retrieval on Flickr30k (using extra training data)

Contrastive Learning Cross-Modal Retrieval +1

Training-free Transformer Architecture Search

1 code implementation CVPR 2022 Qinqin Zhou, Kekai Sheng, Xiawu Zheng, Ke Li, Xing Sun, Yonghong Tian, Jie Chen, Rongrong Ji

Recently, Vision Transformer (ViT) has achieved remarkable success in several computer vision tasks.

Diversity

Exploiting Neighbor Effect: Conv-Agnostic GNNs Framework for Graphs with Heterophily

1 code implementation19 Mar 2022 Jie Chen, Shouzhen Chen, Junbin Gao, Zengfeng Huang, Junping Zhang, Jian Pu

Moreover, we propose a simple yet effective Conv-Agnostic GNN framework (CAGNNs) to enhance the performance of most GNNs on heterophily datasets by learning the neighbor effect for each node.

Node Classification

Data-Efficient Graph Grammar Learning for Molecular Generation

1 code implementation ICLR 2022 Minghao Guo, Veronika Thost, Beichen Li, Payel Das, Jie Chen, Wojciech Matusik

This is a non-trivial task for neural network-based generative models since the relevant chemical knowledge can only be extracted and generalized from the limited training data.

Active Learning for Point Cloud Semantic Segmentation via Spatial-Structural Diversity Reasoning

no code implementations25 Feb 2022 Feifei Shao, Yawei Luo, Ping Liu, Jie Chen, Yi Yang, Yulei Lu, Jun Xiao

To deploy SSDR-AL in a more practical scenario, we design a noise-aware iterative labeling strategy to confront the "noisy annotation" problem introduced by the previous "dominant labeling" strategy in superpoints.

Active Learning Diversity +1

Model-Based and Data-Driven Control of Event- and Self-Triggered Discrete-Time LTI Systems

no code implementations16 Feb 2022 Xin Wang, Julian Berberich, Jian Sun, Gang Wang, Frank Allgöwer, Jie Chen

To this end, we begin by presenting a dynamic event-triggering scheme (ETS) based on periodic sampling, and a discrete-time looped-functional approach, through which a model-based stability condition is derived.

STS

Graph-Augmented Normalizing Flows for Anomaly Detection of Multiple Time Series

4 code implementations ICLR 2022 Enyan Dai, Jie Chen

Anomaly detection is a widely studied task for a broad variety of data types; among them, multiple time series appear frequently in applications, including for example, power grids and traffic networks.

Density Estimation Time Series +2

Time-Frequency Mask Aware Bi-directional LSTM: A Deep Learning Approach for Underwater Acoustic Signal Separation

no code implementations9 Feb 2022 Jie Chen, Chang Liu, Jiawu Xie, Jie An, Nan Huang

In particular, this method breaks through the limitations of the existing methods, not only achieves good results in multivariate separation, but also effectively separates signals when mixed with 40dB Gaussian noise signals.

Temporal Sequences

Memory-based Message Passing: Decoupling the Message for Propogation from Discrimination

1 code implementation1 Feb 2022 Jie Chen, Weiqi Liu, Jian Pu

Based on the homophily assumption, the current message passing always aggregates features of connected nodes, such as the graph Laplacian smoothing process.

Graph Representation Learning

Hyperspectral Image Super-resolution with Deep Priors and Degradation Model Inversion

1 code implementation24 Jan 2022 Xiuheng Wang, Jie Chen, Cédric Richard

To overcome inherent hardware limitations of hyperspectral imaging systems with respect to their spatial resolution, fusion-based hyperspectral image (HSI) super-resolution is attracting increasing attention.

Hyperspectral Image Super-Resolution Image Super-Resolution

Geometry-Aware Guided Loss for Deep Crack Recognition

no code implementations CVPR 2022 Zhuangzhuang Chen, Jin Zhang, Zhuonan Lai, Jie Chen, Zun Liu, Jianqiang Li

Despite the substantial progress of deep models for crack recognition, due to the inconsistent cracks in varying sizes, shapes, and noisy background textures, there still lacks the discriminative power of the deeply learned features when supervised by the cross-entropy loss.

Robust Recommendation with Implicit Feedback via Eliminating the Effects of Unexpected Behaviors

1 code implementation21 Dec 2021 Jie Chen, Lifen Jiang, Chunmei Ma, Huazhi Sun

In this paper, we propose a Multi-Preferences Model (MPM) to eliminate the effects of unexpected behaviors.

Recommendation Systems

Mean-Square Stability and Stabilizability Analyses of LTI Systems Under Spatially Correlated Multiplicative Perturbations

no code implementations10 Dec 2021 Jianqi Chen, Tian Qi, Jie Chen

In this paper, we investigate the mean-square stability and stabilizability problems for linear time-invariant systems under stochastic spatially correlated multiplicative uncertainties.

Distributed Policy Gradient with Variance Reduction in Multi-Agent Reinforcement Learning

no code implementations25 Nov 2021 Xiaoxiao Zhao, Jinlong Lei, Li Li, Jie Chen

This paper studies a distributed policy gradient in collaborative multi-agent reinforcement learning (MARL), where agents over a communication network aim to find the optimal policy to maximize the average of all agents' local returns.

Multi-agent Reinforcement Learning reinforcement-learning +2

Learning Representation for Clustering via Prototype Scattering and Positive Sampling

1 code implementation23 Nov 2021 Zhizhong Huang, Jie Chen, Junping Zhang, Hongming Shan

The strengths of ProPos are avoidable class collision issue, uniform representations, well-separated clusters, and within-cluster compactness.

Clustering Contrastive Learning +3

Traversing the Local Polytopes of ReLU Neural Networks

no code implementations AAAI Workshop AdvML 2022 Shaojie Xu, Joel Vaughan, Jie Chen, Aijun Zhang, Agus Sudjianto

Our polytope traversing algorithm can be adapted to a wide range of applications related to robustness and interpretability.

Temporal-MPI: Enabling Multi-Plane Images for Dynamic Scene Modelling via Temporal Basis Learning

no code implementations20 Nov 2021 Wenpeng Xing, Jie Chen

One of the seminal image-based rendering method, the multi-plane image (MPI), produces high novel-view synthesis quality for static scenes.

Novel View Synthesis

Traversing the Local Polytopes of ReLU Neural Networks: A Unified Approach for Network Verification

no code implementations17 Nov 2021 Shaojie Xu, Joel Vaughan, Jie Chen, Aijun Zhang, Agus Sudjianto

Although neural networks (NNs) with ReLU activation functions have found success in a wide range of applications, their adoption in risk-sensitive settings has been limited by the concerns on robustness and interpretability.

Cannot find the paper you are looking for? You can Submit a new open access paper.