Search Results for author: Jun Chen

Found 106 papers, 37 papers with code

Manifold Projection for Adversarial Defense on Face Recognition

no code implementations ECCV 2020 Jianli Zhou, Chao Liang, Jun Chen

We utilize variational autoencoder (VAE) to estimate the lower bound of the log-likelihood of image and explore to project the input images back into the high probability regions of image manifold again.

Adversarial Defense Face Recognition

GoodDrag: Towards Good Practices for Drag Editing with Diffusion Models

no code implementations10 Apr 2024 Zewei Zhang, Huan Liu, Jun Chen, Xiangyu Xu

In this paper, we introduce GoodDrag, a novel approach to improve the stability and image quality of drag editing.

Benchmarking Denoising

Output-Constrained Lossy Source Coding With Application to Rate-Distortion-Perception Theory

no code implementations21 Mar 2024 Li Xie, Liangyan Li, Jun Chen, Zhongshan Zhang

The distortion-rate function of output-constrained lossy source coding with limited common randomness is analyzed for the special case of squared error distortion measure.

AutoDFP: Automatic Data-Free Pruning via Channel Similarity Reconstruction

no code implementations13 Mar 2024 Siqi Li, Jun Chen, Jingyang Xiang, Chengrui Zhu, Yong liu

AutoDFP assesses the similarity of channels for each layer and provides this information to the reinforcement learning agent, guiding the pruning and reconstruction process of the network.

Unsupervised Contrastive Learning for Robust RF Device Fingerprinting Under Time-Domain Shift

no code implementations6 Mar 2024 Jun Chen, Weng-Keen Wong, Bechir Hamdaoui

When applied to RF fingerprinting, our model treats RF signals from the same transmission as positive pairs and those from different transmissions as negative pairs.

Contrastive Learning Self-Supervised Learning

SCNet: Sparse Compression Network for Music Source Separation

no code implementations24 Jan 2024 Weinan Tong, Jiaxu Zhu, Jun Chen, Shiyin Kang, Tao Jiang, Yang Li, Zhiyong Wu, Helen Meng

We use a higher compression ratio on subbands with less information to improve the information density and focus on modeling subbands with more information.

Music Source Separation

Rate-Distortion-Perception Tradeoff Based on the Conditional-Distribution Perception Measure

no code implementations22 Jan 2024 Sadaf Salehkalaibar, Jun Chen, Ashish Khisti, Wei Yu

We derive the RDP function for vector Gaussian sources and propose a waterfilling type solution.

M2-CLIP: A Multimodal, Multi-task Adapting Framework for Video Action Recognition

no code implementations22 Jan 2024 Mengmeng Wang, Jiazheng Xing, Boyuan Jiang, Jun Chen, Jianbiao Mei, Xingxing Zuo, Guang Dai, Jingdong Wang, Yong liu

In this paper, we introduce a novel Multimodal, Multi-task CLIP adapting framework named \name to address these challenges, preserving both high supervised performance and robust transferability.

Action Recognition Temporal Action Localization

Multi-view MidiVAE: Fusing Track- and Bar-view Representations for Long Multi-track Symbolic Music Generation

no code implementations15 Jan 2024 Zhiwei Lin, Jun Chen, Boshi Tang, Binzhu Sha, Jing Yang, Yaolong Ju, Fan Fan, Shiyin Kang, Zhiyong Wu, Helen Meng

Variational Autoencoders (VAEs) constitute a crucial component of neural symbolic music generation, among which some works have yielded outstanding results and attracted considerable attention.

Music Generation

Learnable Chamfer Distance for Point Cloud Reconstruction

1 code implementation27 Dec 2023 Tianxin Huang, Qingyao Liu, Xiangrui Zhao, Jun Chen, Yong liu

As point clouds are 3D signals with permutation invariance, most existing works train their reconstruction networks by measuring shape differences with the average point-to-point distance between point clouds matched with predefined rules.

Point cloud reconstruction

SimCalib: Graph Neural Network Calibration based on Similarity between Nodes

no code implementations19 Dec 2023 Boshi Tang, Zhiyong Wu, Xixin Wu, Qiaochu Huang, Jun Chen, Shun Lei, Helen Meng

A novel calibration framework, named SimCalib, is accordingly proposed to consider similarity between nodes at global and local levels.

Explore 3D Dance Generation via Reward Model from Automatically-Ranked Demonstrations

no code implementations18 Dec 2023 Zilin Wang, Haolin Zhuang, Lu Li, Yinmin Zhang, Junjie Zhong, Jun Chen, Yu Yang, Boshi Tang, Zhiyong Wu

This paper presents an Exploratory 3D Dance generation framework, E3D2, designed to address the exploration capability deficiency in existing music-conditioned 3D dance generation models.

CR-SFP: Learning Consistent Representation for Soft Filter Pruning

no code implementations17 Dec 2023 Jingyang Xiang, Zhuangzhi Chen, Jianbiao Mei, Siqi Li, Jun Chen, Yong liu

In this paper, we propose to mitigate this gap by learning consistent representation for soft filter pruning, dubbed as CR-SFP.

HB-net: Holistic bursting cell cluster integrated network for occluded multi-objects recognition

1 code implementation18 Oct 2023 Xudong Gao, Xiao Guang Gao, Jia Rong, Xiaowei Chen, Xiang Liao, Jun Chen

Although in high-noise settings, standard CNNs exhibit slightly greater robustness when compared to HB-net models, the models that combine the HB framework and EA mechanism achieve a comparable level of accuracy and resilience to ResNet50, despite having only three convolutional layers and approximately $1/30$ of the parameters.

Multi-Label Classification

MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning

1 code implementation14 Oct 2023 Jun Chen, Deyao Zhu, Xiaoqian Shen, Xiang Li, Zechun Liu, Pengchuan Zhang, Raghuraman Krishnamoorthi, Vikas Chandra, Yunyang Xiong, Mohamed Elhoseiny

Motivated by this, we target to build a unified interface for completing many vision-language tasks including image description, visual question answering, and visual grounding, among others.

Language Modelling Large Language Model +4

SUBP: Soft Uniform Block Pruning for 1xN Sparse CNNs Multithreading Acceleration

1 code implementation10 Oct 2023 Jingyang Xiang, Siqi Li, Jun Chen, Shipeng Bai, Yukai Ma, Guang Dai, Yong liu

To overcome them, this paper proposes a novel \emph{\textbf{S}oft \textbf{U}niform \textbf{B}lock \textbf{P}runing} (SUBP) approach to train a uniform 1$\times$N sparse structured network from scratch.

Decentralized Riemannian Conjugate Gradient Method on the Stiefel Manifold

no code implementations21 Aug 2023 Jun Chen, Haishan Ye, Mengmeng Wang, Tianxin Huang, Guang Dai, Ivor W. Tsang, Yong liu

This paper proposes a decentralized Riemannian conjugate gradient descent (DRCGD) method that aims at minimizing a global function over the Stiefel manifold.

Second-order methods

Unified Data-Free Compression: Pruning and Quantization without Fine-Tuning

no code implementations ICCV 2023 Shipeng Bai, Jun Chen, Xintian Shen, Yixuan Qian, Yong liu

Therefore, a few data-free methods are proposed to address this problem, but they perform data-free pruning and quantization separately, which does not explore the complementarity of pruning and quantization.

Image Classification Quantization

Data-Free Quantization via Mixed-Precision Compensation without Fine-Tuning

no code implementations2 Jul 2023 Jun Chen, Shipeng Bai, Tianxin Huang, Mengmeng Wang, Guanzhong Tian, Yong liu

In this paper, we propose a data-free mixed-precision compensation (DF-MPC) method to recover the performance of an ultra-low precision quantized model without any data and fine-tuning process.

Data Free Quantization Model Compression

Impacts of seasonality and parasitism on honey bee population dynamics

no code implementations23 Jun 2023 Jun Chen, Jordy O Rodriguez Rincon, Gloria DeGrandi-Hoffman, Jennifer Fewell, Jon Harrison, Yun Kang

The honeybee plays an extremely important role in ecosystem stability and diversity and in the production of bee pollinated crops.

MammalNet: A Large-scale Video Benchmark for Mammal Recognition and Behavior Understanding

no code implementations CVPR 2023 Jun Chen, Ming Hu, Darren J. Coker, Michael L. Berumen, Blair Costelloe, Sara Beery, Anna Rohrbach, Mohamed Elhoseiny

Monitoring animal behavior can facilitate conservation efforts by providing key insights into wildlife health, population status, and ecosystem function.

Exploring Open-Vocabulary Semantic Segmentation without Human Labels

no code implementations1 Jun 2023 Jun Chen, Deyao Zhu, Guocheng Qian, Bernard Ghanem, Zhicheng Yan, Chenchen Zhu, Fanyi Xiao, Mohamed Elhoseiny, Sean Chang Culatana

Although acquired extensive knowledge of visual concepts, it is non-trivial to exploit knowledge from these VL models to the task of semantic segmentation, as they are usually trained at an image level.

Open Vocabulary Semantic Segmentation Segmentation +3

Learning Global-aware Kernel for Image Harmonization

no code implementations ICCV 2023 Xintian Shen, Jiangning Zhang, Jun Chen, Shipeng Bai, Yue Han, Yabiao Wang, Chengjie Wang, Yong liu

To address this issue, we propose a novel Global-aware Kernel Network (GKNet) to harmonize local regions with comprehensive consideration of long-distance background references.

Image Harmonization

Breaking Through the Haze: An Advanced Non-Homogeneous Dehazing Method based on Fast Fourier Convolution and ConvNeXt

1 code implementation8 May 2023 Han Zhou, Wei Dong, Yangyi Liu, Jun Chen

To tackle these two challenges, we propose a novel two branch network that leverages 2D discrete wavelete transform (DWT), fast Fourier convolution (FFC) residual block and a pretrained ConvNeXt model.

LMPT: Prompt Tuning with Class-Specific Embedding Loss for Long-tailed Multi-Label Visual Recognition

1 code implementation8 May 2023 Peng Xia, Di Xu, Lie Ju, Ming Hu, Jun Chen, ZongYuan Ge

Long-tailed multi-label visual recognition (LTML) task is a highly challenging task due to the label co-occurrence and imbalanced data distribution.

 Ranked #1 on Long-tail Learning on COCO-MLT (using extra training data)

Long-tail Learning

SwinFSR: Stereo Image Super-Resolution using SwinIR and Frequency Domain Knowledge

no code implementations25 Apr 2023 Ke Chen, Liangyan Li, Huan Liu, Yunzhe Li, Congling Tang, Jun Chen

Stereo Image Super-Resolution (stereoSR) has attracted significant attention in recent years due to the extensive deployment of dual cameras in mobile phones, autonomous vehicles and robots.

Autonomous Vehicles Image Restoration +1

MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models

5 code implementations20 Apr 2023 Deyao Zhu, Jun Chen, Xiaoqian Shen, Xiang Li, Mohamed Elhoseiny

Our work, for the first time, uncovers that properly aligning the visual features with an advanced large language model can possess numerous advanced multi-modal abilities demonstrated by GPT-4, such as detailed image description generation and website creation from hand-drawn drafts.

Language Modelling Large Language Model +3

LLM as A Robotic Brain: Unifying Egocentric Memory and Control

no code implementations19 Apr 2023 Jinjie Mai, Jun Chen, Bing Li, Guocheng Qian, Mohamed Elhoseiny, Bernard Ghanem

In this paper, we propose a novel and generalizable framework called LLM-Brain: using Large-scale Language Model as a robotic brain to unify egocentric memory and control.

Embodied Question Answering Language Modelling +2

A Data-Centric Solution to NonHomogeneous Dehazing via Vision Transformer

1 code implementation16 Apr 2023 Yangyi Liu, Huan Liu, Liangyan Li, Zijun Wu, Jun Chen

Although it is possible to augment the NH-HAZE23 dataset by leveraging other non-homogeneous dehazing datasets, we observe that it is necessary to design a proper data-preprocessing approach that reduces the distribution gaps between the target dataset and the augmented one.

Image Dehazing

Video ChatCaptioner: Towards Enriched Spatiotemporal Descriptions

1 code implementation9 Apr 2023 Jun Chen, Deyao Zhu, Kilichbek Haydarov, Xiang Li, Mohamed Elhoseiny

Video captioning aims to convey dynamic scenes from videos using natural language, facilitating the understanding of spatiotemporal information within our environment.

Video Captioning

Contrastive Semi-supervised Learning for Underwater Image Restoration via Reliable Bank

1 code implementation CVPR 2023 Shirui Huang, Keyan Wang, Huan Liu, Jun Chen, Yunsong Li

Despite the remarkable achievement of recent underwater image restoration techniques, the lack of labeled data has become a major hurdle for further progress.

NR-IQA Underwater Image Restoration

ChatGPT Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched Visual Descriptions

1 code implementation12 Mar 2023 Deyao Zhu, Jun Chen, Kilichbek Haydarov, Xiaoqian Shen, Wenxuan Zhang, Mohamed Elhoseiny

By keeping acquiring new visual information from BLIP-2's answers, ChatCaptioner is able to generate more enriched image descriptions.

Image Captioning Question Answering +1

On the Stability and Generalization of Triplet Learning

no code implementations20 Feb 2023 Jun Chen, Hong Chen, Xue Jiang, Bin Gu, Weifu Li, Tieliang Gong, Feng Zheng

Triplet learning, i. e. learning from triplet data, has attracted much attention in computer vision tasks with an extremely large number of categories, e. g., face recognition and person re-identification.

Face Recognition Metric Learning +1

Stability-based Generalization Analysis for Mixtures of Pointwise and Pairwise Learning

no code implementations20 Feb 2023 Jiahuan Wang, Jun Chen, Hong Chen, Bin Gu, Weifu Li, Xin Tang

Recently, some mixture algorithms of pointwise and pairwise learning (PPL) have been formulated by employing the hybrid error metric of "pointwise loss + pairwise loss" and have shown empirical effectiveness on feature selection, ranking and recommendation tasks.

feature selection Generalization Bounds +1

Learning Discretized Neural Networks under Ricci Flow

no code implementations7 Feb 2023 Jun Chen, Hanwen Chen, Mengmeng Wang, Guang Dai, Ivor W. Tsang, Yong liu

By introducing a partial differential equation on metrics, i. e., the Ricci flow, we establish the dynamical stability and convergence of the LNE metric with the $L^2$-norm perturbation.

M22: A Communication-Efficient Algorithm for Federated Learning Inspired by Rate-Distortion

no code implementations23 Jan 2023 Yangyi Liu, Stefano Rini, Sadaf Salehkalaibar, Jun Chen

This paper proposes ``\emph{${\bf M}$-magnitude weighted $L_{\bf 2}$ distortion + $\bf 2$ degrees of freedom''} (M22) algorithm, a rate-distortion inspired approach to gradient compression for federated training of deep neural networks (DNNs).

Federated Learning

Towards Grand Unified Representation Learning for Unsupervised Visible-Infrared Person Re-Identification

1 code implementation ICCV 2023 Bin Yang, Jun Chen, Mang Ye

The grand unified representation lies in two aspects: 1) GUR adopts a bottom-up domain learning strategy with a cross-memory association embedding module to explore the information of hierarchical domains, i. e., intra-camera, inter-camera, and inter-modality domains, learning a unified and robust representation against hierarchical discrepancy.

Person Re-Identification Representation Learning

Augmented Dual-Contrastive Aggregation Learning for Unsupervised Visible-Infrared Person Re-Identification

1 code implementation ACM MM 2022 Bin Yang, Mang Ye, Jun Chen, Zesen Wu

Visible infrared person re-identification (VI-ReID) aims at searching out the corresponding infrared (visible) images from a gallery set captured by other spectrum cameras.

Contrastive Learning Person Re-Identification

Progressive with Purpose: Guiding Progressive Inpainting DNNs through Context and Structure

no code implementations21 Sep 2022 Kangdi Shi, Muhammad Alrabeiah, Jun Chen

Stacking GLE modules enables the network to extract image features from different image frequency components.

Benchmarking Image Inpainting

SuperLine3D: Self-supervised Line Segmentation and Description for LiDAR Point Cloud

1 code implementation3 Aug 2022 Xiangrui Zhao, Sheng Yang, Tianxin Huang, Jun Chen, Teng Ma, Mingyang Li, Yong liu

To repetitively extract them as features and perform association between discrete LiDAR frames for registration, we propose the first learning-based feature segmentation and description model for 3D lines in LiDAR point cloud.

Point Cloud Registration Segmentation

Efficient Self-supervised Vision Pretraining with Local Masked Reconstruction

1 code implementation1 Jun 2022 Jun Chen, Ming Hu, Boyang Li, Mohamed Elhoseiny

After finetuning the pretrained LoMaR on 384$\times$384 images, it can reach 85. 4% top-1 accuracy, surpassing MAE by 0. 6%.

Image Classification Instance Segmentation +3

Privacy-Preserving Data-Enabled Predictive Leading Cruise Control in Mixed Traffic

no code implementations22 May 2022 Kaixiang Zhang, Kaian Chen, Zhaojian Li, Jun Chen, Yang Zheng

Data-driven predictive control of connected and automated vehicles (CAVs) has received increasing attention as it can achieve safe and optimal control without relying on explicit dynamical models.

Privacy Preserving

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results

2 code implementations11 May 2022 Yawei Li, Kai Zhang, Radu Timofte, Luc van Gool, Fangyuan Kong, Mingxi Li, Songwei Liu, Zongcai Du, Ding Liu, Chenhui Zhou, Jingyi Chen, Qingrui Han, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, Yu Qiao, Chao Dong, Long Sun, Jinshan Pan, Yi Zhu, Zhikai Zong, Xiaoxiao Liu, Zheng Hui, Tao Yang, Peiran Ren, Xuansong Xie, Xian-Sheng Hua, Yanbo Wang, Xiaozhong Ji, Chuming Lin, Donghao Luo, Ying Tai, Chengjie Wang, Zhizhong Zhang, Yuan Xie, Shen Cheng, Ziwei Luo, Lei Yu, Zhihong Wen, Qi Wu1, Youwei Li, Haoqiang Fan, Jian Sun, Shuaicheng Liu, Yuanfei Huang, Meiguang Jin, Hua Huang, Jing Liu, Xinjian Zhang, Yan Wang, Lingshun Long, Gen Li, Yuanfan Zhang, Zuowei Cao, Lei Sun, Panaetov Alexander, Yucong Wang, Minjie Cai, Li Wang, Lu Tian, Zheyuan Wang, Hongbing Ma, Jie Liu, Chao Chen, Yidong Cai, Jie Tang, Gangshan Wu, Weiran Wang, Shirui Huang, Honglei Lu, Huan Liu, Keyan Wang, Jun Chen, Shi Chen, Yuchun Miao, Zimo Huang, Lefei Zhang, Mustafa Ayazoğlu, Wei Xiong, Chengyi Xiong, Fei Wang, Hao Li, Ruimian Wen, Zhijing Yang, Wenbin Zou, Weixin Zheng, Tian Ye, Yuncheng Zhang, Xiangzhen Kong, Aditya Arora, Syed Waqas Zamir, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Dandan Gaoand Dengwen Zhouand Qian Ning, Jingzhu Tang, Han Huang, YuFei Wang, Zhangheng Peng, Haobo Li, Wenxue Guan, Shenghua Gong, Xin Li, Jun Liu, Wanjun Wang, Dengwen Zhou, Kun Zeng, Hanjiang Lin, Xinyu Chen, Jinsheng Fang

The aim was to design a network for single image super-resolution that achieved improvement of efficiency measured according to several metrics including runtime, parameters, FLOPs, activations, and memory consumption while at least maintaining the PSNR of 29. 00dB on DIV2K validation set.

Image Super-Resolution

FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement

2 code implementations23 Mar 2022 Jun Chen, Zilin Wang, Deyi Tuo, Zhiyong Wu, Shiyin Kang, Helen Meng

Previously proposed FullSubNet has achieved outstanding performance in Deep Noise Suppression (DNS) Challenge and attracted much attention.

Speech Enhancement

A Unified Framework for Campaign Performance Forecasting in Online Display Advertising

no code implementations24 Feb 2022 Jun Chen, Cheng Chen, Huayue Zhang, Qing Tan

Advertisers usually enjoy the flexibility to choose criteria like target audience, geographic area and bid price when planning an campaign for online display advertising, while they lack forecast information on campaign performance to optimize delivery strategies in advance, resulting in a waste of labour and budget for feedback adjustments.

Multi-Task Learning

An Analysis of Complex-Valued CNNs for RF Data-Driven Wireless Device Classification

no code implementations20 Feb 2022 Jun Chen, Weng-Keen Wong, Bechir Hamdaoui, Abdurrahman Elmaghbub, Kathiravetpillai Sivanesan, Richard Dorrance, Lily L. Yang

We perform a deep dive into understanding the impact of (i) the input representation/type and (ii) the architectural layer of the neural network.

CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval

no code implementations15 Feb 2022 Licheng Yu, Jun Chen, Animesh Sinha, Mengjiao MJ Wang, Hugo Chen, Tamara L. Berg, Ning Zhang

We introduce CommerceMM - a multimodal model capable of providing a diverse and granular understanding of commerce topics associated to the given piece of content (image, text, image+text), and having the capability to generalize to a wide range of tasks, including Multimodal Categorization, Image-Text Retrieval, Query-to-Product Retrieval, Image-to-Product Retrieval, etc.

Representation Learning Retrieval +1

Towards Multi-Domain Single Image Dehazing via Test-Time Training

no code implementations CVPR 2022 Huan Liu, Zijun Wu, Liangyan Li, Sadaf Salehkalaibar, Jun Chen, Keyan Wang

Motivated by this observation, we propose a test-time training method which leverages a helper network to assist the dehazing model in better adapting to a domain of interest.

Image Dehazing Meta-Learning +1

Dynamically Stable Poincaré Embeddings for Neural Manifolds

no code implementations21 Dec 2021 Jun Chen, Yuang Liu, Xiangrui Zhao, Mengmeng Wang, Yong liu

As a result, we prove that, if initial metrics have an $L^2$-norm perturbation which deviates from the Hyperbolic metric on the Poincar\'e ball, the scaled Ricci-DeTurck flow of such metrics smoothly and exponentially converges to the Hyperbolic metric.

Image Classification

Video Frame Interpolation Transformer

1 code implementation CVPR 2022 Zhihao Shi, Xiangyu Xu, Xiaohong Liu, Jun Chen, Ming-Hsuan Yang

Existing methods for video interpolation heavily rely on deep convolution neural networks, and thus suffer from their intrinsic limitations, such as content-agnostic kernel weights and restricted receptive field.

Video Frame Interpolation

Thoughts on the Consistency between Ricci Flow and Neural Network Behavior

no code implementations16 Nov 2021 Jun Chen, Tianxin Huang, Wenzhou Chen, Yong liu

During the training process of the neural network, we observe that its metric will also regularly converge to the linearly nearly Euclidean metric, which is consistent with the convergent behavior of linearly nearly Euclidean metrics under the Ricci-DeTurck flow.

whu-nercms at trecvid2021:instance search task

no code implementations30 Oct 2021 Yanrui Niu, Jingyao Yang, Ankang Lu, Baojin Huang, Yue Zhang, Ji Huang, Shishi Wen, Dongshu Xu, Chao Liang, Zhongyuan Wang, Jun Chen

We will make a brief introduction of the experimental methods and results of the WHU-NERCMS in the TRECVID2021 in the paper.

Action Detection Face Detection +5

Pseudo Supervised Monocular Depth Estimation with Teacher-Student Network

no code implementations22 Oct 2021 Huan Liu, Junsong Yuan, Chen Wang, Jun Chen

Despite recent improvement of supervised monocular depth estimation, the lack of high quality pixel-wise ground truth annotations has become a major hurdle for further progress.

Knowledge Distillation Monocular Depth Estimation +1

Manifold Micro-Surgery with Linearly Nearly Euclidean Metrics

no code implementations29 Sep 2021 Jun Chen, Tianxin Huang, Wenzhou Chen, Yong liu

The Ricci flow is a method of manifold surgery, which can trim manifolds to more regular.

Riemannian Manifold Embeddings for Straight-Through Estimator

no code implementations29 Sep 2021 Jun Chen, Hanwen Chen, Jiangning Zhang, Yuang Liu, Tianxin Huang, Yong liu

Quantized Neural Networks (QNNs) aim at replacing full-precision weights $\boldsymbol{W}$ with quantized weights $\boldsymbol{\hat{W}}$, which make it possible to deploy large models to mobile and miniaturized devices easily.

Quantization

Cross-Domain Lossy Compression as Optimal Transport with an Entropy Bottleneck

no code implementations ICLR 2022 Huan Liu, George Zhang, Jun Chen, Ashish J Khisti

We study the problem of cross-domain lossy compression where the reconstruction distribution is different from the source distribution in order to account for distributional shift due to processing.

Denoising Super-Resolution

Adaptive Hierarchical Dual Consistency for Semi-Supervised Left Atrium Segmentation on Cross-Domain Data

1 code implementation17 Sep 2021 Jun Chen, Heye Zhang, Raad Mohiaddin, Tom Wong, David Firmin, Jennifer Keegan, Guang Yang

For the inter-domain learning, a consistency constraint is applied to the LAs modelled by two dual-modelling networks to exploit the complementary knowledge among different data domains.

Left Atrium Segmentation Segmentation

Universal Rate-Distortion-Perception Representations for Lossy Compression

no code implementations NeurIPS 2021 George Zhang, Jingjing Qian, Jun Chen, Ashish Khisti

In the context of lossy compression, Blau & Michaeli (2019) adopt a mathematical notion of perceptual quality and define the information rate-distortion-perception function, generalizing the classical rate-distortion tradeoff.

Image Compression

JAS-GAN: Generative Adversarial Network Based Joint Atrium and Scar Segmentations on Unbalanced Atrial Targets

no code implementations1 May 2021 Jun Chen, Guang Yang, Habib Khan, Heye Zhang, Yanping Zhang, Shu Zhao, Raad Mohiaddin, Tom Wong, David Firmin, Jennifer Keegan

In this paper, we propose an inter-cascade generative adversarial network, namely JAS-GAN, to segment the unbalanced atrial targets from LGE CMR images automatically and accurately in an end-to-end way.

Generative Adversarial Network Segmentation

RelTransformer: A Transformer-Based Long-Tail Visual Relationship Recognition

1 code implementation CVPR 2022 Jun Chen, Aniket Agarwal, Sherif Abdelkarim, Deyao Zhu, Mohamed Elhoseiny

This paper shows that modeling an effective message-passing flow through an attention mechanism can be critical to tackling the compositionality and long-tail challenges in VRR.

Image Captioning Object Recognition +5

DW-GAN: A Discrete Wavelet Transform GAN for NonHomogeneous Dehazing

1 code implementation18 Apr 2021 Minghan Fu, Huan Liu, Yankun Yu, Jun Chen, Keyan Wang

By utilizing wavelet transform in DWT branch, our proposed method can retain more high-frequency knowledge in feature maps.

Bayesian Optimisation for a Biologically Inspired Population Neural Network

no code implementations13 Apr 2021 Mahak Kothari, Swapna Sasi, Jun Chen, Elham Zareian, Basabdatta Sen Bhattacharya

The 8-dimensional optimal hyper-parameter combination should be such that the network dynamics simulate the resting state alpha rhythm (8 - 13 Hz rhythms in brain signals).

Bayesian Optimisation Time Series +1

Towards a Unified Approach to Single Image Deraining and Dehazing

no code implementations26 Mar 2021 Xiaohong Liu, Yongrui Ma, Zhihao Shi, Linhui Dai, Jun Chen

We develop a new physical model for the rain effect and show that the well-known atmosphere scattering model (ASM) for the haze effect naturally emerges as its homogeneous continuous limit.

Single Image Deraining

GridDehazeNet+: An Enhanced Multi-Scale Network with Intra-Task Knowledge Transfer for Single Image Dehazing

no code implementations25 Mar 2021 Xiaohong Liu, Zhihao Shi, Zijun Wu, Jun Chen

We also propose a novel intra-task knowledge transfer mechanism that can memorize and take advantage of synthetic domain knowledge to assist the learning process on the translated data.

Dimensionality Reduction Image Dehazing +2

PSCC-Net: Progressive Spatio-Channel Correlation Network for Image Manipulation Detection and Localization

1 code implementation19 Mar 2021 Xiaohong Liu, Yaojie Liu, Jun Chen, Xiaoming Liu

To defend against manipulation of image content, such as splicing, copy-move, and removal, we develop a Progressive Spatio-Channel Correlation Network (PSCC-Net) to detect and localize image manipulations.

Image Manipulation Image Manipulation Detection

Learning for Unconstrained Space-Time Video Super-Resolution

no code implementations25 Feb 2021 Zhihao Shi, Xiaohong Liu, Chengqi Li, Linhui Dai, Jun Chen, Timothy N. Davidson, Jiying Zhao

Recent years have seen considerable research activities devoted to video enhancement that simultaneously increases temporal frame rate and spatial resolution.

Optical Flow Estimation Space-time Video Super-resolution +2

VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image Captioning

1 code implementation CVPR 2022 Jun Chen, Han Guo, Kai Yi, Boyang Li, Mohamed Elhoseiny

To the best of our knowledge, this is the first work that improves data efficiency of image captioning by utilizing LM pretrained on unimodal data.

Image Captioning Language Modelling +1

Indirect Domain Shift for Single Image Dehazing

no code implementations5 Feb 2021 Huan Liu, Jun Chen

Therefore, it is capable of consolidating the expressibility of different architectures, resulting in a more accurate indirect domain shift (IDS) from the hazy images to that of clear images.

Image Dehazing Single Image Dehazing

Edge-Featured Graph Attention Network

no code implementations19 Jan 2021 Jun Chen, Haopeng Chen

In this paper, we present edge-featured graph attention networks, namely EGATs, to extend the use of graph neural networks to those tasks learning on graphs with both node and edge features.

Graph Attention Graph Learning +1

Optimizing Quantized Neural Networks with Natural Gradient

no code implementations1 Jan 2021 Jun Chen, Hanwen Chen, Jiangning Zhang, Wenzhou Chen, Yong liu, Yunliang Jiang

Quantized Neural Networks (QNNs) have achieved an enormous step in improving computational efficiency, making it possible to deploy large models to mobile and miniaturized devices.

Computational Efficiency

APB2FaceV2: Real-Time Audio-Guided Multi-Face Reenactment

1 code implementation25 Oct 2020 Jiangning Zhang, Xianfang Zeng, Chao Xu, Jun Chen, Yong liu, Yunliang Jiang

Audio-guided face reenactment aims to generate a photorealistic face that has matched facial expression with the input audio.

Face Reenactment

Video Frame Interpolation via Generalized Deformable Convolution

1 code implementation24 Aug 2020 Zhihao Shi, Xiaohong Liu, Kangdi Shi, Linhui Dai, Jun Chen

Video frame interpolation aims at synthesizing intermediate frames from nearby source frames while maintaining spatial and temporal consistencies.

Video Frame Interpolation

Exploit Camera Raw Data for Video Super-Resolution via Hidden Markov Model Inference

1 code implementation24 Aug 2020 Xiaohong Liu, Kangdi Shi, Zhe Wang, Jun Chen

Extensive experiments demonstrate that owing to the informativeness of the camera raw data, the effectiveness of the network architecture, and the separation of super-resolution and color correction processes, the proposed method achieves superior VSR results compared to the state-of-the-art and can be adapted to any specific camera-ISP.

Informativeness Video Super-Resolution

AWNet: Attentive Wavelet Network for Image ISP

1 code implementation20 Aug 2020 Linhui Dai, Xiaohong Liu, Chengqi Li, Jun Chen

In this paper, we introduce a novel network that utilizes the attention mechanism and wavelet transform, dubbed AWNet, to tackle this learnable image ISP problem.

Towards Interpretable Clinical Diagnosis with Bayesian Network Ensembles Stacked on Entity-Aware CNNs

no code implementations ACL 2020 Jun Chen, Xiaoya Dai, Quan Yuan, Chao Lu, Haifeng Huang

The automatic text-based diagnosis remains a challenging task for clinical use because it requires appropriate balance between accuracy and interpretability.

A Learning Framework for n-bit Quantized Neural Networks toward FPGAs

1 code implementation6 Apr 2020 Jun Chen, Liang Liu, Yong liu, Xianfang Zeng

Furthermore, we also design a shift vector processing element (SVPE) array to replace all 16-bit multiplications with SHIFT operations in convolution operation on FPGAs.

Propagating Asymptotic-Estimated Gradients for Low Bitwidth Quantized Neural Networks

no code implementations4 Mar 2020 Jun Chen, Yong liu, Hao Zhang, Shengnan Hou, Jian Yang

Meanwhile, we propose a M-bit Inputs and N-bit Weights Network (MINW-Net) trained by AQE, a quantized neural network with 1-3 bits weights and activations.

Simultaneous Left Atrium Anatomy and Scar Segmentations via Deep Learning in Multiview Information with Attention

no code implementations2 Feb 2020 Guang Yang, Jun Chen, Zhifan Gao, Shuo Li, Hao Ni, Elsa Angelini, Tom Wong, Raad Mohiaddin, Eva Nyktari, Ricardo Wage, Lei Xu, Yanping Zhang, Xiuquan Du, Heye Zhang, David Firmin, Jennifer Keegan

Using our MVTT recursive attention model, both the LA anatomy and scar can be segmented accurately (mean Dice score of 93% for the LA anatomy and 87% for the scar segmentations) and efficiently (~0. 27 seconds to simultaneously segment the LA anatomy and scars directly from the 3D LGE CMR dataset with 60-68 2D slices).

Anatomy Segmentation

Highly fluorescent copper nanoclusters for sensing and bioimaging

no code implementations29 Dec 2019 Yu An, Ying Ren, Jing Tang, Jun Chen, Baisong Chang

Metal nanoclusters (NCs), typically consisting of a few to tens of metal atoms, bridge the gap between organometallic compounds and crystalline metal nanoparticles.

GridDehazeNet: Attention-Based Multi-Scale Network for Image Dehazing

1 code implementation ICCV 2019 Xiaohong Liu, Yongrui Ma, Zhihao Shi, Jun Chen

The proposed hazing method does not rely on the atmosphere scattering model, and we provide an explanation as to why it is not necessarily beneficial to take advantage of the dimension reduction offered by the atmosphere scattering model for image dehazing, even if only the dehazing results on synthetic images are concerned.

Dimensionality Reduction Image Dehazing +2

Discriminative Consistent Domain Generation for Semi-supervised Learning

no code implementations24 Jul 2019 Jun Chen, Heye Zhang, Yanping Zhang, Shu Zhao, Raad Mohiaddin, Tom Wong, David Firmin, Guang Yang, Jennifer Keegan

Based on the generated discriminative consistent domain, we can use the unlabeled data to learn the task model along with the labeled data via a consistent image generation.

Anatomy Domain Adaptation +1

A Fast Free-viewpoint Video Synthesis Algorithm for Sports Scenes

no code implementations28 Mar 2019 Jun Chen, Ryosuke Watanabe, Keisuke Nonaka, Tomoaki Konno, Hiroshi Sankoh, Sei Naito

In this paper, we report on a parallel freeviewpoint video synthesis algorithm that can efficiently reconstruct a high-quality 3D scene representation of sports scenes.

Multiview Two-Task Recursive Attention Model for Left Atrium and Atrial Scars Segmentation

no code implementations12 Jun 2018 Jun Chen, Guang Yang, Zhifan Gao, Hao Ni, Elsa Angelini, Raad Mohiaddin, Tom Wong, Yanping Zhang, Xiuquan Du, Heye Zhang, Jennifer Keegan, David Firmin

Late Gadolinium Enhanced Cardiac MRI (LGE-CMRI) for detecting atrial scars in atrial fibrillation (AF) patients has recently emerged as a promising technique to stratify patients, guide ablation therapy and predict treatment success.

Anatomy Segmentation

Efficient Parallel Connected Components Labeling with a Coarse-to-fine Strategy

no code implementations28 Dec 2017 Jun Chen, Keisuke Nonaka, Ryosuke Watanabe, Hiroshi Sankoh, Houari Sabirin, Sei Naito

This paper proposes a new parallel approach to solve connected components on a 2D binary image implemented with CUDA.

Curve-Structure Segmentation from Depth Maps: A CNN-based Approach and Its Application to Exploring Cultural Heritage Objects

no code implementations7 Nov 2017 Yuhang Lu, Jun Zhou, Jing Wang, Jun Chen, Karen Smith, Colin Wilder, Song Wang

Motivated by the important archaeological application of exploring cultural heritage objects, in this paper we study the challenging problem of automatically segmenting curve structures that are very weakly stamped or carved on an object surface in the form of a highly noisy depth map.

Image Segmentation Semantic Segmentation

An Optimized Union-Find Algorithm for Connected Components Labeling Using GPUs

no code implementations28 Aug 2017 Jun Chen, Qiang Yao, Houari Sabirin, Keisuke Nonaka, Hiroshi Sankoh, Sei Naito

In this paper, we report an optimized union-find (UF) algorithm that can label the connected components on a 2D image efficiently by employing the GPU architecture.

A straightforward method to assess motion blur for different types of displays

no code implementations8 Aug 2015 Fuhao Chen, Jun Chen, Feng Huang

A simulation method based on the liquid crystal response and the human visual system is suitable to characterize motion blur for LCDs but not other display types.

Cannot find the paper you are looking for? You can Submit a new open access paper.