Search Results for author: Fan Zhang

Found 241 papers, 64 papers with code

Real-time Decolorization using Dominant Colors

no code implementations10 Apr 2014 Wei Hu, Wei Li, Fan Zhang, Qian Du

Decolorization is the process to convert a color image or video to its grayscale version, and it has received great attention in recent years.

Parallax-tolerant Image Stitching

no code implementations CVPR 2014 Fan Zhang, Feng Liu

We then pre-align input images using the optimal homography and further use content-preserving warping to locally refine the alignment.

Image Stitching Local Distortion

Casual Stereoscopic Panorama Stitching

no code implementations CVPR 2015 Fan Zhang, Feng Liu

The stitching of the right views is formulated as a labeling problem that is constrained by the stitching of the left views to make the left- and right-view panorama consistent to avoid retinal rivalry.

Image Stitching

Fusing Subcategory Probabilities for Texture Classification

no code implementations CVPR 2015 Yang Song, Weidong Cai, Qing Li, Fan Zhang, David Dagan Feng, Heng Huang

Texture, as a fundamental characteristic of objects, has attracted much attention in computer vision research.

Classification Clustering +2

Stealing Machine Learning Models via Prediction APIs

1 code implementation9 Sep 2016 Florian Tramèr, Fan Zhang, Ari Juels, Michael K. Reiter, Thomas Ristenpart

In such attacks, an adversary with black-box access, but no prior knowledge of an ML model's parameters or training data, aims to duplicate the functionality of (i. e., "steal") the model.

BIG-bench Machine Learning Learning Theory +1

Indoor Space Recognition using Deep Convolutional Neural Network: A Case Study at MIT Campus

no code implementations7 Oct 2016 Fan Zhang, Fabio Duarte, Ruixian Ma, Dimitrios Milioris, Hui Lin, Carlo Ratti

In this paper, we propose a robust and parsimonious approach using Deep Convolutional Neural Network (DCNN) to recognize and interpret interior space.

Scene Recognition

A global optimization algorithm for sparse mixed membership matrix factorization

no code implementations19 Oct 2016 Fan Zhang, Chuangqi Wang, Andrew Trapp, Patrick Flaherty

Mixed membership factorization is a popular approach for analyzing data sets that have within-sample heterogeneity.

Inferring Discourse Relations from PDTB-style Discourse Labels for Argumentative Revision Classification

no code implementations COLING 2016 Fan Zhang, Diane Litman, Katherine Forbes Riley

Penn Discourse Treebank (PDTB)-style annotation focuses on labeling local discourse relations between text spans and typically ignores larger discourse contexts.

Classification General Classification

A multi-task convolutional neural network for mega-city analysis using very high resolution satellite imagery and geospatial data

no code implementations26 Feb 2017 Fan Zhang, Bo Du, Liangpei Zhang

For the second target, a novel CNN-based universal framework is proposed to process the VHR satellite images and generate the land-use, urban density, and population distribution maps.

A Joint Identification Approach for Argumentative Writing Revisions

no code implementations28 Feb 2017 Fan Zhang, Diane Litman

This paper proposes an approach that identifies the revision location and the revision type jointly to solve the issue of error propagation.

Classification General Classification

Doubly Robust Data-Driven Distributionally Robust Optimization

no code implementations19 May 2017 Jose Blanchet, Yang Kang, Fan Zhang, Fei He, Zhangyi Hu

Data-driven Distributionally Robust Optimization (DD-DRO) via optimal transport has been shown to encompass a wide range of popular machine learning algorithms.

Data-driven Optimal Cost Selection for Distributionally Robust Optimization

no code implementations19 May 2017 Jose Blanchet, Yang Kang, Fan Zhang, Karthyek Murthy

Recently, (Blanchet, Kang, and Murhy 2016, and Blanchet, and Kang 2017) showed that several machine learning algorithms, such as square-root Lasso, Support Vector Machines, and regularized logistic regression, among many others, can be represented exactly as distributionally robust optimization (DRO) problems.

BIG-bench Machine Learning regression

SAR Target Recognition Using the Multi-aspect-aware Bidirectional LSTM Recurrent Neural Networks

no code implementations25 Jul 2017 Fan Zhang, Chen Hu, Qiang Yin, Wei Li, Heng-Chao Li, Wen Hong

However, there is a limitation in current deep learning based ATR solution that each learning process only handle one SAR image, namely learning the static scattering information, while missing the space-varying information.

Dimensionality Reduction

Parameter-free $\ell_p$-Box Decoding of LDPC Codes

1 code implementation29 Nov 2017 Qiong Wu, Fan Zhang, Hao Wang, Jun Lin, Yang Liu

The Alternating Direction Method of Multipliers (ADMM) decoding of Low Density Parity Check (LDPC) codes has received many attentions due to its excellent performance at the error floor region.

Information Theory Information Theory

SeqFace: Make full use of sequence information for face recognition

1 code implementation17 Mar 2018 Wei Hu, Yangyu Huang, Fan Zhang, Ruirui Li, Wei Li, Guodong Yuan

Deep convolutional neural networks (CNNs) have greatly improved the Face Recognition (FR) performance in recent years.

Face Recognition Face Verification

TreeSegNet: Adaptive Tree CNNs for Subdecimeter Aerial Image Segmentation

no code implementations29 Apr 2018 Kai Yue, Lei Yang, Ruirui Li, Wei Hu, Fan Zhang, Wei Li

For the task of subdecimeter aerial imagery segmentation, fine-grained semantic segmentation results are usually difficult to obtain because of complex remote sensing content and optical conditions.

Image Segmentation Segmentation +1

Hyperspectral image classification via a random patches network

1 code implementation ISPRS Journal of Photogrammetry and Remote Sensing 2018 Yonghao Xu, Bo Du, Fan Zhang, Liangpei Zhang

Due to the remarkable achievements obtained by deep learning methods in the fields of computer vision, an increasing number of researches have been made to apply these powerful tools into hyperspectral image (HSI) classification.

Classification Few-Shot Image Classification +1

Memristor-based Deep Convolution Neural Network: A Case Study

no code implementations14 Sep 2018 Fan Zhang, Miao Hu

In this paper, we firstly introduce a method to efficiently implement large-scale high-dimensional convolution with realistic memristor-based circuit components.

Optimal Transport Based Distributionally Robust Optimization: Structural Properties and Iterative Schemes

1 code implementation4 Oct 2018 Jose Blanchet, Karthyek Murthy, Fan Zhang

We consider optimal transport based distributionally robust optimization (DRO) problems with locally strongly convex transport cost functions and affine decision rules.

Optimization and Control Primary: 90C15, Secondary: 65K05, 90C47

Nonconvex and Nonsmooth Sparse Optimization via Adaptively Iterative Reweighted Methods

no code implementations24 Oct 2018 Hao Wang, Fan Zhang, Yuanming Shi, Yaohua Hu

We propose a general formulation of nonconvex and nonsmooth sparse optimization problems with convex set constraint, which can take into account most existing types of nonconvex sparsity-inducing terms, bringing strong applicability to a wide range of applications.

Quantifying Legibility of Indoor Spaces Using Deep Convolutional Neural Networks: Case Studies in Train Stations

no code implementations22 Jan 2019 Zhoutong Wang, Qianhui Liang, Fabio Duarte, Fan Zhang, Louis Charron, Lenna Johnsen, Bill Cai, Carlo Ratti

Evaluating legibility is particularly desirable in indoor spaces, since it has a large impact on human behavior and the efficiency of space utilization.

Noise-Tolerant Paradigm for Training Face Recognition CNNs

2 code implementations CVPR 2019 Wei Hu, Yangyu Huang, Fan Zhang, Ruirui Li

Benefit from large-scale training datasets, deep Convolutional Neural Networks(CNNs) have achieved impressive results in face recognition(FR).

Face Recognition

A Distributionally Robust Boosting Algorithm

no code implementations20 May 2019 Jose Blanchet, Yang Kang, Fan Zhang, Zhangyi Hu

Distributionally Robust Optimization (DRO) has been shown to provide a flexible framework for decision making under uncertainty and statistical estimation.

Decision Making Decision Making Under Uncertainty

MediaPipe: A Framework for Building Perception Pipelines

2 code implementations14 Jun 2019 Camillo Lugaresi, Jiuqiang Tang, Hadon Nash, Chris McClanahan, Esha Uboweja, Michael Hays, Fan Zhang, Chuo-Ling Chang, Ming Guang Yong, Juhyun Lee, Wan-Teh Chang, Wei Hua, Manfred Georg, Matthias Grundmann

A developer can use MediaPipe to build prototypes by combining existing perception components, to advance them to polished cross-platform applications and measure system performance and resource consumption on target platforms.

Distributed, Parallel, and Cluster Computing

Predicting Treatment Initiation from Clinical Time Series Data via Graph-Augmented Time-Sensitive Model

no code implementations1 Jul 2019 Fan Zhang, Tong Wu, Yunlong Wang, Yong Cai, Cao Xiao, Emily Zhao, Lucas Glass, Jimeng Sun

Many computational models were proposed to extract temporal patterns from clinical time series for each patient and among patient group for predictive healthcare.

Time Series Time Series Analysis

A Survey of Deep Learning-based Object Detection

no code implementations11 Jul 2019 Licheng Jiao, Fan Zhang, Fang Liu, Shuyuan Yang, Lingling Li, Zhixi Feng, Rong Qu

Object detection is one of the most important and challenging branches of computer vision, which has been widely applied in peoples life, such as monitoring security, autonomous driving and so on, with the purpose of locating instances of semantic objects of a certain class.

Autonomous Driving Object +2

Edge AIBench: Towards Comprehensive End-to-end Edge Computing Benchmarking

no code implementations6 Aug 2019 Tianshu Hao, Yunyou Huang, Xu Wen, Wanling Gao, Fan Zhang, Chen Zheng, Lei Wang, Hainan Ye, Kai Hwang, Zujie Ren, Jianfeng Zhan

In edge computing scenarios, the distribution of data and collaboration of workloads on different layers are serious concerns for performance, privacy, and security issues.

Performance Distributed, Parallel, and Cluster Computing

ACFNet: Attentional Class Feature Network for Semantic Segmentation

1 code implementation ICCV 2019 Fan Zhang, Yanqin Chen, Zhihang Li, Zhibin Hong, Jingtuo Liu, Feifei Ma, Junyu Han, Errui Ding

Recent works have made great progress in semantic segmentation by exploiting richer context, most of which are designed from a spatial perspective.

Segmentation Semantic Segmentation

ViSTRA2: Video Coding using Spatial Resolution and Effective Bit Depth Adaptation

no code implementations7 Nov 2019 Fan Zhang, Mariana Afonso, David R. Bull

Our results show consistent and significant compression gains against HM and VVC based on Bj{\o}negaard Delta measurements, with average BD-rate savings of 12. 6% (PSNR) and 19. 5% (VMAF) over HM and 5. 5% (PSNR) and 8. 6% (VMAF) over VTM.

Decoder Video Compression

Inexact Primal-Dual Gradient Projection Methods for Nonlinear Optimization on Convex Set

no code implementations18 Nov 2019 Fan Zhang, Hao Wang, Jiashan Wang, Kai Yang

In this paper, we propose a novel primal-dual inexact gradient projection method for nonlinear optimization problems with convex-set constraint.

Hepatocellular Carcinoma Intra-arterial Treatment Response Prediction for Improved Therapeutic Decision-Making

no code implementations1 Dec 2019 Junlin Yang, Nicha C. Dvornek, Fan Zhang, Julius Chapiro, MingDe Lin, Aaron Abajian, James S. Duncan

This work proposes a pipeline to predict treatment response to intra-arterial therapy of patients with Hepatocellular Carcinoma (HCC) for improved therapeutic decision-making.

Decision Making

Mitigate Parasitic Resistance in Resistive Crossbar-based Convolutional Neural Networks

no code implementations17 Dec 2019 Fan Zhang, Miao Hu

We demonstrated the proposed methods with implementations of a 4-layer CNN on MNIST and ResNet(20, 32, and 56) on CIFAR-10.

Defects Mitigation in Resistive Crossbars for Analog Vector Matrix Multiplication

no code implementations17 Dec 2019 Fan Zhang, Miao Hu

With storage and computation happening at the same place, computing in resistive crossbars minimizes data movement and avoids the memory bottleneck issue.

Concurrently Extrapolating and Interpolating Networks for Continuous Model Generation

1 code implementation12 Jan 2020 Lijun Zhao, Jinjing Zhang, Fan Zhang, Anhong Wang, Huihui Bai, Yao Zhao

Most deep image smoothing operators are always trained repetitively when different explicit structure-texture pairs are employed as label images for each algorithm configured with different parameters.

image smoothing

Residual-Recursion Autoencoder for Shape Illustration Images

no code implementations6 Feb 2020 Qianwei Zhou, Peng Tao, Xiaoxin Li, Sheng-Yong Chen, Fan Zhang, Haigen Hu

Shape illustration images (SIIs) are common and important in describing the cross-sections of industrial products.

Efficient Scenario Generation for Heavy-tailed Chance Constrained Optimization

no code implementations6 Feb 2020 Jose Blanchet, Fan Zhang, Bert Zwart

We consider a generic class of chance-constrained optimization problems with heavy-tailed (i. e., power-law type) risk factors.

Optimization and Control Probability

BVI-CC: A Dataset for Research on Video Compression and Quality Assessment

no code implementations23 Mar 2020 Angeliki V. Katsenou, Fan Zhang, Mariana Afonso, Goce Dimitrov, David R. Bull

The compression efficiency of the codecs was evaluated with commonly used objective quality metrics, and the subjective quality of their reconstructed content was also evaluated through psychophysical experiments.

Video Compression

BVI-DVC: A Training Database for Deep Video Compression

no code implementations30 Mar 2020 Di Ma, Fan Zhang, David R. Bull

Deep learning methods are increasingly being applied in the optimisation of video compression algorithms and can achieve significantly enhanced coding gains, compared to conventional approaches.

Video Compression

TRAKO: Efficient Transmission of Tractography Data for Visualization

1 code implementation26 Apr 2020 Daniel Haehn, Loraine Franke, Fan Zhang, Suheyla Cetin Karayumak, Steve Pieper, Lauren O'Donnell, Yogesh Rathi

Fiber tracking produces large tractography datasets that are tens of gigabytes in size consisting of millions of streamlines.

Encoding in the Dark Grand Challenge: An Overview

no code implementations7 May 2020 Nantheera Anantrasirichai, Fan Zhang, Alexandra Malyugina, Paul Hill, Angeliki Katsenou

In this paper, we present an overview of the proposed challenge, and test state-of-the-art methods that will be part of the benchmark methods at the stage of the participants' deliverable assessment.

Denoising Image Enhancement

Defending Model Inversion and Membership Inference Attacks via Prediction Purification

no code implementations8 May 2020 Ziqi Yang, Bin Shao, Bohan Xuan, Ee-Chien Chang, Fan Zhang

Neural networks are susceptible to data inference attacks such as the model inversion attack and the membership inference attack, where the attacker could infer the reconstruction and the membership of a data sample from the confidence scores predicted by the target classifier.

Inference Attack Membership Inference Attack

Active Fuzzing for Testing and Securing Cyber-Physical Systems

1 code implementation28 May 2020 Yuqi Chen, Bohan Xuan, Christopher M. Poskitt, Jun Sun, Fan Zhang

Cyber-physical systems (CPSs) in critical infrastructure face a pervasive threat from attackers, motivating research into a variety of countermeasures for securing them.

Active Learning

Integrating global spatial features in CNN based Hyperspectral/SAR imagery classification

no code implementations30 May 2020 Fan Zhang, MinChao Yan, Chen Hu, Jun Ni, Fei Ma

In addition, a dual-branch convolutional neural network (CNN) classification method is designed in combination with the global information to mine the pixel features of the image.

Classification General Classification +3

Distributionally Robust Batch Contextual Bandits

no code implementations10 Jun 2020 Nian Si, Fan Zhang, Zhengyuan Zhou, Jose Blanchet

Leveraging this evaluation scheme, we further propose a novel learning algorithm that is able to learn a policy that is robust to adversarial perturbations and unknown covariate shifts with a performance guarantee based on the theory of uniform convergence.

Multi-Armed Bandits

BlazePose: On-device Real-time Body Pose tracking

7 code implementations17 Jun 2020 Valentin Bazarevsky, Ivan Grishchenko, Karthik Raveendran, Tyler Zhu, Fan Zhang, Matthias Grundmann

We present BlazePose, a lightweight convolutional neural network architecture for human pose estimation that is tailored for real-time inference on mobile devices.

2D Human Pose Estimation 3D Human Pose Estimation +4

MediaPipe Hands: On-device Real-time Hand Tracking

4 code implementations18 Jun 2020 Fan Zhang, Valentin Bazarevsky, Andrey Vakunov, Andrei Tkachenka, George Sung, Chuo-Ling Chang, Matthias Grundmann

We present a real-time on-device hand tracking pipeline that predicts hand skeleton from single RGB camera for AR/VR applications.

MFRNet: A New CNN Architecture for Post-Processing and In-loop Filtering

no code implementations14 Jul 2020 Di Ma, Fan Zhang, David R. Bull

Each MFRB extracts features from multiple convolutional layers using dense connections and a multi-level residual learning structure.

Video Compression

Video compression with low complexity CNN-based spatial resolution adaptation

no code implementations29 Jul 2020 Di Ma, Fan Zhang, David R. Bull

It has recently been demonstrated that spatial resolution adaptation can be integrated within video compression to improve overall coding performance by spatially down-sampling before encoding and super-resolving at the decoder.

Decoder Super-Resolution +1

Joint Bandwidth Allocation and Path Selection in WANs with Path Cardinality Constraints

no code implementations10 Aug 2020 Jinxin Wang, Fan Zhang, Zhonglin Xie, Gong Zhang, Zaiwen Wen

Almost all existing works deal with such a problem using relaxation techniques to transform it to be a convex optimization problem.

Fairness

PDAM: A Panoptic-Level Feature Alignment Framework for Unsupervised Domain Adaptive Instance Segmentation in Microscopy Images

1 code implementation11 Sep 2020 Dongnan Liu, Donghao Zhang, Yang song, Fan Zhang, Lauren O'Donnell, Heng Huang, Mei Chen, Weidong Cai

In this work, we present an unsupervised domain adaptation (UDA) method, named Panoptic Domain Adaptive Mask R-CNN (PDAM), for unsupervised instance segmentation in microscopy images.

Instance Segmentation Segmentation +3

P-DIFF: Learning Classifier with Noisy Labels based on Probability Difference Distributions

1 code implementation14 Sep 2020 Wei Hu, QiHao Zhao, Yangyu Huang, Fan Zhang

Learning deep neural network (DNN) classifier with noisy labels is a challenging task because the DNN can easily over-fit on these noisy labels due to its high capability.

Video Compression with CNN-based Post Processing

no code implementations16 Sep 2020 Fan Zhang, Di Ma, Chen Feng, David R. Bull

In recent years, video compression techniques have been significantly challenged by the rapidly increased demands associated with high quality and immersive video content.

Video Compression

A simulation environment for drone cinematography

no code implementations3 Oct 2020 Fan Zhang, David Hall, Tao Xu, Stephen Boyle, David Bull

Methods for environmental image capture, 3D reconstruction (photogrammetry) and the creation of foreground assets are presented along with a flexible and user-friendly simulation interface.

3D Reconstruction

Distributionally Robust Local Non-parametric Conditional Estimation

no code implementations NeurIPS 2020 Viet Anh Nguyen, Fan Zhang, Jose Blanchet, Erick Delage, Yinyu Ye

Conditional estimation given specific covariate values (i. e., local conditional estimation or functional estimation) is ubiquitously useful with applications in engineering, social and natural sciences.

Hole-Doped Room-Temperature Superconductivity in H$_{3}$S$_{1-x}$Z$_x$ (Z=C, Si)

no code implementations25 Nov 2020 Yanfeng Ge, Fan Zhang, Ranga P. Dias, Russell J. Hemley, Yugui Yao

We examine the effects of the low-level substitution of S atoms by C and Si atoms on the superconductivity of H$_3$S with the $Im\bar{3}m$ structure at megabar pressure.

Superconductivity Materials Science

Two-fluid Modeling of Acoustic Wave Propagation in Gravitationally Stratified Isothermal Media

no code implementations26 Nov 2020 Fan Zhang, Stefaan Poedts, Andrea Lani, Błażej Kuźma, Kris Murawski

In the present numerical simulations, the initial density is specified to reach hydrostatic equilibrium, and as a comparison, chemical equilibrium is also taken into account to provide a density profile that differs from typical hydrostatic equilibrium profiles.

Plasma Physics Solar and Stellar Astrophysics

Room-Temperature Superconductivity in Boron-Nitrogen Doped Lanthanum Superhydride

no code implementations24 Dec 2020 Yanfeng Ge, Fan Zhang, Russell J. Hemley

Recent theoretical and experimental studies of hydrogen-rich materials at megabar pressures (i. e., >100 GPa) have led to the discovery of very high-temperature superconductivity in these materials.

Superconductivity Materials Science

Speech Emotion Recognition with Multiscale Area Attention and Data Augmentation

no code implementations3 Feb 2021 Mingke Xu, Fan Zhang, Xiaodong Cui, Wei zhang

In this paper, we apply multiscale area attention in a deep convolutional neural network to attend emotional characteristics with varied granularities and therefore the classifier can benefit from an ensemble of attentions with different scales.

Data Augmentation Speech Emotion Recognition

Sensing population distribution from satellite imagery via deep learning: model selection, neighboring effect, and systematic biases

no code implementations3 Mar 2021 Xiao Huang, Di Zhu, Fan Zhang, Tao Liu, Xiao Li, Lei Zou

The rapid development of remote sensing techniques provides rich, large-coverage, and high-temporal information of the ground, which can be coupled with the emerging deep learning approaches that enable latent features and hidden geographical patterns to be extracted.

Model Selection

Three-dimensional charge density wave and robust zero-bias conductance peak inside the superconducting vortex core of a kagome superconductor CsV$_3$Sb$_5$

no code implementations8 Mar 2021 Zuowei Liang, Xingyuan Hou, Wanru Ma, Fan Zhang, Ping Wu, Zongyuan Zhang, Fanghang Yu, J. -J. Ying, Kun Jiang, Lei Shan, Zhenyu Wang, X. -H. Chen

The transition-metal-based kagome metals provide a versatile platform for correlated topological phases hosting various electronic instabilities.

Superconductivity Strongly Correlated Electrons

Real-world Ride-hailing Vehicle Repositioning using Deep Reinforcement Learning

no code implementations8 Mar 2021 Yan Jiao, Xiaocheng Tang, Zhiwei Qin, Shuaiji Li, Fan Zhang, Hongtu Zhu, Jieping Ye

We present a new practical framework based on deep reinforcement learning and decision-time planning for real-world vehicle repositioning on ride-hailing (a type of mobility-on-demand, MoD) platforms.

reinforcement-learning Reinforcement Learning (RL)

Enhancing VMAF through New Feature Integration and Model Combination

no code implementations10 Mar 2021 Fan Zhang, Angeliki Katsenou, Christos Bampis, Lukas Krasula, Zhi Li, David Bull

VMAF is a machine learning based video quality assessment method, originally designed for streaming applications, which combines multiple quality metrics and video features through SVM regression.

regression Video Quality Assessment

VMAF-based Bitrate Ladder Estimation for Adaptive Streaming

no code implementations12 Mar 2021 Angeliki V. Katsenou, Fan Zhang, Kyle Swanson, Mariana Afonso, Joel Sole, David R. Bull

In HTTP Adaptive Streaming, video content is conventionally encoded by adapting its spatial resolution and quantization level to best match the prevailing network state and display characteristics.

Quantization

A Subjective Study on Videos at Various Bit Depths

no code implementations18 Mar 2021 Alex Mackin, Di Ma, Fan Zhang, David Bull

Bit depth adaptation, where the bit depth of a video sequence is reduced before transmission and up-sampled during display, can potentially reduce data rates with limited impact on perceptual quality.

Robustifying Conditional Portfolio Decisions via Optimal Transport

1 code implementation30 Mar 2021 Viet Anh Nguyen, Fan Zhang, Shanshan Wang, Jose Blanchet, Erick Delage, Yinyu Ye

Despite the non-linearity of the objective function in the probability measure, we show that the distributionally robust portfolio allocation with side information problem can be reformulated as a finite-dimensional optimization problem.

Individually Fair Gradient Boosting

no code implementations ICLR 2021 Alexander Vargo, Fan Zhang, Mikhail Yurochkin, Yuekai Sun

Gradient boosting is a popular method for machine learning from tabular data, which arise often in applications where algorithmic fairness is a concern.

Fairness

Improving Faithfulness in Abstractive Summarization with Contrast Candidate Generation and Selection

no code implementations NAACL 2021 Sihao Chen, Fan Zhang, Kazoo Sone, Dan Roth

Despite significant progress in neural abstractive summarization, recent studies have shown that the current models are prone to generating summaries that are unfaithful to the original context.

Abstractive Text Summarization Hallucination

Quantitative mapping of the brain's structural connectivity using diffusion MRI tractography: a review

no code implementations23 Apr 2021 Fan Zhang, Alessandro Daducci, Yong He, Simona Schiavi, Caio Seguin, Robert Smith, Chun-Hung Yeh, Tengda Zhao, Lauren J. O'Donnell

Diffusion magnetic resonance imaging (dMRI) tractography is an advanced imaging technique that enables in vivo mapping of the brain's white matter connections at macro scale.

Deception Detection in Videos using the Facial Action Coding System

no code implementations28 May 2021 Hammad Ud Din Ahmed, Usama Ijaz Bajwa, Fan Zhang, Muhammad Waqas Anwar

We specifically use long short-term memory (LSTM) which we trained using the real-life trial dataset and it provided one of the best facial only approaches to deception detection.

Deception Detection In Videos Decision Making

A Deep Value-network Based Approach for Multi-Driver Order Dispatching

no code implementations8 Jun 2021 Xiaocheng Tang, Zhiwei Qin, Fan Zhang, Zhaodong Wang, Zhe Xu, Yintai Ma, Hongtu Zhu, Jieping Ye

In this work, we propose a deep reinforcement learning based solution for order dispatching and we conduct large scale online A/B tests on DiDi's ride-dispatching platform to show that the proposed method achieves significant improvement on both total driver income and user experience related metrics.

reinforcement-learning Reinforcement Learning (RL) +1

On Sample Based Explanation Methods for NLP:Efficiency, Faithfulness, and Semantic Evaluation

no code implementations9 Jun 2021 Wei zhang, Ziming Huang, Yada Zhu, Guangnan Ye, Xiaodong Cui, Fan Zhang

In the recent advances of natural language processing, the scale of the state-of-the-art models and datasets is usually extensive, which challenges the application of sample-based explanation methods in many aspects, such as explanation interpretability, efficiency, and faithfulness.

Perceptually-inspired super-resolution of compressed videos

no code implementations15 Jun 2021 Di Ma, Mariana Afonso, Fan Zhang, David R. Bull

Spatial resolution adaptation is a technique which has often been employed in video compression to enhance coding efficiency.

Generative Adversarial Network Super-Resolution +1

An adaptive Lagrange multiplier determination method for rate-distortion optimisation in hybrid video codecs

no code implementations15 Jun 2021 Fan Zhang, David R. Bull

This paper describes an adaptive Lagrange multiplier determination method for rate-quality optimisation in video compression.

Video Compression

Quality assessment methods for perceptual video compression

no code implementations15 Jun 2021 Fan Zhang, David R. Bull

This paper describes a quality assessment model for perceptual video compression applications (PVM), which stimulates visual masking and distortion-artefact perception using an adaptive combination of noticeable distortions and blurring artefacts.

Video Compression

Learning Temporal Consistency for Low Light Video Enhancement From Single Images

1 code implementation CVPR 2021 Fan Zhang, Yu Li, ShaoDi You, Ying Fu

Based on this idea, we propose our method which can infer motion prior for single image low light video enhancement and enforce temporal consistency.

Optical Flow Estimation Video Enhancement

Deep Fiber Clustering: Anatomically Informed Unsupervised Deep Learning for Fast and Effective White Matter Parcellation

no code implementations11 Jul 2021 Yuqian Chen, Chaoyi Zhang, Yang song, Nikos Makris, Yogesh Rathi, Weidong Cai, Fan Zhang, Lauren J. O'Donnell

White matter fiber clustering (WMFC) enables parcellation of white matter tractography for applications such as disease classification and anatomical tract segmentation.

Clustering Segmentation +1

An explainable two-dimensional single model deep learning approach for Alzheimer's disease diagnosis and brain atrophy localization

no code implementations28 Jul 2021 Fan Zhang, Bo Pan, Pengfei Shao, Peng Liu, Shuwei Shen, Peng Yao, Ronald X. Xu

In this research, we propose a novel end-to-end deep learning approach for automated diagnosis of AD and localization of important brain regions related to the disease from sMRI data.

Data Augmentation

On Sample Based Explanation Methods for NLP: Faithfulness, Efficiency and Semantic Evaluation

no code implementations ACL 2021 Wei zhang, Ziming Huang, Yada Zhu, Guangnan Ye, Xiaodong Cui, Fan Zhang

In the recent advances of natural language processing, the scale of the state-of-the-art models and datasets is usually extensive, which challenges the application of sample-based explanation methods in many aspects, such as explanation interpretability, efficiency, and faithfulness.

SR-HetGNN:Session-based Recommendation with Heterogeneous Graph Neural Network

no code implementations12 Aug 2021 Jinpeng Chen, Haiyang Li, Xudong Zhang, Fan Zhang, Senzhang Wang, Kaimin Wei, Jiaqi Ji

The current studies generally learn user preferences according to the transitions of items in the user's session sequence.

Session-Based Recommendations

Ultralow complexity long short-term memory network for fiber nonlinearity mitigation in coherent optical communication systems

no code implementations12 Aug 2021 Hao Ming, Xinyu Chen, Xiansong Fang, Lei Zhang, Chenjia Li, Fan Zhang

In this paper, we propose a center-oriented long short-term memory network (Co-LSTM) incorporating a simplified mode with a recycling mechanism in the equalization operation, which can mitigate fiber nonlinearity in coherent optical communication systems with ultralow complexity.

DSNet: A Dual-Stream Framework for Weakly-Supervised Gigapixel Pathology Image Analysis

no code implementations13 Sep 2021 Tiange Xiang, Yang song, Chaoyi Zhang, Dongnan Liu, Mei Chen, Fan Zhang, Heng Huang, Lauren O'Donnell, Weidong Cai

With image-level labels only, patch-wise classification would be sub-optimal due to inconsistency between the patch appearance and image-level label.

Classification whole slide images

Efficient Context-Aware Network for Abdominal Multi-organ Segmentation

1 code implementation22 Sep 2021 Fan Zhang, Yu Wang, Hua Yang

For the context block, we propose strip pooling module to capture anisotropic and long-range contextual information, which exists in abdominal scene.

Decoder Organ Segmentation

Coded Computation across Shared Heterogeneous Workers with Communication Delay

no code implementations23 Sep 2021 Yuxuan Sun, Fan Zhang, Junlin Zhao, Sheng Zhou, Zhisheng Niu, Deniz Gündüz

In this work, we consider a multi-master heterogeneous-worker distributed computing scenario, where multiple matrix multiplication tasks are encoded and allocated to workers for parallel computation.

Distributed Computing

Grasp-Oriented Fine-grained Cloth Segmentation without Real Supervision

no code implementations6 Oct 2021 Ruijie Ren, Mohit Gurnani Rajesh, Jordi Sanchez-Riera, Fan Zhang, Yurun Tian, Antonio Agudo, Yiannis Demiris, Krystian Mikolajczyk, Francesc Moreno-Noguer

We show that training our network solely with synthetic data and the proposed DA yields results competitive with models trained on real data.

Domain Adaptation

Branch and Bound in Mixed Integer Linear Programming Problems: A Survey of Techniques and Trends

no code implementations5 Nov 2021 Lingying Huang, Xiaomeng Chen, Wei Huo, Jiazheng Wang, Fan Zhang, Bo Bai, Ling Shi

In order to improve the speed of B&B algorithms, learning techniques have been introduced in this algorithm recently.

Variable Selection

Can Graph Neural Networks Learn to Solve MaxSAT Problem?

no code implementations15 Nov 2021 Minghao Liu, Fuqi Jia, Pei Huang, Fan Zhang, Yuchen Sun, Shaowei Cai, Feifei Ma, Jian Zhang

With the rapid development of deep learning techniques, various recent work has tried to apply graph neural networks (GNNs) to solve NP-hard problems such as Boolean Satisfiability (SAT), which shows the potential in bridging the gap between machine learning and symbolic reasoning.

ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation

3 code implementations CVPR 2022 Duolikun Danier, Fan Zhang, David Bull

Video frame interpolation (VFI) is currently a very active research topic, with applications spanning computer vision, post production and video encoding.

Texture Synthesis Video Frame Interpolation

SupWMA: Consistent and Efficient Tractography Parcellation of Superficial White Matter with Deep Learning

1 code implementation29 Jan 2022 Tengfei Xue, Fan Zhang, Chaoyi Zhang, Yuqian Chen, Yang song, Nikos Makris, Yogesh Rathi, Weidong Cai, Lauren J. O'Donnell

Most parcellation methods focus on the deep white matter (DWM), while fewer methods address the superficial white matter (SWM) due to its complexity.

Contrastive Learning

Enhancing Deformable Convolution based Video Frame Interpolation with Coarse-to-fine 3D CNN

no code implementations15 Feb 2022 Duolikun Danier, Fan Zhang, David Bull

This paper presents a new deformable convolution-based video frame interpolation (VFI) method, using a coarse to fine 3D CNN to enhance the multi-flow prediction.

Video Frame Interpolation

A Subjective Quality Study for Video Frame Interpolation

no code implementations15 Feb 2022 Duolikun Danier, Fan Zhang, David Bull

Video frame interpolation (VFI) is one of the fundamental research areas in video processing and there has been extensive research on novel and enhanced interpolation algorithms.

SSIM Video Frame Interpolation

RankDVQA: Deep VQA based on Ranking-inspired Hybrid Training

no code implementations17 Feb 2022 Chen Feng, Duolikun Danier, Fan Zhang, David Bull

In recent years, deep learning techniques have shown significant potential for improving video quality assessment (VQA), achieving higher correlation with subjective opinions compared to conventional approaches.

Video Quality Assessment Visual Question Answering (VQA)

Double Thompson Sampling in Finite stochastic Games

no code implementations21 Feb 2022 Shuqing Shi, Xiaobin Wang, Zhiyou Yang, Fan Zhang, Hong Qu

This algorithm achieves a total regret bound of $\tilde{\mathcal{O}}(D\sqrt{SAT})$in time horizon $T$ with $S$ states, $A$ actions and diameter $D$.

Thompson Sampling

A CNN-based Post-Processor for Perceptually-Optimized Immersive Media Compression

no code implementations25 Feb 2022 Angeliki Katsenou, Fan Zhang, David Bull

In recent years, resolution adaptation based on deep neural networks has enabled significant performance gains for conventional (2D) video codecs.

Mixed Reality Depth Contour Occlusion Using Binocular Similarity Matching and Three-dimensional Contour Optimisation

no code implementations4 Mar 2022 Naye Ji, Fan Zhang, Haoxiang Zhang, Youbing Zhao, Dingguo Yu

To evaluate the effectiveness of the algorithm, we demonstrate a time con-sumption statistical analysis for each stage of the DCO algorithm execution.

Mixed Reality Optical Flow Estimation +1

STICC: A multivariate spatial clustering method for repeated geographic pattern discovery with consideration of spatial contiguity

1 code implementation17 Mar 2022 Yuhao Kang, Kunlin Wu, Song Gao, Ignavier Ng, Jinmeng Rao, Shan Ye, Fan Zhang, Teng Fei

In this paper, we propose a Spatial Toeplitz Inverse Covariance-Based Clustering (STICC) method that considers both attributes and spatial relationships of geographic objects for multivariate spatial clustering.

Attribute Clustering

Global Attitude Synchronization of Networked Rigid Bodies Under Directed Topologies

no code implementations30 Mar 2022 Fan Zhang, Deyuan Meng, Jingyao Zhang

Simulations for networked spacecraft are presented to show the global synchronization performances under different directed topologies.

Enhancing Non-mass Breast Ultrasound Cancer Classification With Knowledge Transfer

no code implementations18 Apr 2022 Yangrun Hu, Yuanfan Guo, Fan Zhang, Mingda Wang, Tiancheng Lin, Rong Wu, Yi Xu

Based on the insight that mass data is sufficient and shares the same knowledge structure with non-mass data of identifying the malignancy of a lesion based on the ultrasound image, we propose a novel transfer learning framework to enhance the generalizability of the DNN model for non-mass BUS with the help of mass BUS.

Classification Transfer Learning

SkillNet-NLG: General-Purpose Natural Language Generation with a Sparsely Activated Approach

no code implementations26 Apr 2022 Junwei Liao, Duyu Tang, Fan Zhang, Shuming Shi

We present SkillNet-NLG, a sparsely activated approach that handles many natural language generation tasks with one model.

Multi-Task Learning Text Generation

Deep fiber clustering: Anatomically informed fiber clustering with self-supervised deep learning for fast and effective tractography parcellation

1 code implementation2 May 2022 Yuqian Chen, Chaoyi Zhang, Tengfei Xue, Yang song, Nikos Makris, Yogesh Rathi, Weidong Cai, Fan Zhang, Lauren J. O'Donnell

In this work, we propose a novel deep learning framework for white matter fiber clustering, Deep Fiber Clustering (DFC), which solves the unsupervised clustering problem as a self-supervised learning task with a domain-specific pretext task to predict pairwise fiber distances.

Anatomy Clustering +3

Multi-Graph based Multi-Scenario Recommendation in Large-scale Online Video Services

no code implementations5 May 2022 Fan Zhang, Qiuying Peng, Yulin Wu, Zheng Pan, Rong Zeng, Da Lin, Yue Qi

Recently, industrial recommendation services have been boosted by the continual upgrade of deep learning methods.

Data Integration Graph Learning

One Model, Multiple Modalities: A Sparsely Activated Approach for Text, Sound, Image, Video and Code

no code implementations12 May 2022 Yong Dai, Duyu Tang, Liangxin Liu, Minghuan Tan, Cong Zhou, Jingquan Wang, Zhangyin Feng, Fan Zhang, Xueyu Hu, Shuming Shi

Moreover, our model supports self-supervised pretraining with the same sparsely activated way, resulting in better initialized parameters for different modalities.

Image Retrieval Retrieval

A Saliency-Guided Street View Image Inpainting Framework for Efficient Last-Meters Wayfinding

1 code implementation14 May 2022 Chuanbo Hu, Shan Jia, Fan Zhang, Xin Li

However, due to the large diversity of geographic context and acquisition conditions, the captured SVI always contains various distracting objects (e. g., pedestrians and vehicles), which will distract human visual attention from efficiently finding the destination in the last few meters.

Image Inpainting object-detection +2

Enhancing VVC with Deep Learning based Multi-Frame Post-Processing

no code implementations19 May 2022 Duolikun Danier, Chen Feng, Fan Zhang, David Bull

This paper describes a CNN-based multi-frame post-processing approach based on a perceptually-inspired Generative Adversarial Network architecture, CVEGAN.

Generative Adversarial Network Image Compression

Phased Progressive Learning with Coupling-Regulation-Imbalance Loss for Imbalanced Data Classification

no code implementations24 May 2022 Liang Xu, Yi Cheng, Fan Zhang, Bingxuan Wu, Pengfei Shao, Peng Liu, Shuwei Shen, Peng Yao, Ronald X. Xu

This loss is effective in addressing quantity imbalances and outliers, while regulating the focus of attention on samples with varying classification difficulties.

Classification imbalanced classification +1

CAINNFlow: Convolutional block Attention modules and Invertible Neural Networks Flow for anomaly detection and localization tasks

no code implementations4 Jun 2022 Ruiqing Yan, Fan Zhang, Mengyuan Huang, Wu Liu, Dongyu Hu, Jinfeng Li, Qiang Liu, Jinrong Jiang, Qianjin Guo, Linghan Zheng

Detection of object anomalies is crucial in industrial processes, but unsupervised anomaly detection and localization is particularly important due to the difficulty of obtaining a large number of defective samples and the unpredictable types of anomalies in real life.

Unsupervised Anomaly Detection

TractoFormer: A Novel Fiber-level Whole Brain Tractography Analysis Framework Using Spectral Embedding and Vision Transformers

no code implementations5 Jul 2022 Fan Zhang, Tengfei Xue, Weidong Cai, Yogesh Rathi, Carl-Fredrik Westin, Lauren J O'Donnell

Whole brain tractography (WBT) data contains over hundreds of thousands of individual fiber streamlines (estimated brain connections), and this data is usually parcellated to create compact representations for data analysis applications such as disease classification.

Data Augmentation Ensemble Learning

White Matter Tracts are Point Clouds: Neuropsychological Score Prediction and Critical Region Localization via Geometric Deep Learning

no code implementations6 Jul 2022 Yuqian Chen, Fan Zhang, Chaoyi Zhang, Tengfei Xue, Leo R. Zekelman, Jianzhong He, Yang song, Nikos Makris, Yogesh Rathi, Alexandra J. Golby, Weidong Cai, Lauren J. O'Donnell

In this paper, we propose a deep-learning-based framework for neuropsychological score prediction using microstructure measurements estimated from diffusion magnetic resonance imaging (dMRI) tractography, focusing on predicting performance on a receptive vocabulary assessment task based on a critical fiber tract for language, the arcuate fasciculus (AF).

FD-GATDR: A Federated-Decentralized-Learning Graph Attention Network for Doctor Recommendation Using EHR

no code implementations11 Jul 2022 Luning Bi, Yunlong Wang, Fan Zhang, Zhuqing Liu, Yong Cai, Emily Zhao

In the past decade, with the development of big data technology, an increasing amount of patient information has been stored as electronic health records (EHRs).

Graph Attention Recommendation Systems

Enhancing HDR Video Compression through CNN-based Effective Bit Depth Adaptation

1 code implementation18 Jul 2022 Chen Feng, Zihao Qi, Duolikun Danier, Fan Zhang, Xiaozhong Xu, Shan Liu, David Bull

In this work, we modify the MFRNet network architecture to enable multiple frame processing, and the new network, multi-frame MFRNet, has been integrated into the EBDA framework using two Versatile Video Coding (VVC) host codecs: VTM 16. 2 and the Fraunhofer Versatile Video Encoder (VVenC 1. 4. 0).

Decoder Video Compression

Fed-CBS: A Heterogeneity-Aware Client Sampling Mechanism for Federated Learning via Class-Imbalance Reduction

no code implementations30 Sep 2022 Jianyi Zhang, Ang Li, Minxue Tang, Jingwei Sun, Xiang Chen, Fan Zhang, Changyou Chen, Yiran Chen, Hai Li

Based on this measure, we also design a computation-efficient client sampling strategy, such that the actively selected clients will generate a more class-balanced grouped dataset with theoretical guarantees.

Federated Learning Privacy Preserving

BVI-VFI: A Video Quality Database for Video Frame Interpolation

2 code implementations3 Oct 2022 Duolikun Danier, Fan Zhang, David Bull

In order to narrow this research gap, we have developed a new video quality database named BVI-VFI, which contains 540 distorted sequences generated by applying five commonly used VFI algorithms to 36 diverse source videos with various spatial resolutions and frame rates.

Video Frame Interpolation

GTAV-NightRain: Photometric Realistic Large-scale Dataset for Night-time Rain Streak Removal

1 code implementation10 Oct 2022 Fan Zhang, ShaoDi You, Yu Li, Ying Fu

In this paper, we propose GTAV-NightRain dataset, which is a large-scale synthetic night-time rain streak removal dataset.

Unsupervised Graph Outlier Detection: Problem Revisit, New Insight, and Superior Method

1 code implementation24 Oct 2022 Yihong Huang, Liping Wang, Fan Zhang, Xuemin Lin

In addition, we observe that existing algorithms have a performance drop with the mitigated data leakage issue.

Attribute Graph Outlier Detection

Memory recall by controlling chaos

no code implementations10 Nov 2022 Fan Zhang

By incorporating feedback loops, that engender amplification and damping so that output is not proportional to input, the biological neural networks become highly nonlinear and thus very likely chaotic in nature.

Line Drawing Guided Progressive Inpainting of Mural Damages

1 code implementation12 Nov 2022 Luxi Li, Qin Zou, Fan Zhang, Hongkai Yu, Long Chen, Chengfang Song, Xianfeng Huang, Xiaoguang Wang

Mural image inpainting refers to repairing the damage or missing areas in a mural image to restore the visual appearance.

Image Inpainting

Tractography-Based Parcellation of Cerebellar Dentate Nuclei via a Deep Nonnegative Matrix Factorization Clustering Method

no code implementations18 Nov 2022 Xiao Xu, Yuqian Chen, Leo Zekelman, Yogesh Rathi, Nikos Makris, Fan Zhang, Lauren J. O'Donnell

In this paper, we investigate a deep nonnegative matrix factorization clustering method (DNMFC) for parcellation of the human DN based on its structural connectivity using diffusion MRI tractography.

Clustering

Purifier: Defending Data Inference Attacks via Transforming Confidence Scores

no code implementations1 Dec 2022 Ziqi Yang, Lijin Wang, Da Yang, Jie Wan, Ziming Zhao, Ee-Chien Chang, Fan Zhang, Kui Ren

Besides, our further experiments show that PURIFIER is also effective in defending adversarial model inversion attacks and attribute inference attacks.

Attribute Inference Attack +1

Text-Guided Mask-free Local Image Retouching

no code implementations15 Dec 2022 Zerun Liu, Fan Zhang, Jingxuan He, Jin Wang, Zhangye Wang, Lechao Cheng

In the realm of multi-modality, text-guided image retouching techniques emerged with the advent of deep learning.

Image Retouching

Biomedical image analysis competitions: The state of current participation practice

no code implementations16 Dec 2022 Matthias Eisenmann, Annika Reinke, Vivienn Weru, Minu Dietlinde Tizabi, Fabian Isensee, Tim J. Adler, Patrick Godau, Veronika Cheplygina, Michal Kozubek, Sharib Ali, Anubha Gupta, Jan Kybic, Alison Noble, Carlos Ortiz de Solórzano, Samiksha Pachade, Caroline Petitjean, Daniel Sage, Donglai Wei, Elizabeth Wilden, Deepak Alapatt, Vincent Andrearczyk, Ujjwal Baid, Spyridon Bakas, Niranjan Balu, Sophia Bano, Vivek Singh Bawa, Jorge Bernal, Sebastian Bodenstedt, Alessandro Casella, Jinwook Choi, Olivier Commowick, Marie Daum, Adrien Depeursinge, Reuben Dorent, Jan Egger, Hannah Eichhorn, Sandy Engelhardt, Melanie Ganz, Gabriel Girard, Lasse Hansen, Mattias Heinrich, Nicholas Heller, Alessa Hering, Arnaud Huaulmé, Hyunjeong Kim, Bennett Landman, Hongwei Bran Li, Jianning Li, Jun Ma, Anne Martel, Carlos Martín-Isla, Bjoern Menze, Chinedu Innocent Nwoye, Valentin Oreiller, Nicolas Padoy, Sarthak Pati, Kelly Payette, Carole Sudre, Kimberlin Van Wijnen, Armine Vardazaryan, Tom Vercauteren, Martin Wagner, Chuanbo Wang, Moi Hoon Yap, Zeyun Yu, Chun Yuan, Maximilian Zenk, Aneeq Zia, David Zimmerer, Rina Bao, Chanyeol Choi, Andrew Cohen, Oleh Dzyubachyk, Adrian Galdran, Tianyuan Gan, Tianqi Guo, Pradyumna Gupta, Mahmood Haithami, Edward Ho, Ikbeom Jang, Zhili Li, Zhengbo Luo, Filip Lux, Sokratis Makrogiannis, Dominik Müller, Young-tack Oh, Subeen Pang, Constantin Pape, Gorkem Polat, Charlotte Rosalie Reed, Kanghyun Ryu, Tim Scherr, Vajira Thambawita, Haoyu Wang, Xinliang Wang, Kele Xu, Hung Yeh, Doyeob Yeo, Yixuan Yuan, Yan Zeng, Xin Zhao, Julian Abbing, Jannes Adam, Nagesh Adluru, Niklas Agethen, Salman Ahmed, Yasmina Al Khalil, Mireia Alenyà, Esa Alhoniemi, Chengyang An, Talha Anwar, Tewodros Weldebirhan Arega, Netanell Avisdris, Dogu Baran Aydogan, Yingbin Bai, Maria Baldeon Calisto, Berke Doga Basaran, Marcel Beetz, Cheng Bian, Hao Bian, Kevin Blansit, Louise Bloch, Robert Bohnsack, Sara Bosticardo, Jack Breen, Mikael Brudfors, Raphael Brüngel, Mariano Cabezas, Alberto Cacciola, Zhiwei Chen, Yucong Chen, Daniel Tianming Chen, Minjeong Cho, Min-Kook Choi, Chuantao Xie Chuantao Xie, Dana Cobzas, Julien Cohen-Adad, Jorge Corral Acero, Sujit Kumar Das, Marcela de Oliveira, Hanqiu Deng, Guiming Dong, Lars Doorenbos, Cory Efird, Sergio Escalera, Di Fan, Mehdi Fatan Serj, Alexandre Fenneteau, Lucas Fidon, Patryk Filipiak, René Finzel, Nuno R. Freitas, Christoph M. Friedrich, Mitchell Fulton, Finn Gaida, Francesco Galati, Christoforos Galazis, Chang Hee Gan, Zheyao Gao, Shengbo Gao, Matej Gazda, Beerend Gerats, Neil Getty, Adam Gibicar, Ryan Gifford, Sajan Gohil, Maria Grammatikopoulou, Daniel Grzech, Orhun Güley, Timo Günnemann, Chunxu Guo, Sylvain Guy, Heonjin Ha, Luyi Han, Il Song Han, Ali Hatamizadeh, Tian He, Jimin Heo, Sebastian Hitziger, SeulGi Hong, Seungbum Hong, Rian Huang, Ziyan Huang, Markus Huellebrand, Stephan Huschauer, Mustaffa Hussain, Tomoo Inubushi, Ece Isik Polat, Mojtaba Jafaritadi, SeongHun Jeong, Bailiang Jian, Yuanhong Jiang, Zhifan Jiang, Yueming Jin, Smriti Joshi, Abdolrahim Kadkhodamohammadi, Reda Abdellah Kamraoui, Inha Kang, Junghwa Kang, Davood Karimi, April Khademi, Muhammad Irfan Khan, Suleiman A. Khan, Rishab Khantwal, Kwang-Ju Kim, Timothy Kline, Satoshi Kondo, Elina Kontio, Adrian Krenzer, Artem Kroviakov, Hugo Kuijf, Satyadwyoom Kumar, Francesco La Rosa, Abhi Lad, Doohee Lee, Minho Lee, Chiara Lena, Hao Li, Ling Li, Xingyu Li, Fuyuan Liao, Kuanlun Liao, Arlindo Limede Oliveira, Chaonan Lin, Shan Lin, Akis Linardos, Marius George Linguraru, Han Liu, Tao Liu, Di Liu, Yanling Liu, João Lourenço-Silva, Jingpei Lu, Jiangshan Lu, Imanol Luengo, Christina B. Lund, Huan Minh Luu, Yi Lv, Uzay Macar, Leon Maechler, Sina Mansour L., Kenji Marshall, Moona Mazher, Richard McKinley, Alfonso Medela, Felix Meissen, Mingyuan Meng, Dylan Miller, Seyed Hossein Mirjahanmardi, Arnab Mishra, Samir Mitha, Hassan Mohy-ud-Din, Tony Chi Wing Mok, Gowtham Krishnan Murugesan, Enamundram Naga Karthik, Sahil Nalawade, Jakub Nalepa, Mohamed Naser, Ramin Nateghi, Hammad Naveed, Quang-Minh Nguyen, Cuong Nguyen Quoc, Brennan Nichyporuk, Bruno Oliveira, David Owen, Jimut Bahan Pal, Junwen Pan, Wentao Pan, Winnie Pang, Bogyu Park, Vivek Pawar, Kamlesh Pawar, Michael Peven, Lena Philipp, Tomasz Pieciak, Szymon Plotka, Marcel Plutat, Fattaneh Pourakpour, Domen Preložnik, Kumaradevan Punithakumar, Abdul Qayyum, Sandro Queirós, Arman Rahmim, Salar Razavi, Jintao Ren, Mina Rezaei, Jonathan Adam Rico, ZunHyan Rieu, Markus Rink, Johannes Roth, Yusely Ruiz-Gonzalez, Numan Saeed, Anindo Saha, Mostafa Salem, Ricardo Sanchez-Matilla, Kurt Schilling, Wei Shao, Zhiqiang Shen, Ruize Shi, Pengcheng Shi, Daniel Sobotka, Théodore Soulier, Bella Specktor Fadida, Danail Stoyanov, Timothy Sum Hon Mun, Xiaowu Sun, Rong Tao, Franz Thaler, Antoine Théberge, Felix Thielke, Helena Torres, Kareem A. Wahid, Jiacheng Wang, Yifei Wang, Wei Wang, Xiong Wang, Jianhui Wen, Ning Wen, Marek Wodzinski, Ye Wu, Fangfang Xia, Tianqi Xiang, Chen Xiaofei, Lizhan Xu, Tingting Xue, Yuxuan Yang, Lin Yang, Kai Yao, Huifeng Yao, Amirsaeed Yazdani, Michael Yip, Hwanseung Yoo, Fereshteh Yousefirizi, Shunkai Yu, Lei Yu, Jonathan Zamora, Ramy Ashraf Zeineldin, Dewen Zeng, Jianpeng Zhang, Bokai Zhang, Jiapeng Zhang, Fan Zhang, Huahong Zhang, Zhongchen Zhao, Zixuan Zhao, Jiachen Zhao, Can Zhao, Qingshuo Zheng, Yuheng Zhi, Ziqi Zhou, Baosheng Zou, Klaus Maier-Hein, Paul F. Jäger, Annette Kopp-Schneider, Lena Maier-Hein

Of these, 84% were based on standard architectures.

Benchmarking

Learning Rain Location Prior for Nighttime Deraining

1 code implementation ICCV 2023 Fan Zhang, ShaoDi You, Yu Li, Ying Fu

This learned prior contains location information of rain streaks and, when injected into deraining models, can significantly improve their performance.

Rain Removal

Urban Visual Intelligence: Studying Cities with AI and Street-level Imagery

no code implementations2 Jan 2023 Fan Zhang, Arianna Salazar Miranda, Fábio Duarte, Lawrence Vale, Gary Hack, Min Chen, Yu Liu, Michael Batty, Carlo Ratti

The visual dimension of cities has been a fundamental subject in urban studies, since the pioneering work of scholars such as Sitte, Lynch, Arnheim, and Jacobs.

TractGraphCNN: anatomically informed graph CNN for classification using diffusion MRI tractography

no code implementations5 Jan 2023 Yuqian Chen, Fan Zhang, Leo R. Zekelman, Tengfei Xue, Chaoyi Zhang, Yang song, Nikos Makris, Yogesh Rathi, Weidong Cai, Lauren J. O'Donnell

This work shows the potential of incorporating anatomical information, especially known anatomical similarities between input features, to guide convolutions in neural networks.

DiffMotion: Speech-Driven Gesture Synthesis Using Denoising Diffusion Model

1 code implementation24 Jan 2023 Fan Zhang, Naye Ji, Fuxing Gao, Yongping Li

Speech-driven gesture synthesis is a field of growing interest in virtual human creation.

Denoising

SimCGNN: Simple Contrastive Graph Neural Network for Session-based Recommendation

no code implementations8 Feb 2023 Yuan Cao, Xudong Zhang, Fan Zhang, Feifei Kou, Josiah Poon, Xiongnan Jin, Yongheng Wang, Jinpeng Chen

Session-based recommendation (SBR) problem, which focuses on next-item prediction for anonymous users, has received increasingly more attention from researchers.

Contrastive Learning Session-Based Recommendations

ST-MFNet Mini: Knowledge Distillation-Driven Frame Interpolation

1 code implementation16 Feb 2023 Crispian Morris, Duolikun Danier, Fan Zhang, Nantheera Anantrasirichai, David R. Bull

Currently, one of the major challenges in deep learning-based video frame interpolation (VFI) is the large model sizes and high computational complexity associated with many high performance VFI approaches.

Knowledge Distillation Network Pruning +1

Few-shots Portrait Generation with Style Enhancement and Identity Preservation

1 code implementation1 Mar 2023 Runchuan Zhu, Naye Ji, Youbing Zhao, Fan Zhang

Nowadays, the wide application of virtual digital human promotes the comprehensive prosperity and development of digital culture supported by digital economy.

Cultural Vocal Bursts Intensity Prediction

GeoLab: Geometry-based Tractography Parcellation of Superficial White Matter

1 code implementation2 Mar 2023 Nabil Vindas, Nicole Labra Avila, Fan Zhang, Tengfei Xue, Lauren J. O'Donnell, Jean-François Mangin

Superficial white matter (SWM) has been less studied than long-range connections despite being of interest to clinical research, andfew tractography parcellation methods have been adapted to SWM.

Efficient Self-supervised Continual Learning with Progressive Task-correlated Layer Freezing

no code implementations13 Mar 2023 Li Yang, Sen Lin, Fan Zhang, Junshan Zhang, Deliang Fan

Inspired by the success of Self-supervised learning (SSL) in learning visual representations from unlabeled data, a few recent works have studied SSL in the context of continual learning (CL), where multiple tasks are learned sequentially, giving rise to a new paradigm, namely self-supervised continual learning (SSCL).

Continual Learning Self-Supervised Learning

LDMVFI: Video Frame Interpolation with Latent Diffusion Models

2 code implementations16 Mar 2023 Duolikun Danier, Fan Zhang, David Bull

Existing works on video frame interpolation (VFI) mostly employ deep neural networks that are trained by minimizing the L1, L2, or deep feature space distance (e. g. VGG loss) between their outputs and ground-truth frames.

Video Frame Interpolation

Fiber Tract Shape Measures Inform Prediction of Non-Imaging Phenotypes

no code implementations16 Mar 2023 Wan Liu, Yuqian Chen, Chuyang Ye, Nikos Makris, Yogesh Rathi, Weidong Cai, Fan Zhang, Lauren J. O'Donnell

In this paper, we investigate the potential of fiber tract shape features for predicting non-imaging phenotypes, both individually and in combination with traditional features.

Mpox-AISM: AI-Mediated Super Monitoring for Mpox and Like-Mpox

no code implementations17 Mar 2023 Yubiao Yue, Minghua Jiang, Xinyue Zhang, Jialong Xu, Huacong Ye, Fan Zhang, Zhenzhang Li, Yang Li

With the help of the Internet and communication terminal, Mpox-AISM can perform a real-time, low-cost, and convenient diagnosis for earlier-stage mpox in various real-world settings, thereby effectively curbing the spread of mpox virus.

Data Augmentation Decision Making +2

Co-GRU Enhanced End-to-End Design for Long-haul Coherent Transmission Systems

no code implementations23 Apr 2023 Jiayu Zheng, Tianhong Zhang, Yu Wenjing, Weiqin Zhou, Chuanchuan Yang, Fan Zhang

In recent years, the end-to-end (E2E) scheme based on deep learning (DL) has been proposed as a potential scheme to jointly optimize the encoder and the decoder parameters of the optical communication system.

Decoder

A fast and flexible algorithm for microstructure reconstruction combining simulated annealing and deep learning

1 code implementation25 Apr 2023 Zhenchuan Ma, Xiaohai He, Pengcheng Yan, Fan Zhang, Qizhi Teng

The proposed algorithm is flexible and can complete training and reconstruction in a short time with only one two-dimensional image.

UPDExplainer: an Interpretable Transformer-based Framework for Urban Physical Disorder Detection Using Street View Imagery

no code implementations4 May 2023 Chuanbo Hu, Shan Jia, Fan Zhang, Changjiang Xiao, Mindi Ruan, Jacob Thrasher, Xin Li

Experimental results on the re-annotated Place Pulse 2. 0 dataset demonstrate promising detection performance of the proposed method, with an accuracy of 79. 9%.

Semantic Segmentation

Understand Waiting Time in Transaction Fee Mechanism: An Interdisciplinary Perspective

1 code implementation4 May 2023 Luyao Zhang, Fan Zhang

Our study identified NFT drops as a unique source of market congestion -- holiday effects -- beyond trend and season effects.

Causal Inference Computer Security +2

Label-Free Multi-Domain Machine Translation with Stage-wise Training

no code implementations6 May 2023 Fan Zhang, Mei Tu, Sangha Kim, Song Liu, Jinyao Yan

Our model is composed of three parts: a backbone model, a domain discriminator taking responsibility to discriminate data from different domains, and a set of experts that transfer the decoded features from generic to specific.

Machine Translation Translation

Doppler-Resilient Design of CAZAC Sequences for mmWave/THz Sensing Applications

no code implementations12 May 2023 Fan Zhang, Tianqi Mao, Zhaocheng Wang

For an arbitrary-length ZC sequence, a feasible range of the root index is derived to satisfy the requirement of PSLR within the scope of RoI.

FGAM:Fast Adversarial Malware Generation Method Based on Gradient Sign

no code implementations22 May 2023 Kun Li, Fan Zhang, Wei Guo

Adversarial attacks are to deceive the deep learning model by generating adversarial samples.

Malware Detection

HiNeRV: Video Compression with Hierarchical Encoding-based Neural Representation

1 code implementation NeurIPS 2023 Ho Man Kwan, Ge Gao, Fan Zhang, Andrew Gower, David Bull

Learning-based video compression is currently a popular research topic, offering the potential to compete with conventional standard video codecs.

Model Compression Quantization +1

SkillNet-X: A Multilingual Multitask Model with Sparsely Activated Skills

no code implementations28 Jun 2023 Zhangyin Feng, Yong Dai, Fan Zhang, Duyu Tang, Xiaocheng Feng, Shuangzhi Wu, Bing Qin, Yunbo Cao, Shuming Shi

Traditional multitask learning methods basically can only exploit common knowledge in task- or language-wise, which lose either cross-language or cross-task knowledge.

Natural Language Understanding

TractGeoNet: A geometric deep learning framework for pointwise analysis of tract microstructure to predict language assessment performance

no code implementations8 Jul 2023 Yuqian Chen, Leo R. Zekelman, Chaoyi Zhang, Tengfei Xue, Yang song, Nikos Makris, Yogesh Rathi, Alexandra J. Golby, Weidong Cai, Fan Zhang, Lauren J. O'Donnell

We evaluate the effectiveness of the proposed method by predicting individual performance on two neuropsychological assessments of language using a dataset of 20 association white matter fiber tracts from 806 subjects from the Human Connectome Project.

regression

Generative Pretraining in Multimodality

2 code implementations11 Jul 2023 Quan Sun, Qiying Yu, Yufeng Cui, Fan Zhang, Xiaosong Zhang, Yueze Wang, Hongcheng Gao, Jingjing Liu, Tiejun Huang, Xinlong Wang

We present Emu, a Transformer-based multimodal foundation model, which can seamlessly generate images and texts in multimodal context.

Image Captioning Temporal/Casual QA +4

ATWM: Defense against adversarial malware based on adversarial training

no code implementations11 Jul 2023 Kun Li, Fan Zhang, Wei Guo

In order to defend against malware attacks, researchers have proposed many Windows malware detection models based on deep learning.

Adversarial Defense Malware Detection

Data-Driven Optimal Control of Tethered Space Robot Deployment with Learning Based Koopman Operator

no code implementations15 Jul 2023 Ao Jin, Fan Zhang, Panfeng Huang

To avoid complex constraints of the traditional nonlinear method for tethered space robot (TSR) deployment, this paper proposes a data-driven optimal control framework with an improved deep learning based Koopman operator that could be applied to complex environments.

TractCloud: Registration-free tractography parcellation with a novel local-global streamline point cloud representation

no code implementations18 Jul 2023 Tengfei Xue, Yuqian Chen, Chaoyi Zhang, Alexandra J. Golby, Nikos Makris, Yogesh Rathi, Weidong Cai, Fan Zhang, Lauren J. O'Donnell

TractCloud achieves efficient and consistent whole-brain white matter parcellation across the lifespan (from neonates to elderly subjects, including brain tumor patients) without the need for registration.

Anatomy

Deep neural networks from the perspective of ergodic theory

no code implementations4 Aug 2023 Fan Zhang

The design of deep neural networks remains somewhat of an art rather than precise science.

Audio is all in one: speech-driven gesture synthetics using WavLM pre-trained model

no code implementations11 Aug 2023 Fan Zhang, Naye Ji, Fuxing Gao, Siyuan Zhao, Zhaohan Wang, Shunman Li

Firstly, considering that speech audio not only contains acoustic and semantic features but also conveys personality traits, emotions, and more subtle information related to accompanying gestures, we pioneer the adaptation of WavLM, a large-scale pre-trained model, to extract low-level and high-level audio information.

Gesture Generation

MDCS: More Diverse Experts with Consistency Self-distillation for Long-tailed Recognition

1 code implementation ICCV 2023 QiHao Zhao, Chen Jiang, Wei Hu, Fan Zhang, Jun Liu

In the analysis and ablation study, we demonstrate that our method compared with previous work can effectively increase the diversity of experts, significantly reduce the variance of the model, and improve recognition accuracy.

Long-tail Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.