Search Results for author: Jian Zhang

Found 280 papers, 99 papers with code

Analysis of DAWNBench, a Time-to-Accuracy Machine Learning Performance Benchmark

no code implementations4 Jun 2018 Cody Coleman, Daniel Kang, Deepak Narayanan, Luigi Nardi, Tian Zhao, Jian Zhang, Peter Bailis, Kunle Olukotun, Chris Re, Matei Zaharia

In this work, we analyze the entries from DAWNBench, which received optimized submissions from multiple industrial groups, to investigate the behavior of TTA as a metric as well as trends in the best-performing entries.

Benchmarking BIG-bench Machine Learning

Feature Affinity based Pseudo Labeling for Semi-supervised Person Re-identification

no code implementations16 May 2018 Guodong Ding, Shanshan Zhang, Salman Khan, Zhenmin Tang, Jian Zhang, Fatih Porikli

Our approach measures the affinity of unlabeled samples with the underlying clusters of labeled data samples using the intermediate feature representations from deep networks.

Data Augmentation Representation Learning +1

Globally Variance-Constrained Sparse Representation and Its Application in Image Set Coding

no code implementations17 Aug 2016 Xiang Zhang, Jiarui Sun, Siwei Ma, Zhouchen Lin, Jian Zhang, Shiqi Wang, Wen Gao

Therefore, introducing an accurate rate-constraint in sparse coding and dictionary learning becomes meaningful, which has not been fully exploited in the context of sparse representation.

Data Compression Dictionary Learning

Deep Reinforcement Learning for Image Hashing

no code implementations7 Feb 2018 Yuxin Peng, Jian Zhang, Zhaoda Ye

Inspired by the sequential decision ability of deep reinforcement learning, we propose a new Deep Reinforcement Learning approach for Image Hashing (DRLIH).

Deep Hashing reinforcement-learning +1

SCH-GAN: Semi-supervised Cross-modal Hashing by Generative Adversarial Network

no code implementations7 Feb 2018 Jian Zhang, Yuxin Peng, Mingkuan Yuan

(2) Ignore the rich information contained in the large amount of unlabeled data across different modalities, especially the margin examples that are easily to be incorrectly retrieved, which can help to model the correlations.

Generative Adversarial Network Retrieval

Multi-pseudo Regularized Label for Generated Data in Person Re-Identification

no code implementations21 Jan 2018 Yan Huang, Jinsong Xu, Qiang Wu, Zhedong Zheng, Zhao-Xiang Zhang, Jian Zhang

Unlike the traditional label which usually is a single integral number, the virtual label proposed in this work is a set of weight-based values each individual of which is a number in (0, 1] called multi-pseudo label and reflects the degree of relation between each generated data to every pre-defined class of real data.

Generative Adversarial Network Person Re-Identification +1

Unsupervised Generative Adversarial Cross-modal Hashing

no code implementations1 Dec 2017 Jian Zhang, Yuxin Peng, Mingkuan Yuan

To address the above problem, in this paper we propose an Unsupervised Generative Adversarial Cross-modal Hashing approach (UGACH), which makes full use of GAN's ability for unsupervised representation learning to exploit the underlying manifold structure of cross-modal data.

Cross-Modal Retrieval Generative Adversarial Network +2

Asking the Difficult Questions: Goal-Oriented Visual Question Generation via Intermediate Rewards

no code implementations21 Nov 2017 Jun-Jie Zhang, Qi Wu, Chunhua Shen, Jian Zhang, Jianfeng Lu, Anton Van Den Hengel

Despite significant progress in a variety of vision-and-language problems, developing a method capable of asking intelligent, goal-oriented questions about images is proven to be an inscrutable challenge.

Informativeness Question Generation +2

Kill Two Birds with One Stone: Weakly-Supervised Neural Network for Image Annotation and Tag Refinement

no code implementations19 Nov 2017 Jun-Jie Zhang, Qi Wu, Jian Zhang, Chunhua Shen, Jianfeng Lu

These comments can be a description of the image, or some objects, attributes, scenes in it, which are normally used as the user-provided tags.

Retrieval TAG

SSDH: Semi-supervised Deep Hashing for Large Scale Image Retrieval

no code implementations28 Jul 2016 Jian Zhang, Yuxin Peng

(2) A semi-supervised deep hashing network is designed to extensively exploit both labeled and unlabeled data, in which we propose an online graph construction method to benefit from the evolving deep features during training to better capture semantic neighbors.

Deep Hashing graph construction +3

Towards Automatic Construction of Diverse, High-quality Image Dataset

no code implementations22 Aug 2017 Yazhou Yao, Jian Zhang, Fumin Shen, Li Liu, Fan Zhu, Dongxiang Zhang, Heng-Tao Shen

To eliminate manual annotation, in this work, we propose a novel image dataset construction framework by employing multiple textual queries.

Image Classification object-detection +2

Place recognition: An Overview of Vision Perspective

no code implementations17 Jun 2017 Zhiqiang Zeng, Jian Zhang, Xiaodong Wang, Yuming Chen, Chaoyang Zhu

Place recognition is one of the most fundamental topics in computer vision and robotics communities, where the task is to accurately and efficiently recognize the location of a given query image.

Image Classification Image Retrieval +4

A New Probabilistic Algorithm for Approximate Model Counting

no code implementations13 Jun 2017 Cunjing Ge, Feifei Ma, Tian Liu, Jian Zhang

Constrained counting is important in domains ranging from artificial intelligence to software analysis.

Query-adaptive Image Retrieval by Deep Weighted Hashing

no code implementations8 Dec 2016 Jian Zhang, Yuxin Peng

On the other hand, different hash bits actually contribute to the image retrieval differently, and treating them equally greatly affects the retrieval accuracy of image.

Deep Hashing Image Retrieval

Exploiting Web Images for Dataset Construction: A Domain Robust Approach

no code implementations22 Nov 2016 Yazhou Yao, Jian Zhang, Fumin Shen, Xian-Sheng Hua, Jingsong Xu, Zhenmin Tang

To reduce the cost of manual labelling, there has been increased research interest in automatically constructing image datasets by exploiting web images.

Domain Adaptation Image Classification +2

Refining Image Categorization by Exploiting Web Images and General Corpus

no code implementations16 Mar 2017 Yazhou Yao, Jian Zhang, Fumin Shen, Xian-Sheng Hua, Wankou Yang, Zhenmin Tang

To tackle these problems, in this work, we exploit general corpus information to automatically select and subsequently classify web images into semantic rich (sub-)categories.

Image Categorization

Multi-Label Image Classification with Regional Latent Semantic Dependencies

no code implementations4 Dec 2016 Jun-Jie Zhang, Qi Wu, Chunhua Shen, Jian Zhang, Jianfeng Lu

Recent state-of-the-art approaches to multi-label image classification exploit the label dependencies in an image, at global level, largely improving the labeling capacity.

Classification General Classification +1

A coarse-to-fine algorithm for registration in 3D street-view cross-source point clouds

no code implementations24 Oct 2016 Xiaoshui Huang, Jian Zhang, Qiang Wu, Lixin Fan, Chun Yuan

In this paper, different from previous ICP-based methods, and from a statistic view, we propose a effective coarse-to-fine algorithm to detect and register a small scale SFM point cloud in a large scale Lidar point cloud.

Parallel SGD: When does averaging help?

no code implementations23 Jun 2016 Jian Zhang, Christopher De Sa, Ioannis Mitliagkas, Christopher Ré

Consider a number of workers running SGD independently on the same pool of data and averaging the models every once in a while -- a common but not well understood practice.

A Tool for Computing and Estimating the Volume of the Solution Space of SMT(LA)

no code implementations1 Jul 2015 Cunjing Ge, Feifei Ma, Jian Zhang

There are already quite a few tools for solving the Satisfiability Modulo Theories (SMT) problems.

Abrupt Motion Tracking via Nearest Neighbor Field Driven Stochastic Sampling

no code implementations28 Oct 2014 Tianfei Zhou, Yao Lu, Feng Lv, Huijun Di, Qingjie Zhao, Jian Zhang

Stochastic sampling based trackers have shown good performance for abrupt motion tracking so that they have gained popularity in recent years.

Motion Detection

Image Restoration Using Joint Statistical Modeling in Space-Transform Domain

no code implementations11 May 2014 Jian Zhang, Debin Zhao, Ruiqin Xiong, Siwei Ma, Wen Gao

This paper presents a novel strategy for high-fidelity image restoration by characterizing both local smoothness and nonlocal self-similarity of natural images in a unified statistical manner.

Deblurring Image Deblurring +3

Image Compressive Sensing Recovery Using Adaptively Learned Sparsifying Basis via L0 Minimization

no code implementations30 Apr 2014 Jian Zhang, Chen Zhao, Debin Zhao, Wen Gao

From many fewer acquired measurements than suggested by the Nyquist sampling theory, compressive sensing (CS) theory demonstrates that, a signal can be reconstructed with high probability when it exhibits sparsity in some domain.

Blocking Compressive Sensing

Structural Group Sparse Representation for Image Compressive Sensing Recovery

no code implementations29 Apr 2014 Jian Zhang, Debin Zhao, Feng Jiang, Wen Gao

Compressive Sensing (CS) theory shows that a signal can be decoded from many fewer measurements than suggested by the Nyquist sampling theory, when the signal is sparse in some domain.

Compressive Sensing

Spatially Directional Predictive Coding for Block-based Compressive Sensing of Natural Images

no code implementations29 Apr 2014 Jian Zhang, Debin Zhao, Feng Jiang

At the encoder, for each block of compressive sensing (CS) measurements, the optimal pre-diction is selected from a set of prediction candidates that are generated by four designed directional predictive modes.

Compressive Sensing Decoder

Characterizing A Database of Sequential Behaviors with Latent Dirichlet Hidden Markov Models

no code implementations24 May 2013 Yin Song, Longbing Cao, Xuhui Fan, Wei Cao, Jian Zhang

These sequence-level latent parameters for each sequence are modeled as latent Dirichlet random variables and parameterized by a set of deterministic database-level hyper-parameters.

General Classification

Scene Text Recognition from Two-Dimensional Perspective

no code implementations18 Sep 2018 Minghui Liao, Jian Zhang, Zhaoyi Wan, Fengming Xie, Jiajun Liang, Pengyuan Lyu, Cong Yao, Xiang Bai

Inspired by speech recognition, recent state-of-the-art algorithms mostly consider scene text recognition as a sequence prediction problem.

Scene Text Recognition Semantic Segmentation +4

Performance assessment of the deep learning technologies in grading glaucoma severity

no code implementations31 Oct 2018 Yi Zhen, Lei Wang, Han Liu, Jian Zhang, Jiantao Pu

Among these CNNs, the DenseNet had the highest classification accuracy (i. e., 75. 50%) based on pre-trained weights when using global ROIs, as compared to 65. 50% when using local ROIs.

Face Detection with Effective Feature Extraction

no code implementations29 Sep 2010 Sakrapee Paisitkriangkrai, Chunhua Shen, Jian Zhang

There is an abundant literature on face detection due to its important role in many vision applications.

Face Detection

Fast Gated Neural Domain Adaptation: Language Model as a Case Study

no code implementations COLING 2016 Jian Zhang, Xiaofeng Wu, Andy Way, Qun Liu

We show that the neural LM perplexity can be reduced by 7. 395 and 12. 011 using the proposed domain adaptation mechanism on the Penn Treebank and News data, respectively.

Domain Adaptation Language Modelling +2

Topic-Informed Neural Machine Translation

no code implementations COLING 2016 Jian Zhang, Liangyou Li, Andy Way, Qun Liu

In recent years, neural machine translation (NMT) has demonstrated state-of-the-art machine translation (MT) performance.

Machine Translation NMT +2

Message Passing Inference for Large Scale Graphical Models with High Order Potentials

no code implementations NeurIPS 2014 Jian Zhang, Alex Schwing, Raquel Urtasun

To keep up with the Big Data challenge, parallelized algorithms based on dual decomposition have been proposed to perform inference in Markov random fields.

Semantic Segmentation

Fine-Grained Video Captioning for Sports Narrative

no code implementations CVPR 2018 Huanyu Yu, Shuo Cheng, Bingbing Ni, Minsi Wang, Jian Zhang, Xiaokang Yang

First, to facilitate this novel research of fine-grained video caption, we collected a novel dataset called Fine-grained Sports Narrative dataset (FSN) that contains 2K sports videos with ground-truth narratives from YouTube. com.

2k Video Captioning

Goal-Oriented Visual Question Generation via Intermediate Rewards

no code implementations ECCV 2018 Jun-Jie Zhang, Qi Wu, Chunhua Shen, Jian Zhang, Jianfeng Lu, Anton Van Den Hengel

Despite significant progress in a variety of vision-and-language problems, developing a method capable of asking intelligent, goal-oriented questions about images is proven to be an inscrutable challenge.

Informativeness Question Generation +2

Assessment of central serous chorioretinopathy (CSC) depicted on color fundus photographs using deep Learning

no code implementations14 Jan 2019 Yi Zhen, Hang Chen, Xu Zhang, Meng Liu, Xin Meng, Jian Zhang, Jiantao Pu

To investigate whether and to what extent central serous chorioretinopathy (CSC) depicted on color fundus photographs can be assessed using deep learning technology.

Image Denoising via Adaptive Soft-Thresholding Based on Non-Local Samples

no code implementations CVPR 2015 Hangfan Liu, Ruiqin Xiong, Jian Zhang, Wen Gao

To estimate the expectation and variance parameters for the transform bands of a particular patch, we exploit the non-local correlation of image and collect a set of similar patches as data samples to form the distribution.

Image Denoising

Unsupervised Part Mining for Fine-grained Image Classification

no code implementations26 Feb 2019 Runsheng Zhang, Jian Zhang, Yaping Huang, Qi Zou

To tackle this issue, we propose a fully unsupervised part mining (UPM) approach to localize the discriminative parts without even image-level annotations, which largely improves the fine-grained classification performance.

Classification Fine-Grained Image Classification +2

Learning Extreme Hummingbird Maneuvers on Flapping Wing Robots

no code implementations25 Feb 2019 Fan Fei, Zhan Tu, Jian Zhang, Xinyan Deng

Inspired by the hummingbirds' near-maximal performance during such extreme maneuvers, we developed a flight control strategy and experimentally demonstrated that such maneuverability can be achieved by an at-scale 12-gram hummingbird robot equipped with just two actuators.

reinforcement-learning Reinforcement Learning (RL)

Fast Registration for cross-source point clouds by using weak regional affinity and pixel-wise refinement

no code implementations11 Mar 2019 Xiaoshui Huang, Lixin Fan, Qiang Wu, Jian Zhang, Chun Yuan

Accurate and fast registration of cross-source 3D point clouds from different sensors is an emerged research problem in computer vision.

Point Cloud Registration

Towards Locally Consistent Object Counting with Constrained Multi-stage Convolutional Neural Networks

no code implementations6 Apr 2019 Muming Zhao, Jian Zhang, Chongyang Zhang, Wenjun Zhang

Towards this problem, in this paper we propose a constrained multi-stage Convolutional Neural Networks (CNNs) to jointly pursue locally consistent density map from two aspects.

Object Object Counting

Single Pixel Reconstruction for One-stage Instance Segmentation

no code implementations16 Apr 2019 Jun Yu, Jinghan Yao, Jian Zhang, Zhou Yu, DaCheng Tao

In this paper, we propose a one-stage framework, SPRNet, which performs efficient instance segmentation by introducing a single pixel reconstruction (SPR) branch to off-the-shelf one-stage detectors.

Instance Segmentation Region Proposal +2

Local Deep-Feature Alignment for Unsupervised Dimension Reduction

no code implementations22 Apr 2019 Jian Zhang, Jun Yu, DaCheng Tao

Next, we exploit an affine transformation to align the local deep features of each neighbourhood with the global features.

Clustering Data Visualization +1

Inferring the Importance of Product Appearance: A Step Towards the Screenless Revolution

no code implementations1 May 2019 Yongshun Gong, Jin-Feng Yi, Dong-Dong Chen, Jian Zhang, Jiayu Zhou, Zhihua Zhou

In this paper, we aim to infer the significance of every item's appearance in consumer decision making and identify the group of items that are suitable for screenless shopping.

Decision Making

Structured Discriminative Tensor Dictionary Learning for Unsupervised Domain Adaptation

no code implementations11 May 2019 Songsong Wu, Yan Yan, Hao Tang, Jianjun Qian, Jian Zhang, Xiao-Yuan Jing

However, the number of labeled source samples are always limited due to expensive annotation cost in practice, making sub-optimal performance been observed.

Dictionary Learning Pseudo Label +1

Glioma Grade Prediction Using Wavelet Scattering-Based Radiomics

no code implementations23 May 2019 Qijian Chen, Lihui Wang, Li Wang, Zeyu Deng, Jian Zhang, Yuemin Zhu

Glioma grading before surgery is very critical for the prognosis prediction and treatment plan making.

Dimensionality Reduction regression

Extracting Visual Knowledge from the Internet: Making Sense of Image Data

no code implementations7 Jun 2019 Yazhou Yao, Jian Zhang, Xian-Sheng Hua, Fumin Shen, Zhenmin Tang

Recent successes in visual recognition can be primarily attributed to feature representation, learning algorithms, and the ever-increasing size of labeled training data.

Representation Learning

Sample Adaptive Multiple Kernel Learning for Failure Prediction of Railway Points

no code implementations2 Jul 2019 Zhibin Li, Jian Zhang, Qiang Wu, Yongshun Gong, Jin-Feng Yi, Christina Kirsch

In this paper, we formulate our prediction task as a multiple kernel learning problem with missing kernels.

Low-Rank Pairwise Alignment Bilinear Network For Few-Shot Fine-Grained Image Classification

no code implementations4 Aug 2019 Huaxi Huang, Jun-Jie Zhang, Jian Zhang, Jingsong Xu, Qiang Wu

A novel low-rank pairwise bilinear pooling operation is proposed to capture the nuanced differences between the support and query images for learning an effective distance metric.

Classification Few-Shot Learning +2

Exploiting Channel Similarity for Accelerating Deep Convolutional Neural Networks

no code implementations6 Aug 2019 Yunxiang Zhang, Chenglong Zhao, Bingbing Ni, Jian Zhang, Haoran Deng

To address the limitations of existing magnitude-based pruning algorithms in cases where model weights or activations are of large and similar magnitude, we propose a novel perspective to discover parameter redundancy among channels and accelerate deep CNNs via channel pruning.

Clustering

C-RPNs: Promoting Object Detection in real world via a Cascade Structure of Region Proposal Networks

no code implementations19 Aug 2019 Dongming Yang, Yuexian Zou, Jian Zhang, Ge Li

Although two-stage detectors like Faster R-CNN achieved big successes in object detection due to the strategy of extracting region proposals by region proposal network, they show their poor adaption in real-world object detection as a result of without considering mining hard samples during extracting region proposals.

Object object-detection +2

PipeMare: Asynchronous Pipeline Parallel DNN Training

no code implementations9 Oct 2019 Bowen Yang, Jian Zhang, Jonathan Li, Christopher Ré, Christopher R. Aberger, Christopher De Sa

Pipeline parallelism (PP) when training neural networks enables larger models to be partitioned spatially, leading to both lower network communication and overall higher hardware utilization.

Worst Cases Policy Gradients

no code implementations9 Nov 2019 Yichuan Charlie Tang, Jian Zhang, Ruslan Salakhutdinov

Recent advances in deep reinforcement learning have demonstrated the capability of learning complex control policies from many types of environments.

reinforcement-learning Reinforcement Learning (RL)

Time-aware Gradient Attack on Dynamic Network Link Prediction

no code implementations24 Nov 2019 Jinyin Chen, Jian Zhang, Zhi Chen, Min Du, Qi Xuan

In this work, we present the first study of adversarial attack on dynamic network link prediction (DNLP).

Adversarial Attack Link Prediction +1

Potential Passenger Flow Prediction: A Novel Study for Urban Transportation Development

no code implementations7 Dec 2019 Yongshun Gong, Zhibin Li, Jian Zhang, Wei Liu, Jin-Feng Yi

In this paper, this specific problem is termed as potential passenger flow (PPF) prediction, which is a novel and important study connected with urban computing and intelligent transportation systems.

MULTI-VIEW LEARNING Recommendation Systems

Adversarial AutoAugment

no code implementations ICLR 2020 Xin-Yu Zhang, Qiang Wang, Jian Zhang, Zhao Zhong

The augmentation policy network attempts to increase the training loss of a target network through generating adversarial augmentation policies, while the target network can learn more robust features from harder examples to improve the generalization.

Data Augmentation Image Classification

PMC-GANs: Generating Multi-Scale High-Quality Pedestrian with Multimodal Cascaded GANs

no code implementations30 Dec 2019 Jie Wu, Ying Peng, Chenghao Zheng, Zongbo Hao, Jian Zhang

Recently, generative adversarial networks (GANs) have shown great advantages in synthesizing images, leading to a boost of explorations of using faked images to augment data.

Data Augmentation Pedestrian Detection

Multi-factorial Optimization for Large-scale Virtual Machine Placement in Cloud Computing

no code implementations18 Jan 2020 Zhengping Liang, Jian Zhang, Liang Feng, Zexuan Zhu

However, as growing demand for cloud services, the existing EAs fail to implement in large-scale virtual machine placement (LVMP) problem due to the high time complexity and poor scalability.

Cloud Computing Evolutionary Algorithms

GID-Net: Detecting Human-Object Interaction with Global and Instance Dependency

no code implementations11 Mar 2020 Dongming Yang, Yuexian Zou, Jian Zhang, Ge Li

GID block breaks through the local neighborhoods and captures long-range dependency of pixels both in global-level and instance-level from the scene to help detecting interactions between instances.

Human-Object Interaction Detection Object

DyNet: Dynamic Convolution for Accelerating Convolutional Neural Networks

no code implementations22 Apr 2020 Yikang Zhang, Jian Zhang, Qiang Wang, Zhao Zhong

On one hand, we can reduce the computation cost remarkably while maintaining the performance.

Towards Better Graph Representation: Two-Branch Collaborative Graph Neural Networks for Multimodal Marketing Intention Detection

no code implementations13 May 2020 Lu Zhang, Jian Zhang, Zhibin Li, Jingsong Xu

Inspired by the fact that spreading and collecting information through the Internet becomes the norm, more and more people choose to post for-profit contents (images and texts) in social networks.

Graph Classification Marketing

Contextual Embeddings: When Are They Worth It?

no code implementations ACL 2020 Simran Arora, Avner May, Jian Zhang, Christopher Ré

We study the settings for which deep contextual embeddings (e. g., BERT) give large improvements in performance relative to classic pretrained embeddings (e. g., GloVe), and an even simpler baseline---random word embeddings---focusing on the impact of the training set size and the linguistic properties of the task.

Word Embeddings

TOAN: Target-Oriented Alignment Network for Fine-Grained Image Categorization with Few Labeled Samples

no code implementations28 May 2020 Huaxi Huang, Jun-Jie Zhang, Jian Zhang, Qiang Wu, Chang Xu

The challenges of high intra-class variance yet low inter-class fluctuations in fine-grained visual categorization are more severe with few labeled samples, \textit{i. e.,} Fine-Grained categorization problems under the Few-Shot setting (FGFS).

Fine-Grained Visual Categorization

Stochastic Batch Augmentation with An Effective Distilled Dynamic Soft Label Regularizer

no code implementations27 Jun 2020 Qian Li, Qingyuan Hu, Yong Qi, Saiyu Qi, Jie Ma, Jian Zhang

SBA stochastically decides whether to augment at iterations controlled by the batch scheduler and in which a ''distilled'' dynamic soft label regularization is introduced by incorporating the similarity in the vicinity distribution respect to raw samples.

Data Augmentation

A Similarity Inference Metric for RGB-Infrared Cross-Modality Person Re-identification

no code implementations3 Jul 2020 Mengxi Jia, Yunpeng Zhai, Shijian Lu, Siwei Ma, Jian Zhang

RGB-Infrared (IR) cross-modality person re-identification (re-ID), which aims to search an IR image in RGB gallery or vice versa, is a challenging task due to the large discrepancy between IR and RGB modalities.

Cross-Modality Person Re-identification Person Re-Identification

Hardware Accelerator for Adversarial Attacks on Deep Learning Neural Networks

no code implementations3 Aug 2020 Haoqiang Guo, Lu Peng, Jian Zhang, Fang Qi, Lide Duan

Recent studies identify that Deep learning Neural Networks (DNNs) are vulnerable to subtle perturbations, which are not perceptible to human visual system but can fool the DNN models and lead to wrong outputs.

Adversarial Attack Computational Efficiency

Salvage Reusable Samples from Noisy Data for Robust Learning

1 code implementation6 Aug 2020 Zeren Sun, Xian-Sheng Hua, Yazhou Yao, Xiu-Shen Wei, Guosheng Hu, Jian Zhang

To this end, we propose a certainty-based reusable sample selection and correction approach, termed as CRSSC, for coping with label noise in training deep FG models with web images.

Memorization

Scalar Coupling Constant Prediction Using Graph Embedding Local Attention Encoder

no code implementations7 Sep 2020 Caiqing Jian, Xinyu Cheng, Jian Zhang, Lihui Wang

The experimental results demonstrate that, compared to the traditional chemical bond structure representations, the rotation and translation invariant structure representations proposed in this work can improve the SCC prediction accuracy; with the graph embedded local self-attention, the mean absolute error (MAE) of the prediction model in the validation set decreases from 0. 1603 Hz to 0. 1067 Hz; using the classification based loss function instead of the scaled regression loss, the MAE of the predicted SCC can be decreased to 0. 0963 HZ, which is close to the quantum chemistry standard on CHAMPS dataset.

Graph Embedding

Multi-Representation Ensemble in Few-Shot Learning

no code implementations1 Jan 2021 Qing Chen, Jian Zhang

Deep neural networks (DNNs) compute representations in a layer by layer fashion, producing a final representation at the top layer of the pipeline, and classification or regression is made using the final representation.

Few-Shot Learning

Revisiting BFfloat16 Training

no code implementations1 Jan 2021 Pedram Zamirai, Jian Zhang, Christopher R Aberger, Christopher De Sa

We ask can we do pure 16-bit training which requires only 16-bit compute units, while still matching the model accuracy attained by 32-bit training.

Uncertainty Weighted Offline Reinforcement Learning

no code implementations1 Jan 2021 Yue Wu, Shuangfei Zhai, Nitish Srivastava, Joshua M. Susskind, Jian Zhang, Ruslan Salakhutdinov, Hanlin Goh

Offline Reinforcement Learning promises to learn effective policies from previously-collected, static datasets without the need for exploration.

Offline RL Q-Learning +2

Revisiting BFloat16 Training

no code implementations13 Oct 2020 Pedram Zamirai, Jian Zhang, Christopher R. Aberger, Christopher De Sa

State-of-the-art generic low-precision training algorithms use a mix of 16-bit and 32-bit precision, creating the folklore that 16-bit hardware compute units alone are not enough to maximize model accuracy.

Face Mask Assistant: Detection of Face Mask Service Stage Based on Mobile Phone

no code implementations9 Oct 2020 Yuzhen Chen, Menghan Hu, Chunjun Hua, Guangtao Zhai, Jian Zhang, Qingli Li, Simon X. Yang

Aimed at solving the problem that we don't know which service stage of the mask belongs to, we propose a detection system based on the mobile phone.

AutoBSS: An Efficient Algorithm for Block Stacking Style Search

no code implementations NeurIPS 2020 Yikang Zhang, Jian Zhang, Zhao Zhong

Neural network architecture design mostly focuses on the new convolutional operator or special topological structure of network block, little attention is drawn to the configuration of stacking each block, called Block Stacking Style (BSS).

AutoML Bayesian Optimization +6

Identification of deep breath while moving forward based on multiple body regions and graph signal analysis

no code implementations20 Oct 2020 Yunlu Wang, Cheng Yang, Menghan Hu, Jian Zhang, Qingli Li, Guangtao Zhai, Xiao-Ping Zhang

This paper presents an unobtrusive solution that can automatically identify deep breath when a person is walking past the global depth camera.

Distribution-aware Margin Calibration for Medical Image Segmentation

no code implementations3 Nov 2020 Zhibin Li, Litao Yu, Jian Zhang

In this paper, we present a novel data-distribution-aware margin calibration method for a better generalization of the mIoU over the whole data-distribution, underpinned by a rigid lower bound.

Image Segmentation Medical Image Segmentation +2

Dual Attention on Pyramid Feature Maps for Image Captioning

no code implementations2 Nov 2020 Litao Yu, Jian Zhang, Qiang Wu

In this paper, we propose to apply dual attention on pyramid image feature maps to fully explore the visual-semantic correlations and improve the quality of generated sentences.

Descriptive Image Captioning

Multi-layer Feature Aggregation for Deep Scene Parsing Models

no code implementations4 Nov 2020 Litao Yu, Yongsheng Gao, Jun Zhou, Jian Zhang, Qiang Wu

The proposed module can auto-select the intermediate visual features to correlate the spatial and semantic information.

Scene Parsing Semantic Segmentation

Time-Series Snapshot Network for Partner Recommendation: A Case Study on OSS

no code implementations18 Nov 2020 Jinyin Chen, Yunyi Xie, Jian Zhang, Xincheng Shu, Qi Xuan

In this paper, we introduce time-series snapshot network (TSSN) which is a mixture network to model the interactions among users and developers.

Social and Information Networks

Rigid and Articulated Point Registration with Expectation Conditional Maximization

no code implementations9 Dec 2020 Radu Horaud, Florence Forbes, Manuel Yguel, Guillaume Dewaele, Jian Zhang

This paper addresses the issue of matching rigid and articulated shapes through probabilistic point registration.

PTN: A Poisson Transfer Network for Semi-supervised Few-shot Learning

no code implementations20 Dec 2020 Huaxi Huang, Junjie Zhang, Jian Zhang, Qiang Wu, Chang Xu

Second, the extra unlabeled samples are employed to transfer the knowledge from base classes to novel classes through contrastive learning.

Contrastive Learning Few-Shot Learning

Multiple Instance Segmentation in Brachial Plexus Ultrasound Image Using BPMSegNet

no code implementations22 Dec 2020 Yi Ding, Qiqi Yang, Guozheng Wu, Jian Zhang, Zhiguang Qin

In this paper, a network called Brachial Plexus Multi-instance Segmentation Network (BPMSegNet) is proposed to identify different tissues (nerves, arteries, veins, muscles) in ultrasound images.

Instance Segmentation Semantic Segmentation

Counting the Number of Solutions to Constraints

no code implementations28 Dec 2020 Jian Zhang, Cunjing Ge, Feifei Ma

Compared with constraint satisfaction problems, counting problems have received less attention.

Temporal-Amount Snapshot MultiGraph for Ethereum Transaction Tracking

no code implementations16 Feb 2021 Yunyi Xie, Jie Jin, Jian Zhang, Shanqing Yu, Qi Xuan

With the wide application of blockchain in the financial field, the rise of various types of cybercrimes has brought great challenges to the security of blockchain.

Link Prediction

Temperature dependent coherence properties of NV ensemble in diamond up to 600K

no code implementations25 Feb 2021 Shengran Lin, Changfeng Weng, Yuanjie Yang, Jiaxin Zhao, Yuhang Guo, Jian Zhang, Liren Lou, Wei Zhu, Guanzhong Wang

Nitrogen-vacancy (NV) center in diamond is an ideal candidate for quantum sensors because of its excellent optical and coherence property.

Quantum Physics Mesoscale and Nanoscale Physics

A comprehensive survey on point cloud registration

no code implementations3 Mar 2021 Xiaoshui Huang, Guofeng Mei, Jian Zhang, Rana Abbas

This survey conducts a comprehensive survey, including both same-source and cross-source registration methods, and summarize the connections between optimization-based and deep learning methods, to provide further research insight.

3D Reconstruction Point Cloud Registration

MACD R-CNN: An Abnormal Cell Nucleus Detection Method

no code implementations28 Jul 2020 Baoyan Ma, Jian Zhang, Feng Cao, Yongjun He

We design a fixed proposal module to generate fixed-sized feature maps of nuclei, which allows the new information of nucleus is used for classification.

Cell Detection Classification +4

Thousand to One: Semantic Prior Modeling for Conceptual Coding

no code implementations12 Mar 2021 Jianhui Chang, Zhenghui Zhao, Lingbo Yang, Chuanmin Jia, Jian Zhang, Siwei Ma

To this end, we propose a novel end-to-end semantic prior modeling-based conceptual coding scheme towards extremely low bitrate image compression, which leverages semantic-wise deep representations as a unified prior for entropy estimation and texture synthesis.

Image Compression Semantic Segmentation +1

Jo-SRC: A Contrastive Approach for Combating Noisy Labels

no code implementations CVPR 2021 Yazhou Yao, Zeren Sun, Chuanyi Zhang, Fumin Shen, Qi Wu, Jian Zhang, Zhenmin Tang

Due to the memorization effect in Deep Neural Networks (DNNs), training with noisy labels usually results in inferior model performance.

Contrastive Learning Memorization

Super-Resolving Compressed Video in Coding Chain

no code implementations26 Mar 2021 Dewang Hou, Yang Zhao, Yuyao Ye, Jiayu Yang, Jian Zhang, Ronggang Wang

Scaling and lossy coding are widely used in video transmission and storage.

Decoder

Max-min Fairness of K-user Cooperative Rate-Splitting in MISO Broadcast Channel with User Relaying

no code implementations17 Oct 2019 Yijie Mao, Bruno Clerckx, Jian Zhang, Victor O. K. Li, Mohammed Arafah

Cooperative Rate-Splitting (CRS) strategy, relying on linearly precoded rate-splitting at the transmitter and opportunistic transmission of the common message by the relaying user, has recently been shown to outperform typical Non-cooperative Rate-Splitting (NRS), Cooperative Non-Orthogonal Multiple Access (C-NOMA) and Space Division Multiple Access (SDMA) in a two-user Multiple Input Single Output (MISO) Broadcast Channel (BC) with user relaying.

Fairness Scheduling

Underwater Target Recognition based on Multi-Decision LOFAR Spectrum Enhancement: A Deep Learning Approach

no code implementations26 Apr 2021 Jie Chen, Jie Liu, Chang Liu, Jian Zhang, Bing Han

To overcome this issue and to further improve the recognition performance, we adopt a deep learning approach for underwater target recognition and propose a LOFAR spectrum enhancement (LSE)-based underwater target recognition scheme, which consists of preprocessing, offline training, and online testing.

Unsupervised Sentiment Analysis by Transferring Multi-source Knowledge

no code implementations9 May 2021 Yong Dai, Jian Liu, Jian Zhang, Hongguang Fu, Zenglin Xu

The first mechanism is a selective domain adaptation (SDA) method, which transfers knowledge from the closest source domain.

Domain Adaptation Sentiment Analysis

A Multi-Branch Hybrid Transformer Networkfor Corneal Endothelial Cell Segmentation

no code implementations21 May 2021 Yinglin Zhang, Risa Higashita, Huazhu Fu, Yanwu Xu, Yang Zhang, Haofeng Liu, Jian Zhang, Jiang Liu

Corneal endothelial cell segmentation plays a vital role inquantifying clinical indicators such as cell density, coefficient of variation, and hexagonality.

Cell Segmentation

Spk2ImgNet: Learning To Reconstruct Dynamic Scene From Continuous Spike Stream

no code implementations CVPR 2021 Jing Zhao, Ruiqin Xiong, Hangfan Liu, Jian Zhang, Tiejun Huang

Different from the conventional digital cameras that compact the photoelectric information within the exposure interval into a single snapshot, the spike camera produces a continuous spike stream to record the dynamic light intensity variation process.

Image Reconstruction

Learning Disentangled Representation Implicitly via Transformer for Occluded Person Re-Identification

no code implementations6 Jul 2021 Mengxi Jia, Xinhua Cheng, Shijian Lu, Jian Zhang

To better eliminate interference from occlusions, we design a contrast feature learning technique (CFL) for better separation of occlusion features and discriminative ID features.

Decoder Person Re-Identification +1

EGC2: Enhanced Graph Classification with Easy Graph Compression

1 code implementation16 Jul 2021 Jinyin Chen, Haiyang Xiong, Haibin Zhenga, Dunjie Zhang, Jian Zhang, Mingwei Jia, Yi Liu

To achieve lower-complexity defense applied to graph classification models, EGC2 utilizes a centrality-based edge-importance index to compress the graphs, filtering out trivial structures and adversarial perturbations in the input graphs, thus improving the model's robustness.

Graph Classification

Multi-Level Contrastive Learning for Few-Shot Problems

no code implementations15 Jul 2021 Qing Chen, Jian Zhang

Most current applications of contrastive learning benefit only a single representation from the last layer of an encoder. In this paper, we propose a multi-level contrasitive learning approach which applies contrastive losses at different layers of an encoder to learn multiple representations from the encoder.

Contrastive Learning Few-Shot Learning

Structure Destruction and Content Combination for Face Anti-Spoofing

no code implementations22 Jul 2021 Ke-Yue Zhang, Taiping Yao, Jian Zhang, Shice Liu, Bangjie Yin, Shouhong Ding, Jilin Li

In pursuit of consolidating the face verification systems, prior face anti-spoofing studies excavate the hidden cues in original images to discriminate real persons and diverse attack types with the assistance of auxiliary supervision.

Face Anti-Spoofing Face Verification +1

Super Resolve Dynamic Scene From Continuous Spike Streams

no code implementations ICCV 2021 Jing Zhao, Jiyu Xie, Ruiqin Xiong, Jian Zhang, Zhaofei Yu, Tiejun Huang

In this paper, we properly exploit the relative motion and derive the relationship between light intensity and each spike, so as to recover the external scene with both high temporal and high spatial resolution.

Super-Resolution

Dyn-Backdoor: Backdoor Attack on Dynamic Link Prediction

no code implementations8 Oct 2021 Jinyin Chen, Haiyang Xiong, Haibin Zheng, Jian Zhang, Guodong Jiang, Yi Liu

Backdoor attacks induce the DLP methods to make wrong prediction by the malicious training data, i. e., generating a subgraph sequence as the trigger and embedding it to the training data.

Backdoor Attack Dynamic Link Prediction +1

Research on the Inverse Kinematics Prediction of a Soft Biomimetic Actuator via BP Neural Network

no code implementations26 Oct 2021 Huichen Ma, Junjie Zhou, Jian Zhang, Lingyu Zhang

After training with sample data, the BP neural network model can represent the relation between the manipulator tip position and the pressure applied to the chambers.

Motion Planning Position

ε-weakened Robustness of Deep Neural Networks

no code implementations29 Oct 2021 Pei Huang, Yuting Yang, Minghao Liu, Fuqi Jia, Feifei Ma, Jian Zhang

This paper introduces a notation of $\varepsilon$-weakened robustness for analyzing the reliability and stability of deep neural networks (DNNs).

Can Graph Neural Networks Learn to Solve MaxSAT Problem?

no code implementations15 Nov 2021 Minghao Liu, Fuqi Jia, Pei Huang, Fan Zhang, Yuchen Sun, Shaowei Cai, Feifei Ma, Jian Zhang

With the rapid development of deep learning techniques, various recent work has tried to apply graph neural networks (GNNs) to solve NP-hard problems such as Boolean Satisfiability (SAT), which shows the potential in bridging the gap between machine learning and symbolic reasoning.

GenReg: Deep Generative Method for Fast Point Cloud Registration

no code implementations23 Nov 2021 Xiaoshui Huang, Zongyi Xu, Guofeng Mei, Sheng Li, Jian Zhang, Yifan Zuo, Yucheng Wang

To solve this challenge, we propose a new data-driven registration algorithm by investigating deep generative neural networks to point cloud registration.

Point Cloud Registration

DyNet: Dynamic Convolution for Accelerating Convolution Neural Networks

no code implementations25 Sep 2019 Kane Zhang, Jian Zhang, Qiang Wang, Zhao Zhong

To verify the scalability, we also apply DyNet on segmentation task, the results show that DyNet can reduces 69. 3% FLOPs while maintaining the Mean IoU on segmentation task.

COTReg:Coupled Optimal Transport based Point Cloud Registration

no code implementations29 Dec 2021 Guofeng Mei, Xiaoshui Huang, Litao Yu, Jian Zhang, Mohammed Bennamoun

Generating a set of high-quality correspondences or matches is one of the most critical steps in point cloud registration.

Point Cloud Registration

Quantifying Robustness to Adversarial Word Substitutions

no code implementations11 Jan 2022 Yuting Yang, Pei Huang, Feifei Ma, Juan Cao, Meishan Zhang, Jian Zhang, Jintao Li

Deep-learning-based NLP models are found to be vulnerable to word substitution perturbations.

Unsupervised Learning on 3D Point Clouds by Clustering and Contrasting

no code implementations5 Feb 2022 Guofeng Mei, Litao Yu, Qiang Wu, Jian Zhang, Mohammed Bennamoun

This paper proposes a general unsupervised approach, named \textbf{ConClu}, to perform the learning of point-wise and global features by jointly leveraging point-level clustering and instance-level contrasting.

3D Object Classification Clustering +2

Series Photo Selection via Multi-view Graph Learning

no code implementations18 Mar 2022 Jin Huang, Lu Zhang, Yongshun Gong, Jian Zhang, Xiushan Nie, Yilong Yin

Series photo selection (SPS) is an important branch of the image aesthetics quality assessment, which focuses on finding the best one from a series of nearly identical photos.

Aesthetics Quality Assessment Graph Learning

A Prompting-based Approach for Adversarial Example Generation and Robustness Enhancement

no code implementations21 Mar 2022 Yuting Yang, Pei Huang, Juan Cao, Jintao Li, Yun Lin, Jin Song Dong, Feifei Ma, Jian Zhang

Our attack technique targets the inherent vulnerabilities of NLP models, allowing us to generate samples even without interacting with the victim NLP model, as long as it is based on pre-trained language models (PLMs).

Adversarial Attack

Robust Invertible Image Steganography

no code implementations CVPR 2022 Youmin Xu, Chong Mou, Yujie Hu, Jingfen Xie, Jian Zhang

Previous image steganography methods are limited in hiding capacity and robustness, commonly vulnerable to distortion on container images such as Gaussian noise, Poisson noise, and lossy compression.

Image Steganography

Hierarchical Similarity Learning for Aliasing Suppression Image Super-Resolution

no code implementations7 Jun 2022 Yuqing Liu, Qi Jia, Jian Zhang, Xin Fan, Shanshe Wang, Siwei Ma, Wen Gao

As a highly ill-posed issue, single image super-resolution (SISR) has been widely investigated in recent years.

Image Super-Resolution

Using EBGAN for Anomaly Intrusion Detection

no code implementations21 Jun 2022 Yi Cui, Wenfeng Shen, Jian Zhang, Weijia Lu, Chuang Liu, Lin Sun, Si Chen

The generator in IDS-EBGAN is responsible for converting the original malicious network traffic in the training set into adversarial malicious examples.

Intrusion Detection

Eco-driving for Electric Connected Vehicles at Signalized Intersections: A Parameterized Reinforcement Learning approach

no code implementations24 Jun 2022 Xia Jiang, Jian Zhang, Dan Li

This paper proposes an eco-driving framework for electric connected vehicles (CVs) based on reinforcement learning (RL) to improve vehicle energy efficiency at signalized intersections.

Reinforcement Learning (RL)

Learning the policy for mixed electric platoon control of automated and human-driven vehicles at signalized intersection: a random search approach

no code implementations24 Jun 2022 Xia Jiang, Jian Zhang, Xiaoyu Shi, Jian Cheng

Meanwhile, the simulation results demonstrate the effectiveness of the delay reward, which is designed to outperform distributed reward mechanism} Compared with normal car-following behavior, the sensitivity analysis reveals that the energy can be saved to different extends (39. 27%-82. 51%) by adjusting the relative importance of the optimization goal.

reinforcement-learning Reinforcement Learning (RL)

Horizontal and Vertical Attention in Transformers

no code implementations10 Jul 2022 Litao Yu, Jian Zhang

Transformers are built upon multi-head scaled dot-product attention and positional encoding, which aim to learn the feature representations and token dependencies.

Dimensionality Reduction

Spatial-temporal Analysis for Automated Concrete Workability Estimation

no code implementations24 Jul 2022 Litao Yu, Jian Zhang, Mohammed Bennamoun, Xiaojun Chang, Vute Sirivivatnanon, Ali Nezhad

Concrete workability measure is mostly determined based on subjective assessment of a certified assessor with visual inspections.

regression

D3C2-Net: Dual-Domain Deep Convolutional Coding Network for Compressive Sensing

no code implementations27 Jul 2022 Weiqi Li, Bin Chen, Jian Zhang

By unfolding the proposed framework into deep neural networks, we further design a novel Dual-Domain Deep Convolutional Coding Network (D3C2-Net) for CS imaging with the capability of transmitting high-throughput feature-level image representation through all the unfolded stages.

Compressive Sensing

Towards Multimodal Multitask Scene Understanding Models for Indoor Mobile Agents

no code implementations27 Sep 2022 Yao-Hung Hubert Tsai, Hanlin Goh, Ali Farhadi, Jian Zhang

The perception system in personalized mobile agents requires developing indoor scene understanding models, which can understand 3D geometries, capture objectiveness, analyze human behaviors, etc.

3D Object Detection Autonomous Driving +9

Multi-Agent Automated Machine Learning

no code implementations CVPR 2023 Zhaozhi Wang, Kefan Su, Jian Zhang, Huizhu Jia, Qixiang Ye, Xiaodong Xie, Zongqing Lu

In this paper, we propose multi-agent automated machine learning (MA2ML) with the aim to effectively handle joint optimization of modules in automated machine learning (AutoML).

Data Augmentation Multi-agent Reinforcement Learning +1

Complementary Labels Learning with Augmented Classes

no code implementations19 Nov 2022 Zhongnian Li, Jian Zhang, Mengting Xu, Xinzheng Xu, Daoqiang Zhang

In this paper, we propose a novel problem setting called Complementary Labels Learning with Augmented Classes (CLLAC), which brings the challenge that classifiers trained by complementary labels should not only be able to classify the instances from observed classes accurately, but also recognize the instance from the Augmented Classes in the testing phase.

Self-Supervised Object Goal Navigation with In-Situ Finetuning

no code implementations9 Dec 2022 So Yeon Min, Yao-Hung Hubert Tsai, Wei Ding, Ali Farhadi, Ruslan Salakhutdinov, Yonatan Bisk, Jian Zhang

In contrast, our LocCon shows the most robust transfer in the real world among the set of models we compare to, and that the real-world performance of all models can be further improved with self-supervised LocCon in-situ training.

Contrastive Learning Navigate +2

Latent Evolution Model for Change Point Detection in Time-varying Networks

no code implementations17 Dec 2022 Yongshun Gong, Xue Dong, Jian Zhang, Meng Chen

Our method focuses on learning the low-dimensional representations of networks and capturing the evolving patterns of these learned latent representations simultaneously.

Change Point Detection

Cross-domain recommendation via user interest alignment

no code implementations26 Jan 2023 Chuang Zhao, Hongke Zhao, Ming He, Jian Zhang, Jianping Fan

Specifically, we first construct a unified cross-domain heterogeneous graph and redefine the message passing mechanism of graph convolutional networks to capture high-order similarity of users and items across domains.

Recommendation Systems

Progressive Content-aware Coded Hyperspectral Compressive Imaging

no code implementations17 Mar 2023 Xuanyu Zhang, Bin Chen, Wenzhen Zou, Shuai Liu, Yongbing Zhang, Ruiqin Xiong, Jian Zhang

Hyperspectral imaging plays a pivotal role in a wide range of applications, like remote sensing, medicine, and cytology.

OPDN: Omnidirectional Position-aware Deformable Network for Omnidirectional Image Super-Resolution

no code implementations26 Apr 2023 Xiaopeng Sun, Weiqi Li, Zhenyu Zhang, Qiufang Ma, Xuhan Sheng, Ming Cheng, Haoyu Ma, Shijie Zhao, Jian Zhang, Junlin Li, Li Zhang

Model A aims to enhance the feature extraction ability of 360{\deg} image positional information, while Model B further focuses on the high-frequency information of 360{\deg} images.

Image Super-Resolution Position

Single Node Injection Label Specificity Attack on Graph Neural Networks via Reinforcement Learning

no code implementations4 May 2023 Dayuan Chen, Jian Zhang, Yuqian Lv, Jinhuan Wang, Hongjie Ni, Shanqing Yu, Zhen Wang, Qi Xuan

Furthermore, most methods concentrate on a single attack goal and lack a generalizable adversary to develop distinct attack strategies for diverse goals, thus limiting precise control over victim model behavior in real-world scenarios.

Specificity

An Object SLAM Framework for Association, Mapping, and High-Level Tasks

no code implementations12 May 2023 Yanmin Wu, Yunzhou Zhang, Delong Zhu, Zhiqiang Deng, Wenkai Sun, Xin Chen, Jian Zhang

Taking into consideration the semantic invariance of objects, we convert the object map to a topological map to provide semantic descriptors to enable multi-map matching.

Decision Making Object +2

Panoptic Compositional Feature Field for Editable Scene Rendering With Network-Inferred Labels via Metric Learning

no code implementations CVPR 2023 Xinhua Cheng, Yanmin Wu, Mengxi Jia, Qian Wang, Jian Zhang

In this work, we attempt to learn an object-compositional neural implicit representation for editable scene rendering by leveraging labels inferred from the off-the-shelf 2D panoptic segmentation networks instead of the ground truth annotations.

Metric Learning Novel View Synthesis +1

Cross-source Point Cloud Registration: Challenges, Progress and Prospects

no code implementations23 May 2023 Xiaoshui Huang, Guofeng Mei, Jian Zhang

The emerging topic of cross-source point cloud (CSPC) registration has attracted increasing attention with the fast development background of 3D sensor technologies.

Point Cloud Registration

DiffLLE: Diffusion-guided Domain Calibration for Unsupervised Low-light Image Enhancement

no code implementations18 Aug 2023 Shuzhou Yang, Xuanyu Zhang, Yinhuai Wang, Jiwen Yu, YuHan Wang, Jian Zhang

Specifically, we adopt a naive unsupervised enhancement algorithm to realize preliminary restoration and design two zero-shot plug-and-play modules based on diffusion model to improve generalization and effectiveness.

Denoising Low-Light Image Enhancement

Masked Cross-image Encoding for Few-shot Segmentation

no code implementations22 Aug 2023 Wenbo Xu, Huaxi Huang, Ming Cheng, Litao Yu, Qiang Wu, Jian Zhang

Few-shot segmentation (FSS) is a dense prediction task that aims to infer the pixel-wise labels of unseen classes using only a limited number of annotated images.

Few-Shot Semantic Segmentation

sasdim: self-adaptive noise scaling diffusion model for spatial time series imputation

no code implementations5 Sep 2023 Shunyang Zhang, Senzhang Wang, Xianzhen Tan, Ruochen Liu, Jian Zhang, Jianxin Wang

Spatial time series imputation is critically important to many real applications such as intelligent transportation and air quality monitoring.

Imputation Time Series

Exploring Flat Minima for Domain Generalization with Large Learning Rates

no code implementations12 Sep 2023 Jian Zhang, Lei Qi, Yinghuan Shi, Yang Gao

Instead, we observe that leveraging a large learning rate can simultaneously promote weight diversity and facilitate the identification of flat regions in the loss landscape.

Domain Generalization Semantic Segmentation

TextCLIP: Text-Guided Face Image Generation And Manipulation Without Adversarial Training

no code implementations21 Sep 2023 Xiaozhou You, Jian Zhang

Text-guided image generation aimed to generate desired images conditioned on given texts, while text-guided image manipulation refers to semantically edit parts of a given image based on specified texts.

Image Generation Image Manipulation +1

IAIFNet: An Illumination-Aware Infrared and Visible Image Fusion Network

no code implementations26 Sep 2023 Qiao Yang, Yu Zhang, Jian Zhang, Zijing Zhao, Shunli Zhang, Jinqiao Wang, Junzhe Chen

Infrared and visible image fusion (IVIF) is used to generate fusion images with comprehensive features of both images, which is beneficial for downstream vision tasks.

Infrared And Visible Image Fusion

SSPFusion: A Semantic Structure-Preserving Approach for Infrared and Visible Image Fusion

no code implementations26 Sep 2023 Qiao Yang, Yu Zhang, Jian Zhang, Zijing Zhao, Shunli Zhang, Jinqiao Wang, Junzhe Chen

Most existing learning-based infrared and visible image fusion (IVIF) methods exhibit massive redundant information in the fusion images, i. e., yielding edge-blurring effect or unrecognizable for object detectors.

Infrared And Visible Image Fusion

Temporal-Coded Spiking Neural Networks with Dynamic Firing Threshold: Learning with Event-Driven Backpropagation

no code implementations ICCV 2023 Wenjie Wei, Malu Zhang, Hong Qu, Ammar Belatreche, Jian Zhang, Hong Chen

As a temporal encoding scheme for SNNs, Time-To-First-Spike (TTFS) encodes information using the timing of a single spike, which allows spiking neurons to transmit information through sparse spike trains and results in lower power consumption and higher computational efficiency compared to traditional rate-based encoding counterparts.

Computational Efficiency Image Classification

Multimodal Large Language Model for Visual Navigation

no code implementations12 Oct 2023 Yao-Hung Hubert Tsai, Vansh Dhar, Jialu Li, BoWen Zhang, Jian Zhang

Recent efforts to enable visual navigation using large language models have mainly focused on developing complex prompt systems.

Language Modelling Large Language Model +2

Deep Unfolding Network for Image Compressed Sensing by Content-adaptive Gradient Updating and Deformation-invariant Non-local Modeling

no code implementations16 Oct 2023 Wenxue Cui, Xiaopeng Fan, Jian Zhang, Debin Zhao

In this paper, inspired by the traditional Proximal Gradient Descent (PGD) algorithm, a novel DUN for image compressed sensing (dubbed DUN-CSNet) is proposed to solve the above two issues.

Image Compressed Sensing

Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts

no code implementations18 Oct 2023 Xinhua Cheng, Tianyu Yang, Jianan Wang, Yu Li, Lei Zhang, Jian Zhang, Li Yuan

Recent text-to-3D generation methods achieve impressive 3D content creation capacity thanks to the advances in image diffusion models and optimizing strategies.

3D Generation Text to 3D

Multilevel Perception Boundary-guided Network for Breast Lesion Segmentation in Ultrasound Images

no code implementations23 Oct 2023 Xing Yang, Jian Zhang, Qijian Chen, Li Wang, Lihui Wang

Moreover, to improve the segmentation performance for tumor boundaries, a multi-level boundary-enhanced segmentation (BS) loss is proposed.

Lesion Segmentation Segmentation +1

Constructing Sample-to-Class Graph for Few-Shot Class-Incremental Learning

1 code implementation31 Oct 2023 Fuyuan Hu, Jian Zhang, Fan Lyu, Linyan Li, Fenglei Xu

Moreover, we design a multi-stage strategy for training S2C model, which mitigates the training challenges posed by limited data in the incremental process.

Few-Shot Class-Incremental Learning Graph Learning +1

SecureCut: Federated Gradient Boosting Decision Trees with Efficient Machine Unlearning

no code implementations22 Nov 2023 Jian Zhang, Bowen Li Jie Li, Chentao Wu

In response to legislation mandating companies to honor the \textit{right to be forgotten} by erasing user data, it has become imperative to enable data removal in Vertical Federated Learning (VFL) where multiple parties provide private features for model training.

Machine Unlearning Vertical Federated Learning

PhysHOI: Physics-Based Imitation of Dynamic Human-Object Interaction

no code implementations7 Dec 2023 Yinhuai Wang, Jing Lin, Ailing Zeng, Zhengyi Luo, Jian Zhang, Lei Zhang

To make up for the lack of dynamic HOI scenarios in this area, we introduce the BallPlay dataset that contains eight whole-body basketball skills.

Human-Object Interaction Detection Object

EditGuard: Versatile Image Watermarking for Tamper Localization and Copyright Protection

no code implementations12 Dec 2023 Xuanyu Zhang, Runyi Li, Jiwen Yu, Youmin Xu, Weiqi Li, Jian Zhang

In the era where AI-generated content (AIGC) models can produce stunning and lifelike images, the lingering shadow of unauthorized reproductions and malicious tampering poses imminent threats to copyright integrity and information security.

Image Steganography

Neural Video Fields Editing

no code implementations12 Dec 2023 Shuzhou Yang, Chong Mou, Jiwen Yu, YuHan Wang, Xiandong Meng, Jian Zhang

Specifically, we construct a neural video field, powered by tri-plane and sparse grid, to enable encoding long videos with hundreds of frames in a memory-efficient manner.

Video Editing

Language-Assisted 3D Scene Understanding

no code implementations18 Dec 2023 Yanmin Wu, Qiankun Gao, Renrui Zhang, Jian Zhang

The scale and quality of point cloud datasets constrain the advancement of point cloud learning.

3D Object Detection 3D Semantic Segmentation +5

360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model

no code implementations12 Jan 2024 Qian Wang, Weiqi Li, Chong Mou, Xinhua Cheng, Jian Zhang

Recently, the emerging text-to-video (T2V) diffusion methods demonstrate notable effectiveness in standard video generation.

Video Generation

Are Large Language Models Rational Investors?

no code implementations20 Feb 2024 YuHang Zhou, Yuchen Ni, Xiang Liu, Jian Zhang, Sen Liu, Guangnan Ye, Hongfeng Chai

Large Language Models (LLMs) are progressively being adopted in financial analysis to harness their extensive knowledge base for interpreting complex market data and trends.

Decision Making Navigate

MamMIL: Multiple Instance Learning for Whole Slide Images with State Space Models

no code implementations8 Mar 2024 Zijie Fang, Yifeng Wang, Zhi Wang, Jian Zhang, Xiangyang Ji, Yongbing Zhang

To tackle this challenge, we propose a MamMIL framework for WSI classification by cooperating the selective structured state space model (i. e., Mamba) with MIL for the first time, enabling the modeling of instance dependencies while maintaining linear complexity.

Multiple Instance Learning whole slide images

Invertible Diffusion Models for Compressed Sensing

no code implementations25 Mar 2024 Bin Chen, Zhenyu Zhang, Weiqi Li, Chen Zhao, Jiwen Yu, Shijie Zhao, Jie Chen, Jian Zhang

To enable such memory-intensive end-to-end finetuning, we propose a novel two-level invertible design to transform both (1) the multi-step sampling process and (2) the noise estimation U-Net in each step into invertible networks.

Image Compressed Sensing Image Reconstruction +1

InstantSplat: Unbounded Sparse-view Pose-free Gaussian Splatting in 40 Seconds

no code implementations29 Mar 2024 Zhiwen Fan, Wenyan Cong, Kairun Wen, Kevin Wang, Jian Zhang, Xinghao Ding, Danfei Xu, Boris Ivanovic, Marco Pavone, Georgios Pavlakos, Zhangyang Wang, Yue Wang

This pre-processing is usually conducted via a Structure-from-Motion (SfM) pipeline, a procedure that can be slow and unreliable, particularly in sparse-view scenarios with insufficient matched features for accurate reconstruction.

Novel View Synthesis SSIM

Mirror-3DGS: Incorporating Mirror Reflections into 3D Gaussian Splatting

no code implementations1 Apr 2024 Jiarui Meng, Haijie Li, Yanmin Wu, Qiankun Gao, Shuzhou Yang, Jian Zhang, Siwei Ma

3D Gaussian Splatting (3DGS) has marked a significant breakthrough in the realm of 3D scene reconstruction and novel view synthesis.

3D Scene Reconstruction Novel View Synthesis

Automated Polyp Segmentation in Colonoscopy Images

no code implementations6 Apr 2024 Swagat Ranjit, Jian Zhang, Bijaya B. Karki

The combination of dilated convolution module, RCCA, and global average pooling was found to be effective for irregular shapes.

Data Augmentation Medical Diagnosis

BG-YOLO: A Bidirectional-Guided Method for Underwater Object Detection

no code implementations13 Apr 2024 Jian Zhang, Ruiteng Zhang, Xinyue Yan, Xiting Zhuang, Ruicheng Cao

When training the enhancement branch, the object detection subnet in the enhancement branch guides the image enhancement subnet to be optimized towards the direction that is most conducive to the detection task.

Image Enhancement Object +2

OmniSSR: Zero-shot Omnidirectional Image Super-Resolution using Stable Diffusion Model

no code implementations16 Apr 2024 Runyi Li, Xuhan Sheng, Weiqi Li, Jian Zhang

Omnidirectional images (ODIs) are commonly used in real-world visual tasks, and high-resolution ODIs help improve the performance of related visual tasks.

Denoising Domain Generalization +4

Face2Face: Label-driven Facial Retouching Restoration

no code implementations22 Apr 2024 Guanhua Zhao, Yu Gu, Xuhan Sheng, Yujie Hu, Jian Zhang

This poses challenges for fields that place high demands on the authenticity of photographs, such as identity verification and social media.

Image Restoration

ResVR: Joint Rescaling and Viewport Rendering of Omnidirectional Images

no code implementations25 Apr 2024 Weiqi Li, Shijie Zhao, Bin Chen, Xinhua Cheng, Junlin Li, Li Zhang, Jian Zhang

With the advent of virtual reality technology, omnidirectional image (ODI) rescaling techniques are increasingly embraced for reducing transmitted and stored file sizes while preserving high image quality.

ERP

V2A-Mark: Versatile Deep Visual-Audio Watermarking for Manipulation Localization and Copyright Protection

no code implementations25 Apr 2024 Xuanyu Zhang, Youmin Xu, Runyi Li, Jiwen Yu, Weiqi Li, Zhipei Xu, Jian Zhang

Meanwhile, we introduce a sample-level audio localization method and a cross-modal copyright extraction mechanism to couple the information of audio and video frames.

Video Editing

F$^3$low: Frame-to-Frame Coarse-grained Molecular Dynamics with SE(3) Guided Flow Matching

no code implementations1 May 2024 Shaoning Li, Yusong Wang, Mingyu Li, Jian Zhang, Bin Shao, Nanning Zheng, Jian Tang

Molecular dynamics (MD) is a crucial technique for simulating biological systems, enabling the exploration of their dynamic nature and fostering an understanding of their functions and properties.

Structured Control Nets for Deep Reinforcement Learning

1 code implementation ICML 2018 Mario Srouji, Jian Zhang, Ruslan Salakhutdinov

The proposed Structured Control Net (SCN) splits the generic MLP into two separate sub-modules: a nonlinear control module and a linear control module.

Decision Making reinforcement-learning +1

Parameter Efficient Deep Neural Networks with Bilinear Projections

1 code implementation3 Nov 2020 Litao Yu, Yongsheng Gao, Jun Zhou, Jian Zhang

Recent research on deep neural networks (DNNs) has primarily focused on improving the model accuracy.

Learning Weighting Map for Bit-Depth Expansion within a Rational Range

1 code implementation26 Apr 2022 Yuqing Liu, Qi Jia, Jian Zhang, Xin Fan, Shanshe Wang, Siwei Ma, Wen Gao

Existing BDE methods have no unified solution for various BDE situations, and directly learn a mapping for each pixel from LBD image to the desired value in HBD image, which may change the given high-order bits and lead to a huge deviation from the ground truth.

SSIM

Self-attention on Multi-Shifted Windows for Scene Segmentation

1 code implementation10 Jul 2022 Litao Yu, Zhibin Li, Jian Zhang, Qiang Wu

Scene segmentation in images is a fundamental yet challenging problem in visual content understanding, which is to learn a model to assign every image pixel to a categorical label.

Descriptive Scene Segmentation +1

Deep Bi-Dense Networks for Image Super-Resolution

1 code implementation11 Oct 2018 Yucheng Wang, Jialiang Shen, Jian Zhang

In this way, feature information propagates from a single dense block to all subsequent blocks, instead of to a single successor.

Image Super-Resolution

ASDN: A Deep Convolutional Network for Arbitrary Scale Image Super-Resolution

1 code implementation6 Oct 2020 Jialiang Shen, Yucheng Wang, Jian Zhang

For SR of small-scales (between 1 and 2), images are constructed by interpolation from a sparse set of precalculated Laplacian pyramid levels.

Image Super-Resolution

NetInfoF Framework: Measuring and Exploiting Network Usable Information

1 code implementation12 Feb 2024 Meng-Chieh Lee, Haiyang Yu, Jian Zhang, Vassilis N. Ioannidis, Xiang Song, Soji Adeshina, Da Zheng, Christos Faloutsos

Given a node-attributed graph, and a graph task (link prediction or node classification), can we tell if a graph neural network (GNN) will perform well?

Link Prediction Node Classification

Exploiting Web Images for Fine-Grained Visual Recognition by Eliminating Noisy Samples and Utilizing Hard Ones

1 code implementation23 Jan 2021 Huafeng Liu, Chuanyi Zhang, Yazhou Yao, Xiushen Wei, Fumin Shen, Jian Zhang, Zhenmin Tang

Labeling objects at a subordinate level typically requires expert knowledge, which is not always available when using random annotators.

Fine-Grained Visual Recognition

Investigating Accuracy-Novelty Performance for Graph-based Collaborative Filtering

1 code implementation26 Apr 2022 Minghao Zhao, Le Wu, Yile Liang, Lei Chen, Jian Zhang, Qilin Deng, Kai Wang, Xudong Shen, Tangjie Lv, Runze Wu

While conventional CF models are known for facing the challenges of the popularity bias that favors popular items, one may wonder "Whether the existing graph-based CF models alleviate or exacerbate popularity bias of recommender systems?"

Collaborative Filtering Recommendation Systems

Compare More Nuanced:Pairwise Alignment Bilinear Network For Few-shot Fine-grained Learning

1 code implementation7 Apr 2019 Huaxi Huang, Jun-Jie Zhang, Jian Zhang, Qiang Wu, Jingsong Xu

Unlike traditional deep bilinear networks for fine-grained classification, which adopt the self-bilinear pooling to capture the subtle features of images, the proposed model uses a novel pairwise bilinear pooling to compare the nuanced differences between base images and query images for learning a deep distance metric.

General Classification Meta-Learning

Constructing and Exploring Intermediate Domains in Mixed Domain Semi-supervised Medical Image Segmentation

1 code implementation13 Apr 2024 Qinghe Ma, Jian Zhang, Lei Qi, Qian Yu, Yinghuan Shi, Yang Gao

To fully utilize the information within the intermediate domain, we propose a symmetric Guidance training strategy (SymGD), which additionally offers direct guidance to unlabeled data by merging pseudo labels from intermediate samples.

Image Segmentation Segmentation +4

Implicit Multidimensional Projection of Local Subspaces

1 code implementation7 Sep 2020 Rongzheng Bian, Yumeng Xue, Liang Zhou, Jian Zhang, Baoquan Chen, Daniel Weiskopf, Yunhai Wang

We propose a visualization method to understand the effect of multidimensional projection on local subspaces, using implicit function differentiation.

Field-wise Learning for Multi-field Categorical Data

1 code implementation NeurIPS 2020 Zhibin Li, Jian Zhang, Yongshun Gong, Yazhou Yao, Qiang Wu

We present a model that utilizes linear models with variance and low-rank constraints, to help it generalize better and reduce the number of parameters.

Cannot find the paper you are looking for? You can Submit a new open access paper.