Search Results for author: Jian Zhang

Found 280 papers, 99 papers with code

Analysis of DAWNBench, a Time-to-Accuracy Machine Learning Performance Benchmark

no code implementations • 4 Jun 2018 • Cody Coleman, Daniel Kang, Deepak Narayanan, Luigi Nardi, Tian Zhao, Jian Zhang, Peter Bailis, Kunle Olukotun, Chris Re, Matei Zaharia

In this work, we analyze the entries from DAWNBench, which received optimized submissions from multiple industrial groups, to investigate the behavior of TTA as a metric as well as trends in the best-performing entries.

Benchmarking BIG-bench Machine Learning

Paper
Add Code

Feature Affinity based Pseudo Labeling for Semi-supervised Person Re-identification

no code implementations • 16 May 2018 • Guodong Ding, Shanshan Zhang, Salman Khan, Zhenmin Tang, Jian Zhang, Fatih Porikli

Our approach measures the affinity of unlabeled samples with the underlying clusters of labeled data samples using the intermediate feature representations from deep networks.

Data Augmentation Representation Learning +1

Paper
Add Code

Globally Variance-Constrained Sparse Representation and Its Application in Image Set Coding

no code implementations • 17 Aug 2016 • Xiang Zhang, Jiarui Sun, Siwei Ma, Zhouchen Lin, Jian Zhang, Shiqi Wang, Wen Gao

Therefore, introducing an accurate rate-constraint in sparse coding and dictionary learning becomes meaningful, which has not been fully exploited in the context of sparse representation.

Data Compression Dictionary Learning

Paper
Add Code

Global Pose Estimation with an Attention-based Recurrent Network

no code implementations • 19 Feb 2018 • Emilio Parisotto, Devendra Singh Chaplot, Jian Zhang, Ruslan Salakhutdinov

The ability for an agent to localize itself within an environment is crucial for many real-world applications.

Feature Engineering Pose Estimation +2

Paper
Add Code

Deep Reinforcement Learning for Image Hashing

no code implementations • 7 Feb 2018 • Yuxin Peng, Jian Zhang, Zhaoda Ye

Inspired by the sequential decision ability of deep reinforcement learning, we propose a new Deep Reinforcement Learning approach for Image Hashing (DRLIH).

Deep Hashing reinforcement-learning +1

Paper
Add Code

SCH-GAN: Semi-supervised Cross-modal Hashing by Generative Adversarial Network

no code implementations • 7 Feb 2018 • Jian Zhang, Yuxin Peng, Mingkuan Yuan

(2) Ignore the rich information contained in the large amount of unlabeled data across different modalities, especially the margin examples that are easily to be incorrectly retrieved, which can help to model the correlations.

Generative Adversarial Network Retrieval

Paper
Add Code

Multi-pseudo Regularized Label for Generated Data in Person Re-Identification

no code implementations • 21 Jan 2018 • Yan Huang, Jinsong Xu, Qiang Wu, Zhedong Zheng, Zhao-Xiang Zhang, Jian Zhang

Unlike the traditional label which usually is a single integral number, the virtual label proposed in this work is a set of weight-based values each individual of which is a number in (0, 1] called multi-pseudo label and reflects the degree of relation between each generated data to every pre-defined class of real data.

Generative Adversarial Network Person Re-Identification +1

Paper
Add Code

Unsupervised Generative Adversarial Cross-modal Hashing

no code implementations • 1 Dec 2017 • Jian Zhang, Yuxin Peng, Mingkuan Yuan

To address the above problem, in this paper we propose an Unsupervised Generative Adversarial Cross-modal Hashing approach (UGACH), which makes full use of GAN's ability for unsupervised representation learning to exploit the underlying manifold structure of cross-modal data.

Cross-Modal Retrieval Generative Adversarial Network +2

Paper
Add Code

Asking the Difficult Questions: Goal-Oriented Visual Question Generation via Intermediate Rewards

no code implementations • 21 Nov 2017 • Jun-Jie Zhang, Qi Wu, Chunhua Shen, Jian Zhang, Jianfeng Lu, Anton Van Den Hengel

Despite significant progress in a variety of vision-and-language problems, developing a method capable of asking intelligent, goal-oriented questions about images is proven to be an inscrutable challenge.

Informativeness Question Generation +2

Paper
Add Code

Kill Two Birds with One Stone: Weakly-Supervised Neural Network for Image Annotation and Tag Refinement

no code implementations • 19 Nov 2017 • Jun-Jie Zhang, Qi Wu, Jian Zhang, Chunhua Shen, Jianfeng Lu

These comments can be a description of the image, or some objects, attributes, scenes in it, which are normally used as the user-provided tags.

Retrieval TAG

Paper
Add Code

SSDH: Semi-supervised Deep Hashing for Large Scale Image Retrieval

no code implementations • 28 Jul 2016 • Jian Zhang, Yuxin Peng

(2) A semi-supervised deep hashing network is designed to extensively exploit both labeled and unlabeled data, in which we propose an online graph construction method to benefit from the evolving deep features during training to better capture semantic neighbors.

Deep Hashing graph construction +3

Paper
Add Code

Towards Automatic Construction of Diverse, High-quality Image Dataset

no code implementations • 22 Aug 2017 • Yazhou Yao, Jian Zhang, Fumin Shen, Li Liu, Fan Zhu, Dongxiang Zhang, Heng-Tao Shen

To eliminate manual annotation, in this work, we propose a novel image dataset construction framework by employing multiple textual queries.

Image Classification object-detection +2

Paper
Add Code

Deep Learning at 15PF: Supervised and Semi-Supervised Classification for Scientific Data

no code implementations • 17 Aug 2017 • Thorsten Kurth, Jian Zhang, Nadathur Satish, Ioannis Mitliagkas, Evan Racah, Mostofa Ali Patwary, Tareq Malas, Narayanan Sundaram, Wahid Bhimji, Mikhail Smorkalov, Jack Deslippe, Mikhail Shiryaev, Srinivas Sridharan, Prabhat, Pradeep Dubey

This paper presents the first, 15-PetaFLOP Deep Learning system for solving scientific pattern classification problems on contemporary HPC architectures.

General Classification

Paper
Add Code

Place recognition: An Overview of Vision Perspective

no code implementations • 17 Jun 2017 • Zhiqiang Zeng, Jian Zhang, Xiaodong Wang, Yuming Chen, Chaoyang Zhu

Place recognition is one of the most fundamental topics in computer vision and robotics communities, where the task is to accurately and efficiently recognize the location of a given query image.

Image Classification Image Retrieval +4

Paper
Add Code

A New Probabilistic Algorithm for Approximate Model Counting

no code implementations • 13 Jun 2017 • Cunjing Ge, Feifei Ma, Tian Liu, Jian Zhang

Constrained counting is important in domains ranging from artificial intelligence to software analysis.

Paper
Add Code

Query-adaptive Image Retrieval by Deep Weighted Hashing

no code implementations • 8 Dec 2016 • Jian Zhang, Yuxin Peng

On the other hand, different hash bits actually contribute to the image retrieval differently, and treating them equally greatly affects the retrieval accuracy of image.

Deep Hashing Image Retrieval

Paper
Add Code

Exploiting Web Images for Dataset Construction: A Domain Robust Approach

no code implementations • 22 Nov 2016 • Yazhou Yao, Jian Zhang, Fumin Shen, Xian-Sheng Hua, Jingsong Xu, Zhenmin Tang

To reduce the cost of manual labelling, there has been increased research interest in automatically constructing image datasets by exploiting web images.

Domain Adaptation Image Classification +2

Paper
Add Code

Refining Image Categorization by Exploiting Web Images and General Corpus

no code implementations • 16 Mar 2017 • Yazhou Yao, Jian Zhang, Fumin Shen, Xian-Sheng Hua, Wankou Yang, Zhenmin Tang

To tackle these problems, in this work, we exploit general corpus information to automatically select and subsequently classify web images into semantic rich (sub-)categories.

Image Categorization

Paper
Add Code

Multi-Label Image Classification with Regional Latent Semantic Dependencies

no code implementations • 4 Dec 2016 • Jun-Jie Zhang, Qi Wu, Chunhua Shen, Jian Zhang, Jianfeng Lu

Recent state-of-the-art approaches to multi-label image classification exploit the label dependencies in an image, at global level, largely improving the labeling capacity.

Classification General Classification +1

Paper
Add Code

A coarse-to-fine algorithm for registration in 3D street-view cross-source point clouds

no code implementations • 24 Oct 2016 • Xiaoshui Huang, Jian Zhang, Qiang Wu, Lixin Fan, Chun Yuan

In this paper, different from previous ICP-based methods, and from a statistic view, we propose a effective coarse-to-fine algorithm to detect and register a small scale SFM point cloud in a large scale Lidar point cloud.

Paper
Add Code

A Systematic Approach for Cross-source Point Cloud Registration by Preserving Macro and Micro Structures

no code implementations • 18 Aug 2016 • Xiaoshui Huang, Jian Zhang, Lixin Fan, Qiang Wu, Chun Yuan

We propose a systematic approach for registering cross-source point clouds.

graph construction Graph Matching +1

Paper
Add Code

Parallel SGD: When does averaging help?

no code implementations • 23 Jun 2016 • Jian Zhang, Christopher De Sa, Ioannis Mitliagkas, Christopher Ré

Consider a number of workers running SGD independently on the same pool of data and averaging the models every once in a while -- a common but not well understood practice.

Paper
Add Code

A Tool for Computing and Estimating the Volume of the Solution Space of SMT(LA)

no code implementations • 1 Jul 2015 • Cunjing Ge, Feifei Ma, Jian Zhang

There are already quite a few tools for solving the Satisfiability Modulo Theories (SMT) problems.

Paper
Add Code

Abrupt Motion Tracking via Nearest Neighbor Field Driven Stochastic Sampling

no code implementations • 28 Oct 2014 • Tianfei Zhou, Yao Lu, Feng Lv, Huijun Di, Qingjie Zhao, Jian Zhang

Stochastic sampling based trackers have shown good performance for abrupt motion tracking so that they have gained popularity in recent years.

Motion Detection

Paper
Add Code

Image Restoration Using Joint Statistical Modeling in Space-Transform Domain

no code implementations • 11 May 2014 • Jian Zhang, Debin Zhao, Ruiqin Xiong, Siwei Ma, Wen Gao

This paper presents a novel strategy for high-fidelity image restoration by characterizing both local smoothness and nonlocal self-similarity of natural images in a unified statistical manner.

Deblurring Image Deblurring +3

Paper
Add Code

Image Compressive Sensing Recovery Using Adaptively Learned Sparsifying Basis via L0 Minimization

no code implementations • 30 Apr 2014 • Jian Zhang, Chen Zhao, Debin Zhao, Wen Gao

From many fewer acquired measurements than suggested by the Nyquist sampling theory, compressive sensing (CS) theory demonstrates that, a signal can be reconstructed with high probability when it exhibits sparsity in some domain.

Blocking Compressive Sensing

Paper
Add Code

Structural Group Sparse Representation for Image Compressive Sensing Recovery

no code implementations • 29 Apr 2014 • Jian Zhang, Debin Zhao, Feng Jiang, Wen Gao

Compressive Sensing (CS) theory shows that a signal can be decoded from many fewer measurements than suggested by the Nyquist sampling theory, when the signal is sparse in some domain.

Compressive Sensing

Paper
Add Code

Spatially Directional Predictive Coding for Block-based Compressive Sensing of Natural Images

no code implementations • 29 Apr 2014 • Jian Zhang, Debin Zhao, Feng Jiang

At the encoder, for each block of compressive sensing (CS) measurements, the optimal pre-diction is selected from a set of prediction candidates that are generated by four designed directional predictive modes.

Compressive Sensing Decoder

Paper
Add Code

Characterizing A Database of Sequential Behaviors with Latent Dirichlet Hidden Markov Models

no code implementations • 24 May 2013 • Yin Song, Longbing Cao, Xuhui Fan, Wei Cao, Jian Zhang

These sequence-level latent parameters for each sequence are modeled as latent Dirichlet random variables and parameterized by a set of deterministic database-level hyper-parameters.

General Classification

Paper
Add Code

Scene Text Recognition from Two-Dimensional Perspective

no code implementations • 18 Sep 2018 • Minghui Liao, Jian Zhang, Zhaoyi Wan, Fengming Xie, Jiajun Liang, Pengyuan Lyu, Cong Yao, Xiang Bai

Inspired by speech recognition, recent state-of-the-art algorithms mostly consider scene text recognition as a sequence prediction problem.

Ranked #32 on Scene Text Recognition on SVT

Scene Text Recognition Semantic Segmentation +4

Paper
Add Code

Performance assessment of the deep learning technologies in grading glaucoma severity

no code implementations • 31 Oct 2018 • Yi Zhen, Lei Wang, Han Liu, Jian Zhang, Jiantao Pu

Among these CNNs, the DenseNet had the highest classification accuracy (i. e., 75. 50%) based on pre-trained weights when using global ROIs, as compared to 65. 50% when using local ROIs.

Paper
Add Code

Face Detection with Effective Feature Extraction

no code implementations • 29 Sep 2010 • Sakrapee Paisitkriangkrai, Chunhua Shen, Jian Zhang

There is an abundant literature on face detection due to its important role in many vision applications.

Face Detection

Paper
Add Code

Fast Gated Neural Domain Adaptation: Language Model as a Case Study

no code implementations • COLING 2016 • Jian Zhang, Xiaofeng Wu, Andy Way, Qun Liu

We show that the neural LM perplexity can be reduced by 7. 395 and 12. 011 using the proposed domain adaptation mechanism on the Penn Treebank and News data, respectively.

Domain Adaptation Language Modelling +2

Paper
Add Code

Topic-Informed Neural Machine Translation

no code implementations • COLING 2016 • Jian Zhang, Liangyou Li, Andy Way, Qun Liu

In recent years, neural machine translation (NMT) has demonstrated state-of-the-art machine translation (MT) performance.

Machine Translation NMT +2

Paper
Add Code

Message Passing Inference for Large Scale Graphical Models with High Order Potentials

no code implementations • NeurIPS 2014 • Jian Zhang, Alex Schwing, Raquel Urtasun

To keep up with the Big Data challenge, parallelized algorithms based on dual decomposition have been proposed to perform inference in Markov random fields.

Semantic Segmentation

Paper
Add Code

Multiple Instance Learning on Structured Data

no code implementations • NeurIPS 2011 • Dan Zhang, Yan Liu, Luo Si, Jian Zhang, Richard D. Lawrence

Ignoring this structure information limits the performance of existing MIL algorithms.

Multiple Instance Learning

Paper
Add Code

Fine-Grained Video Captioning for Sports Narrative

no code implementations • CVPR 2018 • Huanyu Yu, Shuo Cheng, Bingbing Ni, Minsi Wang, Jian Zhang, Xiaokang Yang

First, to facilitate this novel research of fine-grained video caption, we collected a novel dataset called Fine-grained Sports Narrative dataset (FSN) that contains 2K sports videos with ground-truth narratives from YouTube. com.

2k Video Captioning

Paper
Add Code

Goal-Oriented Visual Question Generation via Intermediate Rewards

no code implementations • ECCV 2018 • Jun-Jie Zhang, Qi Wu, Chunhua Shen, Jian Zhang, Jianfeng Lu, Anton Van Den Hengel

Informativeness Question Generation +2

Paper
Add Code

Assessment of central serous chorioretinopathy (CSC) depicted on color fundus photographs using deep Learning

no code implementations • 14 Jan 2019 • Yi Zhen, Hang Chen, Xu Zhang, Meng Liu, Xin Meng, Jian Zhang, Jiantao Pu

To investigate whether and to what extent central serous chorioretinopathy (CSC) depicted on color fundus photographs can be assessed using deep learning technology.

Paper
Add Code

Image Denoising via Adaptive Soft-Thresholding Based on Non-Local Samples

no code implementations • CVPR 2015 • Hangfan Liu, Ruiqin Xiong, Jian Zhang, Wen Gao

To estimate the expectation and variance parameters for the transform bands of a particular patch, we exploit the non-local correlation of image and collect a set of similar patches as data samples to form the distribution.

Image Denoising

Paper
Add Code

Higher-Order Inference for Multi-Class Log-Supermodular Models

no code implementations • ICCV 2015 • Jian Zhang, Josip Djolonga, Andreas Krause

Higher-order models have been shown to be very useful for a plethora of computer vision tasks.

Image Segmentation Semantic Segmentation +1

Paper
Add Code

Experiments in Medical Translation Shared Task at WMT 2014

no code implementations • WS 2014 • Jian Zhang

Language Modelling Machine Translation +1

Paper
Add Code

Target-Centric Features for Translation Quality Estimation

no code implementations • WS 2014 • Chris Hokamp, Iacer Calixto, Joachim Wagner, Jian Zhang

Language Modelling Machine Translation +2

Paper
Add Code

PLUTO: Automated Solutions for Patent Translation

no code implementations • WS 2012 • John Tinsley, Alex Ceausu, ru, Jian Zhang

Domain Adaptation Machine Translation +1

Paper
Add Code

Unsupervised Part Mining for Fine-grained Image Classification

no code implementations • 26 Feb 2019 • Runsheng Zhang, Jian Zhang, Yaping Huang, Qi Zou

To tackle this issue, we propose a fully unsupervised part mining (UPM) approach to localize the discriminative parts without even image-level annotations, which largely improves the fine-grained classification performance.

Classification Fine-Grained Image Classification +2

Paper
Add Code

Learning Extreme Hummingbird Maneuvers on Flapping Wing Robots

no code implementations • 25 Feb 2019 • Fan Fei, Zhan Tu, Jian Zhang, Xinyan Deng

Inspired by the hummingbirds' near-maximal performance during such extreme maneuvers, we developed a flight control strategy and experimentally demonstrated that such maneuverability can be achieved by an at-scale 12-gram hummingbird robot equipped with just two actuators.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Fast Registration for cross-source point clouds by using weak regional affinity and pixel-wise refinement

no code implementations • 11 Mar 2019 • Xiaoshui Huang, Lixin Fan, Qiang Wu, Jian Zhang, Chun Yuan

Accurate and fast registration of cross-source 3D point clouds from different sensors is an emerged research problem in computer vision.

Point Cloud Registration

Paper
Add Code

Towards Locally Consistent Object Counting with Constrained Multi-stage Convolutional Neural Networks

no code implementations • 6 Apr 2019 • Muming Zhao, Jian Zhang, Chongyang Zhang, Wenjun Zhang

Towards this problem, in this paper we propose a constrained multi-stage Convolutional Neural Networks (CNNs) to jointly pursue locally consistent density map from two aspects.

Object Object Counting

Paper
Add Code

Single Pixel Reconstruction for One-stage Instance Segmentation

no code implementations • 16 Apr 2019 • Jun Yu, Jinghan Yao, Jian Zhang, Zhou Yu, DaCheng Tao

In this paper, we propose a one-stage framework, SPRNet, which performs efficient instance segmentation by introducing a single pixel reconstruction (SPR) branch to off-the-shelf one-stage detectors.

Instance Segmentation Region Proposal +2

Paper
Add Code

Local Deep-Feature Alignment for Unsupervised Dimension Reduction

no code implementations • 22 Apr 2019 • Jian Zhang, Jun Yu, DaCheng Tao

Next, we exploit an affine transformation to align the local deep features of each neighbourhood with the global features.

Clustering Data Visualization +1

Paper
Add Code

Low-Memory Neural Network Training: A Technical Report

no code implementations • 24 Apr 2019 • Nimit S. Sohoni, Christopher R. Aberger, Megan Leszczynski, Jian Zhang, Christopher Ré

In this paper we study a fundamental question: How much memory is actually needed to train a neural network?

Image Classification Machine Translation +1

Paper
Add Code

Inferring the Importance of Product Appearance: A Step Towards the Screenless Revolution

no code implementations • 1 May 2019 • Yongshun Gong, Jin-Feng Yi, Dong-Dong Chen, Jian Zhang, Jiayu Zhou, Zhihua Zhou

In this paper, we aim to infer the significance of every item's appearance in consumer decision making and identify the group of items that are suitable for screenless shopping.

Decision Making

Paper
Add Code

Structured Discriminative Tensor Dictionary Learning for Unsupervised Domain Adaptation

no code implementations • 11 May 2019 • Songsong Wu, Yan Yan, Hao Tang, Jianjun Qian, Jian Zhang, Xiao-Yuan Jing

However, the number of labeled source samples are always limited due to expensive annotation cost in practice, making sub-optimal performance been observed.

Dictionary Learning Pseudo Label +1

Paper
Add Code

Glioma Grade Prediction Using Wavelet Scattering-Based Radiomics

no code implementations • 23 May 2019 • Qijian Chen, Lihui Wang, Li Wang, Zeyu Deng, Jian Zhang, Yuemin Zhu

Glioma grading before surgery is very critical for the prognosis prediction and treatment plan making.

Dimensionality Reduction regression

Paper
Add Code

Extracting Visual Knowledge from the Internet: Making Sense of Image Data

no code implementations • 7 Jun 2019 • Yazhou Yao, Jian Zhang, Xian-Sheng Hua, Fumin Shen, Zhenmin Tang

Recent successes in visual recognition can be primarily attributed to feature representation, learning algorithms, and the ever-increasing size of labeled training data.

Representation Learning

Paper
Add Code

Sample Adaptive Multiple Kernel Learning for Failure Prediction of Railway Points

no code implementations • 2 Jul 2019 • Zhibin Li, Jian Zhang, Qiang Wu, Yongshun Gong, Jin-Feng Yi, Christina Kirsch

In this paper, we formulate our prediction task as a multiple kernel learning problem with missing kernels.

Paper
Add Code

Low-Rank Pairwise Alignment Bilinear Network For Few-Shot Fine-Grained Image Classification

no code implementations • 4 Aug 2019 • Huaxi Huang, Jun-Jie Zhang, Jian Zhang, Jingsong Xu, Qiang Wu

A novel low-rank pairwise bilinear pooling operation is proposed to capture the nuanced differences between the support and query images for learning an effective distance metric.

Classification Few-Shot Learning +2

Paper
Add Code

Exploiting Channel Similarity for Accelerating Deep Convolutional Neural Networks

no code implementations • 6 Aug 2019 • Yunxiang Zhang, Chenglong Zhao, Bingbing Ni, Jian Zhang, Haoran Deng

To address the limitations of existing magnitude-based pruning algorithms in cases where model weights or activations are of large and similar magnitude, we propose a novel perspective to discover parameter redundancy among channels and accelerate deep CNNs via channel pruning.

Clustering

Paper
Add Code

C-RPNs: Promoting Object Detection in real world via a Cascade Structure of Region Proposal Networks

no code implementations • 19 Aug 2019 • Dongming Yang, Yuexian Zou, Jian Zhang, Ge Li

Although two-stage detectors like Faster R-CNN achieved big successes in object detection due to the strategy of extracting region proposals by region proposal network, they show their poor adaption in real-world object detection as a result of without considering mining hard samples during extracting region proposals.

Object object-detection +2

Paper
Add Code

PipeMare: Asynchronous Pipeline Parallel DNN Training

no code implementations • 9 Oct 2019 • Bowen Yang, Jian Zhang, Jonathan Li, Christopher Ré, Christopher R. Aberger, Christopher De Sa

Pipeline parallelism (PP) when training neural networks enables larger models to be partitioned spatially, leading to both lower network communication and overall higher hardware utilization.

Paper
Add Code

Variational Few-Shot Learning

no code implementations • ICCV 2019 • Jian Zhang, Chenglong Zhao, Bingbing Ni, Minghao Xu, Xiaokang Yang

We propose a variational Bayesian framework for enhancing few-shot learning performance.

Few-Shot Learning Metric Learning +1

Paper
Add Code

Worst Cases Policy Gradients

no code implementations • 9 Nov 2019 • Yichuan Charlie Tang, Jian Zhang, Ruslan Salakhutdinov

Recent advances in deep reinforcement learning have demonstrated the capability of learning complex control policies from many types of environments.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Time-aware Gradient Attack on Dynamic Network Link Prediction

no code implementations • 24 Nov 2019 • Jinyin Chen, Jian Zhang, Zhi Chen, Min Du, Qi Xuan

In this work, we present the first study of adversarial attack on dynamic network link prediction (DNLP).

Adversarial Attack Link Prediction +1

Paper
Add Code

Potential Passenger Flow Prediction: A Novel Study for Urban Transportation Development

no code implementations • 7 Dec 2019 • Yongshun Gong, Zhibin Li, Jian Zhang, Wei Liu, Jin-Feng Yi

In this paper, this specific problem is termed as potential passenger flow (PPF) prediction, which is a novel and important study connected with urban computing and intelligent transportation systems.

MULTI-VIEW LEARNING Recommendation Systems

Paper
Add Code

Relational Mimic for Visual Adversarial Imitation Learning

no code implementations • 18 Dec 2019 • Lionel Blondé, Yichuan Charlie Tang, Jian Zhang, Russ Webb

In this work, we introduce a new method for imitation learning from video demonstrations.

Imitation Learning Relational Reasoning

Paper
Add Code

Adversarial AutoAugment

no code implementations • ICLR 2020 • Xin-Yu Zhang, Qiang Wang, Jian Zhang, Zhao Zhong

The augmentation policy network attempts to increase the training loss of a target network through generating adversarial augmentation policies, while the target network can learn more robust features from harder examples to improve the generalization.

Ranked #594 on Image Classification on ImageNet

Data Augmentation Image Classification

Paper
Add Code

PMC-GANs: Generating Multi-Scale High-Quality Pedestrian with Multimodal Cascaded GANs

no code implementations • 30 Dec 2019 • Jie Wu, Ying Peng, Chenghao Zheng, Zongbo Hao, Jian Zhang

Recently, generative adversarial networks (GANs) have shown great advantages in synthesizing images, leading to a boost of explorations of using faked images to augment data.

Data Augmentation Pedestrian Detection

Paper
Add Code

Multi-factorial Optimization for Large-scale Virtual Machine Placement in Cloud Computing

no code implementations • 18 Jan 2020 • Zhengping Liang, Jian Zhang, Liang Feng, Zexuan Zhu

However, as growing demand for cloud services, the existing EAs fail to implement in large-scale virtual machine placement (LVMP) problem due to the high time complexity and poor scalability.

Cloud Computing Evolutionary Algorithms

Paper
Add Code

GID-Net: Detecting Human-Object Interaction with Global and Instance Dependency

no code implementations • 11 Mar 2020 • Dongming Yang, Yuexian Zou, Jian Zhang, Ge Li

GID block breaks through the local neighborhoods and captures long-range dependency of pixels both in global-level and instance-level from the scene to help detecting interactions between instances.

Human-Object Interaction Detection Object

Paper
Add Code

DyNet: Dynamic Convolution for Accelerating Convolutional Neural Networks

no code implementations • 22 Apr 2020 • Yikang Zhang, Jian Zhang, Qiang Wang, Zhao Zhong

On one hand, we can reduce the computation cost remarkably while maintaining the performance.

Paper
Add Code

Towards Better Graph Representation: Two-Branch Collaborative Graph Neural Networks for Multimodal Marketing Intention Detection

no code implementations • 13 May 2020 • Lu Zhang, Jian Zhang, Zhibin Li, Jingsong Xu

Inspired by the fact that spreading and collecting information through the Internet becomes the norm, more and more people choose to post for-profit contents (images and texts) in social networks.

Graph Classification Marketing

Paper
Add Code

Contextual Embeddings: When Are They Worth It?

no code implementations • ACL 2020 • Simran Arora, Avner May, Jian Zhang, Christopher Ré

We study the settings for which deep contextual embeddings (e. g., BERT) give large improvements in performance relative to classic pretrained embeddings (e. g., GloVe), and an even simpler baseline---random word embeddings---focusing on the impact of the training set size and the linguistic properties of the task.

Word Embeddings

Paper
Add Code

TOAN: Target-Oriented Alignment Network for Fine-Grained Image Categorization with Few Labeled Samples

no code implementations • 28 May 2020 • Huaxi Huang, Jun-Jie Zhang, Jian Zhang, Qiang Wu, Chang Xu

The challenges of high intra-class variance yet low inter-class fluctuations in fine-grained visual categorization are more severe with few labeled samples, \textit{i. e.,} Fine-Grained categorization problems under the Few-Shot setting (FGFS).

Fine-Grained Visual Categorization

Paper
Add Code

Automated Radiological Report Generation For Chest X-Rays With Weakly-Supervised End-to-End Deep Learning

no code implementations • 18 Jun 2020 • Shuai Zhang, Xiaoyan Xin, Yang Wang, Yachong Guo, Qiuqiao Hao, Xianfeng Yang, Jun Wang, Jian Zhang, Bing Zhang, Wei Wang

The model provides automated recognition of given scans and generation of reports.

Paper
Add Code

Stochastic Batch Augmentation with An Effective Distilled Dynamic Soft Label Regularizer

no code implementations • 27 Jun 2020 • Qian Li, Qingyuan Hu, Yong Qi, Saiyu Qi, Jie Ma, Jian Zhang

SBA stochastically decides whether to augment at iterations controlled by the batch scheduler and in which a ''distilled'' dynamic soft label regularization is introduced by incorporating the similarity in the vicinity distribution respect to raw samples.

Data Augmentation

Paper
Add Code

A Similarity Inference Metric for RGB-Infrared Cross-Modality Person Re-identification

no code implementations • 3 Jul 2020 • Mengxi Jia, Yunpeng Zhai, Shijian Lu, Siwei Ma, Jian Zhang

RGB-Infrared (IR) cross-modality person re-identification (re-ID), which aims to search an IR image in RGB gallery or vice versa, is a challenging task due to the large discrepancy between IR and RGB modalities.

Cross-Modality Person Re-identification Person Re-Identification

Paper
Add Code

Hardware Accelerator for Adversarial Attacks on Deep Learning Neural Networks

no code implementations • 3 Aug 2020 • Haoqiang Guo, Lu Peng, Jian Zhang, Fang Qi, Lide Duan

Recent studies identify that Deep learning Neural Networks (DNNs) are vulnerable to subtle perturbations, which are not perceptible to human visual system but can fool the DNN models and lead to wrong outputs.

Adversarial Attack Computational Efficiency

Paper
Add Code

Salvage Reusable Samples from Noisy Data for Robust Learning

1 code implementation • 6 Aug 2020 • Zeren Sun, Xian-Sheng Hua, Yazhou Yao, Xiu-Shen Wei, Guosheng Hu, Jian Zhang

To this end, we propose a certainty-based reusable sample selection and correction approach, termed as CRSSC, for coping with label noise in training deep FG models with web images.

Memorization

Paper
Code

Face Anti-Spoofing Via Disentangled Representation Learning

no code implementations • ECCV 2020 • Ke-Yue Zhang, Taiping Yao, Jian Zhang, Ying Tai, Shouhong Ding, Jilin Li, Feiyue Huang, Haichuan Song, Lizhuang Ma

Face anti-spoofing is crucial to security of face recognition systems.

Disentanglement Face Anti-Spoofing +1

Paper
Add Code

Computational prediction of RNA tertiary structures using machine learning methods

no code implementations • 3 Sep 2020 • Bin Huang, Yuanyang Du, Shuai Zhang, Wenfei Li, Jun Wang, Jian Zhang

RNAs play crucial and versatile roles in biological processes.

BIG-bench Machine Learning

Paper
Add Code

Scalar Coupling Constant Prediction Using Graph Embedding Local Attention Encoder

no code implementations • 7 Sep 2020 • Caiqing Jian, Xinyu Cheng, Jian Zhang, Lihui Wang

The experimental results demonstrate that, compared to the traditional chemical bond structure representations, the rotation and translation invariant structure representations proposed in this work can improve the SCC prediction accuracy; with the graph embedded local self-attention, the mean absolute error (MAE) of the prediction model in the validation set decreases from 0. 1603 Hz to 0. 1067 Hz; using the classification based loss function instead of the scaled regression loss, the MAE of the predicted SCC can be decreased to 0. 0963 HZ, which is close to the quantum chemistry standard on CHAMPS dataset.

Graph Embedding

Paper
Add Code

Multi-Representation Ensemble in Few-Shot Learning

no code implementations • 1 Jan 2021 • Qing Chen, Jian Zhang

Deep neural networks (DNNs) compute representations in a layer by layer fashion, producing a final representation at the top layer of the pipeline, and classification or regression is made using the final representation.

Few-Shot Learning

Paper
Add Code

Revisiting BFfloat16 Training

no code implementations • 1 Jan 2021 • Pedram Zamirai, Jian Zhang, Christopher R Aberger, Christopher De Sa

We ask can we do pure 16-bit training which requires only 16-bit compute units, while still matching the model accuracy attained by 32-bit training.

Paper
Add Code

Uncertainty Weighted Offline Reinforcement Learning

no code implementations • 1 Jan 2021 • Yue Wu, Shuangfei Zhai, Nitish Srivastava, Joshua M. Susskind, Jian Zhang, Ruslan Salakhutdinov, Hanlin Goh

Offline Reinforcement Learning promises to learn effective policies from previously-collected, static datasets without the need for exploration.

Offline RL Q-Learning +2

Paper
Add Code

Revisiting BFloat16 Training

no code implementations • 13 Oct 2020 • Pedram Zamirai, Jian Zhang, Christopher R. Aberger, Christopher De Sa

State-of-the-art generic low-precision training algorithms use a mix of 16-bit and 32-bit precision, creating the folklore that 16-bit hardware compute units alone are not enough to maximize model accuracy.

Paper
Add Code

Face Mask Assistant: Detection of Face Mask Service Stage Based on Mobile Phone

no code implementations • 9 Oct 2020 • Yuzhen Chen, Menghan Hu, Chunjun Hua, Guangtao Zhai, Jian Zhang, Qingli Li, Simon X. Yang

Aimed at solving the problem that we don't know which service stage of the mask belongs to, we propose a detection system based on the mobile phone.

Paper
Add Code

AutoBSS: An Efficient Algorithm for Block Stacking Style Search

no code implementations • NeurIPS 2020 • Yikang Zhang, Jian Zhang, Zhao Zhong

Neural network architecture design mostly focuses on the new convolutional operator or special topological structure of network block, little attention is drawn to the configuration of stacking each block, called Block Stacking Style (BSS).

AutoML Bayesian Optimization +6

Paper
Add Code

Identification of deep breath while moving forward based on multiple body regions and graph signal analysis

no code implementations • 20 Oct 2020 • Yunlu Wang, Cheng Yang, Menghan Hu, Jian Zhang, Qingli Li, Guangtao Zhai, Xiao-Ping Zhang

This paper presents an unobtrusive solution that can automatically identify deep breath when a person is walking past the global depth camera.

Paper
Add Code

Distribution-aware Margin Calibration for Medical Image Segmentation

no code implementations • 3 Nov 2020 • Zhibin Li, Litao Yu, Jian Zhang

In this paper, we present a novel data-distribution-aware margin calibration method for a better generalization of the mIoU over the whole data-distribution, underpinned by a rigid lower bound.

Image Segmentation Medical Image Segmentation +2

Paper
Add Code

Dual Attention on Pyramid Feature Maps for Image Captioning

no code implementations • 2 Nov 2020 • Litao Yu, Jian Zhang, Qiang Wu

In this paper, we propose to apply dual attention on pyramid image feature maps to fully explore the visual-semantic correlations and improve the quality of generated sentences.

Descriptive Image Captioning

Paper
Add Code

Multi-layer Feature Aggregation for Deep Scene Parsing Models

no code implementations • 4 Nov 2020 • Litao Yu, Yongsheng Gao, Jun Zhou, Jian Zhang, Qiang Wu

The proposed module can auto-select the intermediate visual features to correlate the spatial and semantic information.

Ranked #47 on Semantic Segmentation on NYU Depth v2

Scene Parsing Semantic Segmentation

Paper
Add Code

Time-Series Snapshot Network for Partner Recommendation: A Case Study on OSS

no code implementations • 18 Nov 2020 • Jinyin Chen, Yunyi Xie, Jian Zhang, Xincheng Shu, Qi Xuan

In this paper, we introduce time-series snapshot network (TSSN) which is a mixture network to model the interactions among users and developers.

Social and Information Networks

Paper
Add Code

Rigid and Articulated Point Registration with Expectation Conditional Maximization

no code implementations • 9 Dec 2020 • Radu Horaud, Florence Forbes, Manuel Yguel, Guillaume Dewaele, Jian Zhang

This paper addresses the issue of matching rigid and articulated shapes through probabilistic point registration.

Paper
Add Code

PTN: A Poisson Transfer Network for Semi-supervised Few-shot Learning

no code implementations • 20 Dec 2020 • Huaxi Huang, Junjie Zhang, Jian Zhang, Qiang Wu, Chang Xu

Second, the extra unlabeled samples are employed to transfer the knowledge from base classes to novel classes through contrastive learning.

Contrastive Learning Few-Shot Learning

Paper
Add Code

Multiple Instance Segmentation in Brachial Plexus Ultrasound Image Using BPMSegNet

no code implementations • 22 Dec 2020 • Yi Ding, Qiqi Yang, Guozheng Wu, Jian Zhang, Zhiguang Qin

In this paper, a network called Brachial Plexus Multi-instance Segmentation Network (BPMSegNet) is proposed to identify different tissues (nerves, arteries, veins, muscles) in ultrasound images.

Instance Segmentation Semantic Segmentation

Paper
Add Code

Counting the Number of Solutions to Constraints

no code implementations • 28 Dec 2020 • Jian Zhang, Cunjing Ge, Feifei Ma

Compared with constraint satisfaction problems, counting problems have received less attention.

Paper
Add Code

Aurora Guard: Reliable Face Anti-Spoofing via Mobile Lighting System

no code implementations • 1 Feb 2021 • Jian Zhang, Ying Tai, Taiping Yao, Jia Meng, Shouhong Ding, Chengjie Wang, Jilin Li, Feiyue Huang, Rongrong Ji

Face authentication on mobile end has been widely applied in various scenarios.

Face Anti-Spoofing

Paper
Add Code

Temporal-Amount Snapshot MultiGraph for Ethereum Transaction Tracking

no code implementations • 16 Feb 2021 • Yunyi Xie, Jie Jin, Jian Zhang, Shanqing Yu, Qi Xuan

With the wide application of blockchain in the financial field, the rise of various types of cybercrimes has brought great challenges to the security of blockchain.

Link Prediction

Paper
Add Code

Temperature dependent coherence properties of NV ensemble in diamond up to 600K

no code implementations • 25 Feb 2021 • Shengran Lin, Changfeng Weng, Yuanjie Yang, Jiaxin Zhao, Yuhang Guo, Jian Zhang, Liren Lou, Wei Zhu, Guanzhong Wang

Nitrogen-vacancy (NV) center in diamond is an ideal candidate for quantum sensors because of its excellent optical and coherence property.

Quantum Physics Mesoscale and Nanoscale Physics

Paper
Add Code

A comprehensive survey on point cloud registration

no code implementations • 3 Mar 2021 • Xiaoshui Huang, Guofeng Mei, Jian Zhang, Rana Abbas

This survey conducts a comprehensive survey, including both same-source and cross-source registration methods, and summarize the connections between optimization-based and deep learning methods, to provide further research insight.

3D Reconstruction Point Cloud Registration

Paper
Add Code

MACD R-CNN: An Abnormal Cell Nucleus Detection Method

no code implementations • 28 Jul 2020 • Baoyan Ma, Jian Zhang, Feng Cao, Yongjun He

We design a fixed proposal module to generate fixed-sized feature maps of nuclei, which allows the new information of nucleus is used for classification.

Cell Detection Classification +4

Paper
Add Code

Thousand to One: Semantic Prior Modeling for Conceptual Coding

no code implementations • 12 Mar 2021 • Jianhui Chang, Zhenghui Zhao, Lingbo Yang, Chuanmin Jia, Jian Zhang, Siwei Ma

To this end, we propose a novel end-to-end semantic prior modeling-based conceptual coding scheme towards extremely low bitrate image compression, which leverages semantic-wise deep representations as a unified prior for entropy estimation and texture synthesis.

Image Compression Semantic Segmentation +1

Paper
Add Code

Jo-SRC: A Contrastive Approach for Combating Noisy Labels

no code implementations • CVPR 2021 • Yazhou Yao, Zeren Sun, Chuanyi Zhang, Fumin Shen, Qi Wu, Jian Zhang, Zhenmin Tang

Due to the memorization effect in Deep Neural Networks (DNNs), training with noisy labels usually results in inferior model performance.

Contrastive Learning Memorization

Paper
Add Code

Super-Resolving Compressed Video in Coding Chain

no code implementations • 26 Mar 2021 • Dewang Hou, Yang Zhao, Yuyao Ye, Jiayu Yang, Jian Zhang, Ronggang Wang

Scaling and lossy coding are widely used in video transmission and storage.

Decoder

Paper
Add Code

Max-min Fairness of K-user Cooperative Rate-Splitting in MISO Broadcast Channel with User Relaying

no code implementations • 17 Oct 2019 • Yijie Mao, Bruno Clerckx, Jian Zhang, Victor O. K. Li, Mohammed Arafah

Cooperative Rate-Splitting (CRS) strategy, relying on linearly precoded rate-splitting at the transmitter and opportunistic transmission of the common message by the relaying user, has recently been shown to outperform typical Non-cooperative Rate-Splitting (NRS), Cooperative Non-Orthogonal Multiple Access (C-NOMA) and Space Division Multiple Access (SDMA) in a two-user Multiple Input Single Output (MISO) Broadcast Channel (BC) with user relaying.

Fairness Scheduling

Paper
Add Code

Underwater Target Recognition based on Multi-Decision LOFAR Spectrum Enhancement: A Deep Learning Approach

no code implementations • 26 Apr 2021 • Jie Chen, Jie Liu, Chang Liu, Jian Zhang, Bing Han

To overcome this issue and to further improve the recognition performance, we adopt a deep learning approach for underwater target recognition and propose a LOFAR spectrum enhancement (LSE)-based underwater target recognition scheme, which consists of preprocessing, offline training, and online testing.

Paper
Add Code

Weakly Supervised Dense Video Captioning via Jointly Usage of Knowledge Distillation and Cross-modal Matching

no code implementations • 18 May 2021 • Bofeng Wu, guocheng niu, Jun Yu, Xinyan Xiao, Jian Zhang, Hua Wu

This paper proposes an approach to Dense Video Captioning (DVC) without pairwise event-sentence annotation.

Caption Generation Cross-Modal Retrieval +4

Paper
Add Code

Unsupervised Sentiment Analysis by Transferring Multi-source Knowledge

no code implementations • 9 May 2021 • Yong Dai, Jian Liu, Jian Zhang, Hongguang Fu, Zenglin Xu

The first mechanism is a selective domain adaptation (SDA) method, which transfers knowledge from the closest source domain.

Domain Adaptation Sentiment Analysis

Paper
Add Code

A Multi-Branch Hybrid Transformer Networkfor Corneal Endothelial Cell Segmentation

no code implementations • 21 May 2021 • Yinglin Zhang, Risa Higashita, Huazhu Fu, Yanwu Xu, Yang Zhang, Haofeng Liu, Jian Zhang, Jiang Liu

Corneal endothelial cell segmentation plays a vital role inquantifying clinical indicators such as cell density, coefficient of variation, and hexagonality.

Cell Segmentation

Paper
Add Code

Spk2ImgNet: Learning To Reconstruct Dynamic Scene From Continuous Spike Stream

no code implementations • CVPR 2021 • Jing Zhao, Ruiqin Xiong, Hangfan Liu, Jian Zhang, Tiejun Huang

Different from the conventional digital cameras that compact the photoelectric information within the exposure interval into a single snapshot, the spike camera produces a continuous spike stream to record the dynamic light intensity variation process.

Image Reconstruction

Paper
Add Code

Learning Disentangled Representation Implicitly via Transformer for Occluded Person Re-Identification

no code implementations • 6 Jul 2021 • Mengxi Jia, Xinhua Cheng, Shijian Lu, Jian Zhang

To better eliminate interference from occlusions, we design a contrast feature learning technique (CFL) for better separation of occlusion features and discriminative ID features.

Decoder Person Re-Identification +1

Paper
Add Code

EGC2: Enhanced Graph Classification with Easy Graph Compression

1 code implementation • 16 Jul 2021 • Jinyin Chen, Haiyang Xiong, Haibin Zhenga, Dunjie Zhang, Jian Zhang, Mingwei Jia, Yi Liu

To achieve lower-complexity defense applied to graph classification models, EGC2 utilizes a centrality-based edge-importance index to compress the graphs, filtering out trivial structures and adversarial perturbations in the input graphs, thus improving the model's robustness.

Graph Classification

Paper
Code

Multi-Level Contrastive Learning for Few-Shot Problems

no code implementations • 15 Jul 2021 • Qing Chen, Jian Zhang

Most current applications of contrastive learning benefit only a single representation from the last layer of an encoder. In this paper, we propose a multi-level contrasitive learning approach which applies contrastive losses at different layers of an encoder to learn multiple representations from the encoder.

Contrastive Learning Few-Shot Learning

Paper
Add Code

Structure Destruction and Content Combination for Face Anti-Spoofing

no code implementations • 22 Jul 2021 • Ke-Yue Zhang, Taiping Yao, Jian Zhang, Shice Liu, Bangjie Yin, Shouhong Ding, Jilin Li

In pursuit of consolidating the face verification systems, prior face anti-spoofing studies excavate the hidden cues in original images to discriminate real persons and diverse attack types with the assistance of auxiliary supervision.

Face Anti-Spoofing Face Verification +1

Paper
Add Code

Serving DNN Models with Multi-Instance GPUs: A Case of the Reconfigurable Machine Scheduling Problem

no code implementations • 18 Sep 2021 • Cheng Tan, Zhichao Li, Jian Zhang, Yu Cao, Sikai Qi, Zherui Liu, Yibo Zhu, Chuanxiong Guo

With MIG, A100 can be the most cost-efficient GPU ever for serving Deep Neural Networks (DNNs).

Scheduling

Paper
Add Code

Super Resolve Dynamic Scene From Continuous Spike Streams

no code implementations • ICCV 2021 • Jing Zhao, Jiyu Xie, Ruiqin Xiong, Jian Zhang, Zhaofei Yu, Tiejun Huang

In this paper, we properly exploit the relative motion and derive the relationship between light intensity and each spike, so as to recover the external scene with both high temporal and high spatial resolution.

Super-Resolution

Paper
Add Code

Dyn-Backdoor: Backdoor Attack on Dynamic Link Prediction

no code implementations • 8 Oct 2021 • Jinyin Chen, Haiyang Xiong, Haibin Zheng, Jian Zhang, Guodong Jiang, Yi Liu

Backdoor attacks induce the DLP methods to make wrong prediction by the malicious training data, i. e., generating a subgraph sequence as the trigger and embedding it to the training data.

Backdoor Attack Dynamic Link Prediction +1

Paper
Add Code

Research on the Inverse Kinematics Prediction of a Soft Biomimetic Actuator via BP Neural Network

no code implementations • 26 Oct 2021 • Huichen Ma, Junjie Zhou, Jian Zhang, Lingyu Zhang

After training with sample data, the BP neural network model can represent the relation between the manipulator tip position and the pressure applied to the chambers.

Motion Planning Position

Paper
Add Code

ε-weakened Robustness of Deep Neural Networks

no code implementations • 29 Oct 2021 • Pei Huang, Yuting Yang, Minghao Liu, Fuqi Jia, Feifei Ma, Jian Zhang

This paper introduces a notation of $\varepsilon$-weakened robustness for analyzing the reliability and stability of deep neural networks (DNNs).

Paper
Add Code

Can Graph Neural Networks Learn to Solve MaxSAT Problem?

no code implementations • 15 Nov 2021 • Minghao Liu, Fuqi Jia, Pei Huang, Fan Zhang, Yuchen Sun, Shaowei Cai, Feifei Ma, Jian Zhang

With the rapid development of deep learning techniques, various recent work has tried to apply graph neural networks (GNNs) to solve NP-hard problems such as Boolean Satisfiability (SAT), which shows the potential in bridging the gap between machine learning and symbolic reasoning.

Paper
Add Code

GenReg: Deep Generative Method for Fast Point Cloud Registration

no code implementations • 23 Nov 2021 • Xiaoshui Huang, Zongyi Xu, Guofeng Mei, Sheng Li, Jian Zhang, Yifan Zuo, Yucheng Wang

To solve this challenge, we propose a new data-driven registration algorithm by investigating deep generative neural networks to point cloud registration.

Point Cloud Registration

Paper
Add Code

DyNet: Dynamic Convolution for Accelerating Convolution Neural Networks

no code implementations • 25 Sep 2019 • Kane Zhang, Jian Zhang, Qiang Wang, Zhao Zhong

To verify the scalability, we also apply DyNet on segmentation task, the results show that DyNet can reduces 69. 3% FLOPs while maintaining the Mean IoU on segmentation task.

Paper
Add Code

COTReg:Coupled Optimal Transport based Point Cloud Registration

no code implementations • 29 Dec 2021 • Guofeng Mei, Xiaoshui Huang, Litao Yu, Jian Zhang, Mohammed Bennamoun

Generating a set of high-quality correspondences or matches is one of the most critical steps in point cloud registration.

Point Cloud Registration

Paper
Add Code

Quantifying Robustness to Adversarial Word Substitutions

no code implementations • 11 Jan 2022 • Yuting Yang, Pei Huang, Feifei Ma, Juan Cao, Meishan Zhang, Jian Zhang, Jintao Li

Deep-learning-based NLP models are found to be vulnerable to word substitution perturbations.

Paper
Add Code

Unsupervised Learning on 3D Point Clouds by Clustering and Contrasting

no code implementations • 5 Feb 2022 • Guofeng Mei, Litao Yu, Qiang Wu, Jian Zhang, Mohammed Bennamoun

This paper proposes a general unsupervised approach, named \textbf{ConClu}, to perform the learning of point-wise and global features by jointly leveraging point-level clustering and instance-level contrasting.

3D Object Classification Clustering +2

Paper
Add Code

Series Photo Selection via Multi-view Graph Learning

no code implementations • 18 Mar 2022 • Jin Huang, Lu Zhang, Yongshun Gong, Jian Zhang, Xiushan Nie, Yilong Yin

Series photo selection (SPS) is an important branch of the image aesthetics quality assessment, which focuses on finding the best one from a series of nearly identical photos.

Aesthetics Quality Assessment Graph Learning

Paper
Add Code

A Prompting-based Approach for Adversarial Example Generation and Robustness Enhancement

no code implementations • 21 Mar 2022 • Yuting Yang, Pei Huang, Juan Cao, Jintao Li, Yun Lin, Jin Song Dong, Feifei Ma, Jian Zhang

Our attack technique targets the inherent vulnerabilities of NLP models, allowing us to generate samples even without interacting with the victim NLP model, as long as it is based on pre-trained language models (PLMs).

Adversarial Attack

Paper
Add Code

Ray Priors through Reprojection: Improving Neural Radiance Fields for Novel View Extrapolation

no code implementations • CVPR 2022 • Jian Zhang, Yuanqing Zhang, Huan Fu, Xiaowei Zhou, Bowen Cai, Jinchi Huang, Rongfei Jia, Binqiang Zhao, Xing Tang

Neural Radiance Fields (NeRF) have emerged as a potent paradigm for representing scenes and synthesizing photo-realistic images.

Image Generation

Paper
Add Code

Robust Invertible Image Steganography

no code implementations • CVPR 2022 • Youmin Xu, Chong Mou, Yujie Hu, Jingfen Xie, Jian Zhang

Previous image steganography methods are limited in hiding capacity and robustness, commonly vulnerable to distortion on container images such as Gaussian noise, Poisson noise, and lossy compression.

Image Steganography

Paper
Add Code

Hierarchical Similarity Learning for Aliasing Suppression Image Super-Resolution

no code implementations • 7 Jun 2022 • Yuqing Liu, Qi Jia, Jian Zhang, Xin Fan, Shanshe Wang, Siwei Ma, Wen Gao

As a highly ill-posed issue, single image super-resolution (SISR) has been widely investigated in recent years.

Image Super-Resolution

Paper
Add Code

Using EBGAN for Anomaly Intrusion Detection

no code implementations • 21 Jun 2022 • Yi Cui, Wenfeng Shen, Jian Zhang, Weijia Lu, Chuang Liu, Lin Sun, Si Chen

The generator in IDS-EBGAN is responsible for converting the original malicious network traffic in the training set into adversarial malicious examples.

Intrusion Detection

Paper
Add Code

Eco-driving for Electric Connected Vehicles at Signalized Intersections: A Parameterized Reinforcement Learning approach

no code implementations • 24 Jun 2022 • Xia Jiang, Jian Zhang, Dan Li

This paper proposes an eco-driving framework for electric connected vehicles (CVs) based on reinforcement learning (RL) to improve vehicle energy efficiency at signalized intersections.

Reinforcement Learning (RL)

Paper
Add Code

Learning the policy for mixed electric platoon control of automated and human-driven vehicles at signalized intersection: a random search approach

no code implementations • 24 Jun 2022 • Xia Jiang, Jian Zhang, Xiaoyu Shi, Jian Cheng

Meanwhile, the simulation results demonstrate the effectiveness of the delay reward, which is designed to outperform distributed reward mechanism} Compared with normal car-following behavior, the sensitivity analysis reveals that the energy can be saved to different extends (39. 27%-82. 51%) by adjusting the relative importance of the optimization goal.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Horizontal and Vertical Attention in Transformers

no code implementations • 10 Jul 2022 • Litao Yu, Jian Zhang

Transformers are built upon multi-head scaled dot-product attention and positional encoding, which aim to learn the feature representations and token dependencies.

Dimensionality Reduction

Paper
Add Code

Spatial-temporal Analysis for Automated Concrete Workability Estimation

no code implementations • 24 Jul 2022 • Litao Yu, Jian Zhang, Mohammed Bennamoun, Xiaojun Chang, Vute Sirivivatnanon, Ali Nezhad

Concrete workability measure is mostly determined based on subjective assessment of a certified assessor with visual inspections.

regression

Paper
Add Code

D3C2-Net: Dual-Domain Deep Convolutional Coding Network for Compressive Sensing

no code implementations • 27 Jul 2022 • Weiqi Li, Bin Chen, Jian Zhang

By unfolding the proposed framework into deep neural networks, we further design a novel Dual-Domain Deep Convolutional Coding Network (D3C2-Net) for CS imaging with the capability of transmitting high-throughput feature-level image representation through all the unfolded stages.

Compressive Sensing

Paper
Add Code

SAFER: Safe Collision Avoidance using Focused and Efficient Trajectory Search with Reinforcement Learning

no code implementations • 23 Sep 2022 • Mario Srouji, Hugues Thomas, Hubert Tsai, Ali Farhadi, Jian Zhang

Collision avoidance is key for mobile robots and agents to operate safely in the real world.

Collision Avoidance reinforcement-learning +2

Paper
Add Code

Towards Multimodal Multitask Scene Understanding Models for Indoor Mobile Agents

no code implementations • 27 Sep 2022 • Yao-Hung Hubert Tsai, Hanlin Goh, Ali Farhadi, Jian Zhang

The perception system in personalized mobile agents requires developing indoor scene understanding models, which can understand 3D geometries, capture objectiveness, analyze human behaviors, etc.

3D Object Detection Autonomous Driving +9

Paper
Add Code

Multi-Agent Automated Machine Learning

no code implementations • CVPR 2023 • Zhaozhi Wang, Kefan Su, Jian Zhang, Huizhu Jia, Qixiang Ye, Xiaodong Xie, Zongqing Lu

In this paper, we propose multi-agent automated machine learning (MA2ML) with the aim to effectively handle joint optimization of modules in automated machine learning (AutoML).

Data Augmentation Multi-agent Reinforcement Learning +1

Paper
Add Code

Complementary Labels Learning with Augmented Classes

no code implementations • 19 Nov 2022 • Zhongnian Li, Jian Zhang, Mengting Xu, Xinzheng Xu, Daoqiang Zhang

In this paper, we propose a novel problem setting called Complementary Labels Learning with Augmented Classes (CLLAC), which brings the challenge that classifiers trained by complementary labels should not only be able to classify the instances from observed classes accurately, but also recognize the instance from the Augmented Classes in the testing phase.

Paper
Add Code

Self-Supervised Object Goal Navigation with In-Situ Finetuning

no code implementations • 9 Dec 2022 • So Yeon Min, Yao-Hung Hubert Tsai, Wei Ding, Ali Farhadi, Ruslan Salakhutdinov, Yonatan Bisk, Jian Zhang

In contrast, our LocCon shows the most robust transfer in the real world among the set of models we compare to, and that the real-world performance of all models can be further improved with self-supervised LocCon in-situ training.

Contrastive Learning Navigate +2

Paper
Add Code

Latent Evolution Model for Change Point Detection in Time-varying Networks

no code implementations • 17 Dec 2022 • Yongshun Gong, Xue Dong, Jian Zhang, Meng Chen

Our method focuses on learning the low-dimensional representations of networks and capturing the evolving patterns of these learned latent representations simultaneously.

Change Point Detection

Paper
Add Code

Cross-domain recommendation via user interest alignment

no code implementations • 26 Jan 2023 • Chuang Zhao, Hongke Zhao, Ming He, Jian Zhang, Jianping Fan

Specifically, we first construct a unified cross-domain heterogeneous graph and redefine the message passing mechanism of graph convolutional networks to capture high-order similarity of users and items across domains.

Recommendation Systems

Paper
Add Code

Progressive Content-aware Coded Hyperspectral Compressive Imaging

no code implementations • 17 Mar 2023 • Xuanyu Zhang, Bin Chen, Wenzhen Zou, Shuai Liu, Yongbing Zhang, Ruiqin Xiong, Jian Zhang

Hyperspectral imaging plays a pivotal role in a wide range of applications, like remote sensing, medicine, and cytology.

Paper
Add Code

Unsupervised Deep Probabilistic Approach for Partial Point Cloud Registration

no code implementations • CVPR 2023 • Guofeng Mei, Hao Tang, Xiaoshui Huang, Weijie Wang, Juan Liu, Jian Zhang, Luc van Gool, Qiang Wu

Deep point cloud registration methods face challenges to partial overlaps and rely on labeled data.

Point Cloud Registration

Paper
Add Code

OPDN: Omnidirectional Position-aware Deformable Network for Omnidirectional Image Super-Resolution

no code implementations • 26 Apr 2023 • Xiaopeng Sun, Weiqi Li, Zhenyu Zhang, Qiufang Ma, Xuhan Sheng, Ming Cheng, Haoyu Ma, Shijie Zhao, Jian Zhang, Junlin Li, Li Zhang

Model A aims to enhance the feature extraction ability of 360{\deg} image positional information, while Model B further focuses on the high-frequency information of 360{\deg} images.

Image Super-Resolution Position

Paper
Add Code

Single Node Injection Label Specificity Attack on Graph Neural Networks via Reinforcement Learning

no code implementations • 4 May 2023 • Dayuan Chen, Jian Zhang, Yuqian Lv, Jinhuan Wang, Hongjie Ni, Shanqing Yu, Zhen Wang, Qi Xuan

Furthermore, most methods concentrate on a single attack goal and lack a generalizable adversary to develop distinct attack strategies for diverse goals, thus limiting precise control over victim model behavior in real-world scenarios.

Specificity

Paper
Add Code

An Object SLAM Framework for Association, Mapping, and High-Level Tasks

no code implementations • 12 May 2023 • Yanmin Wu, Yunzhou Zhang, Delong Zhu, Zhiqiang Deng, Wenkai Sun, Xin Chen, Jian Zhang

Taking into consideration the semantic invariance of objects, we convert the object map to a topological map to provide semantic descriptors to enable multi-map matching.

Decision Making Object +2

Paper
Add Code

Panoptic Compositional Feature Field for Editable Scene Rendering With Network-Inferred Labels via Metric Learning

no code implementations • CVPR 2023 • Xinhua Cheng, Yanmin Wu, Mengxi Jia, Qian Wang, Jian Zhang

In this work, we attempt to learn an object-compositional neural implicit representation for editable scene rendering by leveraging labels inferred from the off-the-shelf 2D panoptic segmentation networks instead of the ground truth annotations.

Metric Learning Novel View Synthesis +1

Paper
Add Code

Cross-source Point Cloud Registration: Challenges, Progress and Prospects

no code implementations • 23 May 2023 • Xiaoshui Huang, Guofeng Mei, Jian Zhang

The emerging topic of cross-source point cloud (CSPC) registration has attracted increasing attention with the fast development background of 3D sensor technologies.

Point Cloud Registration

Paper
Add Code

DiffLLE: Diffusion-guided Domain Calibration for Unsupervised Low-light Image Enhancement

no code implementations • 18 Aug 2023 • Shuzhou Yang, Xuanyu Zhang, Yinhuai Wang, Jiwen Yu, YuHan Wang, Jian Zhang

Specifically, we adopt a naive unsupervised enhancement algorithm to realize preliminary restoration and design two zero-shot plug-and-play modules based on diffusion model to improve generalization and effectiveness.

Denoising Low-Light Image Enhancement

Paper
Add Code

Masked Cross-image Encoding for Few-shot Segmentation

no code implementations • 22 Aug 2023 • Wenbo Xu, Huaxi Huang, Ming Cheng, Litao Yu, Qiang Wu, Jian Zhang

Few-shot segmentation (FSS) is a dense prediction task that aims to infer the pixel-wise labels of unseen classes using only a limited number of annotated images.

Ranked #24 on Few-Shot Semantic Segmentation on COCO-20i (5-shot)

Few-Shot Semantic Segmentation

Paper
Add Code

sasdim: self-adaptive noise scaling diffusion model for spatial time series imputation

no code implementations • 5 Sep 2023 • Shunyang Zhang, Senzhang Wang, Xianzhen Tan, Ruochen Liu, Jian Zhang, Jianxin Wang

Spatial time series imputation is critically important to many real applications such as intelligent transportation and air quality monitoring.

Imputation Time Series

Paper
Add Code

Exploring Flat Minima for Domain Generalization with Large Learning Rates

no code implementations • 12 Sep 2023 • Jian Zhang, Lei Qi, Yinghuan Shi, Yang Gao

Instead, we observe that leveraging a large learning rate can simultaneously promote weight diversity and facilitate the identification of flat regions in the loss landscape.

Domain Generalization Semantic Segmentation

Paper
Add Code

TextCLIP: Text-Guided Face Image Generation And Manipulation Without Adversarial Training

no code implementations • 21 Sep 2023 • Xiaozhou You, Jian Zhang

Text-guided image generation aimed to generate desired images conditioned on given texts, while text-guided image manipulation refers to semantically edit parts of a given image based on specified texts.

Image Generation Image Manipulation +1

Paper
Add Code

IAIFNet: An Illumination-Aware Infrared and Visible Image Fusion Network

no code implementations • 26 Sep 2023 • Qiao Yang, Yu Zhang, Jian Zhang, Zijing Zhao, Shunli Zhang, Jinqiao Wang, Junzhe Chen

Infrared and visible image fusion (IVIF) is used to generate fusion images with comprehensive features of both images, which is beneficial for downstream vision tasks.

Infrared And Visible Image Fusion

Paper
Add Code

SSPFusion: A Semantic Structure-Preserving Approach for Infrared and Visible Image Fusion

no code implementations • 26 Sep 2023 • Qiao Yang, Yu Zhang, Jian Zhang, Zijing Zhao, Shunli Zhang, Jinqiao Wang, Junzhe Chen

Most existing learning-based infrared and visible image fusion (IVIF) methods exhibit massive redundant information in the fusion images, i. e., yielding edge-blurring effect or unrecognizable for object detectors.

Infrared And Visible Image Fusion

Paper
Add Code

Temporal-Coded Spiking Neural Networks with Dynamic Firing Threshold: Learning with Event-Driven Backpropagation

no code implementations • ICCV 2023 • Wenjie Wei, Malu Zhang, Hong Qu, Ammar Belatreche, Jian Zhang, Hong Chen

As a temporal encoding scheme for SNNs, Time-To-First-Spike (TTFS) encodes information using the timing of a single spike, which allows spiking neurons to transmit information through sparse spike trains and results in lower power consumption and higher computational efficiency compared to traditional rate-based encoding counterparts.

Computational Efficiency Image Classification

Paper
Add Code

Multimodal Large Language Model for Visual Navigation

no code implementations • 12 Oct 2023 • Yao-Hung Hubert Tsai, Vansh Dhar, Jialu Li, BoWen Zhang, Jian Zhang

Recent efforts to enable visual navigation using large language models have mainly focused on developing complex prompt systems.

Language Modelling Large Language Model +2

Paper
Add Code

Deep Unfolding Network for Image Compressed Sensing by Content-adaptive Gradient Updating and Deformation-invariant Non-local Modeling

no code implementations • 16 Oct 2023 • Wenxue Cui, Xiaopeng Fan, Jian Zhang, Debin Zhao

In this paper, inspired by the traditional Proximal Gradient Descent (PGD) algorithm, a novel DUN for image compressed sensing (dubbed DUN-CSNet) is proposed to solve the above two issues.

Image Compressed Sensing

Paper
Add Code

Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts

no code implementations • 18 Oct 2023 • Xinhua Cheng, Tianyu Yang, Jianan Wang, Yu Li, Lei Zhang, Jian Zhang, Li Yuan

Recent text-to-3D generation methods achieve impressive 3D content creation capacity thanks to the advances in image diffusion models and optimizing strategies.

3D Generation Text to 3D

Paper
Add Code

Multilevel Perception Boundary-guided Network for Breast Lesion Segmentation in Ultrasound Images

no code implementations • 23 Oct 2023 • Xing Yang, Jian Zhang, Qijian Chen, Li Wang, Lihui Wang

Moreover, to improve the segmentation performance for tumor boundaries, a multi-level boundary-enhanced segmentation (BS) loss is proposed.

Lesion Segmentation Segmentation +1

Paper
Add Code

Constructing Sample-to-Class Graph for Few-Shot Class-Incremental Learning

1 code implementation • 31 Oct 2023 • Fuyuan Hu, Jian Zhang, Fan Lyu, Linyan Li, Fenglei Xu

Moreover, we design a multi-stage strategy for training S2C model, which mitigates the training challenges posed by limited data in the incremental process.

Few-Shot Class-Incremental Learning Graph Learning +1

Paper
Code

SecureCut: Federated Gradient Boosting Decision Trees with Efficient Machine Unlearning

no code implementations • 22 Nov 2023 • Jian Zhang, Bowen Li Jie Li, Chentao Wu

In response to legislation mandating companies to honor the \textit{right to be forgotten} by erasing user data, it has become imperative to enable data removal in Vertical Federated Learning (VFL) where multiple parties provide private features for model training.

Machine Unlearning Vertical Federated Learning

Paper
Add Code

PhysHOI: Physics-Based Imitation of Dynamic Human-Object Interaction

no code implementations • 7 Dec 2023 • Yinhuai Wang, Jing Lin, Ailing Zeng, Zhengyi Luo, Jian Zhang, Lei Zhang

To make up for the lack of dynamic HOI scenarios in this area, we introduce the BallPlay dataset that contains eight whole-body basketball skills.

Human-Object Interaction Detection Object

Paper
Add Code

GIR: 3D Gaussian Inverse Rendering for Relightable Scene Factorization

no code implementations • 8 Dec 2023 • Yahao Shi, Yanmin Wu, Chenming Wu, Xing Liu, Chen Zhao, Haocheng Feng, Jingtuo Liu, Liangjun Zhang, Jian Zhang, Bin Zhou, Errui Ding, Jingdong Wang

This paper presents GIR, a 3D Gaussian Inverse Rendering method for relightable scene factorization.

Inverse Rendering

Paper
Add Code

EditGuard: Versatile Image Watermarking for Tamper Localization and Copyright Protection

no code implementations • 12 Dec 2023 • Xuanyu Zhang, Runyi Li, Jiwen Yu, Youmin Xu, Weiqi Li, Jian Zhang

In the era where AI-generated content (AIGC) models can produce stunning and lifelike images, the lingering shadow of unauthorized reproductions and malicious tampering poses imminent threats to copyright integrity and information security.

Image Steganography

Paper
Add Code

Neural Video Fields Editing

no code implementations • 12 Dec 2023 • Shuzhou Yang, Chong Mou, Jiwen Yu, YuHan Wang, Xiandong Meng, Jian Zhang

Specifically, we construct a neural video field, powered by tri-plane and sparse grid, to enable encoding long videos with hundreds of frames in a memory-efficient manner.

Video Editing

Paper
Add Code

Language-Assisted 3D Scene Understanding

no code implementations • 18 Dec 2023 • Yanmin Wu, Qiankun Gao, Renrui Zhang, Jian Zhang

The scale and quality of point cloud datasets constrain the advancement of point cloud learning.

3D Object Detection 3D Semantic Segmentation +5

Paper
Add Code

360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model

no code implementations • 12 Jan 2024 • Qian Wang, Weiqi Li, Chong Mou, Xinhua Cheng, Jian Zhang

Recently, the emerging text-to-video (T2V) diffusion methods demonstrate notable effectiveness in standard video generation.

Video Generation

Paper
Add Code

Efficient Non-Parametric Uncertainty Quantification for Black-Box Large Language Models and Decision Planning

no code implementations • 1 Feb 2024 • Yao-Hung Hubert Tsai, Walter Talbott, Jian Zhang

This paper focuses on decision planning with uncertainty estimation to address the hallucination problem in language models.

Decision Making Hallucination +1

Paper
Add Code

Are Large Language Models Rational Investors?

no code implementations • 20 Feb 2024 • YuHang Zhou, Yuchen Ni, Xiang Liu, Jian Zhang, Sen Liu, Guangnan Ye, Hongfeng Chai

Large Language Models (LLMs) are progressively being adopted in financial analysis to harness their extensive knowledge base for interpreting complex market data and trends.

Decision Making Navigate

Paper
Add Code

Region-Transformer: Self-Attention Region Based Class-Agnostic Point Cloud Segmentation

no code implementations • 3 Mar 2024 • Dipesh Gyawali, Jian Zhang, BB Karki

Attention-based networks have succeeded in many previous methods of performing point cloud segmentation.

Autonomous Vehicles Point Cloud Segmentation +1

Paper
Add Code

MamMIL: Multiple Instance Learning for Whole Slide Images with State Space Models

no code implementations • 8 Mar 2024 • Zijie Fang, Yifeng Wang, Zhi Wang, Jian Zhang, Xiangyang Ji, Yongbing Zhang

To tackle this challenge, we propose a MamMIL framework for WSI classification by cooperating the selective structured state space model (i. e., Mamba) with MIL for the first time, enabling the modeling of instance dependencies while maintaining linear complexity.

Multiple Instance Learning whole slide images

Paper
Add Code

Invertible Diffusion Models for Compressed Sensing

no code implementations • 25 Mar 2024 • Bin Chen, Zhenyu Zhang, Weiqi Li, Chen Zhao, Jiwen Yu, Shijie Zhao, Jie Chen, Jian Zhang

To enable such memory-intensive end-to-end finetuning, we propose a novel two-level invertible design to transform both (1) the multi-step sampling process and (2) the noise estimation U-Net in each step into invertible networks.

Image Compressed Sensing Image Reconstruction +1

Paper
Add Code

InstantSplat: Unbounded Sparse-view Pose-free Gaussian Splatting in 40 Seconds

no code implementations • 29 Mar 2024 • Zhiwen Fan, Wenyan Cong, Kairun Wen, Kevin Wang, Jian Zhang, Xinghao Ding, Danfei Xu, Boris Ivanovic, Marco Pavone, Georgios Pavlakos, Zhangyang Wang, Yue Wang

This pre-processing is usually conducted via a Structure-from-Motion (SfM) pipeline, a procedure that can be slow and unreliable, particularly in sparse-view scenarios with insufficient matched features for accurate reconstruction.

Novel View Synthesis SSIM

Paper
Add Code

Mirror-3DGS: Incorporating Mirror Reflections into 3D Gaussian Splatting

no code implementations • 1 Apr 2024 • Jiarui Meng, Haijie Li, Yanmin Wu, Qiankun Gao, Shuzhou Yang, Jian Zhang, Siwei Ma

3D Gaussian Splatting (3DGS) has marked a significant breakthrough in the realm of 3D scene reconstruction and novel view synthesis.

3D Scene Reconstruction Novel View Synthesis

Paper
Add Code

Automated Polyp Segmentation in Colonoscopy Images

no code implementations • 6 Apr 2024 • Swagat Ranjit, Jian Zhang, Bijaya B. Karki

The combination of dilated convolution module, RCCA, and global average pooling was found to be effective for irregular shapes.

Data Augmentation Medical Diagnosis

Paper
Add Code

BG-YOLO: A Bidirectional-Guided Method for Underwater Object Detection

no code implementations • 13 Apr 2024 • Jian Zhang, Ruiteng Zhang, Xinyue Yan, Xiting Zhuang, Ruicheng Cao

When training the enhancement branch, the object detection subnet in the enhancement branch guides the image enhancement subnet to be optimized towards the direction that is most conducive to the detection task.

Image Enhancement Object +2

Paper
Add Code

OmniSSR: Zero-shot Omnidirectional Image Super-Resolution using Stable Diffusion Model

no code implementations • 16 Apr 2024 • Runyi Li, Xuhan Sheng, Weiqi Li, Jian Zhang

Omnidirectional images (ODIs) are commonly used in real-world visual tasks, and high-resolution ODIs help improve the performance of related visual tasks.

Denoising Domain Generalization +4

Paper
Add Code

Face2Face: Label-driven Facial Retouching Restoration

no code implementations • 22 Apr 2024 • Guanhua Zhao, Yu Gu, Xuhan Sheng, Yujie Hu, Jian Zhang

This poses challenges for fields that place high demands on the authenticity of photographs, such as identity verification and social media.

Image Restoration

Paper
Add Code

ResVR: Joint Rescaling and Viewport Rendering of Omnidirectional Images

no code implementations • 25 Apr 2024 • Weiqi Li, Shijie Zhao, Bin Chen, Xinhua Cheng, Junlin Li, Li Zhang, Jian Zhang

With the advent of virtual reality technology, omnidirectional image (ODI) rescaling techniques are increasingly embraced for reducing transmitted and stored file sizes while preserving high image quality.

ERP

Paper
Add Code

V2A-Mark: Versatile Deep Visual-Audio Watermarking for Manipulation Localization and Copyright Protection

no code implementations • 25 Apr 2024 • Xuanyu Zhang, Youmin Xu, Runyi Li, Jiwen Yu, Weiqi Li, Zhipei Xu, Jian Zhang

Meanwhile, we introduce a sample-level audio localization method and a cross-modal copyright extraction mechanism to couple the information of audio and video frames.

Video Editing

Paper
Add Code

F$^3$low: Frame-to-Frame Coarse-grained Molecular Dynamics with SE(3) Guided Flow Matching

no code implementations • 1 May 2024 • Shaoning Li, Yusong Wang, Mingyu Li, Jian Zhang, Bin Shao, Nanning Zheng, Jian Tang

Molecular dynamics (MD) is a crucial technique for simulating biological systems, enabling the exploration of their dynamic nature and fostering an understanding of their functions and properties.

Paper
Add Code

Structured Control Nets for Deep Reinforcement Learning

1 code implementation • ICML 2018 • Mario Srouji, Jian Zhang, Ruslan Salakhutdinov

The proposed Structured Control Net (SCN) splits the generic MLP into two separate sub-modules: a nonlinear control module and a linear control module.

Decision Making reinforcement-learning +1

Paper
Code

Parameter Efficient Deep Neural Networks with Bilinear Projections

1 code implementation • 3 Nov 2020 • Litao Yu, Yongsheng Gao, Jun Zhou, Jian Zhang

Recent research on deep neural networks (DNNs) has primarily focused on improving the model accuracy.

Paper
Code

Semantically Meaningful Class Prototype Learning for One-Shot Image Semantic Segmentation

1 code implementation • 22 Feb 2021 • Tao Chen, GuoSen Xie, Yazhou Yao, Qiong Wang, Fumin Shen, Zhenmin Tang, Jian Zhang

Then we utilize the fused prototype to guide the final segmentation of the query image.

Image Segmentation Segmentation +1

Paper
Code

Learning Weighting Map for Bit-Depth Expansion within a Rational Range

1 code implementation • 26 Apr 2022 • Yuqing Liu, Qi Jia, Jian Zhang, Xin Fan, Shanshe Wang, Siwei Ma, Wen Gao

Existing BDE methods have no unified solution for various BDE situations, and directly learn a mapping for each pixel from LBD image to the desired value in HBD image, which may change the given high-order bits and lead to a huge deviation from the ground truth.

SSIM

Paper
Code

Self-attention on Multi-Shifted Windows for Scene Segmentation

1 code implementation • 10 Jul 2022 • Litao Yu, Zhibin Li, Jian Zhang, Qiang Wu

Scene segmentation in images is a fundamental yet challenging problem in visual content understanding, which is to learn a model to assign every image pixel to a categorical label.

Descriptive Scene Segmentation +1

Paper
Code

Deep Bi-Dense Networks for Image Super-Resolution

1 code implementation • 11 Oct 2018 • Yucheng Wang, Jialiang Shen, Jian Zhang

In this way, feature information propagates from a single dense block to all subsequent blocks, instead of to a single successor.

Image Super-Resolution

Paper
Code

ASDN: A Deep Convolutional Network for Arbitrary Scale Image Super-Resolution

1 code implementation • 6 Oct 2020 • Jialiang Shen, Yucheng Wang, Jian Zhang

For SR of small-scales (between 1 and 2), images are constructed by interpolation from a sparse set of precalculated Laplacian pyramid levels.

Image Super-Resolution

Paper
Code

Generative Structural Design Integrating BIM and Diffusion Model

1 code implementation • 7 Nov 2023 • Zhili He, Yu-Hsing Wang, Jian Zhang

This study proposes a comprehensive solution.

Generative Adversarial Network

Paper
Code

NetInfoF Framework: Measuring and Exploiting Network Usable Information

1 code implementation • 12 Feb 2024 • Meng-Chieh Lee, Haiyang Yu, Jian Zhang, Vassilis N. Ioannidis, Xiang Song, Soji Adeshina, Da Zheng, Christos Faloutsos

Given a node-attributed graph, and a graph task (link prediction or node classification), can we tell if a graph neural network (GNN) will perform well?

Link Prediction Node Classification

Paper
Code

Exploiting Web Images for Fine-Grained Visual Recognition by Eliminating Noisy Samples and Utilizing Hard Ones

1 code implementation • 23 Jan 2021 • Huafeng Liu, Chuanyi Zhang, Yazhou Yao, Xiushen Wei, Fumin Shen, Jian Zhang, Zhenmin Tang

Labeling objects at a subordinate level typically requires expert knowledge, which is not always available when using random annotators.

Fine-Grained Visual Recognition

Paper
Code

Investigating Accuracy-Novelty Performance for Graph-based Collaborative Filtering

1 code implementation • 26 Apr 2022 • Minghao Zhao, Le Wu, Yile Liang, Lei Chen, Jian Zhang, Qilin Deng, Kai Wang, Xudong Shen, Tangjie Lv, Runze Wu

While conventional CF models are known for facing the challenges of the popularity bias that favors popular items, one may wonder "Whether the existing graph-based CF models alleviate or exacerbate popularity bias of recommender systems?"

Collaborative Filtering Recommendation Systems

Paper
Code

Compare More Nuanced:Pairwise Alignment Bilinear Network For Few-shot Fine-grained Learning

1 code implementation • 7 Apr 2019 • Huaxi Huang, Jun-Jie Zhang, Jian Zhang, Qiang Wu, Jingsong Xu

Unlike traditional deep bilinear networks for fine-grained classification, which adopt the self-bilinear pooling to capture the subtle features of images, the proposed model uses a novel pairwise bilinear pooling to compare the nuanced differences between base images and query images for learning a deep distance metric.

General Classification Meta-Learning

Paper
Code

Constructing and Exploring Intermediate Domains in Mixed Domain Semi-supervised Medical Image Segmentation

1 code implementation • 13 Apr 2024 • Qinghe Ma, Jian Zhang, Lei Qi, Qian Yu, Yinghuan Shi, Yang Gao

To fully utilize the information within the intermediate domain, we propose a symmetric Guidance training strategy (SymGD), which additionally offers direct guidance to unlabeled data by merging pseudo labels from intermediate samples.

Image Segmentation Segmentation +4

Paper
Code

Implicit Multidimensional Projection of Local Subspaces

1 code implementation • 7 Sep 2020 • Rongzheng Bian, Yumeng Xue, Liang Zhou, Jian Zhang, Baoquan Chen, Daniel Weiskopf, Yunhai Wang

We propose a visualization method to understand the effect of multidimensional projection on local subspaces, using implicit function differentiation.

Paper
Code

A Prompt Learning Framework for Source Code Summarization

1 code implementation • 26 Dec 2023 • Weisong Sun, Chunrong Fang, Yudu You, Yuchen Chen, Yi Liu, Chong Wang, Jian Zhang, Quanjun Zhang, Hanwei Qian, Wei Zhao, Yang Liu, Zhenyu Chen

PromptCS trains a prompt agent that can generate continuous prompts to unleash the potential for LLMs in code summarization.

Code Summarization Few-Shot Learning +2

Paper
Code

Field-wise Learning for Multi-field Categorical Data

1 code implementation • NeurIPS 2020 • Zhibin Li, Jian Zhang, Yongshun Gong, Yazhou Yao, Qiang Wu

We present a model that utilizes linear models with variance and low-rank constraints, to help it generalize better and reduce the number of parameters.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.