Search Results for author: Jian Zhang

Found 279 papers, 98 papers with code

V2A-Mark: Versatile Deep Visual-Audio Watermarking for Manipulation Localization and Copyright Protection

no code implementations • 25 Apr 2024 • Xuanyu Zhang, Youmin Xu, Runyi Li, Jiwen Yu, Weiqi Li, Zhipei Xu, Jian Zhang

Meanwhile, we introduce a sample-level audio localization method and a cross-modal copyright extraction mechanism to couple the information of audio and video frames.

Paper
Add Code

ResVR: Joint Rescaling and Viewport Rendering of Omnidirectional Images

no code implementations • 25 Apr 2024 • Weiqi Li, Shijie Zhao, Bin Chen, Xinhua Cheng, Junlin Li, Li Zhang, Jian Zhang

With the advent of virtual reality technology, omnidirectional image (ODI) rescaling techniques are increasingly embraced for reducing transmitted and stored file sizes while preserving high image quality.

Paper
Add Code

Face2Face: Label-driven Facial Retouching Restoration

no code implementations • 22 Apr 2024 • Guanhua Zhao, Yu Gu, Xuhan Sheng, Yujie Hu, Jian Zhang

This poses challenges for fields that place high demands on the authenticity of photographs, such as identity verification and social media.

Image Restoration

Paper
Add Code

OmniSSR: Zero-shot Omnidirectional Image Super-Resolution using Stable Diffusion Model

no code implementations • 16 Apr 2024 • Runyi Li, Xuhan Sheng, Weiqi Li, Jian Zhang

Omnidirectional images (ODIs) are commonly used in real-world visual tasks, and high-resolution ODIs help improve the performance of related visual tasks.

Denoising Domain Generalization +4

Paper
Add Code

Constructing and Exploring Intermediate Domains in Mixed Domain Semi-supervised Medical Image Segmentation

1 code implementation • 13 Apr 2024 • Qinghe Ma, Jian Zhang, Lei Qi, Qian Yu, Yinghuan Shi, Yang Gao

To fully utilize the information within the intermediate domain, we propose a symmetric Guidance training strategy (SymGD), which additionally offers direct guidance to unlabeled data by merging pseudo labels from intermediate samples.

Image Segmentation Segmentation +4

Paper
Code

BG-YOLO: A Bidirectional-Guided Method for Underwater Object Detection

no code implementations • 13 Apr 2024 • Jian Zhang, Ruiteng Zhang, Xinyue Yan, Xiting Zhuang, Ruicheng Cao

When training the enhancement branch, the object detection subnet in the enhancement branch guides the image enhancement subnet to be optimized towards the direction that is most conducive to the detection task.

Image Enhancement Object +2

Paper
Add Code

Automated Polyp Segmentation in Colonoscopy Images

no code implementations • 6 Apr 2024 • Swagat Ranjit, Jian Zhang, Bijaya B. Karki

The combination of dilated convolution module, RCCA, and global average pooling was found to be effective for irregular shapes.

Data Augmentation Medical Diagnosis

Paper
Add Code

Mirror-3DGS: Incorporating Mirror Reflections into 3D Gaussian Splatting

no code implementations • 1 Apr 2024 • Jiarui Meng, Haijie Li, Yanmin Wu, Qiankun Gao, Shuzhou Yang, Jian Zhang, Siwei Ma

3D Gaussian Splatting (3DGS) has marked a significant breakthrough in the realm of 3D scene reconstruction and novel view synthesis.

3D Scene Reconstruction Novel View Synthesis

Paper
Add Code

InstantSplat: Unbounded Sparse-view Pose-free Gaussian Splatting in 40 Seconds

no code implementations • 29 Mar 2024 • Zhiwen Fan, Wenyan Cong, Kairun Wen, Kevin Wang, Jian Zhang, Xinghao Ding, Danfei Xu, Boris Ivanovic, Marco Pavone, Georgios Pavlakos, Zhangyang Wang, Yue Wang

This pre-processing is usually conducted via a Structure-from-Motion (SfM) pipeline, a procedure that can be slow and unreliable, particularly in sparse-view scenarios with insufficient matched features for accurate reconstruction.

Novel View Synthesis SSIM

Paper
Add Code

Invertible Diffusion Models for Compressed Sensing

no code implementations • 25 Mar 2024 • Bin Chen, Zhenyu Zhang, Weiqi Li, Chen Zhao, Jiwen Yu, Shijie Zhao, Jie Chen, Jian Zhang

To enable such memory-intensive end-to-end finetuning, we propose a novel two-level invertible design to transform both (1) the multi-step sampling process and (2) the noise estimation U-Net in each step into invertible networks.

Image Compressed Sensing Image Reconstruction +1

Paper
Add Code

BadEdit: Backdooring large language models by model editing

no code implementations • 20 Mar 2024 • Yanzhou Li, Tianlin Li, Kangjie Chen, Jian Zhang, Shangqing Liu, Wenhan Wang, Tianwei Zhang, Yang Liu

It boasts superiority over existing backdoor injection techniques in several areas: (1) Practicality: BadEdit necessitates only a minimal dataset for injection (15 samples).

Backdoor Attack knowledge editing

Paper
Add Code

Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model

1 code implementation • 17 Mar 2024 • Dian Zheng, Xiao-Ming Wu, Shuzhou Yang, Jian Zhang, Jian-Fang Hu, Wei-Shi Zheng

Universal image restoration is a practical and potential computer vision task for real-world applications.

Image Restoration Zero-shot Generalization

Paper
Code

MamMIL: Multiple Instance Learning for Whole Slide Images with State Space Models

no code implementations • 8 Mar 2024 • Zijie Fang, Yifeng Wang, Zhi Wang, Jian Zhang, Xiangyang Ji, Yongbing Zhang

To tackle this challenge, we propose a MamMIL framework for WSI classification by cooperating the selective structured state space model (i. e., Mamba) with MIL for the first time, enabling the modeling of instance dependencies while maintaining linear complexity.

Multiple Instance Learning whole slide images

Paper
Add Code

Region-Transformer: Self-Attention Region Based Class-Agnostic Point Cloud Segmentation

no code implementations • 3 Mar 2024 • Dipesh Gyawali, Jian Zhang, BB Karki

Attention-based networks have succeeded in many previous methods of performing point cloud segmentation.

Autonomous Vehicles Point Cloud Segmentation +1

Paper
Add Code

Are Large Language Models Rational Investors?

no code implementations • 20 Feb 2024 • YuHang Zhou, Yuchen Ni, Xiang Liu, Jian Zhang, Sen Liu, Guangnan Ye, Hongfeng Chai

Large Language Models (LLMs) are progressively being adopted in financial analysis to harness their extensive knowledge base for interpreting complex market data and trends.

Decision Making Navigate

Paper
Add Code

NetInfoF Framework: Measuring and Exploiting Network Usable Information

1 code implementation • 12 Feb 2024 • Meng-Chieh Lee, Haiyang Yu, Jian Zhang, Vassilis N. Ioannidis, Xiang Song, Soji Adeshina, Da Zheng, Christos Faloutsos

Given a node-attributed graph, and a graph task (link prediction or node classification), can we tell if a graph neural network (GNN) will perform well?

Link Prediction Node Classification

Paper
Code

DiffEditor: Boosting Accuracy and Flexibility on Diffusion-based Image Editing

1 code implementation • 4 Feb 2024 • Chong Mou, Xintao Wang, Jiechong Song, Ying Shan, Jian Zhang

Large-scale Text-to-Image (T2I) diffusion models have revolutionized image generation over the last few years.

Image Generation

645

Paper
Code

Efficient Non-Parametric Uncertainty Quantification for Black-Box Large Language Models and Decision Planning

no code implementations • 1 Feb 2024 • Yao-Hung Hubert Tsai, Walter Talbott, Jian Zhang

This paper focuses on decision planning with uncertainty estimation to address the hallucination problem in language models.

Decision Making Hallucination +1

Paper
Add Code

360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model

no code implementations • 12 Jan 2024 • Qian Wang, Weiqi Li, Chong Mou, Xinhua Cheng, Jian Zhang

Recently, the emerging text-to-video (T2V) diffusion methods demonstrate notable effectiveness in standard video generation.

Video Generation

Paper
Add Code

Brain-Conditional Multimodal Synthesis: A Survey and Taxonomy

1 code implementation • 31 Dec 2023 • Weijian Mai, Jian Zhang, Pengfei Fang, Zhijun Zhang

This survey comprehensively examines the emerging field of AIGC-based Brain-conditional Multimodal Synthesis, termed AIGC-Brain, to delineate the current landscape and future directions.

Brain Computer Interface

Paper
Code

A Prompt Learning Framework for Source Code Summarization

1 code implementation • 26 Dec 2023 • Weisong Sun, Chunrong Fang, Yudu You, Yuchen Chen, Yi Liu, Chong Wang, Jian Zhang, Quanjun Zhang, Hanwei Qian, Wei Zhao, Yang Liu, Zhenyu Chen

PromptCS trains a prompt agent that can generate continuous prompts to unleash the potential for LLMs in code summarization.

Code Summarization Few-Shot Learning +2

Paper
Code

Language-Assisted 3D Scene Understanding

no code implementations • 18 Dec 2023 • Yanmin Wu, Qiankun Gao, Renrui Zhang, Jian Zhang

The scale and quality of point cloud datasets constrain the advancement of point cloud learning.

3D Object Detection 3D Semantic Segmentation +5

Paper
Add Code

Neural Video Fields Editing

no code implementations • 12 Dec 2023 • Shuzhou Yang, Chong Mou, Jiwen Yu, YuHan Wang, Xiandong Meng, Jian Zhang

Specifically, we construct a neural video field, powered by tri-plane and sparse grid, to enable encoding long videos with hundreds of frames in a memory-efficient manner.

Video Editing

Paper
Add Code

EditGuard: Versatile Image Watermarking for Tamper Localization and Copyright Protection

no code implementations • 12 Dec 2023 • Xuanyu Zhang, Runyi Li, Jiwen Yu, Youmin Xu, Weiqi Li, Jian Zhang

In the era where AI-generated content (AIGC) models can produce stunning and lifelike images, the lingering shadow of unauthorized reproductions and malicious tampering poses imminent threats to copyright integrity and information security.

Image Steganography

Paper
Add Code

GIR: 3D Gaussian Inverse Rendering for Relightable Scene Factorization

no code implementations • 8 Dec 2023 • Yahao Shi, Yanmin Wu, Chenming Wu, Xing Liu, Chen Zhao, Haocheng Feng, Jingtuo Liu, Liangjun Zhang, Jian Zhang, Bin Zhou, Errui Ding, Jingdong Wang

This paper presents GIR, a 3D Gaussian Inverse Rendering method for relightable scene factorization.

Inverse Rendering

Paper
Add Code

PhysHOI: Physics-Based Imitation of Dynamic Human-Object Interaction

no code implementations • 7 Dec 2023 • Yinhuai Wang, Jing Lin, Ailing Zeng, Zhengyi Luo, Jian Zhang, Lei Zhang

To make up for the lack of dynamic HOI scenarios in this area, we introduce the BallPlay dataset that contains eight whole-body basketball skills.

Human-Object Interaction Detection Object

Paper
Add Code

AnimateZero: Video Diffusion Models are Zero-Shot Image Animators

1 code implementation • 6 Dec 2023 • Jiwen Yu, Xiaodong Cun, Chenyang Qi, Yong Zhang, Xintao Wang, Ying Shan, Jian Zhang

For appearance control, we borrow intermediate latents and their features from the text-to-image (T2I) generation for ensuring the generated first frame is equal to the given generated image.

Image Animation Video Generation

345

Paper
Code

SecureCut: Federated Gradient Boosting Decision Trees with Efficient Machine Unlearning

no code implementations • 22 Nov 2023 • Jian Zhang, Bowen Li Jie Li, Chentao Wu

In response to legislation mandating companies to honor the \textit{right to be forgotten} by erasing user data, it has become imperative to enable data removal in Vertical Federated Learning (VFL) where multiple parties provide private features for model training.

Machine Unlearning Vertical Federated Learning

Paper
Add Code

Generative Structural Design Integrating BIM and Diffusion Model

1 code implementation • 7 Nov 2023 • Zhili He, Yu-Hsing Wang, Jian Zhang

This study proposes a comprehensive solution.

Generative Adversarial Network

Paper
Code

Constructing Sample-to-Class Graph for Few-Shot Class-Incremental Learning

1 code implementation • 31 Oct 2023 • Fuyuan Hu, Jian Zhang, Fan Lyu, Linyan Li, Fenglei Xu

Moreover, we design a multi-stage strategy for training S2C model, which mitigates the training challenges posed by limited data in the incremental process.

Few-Shot Class-Incremental Learning Graph Learning +1

Paper
Code

Multilevel Perception Boundary-guided Network for Breast Lesion Segmentation in Ultrasound Images

no code implementations • 23 Oct 2023 • Xing Yang, Jian Zhang, Qijian Chen, Li Wang, Lihui Wang

Moreover, to improve the segmentation performance for tumor boundaries, a multi-level boundary-enhanced segmentation (BS) loss is proposed.

Lesion Segmentation Segmentation +1

Paper
Add Code

Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts

no code implementations • 18 Oct 2023 • Xinhua Cheng, Tianyu Yang, Jianan Wang, Yu Li, Lei Zhang, Jian Zhang, Li Yuan

Recent text-to-3D generation methods achieve impressive 3D content creation capacity thanks to the advances in image diffusion models and optimizing strategies.

3D Generation Text to 3D

Paper
Add Code

Compatible Transformer for Irregularly Sampled Multivariate Time Series

1 code implementation • 17 Oct 2023 • Yuxi Wei, Juntong Peng, Tong He, Chenxin Xu, Jian Zhang, Shirui Pan, Siheng Chen

To analyze multivariate time series, most previous methods assume regular subsampling of time series, where the interval between adjacent measurements and the number of samples remain unchanged.

Time Series

Paper
Code

Deep Unfolding Network for Image Compressed Sensing by Content-adaptive Gradient Updating and Deformation-invariant Non-local Modeling

no code implementations • 16 Oct 2023 • Wenxue Cui, Xiaopeng Fan, Jian Zhang, Debin Zhao

In this paper, inspired by the traditional Proximal Gradient Descent (PGD) algorithm, a novel DUN for image compressed sensing (dubbed DUN-CSNet) is proposed to solve the above two issues.

Image Compressed Sensing

Paper
Add Code

Empirical Study of Zero-Shot NER with ChatGPT

1 code implementation • 16 Oct 2023 • Tingyu Xie, Qi Li, Jian Zhang, Yan Zhang, Zuozhu Liu, Hongwei Wang

Large language models (LLMs) exhibited powerful capability in various natural language processing tasks.

Arithmetic Reasoning named-entity-recognition +3

Paper
Code

Multimodal Large Language Model for Visual Navigation

no code implementations • 12 Oct 2023 • Yao-Hung Hubert Tsai, Vansh Dhar, Jialu Li, BoWen Zhang, Jian Zhang

Recent efforts to enable visual navigation using large language models have mainly focused on developing complex prompt systems.

Language Modelling Large Language Model +2

Paper
Add Code

SSPFusion: A Semantic Structure-Preserving Approach for Infrared and Visible Image Fusion

no code implementations • 26 Sep 2023 • Qiao Yang, Yu Zhang, Jian Zhang, Zijing Zhao, Shunli Zhang, Jinqiao Wang, Junzhe Chen

Most existing learning-based infrared and visible image fusion (IVIF) methods exhibit massive redundant information in the fusion images, i. e., yielding edge-blurring effect or unrecognizable for object detectors.

Infrared And Visible Image Fusion

Paper
Add Code

IAIFNet: An Illumination-Aware Infrared and Visible Image Fusion Network

no code implementations • 26 Sep 2023 • Qiao Yang, Yu Zhang, Jian Zhang, Zijing Zhao, Shunli Zhang, Jinqiao Wang, Junzhe Chen

Infrared and visible image fusion (IVIF) is used to generate fusion images with comprehensive features of both images, which is beneficial for downstream vision tasks.

Infrared And Visible Image Fusion

Paper
Add Code

TextCLIP: Text-Guided Face Image Generation And Manipulation Without Adversarial Training

no code implementations • 21 Sep 2023 • Xiaozhou You, Jian Zhang

Text-guided image generation aimed to generate desired images conditioned on given texts, while text-guided image manipulation refers to semantically edit parts of a given image based on specified texts.

Image Generation Image Manipulation +1

Paper
Add Code

Exploring Flat Minima for Domain Generalization with Large Learning Rates

no code implementations • 12 Sep 2023 • Jian Zhang, Lei Qi, Yinghuan Shi, Yang Gao

Instead, we observe that leveraging a large learning rate can simultaneously promote weight diversity and facilitate the identification of flat regions in the loss landscape.

Domain Generalization Semantic Segmentation

Paper
Add Code

sasdim: self-adaptive noise scaling diffusion model for spatial time series imputation

no code implementations • 5 Sep 2023 • Shunyang Zhang, Senzhang Wang, Xianzhen Tan, Ruochen Liu, Jian Zhang, Jianxin Wang

Spatial time series imputation is critically important to many real applications such as intelligent transportation and air quality monitoring.

Imputation Time Series

Paper
Add Code

Self-Supervised Scalable Deep Compressed Sensing

1 code implementation • 26 Aug 2023 • Bin Chen, Xuanyu Zhang, Shuai Liu, Yongbing Zhang, Jian Zhang

Compressed sensing (CS) is a promising tool for reducing sampling costs.

Paper
Code

Masked Cross-image Encoding for Few-shot Segmentation

no code implementations • 22 Aug 2023 • Wenbo Xu, Huaxi Huang, Ming Cheng, Litao Yu, Qiang Wu, Jian Zhang

Few-shot segmentation (FSS) is a dense prediction task that aims to infer the pixel-wise labels of unseen classes using only a limited number of annotated images.

Ranked #24 on Few-Shot Semantic Segmentation on COCO-20i (5-shot)

Few-Shot Semantic Segmentation

Paper
Add Code

DomainAdaptor: A Novel Approach to Test-time Adaptation

1 code implementation • ICCV 2023 • Jian Zhang, Lei Qi, Yinghuan Shi, Yang Gao

To deal with the domain shift between training and test samples, current methods have primarily focused on learning generalizable features during training and ignore the specificity of unseen samples that are also critical during the test.

Specificity Test-time Adaptation

Paper
Code

DiffLLE: Diffusion-guided Domain Calibration for Unsupervised Low-light Image Enhancement

no code implementations • 18 Aug 2023 • Shuzhou Yang, Xuanyu Zhang, Yinhuai Wang, Jiwen Yu, YuHan Wang, Jian Zhang

Specifically, we adopt a naive unsupervised enhancement algorithm to realize preliminary restoration and design two zero-shot plug-and-play modules based on diffusion model to improve generalization and effectiveness.

Denoising Low-Light Image Enhancement

Paper
Add Code

Generalizable Decision Boundaries: Dualistic Meta-Learning for Open Set Domain Generalization

1 code implementation • ICCV 2023 • Xiran Wang, Jian Zhang, Lei Qi, Yinghuan Shi

Domain generalization (DG) is proposed to deal with the issue of domain shift, which occurs when statistical differences exist between source and target domains.

Domain Generalization Meta-Learning

Paper
Code

EFLNet: Enhancing Feature Learning for Infrared Small Target Detection

1 code implementation • 27 Jul 2023 • Bo Yang, Xinyu Zhang, Jian Zhang, Jun Luo, Mingliang Zhou, Yangjun Pi

To address this problem, we propose a new adaptive threshold focal loss (ATFL) function that decouples the target and the background, and utilizes the adaptive mechanism to adjust the loss weight to force the model to allocate more attention to target features.

regression

Paper
Code

Deep Physics-Guided Unrolling Generalization for Compressed Sensing

1 code implementation • 18 Jul 2023 • Bin Chen, Jiechong Song, Jingfen Xie, Jian Zhang

By absorbing the merits of both the model- and data-driven methods, deep physics-engaged learning scheme achieves high-accuracy and interpretable image reconstruction.

Image Compressed Sensing Image Reconstruction

Paper
Code

PKU-GoodsAD: A Supermarket Goods Dataset for Unsupervised Anomaly Detection and Segmentation

1 code implementation • 11 Jul 2023 • Jian Zhang, Runwei Ding, Miaoju Ban, Ge Yang

It follows the unsupervised setting and only normal (defect-free) images are used for training.

Unsupervised Anomaly Detection

Paper
Code

DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models

1 code implementation • 5 Jul 2023 • Chong Mou, Xintao Wang, Jiechong Song, Ying Shan, Jian Zhang

Specifically, we construct classifier guidance based on the strong correspondence of intermediate features in the diffusion model.

Object

645

Paper
Code

HVTSurv: Hierarchical Vision Transformer for Patient-Level Survival Prediction from Whole Slide Image

1 code implementation • 30 Jun 2023 • Zhuchen Shao, Yang Chen, Hao Bian, Jian Zhang, Guojun Liu, Yongbing Zhang

Many studies adopt random sampling pre-processing strategy and WSI-level aggregation models, which inevitably lose critical prognostic information in the patient-level bag.

Multiple Instance Learning Survival Prediction +1

Paper
Code

Dynamic Path-Controllable Deep Unfolding Network for Compressive Sensing

1 code implementation • 28 Jun 2023 • Jiechong Song, Bin Chen, Jian Zhang

Deep unfolding network (DUN) that unfolds the optimization algorithm into a deep neural network has achieved great success in compressive sensing (CS) due to its good interpretability and high performance.

Compressive Sensing

Paper
Code

Infrastructure Crack Segmentation: Boundary Guidance Method and Benchmark Dataset

1 code implementation • 15 Jun 2023 • Zhili He, Wang Chen, Jian Zhang, Yu-Hsing Wang

Cracks provide an essential indicator of infrastructure performance degradation, and achieving high-precision pixel-level crack segmentation is an issue of concern.

Crack Segmentation Segmentation

Paper
Code

On the Tool Manipulation Capability of Open-source Large Language Models

1 code implementation • 25 May 2023 • Qiantong Xu, Fenglu Hong, Bo Li, Changran Hu, Zhengyu Chen, Jian Zhang

In this paper, we ask can we enhance open-source LLMs to be competitive to leading closed LLM APIs in tool manipulation, with practical amount of human supervision.

122

Paper
Code

Cross-source Point Cloud Registration: Challenges, Progress and Prospects

no code implementations • 23 May 2023 • Xiaoshui Huang, Guofeng Mei, Jian Zhang

The emerging topic of cross-source point cloud (CSPC) registration has attracted increasing attention with the fast development background of 3D sensor technologies.

Point Cloud Registration

Paper
Add Code

An Object SLAM Framework for Association, Mapping, and High-Level Tasks

no code implementations • 12 May 2023 • Yanmin Wu, Yunzhou Zhang, Delong Zhu, Zhiqiang Deng, Wenkai Sun, Xin Chen, Jian Zhang

Taking into consideration the semantic invariance of objects, we convert the object map to a topological map to provide semantic descriptors to enable multi-map matching.

Decision Making Object +2

Paper
Add Code

Single Node Injection Label Specificity Attack on Graph Neural Networks via Reinforcement Learning

no code implementations • 4 May 2023 • Dayuan Chen, Jian Zhang, Yuqian Lv, Jinhuan Wang, Hongjie Ni, Shanqing Yu, Zhen Wang, Qi Xuan

Furthermore, most methods concentrate on a single attack goal and lack a generalizable adversary to develop distinct attack strategies for diverse goals, thus limiting precise control over victim model behavior in real-world scenarios.

Specificity

Paper
Add Code

Hierarchical Dialogue Understanding with Special Tokens and Turn-level Attention

1 code implementation • Tiny Papers @ ICLR 2023 • Xiao Liu, Jian Zhang, Heng Zhang, Fuzhao Xue, Yang You

We evaluate our model on various dialogue understanding tasks including dialogue relation extraction, dialogue emotion recognition, and dialogue act classification.

Ranked #1 on Dialog Relation Extraction on DialogRE

Dialogue Act Classification Dialogue Understanding +2

Paper
Code

Optimization-Inspired Cross-Attention Transformer for Compressive Sensing

1 code implementation • CVPR 2023 • Jiechong Song, Chong Mou, Shiqi Wang, Siwei Ma, Jian Zhang

And, PGCA block achieves an enhanced information interaction, which introduces the inertia force into the gradient descent step through a cross attention block.

Compressive Sensing

Paper
Code

OPDN: Omnidirectional Position-aware Deformable Network for Omnidirectional Image Super-Resolution

no code implementations • 26 Apr 2023 • Xiaopeng Sun, Weiqi Li, Zhenyu Zhang, Qiufang Ma, Xuhan Sheng, Ming Cheng, Haoyu Ma, Shijie Zhao, Jian Zhang, Junlin Li, Li Zhang

Model A aims to enhance the feature extraction ability of 360{\deg} image positional information, while Model B further focuses on the high-frequency information of 360{\deg} images.

Image Super-Resolution Position

Paper
Add Code

Large-capacity and Flexible Video Steganography via Invertible Neural Network

1 code implementation • CVPR 2023 • Chong Mou, Youmin Xu, Jiechong Song, Chen Zhao, Bernard Ghanem, Jian Zhang

For large-capacity, we present a reversible pipeline to perform multiple videos hiding and recovering through a single invertible neural network (INN).

Paper
Code

Unsupervised Deep Probabilistic Approach for Partial Point Cloud Registration

no code implementations • CVPR 2023 • Guofeng Mei, Hao Tang, Xiaoshui Huang, Weijie Wang, Juan Liu, Jian Zhang, Luc van Gool, Qiang Wu

Deep point cloud registration methods face challenges to partial overlaps and rely on labeled data.

Point Cloud Registration

Paper
Add Code

Implicit Neural Representation for Cooperative Low-light Image Enhancement

1 code implementation • ICCV 2023 • Shuzhou Yang, Moxuan Ding, Yanmin Wu, Zihan Li, Jian Zhang

Finally, extensive experiments demonstrate the robustness and superior effectiveness of our proposed NeRCo.

Language Modelling Low-Light Image Enhancement

195

Paper
Code

Progressive Content-aware Coded Hyperspectral Compressive Imaging

no code implementations • 17 Mar 2023 • Xuanyu Zhang, Bin Chen, Wenzhen Zou, Shuai Liu, Yongbing Zhang, Ruiqin Xiong, Jian Zhang

Hyperspectral imaging plays a pivotal role in a wide range of applications, like remote sensing, medicine, and cytology.

Paper
Add Code

A Unified Continual Learning Framework with General Parameter-Efficient Tuning

1 code implementation • ICCV 2023 • Qiankun Gao, Chen Zhao, Yifan Sun, Teng Xi, Gang Zhang, Bernard Ghanem, Jian Zhang

1) Learning: the pre-trained model adapts to the new task by tuning an online PET module, along with our adaptation speed calibration to align different PET modules, 2) Accumulation: the task-specific knowledge learned by the online PET module is accumulated into an offline PET module through momentum update, 3) Ensemble: During inference, we respectively construct two experts with online/offline PET modules (which are favored by the novel/historical tasks) for prediction ensemble.

Continual Learning

Paper
Code

FreeDoM: Training-Free Energy-Guided Conditional Diffusion Model

1 code implementation • ICCV 2023 • Jiwen Yu, Yinhuai Wang, Chen Zhao, Bernard Ghanem, Jian Zhang

In this work, we propose a training-Free conditional Diffusion Model (FreeDoM) used for various conditions.

Face Detection

245

Paper
Code

Unlimited-Size Diffusion Restoration

1 code implementation • 1 Mar 2023 • Yinhuai Wang, Jiwen Yu, Runyi Yu, Jian Zhang

Our simple, parameter-free approaches can be used not only for image restoration but also for image generation of unlimited sizes, with the potential to be a general tool for diffusion models.

Image Generation Image Restoration

1,022

Paper
Code

T2I-Adapter: Learning Adapters to Dig out More Controllable Ability for Text-to-Image Diffusion Models

2 code implementations • 16 Feb 2023 • Chong Mou, Xintao Wang, Liangbin Xie, Yanze Wu, Jian Zhang, Zhongang Qi, Ying Shan, XiaoHu Qie

In this paper, we aim to ``dig out" the capabilities that T2I models have implicitly learned, and then explicitly use them to control the generation more granularly.

Image Generation Style Transfer

3,157

Paper
Code

Cross-domain recommendation via user interest alignment

no code implementations • 26 Jan 2023 • Chuang Zhao, Hongke Zhao, Ming He, Jian Zhang, Jianping Fan

Specifically, we first construct a unified cross-domain heterogeneous graph and redefine the message passing mechanism of graph convolutional networks to capture high-order similarity of users and items across domains.

Recommendation Systems

Paper
Add Code

Mind Reasoning Manners: Enhancing Type Perception for Generalized Zero-shot Logical Reasoning over Text

1 code implementation • 8 Jan 2023 • Fangzhi Xu, Jun Liu, Qika Lin, Tianzhe Zhao, Jian Zhang, Lingling Zhang

(2) How to enhance the perception of reasoning types for the models?

Contrastive Learning Logical Reasoning +3

Paper
Code

Temporal-Coded Spiking Neural Networks with Dynamic Firing Threshold: Learning with Event-Driven Backpropagation

no code implementations • ICCV 2023 • Wenjie Wei, Malu Zhang, Hong Qu, Ammar Belatreche, Jian Zhang, Hong Chen

As a temporal encoding scheme for SNNs, Time-To-First-Spike (TTFS) encodes information using the timing of a single spike, which allows spiking neurons to transmit information through sparse spike trains and results in lower power consumption and higher computational efficiency compared to traditional rate-based encoding counterparts.

Computational Efficiency Image Classification

Paper
Add Code

Panoptic Compositional Feature Field for Editable Scene Rendering With Network-Inferred Labels via Metric Learning

no code implementations • CVPR 2023 • Xinhua Cheng, Yanmin Wu, Mengxi Jia, Qian Wang, Jian Zhang

In this work, we attempt to learn an object-compositional neural implicit representation for editable scene rendering by leveraging labels inferred from the off-the-shelf 2D panoptic segmentation networks instead of the ground truth annotations.

Metric Learning Novel View Synthesis +1

Paper
Add Code

Latent Evolution Model for Change Point Detection in Time-varying Networks

no code implementations • 17 Dec 2022 • Yongshun Gong, Xue Dong, Jian Zhang, Meng Chen

Our method focuses on learning the low-dimensional representations of networks and capturing the evolving patterns of these learned latent representations simultaneously.

Change Point Detection

Paper
Add Code

Position Embedding Needs an Independent Layer Normalization

1 code implementation • 10 Dec 2022 • Runyi Yu, Zhennan Wang, Yinhuai Wang, Kehan Li, Yian Zhao, Jian Zhang, Guoli Song, Jie Chen

By analyzing the input and output of each encoder layer in VTs using reparameterization and visualization, we find that the default PE joining method (simply adding the PE and patch embedding together) operates the same affine transformation to token embedding and PE, which limits the expressiveness of PE and hence constrains the performance of VTs.

Position

Paper
Code

Self-Supervised Object Goal Navigation with In-Situ Finetuning

no code implementations • 9 Dec 2022 • So Yeon Min, Yao-Hung Hubert Tsai, Wei Ding, Ali Farhadi, Ruslan Salakhutdinov, Yonatan Bisk, Jian Zhang

In contrast, our LocCon shows the most robust transfer in the real world among the set of models we compare to, and that the real-world performance of all models can be further improved with self-supervised LocCon in-situ training.

Contrastive Learning Navigate +2

Paper
Add Code

Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model

3 code implementations • 1 Dec 2022 • Yinhuai Wang, Jiwen Yu, Jian Zhang

Most existing Image Restoration (IR) models are task-specific, which can not be generalized to different degradation operators.

Ranked #1 on Image Compressed Sensing on CelebA

Colorization Deblurring +7

1,022

Paper
Code

GAN Prior based Null-Space Learning for Consistent Super-Resolution

1 code implementation • 24 Nov 2022 • Yinhuai Wang, Yujie Hu, Jiwen Yu, Jian Zhang

Consistency and realness have always been the two critical issues of image super-resolution.

Image Super-Resolution

Paper
Code

Complementary Labels Learning with Augmented Classes

no code implementations • 19 Nov 2022 • Zhongnian Li, Jian Zhang, Mengting Xu, Xinzheng Xu, Daoqiang Zhang

In this paper, we propose a novel problem setting called Complementary Labels Learning with Augmented Classes (CLLAC), which brings the challenge that classifiers trained by complementary labels should not only be able to classify the instances from observed classes accurately, but also recognize the instance from the Augmented Classes in the testing phase.

Paper
Add Code

Masked Vision-Language Transformers for Scene Text Recognition

1 code implementation • 9 Nov 2022 • Jie Wu, Ying Peng, Shengming Zhang, Weigang Qi, Jian Zhang

MVLT is trained in two stages: in the first stage, we design a STR-tailored pretraining method based on a masking strategy; in the second stage, we fine-tune our model and adopt an iterative correction method to improve the performance.

Scene Text Recognition

Paper
Code

Overlap-guided Gaussian Mixture Models for Point Cloud Registration

1 code implementation • 17 Oct 2022 • Guofeng Mei, Fabio Poiesi, Cristiano Saltori, Jian Zhang, Elisa Ricci, Nicu Sebe

Probabilistic 3D point cloud registration methods have shown competitive performance in overcoming noise, outliers, and density variations.

Point Cloud Registration

Paper
Code

Multi-Agent Automated Machine Learning

no code implementations • CVPR 2023 • Zhaozhi Wang, Kefan Su, Jian Zhang, Huizhu Jia, Qixiang Ye, Xiaodong Xie, Zongqing Lu

In this paper, we propose multi-agent automated machine learning (MA2ML) with the aim to effectively handle joint optimization of modules in automated machine learning (AutoML).

Data Augmentation Multi-agent Reinforcement Learning +1

Paper
Add Code

Data Augmentation-free Unsupervised Learning for 3D Point Cloud Understanding

1 code implementation • 6 Oct 2022 • Guofeng Mei, Cristiano Saltori, Fabio Poiesi, Jian Zhang, Elisa Ricci, Nicu Sebe, Qiang Wu

Unsupervised learning on 3D point clouds has undergone a rapid evolution, especially thanks to data augmentation-based contrastive methods.

3D Object Classification Contrastive Learning +3

Paper
Code

EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding

2 code implementations • CVPR 2023 • Yanmin Wu, Xinhua Cheng, Renrui Zhang, Zesen Cheng, Jian Zhang

3D visual grounding aims to find the object within point clouds mentioned by free-form natural language descriptions with rich semantic cues.

Object Sentence +1

Paper
Code

Towards Multimodal Multitask Scene Understanding Models for Indoor Mobile Agents

no code implementations • 27 Sep 2022 • Yao-Hung Hubert Tsai, Hanlin Goh, Ali Farhadi, Jian Zhang

The perception system in personalized mobile agents requires developing indoor scene understanding models, which can understand 3D geometries, capture objectiveness, analyze human behaviors, etc.

3D Object Detection Autonomous Driving +9

Paper
Add Code

SAFER: Safe Collision Avoidance using Focused and Efficient Trajectory Search with Reinforcement Learning

no code implementations • 23 Sep 2022 • Mario Srouji, Hugues Thomas, Hubert Tsai, Ali Farhadi, Jian Zhang

Collision avoidance is key for mobile robots and agents to operate safely in the real world.

Collision Avoidance reinforcement-learning +2

Paper
Add Code

D3C2-Net: Dual-Domain Deep Convolutional Coding Network for Compressive Sensing

no code implementations • 27 Jul 2022 • Weiqi Li, Bin Chen, Jian Zhang

By unfolding the proposed framework into deep neural networks, we further design a novel Dual-Domain Deep Convolutional Coding Network (D3C2-Net) for CS imaging with the capability of transmitting high-throughput feature-level image representation through all the unfolded stages.

Compressive Sensing

Paper
Add Code

TransCL: Transformer Makes Strong and Flexible Compressive Learning

1 code implementation • 25 Jul 2022 • Chong Mou, Jian Zhang

Compressive learning (CL) is an emerging framework that integrates signal acquisition via compressed sensing (CS) and machine learning for inference tasks directly on a small number of measurements.

Computational Efficiency Image Classification +1

Paper
Code

Spatial-temporal Analysis for Automated Concrete Workability Estimation

no code implementations • 24 Jul 2022 • Litao Yu, Jian Zhang, Mohammed Bennamoun, Xiaojun Chang, Vute Sirivivatnanon, Ali Nezhad

Concrete workability measure is mostly determined based on subjective assessment of a certified assessor with visual inspections.

regression

Paper
Add Code

Content-aware Scalable Deep Compressed Sensing

1 code implementation • 19 Jul 2022 • Bin Chen, Jian Zhang

To more efficiently address image compressed sensing (CS) problems, we present a novel content-aware scalable network dubbed CASNet which collectively achieves adaptive sampling rate allocation, fine granular scalability and high-quality reconstruction.

Ranked #1 on Image Compressed Sensing on CBSD68

Blocking Image Compressed Sensing +1

Paper
Code

Frequency Domain Model Augmentation for Adversarial Attack

2 code implementations • 12 Jul 2022 • Yuyang Long, Qilong Zhang, Boheng Zeng, Lianli Gao, Xianglong Liu, Jian Zhang, Jingkuan Song

Specifically, we apply a spectrum transformation to the input and thus perform the model augmentation in the frequency domain.

Adversarial Attack

136

Paper
Code

Self-attention on Multi-Shifted Windows for Scene Segmentation

1 code implementation • 10 Jul 2022 • Litao Yu, Zhibin Li, Jian Zhang, Qiang Wu

Scene segmentation in images is a fundamental yet challenging problem in visual content understanding, which is to learn a model to assign every image pixel to a categorical label.

Descriptive Scene Segmentation +1

Paper
Code

Horizontal and Vertical Attention in Transformers

no code implementations • 10 Jul 2022 • Litao Yu, Jian Zhang

Transformers are built upon multi-head scaled dot-product attention and positional encoding, which aim to learn the feature representations and token dependencies.

Dimensionality Reduction

Paper
Add Code

Measuring and Improving the Use of Graph Information in Graph Neural Networks

1 code implementation • ICLR 2020 • Yifan Hou, Jian Zhang, James Cheng, Kaili Ma, Richard T. B. Ma, Hongzhi Chen, Ming-Chang Yang

Graph neural networks (GNNs) have been widely used for representation learning on graph data.

Representation Learning

Paper
Code

Multiple Instance Learning with Mixed Supervision in Gleason Grading

1 code implementation • 26 Jun 2022 • Hao Bian, Zhuchen Shao, Yang Chen, Yifeng Wang, Haoqian Wang, Jian Zhang, Yongbing Zhang

We achieve the state-of-the-art performance on the SICAPv2 dataset, and the visual analysis shows the accurate prediction results of instance level.

Multiple Instance Learning whole slide images

Paper
Code

Learning the policy for mixed electric platoon control of automated and human-driven vehicles at signalized intersection: a random search approach

no code implementations • 24 Jun 2022 • Xia Jiang, Jian Zhang, Xiaoyu Shi, Jian Cheng

Meanwhile, the simulation results demonstrate the effectiveness of the delay reward, which is designed to outperform distributed reward mechanism} Compared with normal car-following behavior, the sensitivity analysis reveals that the energy can be saved to different extends (39. 27%-82. 51%) by adjusting the relative importance of the optimization goal.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Eco-driving for Electric Connected Vehicles at Signalized Intersections: A Parameterized Reinforcement Learning approach

no code implementations • 24 Jun 2022 • Xia Jiang, Jian Zhang, Dan Li

This paper proposes an eco-driving framework for electric connected vehicles (CVs) based on reinforcement learning (RL) to improve vehicle energy efficiency at signalized intersections.

Reinforcement Learning (RL)

Paper
Add Code

Using EBGAN for Anomaly Intrusion Detection

no code implementations • 21 Jun 2022 • Yi Cui, Wenfeng Shen, Jian Zhang, Weijia Lu, Chuang Liu, Lin Sun, Si Chen

The generator in IDS-EBGAN is responsible for converting the original malicious network traffic in the training set into adversarial malicious examples.

Intrusion Detection

Paper
Add Code

Hierarchical Similarity Learning for Aliasing Suppression Image Super-Resolution

no code implementations • 7 Jun 2022 • Yuqing Liu, Qi Jia, Jian Zhang, Xin Fan, Shanshe Wang, Siwei Ma, Wen Gao

As a highly ill-posed issue, single image super-resolution (SISR) has been widely investigated in recent years.

Image Super-Resolution

Paper
Add Code

Ray Priors through Reprojection: Improving Neural Radiance Fields for Novel View Extrapolation

no code implementations • CVPR 2022 • Jian Zhang, Yuanqing Zhang, Huan Fu, Xiaowei Zhou, Bowen Cai, Jinchi Huang, Rongfei Jia, Binqiang Zhao, Xing Tang

Neural Radiance Fields (NeRF) have emerged as a potent paradigm for representing scenes and synthesizing photo-realistic images.

Image Generation

Paper
Add Code

MM-RealSR: Metric Learning based Interactive Modulation for Real-World Super-Resolution

1 code implementation • 10 May 2022 • Chong Mou, Yanze Wu, Xintao Wang, Chao Dong, Jian Zhang, Ying Shan

Instead of using known degradation levels as explicit supervision to the interactive mechanism, we propose a metric learning strategy to map the unquantifiable degradation levels in real-world scenarios to a metric space, which is trained in an unsupervised manner.

Image Restoration Metric Learning +1

143

Paper
Code

Deep Generalized Unfolding Networks for Image Restoration

1 code implementation • CVPR 2022 • Chong Mou, Qian Wang, Jian Zhang

Concretely, without loss of interpretability, we integrate a gradient estimation strategy into the gradient descent step of the Proximal Gradient Descent (PGD) algorithm, driving it to deal with complex and real-world image degradation.

Image Restoration

112

Paper
Code

Learning Weighting Map for Bit-Depth Expansion within a Rational Range

1 code implementation • 26 Apr 2022 • Yuqing Liu, Qi Jia, Jian Zhang, Xin Fan, Shanshe Wang, Siwei Ma, Wen Gao

Existing BDE methods have no unified solution for various BDE situations, and directly learn a mapping for each pixel from LBD image to the desired value in HBD image, which may change the given high-order bits and lead to a huge deviation from the ground truth.

SSIM

Paper
Code

Investigating Accuracy-Novelty Performance for Graph-based Collaborative Filtering

1 code implementation • 26 Apr 2022 • Minghao Zhao, Le Wu, Yile Liang, Lei Chen, Jian Zhang, Qilin Deng, Kai Wang, Xudong Shen, Tangjie Lv, Runze Wu

While conventional CF models are known for facing the challenges of the popularity bias that favors popular items, one may wonder "Whether the existing graph-based CF models alleviate or exacerbate popularity bias of recommender systems?"

Collaborative Filtering Recommendation Systems

Paper
Code

PUERT: Probabilistic Under-sampling and Explicable Reconstruction Network for CS-MRI

1 code implementation • 24 Apr 2022 • Jingfen Xie, Jian Zhang, Yongbing Zhang, Xiangyang Ji

Compressed Sensing MRI (CS-MRI) aims at reconstructing de-aliased images from sub-Nyquist sampling k-space data to accelerate MR Imaging, thus presenting two basic issues, i. e., where to sample and how to reconstruct.

Binarization

Paper
Code

R-DFCIL: Relation-Guided Representation Learning for Data-Free Class Incremental Learning

1 code implementation • 24 Mar 2022 • Qiankun Gao, Chen Zhao, Bernard Ghanem, Jian Zhang

After RRL, the classification head is refined with global class-balanced classification loss to address the data imbalance issue as well as learn the decision boundaries between new and previous classes.

Class Incremental Learning Incremental Learning +3

Paper
Code

A Prompting-based Approach for Adversarial Example Generation and Robustness Enhancement

no code implementations • 21 Mar 2022 • Yuting Yang, Pei Huang, Juan Cao, Jintao Li, Yun Lin, Jin Song Dong, Feifei Ma, Jian Zhang

Our attack technique targets the inherent vulnerabilities of NLP models, allowing us to generate samples even without interacting with the victim NLP model, as long as it is based on pre-trained language models (PLMs).

Adversarial Attack

Paper
Add Code

Series Photo Selection via Multi-view Graph Learning

no code implementations • 18 Mar 2022 • Jin Huang, Lu Zhang, Yongshun Gong, Jian Zhang, Xiushan Nie, Yilong Yin

Series photo selection (SPS) is an important branch of the image aesthetics quality assessment, which focuses on finding the best one from a series of nearly identical photos.

Aesthetics Quality Assessment Graph Learning

Paper
Add Code

Panini-Net: GAN Prior Based Degradation-Aware Feature Interpolation for Face Restoration

1 code implementation • 16 Mar 2022 • Yinhuai Wang, Yujie Hu, Jian Zhang

Emerging high-quality face restoration (FR) methods often utilize pre-trained GAN models (\textit{i. e.}, StyleGAN2) as GAN Prior.

Representation Learning Super-Resolution

109

Paper
Code

NeRFocus: Neural Radiance Field for 3D Synthetic Defocus

1 code implementation • 10 Mar 2022 • Yinhuai Wang, Shuzhou Yang, Yujie Hu, Jian Zhang

Unlike the pinhole, the thin lens refracts rays of a scene point, so its imaging on the sensor plane is scattered as a circle of confusion (CoC).

Paper
Code

Unsupervised Learning on 3D Point Clouds by Clustering and Contrasting

no code implementations • 5 Feb 2022 • Guofeng Mei, Litao Yu, Qiang Wu, Jian Zhang, Mohammed Bennamoun

This paper proposes a general unsupervised approach, named \textbf{ConClu}, to perform the learning of point-wise and global features by jointly leveraging point-level clustering and instance-level contrasting.

3D Object Classification Clustering +2

Paper
Add Code

Quantifying Robustness to Adversarial Word Substitutions

no code implementations • 11 Jan 2022 • Yuting Yang, Pei Huang, Feifei Ma, Juan Cao, Meishan Zhang, Jian Zhang, Jintao Li

Deep-learning-based NLP models are found to be vulnerable to word substitution perturbations.

Paper
Add Code

Image Disentanglement Autoencoder for Steganography Without Embedding

1 code implementation • CVPR 2022 • Xiyao Liu, Ziping Ma, Junxing Ma, Jian Zhang, Gerald Schaefer, Hui Fang

Conventional steganography approaches embed a secret message into a carrier for concealed communication but are prone to attack by recent advanced steganalysis tools.

Disentanglement Steganalysis

Paper
Code

Robust Invertible Image Steganography

no code implementations • CVPR 2022 • Youmin Xu, Chong Mou, Yujie Hu, Jingfen Xie, Jian Zhang

Previous image steganography methods are limited in hiding capacity and robustness, commonly vulnerable to distortion on container images such as Gaussian noise, Poisson noise, and lossy compression.

Image Steganography

Paper
Add Code

CSformer: Bridging Convolution and Transformer for Compressive Sensing

1 code implementation • 31 Dec 2021 • Dongjie Ye, Zhangkai Ni, Hanli Wang, Jian Zhang, Shiqi Wang, Sam Kwong

The proposed approach is an end-to-end compressive image sensing method, composed of adaptive sampling and recovery.

Compressive Sensing Inductive Bias +1

Paper
Code

COTReg:Coupled Optimal Transport based Point Cloud Registration

no code implementations • 29 Dec 2021 • Guofeng Mei, Xiaoshui Huang, Litao Yu, Jian Zhang, Mohammed Bennamoun

Generating a set of high-quality correspondences or matches is one of the most critical steps in point cloud registration.

Point Cloud Registration

Paper
Add Code

MVDG: A Unified Multi-view Framework for Domain Generalization

1 code implementation • 23 Dec 2021 • Jian Zhang, Lei Qi, Yinghuan Shi, Yang Gao

Beyond the training stage, overfitting could also cause unstable prediction in the test stage.

Domain Generalization Meta-Learning

Paper
Code

HerosNet: Hyperspectral Explicable Reconstruction and Optimal Sampling Deep Network for Snapshot Compressive Imaging

1 code implementation • CVPR 2022 • Xuanyu Zhang, Yongbing Zhang, Ruiqin Xiong, Qilin Sun, Jian Zhang

Hyperspectral imaging is an essential imaging modality for a wide range of applications, especially in remote sensing, agriculture, and medicine.

Compressive Sensing

Paper
Code

GenReg: Deep Generative Method for Fast Point Cloud Registration

no code implementations • 23 Nov 2021 • Xiaoshui Huang, Zongyi Xu, Guofeng Mei, Sheng Li, Jian Zhang, Yifan Zuo, Yucheng Wang

To solve this challenge, we propose a new data-driven registration algorithm by investigating deep generative neural networks to point cloud registration.

Point Cloud Registration

Paper
Add Code

Can Graph Neural Networks Learn to Solve MaxSAT Problem?

no code implementations • 15 Nov 2021 • Minghao Liu, Fuqi Jia, Pei Huang, Fan Zhang, Yuchen Sun, Shaowei Cai, Feifei Ma, Jian Zhang

With the rapid development of deep learning techniques, various recent work has tried to apply graph neural networks (GNNs) to solve NP-hard problems such as Boolean Satisfiability (SAT), which shows the potential in bridging the gap between machine learning and symbolic reasoning.

Paper
Add Code

ε-weakened Robustness of Deep Neural Networks

no code implementations • 29 Oct 2021 • Pei Huang, Yuting Yang, Minghao Liu, Fuqi Jia, Feifei Ma, Jian Zhang

This paper introduces a notation of $\varepsilon$-weakened robustness for analyzing the reliability and stability of deep neural networks (DNNs).

Paper
Add Code

Research on the Inverse Kinematics Prediction of a Soft Biomimetic Actuator via BP Neural Network

no code implementations • 26 Oct 2021 • Huichen Ma, Junjie Zhou, Jian Zhang, Lingyu Zhang

After training with sample data, the BP neural network model can represent the relation between the manipulator tip position and the pressure applied to the chambers.

Motion Planning Position

Paper
Add Code

Memory-Augmented Deep Unfolding Network for Compressive Sensing

1 code implementation • 19 Oct 2021 • Jiechong Song, Bin Chen, Jian Zhang

By understanding DUNs from the perspective of the human brain's memory processing, we find there exists two issues in existing DUNs.

Compressive Sensing

Paper
Code

Inconsistency-aware Uncertainty Estimation for Semi-supervised Medical Image Segmentation

1 code implementation • 17 Oct 2021 • Yinghuan Shi, Jian Zhang, Tong Ling, Jiwen Lu, Yefeng Zheng, Qian Yu, Lei Qi, Yang Gao

In semi-supervised medical image segmentation, most previous works draw on the common assumption that higher entropy means higher uncertainty.

Image Segmentation Segmentation +2

Paper
Code

Dyn-Backdoor: Backdoor Attack on Dynamic Link Prediction

no code implementations • 8 Oct 2021 • Jinyin Chen, Haiyang Xiong, Haibin Zheng, Jian Zhang, Guodong Jiang, Yi Liu

Backdoor attacks induce the DLP methods to make wrong prediction by the malicious training data, i. e., generating a subgraph sequence as the trigger and embedding it to the training data.

Backdoor Attack Dynamic Link Prediction +1

Paper
Add Code

Serving DNN Models with Multi-Instance GPUs: A Case of the Reconfigurable Machine Scheduling Problem

no code implementations • 18 Sep 2021 • Cheng Tan, Zhichao Li, Jian Zhang, Yu Cao, Sikai Qi, Zherui Liu, Yibo Zhu, Chuanxiong Guo

With MIG, A100 can be the most cost-efficient GPU ever for serving Deep Neural Networks (DNNs).

Scheduling

Paper
Add Code

Dense Deep Unfolding Network with 3D-CNN Prior for Snapshot Compressive Imaging

1 code implementation • ICCV 2021 • Zhuoyuan Wu, Jian Zhang, Chong Mou

To better exploit the spatial-temporal correlation among frames and address the problem of information loss between adjacent phases in existing DUNs, we propose to adopt the 3D-CNN prior in our proximal mapping module and develop a novel dense feature map (DFM) strategy, respectively.

Paper
Code

Dynamic Attentive Graph Learning for Image Restoration

1 code implementation • ICCV 2021 • Chong Mou, Jian Zhang, Zhuoyuan Wu

Specifically, we propose an improved graph model to perform patch-wise graph convolution with a dynamic and adaptive number of neighbors for each node.

Demosaicking Graph Learning +1

Paper
Code

Webly Supervised Fine-Grained Recognition: Benchmark Datasets and An Approach

1 code implementation • ICCV 2021 • Zeren Sun, Yazhou Yao, Xiu-Shen Wei, Yongshun Zhang, Fumin Shen, Jianxin Wu, Jian Zhang, Heng-Tao Shen

Learning from the web can ease the extreme dependence of deep learning on large-scale manually labeled datasets.

Benchmarking

Paper
Code

Structure Destruction and Content Combination for Face Anti-Spoofing

no code implementations • 22 Jul 2021 • Ke-Yue Zhang, Taiping Yao, Jian Zhang, Shice Liu, Bangjie Yin, Shouhong Ding, Jilin Li

In pursuit of consolidating the face verification systems, prior face anti-spoofing studies excavate the hidden cues in original images to discriminate real persons and diverse attack types with the assistance of auxiliary supervision.

Face Anti-Spoofing Face Verification +1

Paper
Add Code

EGC2: Enhanced Graph Classification with Easy Graph Compression

1 code implementation • 16 Jul 2021 • Jinyin Chen, Haiyang Xiong, Haibin Zhenga, Dunjie Zhang, Jian Zhang, Mingwei Jia, Yi Liu

To achieve lower-complexity defense applied to graph classification models, EGC2 utilizes a centrality-based edge-importance index to compress the graphs, filtering out trivial structures and adversarial perturbations in the input graphs, thus improving the model's robustness.

Graph Classification

Paper
Code

Multi-Level Contrastive Learning for Few-Shot Problems

no code implementations • 15 Jul 2021 • Qing Chen, Jian Zhang

Most current applications of contrastive learning benefit only a single representation from the last layer of an encoder. In this paper, we propose a multi-level contrasitive learning approach which applies contrastive losses at different layers of an encoder to learn multiple representations from the encoder.

Contrastive Learning Few-Shot Learning

Paper
Add Code

COAST: COntrollable Arbitrary-Sampling NeTwork for Compressive Sensing

1 code implementation • 15 Jul 2021 • Di You, Jian Zhang, Jingfen Xie, Bin Chen, Siwei Ma

In this paper, we propose a novel COntrollable Arbitrary-Sampling neTwork, dubbed COAST, to solve CS problems of arbitrary-sampling matrices (including unseen sampling matrices) with one single model.

Blocking Compressive Sensing

Paper
Code

Learning Disentangled Representation Implicitly via Transformer for Occluded Person Re-Identification

no code implementations • 6 Jul 2021 • Mengxi Jia, Xinhua Cheng, Shijian Lu, Jian Zhang

To better eliminate interference from occlusions, we design a contrast feature learning technique (CFL) for better separation of occlusion features and discriminative ID features.

Person Re-Identification Representation Learning

Paper
Add Code

Spk2ImgNet: Learning To Reconstruct Dynamic Scene From Continuous Spike Stream

no code implementations • CVPR 2021 • Jing Zhao, Ruiqin Xiong, Hangfan Liu, Jian Zhang, Tiejun Huang

Different from the conventional digital cameras that compact the photoelectric information within the exposure interval into a single snapshot, the spike camera produces a continuous spike stream to record the dynamic light intensity variation process.

Image Reconstruction

Paper
Add Code

TransMIL: Transformer based Correlated Multiple Instance Learning for Whole Slide Image Classification

3 code implementations • NeurIPS 2021 • Zhuchen Shao, Hao Bian, Yang Chen, Yifeng Wang, Jian Zhang, Xiangyang Ji, Yongbing Zhang

Multiple instance learning (MIL) is a powerful tool to solve the weakly supervised classification in whole slide image (WSI) based pathology diagnosis.

Classification Image Classification +2

297

Paper
Code

A Multi-Branch Hybrid Transformer Networkfor Corneal Endothelial Cell Segmentation

no code implementations • 21 May 2021 • Yinglin Zhang, Risa Higashita, Huazhu Fu, Yanwu Xu, Yang Zhang, Haofeng Liu, Jian Zhang, Jiang Liu

Corneal endothelial cell segmentation plays a vital role inquantifying clinical indicators such as cell density, coefficient of variation, and hexagonality.

Cell Segmentation

Paper
Add Code

Weakly Supervised Dense Video Captioning via Jointly Usage of Knowledge Distillation and Cross-modal Matching

no code implementations • 18 May 2021 • Bofeng Wu, guocheng niu, Jun Yu, Xinyan Xiao, Jian Zhang, Hua Wu

This paper proposes an approach to Dense Video Captioning (DVC) without pairwise event-sentence annotation.

Caption Generation Cross-Modal Retrieval +4

Paper
Add Code

Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning

2 code implementations • 17 May 2021 • Yue Wu, Shuangfei Zhai, Nitish Srivastava, Joshua Susskind, Jian Zhang, Ruslan Salakhutdinov, Hanlin Goh

Offline Reinforcement Learning promises to learn effective policies from previously-collected, static datasets without the need for exploration.

Offline RL Q-Learning +2

Paper
Code

Unsupervised Sentiment Analysis by Transferring Multi-source Knowledge

no code implementations • 9 May 2021 • Yong Dai, Jian Liu, Jian Zhang, Hongguang Fu, Zenglin Xu

The first mechanism is a selective domain adaptation (SDA) method, which transfers knowledge from the closest source domain.

Domain Adaptation Sentiment Analysis

Paper
Add Code

Underwater Target Recognition based on Multi-Decision LOFAR Spectrum Enhancement: A Deep Learning Approach

no code implementations • 26 Apr 2021 • Jie Chen, Jie Liu, Chang Liu, Jian Zhang, Bing Han

To overcome this issue and to further improve the recognition performance, we adopt a deep learning approach for underwater target recognition and propose a LOFAR spectrum enhancement (LSE)-based underwater target recognition scheme, which consists of preprocessing, offline training, and online testing.

Paper
Add Code

Super-Resolving Compressed Video in Coding Chain

no code implementations • 26 Mar 2021 • Dewang Hou, Yang Zhao, Yuyao Ye, Jiayu Yang, Jian Zhang, Ronggang Wang

Scaling and lossy coding are widely used in video transmission and storage.

Paper
Add Code

Non-Salient Region Object Mining for Weakly Supervised Semantic Segmentation

1 code implementation • CVPR 2021 • Yazhou Yao, Tao Chen, GuoSen Xie, Chuanyi Zhang, Fumin Shen, Qi Wu, Zhenmin Tang, Jian Zhang

To further mine the non-salient region objects, we propose to exert the segmentation network's self-correction ability.

Object Segmentation +2

Paper
Code

Jo-SRC: A Contrastive Approach for Combating Noisy Labels

no code implementations • CVPR 2021 • Yazhou Yao, Zeren Sun, Chuanyi Zhang, Fumin Shen, Qi Wu, Jian Zhang, Zhenmin Tang

Due to the memorization effect in Deep Neural Networks (DNNs), training with noisy labels usually results in inferior model performance.

Contrastive Learning Memorization

Paper
Add Code

ISTA-Net++: Flexible Deep Unfolding Network for Compressive Sensing

1 code implementation • 22 Mar 2021 • Di You, Jingfen Xie, Jian Zhang

While deep neural networks have achieved impressive success in image compressive sensing (CS), most of them lack flexibility when dealing with multi-ratio tasks and multi-scene images in practical applications.

Blocking Compressive Sensing

Paper
Code

Thousand to One: Semantic Prior Modeling for Conceptual Coding

no code implementations • 12 Mar 2021 • Jianhui Chang, Zhenghui Zhao, Lingbo Yang, Chuanmin Jia, Jian Zhang, Siwei Ma

To this end, we propose a novel end-to-end semantic prior modeling-based conceptual coding scheme towards extremely low bitrate image compression, which leverages semantic-wise deep representations as a unified prior for entropy estimation and texture synthesis.

Image Compression Semantic Segmentation +1

Paper
Add Code

COLA-Net: Collaborative Attention Network for Image Restoration

2 code implementations • 10 Mar 2021 • Chong Mou, Jian Zhang, Xiaopeng Fan, Hangfan Liu, Ronggang Wang

Local and non-local attention-based methods have been well studied in various image restoration tasks while leading to promising performance.

CoLA Image Denoising +1

Paper
Code

A comprehensive survey on point cloud registration

no code implementations • 3 Mar 2021 • Xiaoshui Huang, Guofeng Mei, Jian Zhang, Rana Abbas

This survey conducts a comprehensive survey, including both same-source and cross-source registration methods, and summarize the connections between optimization-based and deep learning methods, to provide further research insight.

3D Reconstruction Point Cloud Registration

Paper
Add Code

Temperature dependent coherence properties of NV ensemble in diamond up to 600K

no code implementations • 25 Feb 2021 • Shengran Lin, Changfeng Weng, Yuanjie Yang, Jiaxin Zhao, Yuhang Guo, Jian Zhang, Liren Lou, Wei Zhu, Guanzhong Wang

Nitrogen-vacancy (NV) center in diamond is an ideal candidate for quantum sensors because of its excellent optical and coherence property.

Quantum Physics Mesoscale and Nanoscale Physics

Paper
Add Code

Semantically Meaningful Class Prototype Learning for One-Shot Image Semantic Segmentation

1 code implementation • 22 Feb 2021 • Tao Chen, GuoSen Xie, Yazhou Yao, Qiong Wang, Fumin Shen, Zhenmin Tang, Jian Zhang

Then we utilize the fused prototype to guide the final segmentation of the query image.

Image Segmentation Segmentation +1

Paper
Code

Temporal-Amount Snapshot MultiGraph for Ethereum Transaction Tracking

no code implementations • 16 Feb 2021 • Yunyi Xie, Jie Jin, Jian Zhang, Shanqing Yu, Qi Xuan

With the wide application of blockchain in the financial field, the rise of various types of cybercrimes has brought great challenges to the security of blockchain.

Link Prediction

Paper
Add Code

Aurora Guard: Reliable Face Anti-Spoofing via Mobile Lighting System

no code implementations • 1 Feb 2021 • Jian Zhang, Ying Tai, Taiping Yao, Jia Meng, Shouhong Ding, Chengjie Wang, Jilin Li, Feiyue Huang, Rongrong Ji

Face authentication on mobile end has been widely applied in various scenarios.

Face Anti-Spoofing

Paper
Add Code

Exploiting Web Images for Fine-Grained Visual Recognition by Eliminating Noisy Samples and Utilizing Hard Ones

1 code implementation • 23 Jan 2021 • Huafeng Liu, Chuanyi Zhang, Yazhou Yao, Xiushen Wei, Fumin Shen, Jian Zhang, Zhenmin Tang

Labeling objects at a subordinate level typically requires expert knowledge, which is not always available when using random annotators.

Fine-Grained Visual Recognition

Paper
Code

Uncertainty Weighted Offline Reinforcement Learning

no code implementations • 1 Jan 2021 • Yue Wu, Shuangfei Zhai, Nitish Srivastava, Joshua M. Susskind, Jian Zhang, Ruslan Salakhutdinov, Hanlin Goh

Offline Reinforcement Learning promises to learn effective policies from previously-collected, static datasets without the need for exploration.

Offline RL Q-Learning +2

Paper
Add Code

Multi-Representation Ensemble in Few-Shot Learning

no code implementations • 1 Jan 2021 • Qing Chen, Jian Zhang

Deep neural networks (DNNs) compute representations in a layer by layer fashion, producing a final representation at the top layer of the pipeline, and classification or regression is made using the final representation.

Few-Shot Learning

Paper
Add Code

Super Resolve Dynamic Scene From Continuous Spike Streams

no code implementations • ICCV 2021 • Jing Zhao, Jiyu Xie, Ruiqin Xiong, Jian Zhang, Zhaofei Yu, Tiejun Huang

In this paper, we properly exploit the relative motion and derive the relationship between light intensity and each spike, so as to recover the external scene with both high temporal and high spatial resolution.

Super-Resolution

Paper
Add Code

Revisiting BFfloat16 Training

no code implementations • 1 Jan 2021 • Pedram Zamirai, Jian Zhang, Christopher R Aberger, Christopher De Sa

We ask can we do pure 16-bit training which requires only 16-bit compute units, while still matching the model accuracy attained by 32-bit training.

Paper
Add Code

Counting the Number of Solutions to Constraints

no code implementations • 28 Dec 2020 • Jian Zhang, Cunjing Ge, Feifei Ma

Compared with constraint satisfaction problems, counting problems have received less attention.

Paper
Add Code

Multiple Instance Segmentation in Brachial Plexus Ultrasound Image Using BPMSegNet

no code implementations • 22 Dec 2020 • Yi Ding, Qiqi Yang, Guozheng Wu, Jian Zhang, Zhiguang Qin

In this paper, a network called Brachial Plexus Multi-instance Segmentation Network (BPMSegNet) is proposed to identify different tissues (nerves, arteries, veins, muscles) in ultrasound images.

Instance Segmentation Semantic Segmentation

Paper
Add Code

PTN: A Poisson Transfer Network for Semi-supervised Few-shot Learning

no code implementations • 20 Dec 2020 • Huaxi Huang, Junjie Zhang, Jian Zhang, Qiang Wu, Chang Xu

Second, the extra unlabeled samples are employed to transfer the knowledge from base classes to novel classes through contrastive learning.

Contrastive Learning Few-Shot Learning

Paper
Add Code

Self-Supervised Learning of Lidar Segmentation for Autonomous Indoor Navigation

2 code implementations • 10 Dec 2020 • Hugues Thomas, Ben Agro, Mona Gridseth, Jian Zhang, Timothy D. Barfoot

We provide insights into our network predictions and show that our approach can also improve the performances of common localization techniques.

Navigate Point Cloud Segmentation +4

Paper
Code

Rigid and Articulated Point Registration with Expectation Conditional Maximization

no code implementations • 9 Dec 2020 • Radu Horaud, Florence Forbes, Manuel Yguel, Guillaume Dewaele, Jian Zhang

This paper addresses the issue of matching rigid and articulated shapes through probabilistic point registration.

Paper
Add Code

Field-wise Learning for Multi-field Categorical Data

1 code implementation • NeurIPS 2020 • Zhibin Li, Jian Zhang, Yongshun Gong, Yazhou Yao, Qiang Wu

We present a model that utilizes linear models with variance and low-rank constraints, to help it generalize better and reduce the number of parameters.

Paper
Code

Time-Series Snapshot Network for Partner Recommendation: A Case Study on OSS

no code implementations • 18 Nov 2020 • Jinyin Chen, Yunyi Xie, Jian Zhang, Xincheng Shu, Qi Xuan

In this paper, we introduce time-series snapshot network (TSSN) which is a mixture network to model the interactions among users and developers.

Social and Information Networks

Paper
Add Code

Conceptual Compression via Deep Structure and Texture Synthesis

2 code implementations • 10 Nov 2020 • Jianhui Chang, Zhenghui Zhao, Chuanmin Jia, Shiqi Wang, Lingbo Yang, Qi Mao, Jian Zhang, Siwei Ma

To this end, we propose a novel conceptual compression framework that encodes visual data into compact structure and texture representations, then decodes in a deep synthesis fashion, aiming to achieve better visual reconstruction quality, flexible content manipulation, and potential support for various vision tasks.

Texture Synthesis

Paper
Code

Multi-layer Feature Aggregation for Deep Scene Parsing Models

no code implementations • 4 Nov 2020 • Litao Yu, Yongsheng Gao, Jun Zhou, Jian Zhang, Qiang Wu

The proposed module can auto-select the intermediate visual features to correlate the spatial and semantic information.

Ranked #47 on Semantic Segmentation on NYU Depth v2

Scene Parsing Semantic Segmentation

Paper
Add Code

Distribution-aware Margin Calibration for Medical Image Segmentation

no code implementations • 3 Nov 2020 • Zhibin Li, Litao Yu, Jian Zhang

In this paper, we present a novel data-distribution-aware margin calibration method for a better generalization of the mIoU over the whole data-distribution, underpinned by a rigid lower bound.

Image Segmentation Medical Image Segmentation +2

Paper
Add Code

Parameter Efficient Deep Neural Networks with Bilinear Projections

1 code implementation • 3 Nov 2020 • Litao Yu, Yongsheng Gao, Jun Zhou, Jian Zhang

Recent research on deep neural networks (DNNs) has primarily focused on improving the model accuracy.

Paper
Code

Dual Attention on Pyramid Feature Maps for Image Captioning

no code implementations • 2 Nov 2020 • Litao Yu, Jian Zhang, Qiang Wu

In this paper, we propose to apply dual attention on pyramid image feature maps to fully explore the visual-semantic correlations and improve the quality of generated sentences.

Descriptive Image Captioning

Paper
Add Code

AutoBSS: An Efficient Algorithm for Block Stacking Style Search

no code implementations • NeurIPS 2020 • Yikang Zhang, Jian Zhang, Zhao Zhong

Neural network architecture design mostly focuses on the new convolutional operator or special topological structure of network block, little attention is drawn to the configuration of stacking each block, called Block Stacking Style (BSS).

AutoML Bayesian Optimization +6

Paper
Add Code

Identification of deep breath while moving forward based on multiple body regions and graph signal analysis

no code implementations • 20 Oct 2020 • Yunlu Wang, Cheng Yang, Menghan Hu, Jian Zhang, Qingli Li, Guangtao Zhai, Xiao-Ping Zhang

This paper presents an unobtrusive solution that can automatically identify deep breath when a person is walking past the global depth camera.

Paper
Add Code

Revisiting BFloat16 Training

no code implementations • 13 Oct 2020 • Pedram Zamirai, Jian Zhang, Christopher R. Aberger, Christopher De Sa

State-of-the-art generic low-precision training algorithms use a mix of 16-bit and 32-bit precision, creating the folklore that 16-bit hardware compute units alone are not enough to maximize model accuracy.

Paper
Add Code

Face Mask Assistant: Detection of Face Mask Service Stage Based on Mobile Phone

no code implementations • 9 Oct 2020 • Yuzhen Chen, Menghan Hu, Chunjun Hua, Guangtao Zhai, Jian Zhang, Qingli Li, Simon X. Yang

Aimed at solving the problem that we don't know which service stage of the mask belongs to, we propose a detection system based on the mobile phone.

Paper
Add Code

ASDN: A Deep Convolutional Network for Arbitrary Scale Image Super-Resolution

1 code implementation • 6 Oct 2020 • Jialiang Shen, Yucheng Wang, Jian Zhang

For SR of small-scales (between 1 and 2), images are constructed by interpolation from a sparse set of precalculated Laplacian pyramid levels.

Image Super-Resolution

Paper
Code

Scalar Coupling Constant Prediction Using Graph Embedding Local Attention Encoder

no code implementations • 7 Sep 2020 • Caiqing Jian, Xinyu Cheng, Jian Zhang, Lihui Wang

The experimental results demonstrate that, compared to the traditional chemical bond structure representations, the rotation and translation invariant structure representations proposed in this work can improve the SCC prediction accuracy; with the graph embedded local self-attention, the mean absolute error (MAE) of the prediction model in the validation set decreases from 0. 1603 Hz to 0. 1067 Hz; using the classification based loss function instead of the scaled regression loss, the MAE of the predicted SCC can be decreased to 0. 0963 HZ, which is close to the quantum chemistry standard on CHAMPS dataset.

Graph Embedding

Paper
Add Code

Implicit Multidimensional Projection of Local Subspaces

1 code implementation • 7 Sep 2020 • Rongzheng Bian, Yumeng Xue, Liang Zhou, Jian Zhang, Baoquan Chen, Daniel Weiskopf, Yunhai Wang

We propose a visualization method to understand the effect of multidimensional projection on local subspaces, using implicit function differentiation.

Paper
Code

Computational prediction of RNA tertiary structures using machine learning methods

no code implementations • 3 Sep 2020 • Bin Huang, Yuanyang Du, Shuai Zhang, Wenfei Li, Jun Wang, Jian Zhang

RNAs play crucial and versatile roles in biological processes.

BIG-bench Machine Learning

Paper
Add Code

Face Anti-Spoofing Via Disentangled Representation Learning

no code implementations • ECCV 2020 • Ke-Yue Zhang, Taiping Yao, Jian Zhang, Ying Tai, Shouhong Ding, Jilin Li, Feiyue Huang, Haichuan Song, Lizhuang Ma

Face anti-spoofing is crucial to security of face recognition systems.

Disentanglement Face Anti-Spoofing +1

Paper
Add Code

Salvage Reusable Samples from Noisy Data for Robust Learning

1 code implementation • 6 Aug 2020 • Zeren Sun, Xian-Sheng Hua, Yazhou Yao, Xiu-Shen Wei, Guosheng Hu, Jian Zhang

To this end, we propose a certainty-based reusable sample selection and correction approach, termed as CRSSC, for coping with label noise in training deep FG models with web images.

Memorization

Paper
Code

Hardware Accelerator for Adversarial Attacks on Deep Learning Neural Networks

no code implementations • 3 Aug 2020 • Haoqiang Guo, Lu Peng, Jian Zhang, Fang Qi, Lide Duan

Recent studies identify that Deep learning Neural Networks (DNNs) are vulnerable to subtle perturbations, which are not perceptible to human visual system but can fool the DNN models and lead to wrong outputs.

Adversarial Attack Computational Efficiency

Paper
Add Code

MACD R-CNN: An Abnormal Cell Nucleus Detection Method

no code implementations • 28 Jul 2020 • Baoyan Ma, Jian Zhang, Feng Cao, Yongjun He

We design a fixed proposal module to generate fixed-sized feature maps of nuclei, which allows the new information of nucleus is used for classification.

Cell Detection Classification +4

Paper
Add Code

A Similarity Inference Metric for RGB-Infrared Cross-Modality Person Re-identification

no code implementations • 3 Jul 2020 • Mengxi Jia, Yunpeng Zhai, Shijian Lu, Siwei Ma, Jian Zhang

RGB-Infrared (IR) cross-modality person re-identification (re-ID), which aims to search an IR image in RGB gallery or vice versa, is a challenging task due to the large discrepancy between IR and RGB modalities.

Cross-Modality Person Re-identification Person Re-Identification

Paper
Add Code

Stochastic Batch Augmentation with An Effective Distilled Dynamic Soft Label Regularizer

no code implementations • 27 Jun 2020 • Qian Li, Qingyuan Hu, Yong Qi, Saiyu Qi, Jie Ma, Jian Zhang

SBA stochastically decides whether to augment at iterations controlled by the batch scheduler and in which a ''distilled'' dynamic soft label regularization is introduced by incorporating the similarity in the vicinity distribution respect to raw samples.

Data Augmentation

Paper
Add Code

Automated Radiological Report Generation For Chest X-Rays With Weakly-Supervised End-to-End Deep Learning

no code implementations • 18 Jun 2020 • Shuai Zhang, Xiaoyan Xin, Yang Wang, Yachong Guo, Qiuqiao Hao, Xianfeng Yang, Jun Wang, Jian Zhang, Bing Zhang, Wei Wang

The model provides automated recognition of given scans and generation of reports.

Paper
Add Code

Understanding Graph Neural Networks from Graph Signal Denoising Perspectives

1 code implementation • 8 Jun 2020 • Guoji Fu, Yifan Hou, Jian Zhang, Kaili Ma, Barakeel Fanseu Kamhoua, James Cheng

This paper aims to provide a theoretical framework to understand GNNs, specifically, spectral graph convolutional networks and graph attention networks, from graph signal denoising perspectives.

Denoising Graph Attention +2

Paper
Code

TOAN: Target-Oriented Alignment Network for Fine-Grained Image Categorization with Few Labeled Samples

no code implementations • 28 May 2020 • Huaxi Huang, Jun-Jie Zhang, Jian Zhang, Qiang Wu, Chang Xu

The challenges of high intra-class variance yet low inter-class fluctuations in fine-grained visual categorization are more severe with few labeled samples, \textit{i. e.,} Fine-Grained categorization problems under the Few-Shot setting (FGFS).

Fine-Grained Visual Categorization

Paper
Add Code

Iterative Network for Image Super-Resolution

1 code implementation • 20 May 2020 • Yuqing Liu, Shiqi Wang, Jian Zhang, Shanshe Wang, Siwei Ma, Wen Gao

A novel iterative super-resolution network (ISRN) is proposed on top of the iterative optimization.

Image Super-Resolution SSIM

Paper
Code

Contextual Embeddings: When Are They Worth It?

no code implementations • ACL 2020 • Simran Arora, Avner May, Jian Zhang, Christopher Ré

We study the settings for which deep contextual embeddings (e. g., BERT) give large improvements in performance relative to classic pretrained embeddings (e. g., GloVe), and an even simpler baseline---random word embeddings---focusing on the impact of the training set size and the linguistic properties of the task.

Word Embeddings

Paper
Add Code

Towards Better Graph Representation: Two-Branch Collaborative Graph Neural Networks for Multimodal Marketing Intention Detection

no code implementations • 13 May 2020 • Lu Zhang, Jian Zhang, Zhibin Li, Jingsong Xu

Inspired by the fact that spreading and collecting information through the Internet becomes the norm, more and more people choose to post for-profit contents (images and texts) in social networks.

Graph Classification Marketing

Paper
Add Code

Feature-metric Registration: A Fast Semi-supervised Approach for Robust Point Cloud Registration without Correspondences

1 code implementation • CVPR 2020 • Xiaoshui Huang, Guofeng Mei, Jian Zhang

We present a fast feature-metric point cloud registration framework, which enforces the optimisation of registration by minimising a feature-metric projection error without correspondences.

Point Cloud Registration

143

Paper
Code

DyNet: Dynamic Convolution for Accelerating Convolutional Neural Networks

no code implementations • 22 Apr 2020 • Yikang Zhang, Jian Zhang, Qiang Wang, Zhao Zhong

On one hand, we can reduce the computation cost remarkably while maintaining the performance.

Paper
Add Code

Generalizable Model-agnostic Semantic Segmentation via Target-specific Normalization

1 code implementation • 27 Mar 2020 • Jian Zhang, Lei Qi, Yinghuan Shi, Yang Gao

Semantic segmentation in a supervised learning manner has achieved significant progress in recent years.

Domain Generalization Segmentation +1

Paper
Code

GID-Net: Detecting Human-Object Interaction with Global and Instance Dependency

no code implementations • 11 Mar 2020 • Dongming Yang, Yuexian Zou, Jian Zhang, Ge Li

GID block breaks through the local neighborhoods and captures long-range dependency of pixels both in global-level and instance-level from the scene to help detecting interactions between instances.

Human-Object Interaction Detection Object

Paper
Add Code

Understanding the Downstream Instability of Word Embeddings

1 code implementation • 29 Feb 2020 • Megan Leszczynski, Avner May, Jian Zhang, Sen Wu, Christopher R. Aberger, Christopher Ré

To theoretically explain this tradeoff, we introduce a new measure of embedding instability---the eigenspace instability measure---which we prove bounds the disagreement in downstream predictions introduced by the change in word embeddings.

Word Embeddings

Paper
Code

Multi-factorial Optimization for Large-scale Virtual Machine Placement in Cloud Computing

no code implementations • 18 Jan 2020 • Zhengping Liang, Jian Zhang, Liang Feng, Zexuan Zhu

However, as growing demand for cloud services, the existing EAs fail to implement in large-scale virtual machine placement (LVMP) problem due to the high time complexity and poor scalability.

Cloud Computing Evolutionary Algorithms

Paper
Add Code

PMC-GANs: Generating Multi-Scale High-Quality Pedestrian with Multimodal Cascaded GANs

no code implementations • 30 Dec 2019 • Jie Wu, Ying Peng, Chenghao Zheng, Zongbo Hao, Jian Zhang

Recently, generative adversarial networks (GANs) have shown great advantages in synthesizing images, leading to a boost of explorations of using faked images to augment data.

Data Augmentation Pedestrian Detection

Paper
Add Code

Adversarial AutoAugment

no code implementations • ICLR 2020 • Xin-Yu Zhang, Qiang Wang, Jian Zhang, Zhao Zhong

The augmentation policy network attempts to increase the training loss of a target network through generating adversarial augmentation policies, while the target network can learn more robust features from harder examples to improve the generalization.

Ranked #594 on Image Classification on ImageNet

Data Augmentation Image Classification

Paper
Add Code

Relational Mimic for Visual Adversarial Imitation Learning

no code implementations • 18 Dec 2019 • Lionel Blondé, Yichuan Charlie Tang, Jian Zhang, Russ Webb

In this work, we introduce a new method for imitation learning from video demonstrations.

Imitation Learning Relational Reasoning

Paper
Add Code

Potential Passenger Flow Prediction: A Novel Study for Urban Transportation Development

no code implementations • 7 Dec 2019 • Yongshun Gong, Zhibin Li, Jian Zhang, Wei Liu, Jin-Feng Yi

In this paper, this specific problem is termed as potential passenger flow (PPF) prediction, which is a novel and important study connected with urban computing and intelligent transportation systems.

MULTI-VIEW LEARNING Recommendation Systems

Paper
Add Code

Adversarial Domain Adaptation with Domain Mixup

1 code implementation • 4 Dec 2019 • Minghao Xu, Jian Zhang, Bingbing Ni, Teng Li, Chengjie Wang, Qi Tian, Wenjun Zhang

In this paper, we present adversarial domain adaptation with domain mixup (DM-ADA), which guarantees domain-invariance in a more continuous latent space and guides the domain discriminator in judging samples' difference relative to source and target domains.

Domain Adaptation

159

Paper
Code

Time-aware Gradient Attack on Dynamic Network Link Prediction

no code implementations • 24 Nov 2019 • Jinyin Chen, Jian Zhang, Zhi Chen, Min Du, Qi Xuan

In this work, we present the first study of adversarial attack on dynamic network link prediction (DNLP).

Adversarial Attack Link Prediction +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.