Search Results for author: Bo Zhang

Found 260 papers, 129 papers with code

Twins: Revisiting the Design of Spatial Attention in Vision Transformers

8 code implementations • NeurIPS 2021 • Xiangxiang Chu, Zhi Tian, Yuqing Wang, Bo Zhang, Haibing Ren, Xiaolin Wei, Huaxia Xia, Chunhua Shen

Very recently, a variety of vision transformer architectures for dense prediction tasks have been proposed and they show that the design of spatial attention is critical to their success in these tasks.

Ranked #48 on Semantic Segmentation on ADE20K val

Image Classification Semantic Segmentation

29,648

Paper
Code

Bringing Old Photos Back to Life

7 code implementations • CVPR 2020 • Zi-Yu Wan, Bo Zhang, Dong-Dong Chen, Pan Zhang, Dong Chen, Jing Liao, Fang Wen

Unlike conventional restoration tasks that can be solved through supervised learning, the degradation in real photos is complex and the domain gap between synthetic images and real old photos makes the network fail to generalize.

Image Restoration Translation

14,413

Paper
Code

Old Photo Restoration via Deep Latent Space Translation

8 code implementations • 14 Sep 2020 • Zi-Yu Wan, Bo Zhang, Dong-Dong Chen, Pan Zhang, Dong Chen, Jing Liao, Fang Wen

Image Restoration Translation

14,413

Paper
Code

YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications

7 code implementations • 7 Sep 2022 • Chuyi Li, Lulu Li, Hongliang Jiang, Kaiheng Weng, Yifei Geng, Liang Li, Zaidan Ke, Qingyuan Li, Meng Cheng, Weiqiang Nie, Yiduo Li, Bo Zhang, Yufei Liang, Linyuan Zhou, Xiaoming Xu, Xiangxiang Chu, Xiaoming Wei, Xiaolin Wei

The YOLO community has prospered overwhelmingly to enrich its use in a multitude of hardware platforms and abundant scenarios.

Ranked #14 on Object Detection on COCO-O

Object Detection Quantization

12,022

Paper
Code

YOLOv6 v3.0: A Full-Scale Reloading

5 code implementations • 13 Jan 2023 • Chuyi Li, Lulu Li, Yifei Geng, Hongliang Jiang, Meng Cheng, Bo Zhang, Zaidan Ke, Xiaoming Xu, Xiangxiang Chu

For a glimpse of performance, our YOLOv6-N hits 37. 5% AP on the COCO dataset at a throughput of 1187 FPS tested with an NVIDIA Tesla T4 GPU.

Ranked #1 on Object Detection on COCO 2017 val

Real-Time Object Detection

12,022

Paper
Code

DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior

1 code implementation • 25 Oct 2023 • Jingxiang Sun, Bo Zhang, Ruizhi Shao, Lizhen Wang, Wen Liu, Zhenda Xie, Yebin Liu

The score distillation from this 3D-aware diffusion prior provides view-consistent guidance for the scene.

3D Generation

1,788

Paper
Code

Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior

2 code implementations • ICCV 2023 • Junshu Tang, Tengfei Wang, Bo Zhang, Ting Zhang, Ran Yi, Lizhuang Ma, Dong Chen

In this work, we investigate the problem of creating high-fidelity 3D content from only a single image.

Text to 3D

1,681

Paper
Code

DeepSeek-VL: Towards Real-World Vision-Language Understanding

2 code implementations • 8 Mar 2024 • Haoyu Lu, Wen Liu, Bo Zhang, Bingxuan Wang, Kai Dong, Bo Liu, Jingxiang Sun, Tongzheng Ren, Zhuoshu Li, Hao Yang, Yaofeng Sun, Chengqi Deng, Hanwei Xu, Zhenda Xie, Chong Ruan

The DeepSeek-VL family (both 1. 3B and 7B models) showcases superior user experiences as a vision-language chatbot in real-world applications, achieving state-of-the-art or competitive performance across a wide range of visual-language benchmarks at the same model size while maintaining robust performance on language-centric benchmarks.

Ranked #28 on Visual Question Answering on MM-Vet

Chatbot Language Modelling +3

1,466

Paper
Code

Paint by Example: Exemplar-based Image Editing with Diffusion Models

2 code implementations • CVPR 2023 • Binxin Yang, Shuyang Gu, Bo Zhang, Ting Zhang, Xuejin Chen, Xiaoyan Sun, Dong Chen, Fang Wen

Language-guided image editing has achieved great success recently.

Image Generation Image Manipulation

956

Paper
Code

Vector Quantized Diffusion Model for Text-to-Image Synthesis

2 code implementations • CVPR 2022 • Shuyang Gu, Dong Chen, Jianmin Bao, Fang Wen, Bo Zhang, Dongdong Chen, Lu Yuan, Baining Guo

Our experiments indicate that the VQ-Diffusion model with the reparameterization is fifteen times faster than traditional AR methods while achieving a better image quality.

Ranked #1 on Text-to-Image Generation on Oxford 102 Flowers (using extra training data)

Denoising Text-to-Image Generation

832

Paper
Code

mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding

1 code implementation • 19 Mar 2024 • Anwen Hu, Haiyang Xu, Jiabo Ye, Ming Yan, Liang Zhang, Bo Zhang, Chen Li, Ji Zhang, Qin Jin, Fei Huang, Jingren Zhou

In this work, we emphasize the importance of structure information in Visual Document Understanding and propose the Unified Structure Learning to boost the performance of MLLMs.

document understanding Optical Character Recognition (OCR)

830

Paper
Code

MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices

1 code implementation • 28 Dec 2023 • Xiangxiang Chu, Limeng Qiao, Xinyang Lin, Shuang Xu, Yang Yang, Yiming Hu, Fei Wei, Xinyu Zhang, Bo Zhang, Xiaolin Wei, Chunhua Shen

We present MobileVLM, a competent multimodal vision language model (MMVLM) targeted to run on mobile devices.

AutoML Language Modelling

753

Paper
Code

MobileVLM V2: Faster and Stronger Baseline for Vision Language Model

1 code implementation • 6 Feb 2024 • Xiangxiang Chu, Limeng Qiao, Xinyu Zhang, Shuang Xu, Fei Wei, Yang Yang, Xiaofei Sun, Yiming Hu, Xinyang Lin, Bo Zhang, Chunhua Shen

We introduce MobileVLM V2, a family of significantly improved vision language models upon MobileVLM, which proves that a delicate orchestration of novel architectural design, an improved training scheme tailored for mobile VLMs, and rich high-quality dataset curation can substantially benefit VLMs' performance.

AutoML Language Modelling

753

Paper
Code

Fast, Accurate and Lightweight Super-Resolution with Neural Architecture Search

2 code implementations • arXiv 2019 • Xiangxiang Chu, Bo Zhang, Hailong Ma, Ruijun Xu, Qingyuan Li

Deep convolutional neural networks demonstrate impressive results in the super-resolution domain.

Ranked #16 on Image Super-Resolution on BSD100 - 2x upscaling

Neural Architecture Search Reinforcement Learning (RL) +1

683

Paper
Code

Making Images Real Again: A Comprehensive Survey on Deep Image Composition

4 code implementations • 28 Jun 2021 • Li Niu, Wenyan Cong, Liu Liu, Yan Hong, Bo Zhang, Jing Liang, Liqing Zhang

We have also contributed the first image composition toolbox: libcom https://github. com/bcmi/libcom, which assembles 10+ image composition related functions (e. g., image blending, image harmonization, object placement, shadow generation, generative composition).

Image Harmonization

587

Paper
Code

Bi3D: Bi-domain Active Learning for Cross-domain 3D Object Detection

1 code implementation • CVPR 2023 • Jiakang Yuan, Bo Zhang, Xiangchao Yan, Tao Chen, Botian Shi, Yikang Li, Yu Qiao

Unsupervised Domain Adaptation (UDA) technique has been explored in 3D cross-domain tasks recently.

3D Object Detection Active Learning +2

564

Paper
Code

Uni3D: A Unified Baseline for Multi-dataset 3D Object Detection

1 code implementation • CVPR 2023 • Bo Zhang, Jiakang Yuan, Botian Shi, Tao Chen, Yikang Li, Yu Qiao

In this paper, we study the task of training a unified 3D detector from multiple datasets.

3D Object Detection object-detection

564

Paper
Code

SUG: Single-dataset Unified Generalization for 3D Point Cloud Classification

2 code implementations • 16 May 2023 • Siyuan Huang, Bo Zhang, Botian Shi, Peng Gao, Yikang Li, Hongsheng Li

In this paper, different from previous 2D DG works, we focus on the 3D DG problem and propose a Single-dataset Unified Generalization (SUG) framework that only leverages a single source dataset to alleviate the unforeseen domain differences faced by a well-trained source model.

3D Point Cloud Classification Domain Generalization +2

564

Paper
Code

ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target Simulation

2 code implementations • 11 Sep 2023 • Bo Zhang, Xinyu Cai, Jiakang Yuan, Donglin Yang, Jianfei Guo, Xiangchao Yan, Renqiu Xia, Botian Shi, Min Dou, Tao Chen, Si Liu, Junchi Yan, Yu Qiao

Domain shifts such as sensor type changes and geographical situation variations are prevalent in Autonomous Driving (AD), which poses a challenge since AD model relying on the previous domain knowledge can be hardly directly deployed to a new domain without additional costs.

Autonomous Driving Domain Generalization

564

Paper
Code

SPOT: Scalable 3D Pre-training via Occupancy Prediction for Autonomous Driving

1 code implementation • 19 Sep 2023 • Xiangchao Yan, Runjian Chen, Bo Zhang, Jiakang Yuan, Xinyu Cai, Botian Shi, Wenqi Shao, Junchi Yan, Ping Luo, Yu Qiao

Our contributions are threefold: (1) Occupancy prediction is shown to be promising for learning general representations, which is demonstrated by extensive experiments on plenty of datasets and tasks.

3D Object Detection Autonomous Driving +3

564

Paper
Code

AD-PT: Autonomous Driving Pre-Training with Large-scale Point Cloud Dataset

1 code implementation • NeurIPS 2023 • Jiakang Yuan, Bo Zhang, Xiangchao Yan, Tao Chen, Botian Shi, Yikang Li, Yu Qiao

It is a long-term vision for Autonomous Driving (AD) community that the perception models can learn from a large-scale point cloud dataset, to obtain unified representations that can achieve promising results on different tasks or benchmarks.

Autonomous Driving Point Cloud Pre-training

564

Paper
Code

MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation

1 code implementation • CVPR 2023 • BoWen Zhang, Chenyang Qi, Pan Zhang, Bo Zhang, HsiangTao Wu, Dong Chen, Qifeng Chen, Yong Wang, Fang Wen

In this work, we propose an ID-preserving talking head generation framework, which advances previous methods in two aspects.

Face Swapping Meta-Learning +1

489

Paper
Code

Bringing Old Films Back to Life

1 code implementation • CVPR 2022 • Ziyu Wan, Bo Zhang, Dongdong Chen, Jing Liao

We present a learning-based framework, recurrent transformer network (RTN), to restore heavily degraded old films.

Ranked #6 on Analog Video Restoration on TAPE

Analog Video Restoration

485

Paper
Code

StyleSwin: Transformer-based GAN for High-resolution Image Generation

1 code implementation • CVPR 2022 • BoWen Zhang, Shuyang Gu, Bo Zhang, Jianmin Bao, Dong Chen, Fang Wen, Yong Wang, Baining Guo

To this end, we believe that local attention is crucial to strike the balance between computational efficiency and modeling capacity.

Ranked #1 on Image Generation on CelebA 256x256 (FID metric)

Blocking Computational Efficiency +3

473

Paper
Code

Pretraining is All You Need for Image-to-Image Translation

2 code implementations • 25 May 2022 • Tengfei Wang, Ting Zhang, Bo Zhang, Hao Ouyang, Dong Chen, Qifeng Chen, Fang Wen

We propose to use pretraining to boost general image-to-image translation.

Ranked #1 on Sketch-to-Image Translation on COCO-Stuff

Image-to-Image Translation Sketch-to-Image Translation +2

470

Paper
Code

MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction

2 code implementations • NAACL 2022 • Yue Zhang, Zhenghua Li, Zuyi Bao, Jiacheng Li, Bo Zhang, Chen Li, Fei Huang, Min Zhang

This paper presents MuCGEC, a multi-reference multi-source evaluation dataset for Chinese Grammatical Error Correction (CGEC), consisting of 7, 063 sentences collected from three Chinese-as-a-Second-Language (CSL) learner sources.

Grammatical Error Correction Sentence

455

Paper
Code

Mining Error Templates for Grammatical Error Correction

2 code implementations • 23 Jun 2022 • Yue Zhang, Haochen Jiang, Zuyi Bao, Bo Zhang, Chen Li, Zhenghua Li

We have accumulated 1, 119 error templates for Chinese GEC based on this method.

Grammatical Error Correction Language Modelling

455

Paper
Code

Cross-domain Correspondence Learning for Exemplar-based Image Translation

3 code implementations • CVPR 2020 • Pan Zhang, Bo Zhang, Dong Chen, Lu Yuan, Fang Wen

The output has the style (e. g., color, texture) in consistency with the semantically corresponding objects in the exemplar.

Ranked #1 on Image-to-Image Translation on ADE20K-Outdoor Labels-to-Photos (FID metric)

Image-to-Image Translation Translation

386

Paper
Code

CoCosNet v2: Full-Resolution Correspondence Learning for Image Translation

1 code implementation • CVPR 2021 • Xingran Zhou, Bo Zhang, Ting Zhang, Pan Zhang, Jianmin Bao, Dong Chen, Zhongfei Zhang, Fang Wen

We present the full-resolution correspondence learning for cross-domain images, which aids image translation.

Image-to-Image Translation Semantic correspondence +1

334

Paper
Code

Deep Exemplar-based Video Colorization

1 code implementation • CVPR 2019 • Bo Zhang, Mingming He, Jing Liao, Pedro V. Sander, Lu Yuan, Amine Bermak, Dong Chen

This paper presents the first end-to-end network for exemplar-based video colorization.

Colorization Semantic correspondence

330

Paper
Code

Document Rectification and Illumination Correction using a Patch-based CNN

1 code implementation • 20 Sep 2019 • Xiaoyu Li, Bo Zhang, Jing Liao, Pedro V. Sander

We propose a novel learning method to rectify document images with various distortion types from a single input image.

Optical Character Recognition (OCR)

312

Paper
Code

FairNAS: Rethinking Evaluation Fairness of Weight Sharing Neural Architecture Search

2 code implementations • ICCV 2021 • Xiangxiang Chu, Bo Zhang, Ruijun Xu

We demonstrate that this is crucial for improving the confidence of models' ranking.

Ranked #3 on Neural Architecture Search on CIFAR-10 (using extra training data)

Fairness Image Classification +1

300

Paper
Code

Towards Knowledge-driven Autonomous Driving

1 code implementation • 7 Dec 2023 • Xin Li, Yeqi Bai, Pinlong Cai, Licheng Wen, Daocheng Fu, Bo Zhang, Xuemeng Yang, Xinyu Cai, Tao Ma, Jianfei Guo, Xing Gao, Min Dou, Yikang Li, Botian Shi, Yong liu, Liang He, Yu Qiao

This paper explores the emerging knowledge-driven autonomous driving technologies.

Autonomous Driving Neural Rendering

295

Paper
Code

VisionLLaMA: A Unified LLaMA Interface for Vision Tasks

1 code implementation • 1 Mar 2024 • Xiangxiang Chu, Jianlin Su, Bo Zhang, Chunhua Shen

Large language models are built on top of a transformer-based architecture to process textual inputs.

Image Classification Image Generation +2

278

Paper
Code

Prototypical Pseudo Label Denoising and Target Structure Learning for Domain Adaptive Semantic Segmentation

2 code implementations • CVPR 2021 • Pan Zhang, Bo Zhang, Ting Zhang, Dong Chen, Yong Wang, Fang Wen

In this paper, we rely on representative prototypes, the feature centroids of classes, to address the two issues for unsupervised domain adaptation.

Ranked #10 on Semantic Segmentation on GTAV-to-Cityscapes Labels

Pseudo Label Semantic Segmentation +2

276

Paper
Code

MoGA: Searching Beyond MobileNetV3

2 code implementations • 4 Aug 2019 • Xiangxiang Chu, Bo Zhang, Ruijun Xu

Bearing the target hardware in mind, we propose the first Mobile GPU-Aware (MoGA) neural architecture search in order to be precisely tailored for real-world applications.

Ranked #854 on Image Classification on ImageNet

Image Classification Neural Architecture Search

228

Paper
Code

Triple Generative Adversarial Nets

1 code implementation • NeurIPS 2017 • Chongxuan Li, Kun Xu, Jun Zhu, Bo Zhang

Generative Adversarial Nets (GANs) have shown promise in image generation and semi-supervised learning (SSL).

Image Generation

224

Paper
Code

3DFaceShop: Explicitly Controllable 3D-Aware Portrait Generation

1 code implementation • 12 Sep 2022 • Junshu Tang, Bo Zhang, Binxin Yang, Ting Zhang, Dong Chen, Lizhuang Ma, Fang Wen

In contrast to the traditional avatar creation pipeline which is a costly process, contemporary generative approaches directly learn the data distribution from photographs.

3D Face Animation Disentanglement +3

205

Paper
Code

Conditional Positional Encodings for Vision Transformers

2 code implementations • 22 Feb 2021 • Xiangxiang Chu, Zhi Tian, Bo Zhang, Xinlong Wang, Chunhua Shen

Built on PEG, we present Conditional Position encoding Vision Transformer (CPVT).

AutoML Classification +4

177

Paper
Code

Fair DARTS: Eliminating Unfair Advantages in Differentiable Architecture Search

1 code implementation • ECCV 2020 • Xiangxiang Chu, Tianbao Zhou, Bo Zhang, Jixiang Li

Differentiable Architecture Search (DARTS) is now a widely disseminated weight-sharing neural architecture search method.

Ranked #24 on Neural Architecture Search on CIFAR-10

Neural Architecture Search

168

Paper
Code

Analytic-DPM: an Analytic Estimate of the Optimal Reverse Variance in Diffusion Probabilistic Models

2 code implementations • ICLR 2022 • Fan Bao, Chongxuan Li, Jun Zhu, Bo Zhang

In this work, we present a surprising result that both the optimal reverse variance and the corresponding optimal KL divergence of a DPM have analytic forms w. r. t.

166

Paper
Code

OASim: an Open and Adaptive Simulator based on Neural Rendering for Autonomous Driving

1 code implementation • 6 Feb 2024 • Guohang Yan, Jiahao Pi, Jianfei Guo, Zhaotong Luo, Min Dou, Nianchen Deng, Qiusheng Huang, Daocheng Fu, Licheng Wen, Pinlong Cai, Xing Gao, Xinyu Cai, Bo Zhang, Xuemeng Yang, Yeqi Bai, Hongbin Zhou, Botian Shi

With the development of implicit rendering technology and in-depth research on using generative models to produce data at scale, we propose OASim, an open and adaptive simulator and autonomous driving data generator based on implicit neural rendering.

Autonomous Driving Neural Rendering +1

164

Paper
Code

Blind Geometric Distortion Correction on Images Through Deep Learning

1 code implementation • CVPR 2019 • Xiaoyu Li, Bo Zhang, Pedro V. Sander, Jing Liao

We propose the first general framework to automatically correct different types of geometric distortion in a single input image.

159

Paper
Code

SCARLET-NAS: Bridging the Gap between Stability and Scalability in Weight-sharing Neural Architecture Search

1 code implementation • 16 Aug 2019 • Xiangxiang Chu, Bo Zhang, Qingyuan Li, Ruijun Xu, Xudong Li

To discover powerful yet compact models is an important goal of neural architecture search.

Ranked #76 on Neural Architecture Search on ImageNet

Image Classification Neural Architecture Search

141

Paper
Code

Style-based Point Generator with Adversarial Rendering for Point Cloud Completion

1 code implementation • CVPR 2021 • Chulin Xie, Chuxin Wang, Bo Zhang, Hao Yang, Dong Chen, Fang Wen

In this paper, we proposed a novel Style-based Point Generator with Adversarial Rendering (SpareNet) for point cloud completion.

Ranked #1 on Point Cloud Completion on ShapeNet (Earth Mover's Distance metric)

Point Cloud Completion

134

Paper
Code

ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning

1 code implementation • 19 Feb 2024 • Renqiu Xia, Bo Zhang, Hancheng Ye, Xiangchao Yan, Qi Liu, Hongbin Zhou, Zijun Chen, Min Dou, Botian Shi, Junchi Yan, Yu Qiao

Recently, many versatile Multi-modal Large Language Models (MLLMs) have emerged continuously.

125

Paper
Code

FastPillars: A Deployment-friendly Pillar-based 3D Detector

1 code implementation • 5 Feb 2023 • Sifan Zhou, Zhi Tian, Xiangxiang Chu, Xinyu Zhang, Bo Zhang, Xiaobo Lu, Chengjian Feng, Zequn Jie, Patrick Yin Chiang, Lin Ma

The deployment of 3D detectors strikes one of the major challenges in real-world self-driving scenarios.

3D Object Detection object-detection

121

Paper
Code

Delving into Shape-aware Zero-shot Semantic Segmentation

1 code implementation • CVPR 2023 • Xinyu Liu, Beiwen Tian, Zhen Wang, Rui Wang, Kehua Sheng, Bo Zhang, Hao Zhao, Guyue Zhou

Thanks to the impressive progress of large-scale vision-language pretraining, recent recognition models can classify arbitrary objects in a zero-shot and open-set manner, with a surprisingly high accuracy.

Image Segmentation Segmentation +2

108

Paper
Code

Image Composition Assessment with Saliency-augmented Multi-pattern Pooling

1 code implementation • 7 Apr 2021 • Bo Zhang, Li Niu, Liqing Zhang

Image composition assessment is crucial in aesthetic assessment, which aims to assess the overall composition quality of a given image.

Ranked #1 on Aesthetics Quality Assessment on CADB

Aesthetics Quality Assessment

103

Paper
Code

ControlCom: Controllable Image Composition using Diffusion Model

1 code implementation • 19 Aug 2023 • Bo Zhang, Yuxuan Duan, Jun Lan, Yan Hong, Huijia Zhu, Weiqiang Wang, Li Niu

To address these challenges, we propose a controllable image composition method that unifies four tasks in one diffusion model: image blending, image harmonization, view synthesis, and generative composition.

Image Harmonization

101

Paper
Code

Performance-aware Approximation of Global Channel Pruning for Multitask CNNs

1 code implementation • 21 Mar 2023 • Hancheng Ye, Bo Zhang, Tao Chen, Jiayuan Fan, Bin Wang

Global channel pruning (GCP) aims to remove a subset of channels (filters) across different layers from a deep model without hurting the performance.

Model Compression

100

Paper
Code

Estimating the Optimal Covariance with Imperfect Mean in Diffusion Probabilistic Models

1 code implementation • 15 Jun 2022 • Fan Bao, Chongxuan Li, Jiacheng Sun, Jun Zhu, Bo Zhang

Thus, the generation performance on a subset of timesteps is crucial, which is greatly influenced by the covariance design in DPMs.

Computational Efficiency

Paper
Code

Disentangled Inference for GANs with Latently Invertible Autoencoder

3 code implementations • 19 Jun 2019 • Jiapeng Zhu, Deli Zhao, Bo Zhang, Bolei Zhou

In this paper, we show that the entanglement of the latent space for the VAE/GAN framework poses the main challenge for encoder learning.

Paper
Code

SynGEC: Syntax-Enhanced Grammatical Error Correction with a Tailored GEC-Oriented Parser

1 code implementation • 22 Oct 2022 • Yue Zhang, Bo Zhang, Zhenghua Li, Zuyi Bao, Chen Li, Min Zhang

Then, we obtain parse trees of the source incorrect sentences by projecting trees of the target correct sentences.

Ranked #5 on Grammatical Error Correction on CoNLL-2014 Shared Task

Grammatical Error Correction Syntax Representation

Paper
Code

OPA: Object Placement Assessment Dataset

3 code implementations • 5 Jul 2021 • Liu Liu, Zhenchen Liu, Bo Zhang, Jiangtong Li, Li Niu, Qingyang Liu, Liqing Zhang

Image composition aims to generate realistic composite image by inserting an object from one image into another background image, where the placement (e. g., location, size, occlusion) of inserted object may be unreasonable, which would significantly degrade the quality of the composite image.

Object

Paper
Code

Graphical Generative Adversarial Networks

1 code implementation • NeurIPS 2018 • Chongxuan Li, Max Welling, Jun Zhu, Bo Zhang

We propose Graphical Generative Adversarial Networks (Graphical-GAN) to model structured data.

Paper
Code

Lenna: Language Enhanced Reasoning Detection Assistant

1 code implementation • 5 Dec 2023 • Fei Wei, Xinyu Zhang, Ailing Zhang, Bo Zhang, Xiangxiang Chu

To evaluate the reasoning capability of Lenna, we construct a ReasonDet dataset to measure its performance on reasoning-based detection.

World Knowledge

Paper
Code

NaSGEC: a Multi-Domain Chinese Grammatical Error Correction Dataset from Native Speaker Texts

1 code implementation • 25 May 2023 • Yue Zhang, Bo Zhang, Haochen Jiang, Zhenghua Li, Chen Li, Fei Huang, Min Zhang

We introduce NaSGEC, a new dataset to facilitate research on Chinese grammatical error correction (CGEC) for native speaker texts from multiple domains.

Grammatical Error Correction

Paper
Code

Ternary Weight Networks

5 code implementations • 16 May 2016 • Fengfu Li, Bin Liu, Xiaoxing Wang, Bo Zhang, Junchi Yan

We present a memory and computation efficient ternary weight networks (TWNs) - with weights constrained to +1, 0 and -1.

Model Compression object-detection +1

Paper
Code

DARTS-: Robustly Stepping out of Performance Collapse Without Indicators

1 code implementation • ICLR 2021 • Xiangxiang Chu, Xiaoxing Wang, Bo Zhang, Shun Lu, Xiaolin Wei, Junchi Yan

We call this approach DARTS-.

Ranked #20 on Neural Architecture Search on NAS-Bench-201, CIFAR-10

Neural Architecture Search

Paper
Code

Shadow Generation for Composite Image Using Diffusion model

1 code implementation • 22 Mar 2024 • Qingyang Liu, Junqi You, Jianting Wang, Xinhao Tao, Bo Zhang, Li Niu

In the realm of image composition, generating realistic shadow for the inserted foreground remains a formidable challenge.

Image-to-Image Translation

Paper
Code

Adversarial Texture for Fooling Person Detectors in the Physical World

1 code implementation • CVPR 2022 • Zhanhao Hu, Siyuan Huang, Xiaopei Zhu, Fuchun Sun, Bo Zhang, Xiaolin Hu

Experiments showed that these clothes could fool person detectors in the physical world.

Paper
Code

Real-Time Neural Character Rendering with Pose-Guided Multiplane Images

1 code implementation • 25 Apr 2022 • Hao Ouyang, Bo Zhang, Pan Zhang, Hao Yang, Jiaolong Yang, Dong Chen, Qifeng Chen, Fang Wen

We propose pose-guided multiplane image (MPI) synthesis which can render an animatable character in real scenes with photorealistic quality.

Image-to-Image Translation Neural Rendering +1

Paper
Code

LiDAR-PTQ: Post-Training Quantization for Point Cloud 3D Object Detection

1 code implementation • 29 Jan 2024 • Sifan Zhou, Liang Li, Xinyu Zhang, Bo Zhang, Shipeng Bai, Miao Sun, Ziyu Zhao, Xiaobo Lu, Xiangxiang Chu

To our knowledge, for the very first time in lidar-based 3D detection tasks, the PTQ INT8 model's accuracy is almost the same as the FP32 model while enjoying $3\times$ inference speedup.

3D Object Detection Autonomous Vehicles +3

Paper
Code

Deep Sketch-guided Cartoon Video Inbetweening

1 code implementation • 10 Aug 2020 • Xiaoyu Li, Bo Zhang, Jing Liao, Pedro V. Sander

The key idea of the proposed approach is to estimate the dense cross-domain correspondence between the sketch and cartoon video frames, and employ a blending module with occlusion estimation to synthesize the middle frame guided by the sketch.

Image Generation Occlusion Estimation

Paper
Code

Curriculum-style Local-to-global Adaptation for Cross-domain Remote Sensing Image Segmentation

1 code implementation • 3 Mar 2022 • Bo Zhang, Tao Chen, Bin Wang

Although domain adaptation has been extensively studied in natural image-based segmentation task, the research on cross-domain segmentation for very high resolution (VHR) remote sensing images (RSIs) still remains underexplored.

Domain Adaptation Image Segmentation +2

Paper
Code

Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models

1 code implementation • 22 Feb 2024 • Xudong Lu, Qi Liu, Yuhui Xu, Aojun Zhou, Siyuan Huang, Bo Zhang, Junchi Yan, Hongsheng Li

Specifically, we propose, for the first time to our best knowledge, post-training approaches for task-agnostic and task-specific expert pruning and skipping of MoE LLMs, tailored to improve deployment efficiency while maintaining model performance across a wide range of tasks.

Paper
Code

Smooth Neighbors on Teacher Graphs for Semi-supervised Learning

1 code implementation • CVPR 2018 • Yucen Luo, Jun Zhu, Mengxi Li, Yong Ren, Bo Zhang

In SNTG, a graph is constructed based on the predictions of the teacher model, i. e., the implicit self-ensemble of models.

Paper
Code

Triple Generative Adversarial Networks

1 code implementation • 20 Dec 2019 • Chongxuan Li, Kun Xu, Jiashuo Liu, Jun Zhu, Bo Zhang

It is formulated as a three-player minimax game consisting of a generator, a classifier and a discriminator, and therefore is referred to as Triple Generative Adversarial Network (Triple-GAN).

Ranked #1 on Semi-Supervised Image Classification on SVHN, 500 Labels

Classification Conditional Image Generation +4

Paper
Code

Microshift: An Efficient Image Compression Algorithm for Hardware

1 code implementation • 20 Apr 2021 • Bo Zhang, Pedro V. Sander, Chi-Ying Tsui, Amine Bermak

In our method, the image is first micro-shifted, then the sub-quantized values are further compressed.

Data Compression Image Compression

Paper
Code

Fast Deep Matting for Portrait Animation on Mobile Phone

1 code implementation • 26 Jul 2017 • Bingke Zhu, Yingying Chen, Jinqiao Wang, Si Liu, Bo Zhang, Ming Tang

Finally, an automatic portrait animation system based on fast deep matting is built on mobile devices, which does not need any interaction and can realize real-time matting with 15 fps.

Image Matting Video Editing

Paper
Code

Multi-Objective Reinforced Evolution in Mobile Neural Architecture Search

4 code implementations • 4 Jan 2019 • Xiangxiang Chu, Bo Zhang, Ruijun Xu, Hailong Ma

In this paper, we present a new multi-objective oriented algorithm called MoreMNAS (Multi-Objective Reinforced Evolution in Mobile Neural Architecture Search) by leveraging good virtues from both EA and RL.

Image Classification Neural Architecture Search +1

Paper
Code

MixPath: A Unified Approach for One-shot Neural Architecture Search

1 code implementation • ICCV 2023 • Xiangxiang Chu, Shun Lu, Xudong Li, Bo Zhang

However, current two-stage neural architecture search methods are mainly limited to single-path search spaces.

Neural Architecture Search

Paper
Code

Improving Seq2Seq Grammatical Error Correction via Decoding Interventions

1 code implementation • 23 Oct 2023 • Houquan Zhou, Yumeng Liu, Zhenghua Li, Min Zhang, Bo Zhang, Chen Li, Ji Zhang, Fei Huang

In this paper, we propose a unified decoding intervention framework that employs an external critic to assess the appropriateness of the token to be generated incrementally, and then dynamically influence the choice of the next token.

Ranked #1 on Grammatical Error Correction on MuCGEC

Grammatical Error Correction Language Modelling

Paper
Code

Training Interpretable Convolutional Neural Networks by Differentiating Class-specific Filters

1 code implementation • ECCV 2020 • Haoyu Liang, Zhihao Ouyang, Yuyuan Zeng, Hang Su, Zihao He, Shu-Tao Xia, Jun Zhu, Bo Zhang

Most existing works attempt post-hoc interpretation on a pre-trained model, while neglecting to reduce the entanglement underlying the model.

Object Localization

Paper
Code

A Matrix-in-matrix Neural Network for Image Super Resolution

1 code implementation • 19 Mar 2019 • Hailong Ma, Xiangxiang Chu, Shaohua Wan, Bo Zhang

In recent years, deep learning methods have achieved impressive results with higher peak signal-to-noise ratio in single image super-resolution (SISR) tasks by utilizing deeper layers.

Image Super-Resolution

Paper
Code

Noisy Differentiable Architecture Search

1 code implementation • 7 May 2020 • Xiangxiang Chu, Bo Zhang

However, it largely suffers from the well-known performance collapse issue due to the aggregation of skip connections.

Ranked #12 on Neural Architecture Search on CIFAR-10

Image Classification Neural Architecture Search

Paper
Code

Contrastive Cross-domain Recommendation in Matching

1 code implementation • 2 Dec 2021 • Ruobing Xie, Qi Liu, Liangdong Wang, Shukai Liu, Bo Zhang, Leyu Lin

Cross-domain recommendation (CDR) aims to provide better recommendation results in the target domain with the help of the source domain, which is widely used and explored in real-world systems.

Contrastive Learning Representation Learning +1

Paper
Code

Simultaneously Optimizing Perturbations and Positions for Black-box Adversarial Patch Attacks

1 code implementation • 26 Dec 2022 • Xingxing Wei, Ying Guo, Jie Yu, Bo Zhang

Extensive experiments are conducted on the Face Recognition (FR) task, and results on four representative FR models show that our method can significantly improve the attack success rate and query efficiency.

Face Recognition Position +2

Paper
Code

Beyond Clicks: Modeling Multi-Relational Item Graph for Session-Based Target Behavior Prediction

1 code implementation • 19 Feb 2020 • Wen Wang, Wei zhang, Shukai Liu, Qi Liu, Bo Zhang, Leyu Lin, Hongyuan Zha

Specifically, we build a Multi-Relational Item Graph (MRIG) based on all behavior sequences from all sessions, involving target and auxiliary behavior types.

Representation Learning

Paper
Code

A Transformer-Based Model With Self-Distillation for Multimodal Emotion Recognition in Conversations

1 code implementation • 31 Oct 2023 • Hui Ma, Jian Wang, Hongfei Lin, Bo Zhang, Yijia Zhang, Bo Xu

Emotion recognition in conversations (ERC), the task of recognizing the emotion of each utterance in a conversation, is crucial for building empathetic machines.

Ranked #1 on Emotion Recognition in Conversation on IEMOCAP

Emotion Recognition in Conversation Multimodal Emotion Recognition

Paper
Code

Discriminatively Boosted Image Clustering with Fully Convolutional Auto-Encoders

2 code implementations • 23 Mar 2017 • Fengfu Li, Hong Qiao, Bo Zhang, Xuanyang Xi

Traditional image clustering methods take a two-step approach, feature learning and clustering, sequentially.

Ranked #4 on Image Clustering on Coil-20

Clustering Image Clustering

Paper
Code

Multiplex Behavioral Relation Learning for Recommendation via Memory Augmented Transformer Network

1 code implementation • 8 Oct 2021 • Lianghao Xia, Chao Huang, Yong Xu, Peng Dai, Bo Zhang, Liefeng Bo

The overlook of multiplex behavior relations can hardly recognize the multi-modal contextual signals across different types of interactions, which limit the feasibility of current recommendation methods.

Recommendation Systems Relation +1

Paper
Code

Human-centric Image Cropping with Partition-aware and Content-preserving Features

1 code implementation • 21 Jul 2022 • Bo Zhang, Li Niu, Xing Zhao, Liqing Zhang

Image cropping aims to find visually appealing crops in an image, which is an important yet challenging task.

Image Cropping

Paper
Code

Physically Realizable Natural-Looking Clothing Textures Evade Person Detectors via 3D Modeling

1 code implementation • CVPR 2023 • Zhanhao Hu, Wenda Chu, Xiaopei Zhu, HUI ZHANG, Bo Zhang, Xiaolin Hu

In order to craft natural-looking adversarial clothes that can evade person detectors at multiple viewing angles, we propose adversarial camouflage textures (AdvCaT) that resemble one kind of the typical textures of daily clothes, camouflage textures.

Paper
Code

Max-margin Deep Generative Models

2 code implementations • NeurIPS 2015 • Chongxuan Li, Jun Zhu, Tianlin Shi, Bo Zhang

Deep generative models (DGMs) are effective on learning multilayered representations of complex data and performing inference of input data by exploring the generative ability.

Paper
Code

Foreground Object Search by Distilling Composite Image Feature

1 code implementation • ICCV 2023 • Bo Zhang, Jiacheng Sui, Li Niu

Additionally, previous works did not release their datasets, so we contribute two datasets for FOS task: S-FOSD dataset with synthetic composite images and R-FOSD dataset with real composite images.

Object Retrieval

Paper
Code

StructChart: Perception, Structuring, Reasoning for Visual Chart Understanding

1 code implementation • 20 Sep 2023 • Renqiu Xia, Bo Zhang, Haoyang Peng, Hancheng Ye, Xiangchao Yan, Peng Ye, Botian Shi, Yu Qiao, Junchi Yan

Charts are common in literature across different scientific fields, conveying rich information easily accessible to readers.

Ranked #17 on Chart Question Answering on ChartQA (using extra training data)

Chart Question Answering Language Modelling +2

Paper
Code

Pruning from Scratch

1 code implementation • 27 Sep 2019 • Yulong Wang, Xiaolu Zhang, Lingxi Xie, Jun Zhou, Hang Su, Bo Zhang, Xiaolin Hu

Network pruning is an important research field aiming at reducing computational costs of neural networks.

Network Pruning

Paper
Code

A Unified Span-Based Approach for Opinion Mining with Syntactic Constituents

1 code implementation • NAACL 2021 • Qingrong Xia, Bo Zhang, Rui Wang, Zhenghua Li, Yue Zhang, Fei Huang, Luo Si, Min Zhang

Fine-grained opinion mining (OM) has achieved increasing attraction in the natural language processing (NLP) community, which aims to find the opinion structures of {``}Who expressed what opinions towards what{''} in one sentence.

Multi-Task Learning Opinion Mining +1

Paper
Code

Fast Lossless Neural Compression with Integer-Only Discrete Flows

1 code implementation • 17 Jun 2022 • Siyu Wang, Jianfei Chen, Chongxuan Li, Jun Zhu, Bo Zhang

In this work, we propose Integer-only Discrete Flows (IODF), an efficient neural compressor with integer-only arithmetic.

Quantization

Paper
Code

Collaborative Filtering with User-Item Co-Autoregressive Models

1 code implementation • 21 Dec 2016 • Chao Du, Chongxuan Li, Yin Zheng, Jun Zhu, Bo Zhang

Deep neural networks have shown promise in collaborative filtering (CF).

Collaborative Filtering

Paper
Code

Function Space Particle Optimization for Bayesian Neural Networks

1 code implementation • ICLR 2019 • Ziyu Wang, Tongzheng Ren, Jun Zhu, Bo Zhang

While Bayesian neural networks (BNNs) have drawn increasing attention, their posterior inference remains challenging, due to the high-dimensional and over-parameterized nature.

Variational Inference

Paper
Code

Learning Cross-Image Object Semantic Relation in Transformer for Few-Shot Fine-Grained Image Classification

1 code implementation • 2 Jul 2022 • Bo Zhang, Jiakang Yuan, Baopu Li, Tao Chen, Jiayuan Fan, Botian Shi

Few-shot fine-grained learning aims to classify a query image into one of a set of support categories with fine-grained differences.

Fine-Grained Image Classification Object +1

Paper
Code

UniDA3D: Unified Domain Adaptive 3D Semantic Segmentation Pipeline

1 code implementation • 20 Dec 2022 • Ben Fei, Siyuan Huang, Jiakang Yuan, Botian Shi, Bo Zhang, Weidong Yang, Min Dou, Yikang Li

Different from previous studies that only focus on a single adaptation task, UniDA3D can tackle several adaptation tasks in 3D segmentation field, by designing a unified source-and-target active sampling strategy, which selects a maximally-informative subset from both source and target domains for effective model adaptation.

3D Semantic Segmentation Domain Generalization +2

Paper
Code

Semi-crowdsourced Clustering with Deep Generative Models

1 code implementation • NeurIPS 2018 • Yucen Luo, Tian Tian, Jiaxin Shi, Jun Zhu, Bo Zhang

We propose a new approach that includes a deep generative model (DGM) to characterize low-level features of the data, and a statistical relational model for noisy pairwise annotations on its subset.

Clustering Variational Inference

Paper
Code

Aspect-specific Context Modeling for Aspect-based Sentiment Analysis

1 code implementation • 17 Jul 2022 • Fang Ma, Chen Zhang, Bo Zhang, Dawei Song

Extensive experimental results on standard and adversarial benchmarks for SC and OE demonstrate the effectiveness and robustness of the proposed method, yielding new state-of-the-art performance on OE and competitive performance on SC.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +1

Paper
Code

Max-Margin Deep Generative Models for (Semi-)Supervised Learning

1 code implementation • 22 Nov 2016 • Chongxuan Li, Jun Zhu, Bo Zhang

Deep generative models (DGMs) are effective on learning multilayered representations of complex data and performing inference of input data by exploring the generative ability.

Missing Labels

Paper
Code

Bi-level Score Matching for Learning Energy-based Latent Variable Models

1 code implementation • NeurIPS 2020 • Fan Bao, Chongxuan Li, Kun Xu, Hang Su, Jun Zhu, Bo Zhang

This paper presents a bi-level score matching (BiSM) method to learn EBLVMs with general structures by reformulating SM as a bi-level optimization problem.

Rolling Shutter Correction Stochastic Optimization

Paper
Code

Deriving the stellar labels of LAMOST spectra with Stellar LAbel Machine (SLAM)

1 code implementation • 23 Aug 2019 • Bo Zhang, Chao Liu, Li-Cai Deng

To illustrate this capability, we test the performance of SLAM on stars ranging from Teff$\sim$4000 to $\sim$8000 K trained on LAMOST spectra and stellar labels.

Solar and Stellar Astrophysics Astrophysics of Galaxies Instrumentation and Methods for Astrophysics

Paper
Code

Joint Distribution Alignment via Adversarial Learning for Domain Adaptive Object Detection

1 code implementation • 19 Sep 2021 • Bo Zhang, Tao Chen, Bin Wang, Ruoyao Li

Unsupervised domain adaptive object detection aims to adapt a well-trained detector from its original source domain with rich labeled data to a new target domain with unlabeled data.

Object object-detection +2

Paper
Code

Learning Point-wise Abstaining Penalty for Point Cloud Anomaly Detection

1 code implementation • 19 Sep 2023 • Shaocong Xu, Pengfei Li, Xinyu Liu, Qianpu Sun, Yang Li, Shihui Guo, Zhen Wang, Bo Jiang, Rui Wang, Kehua Sheng, Bo Zhang, Hao Zhao

We demonstrate that learning different abstaining penalties, apart from point-wise penalty, for different types of (synthesized) outliers can further improve the performance.

Anomaly Detection Autonomous Driving +1

Paper
Code

Understanding and Stabilizing GANs' Training Dynamics with Control Theory

1 code implementation • 29 Sep 2019 • Kun Xu, Chongxuan Li, Jun Zhu, Bo Zhang

There are existing efforts that model the training dynamics of GANs in the parameter space but the analysis cannot directly motivate practically effective stabilizing methods.

Ranked #37 on Image Generation on CIFAR-10 (Inception score metric)

Image Generation L2 Regularization

Paper
Code

Textbook Question Answering Under Instructor Guidance With Memory Networks

1 code implementation • CVPR 2018 • Juzheng Li, Hang Su, Jun Zhu, Siyu Wang, Bo Zhang

The machine thus performs as an instructor to extract the essay-level contradictions as the Guidance.

Question Answering

Paper
Code

LogFormer: A Pre-train and Tuning Pipeline for Log Anomaly Detection

1 code implementation • 9 Jan 2024 • Hongcheng Guo, Jian Yang, Jiaheng Liu, Jiaqi Bai, Boyang Wang, Zhoujun Li, Tieqiao Zheng, Bo Zhang, Junran Peng, Qi Tian

Log anomaly detection is a key component in the field of artificial intelligence for IT operations (AIOps).

Anomaly Detection

Paper
Code

A Closer Look at Few-Shot 3D Point Cloud Classification

1 code implementation • 31 Mar 2023 • Chuangguan Ye, Hongyuan Zhu, Bo Zhang, Tao Chen

In recent years, research on few-shot learning (FSL) has been fast-growing in the 2D image domain due to the less requirement for labeled training data and greater generalization for novel classes.

Few-Shot 3D Point Cloud Classification Few-Shot Learning +1

Paper
Code

Variational (Gradient) Estimate of the Score Function in Energy-based Latent Variable Models

1 code implementation • NeurIPS Workshop ICBINB 2020 • Fan Bao, Kun Xu, Chongxuan Li, Lanqing Hong, Jun Zhu, Bo Zhang

The learning and evaluation of energy-based latent variable models (EBLVMs) without any structural assumptions are highly challenging, because the true posteriors and the partition functions in such models are generally intractable.

Paper
Code

HCGrid: A Convolution-based Gridding Framework for RadioAstronomy in Hybrid Computing Environments

1 code implementation • 24 Dec 2020 • Hao Wang, Ce Yu, Bo Zhang, Jian Xiao, Qi Luo

Gridding operation, which is to map non-uniform data samples onto a uniformly distributedgrid, is one of the key steps in radio astronomical data reduction process.

Instrumentation and Methods for Astrophysics

Paper
Code

Estimating the Causal Effect of Early ArXiving on Paper Acceptance

2 code implementations • 24 Jun 2023 • Yanai Elazar, Jiayao Zhang, David Wadden, Bo Zhang, Noah A. Smith

However, since quality is a challenging construct to estimate, we use the negative outcome control method, using paper citation count as a control variable to debias the quality confounding effect.

Causal Inference

Paper
Code

Learning to Generate with Memory

1 code implementation • 24 Feb 2016 • Chongxuan Li, Jun Zhu, Bo Zhang

Memory units have been widely used to enrich the capabilities of deep networks on capturing long-term dependencies in reasoning and prediction tasks, but little investigation exists on deep generative models (DGMs) which are good at inferring high-level invariant representations from unlabeled data.

Density Estimation Image Generation +2

Paper
Code

Multi-objects Generation with Amortized Structural Regularization

1 code implementation • NeurIPS 2019 • Kun Xu, Chongxuan Li, Jun Zhu, Bo Zhang

Deep generative models (DGMs) have shown promise in image generation.

Image Generation

Paper
Code

Measuring Uncertainty through Bayesian Learning of Deep Neural Network Structure

1 code implementation • 22 Nov 2019 • Zhijie Deng, Yucen Luo, Jun Zhu, Bo Zhang

Bayesian neural networks (BNNs) augment deep networks with uncertainty quantification by Bayesian treatment of the network weights.

Bayesian Inference Neural Architecture Search +2

Paper
Code

Automatic, Dynamic, and Nearly Optimal Learning Rate Specification by Local Quadratic Approximation

1 code implementation • 7 Apr 2020 • Yingqiu Zhu, Yu Chen, Danyang Huang, Bo Zhang, Hansheng Wang

In each update step, given the gradient direction, we locally approximate the loss function by a standard quadratic function of the learning rate.

Paper
Code

OPT-GAN: A Broad-Spectrum Global Optimizer for Black-box Problems by Learning Distribution

1 code implementation • 7 Feb 2021 • Minfang Lu, Shuai Ning, Shuangrong Liu, Fengyang Sun, Bo Zhang, Bo Yang, Lin Wang

Black-box optimization (BBO) algorithms are concerned with finding the best solutions for problems with missing analytical details.

Paper
Code

Deep Bayesian Structure Networks

1 code implementation • 25 Sep 2019 • Zhijie Deng, Yucen Luo, Jun Zhu, Bo Zhang

Bayesian neural networks (BNNs) introduce uncertainty estimation to deep networks by performing Bayesian inference on network weights.

Bayesian Inference Neural Architecture Search +1

Paper
Code

Recognizing Object by Components with Human Prior Knowledge Enhances Adversarial Robustness of Deep Neural Networks

1 code implementation • 4 Dec 2022 • Xiao Li, Ziqi Wang, Bo Zhang, Fuchun Sun, Xiaolin Hu

The first stage of ROCK corresponds to the process of decomposing objects into parts in human vision.

Adversarial Robustness Inductive Bias +2

Paper
Code

Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression

1 code implementation • 23 Mar 2024 • Hancheng Ye, Chong Yu, Peng Ye, Renqiu Xia, Yansong Tang, Jiwen Lu, Tao Chen, Bo Zhang

Recent Vision Transformer Compression (VTC) works mainly follow a two-stage scheme, where the importance score of each model unit is first evaluated or preset in each submodule, followed by the sparsity score evaluation according to the target sparsity constraint.

Dimensionality Reduction

Paper
Code

A Wasserstein Minimum Velocity Approach to Learning Unnormalized Models

1 code implementation • pproximateinference AABI Symposium 2019 • Ziyu Wang, Shuyu Cheng, Yueru Li, Jun Zhu, Bo Zhang

Score matching provides an effective approach to learning flexible unnormalized models, but its scalability is limited by the need to evaluate a second-order derivative.

Paper
Code

Rethinking Cross-Domain Pedestrian Detection: A Background-Focused Distribution Alignment Framework for Instance-Free One-Stage Detectors

1 code implementation • 15 Sep 2023 • Yancheng Cai, Bo Zhang, Baopu Li, Tao Chen, Hongliang Yan, Jingdong Zhang, Jiahao Xu

Therefore, we focus on cross-domain background feature alignment while minimizing the influence of foreground features on the cross-domain alignment stage.

Pedestrian Detection

Paper
Code

MultiSPANS: A Multi-range Spatial-Temporal Transformer Network for Traffic Forecast via Structural Entropy Optimization

1 code implementation • 6 Nov 2023 • Dongcheng Zou, Senzhang Wang, Xuefeng Li, Hao Peng, Yuandong Wang, Chunyang Liu, Kehua Sheng, Bo Zhang

Based on this, we propose a relative structural entropy-based position encoding and a multi-head attention masking scheme based on multi-layer encoding trees.

Management Position +2

Paper
Code

Lemur: Log Parsing with Entropy Sampling and Chain-of-Thought Merging

1 code implementation • 28 Feb 2024 • Wei zhang, Hongcheng Guo, Anjie Le, Jian Yang, Jiaheng Liu, Zhoujun Li, Tieqiao Zheng, Shi Xu, Runqiang Zang, Liangfan Zheng, Bo Zhang

Log parsing, which entails transforming raw log messages into structured templates, constitutes a critical phase in the automation of log analytics.

Log Parsing

Paper
Code

Permutation-equivariant and Proximity-aware Graph Neural Networks with Stochastic Message Passing

1 code implementation • 5 Sep 2020 • Ziwei Zhang, Chenhao Niu, Peng Cui, Jian Pei, Bo Zhang, Wenwu Zhu

Graph neural networks (GNNs) are emerging machine learning models on graphs.

Graph Mining Graph Reconstruction +2

Paper
Code

Stability and Generalization of Bilevel Programming in Hyperparameter Optimization

1 code implementation • NeurIPS 2021 • Fan Bao, Guoqiang Wu, Chongxuan Li, Jun Zhu, Bo Zhang

Our results can explain some mysterious behaviours of the bilevel programming in practice, for instance, overfitting to the validation set.

Hyperparameter Optimization

Paper
Code

Language-Driven Anchors for Zero-Shot Adversarial Robustness

1 code implementation • 30 Jan 2023 • Xiao Li, Wei zhang, Yining Liu, Zhanhao Hu, Bo Zhang, Xiaolin Hu

Previous researches mainly focus on improving adversarial robustness in the fully supervised setting, leaving the challenging domain of zero-shot adversarial robustness an open question.

Adversarial Defense Adversarial Robustness +3

Paper
Code

ZRIGF: An Innovative Multimodal Framework for Zero-Resource Image-Grounded Dialogue Generation

1 code implementation • 1 Aug 2023 • Bo Zhang, Jian Wang, Hui Ma, Bo Xu, Hongfei Lin

To overcome this challenge, we propose an innovative multimodal framework, called ZRIGF, which assimilates image-grounded information for dialogue generation in zero-resource situations.

Dialogue Generation Response Generation

Paper
Code

Semantic Cluster Unary Loss for Efficient Deep Hashing

1 code implementation • 15 May 2018 • Shifeng Zhang, Jianmin Li, Bo Zhang

The resultant hashcodes form several compact clusters, which means hashcodes in the same cluster have similar semantic information.

Deep Hashing Information Retrieval

Paper
Code

Message Passing Stein Variational Gradient Descent

no code implementations • ICML 2018 • Jingwei Zhuo, Chang Liu, Jiaxin Shi, Jun Zhu, Ning Chen, Bo Zhang

Stein variational gradient descent (SVGD) is a recently proposed particle-based Bayesian inference method, which has attracted a lot of interest due to its remarkable approximation ability and particle efficiency compared to traditional variational inference and Markov Chain Monte Carlo methods.

Bayesian Inference Variational Inference

Paper
Add Code

Interlinked Convolutional Neural Networks for Face Parsing

no code implementations • 7 Jun 2018 • Yisu Zhou, Xiaolin Hu, Bo Zhang

It amounts to labeling each pixel with appropriate facial parts such as eyes and nose.

Face Parsing

Paper
Add Code

Adversarial adaptive 1-D convolutional neural networks for bearing fault diagnosis under varying working condition

no code implementations • 1 May 2018 • Bo Zhang, Wei Li, Jie Hao, Xiao-Li Li, Meng Zhang

The layers between the source and target feature extractor are partially untied during the training stage to take both training efficiency and domain adaptation into consideration.

Domain Adaptation

Paper
Add Code

SAM: Semantic Attribute Modulation for Language Modeling and Style Variation

no code implementations • 1 Jul 2017 • Wenbo Hu, Lifeng Hua, Lei LI, Hang Su, Tian Wang, Ning Chen, Bo Zhang

This paper presents a Semantic Attribute Modulation (SAM) for language modeling and style variation.

Attribute Language Modelling

Paper
Add Code

PBODL : Parallel Bayesian Online Deep Learning for Click-Through Rate Prediction in Tencent Advertising System

no code implementations • 4 Jul 2017 • Xun Liu, Wei Xue, Lei Xiao, Bo Zhang

Then we extend the model family to a variety of bayesian online models with increasing feature embedding capabilities, such as Sparse-MLP, FM-MLP and FFM-MLP.

Click-Through Rate Prediction

Paper
Add Code

Improving Interpretability of Deep Neural Networks with Semantic Information

no code implementations • CVPR 2017 • Yinpeng Dong, Hang Su, Jun Zhu, Bo Zhang

Interpretability of deep neural networks (DNNs) is essential since it enables users to understand the overall strengths and weaknesses of the models, conveys an understanding of how the models will behave in the future, and how to diagnose and correct potential problems.

Action Recognition Temporal Action Localization +1

Paper
Add Code

Big Learning with Bayesian Methods

no code implementations • 24 Nov 2014 • Jun Zhu, Jianfei Chen, Wen-Bo Hu, Bo Zhang

Explosive growth in data and availability of cheap computing resources have sparked increasing interest in Big learning, an emerging subfield that studies scalable machine learning algorithms, systems, and applications with Big Data.

Bayesian Inference BIG-bench Machine Learning +1

Paper
Add Code

Effective Deterministic Initialization for $k$-Means-Like Methods via Local Density Peaks Searching

no code implementations • 21 Nov 2016 • Fengfu Li, Hong Qiao, Bo Zhang

Based on these two components, we search for the local density peaks which are characterized with high local densities and high LDIs to deal with 1) and 2).

Clustering Object Categorization

Paper
Add Code

Fast Sampling for Bayesian Max-Margin Models

no code implementations • 27 Apr 2015 • Wenbo Hu, Jun Zhu, Bo Zhang

Bayesian max-margin models have shown superiority in various practical applications, such as text categorization, collaborative prediction, social network link prediction and crowdsourcing, and they conjoin the flexibility of Bayesian modeling and predictive strengths of max-margin learning.

Link Prediction Text Categorization

Paper
Add Code

Scalable Discrete Supervised Hash Learning with Asymmetric Matrix Factorization

no code implementations • 28 Sep 2016 • Shifeng Zhang, Jianmin Li, Jinma Guo, Bo Zhang

Hashing method maps similar data to binary hashcodes with smaller hamming distance, and it has received a broad attention due to its low storage cost and fast retrieval speed.

Clustering Retrieval

Paper
Add Code

Bootstrapping Face Detection with Hard Negative Examples

no code implementations • 7 Aug 2016 • Shaohua Wan, Zhijun Chen, Tao Zhang, Bo Zhang, Kong-kat Wong

Recently significant performance improvement in face detection was made possible by deeply trained convolutional networks.

Face Detection

Paper
Add Code

A New Manifold Distance Measure for Visual Object Categorization

no code implementations • 12 May 2016 • Fengfu Li, Xiayuan Huang, Hong Qiao, Bo Zhang

The proposed distance is more robust to rotations and translations of images than the traditional manifold distance and the CW-SSIM index based distance.

Clustering Object +3

Paper
Add Code

A Novel Biologically Mechanism-Based Visual Cognition Model--Automatic Extraction of Semantics, Formation of Integrated Concepts and Re-selection Features for Ambiguity

no code implementations • 25 Mar 2016 • Peijie Yin, Hong Qiao, Wei Wu, Lu Qi, YinLin Li, Shanlin Zhong, Bo Zhang

In general, the robustness and precision of recognition is one of the key problems for object recognition models.

Object Recognition

Paper
Add Code

Learning Deep Generative Models with Doubly Stochastic MCMC

no code implementations • 15 Jun 2015 • Chao Du, Jun Zhu, Bo Zhang

We present doubly stochastic gradient MCMC, a simple and generic method for (approximate) Bayesian inference of deep generative models (DGMs) in a collapsed continuous parameter space.

Bayesian Inference Density Estimation +1

Paper
Add Code

Fast Parallel SVM using Data Augmentation

no code implementations • 24 Dec 2015 • Hugh Perkins, Minjie Xu, Jun Zhu, Bo Zhang

As one of the most popular classifiers, linear SVMs still have challenges in dealing with very large-scale problems, even though linear or sub-linear algorithms have been developed recently on single machines.

Bayesian Inference Data Augmentation

Paper
Add Code

Discriminative Nonparametric Latent Feature Relational Models with Data Augmentation

no code implementations • 7 Dec 2015 • Bei Chen, Ning Chen, Jun Zhu, Jiaming Song, Bo Zhang

We present a discriminative nonparametric latent feature relational model (LFRM) for link prediction to automatically infer the dimensionality of latent features.

Bayesian Inference Data Augmentation +1

Paper
Add Code

Jointly Modeling Topics and Intents with Global Order Structure

no code implementations • 7 Dec 2015 • Bei Chen, Jun Zhu, Nan Yang, Tian Tian, Ming Zhou, Bo Zhang

Modeling document structure is of great importance for discourse analysis and related applications.

Paper
Add Code

Dropout Training for Support Vector Machines

no code implementations • 16 Apr 2014 • Ning Chen, Jun Zhu, Jianfei Chen, Bo Zhang

To deal with the intractable expectation of the non-smooth hinge loss under corrupting distributions, we develop an iteratively re-weighted least square (IRLS) algorithm by exploring data augmentation techniques.

Data Augmentation

Paper
Add Code

Gibbs Max-margin Topic Models with Data Augmentation

no code implementations • 10 Oct 2013 • Jun Zhu, Ning Chen, Hugh Perkins, Bo Zhang

Gibbs max-margin supervised topic models minimize an expected margin loss, which is an upper bound of the existing margin loss derived from an expected prediction rule.

Data Augmentation General Classification +3

Paper
Add Code

Discriminative Relational Topic Models

no code implementations • 9 Oct 2013 • Ning Chen, Jun Zhu, Fei Xia, Bo Zhang

Many scientific and engineering fields involve analyzing network data.

Bayesian Inference Data Augmentation +1

Paper
Add Code

Improved Bayesian Logistic Supervised Topic Models with Data Augmentation

no code implementations • ACL 2013 • Jun Zhu, Xun Zheng, Bo Zhang

Supervised topic models with a logistic likelihood have two issues that potentially limit their practical use: 1) response variables are usually over-weighted by document word counts; and 2) existing variational inference methods make strict mean-field assumptions.

Bayesian Inference Data Augmentation +2

Paper
Add Code

Deep Structured Generative Models

no code implementations • 10 Jul 2018 • Kun Xu, Haoyu Liang, Jun Zhu, Hang Su, Bo Zhang

Deep generative models have shown promising results in generating realistic images, but it is still non-trivial to generate images with complicated structures.

Paper
Add Code

Learning Implicit Generative Models by Teaching Explicit Ones

no code implementations • ICLR 2019 • Chao Du, Kun Xu, Chongxuan Li, Jun Zhu, Bo Zhang

Implicit generative models are difficult to train as no explicit density functions are defined.

Bilevel Optimization Rolling Shutter Correction

Paper
Add Code

A Unified Framework for Community Detection and Network Representation Learning

no code implementations • 21 Nov 2016 • Cunchao Tu, Xiangkai Zeng, Hao Wang, Zhengyan Zhang, Zhiyuan Liu, Maosong Sun, Bo Zhang, Leyu Lin

Network representation learning (NRL) aims to learn low-dimensional vectors for vertices in a network.

Social and Information Networks Physics and Society

Paper
Add Code

An Unified Intelligence-Communication Model for Multi-Agent System Part-I: Overview

no code implementations • 25 Nov 2018 • Bo Zhang, Bin Chen, Jinyu Yang, Wenjing Yang, Jiankang Zhang

Motivated by Shannon's model and recent rehabilitation of self-supervised artificial intelligence having a "World Model", this paper propose an unified intelligence-communication (UIC) model for describing a single agent and any multi-agent system.

Paper
Add Code

The Entropy of Artificial Intelligence and a Case Study of AlphaZero from Shannon's Perspective

no code implementations • 14 Dec 2018 • Bo Zhang, Bin Chen, Jin-lin Peng

Firstly, as there is a finite number of possibilities in the game, is there a quantifiable intelligence measurement for evaluating intelligent systems, e. g. AlphaZero?

Paper
Add Code

COSINE: Compressive Network Embedding on Large-scale Information Networks

no code implementations • 21 Dec 2018 • Zhengyan Zhang, Cheng Yang, Zhiyuan Liu, Maosong Sun, Zhichong Fang, Bo Zhang, Leyu Lin

There is recently a surge in approaches that learn low-dimensional embeddings of nodes in networks.

General Classification graph partitioning +3

Paper
Add Code

Supervised Treebank Conversion: Data and Approaches

no code implementations • ACL 2018 • Xinzhou Jiang, Zhenghua Li, Bo Zhang, Min Zhang, Sheng Li, Luo Si

Treebank conversion is a straightforward and effective way to exploit various heterogeneous treebanks for boosting parsing performance.

Dependency Parsing Multi-Task Learning +1

Paper
Add Code

Discriminative Deep Random Walk for Network Classification

no code implementations • ACL 2016 • Juzheng Li, Jun Zhu, Bo Zhang

Anomaly Detection Classification +2

Paper
Add Code

Segment-Level Sequence Modeling using Gated Recursive Semi-Markov Conditional Random Fields

no code implementations • ACL 2016 • Jingwei Zhuo, Yong Cao, Jun Zhu, Bo Zhang, Zaiqing Nie

Chunking Named Entity Recognition (NER)

Paper
Add Code

DeepExposure: Learning to Expose Photos with Asynchronously Reinforced Adversarial Learning

no code implementations • NeurIPS 2018 • Runsheng Yu, Wenyu Liu, Yasen Zhang, Zhi Qu, Deli Zhao, Bo Zhang

Based on these sub-images, a local exposure for each sub-image is automatically learned by virtue of policy network sequentially while the reward of learning is globally designed for striking a balance of overall exposures.

Paper
Add Code

Convolutional Neural Networks with Intra-Layer Recurrent Connections for Scene Labeling

no code implementations • NeurIPS 2015 • Ming Liang, Xiaolin Hu, Bo Zhang

We adopt a deep recurrent convolutional neural network (RCNN) for this task, which is originally proposed for object recognition.

Object Recognition Scene Labeling

Paper
Add Code

Distributed Bayesian Posterior Sampling via Moment Sharing

no code implementations • NeurIPS 2014 • Minjie Xu, Balaji Lakshminarayanan, Yee Whye Teh, Jun Zhu, Bo Zhang

We propose a distributed Markov chain Monte Carlo (MCMC) inference algorithm for large scale Bayesian posterior simulation.

regression

Paper
Add Code

Scalable Inference for Logistic-Normal Topic Models

no code implementations • NeurIPS 2013 • Jianfei Chen, Jun Zhu, Zi Wang, Xun Zheng, Bo Zhang

Logistic-normal topic models can effectively discover correlation structures among latent topics.

Data Augmentation Topic Models

Paper
Add Code

Super-Bit Locality-Sensitive Hashing

no code implementations • NeurIPS 2012 • Jianqiu Ji, Jianmin Li, Shuicheng Yan, Bo Zhang, Qi Tian

Sign-random-projection locality-sensitive hashing (SRP-LSH) is a probabilistic dimension reduction method which provides an unbiased estimate of angular similarity, yet suffers from the large variance of its estimation.

Dimensionality Reduction Retrieval

Paper
Add Code

Partially Observed Maximum Entropy Discrimination Markov Networks

no code implementations • NeurIPS 2008 • Jun Zhu, Eric P. Xing, Bo Zhang

Learning graphical models with hidden variables can offer semantic insights to complex data and lead to salient structured predictors without relying on expensive, sometime unattainable fully annotated training data.

Structured Prediction

Paper
Add Code

Interpret Neural Networks by Identifying Critical Data Routing Paths

no code implementations • CVPR 2018 • Yulong Wang, Hang Su, Bo Zhang, Xiaolin Hu

Interpretability of a deep neural network aims to explain the rationale behind its decisions and enable the users to understand the intelligent agents, which has become an important issue due to its importance in practical applications.

Paper
Add Code

To Relieve Your Headache of Training an MRF, Take AdVIL

no code implementations • ICLR 2020 • Chongxuan Li, Chao Du, Kun Xu, Max Welling, Jun Zhu, Bo Zhang

We propose a black-box algorithm called {\it Adversarial Variational Inference and Learning} (AdVIL) to perform inference and learning on a general Markov random field (MRF).

Variational Inference

Paper
Add Code

Orientational Pyramid Matching for Recognizing Indoor Scenes

no code implementations • CVPR 2014 • Lingxi Xie, Jingdong Wang, Baining Guo, Bo Zhang, Qi Tian

The novelty lies in that OPM uses the 3D orientations to form the pyramid and produce the pooling regions, which is unlike SPM that uses the spatial positions to form the pyramid.

General Classification Scene Classification +1

Paper
Add Code

RIDE: Reversal Invariant Descriptor Enhancement

no code implementations • ICCV 2015 • Lingxi Xie, Jingdong Wang, Weiyao Lin, Bo Zhang, Qi Tian

In many fine-grained object recognition datasets, image orientation (left/right) might vary from sample to sample.

Object Recognition

Paper
Add Code

Pairwise Teacher-Student Network for Semi-Supervised Hashing

no code implementations • 2 Feb 2019 • Shifeng Zhang, Jianmin Li, Bo Zhang

Hashing method maps similar high-dimensional data to binary hashcodes with smaller hamming distance, and it has received broad attention due to its low storage cost and fast retrieval speed.

Retrieval

Paper
Add Code

Extracting and Visualizing Semantic Relationships from Chinese Biomedical Text

no code implementations • PACLIC 2012 • Qingliang Miao, Shu Zhang, Bo Zhang, Hao Yu

Drug Discovery Relation Extraction

Paper
Add Code

Artificial Intelligence in Intelligent Tutoring Robots: A Systematic Review and Design Guidelines

no code implementations • 26 Feb 2019 • Jinyu Yang, Bo Zhang

We first analyse the environment of the ITR and propose a relationship model for describing interactions of ITR with the students, the social milieu and the curriculum.

Paper
Add Code

Deep Hierarchical Reinforcement Learning Based Recommendations via Multi-goals Abstraction

no code implementations • 22 Mar 2019 • Dongyang Zhao, Liang Zhang, Bo Zhang, Lizhou Zheng, Yongjun Bao, Weipeng Yan

To tackle this challenge, we propose a deep hierarchical reinforcement learning based recommendation framework, which consists of two components, i. e., high-level agent and low-level agent.

Hierarchical Reinforcement Learning Recommendation Systems +2

Paper
Add Code

Learning Semantic Vector Representations of Source Code via a Siamese Neural Network

no code implementations • 26 Apr 2019 • David Wehr, Halley Fede, Eleanor Pence, Bo Zhang, Guilherme Ferreira, John Walczyk, Joseph Hughes

The abundance of open-source code, coupled with the success of recent advances in deep learning for natural language processing, has given rise to a promising new application of machine learning to source code.

BIG-bench Machine Learning

Paper
Add Code

Curriculum Learning for Deep Generative Models with Clustering

no code implementations • 27 Jun 2019 • Deli Zhao, Jiapeng Zhu, Zhenfang Guo, Bo Zhang

The experiments on cat and human-face data validate that our algorithm is able to learn the optimal generative models (e. g. ProGAN) with respect to specified quality metrics for noisy data.

Clustering Generative Adversarial Network

Paper
Add Code

Multi-Task Deep Learning with Dynamic Programming for Embryo Early Development Stage Classification from Time-Lapse Videos

no code implementations • 22 Aug 2019 • Zihan Liu, Bo Huang, Yuqi Cui, Yifan Xu, Bo Zhang, Lixia Zhu, Yang Wang, Lei Jin, Dongrui Wu

Accurate classification of embryo early development stages can provide embryologists valuable information for assessing the embryo quality, and hence is critical to the success of IVF.

General Classification

Paper
Add Code

A Data-Center FPGA Acceleration Platform for Convolutional Neural Networks

no code implementations • 17 Sep 2019 • Xiaoyu Yu, Yuwei Wang, Jie Miao, Ephrem Wu, Heng Zhang, Yu Meng, Bo Zhang, Biao Min, Dewei Chen, Jianlin Gao

Intensive computation is entering data centers with multiple workloads of deep learning.

Paper
Add Code

Hierarchy Response Learning for Neural Conversation Generation

no code implementations • IJCNLP 2019 • Bo Zhang, Xiao-Ming Zhang

Specifically, a hierarchical response generation (HRG) framework is proposed to capture the conversation intention in a natural and coherent way.

Response Generation

Paper
Add Code

Regularized Adversarial Sampling and Deep Time-aware Attention for Click-Through Rate Prediction

no code implementations • 3 Nov 2019 • Yikai Wang, Liang Zhang, Quanyu Dai, Fuchun Sun, Bo Zhang, Yang He, Weipeng Yan, Yongjun Bao

In deep CTR models, exploiting users' historical data is essential for learning users' behaviors and interests.

Click-Through Rate Prediction

Paper
Add Code

In Vitro Fertilization (IVF) Cumulative Pregnancy Rate Prediction from Basic Patient Characteristics

no code implementations • 10 Nov 2019 • Bo Zhang, Yuqi Cui, Meng Wang, Jingjing Li, Lei Jin, Dongrui Wu

Tens of millions of women suffer from infertility worldwide each year.

Clustering

Paper
Add Code

Automatic quality assessment for 2D fetal sonographic standard plane based on multi-task learning

no code implementations • 11 Dec 2019 • Hong Luo, Han Liu, Kejun Li, Bo Zhang

An essential criterion for FS image quality control is that all the essential anatomical structures in the section should appear full and remarkable with a clear boundary.

Image Quality Assessment Multi-Task Learning +1

Paper
Add Code

Realization of spatial sparseness by deep ReLU nets with massive data

no code implementations • 16 Dec 2019 • Charles K. Chui, Shao-Bo Lin, Bo Zhang, Ding-Xuan Zhou

The great success of deep learning poses urgent challenges for understanding its working mechanism and rationality.

Learning Theory

Paper
Add Code

Latent Variables on Spheres for Autoencoders in High Dimensions

no code implementations • 21 Dec 2019 • Deli Zhao, Jiapeng Zhu, Bo Zhang

Variational Auto-Encoder (VAE) has been widely applied as a fundamental generative model in machine learning.

Vocal Bursts Intensity Prediction

Paper
Add Code

Neural Architecture Search on Acoustic Scene Classification

no code implementations • 30 Dec 2019 • Jixiang Li, Chuming Liang, Bo Zhang, Zhao Wang, Fei Xiang, Xiangxiang Chu

Convolutional neural networks are widely adopted in Acoustic Scene Classification (ASC) tasks, but they generally carry a heavy computational burden.

Acoustic Scene Classification Classification +3

Paper
Add Code

User-Level Privacy-Preserving Federated Learning: Analysis and Performance Optimization

no code implementations • 29 Feb 2020 • Kang Wei, Jun Li, Ming Ding, Chuan Ma, Hang Su, Bo Zhang, H. Vincent Poor

According to our analysis, the UDP framework can realize $(\epsilon_{i}, \delta_{i})$-LDP for the $i$-th MT with adjustable privacy protection levels by varying the variances of the artificial noise processes.

Federated Learning Privacy Preserving

Paper
Add Code

Perceptual Image Super-Resolution with Progressive Adversarial Network

no code implementations • 8 Mar 2020 • Lone Wong, Deli Zhao, Shaohua Wan, Bo Zhang

Progressive growing enhances image resolution gradually, thereby preserving precision of recovered image.

Image Super-Resolution

Paper
Add Code

Syntax-Aware Opinion Role Labeling with Dependency Graph Convolutional Networks

no code implementations • ACL 2020 • Bo Zhang, Yue Zhang, Rui Wang, Zhenghua Li, Min Zhang

The experimental results show that syntactic information is highly valuable for ORL, and our final MTL model effectively boosts the F1 score by 9. 29 over the syntax-agnostic baseline.

Fine-Grained Opinion Analysis Multi-Task Learning

Paper
Add Code

Understanding and Stabilizing GANs' Training Dynamics Using Control Theory

no code implementations • ICML 2020 • Kun Xu, Chongxuan Li, Jun Zhu, Bo Zhang

There are existing efforts that model the training dynamics of GANs in the parameter space but the analysis cannot directly motivate practically effective stabilizing methods.

Paper
Add Code

ROME: Robustifying Memory-Efficient NAS via Topology Disentanglement and Gradient Accumulation

no code implementations • ICCV 2023 • Xiaoxing Wang, Xiangxiang Chu, Yuda Fan, Zhexi Zhang, Bo Zhang, Xiaokang Yang, Junchi Yan

Albeit being a prevalent architecture searching approach, differentiable architecture search (DARTS) is largely hindered by its substantial memory cost since the entire supernet resides in the memory.

Disentanglement Neural Architecture Search

Paper
Add Code

A Unified Mixture-View Framework for Unsupervised Representation Learning

no code implementations • 26 Nov 2020 • Xiangxiang Chu, Xiaohang Zhan, Bo Zhang

Recent unsupervised contrastive representation learning follows a Single Instance Multi-view (SIM) paradigm where positive pairs are usually constructed with intra-image data augmentation.

Data Augmentation object-detection +2

Paper
Add Code

Sex Differences in Severity and Mortality Among Patients With COVID-19: Evidence from Pooled Literature Analysis and Insights from Integrated Bioinformatic Analysis

no code implementations • 30 Mar 2020 • Xiyi Wei, Yu-Tian Xiao, Jian Wang, Rui Chen, Wei zhang, Yue Yang, Daojun Lv, Chao Qin, Di Gu, Bo Zhang, Weidong Chen, Jianquan Hou, Ninghong Song, Guohua Zeng, Shancheng Ren

Objective: To conduct a meta-analysis of current studies that examined sex differences in severity and mortality in patients with COVID-19, and identify potential mechanisms underpinning these differences.

Paper
Add Code

Exploring the Galactic Anticenter substructure with LAMOST & Gaia DR2

no code implementations • 7 Jan 2021 • Jing Li, Xiang-Xiang Xue, Chao Liu, Bo Zhang, Hans-Walter Rix, Jeffrey L. Carlin, Chengqun Yang, Rene A. Mendez, Jing Zhong, Hao Tian, Lan Zhang, Yan Xu, Yaqian Wu, Gang Zhao, Ruixiang Chang

Their location in [$\alpha$/M] vs. [M/H] space is more metal poor than typical thin disk stars, with [$\alpha$/M] \textbf{lower} than the thick disk.

Astrophysics of Galaxies

Paper
Add Code

Robust Dynamical Decoupling for the Manipulation of a Spin Network via a Single Spin

no code implementations • 11 Jan 2021 • Xiaodong Yang, Yunrui Ge, Bo Zhang, Jun Li

High-fidelity control of quantum systems is crucial for quantum information processing, but is often limited by perturbations from the environment and imperfections in the applied control fields.

Quantum Physics

Paper
Add Code

The Flare and Warp of the Young Stellar Disk traced with LAMOST DR5 OB-type stars

no code implementations • 1 Feb 2021 • Yang Yu, Hai-Feng Wang, Wen-Yuan Cui, Lin-Lin Li, Chao Liu, Bo Zhang, Hao Tian, Zhen-Yan Huo, Jie Ju, Zhi-Cun Liu, Fang Wen, Shuai Feng

We present analysis of the spatial density structure for the outer disk from 8$-$14 \, kpc with the LAMOST DR5 13534 OB-type stars and observe similar flaring on north and south sides of the disk implying that the flaring structure is symmetrical about the Galactic plane, for which the scale height at different Galactocentric distance is from 0. 14 to 0. 5 \, kpc.

Astrophysics of Galaxies

Paper
Add Code

Improving Accuracy and Diversity in Matching of Recommendation with Diversified Preference Network

no code implementations • 7 Feb 2021 • Ruobing Xie, Qi Liu, Shukai Liu, Ziwei Zhang, Peng Cui, Bo Zhang, Leyu Lin

In this paper, we propose a novel Heterogeneous graph neural network framework for diversified recommendation (GraphDR) in matching to improve both recommendation accuracy and diversity.

Graph Attention Recommendation Systems

Paper
Add Code

AutoKWS: Keyword Spotting with Differentiable Architecture Search

no code implementations • 8 Sep 2020 • Bo Zhang, Wenfeng Li, Qingyuan Li, Weiji Zhuang, Xiangxiang Chu, Yujun Wang

Smart audio devices are gated by an always-on lightweight keyword spotting program to reduce power consumption.

Keyword Spotting Neural Architecture Search

Paper
Add Code

A Minimax Probability Machine for Non-Decomposable Performance Measures

no code implementations • 28 Feb 2021 • JunRu Luo, Hong Qiao, Bo Zhang

On the other hand, the minimax probability machine is a popular method for binary classification problems and aims at learning a linear classifier by maximizing the accuracy rate, which makes it unsuitable to deal with imbalanced classification tasks.

Binary Classification Classification +2

Paper
Add Code

Learning with Smooth Hinge Losses

no code implementations • 27 Feb 2021 • JunRu Luo, Hong Qiao, Bo Zhang

Due to the non-smoothness of the Hinge loss in SVM, it is difficult to obtain a faster convergence rate with modern optimization algorithms.

text-classification Text Classification

Paper
Add Code

Extragalactic HI 21-cm absorption line observations with the Five-hundred-meter Aperture Spherical radio Telescope

no code implementations • 11 Mar 2021 • Bo Zhang, Ming Zhu, Zhong-Zu Wu, Qing-Zheng Yu, Peng Jiang, You-Ling Yue, Meng-Lin Huang, Qiao-Li Hao

Our observations successfully confirmed the existence of HI absorption lines in all these systems, including two sources that were marginally detected by ALFALFA.

Astrophysics of Galaxies

Paper
Add Code

MagDR: Mask-guided Detection and Reconstruction for Defending Deepfakes

no code implementations • CVPR 2021 • Zhikai Chen, Lingxi Xie, Shanmin Pang, Yong He, Bo Zhang

This paper presents MagDR, a mask-guided detection and reconstruction pipeline for defending deepfakes from adversarial attacks.

Paper
Add Code

Free-Space Optical Communication Using Non-mode-Selective Photonic Lantern Based Coherent Receiver

no code implementations • 3 Jul 2020 • Bo Zhang, Renzhi Yuan, Jianfeng Sun, Julian Cheng, Mohamed-Slim Alouini

A free-space optical communication system using non-mode-selective photonic lantern (PL) based coherent receiver is studied.

Paper
Add Code

Estimating and Improving Dynamic Treatment Regimes With a Time-Varying Instrumental Variable

no code implementations • 15 Apr 2021 • Shuxiao Chen, Bo Zhang

Estimating dynamic treatment regimes (DTRs) from retrospective observational data is challenging as some degree of unmeasured confounding is often expected.

Paper
Add Code

Let's See Clearly: Contaminant Artifact Removal for Moving Cameras

no code implementations • ICCV 2021 • Xiaoyu Li, Bo Zhang, Jing Liao, Pedro V. Sander

This new dataset and our novel framework lead to our method that is able to address different contaminants and outperforms competitive restoration approaches both qualitatively and quantitatively.

Video Restoration

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.