Search Results for author: Jian Sun

Found 255 papers, 124 papers with code

S^2SQL: Injecting Syntax to Question-Schema Interaction Graph Encoder for Text-to-SQL Parsers

no code implementations • Findings (ACL) 2022 • Binyuan Hui, Ruiying Geng, Lihan Wang, Bowen Qin, Yanyang Li, Bowen Li, Jian Sun, Yongbin Li

The task of converting a natural language question into an executable SQL query, known as text-to-SQL, is an important branch of semantic parsing.

Semantic Parsing Text-To-SQL

Paper
Add Code

Data Quality Matters: Suicide Intention Detection on Social Media Posts Using a RoBERTa-CNN Model

no code implementations • 3 Feb 2024 • Emily Lin, Jian Sun, Hsingyu Chen, Mohammad H. Mahoor

In the meanwhile, RoBERTa-CNN outperforms competitive methods, demonstrating the robustness and ability to capture nuanced linguistic patterns for suicidal intentions.

Depression Detection Sentiment Analysis +2

Paper
Add Code

Floorplanning of VLSI by Mixed-Variable Optimization

no code implementations • 27 Jan 2024 • Jian Sun, Huabin Cheng, Jian Wu, Zhanyang Zhu, Yu Chen

FA-GSS uses the Golden Section strategy to optimize both wirelength and area targets.

Paper
Add Code

Distributed Data-driven Unknown-input Observers

no code implementations • 9 Jan 2024 • Yuzhou Wei, Giorgia Disarò, Wenjie Liu, Jian Sun, Maria Elena Valcher, Gang Wang

Moving to a data-driven approach, it is shown that the input/output/state trajectories of the system are compatible with the equations of a D-DUIO, and this allows, under suitable assumptions, to express the matrices of a possible DUIO in terms of the matrices of pre-collected data.

Paper
Add Code

ImputeFormer: Low Rankness-Induced Transformers for Generalizable Spatiotemporal Imputation

1 code implementation • 4 Dec 2023 • Tong Nie, Guoyang Qin, Wei Ma, Yuewen Mei, Jian Sun

The exploitation of the inherent structures of spatiotemporal data enables our model to learn balanced signal-noise representations, making it versatile for a variety of imputation problems.

Inductive Bias Multivariate Time Series Imputation +1

Paper
Code

A WINNER+ Based 3-D Non-Stationary Wideband MIMO Channel Model

no code implementations • 1 Dec 2023 • Ji Bian, Jian Sun, Cheng-Xiang Wang, Rui Feng, Jie Huang, Yang Yang, Minggao Zhang

In this paper, a three-dimensional (3-D) non-stationary wideband multiple-input multiple-output (MIMO) channel model based on the WINNER+ channel model is proposed.

Paper
Add Code

Robust Control of Unknown Switched Linear Systems from Noisy Data

no code implementations • 19 Nov 2023 • Wenjie Liu, Yifei Li, Jian Sun, Gang Wang, Jie Chen

This paper investigates the problem of data-driven stabilization for linear discrete-time switched systems with unknown switching dynamics.

Paper
Add Code

Optimal Transport-Guided Conditional Score-Based Diffusion Models

1 code implementation • 2 Nov 2023 • Xiang Gu, Liwei Yang, Jian Sun, Zongben Xu

Conditional score-based diffusion model (SBDM) is for conditional generation of target data with paired data as condition, and has achieved great success in image translation.

Image-to-Image Translation Super-Resolution +1

Paper
Code

Self-triggered Consensus Control of Multi-agent Systems from Data

no code implementations • 19 Oct 2023 • Yifei Li, Xin Wang, Jian Sun, Gang Wang, Jie Chen

In the presence of external disturbances, a model-based STC scheme is put forth for $\mathcal{H}_{\infty}$-consensus of MASs, serving as a baseline for the data-driven STC.

Paper
Add Code

STORM: Efficient Stochastic Transformer based World Models for Reinforcement Learning

1 code implementation • NeurIPS 2023 • Weipu Zhang, Gang Wang, Jian Sun, Yetian Yuan, Gao Huang

The performance of these algorithms heavily relies on the sequence modeling and generation capabilities of the world model.

Ranked #5 on Atari Games 100k on Atari 100k

Atari Games 100k Model-based Reinforcement Learning +2

Paper
Code

Identifying factors associated with fast visual field progression in patients with ocular hypertension based on unsupervised machine learning

no code implementations • 26 Sep 2023 • Xiaoqin Huang, Asma Poursoroush, Jian Sun, Michael V. Boland, Chris Johnson, Siamak Yousefi

We characterized the subtypes based on demographic, clinical, ocular, and VF factors at the baseline.

Paper
Add Code

Towards High-Fidelity Text-Guided 3D Face Generation and Manipulation Using only Images

no code implementations • ICCV 2023 • Cuican Yu, Guansong Lu, Yihan Zeng, Jian Sun, Xiaodan Liang, Huibin Li, Zongben Xu, Songcen Xu, Wei zhang, Hang Xu

In this paper, we propose a text-guided 3D faces generation method, refer as TG-3DFace, for generating realistic 3D faces using text guidance.

3D Shape Generation Contrastive Learning +2

Paper
Add Code

Soft Decomposed Policy-Critic: Bridging the Gap for Effective Continuous Control with Discrete RL

no code implementations • 20 Aug 2023 • Yechen Zhang, Jian Sun, Gang Wang, Zhuo Li, Wei Chen

Discrete reinforcement learning (RL) algorithms have demonstrated exceptional performance in solving sequential decision tasks with discrete action spaces, such as Atari games.

Atari Games Continuous Control +1

Paper
Add Code

Data-driven Polytopic Output Synchronization of Heterogeneous Multi-agent Systems from Noisy Data

no code implementations • 14 Jul 2023 • Yifei Li, Wenjie Liu, Jian Sun, Gang Wang, Lihua Xie, Jie Chen

This method utilizes measured data and a noise-matrix polytope to ensure near-optimal output synchronization.

Paper
Add Code

Contextualizing MLP-Mixers Spatiotemporally for Urban Data Forecast at Scale

1 code implementation • 4 Jul 2023 • Tong Nie, Guoyang Qin, Lijun Sun, Wei Ma, Yu Mei, Jian Sun

Spatiotemporal urban data (STUD) displays complex correlational patterns.

Computational Efficiency Decision Making

Paper
Code

Flexible Job Shop Scheduling via Dual Attention Network Based Reinforcement Learning

1 code implementation • 9 May 2023 • Runqing Wang, Gang Wang, Jian Sun, Fang Deng, Jie Chen

The complex relationships between operations and machines are represented precisely and concisely, for which a dual-attention network (DAN) comprising several interconnected operation message attention blocks and machine message attention blocks is proposed.

Decision Making Job Shop Scheduling +2

Paper
Code

A Survey on Out-of-Distribution Detection in NLP

no code implementations • 5 May 2023 • Hao Lang, Yinhe Zheng, Yixuan Li, Jian Sun, Fei Huang, Yongbin Li

Out-of-distribution (OOD) detection is essential for the reliable and safe deployment of machine learning systems in the real world.

Out-of-Distribution Detection Out of Distribution (OOD) Detection

Paper
Add Code

Learning Robust Data-based LQG Controllers from Noisy Data

no code implementations • 2 May 2023 • Wenjie Liu, Jian Sun, Gang Wang, Francesco Bullo, Jie Chen

In this work, a data-based formulation for computing the steady-state Kalman gain is proposed based on semi-definite programming (SDP) using some noise-free input-state-output data.

Paper
Add Code

MC-ViViT: Multi-branch Classifier-ViViT to detect Mild Cognitive Impairment in older adults using facial videos

no code implementations • 11 Apr 2023 • Jian Sun, Hiroko H. Dodge, Mohammad H. Mahoor

Deep machine learning models including Convolutional Neural Networks (CNN) have been successful in the detection of Mild Cognitive Impairment (MCI) using medical images, questionnaires, and videos.

Paper
Add Code

Keypoint-Guided Optimal Transport

2 code implementations • 23 Mar 2023 • Xiang Gu, Yucheng Yang, Wei Zeng, Jian Sun, Zongben Xu

In this paper, we propose a novel KeyPoint-Guided model by ReLation preservation (KPG-RL) that searches for the optimal matching (i. e., transport plan) guided by the keypoints in OT.

Domain Adaptation Image-to-Image Translation +1

Paper
Code

Towards better traffic volume estimation: Jointly addressing the underdetermination and nonequilibrium problems with correlation-adaptive GNNs

1 code implementation • 10 Mar 2023 • Tong Nie, Guoyang Qin, Yunpeng Wang, Jian Sun

Traffic volume is an indispensable ingredient to provide fine-grained information for traffic management and control.

Graph Attention

Paper
Code

Generalized Semantic Segmentation by Self-Supervised Source Domain Projection and Multi-Level Contrastive Learning

1 code implementation • 3 Mar 2023 • Liwei Yang, Xiang Gu, Jian Sun

SSDP aims to reduce domain gap by projecting data to the source domain, while MLCL is a learning scheme to learn discriminative and generalizable features on the projected data.

Contrastive Learning Domain Generalization +2

Paper
Code

EdgeYOLO: An Edge-Real-Time Object Detector

1 code implementation • 15 Feb 2023 • Shihan Liu, Junlin Zha, Jian Sun, Zhuo Li, Gang Wang

This paper proposes an efficient, low-complexity and anchor-free object detector based on the state-of-the-art YOLO framework, which can be implemented in real time on edge computing platforms.

Data Augmentation Edge-computing +1

388

Paper
Code

Self-triggered Resilient Stabilization of Linear Systems with Quantized Output

no code implementations • 14 Feb 2023 • Wenjie Liu, Masashi Wakaiki, Jian Sun, Gang Wang, Jie Chen

If, in addition, the transmission protocols at the controller-to-actuator (C-A) and sensor-to-controller (S-C) channels can be adapted, the self-triggered control architecture can be considerably simplified, leveraging a delicate observer-based deadbeat controller to eliminate the need for running the controller in parallel at the encoder side.

Paper
Add Code

Dynamic Grained Encoder for Vision Transformers

1 code implementation • NeurIPS 2021 • Lin Song, Songyang Zhang, Songtao Liu, Zeming Li, Xuming He, Hongbin Sun, Jian Sun, Nanning Zheng

Specifically, we propose a Dynamic Grained Encoder for vision transformers, which can adaptively assign a suitable number of queries to each spatial region.

Image Classification Language Modelling +2

Paper
Code

Towards Generalized Open Information Extraction

no code implementations • 29 Nov 2022 • Bowen Yu, Zhenyu Zhang, Jingyang Li, Haiyang Yu, Tingwen Liu, Jian Sun, Yongbin Li, Bin Wang

Open Information Extraction (OpenIE) facilitates the open-domain discovery of textual facts.

Open Information Extraction

Paper
Add Code

Semi-Supervised Lifelong Language Learning

1 code implementation • 23 Nov 2022 • Yingxiu Zhao, Yinhe Zheng, Bowen Yu, Zhiliang Tian, Dongkyu Lee, Jian Sun, Haiyang Yu, Yongbin Li, Nevin L. Zhang

In this paper, we explore a novel setting, semi-supervised lifelong language learning (SSLL), where a model learns sequentially arriving language tasks with both labeled and unlabeled data.

Transfer Learning

961

Paper
Code

CGoDial: A Large-Scale Benchmark for Chinese Goal-oriented Dialog Evaluation

no code implementations • 21 Nov 2022 • Yinpei Dai, Wanwei He, Bowen Li, Yuchuan Wu, Zheng Cao, Zhongqi An, Jian Sun, Yongbin Li

Practical dialog systems need to deal with various knowledge sources, noisy user expressions, and the shortage of annotated data.

Goal-Oriented Dialog Retrieval

Paper
Add Code

Estimating Soft Labels for Out-of-Domain Intent Detection

no code implementations • 10 Nov 2022 • Hao Lang, Yinhe Zheng, Jian Sun, Fei Huang, Luo Si, Yongbin Li

Out-of-Domain (OOD) intent detection is important for practical dialog systems.

Intent Detection Out of Distribution (OOD) Detection

Paper
Add Code

Correlating sparse sensing for large-scale traffic speed estimation: A Laplacian-enhanced low-rank tensor kriging approach

1 code implementation • 21 Oct 2022 • Tong Nie, Guoyang Qin, Yunpeng Wang, Jian Sun

In addition, sensors are prone to error or missing data due to various kinds of reasons, speeds from these sensors can become highly noisy.

Management

Paper
Code

Doc2Bot: Accessing Heterogeneous Documents via Conversational Bots

no code implementations • 20 Oct 2022 • Haomin Fu, Yeqin Zhang, Haiyang Yu, Jian Sun, Fei Huang, Luo Si, Yongbin Li, Cam-Tu Nguyen

This paper introduces Doc2Bot, a novel dataset for building machines that help users seek information via conversations.

dialog state tracking Response Generation

Paper
Add Code

Prompt Conditioned VAE: Enhancing Generative Replay for Lifelong Learning in Task-Oriented Dialogue

1 code implementation • 14 Oct 2022 • Yingxiu Zhao, Yinhe Zheng, Zhiliang Tian, Chang Gao, Bowen Yu, Haiyang Yu, Yongbin Li, Jian Sun, Nevin L. Zhang

Lifelong learning (LL) is vital for advanced task-oriented dialogue (ToD) systems.

Natural Language Understanding

961

Paper
Code

Stacking Ensemble Learning in Deep Domain Adaptation for Ophthalmic Image Classification

no code implementations • 27 Sep 2022 • Yeganeh Madadi, Vahid Seydi, Jian Sun, Edward Chaum, Siamak Yousefi

We extend Maximum Mean Discrepancy (MMD), Low-rank coding, and Correlation Alignment (CORAL) to compute the adaptation loss in three base models.

Domain Adaptation Ensemble Learning +1

Paper
Add Code

SPACE-3: Unified Dialog Model Pre-training for Task-Oriented Dialog Understanding and Generation

1 code implementation • 14 Sep 2022 • Wanwei He, Yinpei Dai, Min Yang, Jian Sun, Fei Huang, Luo Si, Yongbin Li

To capture the structured dialog semantics, we pre-train the dialog understanding module via a novel tree-induced semi-supervised contrastive learning objective with the help of extra dialog annotations.

Contrastive Learning dialog state tracking +1

961

Paper
Code

Implicit Full Waveform Inversion with Deep Neural Representation

no code implementations • 8 Sep 2022 • Jian Sun, Kristopher Innanen

Compared to FWI, which is sensitive to the initial model, IFWI benefits from the increased degrees of freedom with deep learning optimization, thus allowing to start from a random initialization, which greatly reduces the risk of non-uniqueness and being trapped in local minima.

Bayesian Inference

Paper
Add Code

A Survey on Text-to-SQL Parsing: Concepts, Methods, and Future Directions

no code implementations • 29 Aug 2022 • Bowen Qin, Binyuan Hui, Lihan Wang, Min Yang, Jinyang Li, Binhua Li, Ruiying Geng, Rongyu Cao, Jian Sun, Luo Si, Fei Huang, Yongbin Li

In recent years, deep neural networks have significantly advanced this task by neural generation models, which automatically learn a mapping function from an input NL question to an output SQL query.

SQL Parsing Text-To-SQL

Paper
Add Code

Data-Driven Control of Distributed Event-Triggered Network Systems

no code implementations • 22 Aug 2022 • Xin Wang, Jian Sun, Gang Wang, Frank Allgöwer, Jie Chen

The present paper deals with data-driven event-triggered control of a class of unknown discrete-time interconnected systems (a. k. a.

Paper
Add Code

Differentiable Architecture Search with Random Features

no code implementations • CVPR 2023 • Xuanyang Zhang, Yonggang Li, Xiangyu Zhang, Yongtao Wang, Jian Sun

Differentiable architecture search (DARTS) has significantly promoted the development of NAS techniques because of its high search efficiency and effectiveness but suffers from performance collapse.

Ranked #12 on Neural Architecture Search on NAS-Bench-201, CIFAR-10

Neural Architecture Search

Paper
Add Code

Event-triggered Consensus Control of Heterogeneous Multi-agent Systems: Model- and Data-based Analysis

no code implementations • 1 Aug 2022 • Xin Wang, Jian Sun, Gang Wang, Jie Chen

This article deals with model- and data-based consensus control of heterogenous leader-following multi-agent systems (MASs) under an event-triggering transmission scheme.

Paper
Add Code

DBQ-SSD: Dynamic Ball Query for Efficient 3D Object Detection

1 code implementation • 22 Jul 2022 • Jinrong Yang, Lin Song, Songtao Liu, Weixin Mao, Zeming Li, Xiaoping Li, Hongbin Sun, Jian Sun, Nanning Zheng

Many point-based 3D detectors adopt point-feature sampling strategies to drop some points for efficient inference.

3D Object Detection object-detection

Paper
Code

StreamYOLO: Real-time Object Detection for Streaming Perception

no code implementations • 21 Jul 2022 • Jinrong Yang, Songtao Liu, Zeming Li, Xiaoping Li, Jian Sun

In this paper, we explore the performance of real time models on this metric and endow the models with the capacity of predicting the future, significantly improving the results for streaming perception.

Autonomous Driving Object +2

Paper
Add Code

Data-driven Self-triggered Control via Trajectory Prediction

no code implementations • 18 Jul 2022 • Wenjie Liu, Jian Sun, Gang Wang, Francesco Bullo, Jie Chen

Self-triggered control, a well-documented technique for reducing the communication overhead while ensuring desired system performance, is gaining increasing popularity.

Model Predictive Control Trajectory Prediction

Paper
Add Code

Layout-Aware Information Extraction for Document-Grounded Dialogue: Dataset, Method and Demonstration

no code implementations • 14 Jul 2022 • Zhenyu Zhang, Bowen Yu, Haiyang Yu, Tingwen Liu, Cheng Fu, Jingyang Li, Chengguang Tang, Jian Sun, Yongbin Li

In this paper, we propose a Layout-aware document-level Information Extraction dataset, LIE, to facilitate the study of extracting both structural and semantic knowledge from visually rich documents (VRDs), so as to generate accurate responses in dialogue systems.

Language Modelling

Paper
Add Code

Dense Teacher: Dense Pseudo-Labels for Semi-supervised Object Detection

2 code implementations • 6 Jul 2022 • HongYu Zhou, Zheng Ge, Songtao Liu, Weixin Mao, Zeming Li, Haiyan Yu, Jian Sun

To date, the most powerful semi-supervised object detectors (SS-OD) are based on pseudo-boxes, which need a sequence of post-processing with fine-tuned hyper-parameters.

Ranked #4 on Semi-Supervised Object Detection on COCO 100% labeled data

object-detection Object Detection +2

12,039

Paper
Code

PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images

1 code implementation • ICCV 2023 • Yingfei Liu, Junjie Yan, Fan Jia, Shuailin Li, Aqi Gao, Tiancai Wang, Xiangyu Zhang, Jian Sun

More specifically, we extend the 3D position embedding (3D PE) in PETR for temporal modeling.

Ranked #2 on Bird's-Eye View Semantic Segmentation on nuScenes (IoU lane - 224x480 - 100x100 at 0.5 metric)

3D Lane Detection 3D Object Detection +6

772

Paper
Code

Unifying Voxel-based Representation with Transformer for 3D Object Detection

1 code implementation • 1 Jun 2022 • Yanwei Li, Yilun Chen, Xiaojuan Qi, Zeming Li, Jian Sun, Jiaya Jia

To this end, the modality-specific space is first designed to represent different inputs in the voxel feature space.

3D Object Detection Object +3

214

Paper
Code

Voxel Field Fusion for 3D Object Detection

1 code implementation • CVPR 2022 • Yanwei Li, Xiaojuan Qi, Yukang Chen, LiWei Wang, Zeming Li, Jian Sun, Jiaya Jia

In this work, we present a conceptually simple yet effective framework for cross-modality 3D object detection, named voxel field fusion.

3D Object Detection Data Augmentation +2

Paper
Code

VoGE: A Differentiable Volume Renderer using Gaussian Ellipsoids for Analysis-by-Synthesis

1 code implementation • 30 May 2022 • Angtian Wang, Peng Wang, Jian Sun, Adam Kortylewski, Alan Yuille

The Gaussian reconstruction kernels have been proposed by Westover (1990) and studied by the computer graphics community back in the 90s, which gives an alternative representation of object 3D geometry from meshes and point clouds.

Pose Estimation

Paper
Code

Duplex Conversation: Towards Human-like Interaction in Spoken Dialogue Systems

no code implementations • 30 May 2022 • Ting-En Lin, Yuchuan Wu, Fei Huang, Luo Si, Jian Sun, Yongbin Li

In this paper, we present Duplex Conversation, a multi-turn, multimodal spoken dialogue system that enables telephone-based agents to interact with customers like a human.

Data Augmentation Spoken Dialogue Systems

Paper
Add Code

NTIRE 2022 Challenge on High Dynamic Range Imaging: Methods and Results

no code implementations • 25 May 2022 • Eduardo Pérez-Pellitero, Sibi Catley-Chandar, Richard Shaw, Aleš Leonardis, Radu Timofte, Zexin Zhang, Cen Liu, Yunbo Peng, Yue Lin, Gaocheng Yu, Jin Zhang, Zhe Ma, Hongbin Wang, Xiangyu Chen, Xintao Wang, Haiwei Wu, Lin Liu, Chao Dong, Jiantao Zhou, Qingsen Yan, Song Zhang, Weiye Chen, Yuhang Liu, Zhen Zhang, Yanning Zhang, Javen Qinfeng Shi, Dong Gong, Dan Zhu, Mengdi Sun, Guannan Chen, Yang Hu, Haowei Li, Baozhu Zou, Zhen Liu, Wenjie Lin, Ting Jiang, Chengzhi Jiang, Xinpeng Li, Mingyan Han, Haoqiang Fan, Jian Sun, Shuaicheng Liu, Juan Marín-Vega, Michael Sloth, Peter Schneider-Kamp, Richard Röttger, Chunyang Li, Long Bao, Gang He, Ziyao Xu, Li Xu, Gen Zhan, Ming Sun, Xing Wen, Junlin Li, Shuang Feng, Fei Lei, Rui Liu, Junxiang Ruan, Tianhong Dai, Wei Li, Zhan Lu, Hengyan Liu, Peian Huang, Guangyu Ren, Yonglin Luo, Chang Liu, Qiang Tu, Fangya Li, Ruipeng Gang, Chenghua Li, Jinjing Li, Sai Ma, Chenming Liu, Yizhen Cao, Steven Tel, Barthelemy Heyrman, Dominique Ginhac, Chul Lee, Gahyeon Kim, Seonghyun Park, An Gia Vien, Truong Thanh Nhat Mai, Howoon Yoon, Tu Vo, Alexander Holston, Sheir Zaheer, Chan Y. Park

The challenge is composed of two tracks with an emphasis on fidelity and complexity constraints: In Track 1, participants are asked to optimize objective fidelity scores while imposing a low-complexity constraint (i. e. solutions can not exceed a given number of operations).

Image Restoration Vocal Bursts Intensity Prediction

Paper
Add Code

A Survey on Neural Open Information Extraction: Current Status and Future Directions

no code implementations • 24 May 2022 • Shaowen Zhou, Bowen Yu, Aixin Sun, Cheng Long, Jingyang Li, Haiyang Yu, Jian Sun, Yongbin Li

Open Information Extraction (OpenIE) facilitates domain-independent discovery of relational facts from large corpora.

Ranked #1 on Open Information Extraction on CaRB

Natural Language Understanding Open-Domain Question Answering +1

Paper
Add Code

Truncated tensor Schatten p-norm based approach for spatiotemporal traffic data imputation with complicated missing patterns

1 code implementation • 19 May 2022 • Tong Nie, Guoyang Qin, Jian Sun

Rapid advances in sensor, wireless communication, cloud computing and data science have brought unprecedented amount of data to assist transportation engineers and researchers in making better decisions.

Cloud Computing Imputation +1

Paper
Code

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results

2 code implementations • 11 May 2022 • Yawei Li, Kai Zhang, Radu Timofte, Luc van Gool, Fangyuan Kong, Mingxi Li, Songwei Liu, Zongcai Du, Ding Liu, Chenhui Zhou, Jingyi Chen, Qingrui Han, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, Yu Qiao, Chao Dong, Long Sun, Jinshan Pan, Yi Zhu, Zhikai Zong, Xiaoxiao Liu, Zheng Hui, Tao Yang, Peiran Ren, Xuansong Xie, Xian-Sheng Hua, Yanbo Wang, Xiaozhong Ji, Chuming Lin, Donghao Luo, Ying Tai, Chengjie Wang, Zhizhong Zhang, Yuan Xie, Shen Cheng, Ziwei Luo, Lei Yu, Zhihong Wen, Qi Wu1, Youwei Li, Haoqiang Fan, Jian Sun, Shuaicheng Liu, Yuanfei Huang, Meiguang Jin, Hua Huang, Jing Liu, Xinjian Zhang, Yan Wang, Lingshun Long, Gen Li, Yuanfan Zhang, Zuowei Cao, Lei Sun, Panaetov Alexander, Yucong Wang, Minjie Cai, Li Wang, Lu Tian, Zheyuan Wang, Hongbing Ma, Jie Liu, Chao Chen, Yidong Cai, Jie Tang, Gangshan Wu, Weiran Wang, Shirui Huang, Honglei Lu, Huan Liu, Keyan Wang, Jun Chen, Shi Chen, Yuchun Miao, Zimo Huang, Lefei Zhang, Mustafa Ayazoğlu, Wei Xiong, Chengyi Xiong, Fei Wang, Hao Li, Ruimian Wen, Zhijing Yang, Wenbin Zou, Weixin Zheng, Tian Ye, Yuncheng Zhang, Xiangzhen Kong, Aditya Arora, Syed Waqas Zamir, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Dandan Gaoand Dengwen Zhouand Qian Ning, Jingzhu Tang, Han Huang, YuFei Wang, Zhangheng Peng, Haobo Li, Wenxue Guan, Shenghua Gong, Xin Li, Jun Liu, Wanjun Wang, Dengwen Zhou, Kun Zeng, Hanjiang Lin, Xinyu Chen, Jinsheng Fang

The aim was to design a network for single image super-resolution that achieved improvement of efficiency measured according to several metrics including runtime, parameters, FLOPs, activations, and memory consumption while at least maintaining the PSNR of 29. 00dB on DIV2K validation set.

Image Super-Resolution

116

Paper
Code

Focal Sparse Convolutional Networks for 3D Object Detection

2 code implementations • CVPR 2022 • Yukang Chen, Yanwei Li, Xiangyu Zhang, Jian Sun, Jiaya Jia

In this paper, we introduce two new modules to enhance the capability of Sparse CNNs, both are based on making feature sparsity learnable with position-wise importance prediction.

3D Object Detection Object +1

359

Paper
Code

BSRT: Improving Burst Super-Resolution with Swin Transformer and Flow-Guided Deformable Alignment

1 code implementation • 18 Apr 2022 • Ziwei Luo, Youwei Li, Shen Cheng, Lei Yu, Qi Wu, Zhihong Wen, Haoqiang Fan, Jian Sun, Shuaicheng Liu

To overcome the challenges in BurstSR, we propose a Burst Super-Resolution Transformer (BSRT), which can significantly improve the capability of extracting inter-frame information and reconstruction.

Ranked #1 on Burst Image Super-Resolution on BurstSR

Burst Image Reconstruction Burst Image Super-Resolution +2

176

Paper
Code

When NAS Meets Trees: An Efficient Algorithm for Neural Architecture Search

1 code implementation • 11 Apr 2022 • Guocheng Qian, Xuanyang Zhang, Guohao Li, Chen Zhao, Yukang Chen, Xiangyu Zhang, Bernard Ghanem, Jian Sun

TNAS performs a modified bi-level Breadth-First Search in the proposed trees to discover a high-performance architecture.

Ranked #7 on Neural Architecture Search on NAS-Bench-201, CIFAR-10

Neural Architecture Search

Paper
Code

Simple Baselines for Image Restoration

9 code implementations • 10 Apr 2022 • Liangyu Chen, Xiaojie Chu, Xiangyu Zhang, Jian Sun

Although there have been significant advances in the field of image restoration recently, the system complexity of the state-of-the-art (SOTA) methods is increasing as well, which may hinder the convenient analysis and comparison of methods.

Ranked #1 on Deblurring on MSU BASED

Deblurring Image Deblurring +2

1,993

Paper
Code

Boosting Black-Box Adversarial Attacks with Meta Learning

no code implementations • 28 Mar 2022 • Junjie Fu, Jian Sun, Gang Wang

Extensive experiments demonstrate that our method can not only improve the attack success rates, but also reduces the number of queries compared to other methods.

Adversarial Attack Meta-Learning

Paper
Add Code

FS6D: Few-Shot 6D Pose Estimation of Novel Objects

1 code implementation • CVPR 2022 • Yisheng He, Yao Wang, Haoqiang Fan, Jian Sun, Qifeng Chen

6D object pose estimation networks are limited in their capability to scale to large numbers of object instances due to the close-set assumption and their reliance on high-fidelity object CAD models.

6D Pose Estimation 6D Pose Estimation using RGB +1

Paper
Code

Real-time Object Detection for Streaming Perception

1 code implementation • CVPR 2022 • Jinrong Yang, Songtao Liu, Zeming Li, Xiaoping Li, Jian Sun

In this paper, instead of searching trade-offs between accuracy and speed like previous works, we point out that endowing real-time models with the ability to predict the future is the key to dealing with this problem.

Ranked #1 on Real-Time Object Detection on Argoverse-HD (Full-Stack, Val) (sAP metric, using extra training data)

Autonomous Driving Object +2

297

Paper
Code

Improving Meta-learning for Low-resource Text Classification and Generation via Memory Imitation

no code implementations • ACL 2022 • Yingxiu Zhao, Zhiliang Tian, Huaxiu Yao, Yinhe Zheng, Dongkyu Lee, Yiping Song, Jian Sun, Nevin L. Zhang

Building models of natural language processing (NLP) is challenging in low-resource scenarios where only limited data are available.

Memorization Meta-Learning +2

Paper
Add Code

Rebalanced Siamese Contrastive Mining for Long-Tailed Recognition

2 code implementations • 22 Mar 2022 • Zhisheng Zhong, Jiequan Cui, Zeming Li, Eric Lo, Jian Sun, Jiaya Jia

Given the promising performance of contrastive learning, we propose Rebalanced Siamese Contrastive Mining (ResCom) to tackle imbalanced recognition.

Ranked #5 on Long-tail Learning on CIFAR-10-LT (ρ=10)

Contrastive Learning Long-tail Learning +1

Paper
Code

Tree Energy Loss: Towards Sparsely Annotated Semantic Segmentation

1 code implementation • CVPR 2022 • Zhiyuan Liang, Tiancai Wang, Xiangyu Zhang, Jian Sun, Jianbing Shen

The tree energy loss is effective and easy to be incorporated into existing frameworks by combining it with a traditional segmentation loss.

Segmentation Semantic Segmentation

101

Paper
Code

A Slot Is Not Built in One Utterance: Spoken Language Dialogs with Sub-Slots

1 code implementation • Findings (ACL) 2022 • Sai Zhang, Yuwei Hu, Yuchuan Wu, Jiaman Wu, Yongbin Li, Jian Sun, Caixia Yuan, Xiaojie Wang

We find some new linguistic phenomena and interactive manners in SSTOD which raise critical challenges of building dialog agents for the task.

Ranked #1 on SSTOD on SSD_NAME

SSTOD

Paper
Code

Progressive End-to-End Object Detection in Crowded Scenes

2 code implementations • CVPR 2022 • Anlin Zheng, Yuang Zhang, Xiangyu Zhang, Xiaojuan Qi, Jian Sun

Experiments show that our method can significantly boost the performance of query-based detectors in crowded scenes.

Ranked #1 on Object Detection on CrowdHuman

Object object-detection +1

Paper
Code

S$^2$SQL: Injecting Syntax to Question-Schema Interaction Graph Encoder for Text-to-SQL Parsers

no code implementations • 14 Mar 2022 • Binyuan Hui, Ruiying Geng, Lihan Wang, Bowen Qin, Bowen Li, Jian Sun, Yongbin Li

The task of converting a natural language question into an executable SQL query, known as text-to-SQL, is an important branch of semantic parsing.

Semantic Parsing Text-To-SQL

Paper
Add Code

Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs

7 code implementations • CVPR 2022 • Xiaohan Ding, Xiangyu Zhang, Yizhuang Zhou, Jungong Han, Guiguang Ding, Jian Sun

We revisit large kernel design in modern convolutional neural networks (CNNs).

Ranked #73 on Image Classification on ImageNet

Image Classification

2,983

Paper
Code

PETR: Position Embedding Transformation for Multi-View 3D Object Detection

1 code implementation • 10 Mar 2022 • Yingfei Liu, Tiancai Wang, Xiangyu Zhang, Jian Sun

Object query can perceive the 3D position-aware features and perform end-to-end object detection.

Ranked #3 on 3D Object Detection on 3D Object Detection on Argoverse2 Camera Only

3D Object Detection Object +3

772

Paper
Code

Towards Self-Supervised Category-Level Object Pose and Size Estimation

no code implementations • 6 Mar 2022 • Yisheng He, Haoqiang Fan, Haibin Huang, Qifeng Chen, Jian Sun

Instead, we propose a label-free method that learns to enforce the geometric consistency between category template mesh and observed object point cloud under a self-supervision manner.

Paper
Add Code

Model-Based and Data-Driven Control of Event- and Self-Triggered Discrete-Time LTI Systems

no code implementations • 16 Feb 2022 • Xin Wang, Julian Berberich, Jian Sun, Gang Wang, Frank Allgöwer, Jie Chen

To this end, we begin by presenting a dynamic event-triggering scheme (ETS) based on periodic sampling, and a discrete-time looped-functional approach, through which a model-based stability condition is derived.

STS

Paper
Add Code

Relieving Long-tailed Instance Segmentation via Pairwise Class Balance

2 code implementations • CVPR 2022 • Yin-Yin He, Peizhen Zhang, Xiu-Shen Wei, Xiangyu Zhang, Jian Sun

In this paper, we explore to excavate the confusion matrix, which carries the fine-grained misclassification details, to relieve the pairwise biases, generalizing the coarse one.

Instance Segmentation Semantic Segmentation

Paper
Code

Learning to Select the Next Reasonable Mention for Entity Linking

no code implementations • 8 Dec 2021 • Jian Sun, Yu Zhou, Chengqing Zong

To address the problem, we propose a novel model, called DyMen, to dynamically adjust the subsequent linking target based on the previously linked entities via reinforcement learning, enabling the model to select a link target that can fully use previously linked information.

Entity Linking Knowledge Graphs +2

Paper
Add Code

Adversarial Reweighting for Partial Domain Adaptation

1 code implementation • NeurIPS 2021 • Xiang Gu, Xi Yu, Yan Yang, Jian Sun, Zongben Xu

To tackle the challenge of negative domain transfer, we propose a novel Adversarial Reweighting (AR) approach that adversarially learns the weights of source domain data to align the source and target domain distributions, and the transferable deep recognition network is learned on the reweighted source domain data.

Ranked #1 on Partial Domain Adaptation on DomainNet

Partial Domain Adaptation

Paper
Code

Spherical Motion Dynamics: Learning Dynamics of Normalized Neural Network using SGD and Weight Decay

no code implementations • NeurIPS 2021 • Ruosi Wan, Zhanxing Zhu, Xiangyu Zhang, Jian Sun

Specifically, 1) we introduce the assumptions that can lead to equilibrium state in SMD, and prove equilibrium can be reached in a linear rate regime under given assumptions; 2) we propose ``angular update" as a substitute for effective learning rate to depict the state of SMD, and derive the theoretical value of angular update in equilibrium state; 3) we verify our assumptions and theoretical results on various large-scale computer vision tasks including ImageNet and MSCOCO with standard settings.

Paper
Add Code

GALAXY: A Generative Pre-trained Model for Task-Oriented Dialog with Semi-Supervised Learning and Explicit Policy Injection

1 code implementation • 29 Nov 2021 • Wanwei He, Yinpei Dai, Yinhe Zheng, Yuchuan Wu, Zheng Cao, Dermot Liu, Peng Jiang, Min Yang, Fei Huang, Luo Si, Jian Sun, Yongbin Li

Pre-trained models have proved to be powerful in enhancing task-oriented dialog systems.

Ranked #1 on End-To-End Dialogue Modelling on MULTIWOZ 2.0

End-To-End Dialogue Modelling

105

Paper
Code

XnODR and XnIDR: Two Accurate and Fast Fully Connected Layers For Convolutional Neural Networks

1 code implementation • 21 Nov 2021 • Jian Sun, Ali Pourramezan Fard, Mohammad H. Mahoor

To address the computational burdens of the Dynamic Routing mechanism, this paper proposes new Fully Connected (FC) layers by xnorizing the linear projection outside or inside the Dynamic Routing within the CapsFC layer.

Ranked #11 on Image Classification on MNIST (Accuracy metric)

Binarization Image Classification

Paper
Code

Linking-Enhanced Pre-Training for Table Semantic Parsing

no code implementations • 18 Nov 2021 • Bowen Qin, Lihan Wang, Binyuan Hui, Ruiying Geng, Zheng Cao, Min Yang, Jian Sun, Yongbin Li

Recently pre-training models have significantly improved the performance of various NLP tasks by leveraging large-scale text corpora to improve the contextual representation ability of the neural network.

Inductive Bias Language Modelling +2

Paper
Add Code

Distributed stochastic proximal algorithm with random reshuffling for non-smooth finite-sum optimization

no code implementations • 6 Nov 2021 • Xia Jiang, Xianlin Zeng, Jian Sun, Jie Chen, Lihua Xie

We prove that local variable estimates generated by the proposed algorithm achieve consensus and are attracted to a neighborhood of the optimal solution in expectation with an $\mathcal{O}(\frac{1}{T}+\frac{1}{\sqrt{T}})$ convergence rate, where $T$ is the total number of iterations.

Paper
Add Code

Path-Enhanced Multi-Relational Question Answering with Knowledge Graph Embeddings

no code implementations • 29 Oct 2021 • Guanglin Niu, Yang Li, Chengguang Tang, Zhongkai Hu, Shibin Yang, Peng Li, Chengyu Wang, Hao Wang, Jian Sun

The multi-relational Knowledge Base Question Answering (KBQA) system performs multi-hop reasoning over the knowledge graph (KG) to achieve the answer.

Knowledge Base Question Answering Knowledge Graph Embedding +1

Paper
Add Code

Instance-Conditional Knowledge Distillation for Object Detection

1 code implementation • NeurIPS 2021 • Zijian Kang, Peizhen Zhang, Xiangyu Zhang, Jian Sun, Nanning Zheng

Knowledge distillation has shown great success in classification, however, it is still challenging for detection.

Image Classification Knowledge Distillation +3

Paper
Code

Data-Driven Resilient Predictive Control under Denial-of-Service

no code implementations • 25 Oct 2021 • Wenjie Liu, Jian Sun, Gang Wang, Francesco Bullo, Jie Chen

Finally, a numerical example is given to validate the effectiveness of the proposed control method.

Model Predictive Control

Paper
Add Code

Data-driven Control of Dynamic Event-triggered Systems with Delays

no code implementations • 25 Oct 2021 • Xin Wang, Jian Sun, Julian Berberich, Gang Wang, Frank Allgöwer, Jie Chen

Data-based representations for time-invariant linear systems with known or unknown system input matrices are first developed, along with a novel class of dynamic triggering schemes for sampled-data systems with time delays.

Paper
Add Code

DialogueCSE: Dialogue-based Contrastive Learning of Sentence Embeddings

1 code implementation • EMNLP 2021 • Che Liu, Rui Wang, Jinghua Liu, Jian Sun, Fei Huang, Luo Si

Learning sentence embeddings from dialogues has drawn increasing attention due to its low annotation cost and high domain adaptability.

Contrastive Learning Semantic Textual Similarity +2

Paper
Code

Partial to Whole Knowledge Distillation: Progressive Distilling Decomposed Knowledge Boosts Student Better

no code implementations • 26 Sep 2021 • Xuanyang Zhang, Xiangyu Zhang, Jian Sun

Knowledge distillation field delicately designs various types of knowledge to shrink the performance gap between compact student and large-scale teacher.

Knowledge Distillation

Paper
Add Code

LGD: Label-guided Self-distillation for Object Detection

1 code implementation • 23 Sep 2021 • Peizhen Zhang, Zijian Kang, Tong Yang, Xiangyu Zhang, Nanning Zheng, Jian Sun

Instead, we generate an instructive knowledge based only on student representations and regular labels.

Instance Segmentation Object +4

Paper
Code

Anchor DETR: Query Design for Transformer-Based Object Detection

2 code implementations • 15 Sep 2021 • Yingming Wang, Xiangyu Zhang, Tong Yang, Jian Sun

Thanks to the query design and the attention variant, the proposed detector that we called Anchor DETR, can achieve better performance and run faster than the DETR with 10$\times$ fewer training epochs.

Object object-detection +1

321

Paper
Code

Fully Convolutional Networks for Panoptic Segmentation with Point-based Supervision

1 code implementation • 17 Aug 2021 • Yanwei Li, Hengshuang Zhao, Xiaojuan Qi, Yukang Chen, Lu Qi, LiWei Wang, Zeming Li, Jian Sun, Jiaya Jia

In particular, Panoptic FCN encodes each object instance or stuff category with the proposed kernel generator and produces the prediction by convolving the high-resolution feature directly.

Panoptic Segmentation Segmentation +1

388

Paper
Code

MMChat: Multi-Modal Chat Dataset on Social Media

1 code implementation • LREC 2022 • Yinhe Zheng, Guanyi Chen, Xin Liu, Jian Sun

To better investigate this issue, we manually annotate 100K dialogues from MMChat and further filter the corpus accordingly, which yields MMChat-hf.

Dialogue Generation

Paper
Code

Learning Canonical View Representation for 3D Shape Recognition with Arbitrary Views

1 code implementation • ICCV 2021 • Xin Wei, Yifei Gong, Fudong Wang, Xing Sun, Jian Sun

In this way, each 3D shape with arbitrary views is represented by a fixed number of canonical view features, which are further aggregated to generate a rich and robust 3D shape representation for shape recognition.

3D Shape Recognition 3D Shape Representation

Paper
Code

Preview, Attend and Review: Schema-Aware Curriculum Learning for Multi-Domain Dialogue State Tracking

no code implementations • ACL 2021 • Yinpei Dai, Hangyu Li, Yongbin Li, Jian Sun, Fei Huang, Luo Si, Xiaodan Zhu

Existing dialog state tracking (DST) models are trained with dialog data in a random order, neglecting rich structural information in a dataset.

dialog state tracking Dialogue State Tracking +1

Paper
Add Code

Workshop on Autonomous Driving at CVPR 2021: Technical Report for Streaming Perception Challenge

1 code implementation • 27 Jul 2021 • Songyang Zhang, Lin Song, Songtao Liu, Zheng Ge, Zeming Li, Xuming He, Jian Sun

In this report, we introduce our real-time 2D object detection system for the realistic autonomous driving scenario.

Autonomous Driving object-detection +1

9,008

Paper
Code

A Unified Hyper-GAN Model for Unpaired Multi-contrast MR Image Translation

1 code implementation • 26 Jul 2021 • Heran Yang, Jian Sun, Liwei Yang, Zongben Xu

Hyper-GAN consists of a pair of hyper-encoder and hyper-decoder to first map from the source contrast to a common feature space, and then further map to the target contrast image.

Translation

Paper
Code

YOLOX: Exceeding YOLO Series in 2021

41 code implementations • 18 Jul 2021 • Zheng Ge, Songtao Liu, Feng Wang, Zeming Li, Jian Sun

In this report, we present some experienced improvements to YOLO series, forming a new high-performance detector -- YOLOX.

Ranked #1 on Real-Time Object Detection on Argoverse-HD (Detection-Only, Val) (using extra training data)

Autonomous Driving Real-Time Object Detection

27,744

Paper
Code

Ill-posed Surface Emissivity Retrieval from Multi-Geometry Hyperspectral Images using a Hybrid Deep Neural Network

no code implementations • 9 Jul 2021 • Fangcao Xu, Jian Sun, Guido Cervone, Mark Salvador

Atmospheric correction errors can significantly alter the spectral signature of the observations, and lead to invalid classifications or target detection.

Retrieval

Paper
Add Code

EBSR: Feature Enhanced Burst Super-Resolution With Deformable Alignment

2 code implementations • Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops 2021 • Ziwei Luo, Lei Yu, Xuan Mo, Youwei Li, Lanpeng Jia, Haoqiang Fan, Jian Sun, Shuaicheng Liu

We propose a novel architecture to handle the problem of multi-frame super-resolution (MFSR).

Ranked #2 on Burst Image Super-Resolution on SyntheticBurst

Burst Image Reconstruction Burst Image Super-Resolution +1

176

Paper
Code

NTIRE 2021 Challenge on Burst Super-Resolution: Methods and Results

no code implementations • 7 Jun 2021 • Goutam Bhat, Martin Danelljan, Radu Timofte, Kazutoshi Akita, Wooyeong Cho, Haoqiang Fan, Lanpeng Jia, Daeshik Kim, Bruno Lecouat, Youwei Li, Shuaicheng Liu, Ziluan Liu, Ziwei Luo, Takahiro Maeda, Julien Mairal, Christian Micheloni, Xuan Mo, Takeru Oba, Pavel Ostyakov, Jean Ponce, Sanghyeok Son, Jian Sun, Norimichi Ukita, Rao Muhammad Umer, Youliang Yan, Lei Yu, Magauiya Zhussip, Xueyi Zou

This paper reviews the NTIRE2021 challenge on burst super-resolution.

Super-Resolution

Paper
Add Code

Preview, Attend and Review: Schema-Aware Curriculum Learning for Multi-Domain Dialog State Tracking

no code implementations • 1 Jun 2021 • Yinpei Dai, Hangyu Li, Yongbin Li, Jian Sun, Fei Huang, Luo Si, Xiaodan Zhu

Existing dialog state tracking (DST) models are trained with dialog data in a random order, neglecting rich structural information in a dataset.

Ranked #1 on Multi-domain Dialogue State Tracking on MULTIWOZ 2.1 (using extra training data)

dialog state tracking Multi-domain Dialogue State Tracking

Paper
Add Code

ADNet: Attention-guided Deformable Convolutional Network for High Dynamic Range Imaging

8 code implementations • 22 May 2021 • Zhen Liu, Wenjie Lin, Xinpeng Li, Qing Rao, Ting Jiang, Mingyan Han, Haoqiang Fan, Jian Sun, Shuaicheng Liu

In this paper, we present an attention-guided deformable convolutional network for hand-held multi-frame high dynamic range (HDR) imaging, namely ADNet.

Ranked #5 on Face Alignment on WFW (Extra Data)

Face Alignment Vocal Bursts Intensity Prediction

Paper
Code

Generalized Few-Shot Object Detection without Forgetting

1 code implementation • CVPR 2021 • Zhibo Fan, Yuchen Ma, Zeming Li, Jian Sun

Recently few-shot object detection is widely adopted to deal with data-limited situations.

Few-Shot Object Detection Object +2

Paper
Code

Fast Camera Image Denoising on Mobile GPUs with Deep Learning, Mobile AI 2021 Challenge: Report

no code implementations • 17 May 2021 • Andrey Ignatov, Kim Byeoung-su, Radu Timofte, Angeline Pouget, Fenglong Song, Cheng Li, Shuai Xiao, Zhongqian Fu, Matteo Maggioni, Yibin Huang, Shen Cheng, Xin Lu, Yifeng Zhou, Liangyu Chen, Donghao Liu, Xiangyu Zhang, Haoqiang Fan, Jian Sun, Shuaicheng Liu, Minsu Kwon, Myungje Lee, Jaeyoon Yoo, Changbeom Kang, Shinjo Wang, Bin Huang, Tianbao Zhou, Shuai Liu, Lei Lei, Chaoyu Feng, Liguang Huang, Zhikun Lei, Feifei Chen

A detailed description of all models developed in the challenge is provided in this paper.

Image Denoising

Paper
Add Code

Relational Learning with Gated and Attentive Neighbor Aggregator for Few-Shot Knowledge Graph Completion

1 code implementation • 27 Apr 2021 • Guanglin Niu, Yang Li, Chengguang Tang, Ruiying Geng, Jian Dai, Qiao Liu, Hao Wang, Jian Sun, Fei Huang, Luo Si

Moreover, modeling and inferring complex relations of one-to-many (1-N), many-to-one (N-1), and many-to-many (N-N) by previous knowledge graph completion approaches requires high model complexity and a large amount of training instances.

Few-Shot Learning Relational Reasoning

Paper
Code

Points as Queries: Weakly Semi-supervised Object Detection by Points

1 code implementation • CVPR 2021 • Liangyu Chen, Tong Yang, Xiangyu Zhang, Wei zhang, Jian Sun

We propose a novel point annotated setting for the weakly semi-supervised object detection task, in which the dataset comprises small fully annotated images and large weakly annotated images by points.

object-detection Object Detection +1

Paper
Code

IQDet: Instance-wise Quality Distribution Sampling for Object Detection

no code implementations • CVPR 2021 • Yuchen Ma, Songtao Liu, Zeming Li, Jian Sun

We propose a dense object detector with an instance-wise sampling strategy, named IQDet.

object-detection Object Detection

Paper
Add Code

Learning to Jointly Deblur, Demosaick and Denoise Raw Images

no code implementations • 13 Apr 2021 • Thomas Eboli, Jian Sun, Jean Ponce

We address the problem of non-blind deblurring and demosaicking of noisy raw images.

Deblurring Demosaicking +1

Paper
Add Code

Distribution Alignment: A Unified Framework for Long-tail Visual Recognition

1 code implementation • CVPR 2021 • Songyang Zhang, Zeming Li, Shipeng Yan, Xuming He, Jian Sun

Motivated by our discovery, we propose a unified distribution alignment strategy for long-tail visual recognition.

Ranked #17 on Long-tail Learning on Places-LT

General Classification Image Classification +6

114

Paper
Code

OTA: Optimal Transport Assignment for Object Detection

2 code implementations • CVPR 2021 • Zheng Ge, Songtao Liu, Zeming Li, Osamu Yoshie, Jian Sun

Recent advances in label assignment in object detection mainly seek to independently define positive/negative training samples for each ground-truth (gt) object.

Ranked #62 on Object Detection on COCO test-dev

Object object-detection +1

242

Paper
Code

Resilient Control under Quantization and Denial-of-Service: Co-designing a Deadbeat Controller and Transmission Protocol

no code implementations • 22 Mar 2021 • Wenjie Liu, Jian Sun, Gang Wang, Francesco Bullo, Jie Chen

When both input and output channels are subject to DoS attacks and quantization, the proposed structure is shown able to decouple the encoding schemes for input, output, and estimated output signals.

Quantization

Paper
Add Code

You Only Look One-level Feature

6 code implementations • CVPR 2021 • Qiang Chen, Yingming Wang, Tong Yang, Xiangyu Zhang, Jian Cheng, Jian Sun

From the perspective of optimization, we introduce an alternative way to address the problem instead of adopting the complex feature pyramids - {\em utilizing only one-level feature for detection}.

Ranked #131 on Object Detection on COCO test-dev

object-detection Object Detection

27,744

Paper
Code

Training Networks in Null Space of Feature Covariance for Continual Learning

1 code implementation • CVPR 2021 • Shipeng Wang, Xiaorong Li, Jian Sun, Zongben Xu

To balance plasticity and stability of network in continual learning, in this paper, we propose a novel network training algorithm called Adam-NSCL, which sequentially optimizes network parameters in the null space of previous tasks.

Continual Learning

Paper
Code

End-to-End Human Object Interaction Detection with HOI Transformer

1 code implementation • CVPR 2021 • Cheng Zou, Bohan Wang, Yue Hu, Junqi Liu, Qian Wu, Yu Zhao, Boxun Li, Chenguang Zhang, Chi Zhang, Yichen Wei, Jian Sun

We propose HOI Transformer to tackle human object interaction (HOI) detection in an end-to-end manner.

Ranked #30 on Human-Object Interaction Detection on HICO-DET (using extra training data)

Human-Object Interaction Detection object-detection +1

137

Paper
Code

Improving Text-to-SQL with Schema Dependency Learning

no code implementations • 7 Mar 2021 • Binyuan Hui, Xiang Shi, Ruiying Geng, Binhua Li, Yongbin Li, Jian Sun, Xiaodan Zhu

In this paper, we present the Schema Dependency guided multi-task Text-to-SQL model (SDSQL) to guide the network to effectively capture the interactions between questions and schemas.

Text-To-SQL

Paper
Add Code

FFB6D: A Full Flow Bidirectional Fusion Network for 6D Pose Estimation

3 code implementations • CVPR 2021 • Yisheng He, Haibin Huang, Haoqiang Fan, Qifeng Chen, Jian Sun

Moreover, at the output representation stage, we designed a simple but effective 3D keypoints selection algorithm considering the texture and geometry information of objects, which simplifies keypoint localization for precise pose estimation.

Ranked #1 on 6D Pose Estimation on LineMOD

6D Pose Estimation Representation Learning

473

Paper
Code

Using Long Short-Term Memory (LSTM) and Internet of Things (IoT) for localized surface temperature forecasting in an urban environment

no code implementations • 4 Feb 2021 • Manzhu Yu, Fangcao Xu, Weiming Hu, Jian Sun, Guido Cervone

Meanwhile, by using IoT observations, the spatial resolution of air temperature predictions is significantly improved.

Paper
Add Code

Neural Architecture Search with Random Labels

1 code implementation • CVPR 2021 • Xuanyang Zhang, Pengfei Hou, Xiangyu Zhang, Jian Sun

In this paper, we investigate a new variant of neural architecture search (NAS) paradigm -- searching with random labels (RLNAS).

Neural Architecture Search

Paper
Code

Momentum^2 Teacher: Momentum Teacher with Momentum Statistics for Self-Supervised Learning

1 code implementation • 19 Jan 2021 • Zeming Li, Songtao Liu, Jian Sun

The teacher's weight is a momentum update of the student, and the teacher's BN statistics is a momentum update of those in history.

Self-Supervised Learning

120

Paper
Code

RepVGG: Making VGG-style ConvNets Great Again

22 code implementations • CVPR 2021 • Xiaohan Ding, Xiangyu Zhang, Ningning Ma, Jungong Han, Guiguang Ding, Jian Sun

We present a simple but powerful architecture of convolutional neural network, which has a VGG-like inference-time body composed of nothing but a stack of 3x3 convolution and ReLU, while the training-time model has a multi-branch topology.

Ranked #42 on Semantic Segmentation on Cityscapes val

Image Classification Semantic Segmentation

29,713

Paper
Code

Dynamic Hybrid Relation Network for Cross-Domain Context-Dependent Semantic Parsing

2 code implementations • 5 Jan 2021 • Binyuan Hui, Ruiying Geng, Qiyu Ren, Binhua Li, Yongbin Li, Jian Sun, Fei Huang, Luo Si, Pengfei Zhu, Xiaodan Zhu

Semantic parsing has long been a fundamental problem in natural language processing.

Ranked #5 on Dialogue State Tracking on CoSQL

Dialogue State Tracking Inductive Bias +4

961

Paper
Code

Domain-Free Adversarial Splitting for Domain Generalization

no code implementations • 1 Jan 2021 • Xiang Gu, Jiasun Feng, Jian Sun, Zongben Xu

In this framework, we model the domain generalization as a learning problem that enforces the learner to be able to generalize well for any train/val subsets splitting of the training dataset.

Domain Generalization Meta-Learning

Paper
Add Code

Implicit Feature Pyramid Network for Object Detection

no code implementations • 25 Dec 2020 • Tiancai Wang, Xiangyu Zhang, Jian Sun

In this paper, we present an implicit feature pyramid network (i-FPN) for object detection.

Object object-detection +1

Paper
Add Code

Efficient Human Pose Estimation by Learning Deeply Aggregated Representations

no code implementations • 13 Dec 2020 • Zhengxiong Luo, Zhicheng Wang, Yuanhao Cai, GuanAn Wang, Yan Huang, Liang Wang, Erjin Zhou, Tieniu Tan, Jian Sun

Instead, we focus on exploiting multi-scale information from layers with different receptive-field sizes and then making full of use this information by improving the fusion method.

Pose Estimation

Paper
Add Code

Rethinking Learnable Tree Filter for Generic Feature Transform

1 code implementation • NeurIPS 2020 • Lin Song, Yanwei Li, Zhengkai Jiang, Zeming Li, Xiangyu Zhang, Hongbin Sun, Jian Sun, Nanning Zheng

The Learnable Tree Filter presents a remarkable approach to model structure-preserving relations for semantic segmentation.

Instance Segmentation object-detection +3

Paper
Code

End-to-End Object Detection with Fully Convolutional Network

1 code implementation • CVPR 2021 • JianFeng Wang, Lin Song, Zeming Li, Hongbin Sun, Jian Sun, Nanning Zheng

Mainstream object detectors based on the fully convolutional network has achieved impressive performance.

object-detection Object Detection

489

Paper
Code

Fine-Grained Dynamic Head for Object Detection

1 code implementation • NeurIPS 2020 • Lin Song, Yanwei Li, Zhengkai Jiang, Zeming Li, Hongbin Sun, Jian Sun, Nanning Zheng

To this end, we propose a fine-grained dynamic head to conditionally select a pixel-level combination of FPN features from different scales for each instance, which further releases the ability of multi-scale feature representation.

Object object-detection +1

Paper
Code

Fully Convolutional Networks for Panoptic Segmentation

6 code implementations • CVPR 2021 • Yanwei Li, Hengshuang Zhao, Xiaojuan Qi, LiWei Wang, Zeming Li, Jian Sun, Jiaya Jia

In this paper, we present a conceptually simple, strong, and efficient framework for panoptic segmentation, called Panoptic FCN.

Ranked #1 on Panoptic Segmentation on COCO minival (SQ metric)

Panoptic Segmentation Segmentation

388

Paper
Code

Dual Attention Network for Cross-lingual Entity Alignment

no code implementations • COLING 2020 • Jian Sun, Yu Zhou, Chengqing Zong

The hierarchical attention adaptively aggregates the low-hierarchy and the high-hierarchy information, which is beneficial to balance the neighborhood information of counterpart entities and distinguish non-counterpart entities with similar structures.

Entity Alignment Graph Attention +2

Paper
Add Code

Decentralized TD Tracking with Linear Function Approximation and its Finite-Time Analysis

no code implementations • NeurIPS 2020 • Gang Wang, Songtao Lu, Georgios Giannakis, Gerald Tesauro, Jian Sun

The present contribution deals with decentralized policy evaluation in multi-agent Markov decision processes using temporal-difference (TD) methods with linear function approximation for scalability.

Paper
Add Code

UPFlow: Upsampling Pyramid for Unsupervised Optical Flow Learning

2 code implementations • CVPR 2021 • Kunming Luo, Chuan Wang, Shuaicheng Liu, Haoqiang Fan, Jue Wang, Jian Sun

By integrating these two components together, our method achieves the best performance for unsupervised optical flow learning on multiple leading benchmarks, including MPI-SIntel, KITTI 2012 and KITTI 2015.

Ranked #1 on Optical Flow Estimation on Sintel Final unsupervised

Optical Flow Estimation

239

Paper
Code

Self-EMD: Self-Supervised Object Detection without ImageNet

no code implementations • 27 Nov 2020 • Songtao Liu, Zeming Li, Jian Sun

Our Faster R-CNN (ResNet50-FPN) baseline achieves 39. 8% mAP on COCO, which is on par with the state of the art self-supervised methods pre-trained on ImageNet.

Object object-detection +2

Paper
Add Code

Deep Positional and Relational Feature Learning for Rotation-Invariant Point Cloud Analysis

no code implementations • ECCV 2020 • Ruixuan Yu, Xin Wei, Federico Tombari, Jian Sun

In this work, we propose a novel deep network for point clouds by incorporating positional information of points as inputs while yielding rotation-invariance.

Paper
Add Code

Joint COCO and Mapillary Workshop at ICCV 2019: COCO Instance Segmentation Challenge Track

no code implementations • 6 Oct 2020 • Zeming Li, Yuchen Ma, Yukang Chen, Xiangyu Zhang, Jian Sun

In this report, we present our object detection/instance segmentation system, MegDetV2, which works in a two-pass fashion, first to detect instances then to obtain segmentation.

Instance Segmentation object-detection +3

Paper
Add Code

EqCo: Equivalent Rules for Self-supervised Contrastive Learning

1 code implementation • 5 Oct 2020 • Benjin Zhu, Junqiang Huang, Zeming Li, Xiangyu Zhang, Jian Sun

In this paper, we propose EqCo (Equivalent Rules for Contrastive Learning) to make self-supervised learning irrelevant to the number of negative samples in the contrastive learning framework.

Contrastive Learning Self-Supervised Learning

Paper
Code

Activate or Not: Learning Customized Activation

4 code implementations • CVPR 2021 • Ningning Ma, Xiangyu Zhang, Ming Liu, Jian Sun

We present a simple, effective, and general activation function we term ACON which learns to activate the neurons or not.

object-detection Object Detection +1

203

Paper
Code

Multi-Frequency Multi-Scenario Millimeter Wave MIMO Channel Measurements and Modeling for B5G Wireless Communication Systems

no code implementations • 28 Jul 2020 • Jie Huang, Cheng-Xiang Wang, Hengtai Chang, Jian Sun, Xiqi Gao

Millimeter wave (mmWave) bands have been utilized for the fifth generation (5G) communication systems and will no doubt continue to be deployed for beyond 5G (B5G).

Paper
Add Code

Deep Reinforcement Learning for Dynamic Spectrum Sensing and Aggregation in Multi-Channel Wireless Networks

no code implementations • 28 Jul 2020 • Yunzeng Li, Wensheng Zhang, Cheng-Xiang Wang, Jian Sun, Yu Liu

Then, the vacant channels in the selected segment will be aggregated for satisfying the user requirement.

Q-Learning Reinforcement Learning (RL)

Paper
Add Code

A Non-Stationary VVLC MIMO Channel Model for Street Corner Scenarios

no code implementations • 28 Jul 2020 • Qingshan Chen, Cheng-Xiang Wang, Jian Sun, Wensheng Zhang, Qiuming Zhu

The study of the underlying VLC channel is the basis for designing the VLC communication system.

Paper
Add Code

A Survey on Complex Question Answering over Knowledge Base: Recent Advances and Challenges

no code implementations • 26 Jul 2020 • Bin Fu, Yunqi Qiu, Chengguang Tang, Yang Li, Haiyang Yu, Jian Sun

Question Answering (QA) over Knowledge Base (KB) aims to automatically answer natural language questions via well-structured relation information between entities stored in knowledge bases.

Information Retrieval Question Answering +2

Paper
Add Code

WeightNet: Revisiting the Design Space of Weight Networks

2 code implementations • ECCV 2020 • Ningning Ma, Xiangyu Zhang, Jiawei Huang, Jian Sun

WeightNet is easy and memory-conserving to train, on the kernel space instead of the feature space.

172

Paper
Code

Funnel Activation for Visual Recognition

6 code implementations • ECCV 2020 • Ningning Ma, Xiangyu Zhang, Jian Sun

We present a conceptually simple but effective funnel activation for image recognition tasks, called Funnel activation (FReLU), that extends ReLU and PReLU to a 2D activation by adding a negligible overhead of spatial condition.

Scene Generation Semantic Segmentation

175

Paper
Code

BorderDet: Border Feature for Dense Object Detection

2 code implementations • ECCV 2020 • Han Qiu, Yuchen Ma, Zeming Li, Songtao Liu, Jian Sun

In this paper, We propose a simple and efficient operator called Border-Align to extract "border features" from the extreme point of the border to enhance the point feature.

Dense Object Detection Object +1

432

Paper
Code

Learning from Manipulable Signals

no code implementations • 17 Jul 2020 • Mehmet Ekmekci, Leandro Gorno, Lucas Maestri, Jian Sun, Dong Wei

The principal learns about the agent's type from a noisy performance measure, which can be manipulated by the agent via a costly and hidden action.

Paper
Add Code

AutoAssign: Differentiable Label Assignment for Dense Object Detection

2 code implementations • 7 Jul 2020 • Benjin Zhu, Jian-Feng Wang, Zhengkai Jiang, Fuhang Zong, Songtao Liu, Zeming Li, Jian Sun

During training, to both satisfy the prior distribution of data and adapt to category characteristics, we present Center Weighting to adjust the category-specific prior distributions.

Dense Object Detection Object +1

27,744

Paper
Code

LabelEnc: A New Intermediate Supervision Method for Object Detection

1 code implementation • ECCV 2020 • Miao Hao, Yitao Liu, Xiangyu Zhang, Jian Sun

In this paper we propose a new intermediate supervision method, named LabelEnc, to boost the training of object detection systems.

Object object-detection +1

Paper
Code

End-to-end Interpretable Learning of Non-blind Image Deblurring

1 code implementation • ECCV 2020 • Thomas Eboli, Jian Sun, Jean Ponce

Non-blind image deblurring is typically formulated as a linear least-squares problem regularized by natural priors on the corresponding sharp picture's gradients, which can be solved, for example, using a half-quadratic splitting method with Richardson fixed-point iterations for its least-squares updates and a proximal operator for the auxiliary variable updates.

Blind Image Deblurring Image Deblurring

Paper
Code

Learning Low-Resource End-To-End Goal-Oriented Dialog for Fast and Reliable System Deployment

no code implementations • ACL 2020 • Yinpei Dai, Hangyu Li, Chengguang Tang, Yongbin Li, Jian Sun, Xiaodan Zhu

Existing end-to-end dialog systems perform less effectively when data is scarce.

Dialog Learning Goal-Oriented Dialog +1

Paper
Add Code

Structured and Localized Image Restoration

no code implementations • 16 Jun 2020 • Thomas Eboli, Alex Nowak-Vila, Jian Sun, Francis Bach, Jean Ponce, Alessandro Rudi

We present a novel approach to image restoration that leverages ideas from localized structured prediction and non-linear multi-task learning.

Image Restoration Multi-Task Learning +1

Paper
Add Code

Online Reinforcement Learning Control by Direct Heuristic Dynamic Programming: from Time-Driven to Event-Driven

no code implementations • 16 Jun 2020 • Qingtao Zhao, Jennie Si, Jian Sun

In this paper time-driven learning refers to the machine learning method that updates parameters in a prediction model continuously as new data arrives.

Reinforcement Learning (RL)

Paper
Add Code

Spherical Motion Dynamics: Learning Dynamics of Neural Network with Normalization, Weight Decay, and SGD

no code implementations • 15 Jun 2020 • Ruosi Wan, Zhanxing Zhu, Xiangyu Zhang, Jian Sun

In this work, we comprehensively reveal the learning dynamics of neural network with normalization, weight decay (WD), and SGD (with momentum), named as Spherical Motion Dynamics (SMD).

Paper
Add Code

Joint Multi-Dimension Pruning via Numerical Gradient Update

no code implementations • 18 May 2020 • Zechun Liu, Xiangyu Zhang, Zhiqiang Shen, Zhe Li, Yichen Wei, Kwang-Ting Cheng, Jian Sun

To tackle these three naturally different dimensions, we proposed a general framework by defining pruning as seeking the best pruning vector (i. e., the numerical value of layer-wise channel number, spacial size, depth) and construct a unique mapping from the pruning vector to the pruned network structures.

Paper
Add Code

Dynamic Memory Induction Networks for Few-Shot Text Classification

no code implementations • ACL 2020 • Ruiying Geng, Binhua Li, Yongbin Li, Jian Sun, Xiaodan Zhu

This paper proposes Dynamic Memory Induction Networks (DMIN) for few-shot text classification.

Few-Shot Learning Few-Shot Text Classification +2

Paper
Add Code

A Survey on Dialog Management: Recent Advances and Challenges

no code implementations • 5 May 2020 • Yinpei Dai, Huihua Yu, Yixuan Jiang, Chengguang Tang, Yongbin Li, Jian Sun

Dialog management (DM) is a crucial component in a task-oriented dialog system.

Management Reinforcement Learning (RL)

Paper
Add Code

Angle-based Search Space Shrinking for Neural Architecture Search

1 code implementation • ECCV 2020 • Yiming Hu, Yuding Liang, Zichao Guo, Ruosi Wan, Xiangyu Zhang, Yichen Wei, Qingyi Gu, Jian Sun

Comprehensive experiments show that ABS can dramatically enhance existing NAS approaches by providing a promising shrunk search space.

Neural Architecture Search

Paper
Code

Dynamic Scale Training for Object Detection

4 code implementations • 26 Apr 2020 • Yukang Chen, Peizhen Zhang, Zeming Li, Yanwei Li, Xiangyu Zhang, Lu Qi, Jian Sun, Jiaya Jia

We propose a Dynamic Scale Training paradigm (abbreviated as DST) to mitigate scale variation challenge in object detection.

Instance Segmentation Model Optimization +4

Paper
Code

Attentive Normalization for Conditional Image Generation

1 code implementation • CVPR 2020 • Yi Wang, Ying-Cong Chen, Xiangyu Zhang, Jian Sun, Jiaya Jia

Traditional convolution-based generative adversarial networks synthesize images based on hierarchical local operations, where long-range dependency relation is implicitly modeled with a Markov chain.

Conditional Image Generation Semantic correspondence +2

Paper
Code

Learning Human-Object Interaction Detection using Interaction Points

1 code implementation • CVPR 2020 • Tiancai Wang, Tong Yang, Martin Danelljan, Fahad Shahbaz Khan, Xiangyu Zhang, Jian Sun

Human-object interaction (HOI) detection strives to localize both the human and an object as well as the identification of complex interactions between them.

Human-Object Interaction Detection Keypoint Detection +2

Paper
Code

Dynamic Region-Aware Convolution

no code implementations • CVPR 2021 • Jin Chen, Xijun Wang, Zichao Guo, Xiangyu Zhang, Jian Sun

More gracefully, our DRConv transfers the increasing channel-wise filters to spatial dimension with learnable instructor, which not only improve representation ability of convolution, but also maintains computational cost and the translation-invariance as standard convolution dose.

Ranked #14 on Semantic Segmentation on MCubeS

Face Recognition General Classification +2

Paper
Add Code

Learning Dynamic Routing for Semantic Segmentation

1 code implementation • CVPR 2020 • Yanwei Li, Lin Song, Yukang Chen, Zeming Li, Xiangyu Zhang, Xingang Wang, Jian Sun

To demonstrate the superiority of the dynamic property, we compare with several static architectures, which can be modeled as special cases in the routing space.

Segmentation Semantic Segmentation

378

Paper
Code

Detection in Crowded Scenes: One Proposal, Multiple Predictions

3 code implementations • CVPR 2020 • Xuangeng Chu, Anlin Zheng, Xiangyu Zhang, Jian Sun

We propose a simple yet effective proposal-based object detector, aiming at detecting highly-overlapped instances in crowded scenes.

Ranked #2 on Pedestrian Detection on TJU-Ped-campus

Object Detection Pedestrian Detection

3,076

Paper
Code

High-Order Information Matters: Learning Relation and Topology for Occluded Person Re-Identification

2 code implementations • CVPR 2020 • Guan'an Wang, Shuo Yang, Huanyu Liu, Zhicheng Wang, Yang Yang, Shuliang Wang, Gang Yu, Erjin Zhou, Jian Sun

When aligning two groups of local features from two images, we view it as a graph matching problem and propose a cross-graph embedded-alignment (CGEA) layer to jointly learn and embed topology information to local features, and straightly predict similarity score.

Graph Matching Person Re-Identification +1

498

Paper
Code

PointINS: Point-based Instance Segmentation

no code implementations • 13 Mar 2020 • Lu Qi, Yi Wang, Yukang Chen, Yingcong Chen, Xiangyu Zhang, Jian Sun, Jiaya Jia

In this paper, we explore the mask representation in instance segmentation with Point-of-Interest (PoI) features.

Instance Segmentation Object Detection +3

Paper
Add Code

Learning Delicate Local Representations for Multi-Person Pose Estimation

4 code implementations • ECCV 2020 • Yuanhao Cai, Zhicheng Wang, Zhengxiong Luo, Binyi Yin, Angang Du, Haoqian Wang, Xiangyu Zhang, Xinyu Zhou, Erjin Zhou, Jian Sun

To tackle this problem, we propose an efficient attention mechanism - Pose Refine Machine (PRM) to make a trade-off between local and global representations in output features and further refine the keypoint locations.

Ranked #1 on Keypoint Detection on COCO test-challenge

Keypoint Detection Multi-Person Pose Estimation

4,982

Paper
Code

Gauss-Newton Unrolled Neural Networks and Data-driven Priors for Regularized PSSE with Robustness

no code implementations • 3 Mar 2020 • Qiuling Yang, Alireza Sadeghi, Gang Wang, Georgios B. Giannakis, Jian Sun

Numerical tests using real load data on the IEEE $118$-bus benchmark system showcase the improved estimation and robustness performance of the proposed scheme compared with several state-of-the-art alternatives.

Image Denoising Rolling Shutter Correction

Paper
Add Code

A Big Data Enabled Channel Model for 5G Wireless Communication Systems

no code implementations • 28 Feb 2020 • Jie Huang, Cheng-Xiang Wang, Lu Bai, Jian Sun, Yang Yang, Jie Li, Olav Tirkkonen, Ming-Tuo Zhou

This paper investigates various applications of big data analytics, especially machine learning algorithms in wireless communications and channel modeling.

BIG-bench Machine Learning

Paper
Add Code

Isotropic All-electric Spin analyzer based on a quantum ring with spin-orbit coupling

no code implementations • 4 Feb 2020 • Shenglin Peng, Wenchen Luo, Jian Sun, Ai-Min Guo, Fangping Ouyang, Tapash Chakraborty

Here we propose an isotropic all electrical spin analyzer in a quantum ring with spin-orbit coupling by analytically and numerically modeling how the charge transmission rates depend on the polarization of the incident spin.

Mesoscale and Nanoscale Physics

Paper
Add Code

Towards Stabilizing Batch Statistics in Backward Propagation of Batch Normalization

1 code implementation • ICLR 2020 • Junjie Yan, Ruosi Wan, Xiangyu Zhang, Wei zhang, Yichen Wei, Jian Sun

Therefore many modified normalization techniques have been proposed, which either fail to restore the performance of BN completely, or have to introduce additional nonlinear operations in inference procedure and increase huge consumption.

182

Paper
Code

Learning Neural Surrogate Model for Warm-Starting Bayesian Optimization

no code implementations • ICLR 2020 • Haotian Zhang, Jian Sun, Zongben Xu

Bayesian optimization is an effective tool to optimize black-box functions and popular for hyper-parameter tuning in machine learning.

Bayesian Optimization

Paper
Add Code

Neural Diffusion Distance for Image Segmentation

no code implementations • NeurIPS 2019 • Jian Sun, Zongben Xu

To compute high-resolution diffusion distance or segmentation mask, we design an up-sampling strategy by feature-attentional interpolation which can be learned when training spec-diff-net.

Image Segmentation Segmentation +2

Paper
Add Code

PVN3D: A Deep Point-wise 3D Keypoints Voting Network for 6DoF Pose Estimation

3 code implementations • CVPR 2020 • Yisheng He, Wei Sun, Haibin Huang, Jianran Liu, Haoqiang Fan, Jian Sun

Our method is a natural extension of 2D-keypoint approaches that successfully work on RGB based 6DoF estimation.

Ranked #1 on 6D Pose Estimation using RGBD on YCB-Video (Mean ADD-S metric)

6D Pose Estimation 6D Pose Estimation using RGBD +1

473

Paper
Code

Conductor Galloping Prediction on Imbalanced Datasets: SVM with Smart Sampling

no code implementations • 9 Nov 2019 • Kui Wang, Jian Sun, Chenye Wu, Yang Yu

Conductor galloping is the high-amplitude, low-frequency oscillation of overhead power lines due to wind.

Paper
Add Code

A Statistical Learning Approach to Reactive Power Control in Distribution Systems

no code implementations • 25 Oct 2019 • Qiuling Yang, Alireza Sadeghi, Gang Wang, Georgios B. Giannakis, Jian Sun

Taking a statistical learning viewpoint, the input-output relationship between each grid state and the corresponding optimal reactive power control is parameterized in the present work by a deep neural network, whose unknown weights are learned offline by minimizing the power loss over a number of historical and simulated training pairs.

Computational Efficiency

Paper
Add Code

Learnable Tree Filter for Structure-preserving Feature Transform

1 code implementation • NeurIPS 2019 • Lin Song, Yanwei Li, Zeming Li, Gang Yu, Hongbin Sun, Jian Sun, Nanning Zheng

To this end, tree filtering modules are embedded to formulate a unified framework for semantic segmentation.

Semantic Segmentation

140

Paper
Code

VAENAS: Sampling Matters in Neural Architecture Search

no code implementations • 25 Sep 2019 • Shizheng Qin, Yichen Zhu, Pengfei Hou, Xiangyu Zhang, Wenqiang Zhang, Jian Sun

In this paper, we propose a learnable sampling module based on variational auto-encoder (VAE) for neural architecture search (NAS), named as VAENAS, which can be easily embedded into existing weight sharing NAS framework, e. g., one-shot approach and gradient-based approach, and significantly improve the performance of searching results.

Neural Architecture Search

Paper
Add Code

Resizable Neural Networks

no code implementations • 25 Sep 2019 • Yichen Zhu, Xiangyu Zhang, Tong Yang, Jian Sun

We introduce the adaptive resizable networks as dynamic networks, which further improve the performance with less computational cost via data-dependent inference.

Data Augmentation Neural Architecture Search

Paper
Add Code

Content-Aware Unsupervised Deep Homography Estimation

1 code implementation • ECCV 2020 • Jirong Zhang, Chuan Wang, Shuaicheng Liu, Lanpeng Jia, Nianjin Ye, Jue Wang, Ji Zhou, Jian Sun

Homography estimation is a basic image alignment method in many applications.

Ranked #5 on Homography Estimation on S-COCO

Homography Estimation

315

Paper
Code

Disentangled Image Matting

no code implementations • ICCV 2019 • Shaofan Cai, Xiaoshuai Zhang, Haoqiang Fan, Haibin Huang, Jiangyu Liu, Jiaming Liu, Jiaying Liu, Jue Wang, Jian Sun

Most previous image matting methods require a roughly-specificed trimap as input, and estimate fractional alpha values for all pixels that are in the unknown region of the trimap.

Image Matting

Paper
Add Code

HRGE-Net: Hierarchical Relational Graph Embedding Network for Multi-view 3D Shape Recognition

no code implementations • 27 Aug 2019 • Xin Wei, Ruixuan Yu, Jian Sun

We construct a relational graph with multi-view images as nodes, and design relational graph embedding by modeling pairwise and neighboring relations among views.

3D Shape Classification 3D Shape Recognition +2

Paper
Add Code

DFANet: Deep Feature Aggregation for Real-Time Semantic Segmentation

2 code implementations • CVPR 2019 • Hanchao Li, Pengfei Xiong, Haoqiang Fan, Jian Sun

This paper introduces an extremely efficient CNN architecture named DFANet for semantic segmentation under resource constraints.

Ranked #7 on SMAC+ on Def_Infantry_parallel

Real-Time Semantic Segmentation Segmentation +1

255

Paper
Code

Perceive Where to Focus: Learning Visibility-aware Part-level Features for Partial Person Re-identification

1 code implementation • CVPR 2019 • Yifan Sun, Qin Xu, Ya-Li Li, Chi Zhang, Yikang Li, Shengjin Wang, Jian Sun

The visibility awareness allows VPM to extract region-level features and compare two images with focus on their shared regions (which are visible on both images).

Ranked #14 on Person Re-Identification on Market-1501-C

Person Re-Identification

Paper
Code

Single Path One-Shot Neural Architecture Search with Uniform Sampling

6 code implementations • ECCV 2020 • Zichao Guo, Xiangyu Zhang, Haoyuan Mu, Wen Heng, Zechun Liu, Yichen Wei, Jian Sun

It is easy to train and fast to search.

Ranked #88 on Neural Architecture Search on ImageNet (Accuracy metric)

Neural Architecture Search Quantization

1,361

Paper
Code

ThunderNet: Towards Real-time Generic Object Detection

3 code implementations • 28 Mar 2019 • Zheng Qin, Zeming Li, Zhaoning Zhang, Yiping Bao, Gang Yu, Yuxing Peng, Jian Sun

In this paper, we investigate the effectiveness of two-stage detectors in real-time generic detection and propose a lightweight two-stage detector named ThunderNet.

Ranked #15 on Object Detection on PASCAL VOC 2007

Object object-detection +1

277

Paper
Code

DetNAS: Backbone Search for Object Detection

2 code implementations • NeurIPS 2019 • Yukang Chen, Tong Yang, Xiangyu Zhang, Gaofeng Meng, Xinyu Xiao, Jian Sun

In this work, we present DetNAS to use Neural Architecture Search (NAS) for the design of better backbones for object detection.

General Classification Image Classification +4

1,361

Paper
Code

MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning

2 code implementations • ICCV 2019 • Zechun Liu, Haoyuan Mu, Xiangyu Zhang, Zichao Guo, Xin Yang, Tim Kwang-Ting Cheng, Jian Sun

In this paper, we propose a novel meta learning approach for automatic channel pruning of very deep neural networks.

AutoML Meta-Learning

346

Paper
Code

Improving Cross-Domain Chinese Word Segmentation with Word Embeddings

1 code implementation • NAACL 2019 • Yuxiao Ye, Yue Zhang, Weikang Li, Likun Qiu, Jian Sun

Cross-domain Chinese Word Segmentation (CWS) remains a challenge despite recent progress in neural-based CWS.

Chinese Word Segmentation Segmentation +1

Paper
Code

Meta-SR: A Magnification-Arbitrary Network for Super-Resolution

2 code implementations • CVPR 2019 • Xuecai Hu, Haoyuan Mu, Xiangyu Zhang, Zilei Wang, Tieniu Tan, Jian Sun

In this work, we propose a novel method called Meta-SR to firstly solve super-resolution of arbitrary scale factor (including non-integer scale factors) with a single model.

Image Super-Resolution

544

Paper
Code

Induction Networks for Few-Shot Text Classification

5 code implementations • IJCNLP 2019 • Ruiying Geng, Binhua Li, Yongbin Li, Xiaodan Zhu, Ping Jian, Jian Sun

Therefore, we should be able to learn a general representation of each class in the support set and then compare it to new queries.

Ranked #1 on Few-Shot Text Classification on ODIC 5-way (10-shot)

Few-Shot Text Classification General Classification +5

106

Paper
Code

Rethinking on Multi-Stage Networks for Human Pose Estimation

7 code implementations • 1 Jan 2019 • Wenbo Li, Zhicheng Wang, Binyi Yin, Qixiang Peng, Yuming Du, Tianzi Xiao, Gang Yu, Hongtao Lu, Yichen Wei, Jian Sun

Existing pose estimation approaches fall into two categories: single-stage and multi-stage methods.

Ranked #1 on Pose Estimation on COCO minival

Keypoint Detection

4,982

Paper
Code

HyperAdam: A Learnable Task-Adaptive Adam for Network Training

2 code implementations • 22 Nov 2018 • Shipeng Wang, Jian Sun, Zongben Xu

Deep neural networks are traditionally trained using human-designed stochastic optimization algorithms, such as SGD and Adam.

Stochastic Optimization

Paper
Code

Learning Spectral Transform Network on 3D Surface for Non-rigid Shape Analysis

no code implementations • 21 Oct 2018 • Ruixuan Yu, Jian Sun, Huibin Li

Designing a network on 3D surface for non-rigid shape analysis is a challenging task.

General Classification Metric Learning +1

Paper
Add Code

Unpaired Brain MR-to-CT Synthesis using a Structure-Constrained CycleGAN

no code implementations • 12 Sep 2018 • Heran Yang, Jian Sun, Aaron Carass, Can Zhao, Junghoon Lee, Zongben Xu, Jerry Prince

The cycleGAN is becoming an influential method in medical image synthesis.

Image Generation Position

Paper
Add Code

Rendering Portraitures from Monocular Camera and Beyond

no code implementations • ECCV 2018 • Xiangyu Xu, Deqing Sun, Sifei Liu, Wenqi Ren, Yu-Jin Zhang, Ming-Hsuan Yang, Jian Sun

Specifically, we first exploit Convolutional Neural Networks to estimate the relative depth and portrait segmentation maps from a single input image.

Image Matting Portrait Segmentation +1

Paper
Add Code

Proximal Dehaze-Net: A Prior Learning-Based Deep Network for Single Image Dehazing

no code implementations • ECCV 2018 • Dong Yang, Jian Sun

In this paper, we propose a novel deep learning approach for single image dehazing by learning dark channel and transmission priors.

Image Dehazing Single Image Dehazing

Paper
Add Code

GridFace: Face Rectification via Learning Local Homography Transformations

no code implementations • ECCV 2018 • Erjin Zhou, Zhimin Cao, Jian Sun

In this paper, we propose a method, called GridFace, to reduce facial geometric variations and improve the recognition performance.

Face Recognition Image Generation

Paper
Add Code

ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design

35 code implementations • ECCV 2018 • Ningning Ma, Xiangyu Zhang, Hai-Tao Zheng, Jian Sun

Datasets, Transforms and Models specific to Computer Vision

Ranked #873 on Image Classification on ImageNet

Image Classification Object Detection

15,422

Paper
Code

Unified Perceptual Parsing for Scene Understanding

18 code implementations • ECCV 2018 • Tete Xiao, Yingcheng Liu, Bolei Zhou, Yuning Jiang, Jian Sun

In this paper, we study a new task called Unified Perceptual Parsing, which requires the machine vision systems to recognize as many visual concepts as possible from a given image.

Ranked #88 on Semantic Segmentation on ADE20K val

Scene Understanding Semantic Segmentation

7,387

Paper
Code

MetaAnchor: Learning to Detect Objects with Customized Anchors

no code implementations • NeurIPS 2018 • Tong Yang, Xiangyu Zhang, Zeming Li, Wenqiang Zhang, Jian Sun

We propose a novel and flexible anchor mechanism named MetaAnchor for object detection frameworks.

Object object-detection +1

Paper
Add Code

Learning Visually-Grounded Semantics from Contrastive Adversarial Samples

1 code implementation • COLING 2018 • Haoyue Shi, Jiayuan Mao, Tete Xiao, Yuning Jiang, Jian Sun

Begin with an insightful adversarial attack on VSE embeddings, we show the limitation of current frameworks and image-text datasets (e. g., MS-COCO) both quantitatively and qualitatively.

Adversarial Attack Image Captioning

Paper
Code

CrowdHuman: A Benchmark for Detecting Human in a Crowd

1 code implementation • 30 Apr 2018 • Shuai Shao, Zijian Zhao, Boxun Li, Tete Xiao, Gang Yu, Xiangyu Zhang, Jian Sun

There are a total of $470K$ human instances from the train and validation subsets, and $~22. 6$ persons per image, with various kinds of occlusions in the dataset.

Ranked #7 on Pedestrian Detection on Caltech (using extra training data)

Human Detection Object Detection +1

Paper
Code

Automated vehicle's behavior decision making using deep reinforcement learning and high-fidelity simulation environment

no code implementations • 17 Apr 2018 • Yingjun Ye, Xiaohui Zhang, Jian Sun

Therefore, a framework of the decision-making training and learning is put forward in this paper.

Decision Making reinforcement-learning +1

Paper
Add Code

DetNet: A Backbone network for Object Detection

2 code implementations • 17 Apr 2018 • Zeming Li, Chao Peng, Gang Yu, Xiangyu Zhang, Yangdong Deng, Jian Sun

Due to the gap between the image classification and object detection, we propose DetNet in this paper, which is a novel backbone network specifically designed for object detection.

Classification General Classification +7

Paper
Code

ExFuse: Enhancing Feature Fusion for Semantic Segmentation

no code implementations • ECCV 2018 • Zhenli Zhang, Xiangyu Zhang, Chao Peng, Dazhi Cheng, Jian Sun

Modern semantic segmentation frameworks usually combine low-level and high-level features from pre-trained backbone convolutional models to boost performance.

Ranked #4 on Semantic Segmentation on PASCAL VOC 2012 val (using extra training data)

Segmentation Semantic Segmentation

Paper
Add Code

AlignedReID: Surpassing Human-Level Performance in Person Re-Identification

15 code implementations • 22 Nov 2017 • Xuan Zhang, Hao Luo, Xing Fan, Weilai Xiang, Yixiao Sun, Qiqi Xiao, Wei Jiang, Chi Zhang, Jian Sun

In this paper, we propose a novel method called AlignedReID that extracts a global feature which is jointly learned with local features.

Ranked #1 on Person Re-Identification on CUHK-SYSU

Person Re-Identification

637

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.