no code implementations • 22 Feb 2023 • Haoran Yin, Jiaojiao Xiong, Yu Zhou, Chi Zhang, Di Zhang, Xizhang Wei, Yanqun Tang
Delay-Doppler waveform design has been considered as a promising solution to achieve reliable communication under high-mobility channels for the space-air-ground-integrated networks (SAGIN).
1 code implementation • 12 Feb 2023 • Chi Zhang, Rui Chen, Xiangyu Zhao, Qilong Han, Li Li
In practical recommendation scenarios, users often interact with items under multi-typed behaviors (e. g., click, add-to-cart, and purchase).
1 code implementation • 10 Feb 2023 • Hong Wang, Yuanzhi Zhou, Chi Zhang, Chen Peng, Mingxia Huang, Yi Liu, Lintao Zhang
This paper introduces XFL, an industrial-grade federated learning project.
no code implementations • 3 Feb 2023 • Qingpeng Cai, Zhenghai Xue, Chi Zhang, Wanqi Xue, Shuchang Liu, Ruohan Zhan, Xueliang Wang, Tianyou Zuo, Wentao Xie, Dong Zheng, Peng Jiang, Kun Gai
One the one hand, the platforms aims at optimizing the users' cumulative watch time (main goal) in long term, which can be effectively optimized by Reinforcement Learning.
1 code implementation • 28 Jan 2023 • Chi Zhang, Wenjie Ruan, Peipei Xu
We then reveal the working principles of applying Lipschitzian optimisation on NNCS verification and illustrate it by verifying an adaptive cruise control model.
no code implementations • 28 Jan 2023 • Jing Zhang, Chi Zhang, Wenjia Wang, Bing-Yi Jing
Due to the inability to interact with the environment, offline reinforcement learning (RL) methods face the challenge of estimating the Out-of-Distribution (OOD) points.
no code implementations • 21 Jan 2023 • Chi Zhang, Eric Z. Chen, Xiao Chen, Yikang Liu, Terrence Chen, Shanhui Sun
We further compared the proposed dMLP with CNNs using large kernels and studied pure MLP-based reconstruction using a stack of 1D dMLPs, as well as its CNN counterpart using only 1D convolutions.
1 code implementation • 27 Nov 2022 • Chi Zhang, Yuanyuan Shi, Yize Chen
Recent advancements in reinforcement learning algorithms have opened doors for researchers to operate and optimize building energy management systems autonomously.
no code implementations • 23 Nov 2022 • Binxin Yang, Xuejin Chen, Chaoqun Wang, Chi Zhang, Zihan Chen, Xiaoyan Sun
With a semantic feature matching loss for effective semantic supervision, our sketch embedding precisely conveys the semantics in the input sketches to the synthesized images.
no code implementations • 18 Oct 2022 • Chi Zhang, Wei Yin, Zhibin Wang, Gang Yu, Bin Fu, Chunhua Shen
In this paper, we address monocular depth estimation with deep neural networks.
no code implementations • 7 Oct 2022 • Zeqi Chen, Zhichao Cui, Chi Zhang, Jiahuan Zhou, Yuehu Liu
However, training two networks with a set of noisy pseudo labels reduces the complementarity of the two networks and results in label noise accumulation.
no code implementations • 5 Oct 2022 • Shiqian Li, Kewen Wu, Chi Zhang, Yixin Zhu
Taken together, the results on the challenging benchmark of PHYRE show that LfI is, if not better, as good as LfD for dynamics prediction.
no code implementations • 18 Sep 2022 • Chi Zhang, Yu Wang, Linzhang Wang
The recent breakthroughs in deep learning methods have sparked a wave of interest in learning-based bug detectors.
no code implementations • 6 Sep 2022 • Shihong Zhang, Chi Zhang, Bosen Wang
To fill the gaps above, we propose three initiatives in this paper: (1) A Multi-Receptive-Field PINN (MRF-PINN) model is established to solve different types of PDEs on various mesh resolutions without manual tuning; (2) The dimensional balance method is used to estimate the loss weights when solving Navier-Stokes equations; (3) The Taylor polynomial is used to pad the virtual nodes near the boundaries for implementing high-order finite difference.
no code implementations • 2 Sep 2022 • Diyi Hu, Chi Zhang, Viktor Prasanna, Bhaskar, Krishnamachari
In Multi-Agent Reinforcement Learning, communication is critical to encourage cooperation among agents.
Multi-agent Reinforcement Learning
reinforcement-learning
+1
no code implementations • 23 Aug 2022 • Weide Liu, Chi Zhang, Guosheng Lin, Fayao Liu
Few-shot segmentation aims to learn a segmentation model that can be generalized to novel classes with only a few training images.
1 code implementation • 21 Jul 2022 • Yikang Ding, Qingtian Zhu, Xiangyue Liu, Wentao Yuan, Haotian Zhang, Chi Zhang
Supervised multi-view stereo (MVS) methods have achieved remarkable progress in terms of reconstruction quality, but suffer from the challenge of collecting large-scale ground-truth depth.
1 code implementation • 21 Jul 2022 • Wentao Yuan, Qingtian Zhu, Xiangyue Liu, Yikang Ding, Haotian Zhang, Chi Zhang
Recently, Implicit Neural Representations (INRs) parameterized by neural networks have emerged as a powerful and promising tool to represent different kinds of signals due to its continuous, differentiable properties, showing superiorities to classical discretized representations.
no code implementations • 19 Jul 2022 • Nan Song, Chi Zhang, Guosheng Lin
First, instead of learning the decision boundaries between seen classes, as is done in standard close-set classification, we reserve space for unseen classes, such that images located in these areas are recognized as the unseen classes.
no code implementations • 12 Jul 2022 • Fei Hua, Yuwei Jin, Ang Li, Yanhao Chen, Chi Zhang, Ari Hayes, Hang Gao, Eddy Z. Zhang
Evaluations through simulation and on real IBM-Q devices show that our framework can significantly reduce the error rate by up to 6$\times$, with only $\sim$60\% circuit depth compared to state-of-the-art gate scheduling approaches.
no code implementations • 30 Jun 2022 • Yuting Wang, Hangning Zhou, Zhigang Zhang, Chen Feng, Huadong Lin, Chaofei Gao, Yizhi Tang, Zhenting Zhao, Shiyu Zhang, Jie Guo, Xuefeng Wang, Ziyao Xu, Chi Zhang
This technical report presents an effective method for motion prediction in autonomous driving.
Ranked #11 on
Motion Forecasting
on Argoverse CVPR 2020
1 code implementation • 26 Jun 2022 • Xiaochuan Fan, Chi Zhang, Yong Yang, Yue Shang, Xueying Zhang, Zhen He, Yun Xiao, Bo Long, Lingfei Wu
For a platform with billions of products, it is extremely time-costly and labor-expensive to manually pick and organize qualified images.
no code implementations • 18 Jun 2022 • Manjie Xu, Guangyuan Jiang, Chi Zhang, Song-Chun Zhu, Yixin Zhu
Such inefficacy of learning in scientific thinking calls for future research in building humanlike intelligence.
no code implementations • 7 Jun 2022 • Chi Zhang, Lijuan Liu, Xiaoxue Zang, Frederick Liu, Hao Zhang, Xinying Song, Jindong Chen
Convolutional Neural Networks (CNN) have dominated the field of detection ever since the success of AlexNet in ImageNet classification [12].
1 code implementation • 1 Jun 2022 • Ravi Mangal, Zifan Wang, Chi Zhang, Klas Leino, Corina Pasareanu, Matt Fredrikson
We present \emph{cascade attack} (CasA), an adversarial attack against cascading ensembles, and show that: (1) there exists an adversarial input for up to 88\% of the samples where the ensemble claims to be certifiably robust and accurate; and (2) the accuracy of a cascading ensemble under our attack is as low as 11\% when it claims to be certifiably robust and accurate on 97\% of the test set.
no code implementations • 28 May 2022 • Chi Zhang, Olga Papaemmanouil, Josiah P. Hanna, Aditya Akella
Thus, the paper attempts to address the question "Is it possible to design a database consisting of various learned components that cooperatively work to improve end-to-end query latency?".
no code implementations • 26 May 2022 • Qingpeng Cai, Ruohan Zhan, Chi Zhang, Jie Zheng, Guangwei Ding, Pinghua Gong, Dong Zheng, Peng Jiang
In this paper, we formulate the problem of short video recommendation as a constrained Markov Decision Process (MDP), where platforms want to optimize the main goal of user watch time in long term, with the constraint of accommodating the auxiliary responses of user interactions such as sharing/downloading videos.
no code implementations • 21 May 2022 • Xueying Zhang, Kai Shen, Chi Zhang, Xiaochuan Fan, Yun Xiao, Zhen He, Bo Long, Lingfei Wu
In this paper, we proposed an automatic Scenario-based Multi-product Advertising Copywriting Generation system (SMPACG) for E-Commerce, which has been deployed on a leading Chinese e-commerce platform.
no code implementations • 20 May 2022 • Guangyuan Jiang, Manjie Xu, Song-Chun Zhu, Wenjuan Han, Chi Zhang, Yixin Zhu
Further, given this evaluation framework, (3) how can we induce a certain personality in a fully controllable fashion?
1 code implementation • Computational and Structural Biotechnology Journal 2022 • Chi Zhang, Hao Jiang, Weihuang Liu, Junyi Li, Shiming Tang, Mario Juhas, Yang Zhang.
Results To solve the out-of-focus issue in microscopy, we developed a Cycle Generative Adversarial Network (CycleGAN) based model and a multi-component weighted loss function.
1 code implementation • 23 Mar 2022 • Ze Yang, Chi Zhang, Ruibo Li, Yi Xu, Guosheng Lin
Upon this baseline, we devise an initializer named knowledge inheritance (KI) to reliably initialize the novel weights for the box classifier, which effectively facilitates the knowledge transfer process and boosts the adaptation speed.
no code implementations • 10 Feb 2022 • Chi Zhang, Christian Berger
In this paper, we study the interaction between pedestrians and vehicles and propose a novel neural network structure called the Pedestrian-Vehicle Interaction (PVI) extractor for learning the pedestrian-vehicle interaction.
no code implementations • CVPR 2022 • Ruibo Li, Chi Zhang, Guosheng Lin, Zhe Wang, Chunhua Shen
In this work, we focus on scene flow learning on point clouds in a self-supervised manner.
no code implementations • 22 Dec 2021 • Yuhang Wu, Tengteng Huang, Haotian Yao, Chi Zhang, Yuanjie Shao, Chuchu Han, Changxin Gao, Nong Sang
First, we present a Domain-Specific Contrastive Learning (DSCL) mechanism to fully explore intradomain information by comparing samples only from the same domain.
Contrastive Learning
Domain Adaptive Person Re-Identification
+2
no code implementations • SIGIR 2021 • Xueying Zhang, Yunjiang Jiang, Yue Shang, Zhaomeng Cheng, Chi Zhang, Xiaochuan Fan, Yun Xiao, Bo Long
We propose a novel domain-specific generative pre-training (DS-GPT) method for text generation and apply it to the product titleand review summarization problems on E-commerce mobile display. First, we adopt a decoder-only transformer architecture, which fitswell for fine-tuning tasks by combining input and output all to-gether.
no code implementations • 25 Nov 2021 • Chi Zhang, Sirui Xie, Baoxiong Jia, Ying Nian Wu, Song-Chun Zhu, Yixin Zhu
Extensive experiments show that by incorporating an algebraic treatment, the ALANS learner outperforms various pure connectionist models in domains requiring systematic generalization.
1 code implementation • NeurIPS 2021 • Tengteng Huang, Yifan Sun, Xun Wang, Haotian Yao, Chi Zhang
Model smoothing is of central importance for obtaining a reliable teacher model in the student-teacher framework, where the teacher generates surrogate supervision signals to train the student.
1 code implementation • 4 Oct 2021 • Zhaoyang Zhu, Haozhe Sun, Chi Zhang
Adam is applied widely to train neural networks.
no code implementations • 3 Oct 2021 • Chi Zhang, Sanmukh Rao Kuppannagari, Viktor K Prasanna
Current implementations exhibit poor performance due to challenges such as irregular memory accesses and thread-level synchronization overheads on CPU.
1 code implementation • 2 Oct 2021 • Chi Zhang, Sanmukh Rao Kuppannagari, Viktor K Prasanna
This leads to large overestimations of the Q values and performance deterioration of the learned policy.
no code implementations • 29 Sep 2021 • Klas Leino, Chi Zhang, Ravi Mangal, Matt Fredrikson, Bryan Parno, Corina Pasareanu
Certifiably robust neural networks employ provable run-time defenses against adversarial examples by checking if the model is locally robust at the input under evaluation.
no code implementations • 21 Sep 2021 • Chi Zhang, Chaolin Song, Abdollah Shafieezadeh
In this context, CLF provides a new direction for quantifying the impact of new training points and can be easily extended with new learning functions to adapt to different reliability problems.
no code implementations • ICCV 2021 • Chi Zhang, Henghui Ding, Guosheng Lin, Ruibo Li, Changhu Wang, Chunhua Shen
Inspired by the recent success in Automated Machine Learning literature (AutoML), in this paper, we present Meta Navigator, a framework that attempts to solve the aforementioned limitation in few-shot learning by seeking a higher-level strategy and proffer to automate the selection from various few-shot learning designs.
1 code implementation • 6 Sep 2021 • Zhixuan Zhang, Chi Zhang, Zhenning Niu, Le Wang, Yuehu Liu
In this manuscript, we introduce a semi-automatic scene graph annotation tool for images, the GeneAnnotator.
1 code implementation • 1 Sep 2021 • Wennan Chang, Pengtao Dang, Changlin Wan, Xiaoyu Lu, Yue Fang, Tong Zhao, Yong Zang, Bo Li, Chi Zhang, Sha Cao
Compared with existing spatial regression models, our proposed model assumes the existence a few distinct regression models that are estimated based on observations that exhibit similar response-predictor relationships.
1 code implementation • 1 Sep 2021 • Mingkuan Liu, Chi Zhang, Hua Xing, Chao Feng, Monchu Chen, Judith Bishop, Grace Ngapo
Our A/B testing and pilot results demonstrated the HITL pipeline can improve annotation speed and capacity by at least 80% and quality is comparable to or higher than manual double pass annotation.
no code implementations • 29 Aug 2021 • Chi Zhang, Guosheng Lin, Lvlong Lai, Henghui Ding, Qingyao Wu
First, we present a Class Activation Map Calibration (CAMC) module to improve the learning and prediction of network classifiers, by enforcing network prediction based on important image regions.
1 code implementation • ICCV 2021 • Ziqi Zhou, Xi Qiu, Jiangtao Xie, Jianan Wu, Chi Zhang
From the perspective of class space on base set, existing methods either focus on utilizing all classes under a global view by normal pretraining, or pay more attention to adopt an episodic manner to train meta-tasks within few classes in a local view.
1 code implementation • ICCV 2021 • Weixin Feng, Yuanjiang Wang, Lihua Ma, Ye Yuan, Chi Zhang
The instance discrimination paradigm has become dominant in unsupervised learning.
1 code implementation • ICCV 2021 • Limeng Qiao, Yuxuan Zhao, Zhiyuan Li, Xi Qiu, Jianan Wu, Chi Zhang
Few-shot object detection, which aims at detecting novel objects rapidly from extremely few annotated examples of previously unseen classes, has attracted significant research interest in the community.
Ranked #3 on
Few-Shot Object Detection
on MS-COCO (1-shot)
no code implementations • 19 Aug 2021 • Weide Liu, Chi Zhang, Henghui Ding, Tzu-Yi Hung, Guosheng Lin
In this work, we argue that every support pixel's information is desired to be transferred to all query pixels and propose a Correspondence Matching Network (CMNet) with an Optimal Transport Matching module to mine out the correspondence between the query and support images.
no code implementations • 9 Aug 2021 • Chi Zhang, Xiaoning Ma, Yu Liu, Le Wang, Yuanqi SU, Yuehu Liu
Fundamental machine learning theory shows that different samples contribute unequally both in learning and testing processes.
3 code implementations • ICCV 2021 • Yongxing Dai, Jun Liu, Yifan Sun, Zekun Tong, Chi Zhang, Ling-Yu Duan
To ensure these two properties to better characterize appropriate intermediate domains, we enforce the bridge losses on intermediate domains' prediction space and feature space, and enforce a diversity loss on the two domain factors.
Domain Adaptive Person Re-Identification
Person Re-Identification
no code implementations • 5 Aug 2021 • Xin Sun, Henghui Ding, Chi Zhang, Guosheng Lin, Keck-Voon Ling
In this work, we aim to address the challenging task of open set recognition (OSR).
no code implementations • 8 Jun 2021 • Changlin Wan, Muhan Zhang, Wei Hao, Sha Cao, Pan Li, Chi Zhang
SNALS captures the joint interactions of a hyperedge by its local environment, which is retrieved by collecting the spectrum information of their connections.
no code implementations • 26 May 2021 • Chi Zhang, Christian Berger, Marco Dozza
In this paper, we use the recently released large-scale Waymo Open Dataset in urban traffic scenarios, which includes 374 urban training scenes and 76 urban testing scenes to analyze the performance of our proposed algorithm in comparison to the state-of-the-art (SOTA) models.
no code implementations • 7 May 2021 • Shuang Wang, Dong Zhao, Yi Li, Chi Zhang, Yuwei Guo, Qi Zang, Biao Hou, Licheng Jiao
Feature alignment between domains is one of the mainstream methods for Unsupervised Domain Adaptation (UDA) semantic segmentation.
1 code implementation • CVPR 2021 • Chi Zhang, Nan Song, Guosheng Lin, Yun Zheng, Pan Pan, Yinghui Xu
First, we adopt a simple but effective decoupled learning strategy of representations and classifiers that only the classifiers are updated in each incremental session, which avoids knowledge forgetting in the representations.
Ranked #5 on
Few-Shot Class-Incremental Learning
on mini-Imagenet
no code implementations • 3 Apr 2021 • Zhuyu Yao, Jiangbo Ai, Boxun Li, Chi Zhang
By taking advantage of both dense detection and sparse set detection, Efficient DETR leverages dense prior to initialize the object containers and brings the gap of the 1-decoder structure and 6-decoder structure.
no code implementations • CVPR 2021 • Chi Zhang, Baoxiong Jia, Song-Chun Zhu, Yixin Zhu
To fill in this gap, we propose a neuro-symbolic Probabilistic Abduction and Execution (PrAE) learner; central to the PrAE learner is the process of probabilistic abduction and execution on a probabilistic scene representation, akin to the mental manipulation of objects.
no code implementations • CVPR 2021 • Chi Zhang, Baoxiong Jia, Mark Edmonds, Song-Chun Zhu, Yixin Zhu
Causal induction, i. e., identifying unobservable mechanisms that lead to the observable relations among variables, has played a pivotal role in modern scientific discovery, especially in scenarios with only sparse and limited data.
1 code implementation • 26 Mar 2021 • Xu Xie, Chi Zhang, Yixin Zhu, Ying Nian Wu, Song-Chun Zhu
Predicting agents' future trajectories plays a crucial role in modern AI systems, yet it is challenging due to intricate interactions exhibited in multi-agent systems, especially when it comes to collision avoidance.
1 code implementation • CVPR 2021 • Yifan Sun, Yuke Zhu, Yuhan Zhang, Pengkun Zheng, Xi Qiu, Chi Zhang, Yichen Wei
%We argue that such flexibility is also important for deep metric learning, because different visual concepts indeed correspond to different semantic scales.
Ranked #2 on
Metric Learning
on DyML-Animal
no code implementations • 11 Mar 2021 • Chi Zhang, Zihang Lin, Liheng Xu, Zongliang Li, Wei Tang, Yuehu Liu, Gaofeng Meng, Le Wang, Li Li
The key procedure of haze image translation through adversarial training lies in the disentanglement between the feature only involved in haze synthesis, i. e. style feature, and the feature representing the invariant semantic content, i. e. content feature.
3 code implementations • CVPR 2021 • Bo Sun, Banghuai Li, Shengcai Cai, Ye Yuan, Chi Zhang
We present Few-Shot object detection via Contrastive proposals Encoding (FSCE), a simple yet effective approach to learning contrastive-aware object proposal encodings that facilitate the classification of detected objects.
Ranked #10 on
Few-Shot Object Detection
on MS-COCO (30-shot)
1 code implementation • CVPR 2021 • Cheng Zou, Bohan Wang, Yue Hu, Junqi Liu, Qian Wu, Yu Zhao, Boxun Li, Chenguang Zhang, Chi Zhang, Yichen Wei, Jian Sun
We propose HOI Transformer to tackle human object interaction (HOI) detection in an end-to-end manner.
Ranked #20 on
Human-Object Interaction Detection
on HICO-DET
(using extra training data)
no code implementations • 25 Feb 2021 • Chi Zhang, Jinghan Jia, Burhaneddin Yaman, Steen Moeller, Sijia Liu, Mingyi Hong, Mehmet Akçakaya
Although deep learning (DL) has received much attention in accelerated MRI, recent studies suggest small perturbations may lead to instabilities in DL-based reconstructions, leading to concern for their clinical application.
no code implementations • 4 Feb 2021 • Chi Zhang, Jason M. Bartell, Jonathan C. Karsch, Isaiah Gray, Gregory D. Fuchs
In addition, we study the near-field and time-resolved characteristics of our signal and find that our instrument possesses a spatial resolution on the scale of 100 nm and a temporal resolution below 100 ps.
Mesoscale and Nanoscale Physics Materials Science
no code implementations • 27 Jan 2021 • Xiang-Rong Sheng, Liqin Zhao, Guorui Zhou, Xinyao Ding, Binding Dai, Qiang Luo, Siran Yang, Jingshan Lv, Chi Zhang, Hongbo Deng, Xiaoqiang Zhu
Concretely, STAR has the star topology, which consists of the shared centered parameters and domain-specific parameters.
1 code implementation • 12 Jan 2021 • Wenyu Ouyang, Kathryn Lawson, Dapeng Feng, Lei Ye, Chi Zhang, Chaopeng Shen
However, dammed basins must be present in the training dataset.
no code implementations • 7 Jan 2021 • Shuwei Shen, Mengjuan Xu, Fan Zhang, Pengfei Shao, Honghong Liu, Liang Xu, Chi Zhang, Peng Liu, Zhihong Zhang, Peng Yao, Ronald X. Xu
At the network search stage, the DCNNs are fine-tuned with the full training set in order to select the model with the highest BACC.
no code implementations • 5 Jan 2021 • Chi Zhang, Guankai Li, Guosheng Lin, Qingyao Wu, Rui Yao
Image co-segmentation is an active computer vision task that aims to segment the common objects from a set of images.
no code implementations • 1 Jan 2021 • Chi Zhang, Sirui Xie, Baoxiong Jia, Yixin Zhu, Ying Nian Wu, Song-Chun Zhu
We further show that the algebraic representation learned can be decoded by isomorphism and used to generate an answer.
no code implementations • 1 Jan 2021 • Benyi Hu, Chi Zhang, Yuehu Liu, Le Wang, Li Liu
Long-tailed visual class recognition poses significant challenges to traditional machine learning and emerging deep networks due to its inherent class imbalance.
no code implementations • 1 Jan 2021 • Pengtao Dang, Wennan Chang, Haiqi Zhu, Changlin Wan, Tong Zhao, Tingbo Guo, Paul Salama, Sha Cao, Chi Zhang
In this work, we first organize the general MLLRR problem into three subproblems based on different low rank properties , and we argue that most of existing efforts focus on only one category, which leaves the other two unsolved.
no code implementations • 1 Jan 2021 • Chi Zhang, Sanmukh Rao Kuppannagari, Viktor Prasanna
The goal of Offline Reinforcement Learning (RL) is to address this problem by learning effective policies using previously collected datasets.
no code implementations • 28 Dec 2020 • Xiaoyu Chen, Chi Zhang, Guosheng Lin, Jing Han
Moreover, when we use our network to handle the long-tail problem in a fully supervised point cloud segmentation dataset, it can also effectively boost the performance of the few-shot classes.
no code implementations • 21 Dec 2020 • Rui Chen, Liang Li, Kaiping Xue, Chi Zhang, Miao Pan, Yuguang Fang
To address these challenges, in this paper, we attempt to take FL into the design of future wireless networks and develop a novel joint design of wireless transmission and weight quantization for energy efficient FL over mobile devices.
no code implementations • 3 Dec 2020 • Filippo Maria Gambetta, Chi Zhang, Markus Hennrich, Igor Lesanovsky, Weibin Li
Conical intersections between electronic potential energy surfaces are paradigmatic for the study of non-adiabatic processes in the excited states of large molecules.
Atomic Physics Quantum Physics
1 code implementation • NeurIPS 2020 • Armand Comas, Chi Zhang, Zlatan Feric, Octavia Camps, Rose Yu
Missing data poses significant challenges while learning representations of video sequences.
no code implementations • 16 Nov 2020 • Zhen Yang, Chi Zhang, Huiming Guo, Zhaoxiang Zhang
In this paper, we propose a manual-label free 3D detection algorithm that leverages the CARLA simulator to generate a large amount of self-labeled training samples and introduces a novel Domain Adaptive VoxelNet (DA-VoxelNet) that can cross the distribution gap from the synthetic data to the real scenario.
no code implementations • 6 Sep 2020 • Heng-Li Liu, Quan-Lin Li, Chi Zhang
In this paper, we discuss an interesting but challenging bilateral stochastically matching problem: A more general matched queue with matching batch pair (m, n) and two types (i. e., types A and B) of impatient customers, where the arrivals of A- and B-customers are both Poisson processes, m A-customers and n B-customers are matched as a group which leaves the system immediately, and the customers' impatient behavior is to guarantee the stability of the system.
no code implementations • 24 Aug 2020 • Chi Zhang, Philip Odonkor, Shuai Zheng, Hamed Khorasgani, Susumu Serita, Chetan Gupta
In this paper, we propose a novel Deep Reinforcement Learning approach to solve the dynamic dispatching problem in mining.
no code implementations • 22 Aug 2020 • Jialun Liu, Jingwei Zhang, Yi Yang, Wenhui Li, Chi Zhang, Yifan Sun
With slight modifications, MBJ is applicable for two fundamental visual recognition tasks, \emph{i. e.}, deep image classification and deep metric learning (on long-tailed data).
Ranked #34 on
Long-tail Learning
on CIFAR-100-LT (ρ=100)
no code implementations • 12 Aug 2020 • Xin Sun, Chi Zhang, Guosheng Lin, Keck-Voon Ling
A typical challenge that hinders their real-world applications is that unknown samples may be fed into the system during the testing phase, but traditional deep neural networks will wrongly recognize these unknown samples as one of the known classes.
1 code implementation • 31 Jul 2020 • Changlin Wan, Wennan Chang, Tong Zhao, Sha Cao, Chi Zhang
Low rank representation of binary matrix is powerful in disentangling sparse individual-attribute associations, and has received wide applications.
1 code implementation • NeurIPS 2020 • Changlin Wan, Wennan Chang, Tong Zhao, Sha Cao, Chi Zhang
Boolean tensor has been broadly utilized in representing high dimensional logical data collected on spatial, temporal and/or other relational domains.
no code implementations • 21 Jul 2020 • Chi Zhang, Ryan Marcus, Anat Kleiman, Olga Papaemmanouil
In this extended abstract, we propose a new technique for query scheduling with the explicit goal of reducing disk reads and thus implicitly increasing query performance.
no code implementations • 19 Jul 2020 • Wennan Chang, Changlin Wan, Yong Zang, Chi Zhang, Sha Cao
Identifying relationships between molecular variations and their clinical presentations has been challenged by the heterogeneous causes of a disease.
no code implementations • 4 Jul 2020 • Yun Li, Zechun Liu, Weiqun Wu, Haotian Yao, Xiangyu Zhang, Chi Zhang, Baoqun Yin
In this paper, a simple yet effective network pruning framework is proposed to simultaneously address the problems of pruning indicator, pruning ratio, and efficiency constraint.
1 code implementation • 23 Jun 2020 • Armand Comas-Massagué, Chi Zhang, Zlatan Feric, Octavia Camps, Rose Yu
Missing data poses significant challenges while learning representations of video sequences.
no code implementations • 8 Jun 2020 • Chi Zhang, Sanmukh Rao Kuppannagari, Viktor K. Prasanna
Furthermore, we propose to generate \emph{diverse} model rollouts by non-uniform sampling of the environment states such that the entropy of the model rollouts is maximized.
Model-based Reinforcement Learning
reinforcement-learning
+1
no code implementations • CVPR 2020 • Chi Zhang, Yujun Cai, Guosheng Lin, Chunhua Shen
We adopt the Earth Mover's Distance (EMD) as a metric to compute a structural distance between dense image representations to determine image relevance.
no code implementations • 23 May 2020 • Wennan Chang, Xinyu Zhou, Yong Zang, Chi Zhang, Sha Cao
Existing robust mixture regression methods suffer from outliers as they either conduct parameter estimation in the presence of outliers, or rely on prior knowledge of the level of outlier contamination.
2 code implementations • 25 Apr 2020 • Wenhe Zhang, Chi Zhang, Yixin Zhu, Song-Chun Zhu
To endow such a crucial cognitive ability to machine intelligence, we propose a dataset, Machine Number Sense (MNS), consisting of visual arithmetic problems automatically generated using a grammar model--And-Or Graph (AOG).
no code implementations • 20 Apr 2020 • Yixin Zhu, Tao Gao, Lifeng Fan, Siyuan Huang, Mark Edmonds, Hangxin Liu, Feng Gao, Chi Zhang, Siyuan Qi, Ying Nian Wu, Joshua B. Tenenbaum, Song-Chun Zhu
We demonstrate the power of this perspective to develop cognitive AI systems with humanlike common sense by showing how to observe and apply FPICU with little training data to solve a wide range of challenging tasks, including tool use, planning, utility inference, and social learning.
no code implementations • 26 Mar 2020 • Kai Qiao, Chi Zhang, Jian Chen, Linyuan Wang, Li Tong, Bin Yan
Except for deep network structure, the task or corresponding big dataset is also important for deep network models, but neglected by previous studies.
no code implementations • CVPR 2020 • Weide Liu, Chi Zhang, Guosheng Lin, Fayao Liu
In this paper, we propose a cross-reference network (CRNet) for few-shot segmentation.
1 code implementation • CVPR 2020 • Xin Sun, Zhenning Yang, Chi Zhang, Guohao Peng, Keck-Voon Ling
A typical challenge is that unknown samples may be fed into the system during the testing phase and traditional deep neural networks will wrongly recognize the unknown sample as one of the known classes.
3 code implementations • 15 Mar 2020 • Chi Zhang, Yujun Cai, Guosheng Lin, Chunhua Shen
We employ the Earth Mover's Distance (EMD) as a metric to compute a structural distance between dense image representations to determine image relevance.
no code implementations • 13 Mar 2020 • Kai Qiao, Jian Chen, Linyuan Wang, Chi Zhang, Li Tong, Bin Yan
In this study, we proposed a new GAN-based Bayesian visual reconstruction method (GAN-BVRM) that includes a classifier to decode categories from fMRI data, a pre-trained conditional generator to generate natural images of specified categories, and a set of encoding models and evaluator to evaluate generated images.
1 code implementation • arXiv 2020 • Guangming Wang, Chi Zhang, Hesheng Wang, Jingchuan Wang, Yong Wang, Xinlei Wang
In the occluded region, as depth and camera motion can provide more reliable motion estimation, they can be used to instruct unsupervised learning of optical flow.
no code implementations • 29 Feb 2020 • Xing Fan, Hao Luo, Chi Zhang, Wei Jiang
Another challenge of RGB-infrared ReID is that the intra-person (images from the same person) discrepancy is often larger than the inter-person (images from different persons) discrepancy, so a dual-subspace pairing strategy is proposed to alleviate this problem.
10 code implementations • CVPR 2020 • Yifan Sun, Changmao Cheng, Yuhan Zhang, Chi Zhang, Liang Zheng, Zhongdao Wang, Yichen Wei
This paper provides a pair similarity optimization viewpoint on deep feature learning, aiming to maximize the within-class similarity $s_p$ and minimize the between-class similarity $s_n$.
Ranked #1 on
Face Verification
on IJB-C
(training dataset metric)
no code implementations • 12 Feb 2020 • Chi Zhang, Yong Sheng Soh, Ling Feng, Tianyi Zhou, Qianxiao Li
While current machine learning models have impressive performance over a wide range of applications, their large size and complexity render them unsuitable for tasks such as remote monitoring on edge devices with limited storage and computational power.
no code implementations • 6 Feb 2020 • Chi Zhang, Zeyu Wang, Abdollah Shafieezadeh
The proposed VoI analysis framework is applied for an optimal decision-making problem involving load testing of a truss bridge.
1 code implementation • NeurIPS 2019 • Chi Zhang, Baoxiong Jia, Feng Gao, Yixin Zhu, Hongjing Lu, Song-Chun Zhu
"Thinking in pictures," [1] i. e., spatial-temporal reasoning, effortless and instantaneous for humans, is believed to be a significant ability to perform logical induction and a crucial factor in the intellectual history of technology development.
1 code implementation • ECCV 2020 • Jiahao Li, Changhao Zhang, Ziyao Xu, Hangning Zhou, Chi Zhang
In this paper, we propose a novel learning-based pipeline for partially overlapping 3D point cloud registration.
no code implementations • 11 Oct 2019 • Chi Zhang, Sanmukh R. Kuppannagari, Rajgopal Kannan, Viktor K. Prasanna
Safety-aware exploration is another challenge in real systems since certain actions at particular states may result in catastrophic outcomes.
Model-based Reinforcement Learning
reinforcement-learning
+4
no code implementations • ICCV 2019 • Ruihang Chu, Yifan Sun, Yadong Li, Zheng Liu, Chi Zhang, Yichen Wei
This paper considers vehicle re-identification (re-ID) problem.
no code implementations • ICCV 2019 • Chi Zhang, Guosheng Lin, Fayao Liu, Jiushuang Guo, Qingyao Wu, Rui Yao
One-shot image segmentation aims to undertake the segmentation task of a novel class with only one training image available.
Ranked #62 on
Few-Shot Semantic Segmentation
on PASCAL-5i (5-Shot)
no code implementations • 25 Sep 2019 • Hamed Khorasgani, Chi Zhang, Chetan Gupta, Susumu Serita
Our method can learn complex policies to achieve long-term goals and at the same time it can be easily adjusted to address short-term requirements without retraining.
1 code implementation • 19 Sep 2019 • Zhaobing Kang, Wei Zou, Zheng Zhu, Chi Zhang, Hongxuan Ma
This paper presents a generic 6DOF camera pose estimation method, which can be used for both the pinhole camera and the fish-eye camera.
no code implementations • ICCV 2019 • Chuchu Han, Jiacheng Ye, Yunshan Zhong, Xin Tan, Chi Zhang, Changxin Gao, Nong Sang
The state-of-the-art methods train the detector individually, and the detected bounding boxes may be sub-optimal for the following re-ID task.
no code implementations • 12 Sep 2019 • Chi Zhang, Bryan Wilkinson, Ashwinkumar Ganesan, Tim Oates
Another way to remove that limitation, an optional classification layer, trained on manually annotated DoS attack tweets, to filter out non-attack tweets can be used to increase precision at the expense of recall.
no code implementations • 9 Sep 2019 • Changlin Wan, Wennan Chang, Tong Zhao, Mengya Li, Sha Cao, Chi Zhang
Boolean matrix factorization (BMF) aims to find an approximation of a binary matrix as the Boolean product of two low rank Boolean matrices, which could generate vast amount of information for the patterns of relationships between the features and samples.
1 code implementation • 21 Aug 2019 • Yuan Dong, Dawei Li, Chi Zhang, Chuhan Wu, Hong Wang, Ming Xin, Jianlin Cheng, Jian Lin
A significant novelty of the proposed RGAN is that it combines the supervised and regressional convolutional neural network (CNN) with the traditional unsupervised GAN, thus overcoming the common technical barrier in the traditional GANs, which cannot generate data associated with given continuous quantitative labels.
Computational Physics Materials Science Applied Physics
1 code implementation • 27 Jul 2019 • Kai Qiao, Chi Zhang, Jian Chen, Linyuan Wang, Li Tong, Bin Yan
Recently, visual encoding based on functional magnetic resonance imaging (fMRI) have realized many achievements with the rapid development of deep network computation.
no code implementations • 14 Jun 2019 • Chi Zhang, Qianxiao Li
Moreover, we show that the more local updating can reduce the overall communication, even for an infinity number of steps where each node is free to update its local model to near-optimality before exchanging information.
no code implementations • 16 May 2019 • Chi Zhang, Yuehu Liu, Ying Wu, Qilin Zhang, Le Wang
In the pipeline, the estimated shape is refined by the shape prior from the given depth map under the estimated pose.
no code implementations • 15 May 2019 • Zongliang Li, Chi Zhang, Gaofeng Meng, Yuehu Liu
Fog and haze are weathers with low visibility which are adversarial to the driving safety of intelligent vehicles equipped with optical sensors like cameras and LiDARs.
1 code implementation • CVPR 2019 • Chenyou Fan, Xiaofan Zhang, Shu Zhang, Wensheng Wang, Chi Zhang, Heng Huang
In this paper, we propose a novel end-to-end trainable Video Question Answering (VideoQA) framework with three major components: 1) a new heterogeneous memory which can effectively learn global context information from appearance and motion features; 2) a redesigned question memory which helps understand the complex semantics of question and highlights queried subjects; and 3) a new multimodal fusion layer which performs multi-step reasoning by attending to relevant visual and textual hints with self-updated attention.
Ranked #21 on
Visual Question Answering (VQA)
on MSVD-QA
no code implementations • CVPR 2019 • Jian Wang, Yunshan Zhong, Yachun Li, Chi Zhang, Yichen Wei
The estimation of 3D human body pose and shape from a single image has been extensively studied in recent years.
no code implementations • 1 Apr 2019 • Florian Knoll, Kerstin Hammernik, Chi Zhang, Steen Moeller, Thomas Pock, Daniel K. Sodickson, Mehmet Akcakaya
Both linear and non-linear methods are covered, followed by a discussion of recent efforts to further improve parallel imaging using machine learning, and specifically using artificial neural networks.
1 code implementation • CVPR 2019 • Yifan Sun, Qin Xu, Ya-Li Li, Chi Zhang, Yikang Li, Shengjin Wang, Jian Sun
The visibility awareness allows VPM to extract region-level features and compare two images with focus on their shared regions (which are visible on both images).
Ranked #14 on
Person Re-Identification
on Market-1501-C
no code implementations • 19 Mar 2019 • Kai Qiao, Jian Chen, Linyuan Wang, Chi Zhang, Lei Zeng, Li Tong, Bin Yan
Despite the hierarchically similar representations of deep network and human vision, visual information flows from primary visual cortices to high visual cortices and vice versa based on the bottom-up and top-down manners, respectively.
Neurons and Cognition
no code implementations • 17 Mar 2019 • Hao Luo, Xing Fan, Chi Zhang, Wei Jiang
Competition (or confrontation) is observed between the STN module and the ReID module, and two-stage training is applied to acquire a strong STNReID for partial ReID.
no code implementations • CVPR 2019 • Chi Zhang, Feng Gao, Baoxiong Jia, Yixin Zhu, Song-Chun Zhu
In this work, we propose a new dataset, built in the context of Raven's Progressive Matrices (RPM) and aimed at lifting machine intelligence by associating vision with structural, relational, and analogical reasoning in a hierarchical representation.
1 code implementation • CVPR 2019 • Chi Zhang, Guosheng Lin, Fayao Liu, Rui Yao, Chunhua Shen
Recent progress in semantic segmentation is driven by deep Convolutional Neural Networks and large-scale labeled image datasets.
Ranked #65 on
Few-Shot Semantic Segmentation
on PASCAL-5i (5-Shot)
no code implementations • 23 Feb 2019 • Chi Zhang, Kai Qiao, Linyuan Wang, Li Tong, Guoen Hu, Ruyuan Zhang, Bin Yan
In this framework, we employ the transfer learning technique to incorporate a pre-trained DNN (i. e., AlexNet) and train a nonlinear mapping from visual features to brain activity.
no code implementations • 23 Jan 2019 • Guanghan Ning, Ping Liu, Xiaochuan Fan, Chi Zhang
Both the tasks of multi-person human pose estimation and pose tracking in videos are quite challenging.
no code implementations • 7 Jan 2019 • Jiahao Ding, Xiaoqi Qin, Wenjun Xu, Yanmin Gong, Chi Zhang, Miao Pan
Due to massive amounts of data distributed across multiple locations, distributed machine learning has attracted a lot of research interests.
no code implementations • 22 Dec 2018 • Chi Zhang, Xiaohan Duan, Linyuan Wang, Yongli Li, Bin Yan, Guoen Hu, Ruyuan Zhang, Li Tong
Furthermore, we show that voxel-encoding models trained on regular images can successfully generalize to the neural responses to AI images but not AN images.
no code implementations • 19 Dec 2018 • Zheng Chen, Xinli Yu, Chi Zhang, Jin Zhang, Cui Lin, Bo Song, Jianliang Gao, Xiaohua Hu, Wei-Shih Yang, Erjia Yan
Botnet, a group of coordinated bots, is becoming the main platform of malicious Internet activities like DDOS, click fraud, web scraping, spam/rumor distribution, etc.
no code implementations • 18 Dec 2018 • Karan Aggarwal, Onur Atan, Ahmed Farahat, Chi Zhang, Kosta Ristovski, Chetan Gupta
Classically, this problem has been posed in two different ways which are typically solved independently: (1) Remaining useful life (RUL) estimation as a long-term prediction task to estimate how much time is left in the useful life of the equipment and (2) Failure prediction (FP) as a short-term prediction task to assess the probability of a failure within a pre-specified time window.
no code implementations • 13 Dec 2018 • Chi Zhang, Yixin Zhu, Song-Chun Zhu
An unprecedented booming has been witnessed in the research area of artistic style transfer ever since Gatys et al. introduced the neural method.
no code implementations • 16 Oct 2018 • Xing Fan, Hao Luo, Xuan Zhang, Lingxiao He, Chi Zhang, Wei Jiang
Holistic person re-identification (ReID) has received extensive study in the past few years and achieves impressive progress.
no code implementations • 28 Sep 2018 • Yuan Dong, Chuhan Wu, Chi Zhang, Yingda Liu, Jianlin Cheng, Jian Lin
Moreover, given ubiquitous existence of topologies in materials, this work will stimulate widespread interests in applying deep learning algorithms to topological design of materials crossing atomic, nano-, meso-, and macro- scales.
Materials Science Computational Physics
no code implementations • 27 Sep 2018 • Shagan Sah, Chi Zhang, Thang Nguyen, Dheeraj Kumar Peri, Ameya Shringi, Raymond Ptucha
We leverage a sequence-to-sequence model to generate synthetic captions that have the same meaning for having a robust image generation.
no code implementations • 27 Sep 2018 • Shagan Sah, Dheeraj Peri, Ameya Shringi, Chi Zhang, Miguel Dominguez, Andreas Savakis, Ray Ptucha
Along with MMVR, we propose two improvements to the text conditioned image generation.
no code implementations • 26 Sep 2018 • Chi Zhang, Shagan Sah, Thang Nguyen, Dheeraj Peri, Alexander Loui, Carl Salvaggio, Raymond Ptucha
This paper introduces a sentence to vector encoding framework suitable for advanced natural language processing.
1 code implementation • 26 Sep 2018 • Chi Zhang, Thang Nguyen, Shagan Sah, Raymond Ptucha, Alexander Loui, Carl Salvaggio
Gradient control plays an important role in feed-forward networks applied to various computer vision tasks.
no code implementations • 26 Sep 2018 • Chi Zhang, Alexander Loui
In this study, we develop an unsupervised coarse-to-fine video analysis framework and prototype system to extract a salient object in a video sequence.
no code implementations • 12 Jul 2018 • Xingyu Liao, Lingxiao He, Zhouwang Yang, Chi Zhang
Video-based person re-identification (ReID) is a challenging problem, where some video tracks of people across non-overlapping cameras are available for matching.
no code implementations • 4 Jul 2018 • Nevrez Imamoglu, Wataru Shimoda, Chi Zhang, Yuming Fang, Asako Kanezaki, Keiji Yanai, Yoshifumi Nishida
Bottom-up and top-down visual cues are two types of information that helps the visual saliency models.
1 code implementation • CVPR 2018 • Maoke Yang, Kun Yu, Chi Zhang, Zhiwei Li, Kuiyuan Yang
To this end, we propose Densely connected Atrous Spatial Pyramid Pooling (DenseASPP), which connects a set of atrous convolutional layers in a dense way, such that it generates multi-scale features that not only cover a larger scale range, but also cover that scale range densely, without significantly increasing the model size.
Ranked #5 on
Semantic Segmentation
on SkyScapes-Dense
no code implementations • 21 May 2018 • Yi Xu, Shenghuo Zhu, Sen yang, Chi Zhang, Rong Jin, Tianbao Yang
Learning with a {\it convex loss} function has been a dominating paradigm for many years.
no code implementations • 24 Jan 2018 • Kui Zhao, Yuechuan Li, Chi Zhang, Cheng Yang, Huan Xu
By leveraging the mixture layer, the proposed method can adaptively update states according to the similarities between encoded inputs and prototype vectors, leading to a stronger capacity in assimilating sequences with multiple patterns.
no code implementations • 16 Jan 2018 • Chi Zhang, Kai Qiao, Linyuan Wang, Li Tong, Ying Zeng, Bin Yan
Without semantic prior information, we present a novel method to reconstruct nature images from fMRI signals of human visual cortex based on the computation model of convolutional neural network (CNN).
no code implementations • 2 Jan 2018 • Kai Qiao, Chi Zhang, Linyuan Wang, Bin Yan, Jian Chen, Lei Zeng, Li Tong
We firstly employed the CapsNet to train the nonlinear mapping from image stimuli to high-level capsule features, and from high-level capsule features to image stimuli again in an end-to-end manner.
no code implementations • 27 Dec 2017 • Zhimeng Zhang, Jia-Nan Wu, Xuan Zhang, Chi Zhang
Although many methods perform well in single camera tracking, multi-camera tracking remains a challenging problem with less attention.
no code implementations • 4 Dec 2017 • Qizheng He, Jia-Nan Wu, Gang Yu, Chi Zhang
Another contribution is that we show with a deep learning based appearance model, it is easy to associate detections of the same object efficiently and also with high accuracy.
6 code implementations • 22 Nov 2017 • Xuan Zhang, Hao Luo, Xing Fan, Weilai Xiang, Yixiao Sun, Qiqi Xiao, Wei Jiang, Chi Zhang, Jian Sun
In this paper, we propose a novel method called AlignedReID that extracts a global feature which is jointly learned with local features.
Ranked #1 on
Person Re-Identification
on CUHK-SYSU
1 code implementation • 2 Oct 2017 • Qiqi Xiao, Hao Luo, Chi Zhang
Person re-identification (ReID) is an important task in computer vision.
no code implementations • 24 Sep 2017 • Siyi Li, Tianbo Liu, Chi Zhang, Dit-yan Yeung, Shaojie Shen
While deep reinforcement learning (RL) methods have achieved unprecedented successes in a range of challenging problems, their applicability has been mainly limited to simulation or game domains due to the high sample complexity of the trial-and-error learning process.
no code implementations • 3 Jul 2017 • Chi Zhang, Rui Yao, Jinpeng Cai
According to the results from our experiments, our CNN model is able to accurately estimate different people's gaze under various lighting conditions by different devices.
1 code implementation • 1 Mar 2017 • Nevrez Imamoglu, Chi Zhang, Wataru Shimoda, Yuming Fang, Boxin Shi
As prior knowledge of objects or object features helps us make relations for similar objects on attentional tasks, pre-trained deep convolutional neural networks (CNNs) can be used to detect salient objects on images regardless of the object class is in the network knowledge or not.
no code implementations • 24 Nov 2016 • Zheqian Chen, Chi Zhang, Zhou Zhao, Deng Cai
The challenges in this task are the lexical gaps between questions for the word ambiguity and word mismatch problem.
no code implementations • 18 Nov 2016 • Qiqi Xiao, Kelei Cao, Haonan Chen, Fangyue Peng, Chi Zhang
Building on the idea that identity classification, attribute recognition and re- identification share the same mid-level semantic representations, they can be trained sequentially by fine-tuning one based on another.
no code implementations • CVPR 2016 • Chi Zhang, Zhiwei Li, Rui Cai, Hongyang Chao, Yong Rui
In this paper, we propose an RGB-D camera localization approach which takes an effective geometry constraint, i. e. silhouette consistency, into consideration.
no code implementations • 22 Mar 2016 • Zhen Dong, Su Jia, Chi Zhang, Mingtao Pei
To sufficiently discover the useful information contained in face videos, we present a novel network architecture called input aggregated network which is able to learn fixed-length representations for variable-length face videos.
no code implementations • ICCV 2015 • Yanhua Cheng, Rui Cai, Chi Zhang, Zhiwei Li, Xin Zhao, Kaiqi Huang, Yong Rui
The reasons are in two-fold: (1) existing similarity measures are sensitive to object pose and scale changes, as well as intra-class variations; and (2) effectively fusing RGB and depth cues is still an open problem.
no code implementations • ICCV 2015 • Chi Zhang, Zhiwei Li, Yanhua Cheng, Rui Cai, Hongyang Chao, Yong Rui
We present a novel global stereo model designed for view interpolation.
no code implementations • 30 Nov 2015 • Cong Yao, Jia-Nan Wu, Xinyu Zhou, Chi Zhang, Shuchang Zhou, Zhimin Cao, Qi Yin
Different from focused texts present in natural images, which are captured with user's intention and intervention, incidental texts usually exhibit much more diversity, variability and complexity, thus posing significant difficulties and challenges for scene text detection and recognition algorithms.