20 code implementations • CVPR 2019 • Xiang Li, Wenhai Wang, Xiaolin Hu, Jian Yang
A building block called Selective Kernel (SK) unit is designed, in which multiple branches with different kernel sizes are fused using softmax attention that is guided by the information in these branches.
Ranked #98 on Image Classification on CIFAR-100 (using extra training data)
7 code implementations • NeurIPS 2020 • Xiang Li, Wenhai Wang, Lijun Wu, Shuo Chen, Xiaolin Hu, Jun Li, Jinhui Tang, Jian Yang
Specifically, we merge the quality estimation into the class prediction vector to form a joint representation of localization quality and classification, and use a vector to represent arbitrary distribution of box locations.
Ranked #93 on Object Detection on COCO test-dev
5 code implementations • CVPR 2021 • Xiang Li, Wenhai Wang, Xiaolin Hu, Jun Li, Jinhui Tang, Jian Yang
Such a property makes the distribution statistics of a bounding box highly correlated to its real localization quality.
Ranked #26 on Object Detection on COCO-O
2 code implementations • 28 Sep 2023 • Jinze Bai, Shuai Bai, Yunfei Chu, Zeyu Cui, Kai Dang, Xiaodong Deng, Yang Fan, Wenbin Ge, Yu Han, Fei Huang, Binyuan Hui, Luo Ji, Mei Li, Junyang Lin, Runji Lin, Dayiheng Liu, Gao Liu, Chengqiang Lu, Keming Lu, Jianxin Ma, Rui Men, Xingzhang Ren, Xuancheng Ren, Chuanqi Tan, Sinan Tan, Jianhong Tu, Peng Wang, Shijie Wang, Wei Wang, Shengguang Wu, Benfeng Xu, Jin Xu, An Yang, Hao Yang, Jian Yang, Shusheng Yang, Yang Yao, Bowen Yu, Hongyi Yuan, Zheng Yuan, Jianwei Zhang, Xingxuan Zhang, Yichang Zhang, Zhenru Zhang, Chang Zhou, Jingren Zhou, Xiaohuan Zhou, Tianhang Zhu
Large language models (LLMs) have revolutionized the field of artificial intelligence, enabling natural language processing tasks that were previously thought to be exclusive to humans.
Ranked #3 on Multi-Label Text Classification on CC3M-TagMask
4 code implementations • CVPR 2019 • Jian Li, Yabiao Wang, Changan Wang, Ying Tai, Jianjun Qian, Jian Yang, Chengjie Wang, Jilin Li, Feiyue Huang
In this paper, we propose a novel face detection network with three novel contributions that address three key aspects of face detection, including better feature learning, progressive loss design and anchor assign based data augmentation, respectively.
Ranked #1 on Face Detection on FDDB
9 code implementations • 7 Jun 2018 • Xiang Li, Wenhai Wang, Wenbo Hou, Ruo-Ze Liu, Tong Lu, Jian Yang
To address these problems, we propose a novel Progressive Scale Expansion Network (PSENet), designed as a segmentation-based detector with multiple predictions for each text instance.
Ranked #12 on Scene Text Detection on ICDAR 2017 MLT
3 code implementations • 23 May 2019 • Xiang Li, Xiaolin Hu, Jian Yang
The Convolutional Neural Networks (CNNs) generate the feature representation of complex objects by collecting hierarchical and different parts of semantic sub-features.
Ranked #739 on Image Classification on ImageNet
3 code implementations • 18 Feb 2019 • Max Allan, Alex Shvets, Thomas Kurmann, Zichen Zhang, Rahul Duggal, Yun-Hsuan Su, Nicola Rieke, Iro Laina, Niveditha Kalavakonda, Sebastian Bodenstedt, Luis Herrera, Wenqi Li, Vladimir Iglovikov, Huoling Luo, Jian Yang, Danail Stoyanov, Lena Maier-Hein, Stefanie Speidel, Mahdi Azizian
In mainstream computer vision and machine learning, public datasets such as ImageNet, COCO and KITTI have helped drive enormous improvements by enabling researchers to understand the strengths and limitations of different algorithms via performance comparison.
1 code implementation • 17 May 2020 • Fanzhen Liu, Shan Xue, Jia Wu, Chuan Zhou, Wenbin Hu, Cecile Paris, Surya Nepal, Jian Yang, Philip S. Yu
As communities represent similar opinions, similar functions, similar purposes, etc., community detection is an important and extremely useful tool in both scientific inquiry and data analytics.
1 code implementation • ICCV 2023 • YuXuan Li, Qibin Hou, Zhaohui Zheng, Ming-Ming Cheng, Jian Yang, Xiang Li
To the best of our knowledge, this is the first time that large and selective kernel mechanisms have been explored in the field of remote sensing object detection.
Ranked #1 on Semantic Segmentation on UAVid
1 code implementation • 18 Mar 2024 • YuXuan Li, Xiang Li, Yimain Dai, Qibin Hou, Li Liu, Yongxiang Liu, Ming-Ming Cheng, Jian Yang
While a considerable amount of research has been dedicated to remote sensing classification, object detection and semantic segmentation, most of these studies have overlooked the valuable prior knowledge embedded within remote sensing scenarios.
1 code implementation • 14 Jun 2021 • Xiaoxiao Ma, Jia Wu, Shan Xue, Jian Yang, Chuan Zhou, Quan Z. Sheng, Hui Xiong, Leman Akoglu
In this survey, we aim to provide a systematic and comprehensive review of the contemporary deep learning techniques for graph anomaly detection.
4 code implementations • CVPR 2018 • Yu Chen, Ying Tai, Xiaoming Liu, Chunhua Shen, Jian Yang
We present a novel deep end-to-end trainable Face Super-Resolution Network (FSRNet), which makes full use of the geometry prior, i. e., facial landmark heatmaps and parsing maps, to super-resolve very low-resolution (LR) face images without well-aligned requirement.
1 code implementation • 15 Dec 2023 • Senmao Li, Taihang Hu, Fahad Shahbaz Khan, Linxuan Li, Shiqi Yang, Yaxing Wang, Ming-Ming Cheng, Jian Yang
This finding inspired us to omit the encoder at certain adjacent time-steps and reuse cyclically the encoder features in the previous time-steps for the decoder.
1 code implementation • 20 May 2022 • Xiang Li, Wenhai Wang, Lingfeng Yang, Jian Yang
Masked AutoEncoder (MAE) has recently led the trends of visual self-supervision area by an elegant asymmetric encoder-decoder design, which significantly optimizes both the pre-training efficiency and fine-tuning accuracy.
Ranked #37 on Object Detection on COCO minival
1 code implementation • CVPR 2017 • Ying Tai, Jian Yang, Xiaoming Liu
Specifically, residual learning is adopted, both in global and local manners, to mitigate the difficulty of training very deep networks; recursive learning is used to control the model parameters while increasing the depth.
Ranked #10 on Video Super-Resolution on MSU Video Upscalers: Quality Enhancement (VMAF metric)
1 code implementation • 11 Mar 2024 • YuXuan Li, Xiang Li, Weijie Li, Qibin Hou, Li Liu, Ming-Ming Cheng, Jian Yang
To the best of our knowledge, SARDet-100K is the first COCO-level large-scale multi-class SAR object detection dataset ever created.
Ranked #1 on 2D Object Detection on SARDet-100K (using extra training data)
2 code implementations • ICCV 2017 • Ying Tai, Jian Yang, Xiaoming Liu, Chunyan Xu
We apply MemNet to three image restoration tasks, i. e., image denosing, super-resolution and JPEG deblocking.
2 code implementations • ICCV 2021 • Kun Wang, Zhenyu Zhang, Zhiqiang Yan, Xiang Li, Baobei Xu, Jun Li, Jian Yang
Monocular depth estimation aims at predicting depth from a single image or video.
1 code implementation • CVPR 2021 • Zhenyu Zhang, Yanhao Ge, Renwang Chen, Ying Tai, Yan Yan, Jian Yang, Chengjie Wang, Jilin Li, Feiyue Huang
Non-parametric face modeling aims to reconstruct 3D face only from images without shape assumptions.
1 code implementation • 7 Sep 2020 • Zezhou Sun, Banghe Wu, Cheng-Zhong Xu, Sanjay E. Sarma, Jian Yang, Hui Kong
We propose an integrated approach to active exploration by exploiting the Cartographer method as the base SLAM module for submap creation and performing efficient frontier detection in the geometrically co-aligned submaps induced by graph optimization.
1 code implementation • 29 Nov 2022 • Zheng Li, Xiang Li, Lingfeng Yang, Borui Zhao, RenJie Song, Lei Luo, Jun Li, Jian Yang
In this paper, we propose a simple curriculum-based technique, termed Curriculum Temperature for Knowledge Distillation (CTKD), which controls the task difficulty level during the student's learning career through a dynamic and learnable temperature.
1 code implementation • CVPR 2021 • Hui Lv, Chen Chen, Zhen Cui, Chunyan Xu, Yong Li, Jian Yang
Frame reconstruction (current or future frame) based on Auto-Encoder (AE) is a popular method for video anomaly detection.
1 code implementation • 14 May 2019 • Sheng Wan, Chen Gong, Ping Zhong, Bo Du, Lefei Zhang, Jian Yang
To alleviate this shortcoming, we consider employing the recently proposed Graph Convolutional Network (GCN) for hyperspectral image classification, as it can conduct the convolution on arbitrarily structured non-Euclidean data and is applicable to the irregular image regions represented by graph topological information.
1 code implementation • 18 Dec 2023 • Bing Wang, Changyu Ren, Jian Yang, Xinnian Liang, Jiaqi Bai, Linzheng Chai, Zhao Yan, Qian-Wen Zhang, Di Yin, Xing Sun, Zhoujun Li
Our framework comprises a core decomposer agent for Text-to-SQL generation with few-shot chain-of-thought reasoning, accompanied by two auxiliary agents that utilize external tools or models to acquire smaller sub-databases and refine erroneous SQL queries.
3 code implementations • CVPR 2021 • Zongyan Han, ZhenYong Fu, Shuo Chen, Jian Yang
To tackle this issue, we propose to integrate the generation model with the embedding model, yielding a hybrid GZSL framework.
1 code implementation • CVPR 2022 • Lingfeng Yang, Xiang Li, RenJie Song, Borui Zhao, Juntian Tao, Shihao Zhou, Jiajun Liang, Jian Yang
Therefore, it is helpful to leverage additional information, e. g., the locations and dates for data shooting, which can be easily accessible but rarely exploited.
1 code implementation • 10 Dec 2023 • Zhengxue Wang, Zhiqiang Yan, Jian Yang
Recent image guided DSR approaches mainly focus on spatial domain to rebuild depth structure.
1 code implementation • 5 Mar 2024 • Zheng Li, Xiang Li, Xinyi Fu, Xin Zhang, Weiqiang Wang, Shuo Chen, Jian Yang
To our best knowledge, we are the first to (1) perform unsupervised domain-specific prompt-driven knowledge distillation for CLIP, and (2) establish a practical pre-storing mechanism of text features as shared class vectors between teacher and student.
Ranked #1 on Prompt Engineering on Oxford-IIIT Pet Dataset
1 code implementation • 26 Mar 2024 • Gan Pei, Jiangning Zhang, Menghan Hu, Zhenyu Zhang, Chengjie Wang, Yunsheng Wu, Guangtao Zhai, Jian Yang, Chunhua Shen, DaCheng Tao
In addition to the advancements in deepfake generation, corresponding detection technologies need to continuously evolve to regulate the potential misuse of deepfakes, such as for privacy invasion and phishing attacks.
1 code implementation • 20 Aug 2020 • Hui Lv, Chuanwei Zhou, Chunyan Xu, Zhen Cui, Jian Yang
In addition, in order to fully utilize the spatial context information, the immediate semantics are directly derived from the segment representations.
Anomaly Detection In Surveillance Videos Video Anomaly Detection
2 code implementations • NAACL 2022 • Yuchen Eleanor Jiang, Tianyu Liu, Shuming Ma, Dongdong Zhang, Jian Yang, Haoyang Huang, Rico Sennrich, Ryan Cotterell, Mrinmaya Sachan, Ming Zhou
Standard automatic metrics, e. g. BLEU, are not reliable for document-level MT evaluation.
4 code implementations • CVPR 2018 • Jifeng Wang, Xiang Li, Le Hui, Jian Yang
Specifically, a shadow image is fed into the first generator which produces a shadow detection mask.
Ranked #3 on RGB Salient Object Detection on ISTD
1 code implementation • 25 Jul 2022 • Mu He, Le Hui, Yikai Bian, Jian Ren, Jin Xie, Jian Yang
In this paper, we propose a resolution adaptive self-supervised monocular depth estimation method (RA-Depth) by learning the scale invariance of the scene depth.
1 code implementation • 2 Apr 2024 • Rui Xie, Ying Tai, Kai Zhang, Zhenyu Zhang, Jun Zhou, Jian Yang
Blind super-resolution methods based on stable diffusion showcase formidable generative capabilities in reconstructing clear high-resolution images with intricate details from low-resolution inputs.
1 code implementation • ICCV 2021 • Le Hui, Hang Yang, Mingmei Cheng, Jin Xie, Jian Yang
In order to obtain discriminative global descriptors, we construct a pyramid VLAD module to aggregate the multi-scale feature maps of point clouds into the global descriptors.
Ranked #3 on 3D Place Recognition on Oxford RobotCar Dataset
1 code implementation • 16 Dec 2022 • Yimian Dai, Xiang Li, Fei Zhou, Yulei Qian, Yaohong Chen, Jian Yang
Finally, we present a new research benchmark for infrared small target detection, consisting of the SIRST-V2 dataset of real-world, high-resolution single-frame targets, the normalized contrast evaluation metric, and the DeepInfrared toolkit for detection.
1 code implementation • 28 Mar 2023 • Senmao Li, Joost Van de Weijer, Taihang Hu, Fahad Shahbaz Khan, Qibin Hou, Yaxing Wang, Jian Yang
A significant research effort is focused on exploiting the amazing capacities of pretrained diffusion models for the editing of images.
Ranked #7 on Text-based Image Editing on PIE-Bench
1 code implementation • CVPR 2022 • Jinchao Yang, Fei Guo, Shuo Chen, Jun Li, Jian Yang
Given a source product, a target product, and an art style image, our method produces a neural warping field that warps the source shape to imitate the geometric style of the target and a neural texture transformation network that transfers the artistic style to the warped source product.
2 code implementations • ICCV 2017 • Yu Chen, Chunhua Shen, Xiu-Shen Wei, Lingqiao Liu, Jian Yang
In contrast, human vision is able to predict poses by exploiting geometric constraints of joint inter-connectivity.
Ranked #15 on Pose Estimation on MPII Human Pose
1 code implementation • 22 May 2023 • Zheng Li, YuXuan Li, Penghai Zhao, RenJie Song, Xiang Li, Jian Yang
Diffusion models have recently achieved astonishing performance in generating high-fidelity photo-realistic images.
2 code implementations • 6 Apr 2019 • Yuan Gong, Jian Yang, Jacob Huber, Mitchell MacKnight, Christian Poellabauer
This paper introduces a new database of voice recordings with the goal of supporting research on vulnerabilities and protection of voice-controlled systems (VCSs).
2 code implementations • 18 Mar 2020 • Yuan Gong, Jian Yang, Christian Poellabauer
With the rapidly growing number of security-sensitive systems that use voice as the primary input, it becomes increasingly important to address these systems' potential vulnerability to replay attacks.
1 code implementation • CVPR 2023 • Senmao Li, Joost Van de Weijer, Yaxing Wang, Fahad Shahbaz Khan, Meiqin Liu, Jian Yang
In the second step, based on the well-trained multi-class 3D-aware GAN architecture, that preserves view-consistency, we construct a 3D-aware I2I translation system.
1 code implementation • CVPR 2023 • Ziyue Zhu, Qiang Meng, Xiao Wang, Ke Wang, Liujiang Yan, Jian Yang
For the loss design, we propose the COMLoss to dynamically predict object-level difficulties and emphasize objects of different difficulties based on training stages.
1 code implementation • NeurIPS 2021 • Le Hui, Lingpeng Wang, Mingmei Cheng, Jin Xie, Jian Yang
The Siamese shape-aware feature learning network can capture 3D shape information of the object to learn the discriminative features of the object so that the potential target from the background in sparse point clouds can be identified.
1 code implementation • 24 Jun 2019 • Yalong Liu, Jie Li, Ying Wang, Miaomiao Wang, Xianjun Li, Zhicheng Jiao, Jian Yang, Xingbo Gao
In this paper, we construct an efficient two-stage PWML semantic segmentation network based on the characteristics of the lesion, called refined segmentation R-CNN (RS RCNN).
1 code implementation • 26 Apr 2023 • Bing Wang, Xinnian Liang, Jian Yang, Hui Huang, Shuangzhi Wu, Peihao Wu, Lu Lu, Zejun Ma, Zhoujun Li
Large Language Models (LLMs) are constrained by their inability to process lengthy inputs, resulting in the loss of critical historical information.
1 code implementation • 7 Jan 2021 • Le Hui, Mingmei Cheng, Jin Xie, Jian Yang
In this paper, we develop an efficient point cloud learning network (EPC-Net) to form a global descriptor for visual place recognition, which can obtain good performance and reduce computation memory and inference time.
1 code implementation • 25 Jul 2022 • Le Hui, Lingpeng Wang, Linghua Tang, Kaihao Lan, Jin Xie, Jian Yang
Siamese network based trackers formulate 3D single object tracking as cross-correlation learning between point features of a template and a search area.
1 code implementation • 23 Feb 2022 • Yaqi Shen, Le Hui, Haobo Jiang, Jin Xie, Jian Yang
In this paper, we propose a neighborhood consensus based reliable inlier evaluation method for robust unsupervised point cloud registration.
1 code implementation • CVPR 2023 • Haobo Jiang, Zheng Dang, Zhen Wei, Jin Xie, Jian Yang, Mathieu Salzmann
Embedded with the inlier/outlier label, the posterior feature distribution is label-dependent and discriminative.
1 code implementation • 12 Jan 2024 • Minxing Luo, Wentao Cheng, Jian Yang
Our method tracks handle points more precisely by utilizing the feature map of the rotated images, thus ensuring precise optimization and high image fidelity.
1 code implementation • 18 Aug 2019 • Jinshan Pan, Yang Liu, Deqing Sun, Jimmy Ren, Ming-Ming Cheng, Jian Yang, Jinhui Tang
We present a simple and effective image super-resolution algorithm that imposes an image formation constraint on the deep neural networks via pixel substitution.
1 code implementation • 31 Jan 2018 • Xi Cheng, Xiang Li, Ying Tai, Jian Yang
Single image super resolution is a very important computer vision task, with a wide range of applications.
Ranked #34 on Image Super-Resolution on BSD100 - 4x upscaling
1 code implementation • ECCV 2020 • Le Hui, Rui Xu, Jin Xie, Jianjun Qian, Jian Yang
Starting from the low-resolution point clouds, with the bilateral interpolation and max-pooling operations, the deconvolution network can progressively output high-resolution local and global feature maps.
1 code implementation • 6 Feb 2018 • Wenhai Wang, Xiang Li, Jian Yang, Tong Lu
Basing on the analysis by revealing the equivalence of modern networks, we find that both ResNet and DenseNet are essentially derived from the same "dense topology", yet they only differ in the form of connection -- addition (dubbed "inner link") vs. concatenation (dubbed "outer link").
1 code implementation • ICCV 2021 • Le Hui, Jia Yuan, Mingmei Cheng, Jin Xie, Xiaoya Zhang, Jian Yang
Specifically, in our clustering network, we first jointly learn a soft point-superpoint association map from the coordinate and feature spaces of point clouds, where each point is assigned to the superpoint with a learned weight.
1 code implementation • 18 Jan 2024 • Xianfu Cheng, Weixiao Zhou, Xiang Li, Xiaoming Chen, Jian Yang, Tongliang Li, Zhoujun Li
In this work, we propose the VIsion Permutable extractor for fast and efficient scene Text Recognition (VIPTR), which achieves an impressive balance between high performance and rapid inference speeds in the domain of STR.
1 code implementation • 8 Feb 2024 • Senmao Li, Joost Van de Weijer, Taihang Hu, Fahad Shahbaz Khan, Qibin Hou, Yaxing Wang, Jian Yang
However, these models struggle to effectively suppress the generation of undesired content, which is explicitly requested to be omitted from the generated image in the prompt.
1 code implementation • 16 Apr 2021 • Mingmei Cheng, Le Hui, Jin Xie, Jian Yang
In order to reduce the number of annotated labels, we propose a semi-supervised semantic point cloud segmentation network, named SSPC-Net, where we train the semantic segmentation network by inferring the labels of unlabeled points from the few annotated 3D points.
1 code implementation • CVPR 2022 • Ziqiang Xu, Chunyan Xu, Zhen Cui, Xiangwei Zheng, Jian Yang
The classic active contour model raises a great promising solution to polygon-based object extraction with the progress of deep learning recently.
1 code implementation • 14 Mar 2022 • Lingfeng Yang, Xiang Li, Borui Zhao, RenJie Song, Jian Yang
In semantic segmentation, RM also surpasses the baseline and CutMix by 1. 9 and 1. 1 mIoU points under UperNet on ADE20K, respectively.
1 code implementation • NeurIPS 2023 • Lingfeng Yang, Yueze Wang, Xiang Li, Xinlong Wang, Jian Yang
Previous works have suggested that incorporating visual prompts, such as colorful boxes or circles, can improve the ability of models to recognize objects of interest.
1 code implementation • 19 Apr 2022 • Guangwei Gao, Zixiang Xu, Juncheng Li, Jian Yang, Tieyong Zeng, Guo-Jun Qi
Then, we design an efficient Feature Refinement Module (FRM) to enhance the encoded features.
1 code implementation • 26 Sep 2019 • Xi Cheng, Zhen-Yong Fu, Jian Yang
The prevalence of digital sensors, such as digital cameras and mobile phones, simplifies the acquisition of photos.
1 code implementation • 12 Mar 2023 • Qi Zhao, Bai Yan, Taiwei Hu, Xianglong Chen, Qiqi Duan, Jian Yang, Yuhui Shi
In response, this paper proposes AutoOptLib, the first platform for accessible automated design of metaheuristic optimizers.
1 code implementation • CVPR 2023 • Yaqi Shen, Le Hui, Jin Xie, Jian Yang
In our superpoint generation module, we utilize the bidirectional flow information at the previous iteration to obtain the matching points of points and superpoint centers for soft point-to-superpoint association construction, in which the superpoints are generated for pairwise point clouds.
1 code implementation • 22 Jul 2023 • Yuwei Yin, Yazheng Yang, Jian Yang, Qi Liu
To tackle these issues, we propose FinPT and FinBench: the former is a novel approach for financial risk prediction that conduct Profile Tuning on large pretrained foundation models, and the latter is a set of high-quality datasets on financial risks such as default, fraud, and churn.
1 code implementation • 18 Oct 2022 • Fanzhen Liu, Xiaoxiao Ma, Jia Wu, Jian Yang, Shan Xue, Amin Beheshti, Chuan Zhou, Hao Peng, Quan Z. Sheng, Charu C. Aggarwal
To bridge the gaps, this paper devises a novel Data Augmentation-based Graph Anomaly Detection (DAGAD) framework for attributed graphs, equipped with three specially designed modules: 1) an information fusion module employing graph neural network encoders to learn representations, 2) a graph data augmentation module that fertilizes the training set with generated samples, and 3) an imbalance-tailored learning module to discriminate the distributions of the minority (anomalous) and majority (normal) classes.
1 code implementation • 2 Dec 2022 • Tao Zhou, Yi Zhou, Chen Gong, Jian Yang, Yu Zhang
In this paper, we propose a novel Feature Aggregation and Propagation Network (FAP-Net) for camouflaged object detection.
1 code implementation • IJCNLP 2019 • Ze Yang, Wei Wu, Jian Yang, Can Xu, Zhoujun Li
Since the paired data now is no longer enough to train a neural generation model, we consider leveraging the large scale of unpaired data that are much easier to obtain, and propose response generation with both paired and unpaired data.
1 code implementation • ICCV 2021 • Haobo Jiang, Yaqi Shen, Jin Xie, Jun Li, Jianjun Qian, Jian Yang
Based on the reward function, for each state, we then construct a fused score function to evaluate the sampled transformations, where we weight the current and future rewards of the transformations.
2 code implementations • 23 Oct 2020 • Yali Peng, Yue Cao, Shigang Liu, Jian Yang, WangMeng Zuo
To cope with this issue, this paper presents a multi-level wavelet residual network (MWRN) architecture as well as a progressive training (PTMWRN) scheme to improve image denoising performance.
1 code implementation • 17 Dec 2023 • Xiaoqi An, Lin Zhao, Chen Gong, Nannan Wang, Di Wang, Jian Yang
In this paper, we address the following question: "Only sparse human keypoint locations are detected for human pose estimation, is it really necessary to describe the whole image in a dense, high-resolution manner?"
1 code implementation • 14 Jan 2021 • Qizhou Wang, Bo Han, Tongliang Liu, Gang Niu, Jian Yang, Chen Gong
The drastic increase of data quantity often brings the severe decrease of data quality, such as incorrect label annotations, which poses a great challenge for robustly training Deep Neural Networks (DNNs).
1 code implementation • 23 Mar 2023 • Xiang Li, Ge Wu, Lingfeng Yang, Wenhai Wang, RenJie Song, Jian Yang
The various types of elements, deposited in the training history, are a large amount of wealth for improving learning deep models.
1 code implementation • 6 Jul 2022 • Wenjie Li, Juncheng Li, Guangwei Gao, Jiantao Zhou, Jian Yang, Guo-Jun Qi
Recently, Transformer-based methods have shown impressive performance in single image super-resolution (SISR) tasks due to the ability of global feature extraction.
1 code implementation • 25 Mar 2019 • Lei Zhang, Shan-Shan Wang, Guang-Bin Huang, WangMeng Zuo, Jian Yang, David Zhang
The merits of the proposed MCTL are four-fold: 1) the concept of manifold criterion (MC) is first proposed as a measure validating the distribution matching across domains, and domain adaptation is achieved if the MC is satisfied; 2) the proposed MC can well guide the generation of the intermediate domain sharing similar distribution with the target domain, by minimizing the local domain discrepancy; 3) a global generative discrepancy metric (GGDM) is presented, such that both the global and local discrepancy can be effectively and positively reduced; 4) a simplified version of MCTL called MCTL-S is presented under a perfect domain generation assumption for more generic learning scenario.
2 code implementations • 3 Feb 2018 • Zixiang Ding, Rui Xia, Jianfei Yu, Xiang Li, Jian Yang
Deep neural networks have recently been shown to achieve highly competitive performance in many computer vision tasks due to their abilities of exploring in a much larger hypothesis space.
1 code implementation • 2 Sep 2021 • Guangwei Gao, Guoan Xu, Juncheng Li, Yi Yu, Huimin Lu, Jian Yang
Specifically, FBSNet employs a symmetrical encoder-decoder structure with two branches, semantic information branch and spatial detail branch.
1 code implementation • 21 Dec 2022 • Xing Su, Jian Yang, Jia Wu, Yuchen Zhang
In this paper, we construct a dual-layer graph (i. e., the news layer and the user layer) to extract multiple relations of news and users in social networks to derive rich information for detecting fake news.
1 code implementation • 27 Jun 2023 • Jiaqi Bai, Zhao Yan, Jian Yang, Xinnian Liang, Hongcheng Guo, Zhoujun Li
We propose Knowledgeable Prefix Tuning (KnowPrefix-Tuning), a two-stage tuning framework, bypassing the retrieval process in a knowledge-grounded conversation system by injecting prior knowledge into the lightweight knowledge prefix.
1 code implementation • 9 Jan 2024 • Hongcheng Guo, Jian Yang, Jiaheng Liu, Jiaqi Bai, Boyang Wang, Zhoujun Li, Tieqiao Zheng, Bo Zhang, Junran Peng, Qi Tian
Log anomaly detection is a key component in the field of artificial intelligence for IT operations (AIOps).
1 code implementation • 14 Jul 2021 • Yong Li, Lingjie Lao, Zhen Cui, Shiguang Shan, Jian Yang
To mitigate this issue, we propose the GraphJigsaw that constructs jigsaw puzzles at various stages in the classification network and solves the puzzles with the graph convolutional network (GCN) in a progressive manner.
1 code implementation • 21 Feb 2023 • Guoan Xu, Juncheng Li, Guangwei Gao, Huimin Lu, Jian Yang, Dong Yue
In the past decade, convolutional neural networks (CNNs) have shown prominence for semantic segmentation.
2 code implementations • ICCV 2023 • Renke Wang, Guimin Que, Shuo Chen, Xiang Li, Jun Li, Jian Yang
Our focus lies primarily on birds, a popular subject in 3D reconstruction, for which no existing single-view 3D transfer methods have been developed. The method we propose seeks to generate a 3D mesh shape and texture of a bird from two single-view images.
1 code implementation • 16 Nov 2021 • Di Chen, Andreas Doering, Shanshan Zhang, Jian Yang, Juergen Gall, Bernt Schiele
Video-based person re-identification (re-ID) is an important technique in visual surveillance systems which aims to match video snippets of people captured by different cameras.
Representation Learning Video-Based Person Re-Identification
1 code implementation • CVPR 2022 • Shenjian Gong, Shanshan Zhang, Jian Yang, Dengxin Dai, Bernt Schiele
The main challenge for this task is to achieve high-quality manual annotations on a large amount of training data.
2 code implementations • 12 Aug 2023 • Tongliang Li, Zixiang Wang, Linzheng Chai, Jian Yang, Jiaqi Bai, Yuwei Yin, Jiaheng Liu, Hongcheng Guo, Liqun Yang, Hebboul Zine el-abidine, Zhoujun Li
Cross-lingual open information extraction aims to extract structured information from raw text across multiple languages.
1 code implementation • 3 May 2021 • Haobo Jiang, Jin Xie, Jian Yang
Finally, we use the maximum value in the second set of estimators to clip the action value of the chosen action in the first set of estimators and the clipped value is used for approximating the maximum expected action value.
1 code implementation • 22 Mar 2022 • Haobo Jiang, Jin Xie, Jian Yang
Finally, we use the maximum value in the second set of estimators to clip the action value of the chosen action in the first set of estimators and the clipped value is used for approximating the maximum expected action value.
1 code implementation • 13 Oct 2022 • Jian Yang, Shaohan Huang, Shuming Ma, Yuwei Yin, Li Dong, Dongdong Zhang, Hongcheng Guo, Zhoujun Li, Furu Wei
Specifically, the target sequence is first translated into the source language and then tagged by a source NER model.
1 code implementation • 12 Sep 2023 • Haibo Chen, Lei Zhao, Jun Li, Jian Yang
To address this issue, we imitate the drawing process of humans and propose a Two-Stage Statistics-Aware Transformation (TSSAT) module, which first builds the global style foundation by aligning the global statistics of content and style features and then further enriches local style details by swapping the local statistics (instead of local features) in a patch-wise manner, significantly improving the stylization effects.
1 code implementation • 22 Oct 2019 • Yalong Liu, Jie Li, Miaomiao Wang, Zhicheng Jiao, Jian Yang, Xianjun Li
In this paper, a novel spatiotemporal transformation deep learning method called Trident Segmentation CNN (TS-CNN) is proposed to segment PWML in MR images.
1 code implementation • 11 Jul 2022 • Jian Yang, Yuwei Yin, Shuming Ma, Dongdong Zhang, Zhoujun Li, Furu Wei
Nonetheless, multilingual training is plagued by language interference degeneration in shared parameters because of the negative interference among different translation directions, especially on high-resource languages.
1 code implementation • CVPR 2023 • Yaqing Ding, Jian Yang, Viktor Larsson, Carl Olsson, Kalle Åström
One of the classical multi-view geometry problems is the so called P3P problem, where the absolute pose of a calibrated camera is determined from three 2D-to-3D correspondences.
4 code implementations • 28 Sep 2022 • Yang shen, Xuhao Sun, Xiu-Shen Wei, Qing-Yuan Jiang, Jian Yang
In this paper, we propose Suppression-Enhancing Mask based attention and Interactive Channel transformatiON (SEMICON) to learn binary hash codes for dealing with large-scale fine-grained image retrieval tasks.
1 code implementation • 28 Feb 2024 • Wei zhang, Hongcheng Guo, Anjie Le, Jian Yang, Jiaheng Liu, Zhoujun Li, Tieqiao Zheng, Shi Xu, Runqiang Zang, Liangfan Zheng, Bo Zhang
Log parsing, which entails transforming raw log messages into structured templates, constitutes a critical phase in the automation of log analytics.
4 code implementations • CVPR 2019 • Xiang Li, Shuo Chen, Xiaolin Hu, Jian Yang
Theoretically, we find that Dropout would shift the variance of a specific neural unit when we transfer the state of that network from train to test.
1 code implementation • 29 Jul 2022 • Jian Yang, Yuwei Yin, Liqun Yang, Shuming Ma, Haoyang Huang, Dongdong Zhang, Furu Wei, Zhoujun Li
Transformer structure, stacked by a sequence of encoder and decoder network layers, achieves significant development in neural machine translation.
1 code implementation • 13 Dec 2023 • Yanling Tian, Di Chen, Yunan Liu, Jian Yang, Shanshan Zhang
To the best of our knowledge, this is the first work that investigates how to support full-task pre-training using sub-task data.
1 code implementation • NeurIPS 2021 • Xiu-Shen Wei, Yang shen, Xuhao Sun, Han-Jia Ye, Jian Yang
Specifically, based on the captured visual representations by attention, we develop an encoder-decoder structure network of a reconstruction task to unsupervisedly distill high-level attribute-specific vectors from the appearance-specific visual representations without attribute annotations.
1 code implementation • 20 Dec 2022 • Jian Yang, Shuming Ma, Li Dong, Shaohan Huang, Haoyang Huang, Yuwei Yin, Dongdong Zhang, Liqun Yang, Furu Wei, Zhoujun Li
Inspired by the idea of Generative Adversarial Networks (GANs), we propose a GAN-style model for encoder-decoder pre-training by introducing an auxiliary discriminator, unifying the ability of language understanding and generation in a single model.
1 code implementation • 20 Feb 2023 • Zitai Qiu, Jia Wu, Jian Yang, Xing Su, Charu C. Aggarwal
This model addresses the heterogeneity of social media, and, with this graph, the information in social media can be used to capture structural information based on the properties of hyperbolic space.
1 code implementation • IJCAI 2022 • Yu-Yan Xu, Yang shen, Xiu-Shen Wei, Jian Yang
The task of webly-supervised fne-grained recognition is to boost recognition accuracy of classifying subordinate categories (e. g., different bird species)by utilizing freely available but noisy web data. As the label noises signifcantly hurt the network training, it is desirable to distinguish and eliminate noisy images.
1 code implementation • ACM Transactions on Multimedia Computing, Communications, and Applications 2024 • Mingyu Li, Tao Zhou, Zhuo Huang, Jian Yang, Jie Yang, Chen Gong
Nowadays, class-mismatch problem has drawn intensive attention in Semi-Supervised Learning (SSL), where the classes of labeled data are assumed to be only a subset of the classes of unlabeled data.
no code implementations • 1 Nov 2017 • Yu Chen, Chunhua Shen, Hao Chen, Xiu-Shen Wei, Lingqiao Liu, Jian Yang
In contrast, human vision is able to predict poses by exploiting geometric constraints of landmark point inter-connectivity.
no code implementations • 25 May 2018 • Huihui Fang, Jian Yang, Jianjun Zhu, Danni Ai, Yong Huang, Yurong Jiang, Hong Song, Yongtian Wang
The vascular branch was described using a vascular centerline extraction method with multi-probability fusion-based topology optimization.
no code implementations • 16 Apr 2018 • Jiatao Jiang, Chunyan Xu, Zhen Cui, Tong Zhang, Wenming Zheng, Jian Yang
As an analogy to a standard convolution kernel on image, Gaussian models implicitly coordinate those unordered vertices/nodes and edges in a local receptive field after projecting to the gradient space of Gaussian parameters.
no code implementations • 17 Mar 2018 • Eric C. Chi, Brian R. Gaines, Will Wei Sun, Hua Zhou, Jian Yang
Our convex co-clustering (CoCo) estimator enjoys stability guarantees and its computational and storage costs are polynomial in the size of the data.
no code implementations • 27 Feb 2018 • Chaolong Li, Zhen Cui, Wenming Zheng, Chunyan Xu, Jian Yang
To encode dynamic graphs, the constructed multi-scale local graph convolution filters, consisting of matrices of local receptive fields and signal mappings, are recursively performed on structured graph data of temporal and spatial domain.
Ranked #1 on Skeleton Based Action Recognition on Florence 3D
no code implementations • 9 Feb 2018 • Shuo Chen, Chen Gong, Jian Yang, Xiang Li, Yang Wei, Jun Li
In distinguishment stage, a metric is exhaustively learned to try its best to distinguish both the adversarial pairs and the original training pairs.
no code implementations • 6 Dec 2017 • Le Hui, Xiang Li, Jiaxin Chen, Hongliang He, Chen Gong, Jian Yang
Unsupervised Image-to-Image Translation achieves spectacularly advanced developments nowadays.
no code implementations • 17 Nov 2017 • Chaolong Li, Zhen Cui, Wenming Zheng, Chunyan Xu, Rongrong Ji, Jian Yang
The motion analysis of human skeletons is crucial for human action recognition, which is one of the most active topics in computer vision.
no code implementations • 18 Jul 2017 • Zhen Cui, You yi Cai, Wen ming Zheng, Jian Yang
Visual object tracking is a challenging computer vision task with numerous real-world applications.
no code implementations • 12 Jul 2017 • Zheng Zhang, Yong Xu, Ling Shao, Jian Yang
In particular, the elaborate BDLRR is formulated as a joint optimization problem of shrinking the unfavorable representation from off-block-diagonal elements and strengthening the compact block-diagonal representation under the semi-supervised framework of low-rank representation.
no code implementations • NeurIPS 2016 • Xiang Li, Tao Qin, Jian Yang, Tie-Yan Liu
Based on the 2-Component shared embedding, we design a new RNN algorithm and evaluate it using the language modeling task on several benchmark datasets.
no code implementations • 15 Sep 2016 • Xiang Lyu, Will Wei Sun, Zhaoran Wang, Han Liu, Jian Yang, Guang Cheng
We consider the estimation and inference of graphical models that characterize the dependency structure of high-dimensional tensor-valued data.
no code implementations • 23 Feb 2016 • Zheng Zhang, Yong Xu, Jian Yang, Xuelong. Li, David Zhang
The main purpose of this article is to provide a comprehensive study and an updated review on sparse representation and to supply a guidance for researchers.
no code implementations • 22 Oct 2015 • Michael N. Katehakis, Jian Yang, Tingting Zhou
Inventory control with unknown demand distribution is considered, with emphasis placed on the case involving discrete nonperishable items.
no code implementations • 24 Aug 2015 • Jingen Ni, Jian Yang, Jie Chen, Cédric Richard, José Carlos M. Bermudez
Some system identification problems impose nonnegativity constraints on the parameters to estimate due to inherent physical characteristics of the unknown system.
no code implementations • 5 Jan 2015 • Jun Li, Heyou Chang, Jian Yang
Luckily, a simplified neural network module (SNNM) has been proposed to directly learn the discriminative dictionaries for avoiding the expensive inference.
no code implementations • 2 Sep 2014 • Jian Yang, Liqiu Meng
Map matching of GPS trajectories from a sequence of noisy observations serves the purpose of recovering the original routes in a road network.
no code implementations • 2 Sep 2014 • Jian Yang, Liqiu Meng
Map matching of the GPS trajectory serves the purpose of recovering the original route on a road network from a sequence of noisy GPS observations.
no code implementations • 20 Dec 2013 • Jun Li, Wei Luo, Jian Yang, Xiao-Tong Yuan
It is well known that direct training of deep neural networks will generally lead to poor results.
no code implementations • 6 May 2014 • Jian Yang, Jianjun Qian, Lei Luo, Fanlong Zhang, Yicheng Gao
Compared with the current regression methods, the proposed Nuclear Norm based Matrix Regression (NMR) model is more robust for alleviating the effect of illumination, and more intuitive and powerful for removing the structural noise caused by occlusion.
no code implementations • 7 Jul 2018 • Wenting Zhao, Chunyan Xu, Zhen Cui, Tong Zhang, Jiatao Jiang, Zhen-Yu Zhang, Jian Yang
In this paper, we aim to give a comprehensive analysis of when work matters by transforming different classical network structures to graph CNN, particularly in the basic graph recognition problem.
Ranked #3 on Graph Classification on IMDb-B
no code implementations • ECCV 2018 • Di Chen, Shanshan Zhang, Wanli Ouyang, Jian Yang, Ying Tai
In this work, we tackle the problem of person search, which is a challenging task consisted of pedestrian detection and person re-identification~(re-ID).
no code implementations • 8 Oct 2018 • Xi Cheng, Xiang Li, Jian Yang
Single image super resolution is of great importance as a low-level computer vision task.
no code implementations • 1 Nov 2018 • Xiangbo Shu, Jinhui Tang, Guo-Jun Qi, Wei Liu, Jian Yang
In a Co-LSTM unit, each sub-memory unit stores individual motion information, while this Co-LSTM unit selectively integrates and stores inter-related motion information between multiple interacting persons from multiple sub-memory units via the cell gate and co-memory cell, respectively.
Ranked #1 on Human Interaction Recognition on UT
no code implementations • 20 Feb 2012 • Meng Yang, Lei Zhang, Jian Yang, David Zhang
Recently the sparse representation based classification (SRC) has been proposed for robust face recognition (FR).
no code implementations • 11 Nov 2018 • Jiatao Jiang, Zhen Cui, Chunyan Xu, Jian Yang
In this work, we propose a Gaussian-induced convolution (GIC) framework to conduct local convolution filtering on irregular graphs.
Ranked #3 on Graph Classification on PTC
no code implementations • NeurIPS 2016 • Mohammad Saberian, Jose Costa Pereira, Can Xu, Jian Yang, Nuno Nvasconcelos
We argue that the intermediate mapping, e. g. boosting predictor, is preserving the discriminant aspects of the data and by controlling the dimension of this mapping it is possible to achieve discriminant low dimensional representations for the data.
no code implementations • CVPR 2018 • Shanshan Zhang, Jian Yang, Bernt Schiele
In this paper, we aim to propose a simple and compact method based on the FasterRCNN architecture for occluded pedestrian detection.
no code implementations • ECCV 2018 • Zhen-Yu Zhang, Zhen Cui, Chunyan Xu, Zequn Jie, Xiang Li, Jian Yang
In this paper, we propose a novel joint Task-Recursive Learning (TRL) framework for the closing-loop semantic segmentation and monocular depth estimation tasks.
Ranked #76 on Semantic Segmentation on NYU Depth v2
no code implementations • 11 Sep 2018 • Zhen Cui, Chunyan Xu, Wenming Zheng, Jian Yang
Visual relationship detection can bridge the gap between computer vision and natural language for scene understanding of images.
no code implementations • 20 Feb 2019 • Chen Gong, Hengmin Zhang, Jian Yang, DaCheng Tao
To address label insufficiency, we use a graph to bridge the data points so that the label information can be propagated from the scarce labeled examples to unlabeled examples along the graph edges.
no code implementations • 31 Mar 2019 • Botao Hao, Boxiang Wang, Pengyuan Wang, Jingfei Zhang, Jian Yang, Will Wei Sun
Tensors are becoming prevalent in modern applications such as medical imaging and digital marketing.
no code implementations • 5 Apr 2019 • Chen Gong, Tongliang Liu, Yuanyan Tang, Jian Yang, Jie Yang, DaCheng Tao
As a result, the intrinsic constraints among different candidate labels are deployed, and the disambiguated labels generated by RegISL are more discriminative and accurate than those output by existing instance-based algorithms.
no code implementations • 8 Apr 2019 • Chen Gong, DaCheng Tao, Xiaojun Chang, Jian Yang
More importantly, HyDEnT conducts propagation under the guidance of an ensemble of teachers.
no code implementations • 17 Apr 2019 • Zhen-Yu Zhang, Stéphane Lathuilière, Andrea Pilzer, Nicu Sebe, Elisa Ricci, Jian Yang
Our proposal is evaluated on the wellestablished KITTI dataset, where we show that our online method is competitive withstate of the art algorithms trained in a batch setting.
no code implementations • CVPR 2019 • Zhen-Yu Zhang, Zhen Cui, Chunyan Xu, Yan Yan, Nicu Sebe, Jian Yang
In this paper, we propose a novel Pattern-Affinitive Propagation (PAP) framework to jointly predict depth, surface normal and semantic segmentation.
Ranked #51 on Semantic Segmentation on NYU Depth v2
no code implementations • 16 Mar 2012 • Vijay Bharadwaj, Peiji Chen, Wenjing Ma, Chandrashekhar Nagarajan, John Tomlin, Sergei Vassilvitskii, Erik Vee, Jian Yang
Motivated by the problem of optimizing allocation in guaranteed display advertising, we develop an efficient, lightweight method of generating a compact {\em allocation plan} that can be used to guide ad server decisions.
Data Structures and Algorithms
no code implementations • 10 Jun 2019 • Chun-Mei Feng, Yong Xu, Zuoyong Li, Jian Yang
It performs Sparse Representation Fusion based on the Diverse Subset of training samples (SRFDS), which reduces the impact of randomness of the sample set and enhances the robustness of classification results.
no code implementations • ICCV 2019 • Wei Luo, Xitong Yang, Xianjie Mo, Yuheng Lu, Larry S. Davis, Jun Li, Jian Yang, Ser-Nam Lim
Recognizing objects from subcategories with very subtle differences remains a challenging task due to the large intra-class and small inter-class variation.
Ranked #18 on Fine-Grained Image Classification on NABirds (using extra training data)
Fine-Grained Image Classification Fine-Grained Visual Categorization
no code implementations • 26 Sep 2019 • Sheng Wan, Chen Gong, Ping Zhong, Shirui Pan, Guangyu Li, Jian Yang
In hyperspectral image (HSI) classification, spatial context has demonstrated its significance in achieving promising performance.
no code implementations • 8 Nov 2019 • Shanxin Yuan, Radu Timofte, Gregory Slabaugh, Ales Leonardis, Bolun Zheng, Xin Ye, Xiang Tian, Yaowu Chen, Xi Cheng, Zhen-Yong Fu, Jian Yang, Ming Hong, Wenying Lin, Wenjin Yang, Yanyun Qu, Hong-Kyu Shin, Joon-Yeon Kim, Sung-Jea Ko, Hang Dong, Yu Guo, Jie Wang, Xuan Ding, Zongyan Han, Sourya Dipta Das, Kuldeep Purohit, Praveen Kandula, Maitreya Suin, A. N. Rajagopalan
A new dataset, called LCDMoire was created for this challenge, and consists of 10, 200 synthetically generated image pairs (moire and clean ground truth).
1 code implementation • NeurIPS 2019 • Shuo Chen, Lei Luo, Jian Yang, Chen Gong, Jun Li, Heng Huang
To address this issue, we first reveal that the traditional linear distance metric is equivalent to the cumulative arc length between the data pair's nearest points on the learned straight measurer lines.
no code implementations • 28 Nov 2019 • Xueya Zhang, Tong Zhang, Wenting Zhao, Zhen Cui, Jian Yang
Graph convolutional networks (GCNs) have shown the powerful ability in text structure representation and effectively facilitate the task of text classification.
no code implementations • 9 Dec 2019 • Ying Wang, Zezhou Sun, Cheng-Zhong Xu, Sanjay Sarma, Jian Yang, Hui Kong
In this paper, a global descriptor for a LiDAR point cloud, called LiDAR Iris, is proposed for fast and accurate loop-closure detection.
no code implementations • ICLR 2020 • Chunyan Xu, Zhen Cui, Xiaobin Hong, Tong Zhang, Jian Yang, Wei Liu
In this work, we address semi-supervised classification of graph data, where the categories of those unlabeled nodes are inferred from labeled nodes as well as graph structures.
no code implementations • 22 Feb 2020 • Yao Yao, Chen Gong, Jiehui Deng, Jian Yang
Partial Label Learning (PLL) aims to train a classifier when each training instance is associated with a set of candidate labels, among which only one is correct but is not accessible during the training phase.
no code implementations • 4 Mar 2020 • Jun Chen, Yong liu, Hao Zhang, Shengnan Hou, Jian Yang
Meanwhile, we propose a M-bit Inputs and N-bit Weights Network (MINW-Net) trained by AQE, a quantized neural network with 1-3 bits weights and activations.
no code implementations • 6 May 2020 • Shanxin Yuan, Radu Timofte, Ales Leonardis, Gregory Slabaugh, Xiaotong Luo, Jiangtao Zhang, Yanyun Qu, Ming Hong, Yuan Xie, Cuihua Li, Dejia Xu, Yihao Chu, Qingyan Sun, Shuai Liu, Ziyao Zong, Nan Nan, Chenghua Li, Sangmin Kim, Hyungjoon Nam, Jisu Kim, Jechang Jeong, Manri Cheon, Sung-Jun Yoon, Byungyeon Kang, Junwoo Lee, Bolun Zheng, Xiaohong Liu, Linhui Dai, Jun Chen, Xi Cheng, Zhen-Yong Fu, Jian Yang, Chul Lee, An Gia Vien, Hyunkook Park, Sabari Nathan, M. Parisa Beham, S Mohamed Mansoor Roomi, Florian Lemarchand, Maxime Pelcat, Erwan Nogues, Densen Puthussery, Hrishikesh P. S, Jiji C. V, Ashish Sinha, Xuan Zhao
Track 1 targeted the single image demoireing problem, which seeks to remove moire patterns from a single image.
no code implementations • CVPR 2020 • Zongyan Han, Zhen-Yong Fu, Jian Yang
Zero-shot object recognition or zero-shot learning aims to transfer the object recognition ability among the semantically related categories, such as fine-grained animal or bird species.
no code implementations • ACL 2020 • Jian Yang, Shuming Ma, Dong-dong Zhang, Zhoujun Li, Ming Zhou
Although neural machine translation (NMT) has achieved significant progress in recent years, most previous NMT models only depend on the source text to generate translation.
no code implementations • 30 Jul 2020 • Mingmei Cheng, Le Hui, Jin Xie, Jian Yang, Hui Kong
In this paper, we propose a cascaded non-local neural network for point cloud segmentation.
no code implementations • ECCV 2020 • Xueya Zhang, Tong Zhang, Xiaobin Hong, Zhen Cui, Jian Yang
Spectral graph filtering is introduced to encode graph signals, which are then embedded as probability distributions in a Wasserstein space, called graph Wasserstein metric learning.
no code implementations • 11 Aug 2020 • Yang Yang, Zhen-Qiang Sun, Hui Xiong, Jian Yang
Open set classification (OSC) tackles the problem of determining whether the data are in-class or out-of-class during inference, when only provided with a set of in-class examples at training time.
no code implementations • 19 Aug 2020 • Yun Wang, Tong Zhang, Zhen Cui, Chunyan Xu, Jian Yang
For label diffusion of instance-awareness in graph convolution, rather than using the statistical label correlation alone, an image-dependent label correlation matrix (LCM), fusing both the statistical LCM and an individual one of each image instance, is constructed for graph inference on labels to inject adaptive information of label-awareness into the learned features of the model.
1 code implementation • 31 Aug 2020 • Yang Yang, Zhen-Qiang Sun, HengShu Zhu, Yanjie Fu, Hui Xiong, Jian Yang
To this end, we propose a Class-Incremental Learning without Forgetting (CILF) framework, which aims to learn adaptive embedding for processing novel class detection and model update in a unified framework.
no code implementations • 22 Aug 2020 • Min Fu, Jiwei Guan, Xi Zheng, Jie zhou, Jianchao Lu, Tianyi Zhang, Shoujie Zhuo, Lijun Zhan, Jian Yang
Existing solution recommendation methods for online customer service are unable to determine the best solutions at runtime, leading to poor satisfaction of end customers.
no code implementations • 3 Sep 2020 • Yuan Fang, Chunyan Xu, Zhen Cui, Yuan Zong, Jian Yang
In this paper, we propose a spatial transformer point convolution (STPC) method to achieve anisotropic convolution filtering on point clouds.
no code implementations • ECCV 2020 • Xi Cheng, Zhen-Yong Fu, Jian Yang
In the past few years, we have witnessed the great progress of image super-resolution (SR) thanks to the power of deep learning.
no code implementations • 15 Sep 2020 • Sheng Wan, Shirui Pan, Jian Yang, Chen Gong
Graph-based Semi-Supervised Learning (SSL) aims to transfer the labels of a handful of labeled data to the remaining massive unlabeled data via a graph.
no code implementations • 19 Sep 2020 • Sheng Wan, Chen Gong, Shirui Pan, Jie Yang, Jian Yang
Nowadays, deep learning methods, especially the Graph Convolutional Network (GCN), have shown impressive performance in hyperspectral image (HSI) classification.
no code implementations • 24 Sep 2020 • Qianliang Wu, Tong Zhang, Zhen Cui, Jian Yang
In this paper, we aim to mine the cue of user preferences in resource-limited recommendation tasks, for which purpose we specifically build a large used car transaction dataset possessing resource-limitation characteristics.
no code implementations • 1 Jan 2021 • Wenting Zhao, Yuan Fang, Zhen Cui, Tong Zhang, Jian Yang, Wei Liu
In this paper, we propose a simple yet effective graph deformer network (GDN) to fulfill anisotropic convolution filtering on graphs, analogous to the standard convolution operation on images.
no code implementations • 27 Nov 2020 • Zhuo Huang, Ying Tai, Chengjie Wang, Jian Yang, Chen Gong
Semi-Supervised Learning (SSL) with mismatched classes deals with the problem that the classes-of-interests in the limited labeled data is only a subset of the classes in massive unlabeled data.
no code implementations • CVPR 2021 • Yaqing Ding, Daniel Barath, Jian Yang, Hui Kong, Zuzana Kukelova
Smartphones, tablets and camera systems used, e. g., in cars and UAVs, are typically equipped with IMUs (inertial measurement units) that can measure the gravity vector accurately.
no code implementations • 31 Dec 2020 • Shuming Ma, Jian Yang, Haoyang Huang, Zewen Chi, Li Dong, Dongdong Zhang, Hany Hassan Awadalla, Alexandre Muzio, Akiko Eriguchi, Saksham Singhal, Xia Song, Arul Menezes, Furu Wei
Multilingual machine translation enables a single model to translate between different languages.
no code implementations • 4 Nov 2020 • Jian Yang, Juan Yang
Therefore, to solve the problems above, we build a new model based on gating mechanism, combined with convolutional neural networks (CNN) and self-attention mechanism.
Aspect-Based Sentiment Analysis Aspect Category Sentiment Analysis +1
no code implementations • 10 Mar 2021 • Xuran Xu, Tong Zhang, Chunyan Xu, Zhen Cui, Jian Yang
We further extend graph convolution into tensor space and propose a tensor graph convolution network to extract more discriminating features from spatial-temporal graph data.
Ranked #1 on Traffic Prediction on SZ-Taxi
no code implementations • 24 Mar 2021 • Guangwei Gao, Guoan Xu, Yi Yu, Jin Xie, Jian Yang, Dong Yue
In recent years, how to strike a good trade-off between accuracy and inference speed has become the core issue for real-time semantic segmentation applications, which plays a vital role in real-world scenarios such as autonomous driving systems and drones.
no code implementations • 25 Mar 2021 • Guangwei Gao, Yi Yu, Jian Yang, Guo-Jun Qi, Meng Yang
(i) To learn more robust and discriminative features, we desire to adaptively fuse the contextual features from different layers.
no code implementations • 25 Mar 2021 • Guangwei Gao, Lei Tang, Fei Wu, Huimin Lu, Jian Yang
In this work, we treat the mask occlusion as image noise and construct a joint and collaborative learning network, called JDSR-GAN, for the masked face super-resolution task.
no code implementations • 12 May 2021 • Chun-Mei Feng, Zhanyuan Yang, Huazhu Fu, Yong Xu, Jian Yang, Ling Shao
In this paper, we propose the Dual-Octave Network (DONet), which is capable of learning multi-scale spatial-frequency features from both the real and imaginary components of MR data, for fast parallel MR image reconstruction.
no code implementations • 26 May 2021 • Xing Su, Shan Xue, Fanzhen Liu, Jia Wu, Jian Yang, Chuan Zhou, Wenbin Hu, Cecile Paris, Surya Nepal, Di Jin, Quan Z. Sheng, Philip S. Yu
A community reveals the features and connections of its members that are different from those in other communities in a network.
no code implementations • 27 May 2021 • Jian Yang, Yuhui Shi
Swarm intelligence optimization algorithms can be adopted in swarm robotics for target searching tasks in a 2-D or 3-D space by treating the target signal strength as fitness values.
1 code implementation • 27 May 2021 • Jian Yang, Yuhui Shi
Rather than converge to a single global optimum, the proposed method can guide the search procedure to converge to multiple "salient" solutions.
no code implementations • 10 Jun 2021 • Yang Guo, Tarique Anwar, Jian Yang, Jia Wu
As the process should be socially and economically profitable, the task of vehicle dispatching is highly challenging, specially due to the time-varying travel demands and traffic conditions.
no code implementations • NAACL 2021 • Jian Yang, Shuming Ma, Dongdong Zhang, Juncheng Wan, Zhoujun Li, Ming Zhou
Most current neural machine translation models adopt a monotonic decoding order of either left-to-right or right-to-left.
no code implementations • 29 Jul 2021 • Zhiqiang Yan, Kun Wang, Xiang Li, Zhenyu Zhang, Jun Li, Jian Yang
However, blurry guidance in the image and unclear structure in the depth still impede the performance of the image guided frameworks.
Ranked #2 on Depth Completion on KITTI Depth Completion
no code implementations • 5 Aug 2021 • Haobo Jiang, Jin Xie, Jianjun Qian, Jian Yang
By modeling the point cloud registration process as a Markov decision process (MDP), we develop a latent dynamic model of point clouds, consisting of a transformation network and evaluation network.
no code implementations • 11 Aug 2021 • Yong Li, Yufei Sun, Zhen Cui, Shiguang Shan, Jian Yang
To mitigate racial bias and meantime preserve robust FR, we abstract face identity-related representation as a signal denoising problem and propose a progressive cross transformer (PCT) method for fair face recognition.
no code implementations • ACL 2021 • Jian Yang, Yuwei Yin, Shuming Ma, Haoyang Huang, Dongdong Zhang, Zhoujun Li, Furu Wei
Although multilingual neural machine translation (MNMT) enables multiple language translations, the training process is based on independent multilingual objectives.
no code implementations • 1 Oct 2021 • Jian Yang, Xinyu Hu, Gang Xiao, Yulong Shen
Pre-trained language models learn informative word representations on a large-scale text corpus through self-supervised learning, which has achieved promising performance in fields of natural language processing (NLP) after fine-tuning.
no code implementations • 1 Oct 2021 • Zheng Li, Xiang Li, Lingfeng Yang, Jian Yang, Zhigeng Pan
Knowledge distillation usually transfers the knowledge from a pre-trained cumbersome teacher network to a compact student network, which follows the classical teacher-teaching-student paradigm.
no code implementations • ICCV 2021 • Yun Wang, Tong Zhang, Xueya Zhang, Zhen Cui, Yuge Huang, Pengcheng Shen, Shaoxin Li, Jian Yang
Then, a Wasserstein coupled dictionary, containing multiple pairs of counterpart graph keys with each key corresponding to one modality, is constructed for further feature learning.
no code implementations • ICCV 2021 • Jingshan Xu, Chuanwei Zhou, Zhen Cui, Chunyan Xu, Yuge Huang, Pengcheng Shen, Shaoxin Li, Jian Yang
In this paper, we propose a progressive segmentation inference (PSI) framework to tackle with scribble-supervised semantic segmentation.
no code implementations • 22 Oct 2021 • Yang Yang, Hongchen Wei, HengShu Zhu, dianhai yu, Hui Xiong, Jian Yang
In detail, considering that the heterogeneous gap between modalities always leads to the supervision difficulty of using the global embedding directly, CPRC turns to transform both the raw image and corresponding generated sentence into the shared semantic space, and measure the generated sentence from two aspects: 1) Prediction consistency.
no code implementations • WMT (EMNLP) 2021 • Jian Yang, Shuming Ma, Haoyang Huang, Dongdong Zhang, Li Dong, Shaohan Huang, Alexandre Muzio, Saksham Singhal, Hany Hassan Awadalla, Xia Song, Furu Wei
This report describes Microsoft's machine translation systems for the WMT21 shared task on large-scale multilingual machine translation.
no code implementations • 6 Nov 2021 • Jiahui Fan, Beibei Wang, Miloš Hašan, Jian Yang, Ling-Qi Yan
Bidirectional reflectance distribution functions (BRDFs) are pervasively used in computer graphics to produce realistic physically-based appearance.
no code implementations • 11 Nov 2021 • Xiu-Shen Wei, Yi-Zhe Song, Oisin Mac Aodha, Jianxin Wu, Yuxin Peng, Jinhui Tang, Jian Yang, Serge Belongie
Fine-grained image analysis (FGIA) is a longstanding and fundamental problem in computer vision and pattern recognition, and underpins a diverse set of real-world applications.
no code implementations • 17 Nov 2021 • Xi Cheng, Jun Li, Qiang Dai, ZhenYong Fu, Jian Yang
In our SF-SIM, we propose a noise estimator which can effectively suppress the noise in the image and enable our method to work under the low light and short exposure environment, without the need for stacking multiple frames for non-local denoising.
no code implementations • NeurIPS 2021 • Yunan Liu, Shanshan Zhang, Yang Li, Jian Yang
In this setting, we embed an additional pair of “latent-latent” to reduce the domain gap between the source and different latent domains, allowing the model to adapt well on multiple target domains simultaneously.
no code implementations • NeurIPS 2021 • Zhuo Huang, Chao Xue, Bo Han, Jian Yang, Chen Gong
Universal Semi-Supervised Learning (UniSSL) aims to solve the open-set problem where both the class distribution (i. e., class set) and feature distribution (i. e., feature domain) are different between labeled dataset and unlabeled dataset.
no code implementations • 25 Sep 2019 • Minghong Yao, Liansheng Zhuang, Houqiang Li, Jian Yang, Shafei Wang
Results show that our model can outperform the dominant models consistently in these tasks.