no code implementations • ECCV 2020 • Xin Xiong, Haipeng Xiong, Ke Xian, Chen Zhao, Zhiguo Cao, Xin Li
Depth completion is a widely studied problem of predicting a dense depth map from a sparse set of measurements and a single RGB image.
1 code implementation • ECCV 2020 • Matthew Korban, Xin Li
We propose a Dynamic Directed Graph Convolutional Network (DDGCN) to model spatial and temporal features of human actions from their skeletal representations.
1 code implementation • Findings (EMNLP) 2021 • Wenxuan Zhang, Yang Deng, Xin Li, Lidong Bing, Wai Lam
This motivates us to investigate the task of ABSA on QA forums (ABSA-QA), aiming to jointly detect the discussed aspects and their sentiment polarities for a given QA pair.
Aspect-Based Sentiment Analysis
Aspect-Based Sentiment Analysis (ABSA)
+1
1 code implementation • 1 Dec 2023 • Xuan-Phi Nguyen, Wenxuan Zhang, Xin Li, Mahani Aljunied, Qingyu Tan, Liying Cheng, Guanzheng Chen, Yue Deng, Sen yang, Chaoqun Liu, Hang Zhang, Lidong Bing
Despite the remarkable achievements of large language models (LLMs) in various tasks, there remains a linguistic bias that favors high-resource languages, such as English, often at the expense of low-resource and regional languages.
no code implementations • 30 Nov 2023 • Xuan-Bac Nguyen, Xin Li, Samee U. Khan, Khoa Luu
In this work, we first present a simple yet effective Brainformer approach, a novel Transformer-based framework, to analyze the patterns of fMRI in the human perception system from the machine learning perspective.
no code implementations • 28 Nov 2023 • Zhengming Yu, Zhiyang Dou, Xiaoxiao Long, Cheng Lin, Zekun Li, YuAn Liu, Norman Müller, Taku Komura, Marc Habermann, Christian Theobalt, Xin Li, Wenping Wang
Since the learning difficulty for various shapes can differ, a curriculum learning strategy is employed to efficiently embed various surfaces, enhancing the whole embedding process.
2 code implementations • 28 Nov 2023 • Sicong Leng, Hang Zhang, Guanzheng Chen, Xin Li, Shijian Lu, Chunyan Miao, Lidong Bing
Large Vision-Language Models (LVLMs) have advanced considerably, intertwining visual recognition and language understanding to generate content that is not only coherent but also contextually attuned.
1 code implementation • 28 Nov 2023 • Zhihe Lu, Jiawang Bai, Xin Li, Zeyu Xiao, Xinchao Wang
However, performance advancements are limited when relying solely on intricate algorithmic designs for a single model, even one exhibiting strong performance, e. g., CLIP-ViT-B/16.
Ranked #1 on
Prompt Engineering
on ImageNet
no code implementations • 26 Nov 2023 • Hoang-Quan Nguyen, Thanh-Dat Truong, Xuan Bac Nguyen, Ashley Dowling, Xin Li, Khoa Luu
In precision agriculture, the detection and recognition of insects play an essential role in the ability of crops to grow healthy and produce a high-quality yield.
1 code implementation • 16 Nov 2023 • Sen yang, Xin Li, Leyang Cui, Lidong Bing, Wai Lam
Though prompting LLMs with various reasoning structures produces reasoning proofs along with answers, these proofs are not ensured to be causal and reliable due to the inherent defects of LLMs.
1 code implementation • 9 Nov 2023 • Licheng Wen, Xuemeng Yang, Daocheng Fu, XiaoFeng Wang, Pinlong Cai, Xin Li, Tao Ma, Yingxuan Li, Linran Xu, Dengke Shang, Zheng Zhu, Shaoyan Sun, Yeqi Bai, Xinyu Cai, Min Dou, Shuanglu Hu, Botian Shi, Yu Qiao
This has been a significant bottleneck, particularly in the development of common sense reasoning and nuanced scene understanding necessary for safe and reliable autonomous driving.
1 code implementation • 6 Nov 2023 • Siqi Li, Di Miao, Qiming Wu, Chuan Hong, Danny D'Agostino, Xin Li, Yilin Ning, Yuqing Shang, Huazhu Fu, Marcus Eng Hock Ong, Hamed Haddadi, Nan Liu
Our goal was to bridge the gap by presenting the first comprehensive comparison of FL frameworks from both engineering and statistical domains.
no code implementations • 2 Nov 2023 • Weixi Wang, Xichen Zhong, Xin Li, Sizhe Li, Xun Ma
Overhead line inspection greatly benefits from defect recognition using visible light imagery.
no code implementations • 25 Oct 2023 • Chen Liu, Hongyu Zang, Xin Li, Yong Heng, Yifei Wang, Zhen Fang, Yisen Wang, Mingzhong Wang
Image-based Reinforcement Learning is a practical yet challenging task.
1 code implementation • 25 Oct 2023 • Guanzheng Chen, Xin Li, Zaiqiao Meng, Shangsong Liang, Lidong Bing
We generalise the PE scaling approaches to model the continuous dynamics by ordinary differential equations over the length scaling factor, thereby overcoming the constraints of current PE scaling methods designed for specific lengths.
no code implementations • 24 Oct 2023 • Ye Yuan, Xin Li, Yong Heng, Leiji Zhang, Mingzhong Wang
Imitation Learning (IL) aims to discover a policy by minimizing the discrepancy between the agent's behavior and expert demonstrations.
1 code implementation • 23 Oct 2023 • Sen yang, Xin Li, Lidong Bing, Wai Lam
However, the knowledge-time association is usually insufficient for the downstream tasks that require reasoning over temporal dependencies between knowledge.
no code implementations • 20 Oct 2023 • Guangqi Xie, Xin Li, Xiaohan Pan, Zhibo Chen
Remote medical diagnosis has emerged as a critical and indispensable technique in practical medical systems, where medical data are required to be efficiently compressed and transmitted for diagnosis by either professional doctors or intelligent diagnosis devices.
no code implementations • 19 Oct 2023 • Aritra Dutta, El Houcine Bergou, Soumia Boucherouite, Nicklas Werge, Melih Kandemir, Xin Li
Additionally, our analyses allow us to measure the density of the $\epsilon$-stationary points in the final iterates of SGD, and we recover the classical $O(\frac{1}{\sqrt{T}})$ asymptotic rate under various existing assumptions on the objective function and the bounds on the stochastic gradient.
no code implementations • 29 Sep 2023 • Xin Li, Yiting Lu, Zhibo Chen
Based on this, we propose to improve the perception-oriented transferability of BIQA by performing feature frequency decomposition and selecting the frequency components that contained the most transferable perception knowledge for alignment.
Blind Image Quality Assessment
Unsupervised Domain Adaptation
1 code implementation • ICCV 2023 • Ao Luo, Fan Yang, Xin Li, Lang Nie, Chunyu Lin, Haoqiang Fan, Shuaicheng Liu
Moreover, for reliable motion analysis, we provide a new Gaussian-Guided Attention Module (GGAM) which not only inherits properties from Gaussian distribution to instinctively revolve around the neighbor fields of each point but also is empowered to put the emphasis on contextually related regions during matching.
1 code implementation • 28 Sep 2023 • Licheng Wen, Daocheng Fu, Xin Li, Xinyu Cai, Tao Ma, Pinlong Cai, Min Dou, Botian Shi, Liang He, Yu Qiao
Recent advancements in autonomous driving have relied on data-driven approaches, which are widely adopted but face challenges including dataset bias, overfitting, and uninterpretability.
no code implementations • 27 Sep 2023 • Peng Zhang, Xin Li, Liang He, Xin Lin
This paper undertakes a comprehensive examination, assessment, and synthesis of the research landscape in this domain, remaining attuned to the latest developments in 3D MOT while suggesting prospective avenues for future investigation.
1 code implementation • NeurIPS 2023 • Xin Li, Dongze Lian, Zhihe Lu, Jiawang Bai, Zhibo Chen, Xinchao Wang
To mitigate that, we propose an effective adapter-style tuning strategy, dubbed GraphAdapter, which performs the textual adapter by explicitly modeling the dual-modality structure knowledge (i. e., the correlation of different semantics/classes in textual and visual modalities) with a dual knowledge graph.
1 code implementation • ICCV 2023 • Youquan Liu, Runnan Chen, Xin Li, Lingdong Kong, Yuchen Yang, Zhaoyang Xia, Yeqi Bai, Xinge Zhu, Yuexin Ma, Yikang Li, Yu Qiao, Yuenan Hou
Besides, we construct the OpenPCSeg codebase, which is the largest and most comprehensive outdoor LiDAR segmentation codebase.
no code implementations • 1 Sep 2023 • Xin Li, Wenqing Chu, Ye Wu, Weihang Yuan, Fanglong Liu, Qi Zhang, Fu Li, Haocheng Feng, Errui Ding, Jingdong Wang
In this paper, we present VideoGen, a text-to-video generation approach, which can generate a high-definition video with high frame fidelity and strong temporal consistency using reference-guided latent diffusion.
no code implementations • 31 Aug 2023 • El Houcine Bergou, Soumia Boucherouite, Aritra Dutta, Xin Li, Anna Ma
In this paper, we analyze the convergence of RK for noisy linear systems when the coefficient matrix, $A$, is corrupted with both additive and multiplicative noise, along with the noisy vector, $b$.
no code implementations • 27 Aug 2023 • Xin Yang, Yi Lin, Zhiwei Wang, Xin Li, Kwang-Ting Cheng
A method for measuring the synthesis complexity is proposed to automatically determine the synthesis order in our sequential GAN.
1 code implementation • 23 Aug 2023 • Yu-Xiang Zeng, Jun-Wei Hsieh, Xin Li, Ming-Ching Chang
Detecting small scene text instances in the wild is particularly challenging, where the influence of irregular positions and nonideal lighting often leads to detection errors.
Ranked #1 on
Scene Text Detection
on SCUT-CTW1500
1 code implementation • ICCV 2023 • Xin Li, Yuqing Huang, Zhenyu He, YaoWei Wang, Huchuan Lu, Ming-Hsuan Yang
Existing visual tracking methods typically take an image patch as the reference of the target to perform tracking.
1 code implementation • 18 Aug 2023 • Xin Li, Yulin Ren, Xin Jin, Cuiling Lan, Xingrui Wang, Wenjun Zeng, Xinchao Wang, Zhibo Chen
Image restoration (IR) has been an indispensable and challenging task in the low-level vision field, which strives to improve the subjective quality of images distorted by various forms of degradation.
1 code implementation • 1 Aug 2023 • Xuan-Bac Nguyen, Xudong Liu, Xin Li, Khoa Luu
The goal is to predict brain responses across the entire visual brain, as it is the region where the most reliable responses to images have been observed.
1 code implementation • ICCV 2023 • Weiyue Zhao, Xin Li, Zhan Peng, Xianrui Luo, Xinyi Ye, Hao Lu, Zhiguo Cao
Video stabilization refers to the problem of transforming a shaky video into a visually pleasing one.
no code implementations • 20 Jul 2023 • Can Jiang, Xin Li, Jia-Rui Lin, Ming Liu, Zhiliang Ma
Therefore, this paper introducess a model and method to adaptive control the resource flows to optimize the work and cash flows of construction projects.
1 code implementation • ICCV 2023 • Xinyi Ye, Weiyue Zhao, Tianqi Liu, Zihao Huang, Zhiguo Cao, Xin Li
Learning-based multi-view stereo (MVS) methods deal with predicting accurate depth maps to achieve an accurate and complete 3D representation.
1 code implementation • 14 Jul 2023 • Daocheng Fu, Xin Li, Licheng Wen, Min Dou, Pinlong Cai, Botian Shi, Yu Qiao
In this paper, we explore the potential of using a large language model (LLM) to understand the driving environment in a human-like manner and analyze its ability to reason, interpret, and memorize when facing complex scenarios.
no code implementations • 7 Jul 2023 • Chuanbo Hu, Bin Liu, Xin Li, Yanfang Ye
By integrating prior knowledge and the proposed prompts, ChatGPT can effectively identify and label drug trafficking activities on social networks, even in the presence of deceptive language and euphemisms used by drug dealers to evade detection.
no code implementations • 26 Jun 2023 • Xinquan Yang, Jinheng Xie, Xuguang Li, Xuechen Li, Xin Li, Linlin Shen, Yongqiang Deng
When deep neural network has been proposed to assist the dentist in designing the location of dental implant, most of them are targeting simple cases where only one missing tooth is available.
no code implementations • 25 Jun 2023 • Thang Doan, Xin Li, Sima Behpour, Wenbin He, Liang Gou, Liu Ren
We argue that this external or contextual information should already be embedded within the known classes.
1 code implementation • 19 Jun 2023 • Zhiwei Wang, Junlin Xian, Kangyi Liu, Xin Li, Qiang Li, Xin Yang
Mammogram image is important for breast cancer screening, and typically obtained in a dual-view form, i. e., cranio-caudal (CC) and mediolateral oblique (MLO), to provide complementary information.
no code implementations • 13 Jun 2023 • Lan Wang, Ruiling He, Lili Zhao, Jia Wang, Zhengzi Geng, Tao Ren, Guo Zhang, Peng Zhang, Kaiqiang Tang, Chaofei Gao, Fei Chen, Liting Zhang, Yonghe Zhou, Xin Li, Fanbin He, Hui Huan, Wenjuan Wang, Yunxiao Liang, Juan Tang, Fang Ai, Tingyu Wang, Liyun Zheng, Zhongwei Zhao, Jiansong Ji, Wei Liu, Jiaojiao Xu, Bo Liu, Xuemei Wang, Yao Zhang, Qiong Yan, Muhan Lv, Xiaomei Chen, Shuhua Zhang, Yihua Wang, Yang Liu, Li Yin, Yanni Liu, Yanqing Huang, Yunfang Liu, Kun Wang, Meiqin Su, Li Bian, Ping An, Xin Zhang, Linxue Qian, Shao Li, Xiaolong Qi
Validation analysis revealed that the AUCs of DLRP were 0. 91 for GEV (95% CI 0. 90 to 0. 93, p < 0. 05) and 0. 88 for HRV (95% CI 0. 86 to 0. 89, p < 0. 01), which were significantly and robustly better than canonical risk indicators, including the value of LSM and SSM.
no code implementations • 11 Jun 2023 • Minglei Yin, Bin Liu, Neil Zhenqiang Gong, Xin Li
Our proposed method can simultaneously (1) secure VARS from adversarial attacks characterized by local perturbations by image reconstruction based on global vision transformers; and (2) accurately detect adversarial examples using a novel contrastive learning approach.
1 code implementation • ICCV 2023 • Tao Ma, Xuemeng Yang, Hongbin Zhou, Xin Li, Botian Shi, Junjie Liu, Yuchen Yang, Zhizheng Liu, Liang He, Yu Qiao, Yikang Li, Hongsheng Li
Extensive experiments on Waymo Open Dataset show our DetZero outperforms all state-of-the-art onboard and offboard 3D detection methods.
no code implementations • 7 Jun 2023 • Weiyue Zhao, Hao Lu, Xinyi Ye, Zhiguo Cao, Xin Li
We introduce Probabilistic Coordinate Fields (PCFs), a novel geometric-invariant coordinate representation for image correspondence problems.
no code implementations • 5 Jun 2023 • Weiyue Zhao, Hao Lu, Zhiguo Cao, Xin Li
This approach offers a new perspective to alleviate the problem of repeated patterns and emphasizes the importance of choosing coordinate representations for feature correspondences.
1 code implementation • 5 Jun 2023 • Hang Zhang, Xin Li, Lidong Bing
We present Video-LLaMA a multi-modal framework that empowers Large Language Models (LLMs) with the capability of understanding both visual and auditory content in the video.
2 code implementations • 2 Jun 2023 • Wenhui Zhu, Peijie Qiu, Xin Li, Natasha Lepore, Oana M. Dumitrascu, Yalin Wang
Over the past few decades, convolutional neural networks (CNNs) have been at the forefront of the detection and tracking of various retinal diseases (RD).
1 code implementation • 31 May 2023 • Jia Guo, Liying Cheng, Wenxuan Zhang, Stanley Kok, Xin Li, Lidong Bing
In this work, we for the first time propose a challenging argument quadruplet extraction task (AQE), which can provide an all-in-one extraction of four argumentative components, i. e., claims, evidence, evidence types, and stances.
no code implementations • 25 May 2023 • Zhenxi Zhang, Ran Ran, Chunna Tian, Heng Zhou, Fan Yang, Xin Li, Zhicheng Jiao
This paper proposes a cross-supervised learning framework based on dual classifiers (DC-Net), including an evidential classifier and a vanilla classifier.
no code implementations • 25 May 2023 • Zhenxi Zhang, Ran Ran, Chunna Tian, Heng Zhou, Xin Li, Fan Yang, Zhicheng Jiao
To address these issues, we propose a self-aware and cross-sample prototypical learning method (SCP-Net) to enhance the diversity of prediction in consistency learning by utilizing a broader range of semantic information derived from multiple inputs.
1 code implementation • 23 May 2023 • Ran Zhou, Xin Li, Lidong Bing, Erik Cambria, Chunyan Miao
In cross-lingual named entity recognition (NER), self-training is commonly used to bridge the linguistic gap by training on pseudo-labeled target-language data.
1 code implementation • 23 May 2023 • Weiwen Xu, Xin Li, Wai Lam, Lidong Bing
mPMR aims to guide multilingual pre-trained language models (mPLMs) to perform natural language understanding (NLU) including both sequence classification and span extraction in multiple languages.
no code implementations • 17 May 2023 • Xinquan Yang, Xuguang Li, Xuechen Li, WenTing Chen, Linlin Shen, Xin Li, Yongqiang Deng
In this paper, we develop a two-stream implant position regression framework (TSIPR), which consists of an implant region detector (IRD) and a multi-scale patch embedding regression network (MSPENet), to address this issue.
no code implementations • 11 May 2023 • Dongyang Li, Ruixue Ding, Qiang Zhang, Zheng Li, Boli Chen, Pengjun Xie, Yao Xu, Xin Li, Ning Guo, Fei Huang, Xiaofeng He
With a fast developing pace of geographic applications, automatable and intelligent models are essential to be designed to handle the large volume of information.
no code implementations • 4 May 2023 • Chuanbo Hu, Shan Jia, Fan Zhang, Changjiang Xiao, Mindi Ruan, Jacob Thrasher, Xin Li
Experimental results on the re-annotated Place Pulse 2. 0 dataset demonstrate promising detection performance of the proposed method, with an accuracy of 79. 9%.
no code implementations • 4 May 2023 • Zhou'an_Zhu, Xin Li, Jicai Pan, Yufei Xiao, Yanan Chang, Feiyi Zheng, Shangfei Wang
We also propose three labels (i. e., expression of experience, emotional reaction, and cognitive reaction) to describe the degree of empathy between counselors and their clients.
no code implementations • 27 Apr 2023 • Naga VS Raviteja Chappa, Pha Nguyen, Alexander H Nelson, Han-Seok Seo, Xin Li, Page Daniel Dobbs, Khoa Luu
This paper introduces a novel approach to Social Group Activity Recognition (SoGAR) using Self-supervised Transformers network that can effectively utilize unlabeled video data.
no code implementations • 20 Apr 2023 • Mindi Ruan, Xiangxu Yu, Na Zhang, Chuanbo Hu, Shuo Wang, Xin Li
How can we teach a computer to recognize 10, 000 different actions?
no code implementations • 14 Apr 2023 • Thanh-Dat Truong, Chi Nhan Duong, Pierce Helton, Ashley Dowling, Xin Li, Khoa Luu
They are insufficient to model both global and local structures of a given image, especially in small regions of tail classes.
no code implementations • 13 Apr 2023 • Hongchen Tan, BaoCai Yin, Kun Wei, Xiuping Liu, Xin Li
The ALR-GAN includes an Adaptive Layout Refinement (ALR) module and a Layout Visual Refinement (LVR) loss.
1 code implementation • CVPR 2023 • Xuan-Bac Nguyen, Chi Nhan Duong, Xin Li, Susan Gauch, Han-Seok Seo, Khoa Luu
By incorporating these components into an end-to-end deep network, the proposed $\mu$-BERT significantly outperforms all previous work in various micro-expression tasks.
Ranked #1 on
Micro Expression Recognition
on SMIC
Micro Expression Recognition
Micro-Expression Recognition
+1
no code implementations • 4 Apr 2023 • Yongxin Zhu, Zhen Liu, Yukang Liang, Xin Li, Hao liu, Changcun Bao, Linli Xu
Different to conventional STVQA models which take the linguistic semantics and visual semantics in scene text as two separate features, in this paper, we propose a paradigm of "Locate Then Generate" (LTG), which explicitly unifies this two semantics with the spatial bounding box as a bridge connecting them.
1 code implementation • ICCV 2023 • Lingdong Kong, Youquan Liu, Xin Li, Runnan Chen, Wenwei Zhang, Jiawei Ren, Liang Pan, Kai Chen, Ziwei Liu
The robustness of 3D perception systems under natural corruptions from environments and sensors is pivotal for safety-critical applications.
no code implementations • 30 Mar 2023 • Renhong Zhang, Tianheng Cheng, Shusheng Yang, Haoyi Jiang, Shuai Zhang, Jiancheng Lyu, Xin Li, Xiaowen Ying, Dashan Gao, Wenyu Liu, Xinggang Wang
To address those issues, we present MobileInst, a lightweight and mobile-friendly framework for video instance segmentation on mobile devices.
no code implementations • 16 Mar 2023 • Shangfei Wang, Jiaqiang Wu, Feiyi Zheng, Xin Li, XueWei Li, Suwen Wang, Yi Wu, Yanan Chang, Xiangyu Miao
In this paper, 1. better features are extracted with the SOTA pretrained models.
no code implementations • 16 Mar 2023 • Hao liu, Xin Li, Mingming Gong, Bing Liu, Yunfei Wu, Deqiang Jiang, Yinsong Liu, Xing Sun
Recently, Table Structure Recognition (TSR) task, aiming at identifying table structure into machine readable formats, has received increasing interest in the community.
2 code implementations • CVPR 2023 • Xin Li, Bingchen Li, Xin Jin, Cuiling Lan, Zhibo Chen
In this paper, we are the first to propose a novel training strategy for image restoration from the causality perspective, to improve the generalization ability of DNNs for unknown degradations.
1 code implementation • CVPR 2023 • Zhaoyang Xia, Youquan Liu, Xin Li, Xinge Zhu, Yuexin Ma, Yikang Li, Yuenan Hou, Yu Qiao
We propose a simple yet effective label rectification strategy, which uses off-the-shelf panoptic segmentation labels to remove the traces of dynamic objects in completion labels, greatly improving the performance of deep models especially for those moving objects.
Ranked #1 on
3D Semantic Scene Completion
on SemanticKITTI
1 code implementation • ICCV 2023 • Lin Zhang, Xin Li, Dongliang He, Errui Ding, Zhaoxiang Zhang
To this end, we construct a large-scale, multi-reference super-resolution dataset, named LMR.
1 code implementation • CVPR 2023 • Xin Li, Tao Ma, Yuenan Hou, Botian Shi, Yuchen Yang, Youquan Liu, Xingjiao Wu, Qin Chen, Yikang Li, Yu Qiao, Liang He
Notably, LoGoNet ranks 1st on Waymo 3D object detection leaderboard and obtains 81. 02 mAPH (L2) detection performance.
no code implementations • 7 Mar 2023 • Xin Li, Bin Liu, Shuo Wang
At the intersection of computational neuroscience (CN) and data mining (DM), we advocate a holistic view toward their rich connections.
no code implementations • 7 Mar 2023 • Xin Li, Shuo Wang
It has been hypothesized that the ventral stream processing for object recognition is based on a mechanism called cortically local subspace untangling.
1 code implementation • 6 Mar 2023 • Naga VS Raviteja Chappa, Pha Nguyen, Alexander H Nelson, Han-Seok Seo, Xin Li, Page Daniel Dobbs, Khoa Luu
In this paper, we propose a new, simple, and effective Self-supervised Spatio-temporal Transformers (SPARTAN) approach to Group Activity Recognition (GAR) using unlabeled video data.
1 code implementation • CVPR 2023 • Hai Wu, Chenglu Wen, Shaoshuai Shi, Xin Li, Cheng Wang
Finally, we develop a semi-supervised pipeline VirConv-S based on a pseudo-label framework.
no code implementations • 18 Feb 2023 • Na Zhang, Xudong Liu, Xin Li, Guo-Jun Qi
Semantic face image manipulation has received increasing attention in recent years.
no code implementations • 2 Feb 2023 • Syeda Nyma Ferdous, Xin Li, Kamalakanta Sahoo, Richard Bergman
This study proposes a robust model for biomass sustainability prediction by analyzing sustainability indicators using machine learning models.
no code implementations • 30 Jan 2023 • Xin Li, Mingqiang Wei, Songcan Chen
From the perspective of how-and-what-to-learn, PointSmile is designed to imitate human curriculum learning, i. e., starting with an easy curriculum and gradually increasing the difficulty of that curriculum.
1 code implementation • 17 Jan 2023 • Xin Li, Deng Pan, Chengyin Li, Yao Qiang, Dongxiao Zhu
There are increasing demands for understanding deep neural networks' (DNNs) behavior spurred by growing security and/or transparency concerns.
no code implementations • 12 Jan 2023 • Jonathan P. Mailoa, Xin Li, Jiezhong Qiu, Shengyu Zhang
Recently, machine learning methods have been used to propose molecules with desired properties, which is especially useful for exploring large chemical spaces efficiently.
1 code implementation • 11 Jan 2023 • Ruixue Ding, Boli Chen, Pengjun Xie, Fei Huang, Xin Li, Qiang Zhang, Yao Xu
Single-modal PTMs can barely make use of the important GC and therefore have limited performance.
1 code implementation • ICCV 2023 • Qiming Xia, Jinhao Deng, Chenglu Wen, Hai Wu, Shaoshuai Shi, Xin Li, Cheng Wang
Combining CoIn with an iterative training strategy, we propose a CoIn++ pipeline, which requires only 2% annotations in the KITTI dataset to achieve performance comparable to the fully supervised methods.
no code implementations • CVPR 2023 • Zhenxuan Fang, Fangfang Wu, Weisheng Dong, Xin Li, Jinjian Wu, Guangming Shi
To address these issues, we propose to represent the field of motion blur kernels in a latent space by normalizing flows, and design CNNs to predict the latent codes instead of motion kernels.
1 code implementation • ICCV 2023 • Yunlong Liu, Tao Huang, Weisheng Dong, Fangfang Wu, Xin Li, Guangming Shi
Deep learning-based LLIE methods focus on learning a mapping function between low-light images and normal-light images that outperforms conventional LLIE methods.
no code implementations • 1 Jan 2023 • Fuwang Dong, Wei Wang, Xin Li, Fan Liu, Sheng Chen, Lajos Hanzo
The dual-functional radar and communication (DFRC) technique constitutes a promising next-generation wireless solution, due to its benefits in terms of power consumption, physical hardware, and spectrum exploitation.
no code implementations • CVPR 2023 • Zhou Yang, Weisheng Dong, Xin Li, Mengluan Huang, Yulin Sun, Guangming Shi
During training, we enforce the quantization of features from clean and corrupted images in the same discrete embedding space so that an invariant quality-independent feature representation can be learned to improve the recognition robustness of low-quality images.
1 code implementation • 29 Dec 2022 • Li Liu, Penggang Chen, Xin Li, William K. Cheung, Youmin Zhang, Qun Liu, Guoyin Wang
Aligning users across networks using graph representation learning has been found effective where the alignment is accomplished in a low-dimensional embedding space.
no code implementations • 28 Dec 2022 • Riashat Islam, Hongyu Zang, Manan Tomar, Aniket Didolkar, Md Mofijul Islam, Samin Yeasar Arnob, Tariq Iqbal, Xin Li, Anirudh Goyal, Nicolas Heess, Alex Lamb
Several self-supervised representation learning methods have been proposed for reinforcement learning (RL) with rich observations.
no code implementations • 13 Dec 2022 • Peiyao Zhao, Yuangang Pan, Xin Li, Xu Chen, Ivor W. Tsang, Lejian Liao
Inspired by the impressive success of contrastive learning (CL), a variety of graph augmentation strategies have been employed to learn node representations in a self-supervised manner.
1 code implementation • 9 Dec 2022 • Weiwen Xu, Xin Li, Wenxuan Zhang, Meng Zhou, Wai Lam, Luo Si, Lidong Bing
We present Pre-trained Machine Reader (PMR), a novel method for retrofitting pre-trained masked language models (MLMs) to pre-trained machine reading comprehension (MRC) models without acquiring labeled data.
1 code implementation • 6 Dec 2022 • Xin Li, Cuiling Lan, Guoqiang Wei, Zhibo Chen
In this way, our message broadcasting encourages the group tokens to learn more informative and diverse information for effective domain alignment.
Ranked #1 on
Unsupervised Domain Adaptation
on VisDA2017
no code implementations • 3 Dec 2022 • Tianwei Lin, Honglin Lin, Fu Li, Dongliang He, Wenhao Wu, Meiling Wang, Xin Li, Yong liu
Then, in \textbf{AdaCM}, we adopt a CNN encoder to adaptively predict all parameters for the ColorMLP conditioned on each input content and style image pair.
no code implementations • 23 Nov 2022 • Xin Li, Xiangrui Li, Deng Pan, Yao Qiang, Dongxiao Zhu
Deep neural networks (DNNs) for supervised learning can be viewed as a pipeline of the feature extractor (i. e., last hidden layer) and a linear classifier (i. e., output layer) that are trained jointly with stochastic gradient descent (SGD) on the loss function (e. g., cross-entropy).
no code implementations • 22 Nov 2022 • Hai Wu, Chenglu Wen, Wei Li, Xin Li, Ruigang Yang, Cheng Wang
However, it is difficult to apply such networks to 3D object detection in autonomous driving due to its large computation cost and slow reasoning speed.
1 code implementation • 17 Nov 2022 • Ran Zhou, Xin Li, Lidong Bing, Erik Cambria, Luo Si, Chunyan Miao
We propose ConNER as a novel consistency training framework for cross-lingual NER, which comprises of: (1) translation-based consistency training on unlabeled target-language data, and (2) dropoutbased consistency training on labeled source-language data.
2 code implementations • 16 Nov 2022 • Yu-Hsiang Wang, Jun-Wei Hsieh, Ping-Yang Chen, Ming-Ching Chang, Hung Hin So, Xin Li
Second, we develop a Similarity Matching Cascade (SMC) module with a novel GATE function for robust object matching across consecutive video frames, further enhancing MOT performance.
Ranked #1 on
Multi-Object Tracking
on MOT20
(using extra training data)
1 code implementation • 16 Nov 2022 • Linlin Liu, Xingxuan Li, Megh Thakkar, Xin Li, Shafiq Joty, Luo Si, Lidong Bing
Due to the huge amount of parameters, fine-tuning of pretrained language models (PLMs) is prone to overfitting in the low resource scenarios.
2 code implementations • 15 Nov 2022 • Yu Wang, Xin Li, Shengzhao Wen, Fukui Yang, Wanping Zhang, Gang Zhang, Haocheng Feng, Junyu Han, Errui Ding
In this paper, we focus on the compression of DETR with knowledge distillation.
no code implementations • ICCV 2023 • Jiepeng Wang, Congyi Zhang, Peng Wang, Xin Li, Peter J. Cobb, Christian Theobalt, Wenping Wang
In this work, we aim to develop a portable, high-throughput, and accurate reconstruction system for efficient digitization of fragments excavated in archaeological sites.
no code implementations • 8 Nov 2022 • Lin Zhang, Xin Li, Dongliang He, Fu Li, Yili Wang, Zhaoxiang Zhang
While previous state-of-the-art RefSR methods mainly focus on improving the efficacy and robustness of reference feature transfer, it is generally overlooked that a well reconstructed SR image should enable better SR reconstruction for its similar LR images when it is referred to as.
1 code implementation • 2 Nov 2022 • Hongyu Zang, Xin Li, Jie Yu, Chen Liu, Riashat Islam, Remi Tachet des Combes, Romain Laroche
Our method, Behavior Prior Representation (BPR), learns state representations with an easy-to-integrate objective based on behavior cloning of the dataset: we first learn a state representation by mimicking actions from the dataset, and then train a policy on top of the fixed representation, using any off-the-shelf Offline RL algorithm.
no code implementations • 1 Nov 2022 • Riashat Islam, Hongyu Zang, Anirudh Goyal, Alex Lamb, Kenji Kawaguchi, Xin Li, Romain Laroche, Yoshua Bengio, Remi Tachet des Combes
Goal-conditioned reinforcement learning (RL) is a promising direction for training agents that are capable of solving multiple tasks and reach a diverse set of objectives.
1 code implementation • 31 Oct 2022 • Riashat Islam, Manan Tomar, Alex Lamb, Yonathan Efroni, Hongyu Zang, Aniket Didolkar, Dipendra Misra, Xin Li, Harm van Seijen, Remi Tachet des Combes, John Langford
We find that contemporary representation learning techniques can fail on datasets where the noise is a complex and time dependent process, which is prevalent in practical applications.
1 code implementation • 27 Oct 2022 • Na Zhang, Shan Jia, Siwei Lyu, Xin Li
Our technical contributions include: 1) We propose a fusion-based few-shot learning (FSL) method to learn discriminative features that can generalize to unseen morphing attack types from predefined presentation attacks; 2) The proposed FSL based on the fusion of the PRNU model and Noiseprint network is extended from binary MAD to multiclass morphing attack fingerprinting (MAF).
no code implementations • 27 Oct 2022 • Yuanzhe Chen, Ming Tu, Tang Li, Xin Li, Qiuqiang Kong, Jiaxin Li, Zhichao Wang, Qiao Tian, Yuping Wang, Yuxuan Wang
In this paper, we propose to use intermediate bottleneck features (IBFs) to replace PPGs.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+2
no code implementations • 25 Oct 2022 • James Lee Hu, MohammadReza Ebrahimi, Weifeng Li, Xin Li, Hsinchun Chen
This provides an opportunity for the defenders (i. e., malware detectors) to detect the adversarial variants by utilizing more than one view of a malware file (e. g., source code view in addition to the binary view).
no code implementations • 23 Oct 2022 • Changjun Hu, Quan Shi, Xin Li, Xiaoxian Guo
The test platform can test the performance of DP system and determine the operational time window.
no code implementations • 23 Oct 2022 • Qing Wu, Xin Li, Hongjiang Wei, Jingyi Yu, Yuyao Zhang
NeRF-based SVCT methods represent the desired CT image as a continuous function of spatial coordinates and train a Multi-Layer Perceptron (MLP) to learn the function by minimizing loss on the SV sinogram.
no code implementations • 22 Oct 2022 • Mengbing Liu, Xin Li, Boyu Ning, Chongwen Huang, Sumei Sun, Chau Yuen
Reconfigurable Intelligent Surface (RIS) is considered as an energy-efficient solution for future wireless communication networks due to its fast and low-cost configuration.
no code implementations • 18 Oct 2022 • Xin Li, Botian Shi, Yuenan Hou, Xingjiao Wu, Tianlong Ma, Yikang Li, Liang He
To address these problems, we construct the homogeneous structure between the point cloud and images to avoid projective information loss by transforming the camera features into the LiDAR 3D space.
1 code implementation • 18 Oct 2022 • Deng Cai, Xin Li, Jackie Chun-Sing Ho, Lidong Bing, Wai Lam
Unlike most prior work that only evaluates the ability to measure semantic similarity, we present a thorough evaluation of existing multilingual sentence embeddings and our improved versions, which include a collection of five transfer tasks in different downstream applications.
no code implementations • 17 Oct 2022 • Lianting Hu, Huiying Liang, Jiajie Tang, Xin Li, Li Huang, Long Lu
Background: Medical images are more difficult to acquire and annotate than natural images, which results in data augmentation technologies often being used in medical image segmentation tasks.
1 code implementation • 17 Oct 2022 • Weiwen Xu, Xin Li, Yang Deng, Wai Lam, Lidong Bing
Specifically, a novel Peer Data Augmentation (PeerDA) approach is proposed which employs span pairs with the PR relation as the augmentation data for training.
no code implementations • 7 Oct 2022 • Xin Li
In this paper, we revisit the problem of computational modeling of simple and complex cells for an over-parameterized and direct-fit model of visual perception.
no code implementations • 4 Oct 2022 • Honghu Pan, Yongyong Chen, Yunqi He, Xin Li, Zhenyu He
To this end, we propose Flow2Flow, a unified framework that could jointly achieve training sample expansion and cross-modality image generation for V2I person ReID.
no code implementations • 19 Sep 2022 • Syeda Nyma Ferdous, Xin Li, Siwei Lyu
Learning a robust and discriminative feature representation is a crucial challenge for object ReID.
no code implementations • 9 Sep 2022 • Xin Li, Yao Qiang, Chengyin Li, Sijia Liu, Dongxiao Zhu
We hypothesize that adversarial training can eliminate shortcut features whereas saliency guided training can filter out non-relevant features; both are nuisance features accounting for the performance degradation on OOD test sets.
no code implementations • 7 Sep 2022 • Xin Li, Xuli Tang, Qikai Cheng
We extracted ninety-one paper features from three dimensions as the input of the model, including twenty-one features in the paper dimension, thirty-five in the reference dimension, and thirty-five in the citing paper dimension.
no code implementations • 30 Aug 2022 • Xin Li, Jian Li, Zhihong Jeff Xia, Nikolaos Georgakarakos
Based on Chen & Tao (2021), the symplectic mapping is represented by a generating function.
no code implementations • 24 Aug 2022 • Xiaoshuai Fan, Xin Li, Zhibo Chen
Our proposed transcoding architecture shows significant superiority in the compression of JPEG images thanks to the collaboration of learned lossy transform coding and residual entropy coding.
no code implementations • 24 Aug 2022 • Guangqi Xie, Xin Li, Shiqi Lin, Li Zhang, Kai Zhang, Yue Li, Zhibo Chen
In this paper, we take a step forward to video semantic compression and propose the Hierarchical Reinforcement Learning based task-driven Video Semantic Coding, named as HRLVSC.
Hierarchical Reinforcement Learning
reinforcement-learning
+3
3 code implementations • 23 Aug 2022 • Ren Yang, Radu Timofte, Qi Zhang, Lin Zhang, Fanglong Liu, Dongliang He, Fu Li, He Zheng, Weihang Yuan, Pavel Ostyakov, Dmitry Vyal, Magauiya Zhussip, Xueyi Zou, Youliang Yan, Lei LI, Jingzhu Tang, Ming Chen, Shijie Zhao, Yu Zhu, Xiaoran Qin, Chenghua Li, Cong Leng, Jian Cheng, Claudio Rota, Marco Buzzelli, Simone Bianco, Raimondo Schettini, Dafeng Zhang, Feiyu Huang, Shizhuo Liu, Xiaobing Wang, Zhezhu Jin, Bingchen Li, Xin Li, Mingxi Li, Ding Liu, Wenbin Zou, Peijie Dong, Tian Ye, Yunchen Zhang, Ming Tan, Xin Niu, Mustafa Ayazoglu, Marcos Conde, Ui-Jin Choi, Zhuang Jia, Tianyu Xu, Yijian Zhang, Mao Ye, Dengyan Luo, Xiaofeng Pan, Liuhan Peng
The homepage of this challenge is at https://github. com/RenYang-home/AIM22_CompressSR.
3 code implementations • 21 Aug 2022 • Bingchen Li, Xin Li, Yiting Lu, Sen Liu, Ruoyu Feng, Zhibo Chen
Compressed Image Super-resolution has achieved great attention in recent years, where images are degraded with compression artifacts and low-resolution artifacts.
Ranked #1 on
Compressed Image Super-resolution
on DIV2K-q40-x4
no code implementations • 29 Jul 2022 • Yiting Lu, Xin Li, Jianzhao Liu, Zhibo Chen
Specifically, we find a more compact and reliable space i. e., feature style space for perception-oriented UDA based on an interesting/amazing observation, that the feature style (i. e., the mean and variance) of the deep layer in DNNs is exactly associated with the quality score in NR-IQA.
no code implementations • 27 Jul 2022 • Daizong Liu, Wei Hu, Xin Li
Instead, we propose point cloud attacks from a new perspective -- the graph spectral domain attack, aiming to perturb graph transform coefficients in the spectral domain that corresponds to varying certain geometric structure.
no code implementations • 17 Jul 2022 • Zongze Chen, Wenxia Yang, Xin Li
Following its canonical writing order, we first represent a Chinese character as a series of stroke images with a fixed writing order, and then our SAE model is trained to reconstruct this stroke image sequence.
2 code implementations • 17 Jul 2022 • Yili Wang, Xin Li, Kun Xu, Dongliang He, Qi Zhang, Fu Li, Errui Ding
The neural color operator mimics the behavior of traditional color operators and learns pixelwise color transformation while its strength is controlled by a scalar.
no code implementations • 17 Jul 2022 • Jianzhao Liu, Xin Li, Shukun An, Zhibo Chen
Thanks to the development of unsupervised domain adaptation (UDA), some works attempt to transfer the knowledge from a label-sufficient source domain to a label-free target domain under domain shift with UDA.
Blind Image Quality Assessment
Unsupervised Domain Adaptation
no code implementations • 13 Jul 2022 • Yiting Lu, Jun Fu, Xin Li, Wei Zhou, Sen Liu, Xinxin Zhang, Congfu Jia, Ying Liu, Zhibo Chen
Therefore, we propose a Progressive Reinforcement learning based Instance Discarding module (termed as PRID) to progressively remove quality-irrelevant/negative instances for CCTA VIQA.
2 code implementations • 22 Jun 2022 • Jiahui Yu, Yuanzhong Xu, Jing Yu Koh, Thang Luong, Gunjan Baid, ZiRui Wang, Vijay Vasudevan, Alexander Ku, Yinfei Yang, Burcu Karagol Ayan, Ben Hutchinson, Wei Han, Zarana Parekh, Xin Li, Han Zhang, Jason Baldridge, Yonghui Wu
We present the Pathways Autoregressive Text-to-Image (Parti) model, which generates high-fidelity photorealistic images and supports content-rich synthesis involving complex compositions and world knowledge.
Ranked #1 on
Text-to-Image Generation
on LAION COCO
no code implementations • 9 Jun 2022 • Xin Li, Daqi Zhu, Bing Sun, Qi Chen, Wenyang Gan, Zhigang Li
At last, a robust sliding mode controller with continuous model predictive control strategy for the multi-AUV system is developed to achieve leader-follower formation tracking under the presence of bounded flow disturbances, and simulations are implemented to confirm the effectiveness of the proposed method.
no code implementations • 5 Jun 2022 • Zhiwei Wang, Jinxin Lv, Yunqiao Yang, Yuanhuai Liang, Yi Lin, Qiang Li, Xin Li, Xin Yang
Vertebral landmark localization is a crucial step for variant spine-related clinical applications, which requires detecting the corner points of 17 vertebrae.
1 code implementation • 14 May 2022 • Chuanbo Hu, Shan Jia, Fan Zhang, Xin Li
However, due to the large diversity of geographic context and acquisition conditions, the captured SVI always contains various distracting objects (e. g., pedestrians and vehicles), which will distract human visual attention from efficiently finding the destination in the last few meters.
2 code implementations • 11 May 2022 • Yawei Li, Kai Zhang, Radu Timofte, Luc van Gool, Fangyuan Kong, Mingxi Li, Songwei Liu, Zongcai Du, Ding Liu, Chenhui Zhou, Jingyi Chen, Qingrui Han, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, Yu Qiao, Chao Dong, Long Sun, Jinshan Pan, Yi Zhu, Zhikai Zong, Xiaoxiao Liu, Zheng Hui, Tao Yang, Peiran Ren, Xuansong Xie, Xian-Sheng Hua, Yanbo Wang, Xiaozhong Ji, Chuming Lin, Donghao Luo, Ying Tai, Chengjie Wang, Zhizhong Zhang, Yuan Xie, Shen Cheng, Ziwei Luo, Lei Yu, Zhihong Wen, Qi Wu1, Youwei Li, Haoqiang Fan, Jian Sun, Shuaicheng Liu, Yuanfei Huang, Meiguang Jin, Hua Huang, Jing Liu, Xinjian Zhang, Yan Wang, Lingshun Long, Gen Li, Yuanfan Zhang, Zuowei Cao, Lei Sun, Panaetov Alexander, Yucong Wang, Minjie Cai, Li Wang, Lu Tian, Zheyuan Wang, Hongbing Ma, Jie Liu, Chao Chen, Yidong Cai, Jie Tang, Gangshan Wu, Weiran Wang, Shirui Huang, Honglei Lu, Huan Liu, Keyan Wang, Jun Chen, Shi Chen, Yuchun Miao, Zimo Huang, Lefei Zhang, Mustafa Ayazoğlu, Wei Xiong, Chengyi Xiong, Fei Wang, Hao Li, Ruimian Wen, Zhijing Yang, Wenbin Zou, Weixin Zheng, Tian Ye, Yuncheng Zhang, Xiangzhen Kong, Aditya Arora, Syed Waqas Zamir, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Dandan Gaoand Dengwen Zhouand Qian Ning, Jingzhu Tang, Han Huang, YuFei Wang, Zhangheng Peng, Haobo Li, Wenxue Guan, Shenghua Gong, Xin Li, Jun Liu, Wanjun Wang, Dengwen Zhou, Kun Zeng, Hanjiang Lin, Xinyu Chen, Jinsheng Fang
The aim was to design a network for single image super-resolution that achieved improvement of efficiency measured according to several metrics including runtime, parameters, FLOPs, activations, and memory consumption while at least maintaining the PSNR of 29. 00dB on DIV2K validation set.
1 code implementation • 9 May 2022 • Jianzhao Liu, Xin Li, Yanding Peng, Tao Yu, Zhibo Chen
In this paper, we design a full-reference image quality assessment metric SwinIQA to measure the perceptual quality of compressed images in a learned Swin distance space.
no code implementations • 5 May 2022 • Xin Li, Yan Zheng, Yiqing Hu, Haoyu Cao, Yunfei Wu, Deqiang Jiang, Yinsong Liu, Bo Ren
To deal with the unpredictable definition of relations, we propose a novel contrastive learning task named Relational Consistency Modeling (RCM), which harnesses the fact that existing relations should be consistent in differently augmented positive views.
no code implementations • 22 Apr 2022 • Xin Li, Hsinchun Chen, Zan Huang, Hua Su, Jesse D. Martinez
In this paper, we propose a comprehensive framework for constructing and analyzing large-scale gene functional networks based on the gene/protein interactions extracted from biomedical literature repositories using text mining tools.
no code implementations • 22 Apr 2022 • Xin Li, Hsinchun Chen, Jiexun Li, Zhu Zhang
Predicting gene functions is a challenge for biologists in the post genomic era.
no code implementations • 18 Apr 2022 • Hao liu, Xinghua Jiang, Xin Li, Antai Guo, Deqiang Jiang, Bo Ren
The self-supervised Masked Image Modeling (MIM) schema, following "mask-and-reconstruct" pipeline of recovering contents from masked image, has recently captured the increasing interest in the multimedia community, owing to the excellent ability of learning visual representation from unlabeled data.
1 code implementation • 17 Apr 2022 • Hongchen Tan, Xiuping Liu, BaoCai Yin, Xin Li
This paper presents a new Text-to-Image generation model, named Distribution Regularization Generative Adversarial Network (DR-GAN), to generate images from text descriptions from improved distribution learning.
no code implementations • 4 Apr 2022 • Qiuhong Shen, Xin Li, Fanyang Meng, Yongsheng Liang
These deep trackers usually do not perform online update or update single sub-branch of the tracking model, for which they cannot adapt to the appearance variation of objects.
1 code implementation • CVPR 2022 • Qiuhong Shen, Lei Qiao, Jinyang Guo, Peixia Li, Xin Li, Bo Li, Weitao Feng, Weihao Gan, Wei Wu, Wanli Ouyang
As unlimited self-supervision signals can be obtained by tracking a video along a cycle in time, we investigate evolving a Siamese tracker by tracking videos forward-backward.
1 code implementation • CVPR 2022 • Thanh-Dat Truong, Quoc-Huy Bui, Chi Nhan Duong, Han-Seok Seo, Son Lam Phung, Xin Li, Khoa Luu
Various 3D-CNN based methods have been presented to tackle both the spatial and temporal dimensions in the task of video action recognition with competitive results.
Ranked #1 on
Action Recognition
on Jester (Gesture Recognition)
no code implementations • 4 Mar 2022 • Yanwu Yang, Xin Li, Bernard J. Jansen, Daniel Zeng
Originality: This is one of the first research works to explore collective group decisions and resulting phenomena in the complex context of search engine advertising via developing and validating a simulation framework that supports assessments of various advertising strategies and estimations of the impact of mechanisms on the search market.
1 code implementation • 2 Mar 2022 • Wenxuan Zhang, Xin Li, Yang Deng, Lidong Bing, Wai Lam
More specifically, we provide a new taxonomy for ABSA which organizes existing studies from the axes of concerned sentiment elements, with an emphasis on recent advances of compound ABSA tasks.
Aspect-Based Sentiment Analysis
Aspect-Based Sentiment Analysis (ABSA)
no code implementations • CVPR 2022 • Zhihong Pan, Baopu Li, Dongliang He, Mingde Yao, Wenhao Wu, Tianwei Lin, Xin Li, Errui Ding
Deep learning based single image super-resolution models have been widely studied and superb results are achieved in upscaling low-resolution images with fixed scale factor and downscaling degradation kernel.
1 code implementation • 25 Feb 2022 • Shan Jia, Xin Li, Siwei Lyu
Then we take Deepfakes model attribution as a multiclass classification task and propose a spatial and temporal attention based method to explore the differences among Deepfakes in the new dataset.
1 code implementation • 15 Feb 2022 • Meng Zhou, Xin Li, Yue Jiang, Lidong Bing
Prompting shows promising results in few-shot scenarios.
no code implementations • 15 Feb 2022 • Soo Min Kwon, Xin Li, Anand D. Sarwate
We study the low-rank phase retrieval problem, where the objective is to recover a sequence of signals (typically images) given the magnitude of linear measurements of those signals.
1 code implementation • 8 Feb 2022 • Ao Luo, Fan Yang, Kunming Luo, Xin Li, Haoqiang Fan, Shuaicheng Liu
Our key idea is to decouple the context reasoning from the matching procedure, and exploit scene information to effectively assist motion estimation by learning to reason over the adaptive graph.
no code implementations • 6 Feb 2022 • Keli Huang, Botian Shi, Xiang Li, Xin Li, Siyuan Huang, Yikang Li
Multi-modal fusion is a fundamental task for the perception of an autonomous driving system, which has recently intrigued many researchers.
no code implementations • 3 Feb 2022 • Peiying Zhang, Xue Pang, Yongjing Ni, Haipeng Yao, Xin Li
Virtual network embedding (VNE) is an crucial part of network virtualization (NV), which aims to map the virtual networks (VNs) to a shared substrate network (SN).
no code implementations • 24 Jan 2022 • Xingjiao Wu, Luwei Xiao, Xiangcheng Du, Yingbin Zheng, Xin Li, Tianlong Ma, Liang He
Our framework is an unsupervised document layout analysis framework.
no code implementations • 18 Jan 2022 • Xin Li, Jian Li, Zhihong Jeff Xia, Nikolaos Georgakarakos
Most recently, machine learning has been used to study the dynamics of integrable Hamiltonian systems and the chaotic 3-body problem.
no code implementations • 11 Jan 2022 • Matthew Korban, Xin Li
This paper investigates different approaches to build and use digital human avatars toward interactive Virtual Co-presence (VCP) environments.
1 code implementation • CVPR 2022 • Ao Luo, Fan Yang, Xin Li, Shuaicheng Liu
Optical flow is a fundamental method used for quantitative motion estimation on the image plane.
no code implementations • CVPR 2022 • Shuai Liu, Xin Li, Huchuan Lu, You He
Multi-object tracking in unmanned aerial vehicle (UAV) videos is an important vision task and can be applied in a wide range of applications.
2 code implementations • 31 Dec 2021 • Hongyu Zang, Xin Li, Mingzhong Wang
This work explores how to learn robust and generalizable state representation from image-based observations with deep reinforcement learning methods.
2 code implementations • 22 Dec 2021 • Liang Pan, Tong Wu, Zhongang Cai, Ziwei Liu, Xumin Yu, Yongming Rao, Jiwen Lu, Jie zhou, Mingye Xu, Xiaoyuan Luo, Kexue Fu, Peng Gao, Manning Wang, Yali Wang, Yu Qiao, Junsheng Zhou, Xin Wen, Peng Xiang, Yu-Shen Liu, Zhizhong Han, Yuanjie Yan, Junyi An, Lifa Zhu, Changwei Lin, Dongrui Liu, Xin Li, Francisco Gómez-Fernández, Qinlong Wang, Yang Yang
Based on the MVP dataset, this paper reports methods and results in the Multi-View Partial Point Cloud Challenge 2021 on Completion and Registration.