no code implementations • 27 Jan 2018 • Wei Li, Zheng Yang, Xu sun
Traditional Chinese Medicine (TCM) is an influential form of medical treatment in China and surrounding areas.
no code implementations • 1 May 2018 • Bo Zhang, Wei Li, Jie Hao, Xiao-Li Li, Meng Zhang
The layers between the source and target feature extractor are partially untied during the training stage to take both training efficiency and domain adaptation into consideration.
no code implementations • 29 Apr 2018 • Kai Yue, Lei Yang, Ruirui Li, Wei Hu, Fan Zhang, Wei Li
For the task of subdecimeter aerial imagery segmentation, fine-grained semantic segmentation results are usually difficult to obtain because of complex remote sensing content and optical conditions.
no code implementations • 26 Apr 2018 • Honggang Zhou, Yunchun Li, Hailong Yang, Wei Li, Jie Jia
However, the learning and inference of BN model are NP-hard thus the number of stochastic variables in BN is highly constrained.
no code implementations • CVPR 2018 • Jingya Wang, Xiatian Zhu, Shaogang Gong, Wei Li
Most existing person re-identification (re-id) methods require supervised model learning from a separate large set of pairwise labelled training data for every single camera pair.
Ranked #22 on Unsupervised Domain Adaptation on Market to Duke
no code implementations • 5 Mar 2018 • Zhiyuan Zhang, Wei Li, Qi Su
In this paper, we propose to build an end-to-end neural model to automatically translate between ancient and contemporary Chinese.
no code implementations • 27 Jan 2018 • Wei Li, Yunfang Wu, Xueqiang Lv
Using low dimensional vector space to represent words has been very effective in many NLP tasks.
no code implementations • 11 Nov 2017 • Xiangyu Zhu, Yingying Jiang, Shuli Yang, Xiaobing Wang, Wei Li, Pei Fu, Hua Wang, Zhenbo Luo
Scene text detection is a challenging problem in computer vision.
no code implementations • 6 Nov 2017 • Wei Li, Zheng Yang
Traditional Chinese Medicine (TCM) has accumulated a big amount of precious resource in the long history of development.
no code implementations • 26 Oct 2017 • Yanzhang He, Rohit Prabhavalkar, Kanishka Rao, Wei Li, Anton Bakhtin, Ian McGraw
We develop streaming keyword spotting systems using a recurrent neural network transducer (RNN-T) model: an all-neural, end-to-end trained, sequence-to-sequence model which jointly learns acoustic and language model components.
no code implementations • ICCV 2017 • Jingya Wang, Xiatian Zhu, Shaogang Gong, Wei Li
Recognising semantic pedestrian attributes in surveillance images is a challenging task for computer vision, particularly when the imaging quality is poor with complex background clutter and uncontrolled viewing conditions, and the number of labelled training data is small.
no code implementations • 17 Sep 2017 • Wei Li, Yunfang Wu
In this paper, we focus on the problem of answer triggering ad-dressed by Yang et al. (2015), which is a critical component for a real-world question answering system.
no code implementations • 9 Aug 2017 • Wen Li, Li-Min Wang, Wei Li, Eirikur Agustsson, Luc van Gool
Our new WebVision database and relevant studies in this work would benefit the advance of learning state-of-the-art visual models with minimum supervision based on web data.
no code implementations • 25 Jul 2017 • Fan Zhang, Chen Hu, Qiang Yin, Wei Li, Heng-Chao Li, Wen Hong
However, there is a limitation in current deep learning based ATR solution that each learning process only handle one SAR image, namely learning the static scattering information, while missing the space-varying information.
no code implementations • 12 May 2017 • Wei Li, Xiatian Zhu, Shaogang Gong
Existing person re-identification (re-id) methods rely mostly on either localised or global feature representation alone.
Ranked #103 on Person Re-Identification on Market-1501
no code implementations • 16 May 2017 • Wen Li, Li-Min Wang, Wei Li, Eirikur Agustsson, Jesse Berent, Abhinav Gupta, Rahul Sukthankar, Luc van Gool
The 2017 WebVision challenge consists of two tracks, the image classification task on WebVision test set, and the transfer learning task on PASCAL VOC 2012 dataset.
no code implementations • CVPR 2017 • Wei Li, Farnaz Abitahi, Zhigang Zhu
Action Unit (AU) detection becomes essential for facial analysis.
no code implementations • 12 Mar 2017 • Saifeng Liu, Huaixiu Zheng, Yesu Feng, Wei Li
A novel deep learning architecture (XmasNet) based on convolutional neural networks was developed for the classification of prostate cancer lesions, using the 3D multiparametric MRI data provided by the PROSTATEx challenge.
no code implementations • 9 Feb 2017 • Wei Li, Farnaz Abtahi, Zhigang Zhu, Lijun Yin
For the enhancing layers, we designed an attention map based on facial landmark features and applied it to a pretrained neural network to conduct enhanced learning (The E-Net).
no code implementations • 28 Dec 2016 • Asad Khan, Luo Jiang, Wei Li, Ligang Liu
Our algorithm is not restricted to one-to-one image color transfer and can make use of more than one target images to transfer the color in different regions in the source image.
no code implementations • 1 Nov 2016 • Wei Li, Brian Kan Wing Mak
In many natural language processing (NLP) tasks, a document is commonly modeled as a bag of words using the term frequency-inverse document frequency (TF-IDF) vector.
no code implementations • 14 Oct 2016 • Wei Li, Zhigang Zhu
We have found that features trained for one task can be used for other related tasks.
no code implementations • 1 Oct 2016 • Wei Li, Johannes Lederer
Feature selection is a standard approach to understanding and modeling high-dimensional classification data, but the corresponding statistical methods hinge on tuning parameters that are difficult to calibrate.
no code implementations • 15 Mar 2016 • Wei Li, Melvin Gauci, Roderich Gross
We present two case studies with swarms of simulated robots and prove that the underlying behaviors cannot be inferred by a metric-based system identification method.
no code implementations • 4 Aug 2016 • Wei Li, Christina Tsangouri, Farnaz Abtahi, Zhigang Zhu
In order to increase the expression recognition accuracy, we also fine-tune the CNN model and thus obtain a better CNN facial expression recognition model.
Facial Expression Recognition Facial Expression Recognition (FER)
no code implementations • 18 Jul 2016 • Wei Li, Matthias Breier, Dorit Merhof
Aiming at improving the performance of existing detection algorithms developed for different applications, we propose a region regression-based multi-stage class-agnostic detection pipeline, whereby the existing algorithms are employed for providing the initial detection proposals.
no code implementations • 10 Jul 2016 • Wei Li, Farnaz Abtahi, Christina Tsangouri, Zhigang Zhu
To evaluate the dataset, we compared the performance of two deep learning models trained on both GaMo and CIFE.
no code implementations • 8 Nov 2015 • Wei Li, Mingquan Qiu, Zhencai Zhu, Bo Wu, Gongbo Zhou
Bearing fault diagnosis has been a challenge in the monitoring activities of rotating machinery, and it's receiving more and more attention.
no code implementations • 28 Feb 2015 • Chongyang Zhang, Weiyao Lin, Wei Li, Bing Zhou, Jun Xie, Jijia Li
Image deblurring techniques play important roles in many image processing applications.
no code implementations • 28 Feb 2015 • Weiyao Lin, Ming-Ting Sun, Hongxiang Li, Zhenzhong Chen, Wei Li, Bing Zhou
We demonstrate that this low-computation-complexity method can efficiently catch the characteristics of the frame.
no code implementations • 10 Apr 2014 • Wei Hu, Wei Li, Fan Zhang, Qian Du
Decolorization is the process to convert a color image or video to its grayscale version, and it has received great attention in recent years.
no code implementations • 24 Feb 2014 • Huanguo Zhang, Sha Lv, Wei Li, Xun Qu
Instead of projecting an image to its nearest image, we try to project it to its nearest line spanned by two different face images.
no code implementations • 16 Jan 2014 • Wei Li, Pascal Poupart, Peter van Beek
Previous studies have demonstrated that encoding a Bayesian network into a SAT formula and then performing weighted model counting using a backtracking search algorithm can be an effective method for exact inference.
no code implementations • 26 Jul 2018 • Yuzhe Ma, Ran Chen, Wei Li, Fanhua Shang, Wenjian Yu, Minsik Cho, Bei Yu
To address this issue, various approximation techniques have been investigated, which seek for a light weighted network with little performance degradation in exchange of smaller model size or faster inference.
no code implementations • 29 Jul 2018 • Wei Li, Brian Mak
This paper further adds a distance constraint to the training objective function of NV so that the two embeddings of a parallel document are required to be as close as possible.
Cross-Lingual Document Classification Document Classification +5
no code implementations • 14 Aug 2018 • Zhiyuan Zhang, Wei Li, Jingjing Xu, Xu sun
We define the primal meaning of an expression to be a frequently used sense of that expression from which its other frequent senses can be deduced.
no code implementations • 16 Aug 2018 • Wei Li, Xuancheng Ren, Damai Dai, Yunfang Wu, Houfeng Wang, Xu sun
In the experiments, we take a real-world sememe knowledge base HowNet and the corresponding descriptions of the words in Baidu Wiki for training and evaluation.
no code implementations • 23 Apr 2018 • Wei Li
Blockchain stores information into a chain of blocks, whose integrity is usually guaranteed by Proof of Work (PoW).
Cryptography and Security Distributed, Parallel, and Cluster Computing
no code implementations • 9 Oct 2018 • Wei Li, Zehuan Yuan, Xiangzhong Fang, Changhu Wang
Attention mechanisms have been widely used in Visual Question Answering (VQA) solutions due to their capacity to model deep cross-domain interactions.
no code implementations • 25 Nov 2018 • Keren Ye, Mingda Zhang, Wei Li, Danfeng Qin, Adriana Kovashka, Jesse Berent
To alleviate the cost of obtaining accurate bounding boxes for training today's state-of-the-art object detection models, recent weakly supervised detection work has proposed techniques to learn from image-level labels.
no code implementations • EMNLP 2018 • Wei Li, Xinyan Xiao, Yajuan Lyu, Yuanzhuo Wang
Information selection is the most important component in document summarization task.
Ranked #32 on Abstractive Text Summarization on CNN / Daily Mail
no code implementations • EMNLP 2018 • Wei Li, Xinyan Xiao, Yajuan Lyu, Yuanzhuo Wang
Recent neural sequence-to-sequence models have shown significant progress on short text summarization.
Ranked #43 on Abstractive Text Summarization on CNN / Daily Mail
no code implementations • EACL 2017 • Wei Li, Brian Mak
In many natural language processing (NLP) tasks, a document is commonly modeled as a bag of words using the term frequency-inverse document frequency (TF-IDF) vector.
no code implementations • COLING 2016 • Wei Li, Lei He, Hai Zhuge
This paper studies the abstractive multi-document summarization for event-oriented news texts through event information extraction and abstract representation.
no code implementations • COLING 2016 • Lei He, Wei Li, Hai Zhuge
This paper investigates differential topic models (dTM) for summarizing the differences among document groups.
no code implementations • NeurIPS 2018 • Xundong Wu, Xiangwen Liu, Wei Li, Qing Wu
In this study, we model such local nonlinearity of dendritic trees with our dendritic neural network (DENN) structure and apply this structure to typical machine learning tasks.
no code implementations • NeurIPS 2017 • Roderich Gross, Yue Gu, Wei Li, Melvin Gauci
In this paper we examine how these algorithms relate to the Turing test, and derive what - from a Turing perspective - can be considered their defining features.
no code implementations • 27 Dec 2018 • Husheng Zhou, Wei Li, Yuankun Zhu, Yuqun Zhang, Bei Yu, Lingming Zhang, Cong Liu
Furthermore, DeepBillboard is sufficiently robust and resilient for generating physical-world adversarial billboard tests for real-world driving under various weather conditions.
no code implementations • 23 Jan 2019 • Xin Wu, Danfeng Hong, Jiaojiao Tian, Jocelyn Chanussot, Wei Li, Ran Tao
To this end, we propose a novel object detection framework, called optical remote sensing imagery detector (ORSIm detector), integrating diverse channel features extraction, feature learning, fast image pyramid matching, and boosting strategy.
no code implementations • CVPR 2013 • Wei Li, Xiaogang Wang
In this paper, we propose a new approach for matching images observed in different camera views with complex cross-view transforms and apply it to person reidentification.
no code implementations • CVPR 2014 • Wei Li, Rui Zhao, Tong Xiao, Xiaogang Wang
In this paper, we propose a novel filter pairing neural network (FPNN) to jointly handle misalignment, photometric and geometric transforms, occlusions and background clutter.
no code implementations • 9 May 2019 • Wen-Shuai Hu, Heng-Chao Li, Lei Pan, Wei Li, Ran Tao, Qian Du
Particularly, long short-term memory (LSTM), as a special deep learning structure, has shown great ability in modeling long-term dependencies in the time dimension of video or the spectral dimension of HSIs.
no code implementations • ACL 2019 • Naveen Arivazhagan, Colin Cherry, Wolfgang Macherey, Chung-Cheng Chiu, Semih Yavuz, Ruoming Pang, Wei Li, Colin Raffel
Simultaneous machine translation begins to translate each source sentence before the source speaker is finished speaking, with applications to live and streaming scenarios.
no code implementations • 3 Jul 2019 • Wei Li, Zehuan Yuan, Dashan Guo, Lei Huang, Xiangzhong Fang, Changhu Wang
To perform action detection, we design a 3D convolution network with skip connections for tube classification and regression.
no code implementations • 7 Sep 2019 • Deli Chen, Yankai Lin, Wei Li, Peng Li, Jie zhou, Xu sun
Graph Neural Networks (GNNs) have achieved promising performance on a wide range of graph-based tasks.
Ranked #52 on Node Classification on Cora
no code implementations • 18 Sep 2019 • Wei Li, Shuheng Li, Shuming Ma, Yancheng He, Deli Chen, Xu sun
Graph is a natural structure to describe the complicated relation between tokens.
no code implementations • 1 Oct 2019 • Wei Zhang, Wei Li, Lei Han
Very short-term convective storm forecasting, termed nowcasting, has long been an important issue and has attracted substantial interest.
no code implementations • COLING 2016 • Wei Li, Yunfang Wu
In this paper we focus on the problem of dialog act (DA) labelling.
no code implementations • 21 Oct 2019 • Kai Bai, Wei Li, Mathieu Desbrun, Xiaopei Liu
We propose a novel dictionary-based neural network which learns both a fast evaluation of sparse patch encoding and a dictionary of corresponding coarse and fine patches from a sequence of example simulations computed with any numerical solver.
Graphics
no code implementations • 7 Feb 2020 • Wei Li, Amin Kiaghadi, Clint N. Dawson
Accurate and efficient models for rainfall runoff (RR) simulations are crucial for flood risk management.
no code implementations • 28 Mar 2020 • Tara N. Sainath, Yanzhang He, Bo Li, Arun Narayanan, Ruoming Pang, Antoine Bruguier, Shuo-Yiin Chang, Wei Li, Raziel Alvarez, Zhifeng Chen, Chung-Cheng Chiu, David Garcia, Alex Gruenstein, Ke Hu, Minho Jin, Anjuli Kannan, Qiao Liang, Ian McGraw, Cal Peyser, Rohit Prabhavalkar, Golan Pundak, David Rybach, Yuan Shangguan, Yash Sheth, Trevor Strohman, Mirko Visontai, Yonghui Wu, Yu Zhang, Ding Zhao
Thus far, end-to-end (E2E) models have not been shown to outperform state-of-the-art conventional models with respect to both quality, i. e., word error rate (WER), and latency, i. e., the time the hypothesis is finalized after the user stops speaking.
no code implementations • 13 Apr 2020 • Tao Zhang, Wei Li
On the ImageNet, accuracy is improved by 1. 25\%.
no code implementations • 10 Jul 2015 • An Chang, Joshua Cooper, Wei Li
In this paper, we study the analytic connectivity of a $k$-uniform hypergraph $H$, denoted by $\alpha(H)$.
Combinatorics 05C65 (Primary), 05C40, 05B05, 26D15 (Secondary)
no code implementations • 19 May 2020 • Wenjie Li, Benlai Tang, Xiang Yin, Yushi Zhao, Wei Li, Kang Wang, Hao Huang, Yuxuan Wang, Zejun Ma
Accent conversion (AC) transforms a non-native speaker's accent into a native accent while maintaining the speaker's voice timbre.
no code implementations • CVPR 2020 • Xibin Song, Yuchao Dai, Dingfu Zhou, Liu Liu, Wei Li, Hongdng Li, Ruigang Yang
Second, we propose a new framework for real-world DSR, which consists of four modules : 1) An iterative residual learning module with deep supervision to learn effective high-frequency components of depth maps in a coarse-to-fine manner; 2) A channel attention strategy to enhance channels with abundant high-frequency components; 3) A multi-stage fusion module to effectively re-exploit the results in the coarse-to-fine process; and 4) A depth refinement module to improve the depth map by TGV regularization and input loss.
no code implementations • 16 Jul 2020 • Xin Wu, Wei Li, Danfeng Hong, Jiaojiao Tian, Ran Tao, Qian Du
In addition, the generalization ability of Ms-AFt in dense remote sensing scenes is further verified on stereo aerial imagery of a large camping site.
no code implementations • 16 Jul 2020 • Feixiang Lu, Zongdai Liu, Xibin Song, Dingfu Zhou, Wei Li, Hui Miao, Miao Liao, Liangjun Zhang, Bin Zhou, Ruigang Yang, Dinesh Manocha
We present a novel approach to detect, segment, and reconstruct complete textured 3D models of vehicles from a single image for autonomous driving.
no code implementations • 14 Aug 2020 • Wensheng Cheng, Hao Luo, Wen Yang, Lei Yu, Wei Li
We then propose a structure-aware network for lane marker extraction in DVS images.
no code implementations • 19 Aug 2020 • Wei Li, Brian Mak
One of the current state-of-the-art multilingual document embedding model LASER is based on the bidirectional LSTM neural machine translation model.
no code implementations • 19 Aug 2020 • Xiangtong Wang, Binbin Liang, Menglong Yang, Wei Li
Current computer vision tasks based on deep learning require a huge amount of data with annotations for model training or testing, especially in some dense estimation tasks, such as optical flow segmentation and depth estimation.
no code implementations • 19 Aug 2020 • Ying Qu, Razieh Kaviani Baghbaderani, Wei Li, Lianru Gao, Hairong Qi
Transfer learning-based methods address this problem by pre-training in the source domain and fine-tuning on the target domain.
General Classification Hyperspectral Image Classification +1
no code implementations • 30 Aug 2020 • Wei Li, James Qin, Chung-Cheng Chiu, Ruoming Pang, Yanzhang He
The 2nd-pass model plays a key role in the quality improvement of the end-to-end model to surpass the conventional model.
no code implementations • ECCV 2020 • Niamul Quader, Juwei Lu, Peng Dai, Wei Li
State-of-the-art approaches to video-based action and gesture recognition often employ two key concepts: First, they employ multistream processing; second, they use an ensemble of convolutional networks.
Ranked #1 on Action Classification on Jester test
no code implementations • 26 Jun 2020 • Wei Li, Ruihan Bao, Keiko Harimoto, Deli Chen, Jingjing Xu and Qi Su
Further analysis shows that the introduction of the graph enables our model to predict the movement of stocks that are not directly associated with news as well as the whole market, which is not available in most previous methods.
no code implementations • 30 Sep 2020 • Mingchi Zhang, Xuemin Chen, Wei Li
However, the negative pressure waves or guided stress waves may not be easily detected with environmental interference, e. g., the oil and gas pipelines in offshore environment.
no code implementations • 1 Jan 2021 • Wei Li, Ruxuan Li, Yuzhe ma, Siu On Chan, Bei Yu
To characterize the power of GNNs for the graph coloring problem, we first formalize the discrimination power of GNNs as the capability to assign nodes different colors.
no code implementations • 1 Jan 2021 • Ali Ghobadzadeh, Deepak Sridhar, Juwei Lu, Wei Li
In this paper, we probe this direction by deriving a relationship between the estimation of unknown parameters of the probability density function (pdf) of input data and classification accuracy.
no code implementations • 8 Apr 2020 • Yifu Sun, xulong Zhang, Yi Yu, Xi Chen, Wei Li
Singing voice detection (SVD), to recognize vocal parts in the song, is an essential task in music information retrieval (MIR).
no code implementations • 18 Dec 2020 • Xingxing Zuo, Nathaniel Merrill, Wei Li, Yong liu, Marc Pollefeys, Guoquan Huang
In this work, we present a lightweight, tightly-coupled deep depth network and visual-inertial odometry (VIO) system, which can provide accurate state estimates and dense depth maps of the immediate surroundings.
no code implementations • 29 Dec 2020 • Wei Tao, Wei Li, Zhisong Pan, Qing Tao
In order to remove this factor, we first develop gradient descent averaging (GDA), which is a general projection-based dual averaging algorithm in the strongly convex setting.
no code implementations • 13 Jan 2021 • Wei Li, Denis Mike Becker
In the context of trade liberalisation and market harmonisation in the European markets, accurate price forecasting becomes difficult for electricity market participants to obtain because electricity forecasting requires the consideration of features from ever-growing coupling markets.
no code implementations • 14 Jan 2021 • Jing-Rong Wang, Wei Li, Chang-Jin Zhang
The physical essences of the quantum critical points are determined by analyzing the susceptibility exponents for all of the source terms in particle-hole and particle-particle channels.
Strongly Correlated Electrons Materials Science
no code implementations • 27 Jan 2021 • Yuan Da Liao, Han Li, Zheng Yan, Hao-Tian Wei, Wei Li, Yang Qi, Zi Yang Meng
Quantum Ising model on a triangular lattice hosts a finite temperature Berezinskii-Kosterlitz-Thouless (BKT) phase with emergent U(1) symmetry, and it will transit into an up-up-down (UUD) phase with $C_3$ symmetry breaking upon an infinitesimal external field along the longitudinal direction, but the overall phase diagram spanned by the axes of external field and temperature remains opaque due to the lack of systematic invesitgations with controlled methodologies.
Strongly Correlated Electrons Statistical Mechanics
no code implementations • 16 Feb 2021 • Jianing Zhang, Wei Li, Honggang Gou, Lu Fang, Ruigang Yang
In this paper, we propose LEAD, i. e., LiDAR Extender for Autonomous Driving, to extend the MEMS LiDAR by coupled image w. r. t both FoV and range.
no code implementations • 17 Feb 2021 • Tao Liu, Xin-Yang Liu, Yuan Gao, Hai Jin, Jun He, Xian-Lei Sheng, Wentao Jin, Ziyu Chen, Wei Li
Strong fluctuations in the low-$T$ quantum critical regime can give rise to a large thermal entropy change and thus significant cooling effect when approaching the QCP.
Strongly Correlated Electrons
no code implementations • 25 Feb 2021 • Christina Kaiser, Oskar J. Sandberg, Nasim Zarrabi, Wei Li, Paul Meredith, Ardalan Armin
A simple model is presented that explains absorption line-shapes of disordered systems, and we also provide a strategy to determine the excitonic disorder energy.
Optics Disordered Systems and Neural Networks
no code implementations • 11 Mar 2021 • David Qiu, Qiujia Li, Yanzhang He, Yu Zhang, Bo Li, Liangliang Cao, Rohit Prabhavalkar, Deepti Bhatia, Wei Li, Ke Hu, Tara N. Sainath, Ian McGraw
We study the problem of word-level confidence estimation in subword-based end-to-end (E2E) models for automatic speech recognition (ASR).
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 26 Feb 2021 • Chenxi Zhou, Bin Yang, Wenliang Fan, Wei Li
(3) The detection of neural disease was demonstrated to be benefit from thermodynamic model, implying the immense potential of thermodynamics in auxiliary diagnosis.
no code implementations • 1 Mar 2021 • Xiuqing Li, Wei Li, Xinlin Yi, Qihang Huang, Yuhang Wang, Chenzhe Ye
With the path-specific parameters obtained by the proposed channel tracking, the proposed PTRM can not only match the time dispersion as conventional PTRM, but also the doubly-spread channel, since the path-specific delay and Doppler scaler factor can help to match the channel in both time and frequency domain.
no code implementations • 9 Apr 2021 • Léni K. Le Goff, Edgar Buchanan, Emma Hart, Agoston E. Eiben, Wei Li, Matteo De Carlo, Alan F. Winfield, Matthew F. Hale, Robert Woolley, Mike Angus, Jon Timmis, Andy M. Tyrrell
This causes a potential mismatch between the structure of an inherited controller and the new body.
no code implementations • 1 Mar 2021 • Qihang Huang, Wei Li, Weicheng Zhan, Yuhang Wang, Rongrong Guo
A model based on the underwater acoustic channel's correlation can be used as the state-space model in the Kalman filter to improve the underwater acoustic channel tracking compared that without a model.
no code implementations • 4 Sep 2020 • Zhixing Lin, Shuqian Sun, Jose Azana, Wei Li, Ninghua Zhu, Ming Li
This concept represents a novel one-dimensional realization of artificial neural networks, enabling an efficient application of optical deep learning methods to the analysis and processing of serial data signals, while offering a new overall perspective for the temporal signal processing.
no code implementations • 15 Apr 2021 • Lei Zhang, Wei Bai, Wei Li, Shiming Xia, Qibin Zheng
To achieve these results, we pose discovering attack paths as a Reinforcement Learning (RL) problem and train an agent to discover multi-domain cyberspace attack paths.
no code implementations • 29 Apr 2021 • Dengcheng Yan, Youwen Zhang, Wei Li, Yiwen Zhang
Signed network embedding is an approach to learn low-dimensional representations of nodes in signed networks with both positive and negative links, which facilitates downstream tasks such as link prediction with general data mining frameworks.
no code implementations • ACL 2021 • Wenhao Wu, Wei Li, Xinyan Xiao, Jiachen Liu, Ziqiang Cao, Sujian Li, Hua Wu, Haifeng Wang
Abstractive summarization for long-document or multi-document remains challenging for the Seq2Seq architecture, as Seq2Seq is not good at analyzing long-distance relations in text.
no code implementations • ACL 2021 • Zixuan Li, Xiaolong Jin, Saiping Guan, Wei Li, Jiafeng Guo, Yuanzhuo Wang, Xueqi Cheng
Specifically, at the clue searching stage, CluSTeR learns a beam search policy via reinforcement learning (RL) to induce multiple clues from historical facts.
no code implementations • 7 Jun 2021 • Zhipeng Cai, Zuobin Xiong, Honghui Xu, Peng Wang, Wei Li, Yi Pan
Generative Adversarial Networks (GAN) have promoted a variety of applications in computer vision, natural language processing, etc.
no code implementations • NeurIPS 2021 • Zhaowen Li, Zhiyang Chen, Fan Yang, Wei Li, Yousong Zhu, Chaoyang Zhao, Rui Deng, Liwei Wu, Rui Zhao, Ming Tang, Jinqiao Wang
More importantly, the masked tokens together with the remaining tokens are further recovered by a global image decoder, which preserves the spatial information of the image and is more friendly to the downstream dense prediction tasks.
no code implementations • 26 Jun 2021 • Jingxuan Li, Rui Huang, Wei Li, Kai Yao, Weiguo Tan
We integrate this ranking scheme with two frequency models and a GPT-2 styled language model, along with the acceptance model to yield 27. 80% and 37. 64% increase in TOP1 and TOP5 accuracy, respectively.
no code implementations • 6 Jul 2021 • Wei Li, Yuanjun Xiong, Shuo Yang, Mingze Xu, Yongxin Wang, Wei Xia
We design a new instance-to-track matching objective to learn appearance embedding that compares a candidate detection to the embedding of the tracks persisted in the tracker.
no code implementations • 8 Jul 2021 • Wei Li, Dezhao Luo, Bo Fang, Yu Zhou, Weiping Wang
As a result, we can leverage the spatial information (the size of objects), temporal information (the direction and magnitude of motions) as our learning target.
no code implementations • 18 Jul 2021 • Xiaoheng Sun, Yiliang Jiang, Wei Li
Phonation mode is an essential characteristic of singing style as well as an important expression of performance.
no code implementations • 19 Jul 2021 • Wei Li, Florentina Paraschiv, Georgios Sermpinis
The rapid development of artificial intelligence methods contributes to their wide applications for forecasting various financial risks in recent years.
no code implementations • 19 Aug 2021 • Mingren Shen, Guanzhao Li, Dongxia Wu, YuHan Liu, Jacob Greaves, Wei Hao, Nathaniel J. Krakauer, Leah Krudy, Jacob Perez, Varun Sreenivasan, Bryan Sanchez, Oigimer Torres, Wei Li, Kevin Field, Dane Morgan
Electron microscopy is widely used to explore defects in crystal structures, but human detecting of defects is often time-consuming, error-prone, and unreliable, and is not scalable to large numbers of images or real-time analysis.
no code implementations • 20 Aug 2021 • Zhiyuan Zhang, Wei Li, Ruihan Bao, Keiko Harimoto, Yunfang Wu, Xu sun
Besides the security concerns of potential adversarial examples, adversarial training can also improve the generalization ability of neural networks, train robust neural networks, and provide interpretability for neural networks.
no code implementations • 3 Sep 2021 • Xiaoheng Sun, Qiqi He, Yongwei Gao, Wei Li
Recently, some single-step systems without onset detection have shown their effectiveness in automatic musical tempo estimation.
no code implementations • SEMEVAL 2021 • Wei Li, Harish Tayyar Madabushi, Mark Lee
This paper describes our submission to SemEval 2021 Task 2.
no code implementations • 15 Sep 2021 • Rami Botros, Tara N. Sainath, Robert David, Emmanuel Guzman, Wei Li, Yanzhang He
Previous works on the Recurrent Neural Network-Transducer (RNN-T) models have shown that, under some conditions, it is possible to simplify its prediction network with little or no loss in recognition accuracy (arXiv:2003. 07705 [eess. AS], [2], arXiv:2012. 06749 [cs. CL]).
no code implementations • ICCV 2021 • Pan Li, Da Li, Wei Li, Shaogang Gong, Yanwei Fu, Timothy M. Hospedales
The topical domain generalization (DG) problem asks trained models to perform well on an unseen target domain with different data statistics from the source training domains.
no code implementations • 7 Oct 2021 • Gaojian Wang, Qian Jiang, Xin Jin, Wei Li, Xiaohui Cui
Moreover, we make a key observation that subtle forgery artifacts can be further exposed in the patch-wise phase and amplitude spectrum and exhibit different clues.
no code implementations • 29 Sep 2021 • Runnan Chen, Xinge Zhu, Nenglun Chen, Dawei Wang, Wei Li, Yuexin Ma, Ruigang Yang, Wenping Wang
In this paper, we study a new problem named Referring Self-supervised Learning (RSL) on 3D scene understanding: Given the 3D synthetic models with labels and the unlabeled 3D real scene scans, our goal is to distinguish the identical semantic objects on an unseen scene according to the referring synthetic 3D models.
no code implementations • 16 Oct 2021 • Wei Li
The action space is restricted to the UI elements plus a few global actions.
no code implementations • 25 Oct 2021 • Xin Wu, Wei Li, Danfeng Hong, Ran Tao, Qian Du
Owing to effective and flexible data acquisition, unmanned aerial vehicle (UAV) has recently become a hotspot across the fields of computer vision (CV) and remote sensing (RS).
no code implementations • CCL 2020 • Xiaodong Yan, Xiaoqing Xie, Yu Zou, Wei Li
Seq2seq神经网络模型在中英文文本摘要的研究中取得了良好的效果, 但在低资源语言的文本摘要研究还处于探索阶段, 尤其是在藏语中。此外, 目前还没有大规模的标注语料库进行摘要提取。本文提出了一种生成藏文新闻摘要的统一模型。利用TextRank算法解决了藏语标注训练数据不足的问题。然后, 采用两层双GRU神经网络提取代表原始新闻的句子, 减少冗余信息。最后, 使用基于注意力机制的Seq2Seq来生成理解式摘要。同时, 我们加入了指针网络来处理未登录词的问题。实验结果表明, ROUGE-1评分比传统模型提高了2%。 关键词:文本摘要;藏文;TextRank; 指针网络;Bi-GRU
no code implementations • 25 Sep 2019 • Wei Li, Shaogang Gong, Xiatian Zhu
We address this limitation by additionally exploiting feature self-calibration operations, resulting in a heterogeneous search space.
no code implementations • 25 Nov 2021 • Wenxuan Ma, Jinming Zhang, Shuang Li, Chi Harold Liu, Yulin Wang, Wei Li
Unsupervised Domain Adaptation (UDA) aims to transfer knowledge from a labeled source domain to an unlabeled target domain.
no code implementations • 5 Dec 2021 • Jiwei Zhang, Yi Yu, Suhua Tang, Jianming Wu, Wei Li
On the one hand, audio encoder and visual encoder separately encode audio data and visual data into two different latent spaces.
no code implementations • 31 Dec 2021 • Jianmin Shen, Feiyi Liu, Shiyang Chen, Dian Xu, Xiangna Chen, Shengfeng Deng, Wei Li, Gabor Papp, Chunbin Yang
With the DANN, only a small fraction of input configurations (2d images) needs to be labeled, which is automatically chosen, in order to capture the critical point.
no code implementations • 31 Dec 2021 • Dawei Wang, Lingping Gao, Ziquan Lan, Wei Li, Jiaping Ren, Jiahui Zhang, Peng Zhang, Pei Zhou, Shengao Wang, Jia Pan, Dinesh Manocha, Ruigang Yang
Recently, there have been many advances in autonomous driving society, attracting a lot of attention from academia and industry.
no code implementations • 3 Jan 2022 • Wei Li, Ksenia Abrashitova, Gerwin Osnabrugge, Lyubov V. Amitonova
A multimode fiber represents the ultimate limit in miniaturization of imaging endoscopes.
no code implementations • 12 Jan 2022 • Yan Ma, Tianxing Liu, Bingsheng Wei, Yi Liu, Kang Xu, Wei Li
Evolutionary Algorithms (EAs) and Deep Reinforcement Learning (DRL) have recently been integrated to take the advantage of the both methods for better exploration and exploitation. The evolutionary part in these hybrid methods maintains a population of policy networks. However, existing methods focus on optimizing the parameters of policy network, which is usually high-dimensional and tricky for EA. In this paper, we shift the target of evolution from high-dimensional parameter space to low-dimensional action space. We propose Evolutionary Action Selection-Twin Delayed Deep Deterministic Policy Gradient (EAS-TD3), a novel hybrid method of EA and DRL. In EAS, we focus on optimizing the action chosen by the policy network and attempt to obtain high-quality actions to promote policy learning through an evolutionary algorithm.
no code implementations • 18 Jan 2022 • Ying Wang, Yuexing Peng, Xinran Liu, Wei Li, George C. Alexandropoulos, Junchuan Yu, Daqing Ge, Wei Xiang
Extracting roads from high-resolution remote sensing images (HRSIs) is vital in a wide variety of applications, such as autonomous driving, path planning, and road navigation.
no code implementations • 20 Jan 2022 • Zengfu Hou, Wei Li
Multi-temporal hyperspectral images can be used to detect changed information, which has gradually attracted researchers' attention.
no code implementations • 1 Mar 2022 • Kaiqi Fu, Shaojun Gao, Kai Wang, Wei Li, Xiaohai Tian, Zejun Ma
Moreover, we utilize multi-source information (e. g., MFCC and deep features) to further improve the scoring system performance.
no code implementations • 10 Mar 2022 • Wei Li, Wenhao Wu, Moye Chen, Jiachen Liu, Xinyan Xiao, Hua Wu
In this survey, we provide a systematic overview of the research progress on the faithfulness problem of NLG, including problem analysis, evaluation metrics and optimization methods.
no code implementations • CVPR 2022 • Zhaowen Li, Yousong Zhu, Fan Yang, Wei Li, Chaoyang Zhao, Yingying Chen, Zhiyang Chen, Jiahao Xie, Liwei Wu, Rui Zhao, Ming Tang, Jinqiao Wang
Furthermore, our method can also exploit single-centric-object dataset such as ImageNet and outperforms BYOL by 2. 5% with the same pre-training epochs in linear probing, and surpass current self-supervised object detection methods on COCO dataset, demonstrating its universality and potential.
no code implementations • 14 Mar 2022 • Siao Liu, Zhaoyu Chen, Wei Li, Jiwei Zhu, Jiafeng Wang, Wenqiang Zhang, Zhongxue Gan
Recently, adversarial attacks have been applied in visual object tracking to deceive deep trackers by injecting imperceptible perturbations into video frames.
no code implementations • 20 Mar 2022 • Runnan Chen, Xinge Zhu, Nenglun Chen, Dawei Wang, Wei Li, Yuexin Ma, Ruigang Yang, Wenping Wang
Promising performance has been achieved for visual perception on the point cloud.
no code implementations • 31 Mar 2022 • Weicheng Kuo, Fred Bertsch, Wei Li, AJ Piergiovanni, Mohammad Saffar, Anelia Angelova
We propose FindIt, a simple and versatile framework that unifies a variety of visual grounding and localization tasks including referring expression comprehension, text-based localization, and object detection.
no code implementations • CVPR 2022 • Jiarui Cai, Mingze Xu, Wei Li, Yuanjun Xiong, Wei Xia, Zhuowen Tu, Stefano Soatto
We propose an online tracking algorithm that performs the object detection and data association under a common framework, capable of linking objects after a long time span.
no code implementations • 1 Apr 2022 • Chenzhong Gao, Wei Li, Ran Tao, Qian Du
Considering the characteristics and differences of multi-source remote sensing images, a feature-based registration algorithm named Multi-scale Histogram of Local Main Orientation (MS-HLMO) is proposed.
no code implementations • 6 Apr 2022 • Shimin Chen, Wei Li, Chen Chen, Jianyang Gu, Jiaming Chu, Xunqiang Tao, Yandong Guo
SEAL consists of two kinds of annotations, SEAL Tubes and SEAL Clips.
no code implementations • 6 Apr 2022 • Shimin Chen, Chen Chen, Wei Li, Xunqiang Tao, Yandong Guo
In this paper, we propose a unified network for TAD, termed Faster-TAD, by re-purposing a Faster-RCNN like architecture.
no code implementations • 9 Apr 2022 • Heng-Chao Li, Wen-Shuai Hu, Wei Li, Jun Li, Qian Du, Antonio Plaza
The problem of effectively exploiting the information multiple data sources has become a relevant but challenging research topic in remote sensing.
no code implementations • 24 Apr 2022 • Yanxiong Li, Wucheng Wang, Hao Chen, Wenchang Cao, Wei Li, Qianhua He
Although few-shot learning has attracted much attention from the fields of image and audio classification, few efforts have been made on few-shot speaker identification.
no code implementations • 2 May 2022 • AJ Piergiovanni, Wei Li, Weicheng Kuo, Mohammad Saffar, Fred Bertsch, Anelia Angelova
We present Answer-Me, a task-aware multi-task framework which unifies a variety of question answering tasks, such as, visual question answering, visual entailment, visual reasoning.
no code implementations • 18 May 2022 • Wei Li, Bin Yang, Junsheng Qiao
In this paper, the depiction of $(O, G)$-granular variable precision fuzzy rough sets ($(O, G)$-GVPFRSs for short) is first given based on overlap and grouping functions.
no code implementations • 18 May 2022 • Na Liu, Wei Li, Yinjian Wang, Rao Tao, Qian Du, Jocelyn Chanussot
The ability of capturing fine spectral discriminative information enables hyperspectral images (HSIs) to observe, detect and identify objects with subtle spectral discrepancy.
no code implementations • 13 May 2022 • Gongao Qi, Bin Yang, Wei Li
In order to further generalize the FRS theory to more complicated data environments, we firstly propose four types of fuzzy neighborhood operators based on fuzzy covering by overlap functions and their implicators in this paper.
no code implementations • 25 May 2022 • Eduardo Pérez-Pellitero, Sibi Catley-Chandar, Richard Shaw, Aleš Leonardis, Radu Timofte, Zexin Zhang, Cen Liu, Yunbo Peng, Yue Lin, Gaocheng Yu, Jin Zhang, Zhe Ma, Hongbin Wang, Xiangyu Chen, Xintao Wang, Haiwei Wu, Lin Liu, Chao Dong, Jiantao Zhou, Qingsen Yan, Song Zhang, Weiye Chen, Yuhang Liu, Zhen Zhang, Yanning Zhang, Javen Qinfeng Shi, Dong Gong, Dan Zhu, Mengdi Sun, Guannan Chen, Yang Hu, Haowei Li, Baozhu Zou, Zhen Liu, Wenjie Lin, Ting Jiang, Chengzhi Jiang, Xinpeng Li, Mingyan Han, Haoqiang Fan, Jian Sun, Shuaicheng Liu, Juan Marín-Vega, Michael Sloth, Peter Schneider-Kamp, Richard Röttger, Chunyang Li, Long Bao, Gang He, Ziyao Xu, Li Xu, Gen Zhan, Ming Sun, Xing Wen, Junlin Li, Shuang Feng, Fei Lei, Rui Liu, Junxiang Ruan, Tianhong Dai, Wei Li, Zhan Lu, Hengyan Liu, Peian Huang, Guangyu Ren, Yonglin Luo, Chang Liu, Qiang Tu, Fangya Li, Ruipeng Gang, Chenghua Li, Jinjing Li, Sai Ma, Chenming Liu, Yizhen Cao, Steven Tel, Barthelemy Heyrman, Dominique Ginhac, Chul Lee, Gahyeon Kim, Seonghyun Park, An Gia Vien, Truong Thanh Nhat Mai, Howoon Yoon, Tu Vo, Alexander Holston, Sheir Zaheer, Chan Y. Park
The challenge is composed of two tracks with an emphasis on fidelity and complexity constraints: In Track 1, participants are asked to optimize objective fidelity scores while imposing a low-complexity constraint (i. e. solutions can not exceed a given number of operations).
no code implementations • 30 May 2022 • Ye Zheng, Xiang Wang, Yu Qi, Wei Li, Liwei Wu
From the time the MVTec AD dataset was proposed to the present, new research methods that are constantly being proposed push its precision to saturation.
no code implementations • 28 May 2022 • Shuang Li, Ke Li, Wei Li
Constraint violation has been a building block to design evolutionary multi-objective optimization algorithms for solving constrained multi-objective optimization problems.
no code implementations • CVPR 2022 • Wei Li, Haiwei Zhang, Qijie Bai, Guoqing Zhao, Ning Jiang, Xiaojie Yuan
However, the application value of SG on downstream tasks is severely limited by the predicate classification bias, which is caused by long-tailed data and presented as semantic bias of predicted relation predicates.
no code implementations • 5 Jun 2022 • Ao Wang, Wei Li, Xin Wu, Zhanchao Huang, Ran Tao
To this end, a multi-patch attention network (MPANet) based on the axial-attention encoder and the multi-scale patch branch (MSPB) structure is proposed.
no code implementations • 20 Jun 2022 • Wei Li, Shuai Xiao, Tianhong Dai, Shanxin Yuan, Tao Wang, Cheng Li, Fenglong Song
To further leverage these two paradigms, we propose a selective and joint HDR and denoising (SJ-HD$^2$R) imaging framework, utilizing scenario-specific priors to conduct the path selection with an accuracy of more than 93. 3$\%$.
no code implementations • 13 May 2022 • Wei Li, Bin Yang, Junsheng Qiao
In this paper, we mainly construct three types of $L$-fuzzy $\beta$-covering-based rough set models and study the axiom sets, matrix representations and interdependency of these three pairs of $L$-fuzzy $\beta$-covering-based rough approximation operators.
no code implementations • 4 Jul 2022 • Jing Wang, Jiangyun Li, Wei Li, Lingfei Xuan, Tianxiang Zhang, Wenxuan Wang
The contextual information is critical for various computer vision tasks, previous works commonly design plug-and-play modules and structural losses to effectively extract and aggregate the global context.
no code implementations • 15 Aug 2022 • Wei Li, Ruxuan Li, Yuzhe ma, Siu On Chan, David Pan, Bei Yu
Graph coloring, a classical and critical NP-hard problem, is the problem of assigning connected nodes as different colors as possible.
no code implementations • 4 Sep 2022 • Yuxiang Zhang, Wei Li, Weidong Sun, Ran Tao, Qian Du
Currently, cross-scene hyperspectral image (HSI) classification has drawn increasing attention.
no code implementations • 6 Sep 2022 • Yuxiang Zhang, Mengmeng Zhang, Wei Li, Shuai Wang, Ran Tao
Text information including extensive prior knowledge about land cover classes has been ignored in hyperspectral image classification (HSI) tasks.
no code implementations • 16 Sep 2022 • Zhanchao Huang, Wei Li, Xiang-Gen Xia, Hao Wang, Feiran Jie, Ran Tao
Specifically, a channel separation-aggregation (CSA) structure is designed to simplify the complexity of stacked separable convolutions, and a dynamic receptive field (DRF) mechanism is developed to maintain high accuracy by customizing the convolution kernel and its perception range dynamically when reducing the network complexity.
no code implementations • 19 Sep 2022 • Dichucheng Li, Yulun Wu, Qinyu Li, Jiahao Zhao, Yi Yu, Fan Xia, Wei Li
Because each Guzheng playing technique is applied to a note, a dedicated onset detector is trained to divide an audio into several notes and its predictions are fused with frame-wise IPT predictions.
no code implementations • 23 Sep 2022 • Kang Xu, Yan Ma, Wei Li
Our key insight is that dynamic systems with different parameters provide different levels of difficulty for the policy, and the difficulty of behaving well in a system is constantly changing due to the evolution of the policy.
no code implementations • 24 Sep 2022 • Kang Xu, Yan Ma, Bingsheng Wei, Wei Li
While Reinforcement Learning can achieve impressive results for complex tasks, the learned policies are generally prone to fail in downstream tasks with even minor model mismatch or unexpected perturbations.
no code implementations • CCL 2022 • Tian Huang, Yanqiu Shao, Wei Li
“语义依存图是NLP处理语义的深层分析方法, 能够对句子中词与词之间的语义进行分析。该文针对古代汉语特点, 在制定古代汉语语义依存图标注规范的基础上, 以《二十四史》为语料来源, 完成标注了规模为3000句的古代汉语语义依存图库, 标注一致性的kappa值为78. 83%。通过与现代汉语语义依存图库的对比, 对依存图库基本情况进行统计, 分析古代汉语的语义特色和规律。统计显示, 古代汉语语义分布宏观上符合齐普夫定律, 在语义事件描述上具有强烈的历史性叙事和正式文体特征, 如以人物纪传为中心, 时间、地点等周边角色描述细致, 叙事语言冷静客观, 缺少描述情态、语气、程度、时间状态等的修饰词语等。 "
no code implementations • CCL 2022 • Kuai Yu, Yanqiu Shao, Wei Li
“基于深度学习的有监督机器翻译取得了良好的效果, 但训练过程中需要大量质量较高的对齐语料。对于中文古今翻译场景, 高质量的平行语料并不多, 而粗对齐的篇章、段语料比较容易获得, 因此语料对齐很有研究价值和研究必要。在传统双语平行语料的句子对齐研究中, 传统方法根据双语文本中的长度、词汇、共现文字等语法信息, 建立一个综合评判标准来衡量两个句对之间相似度。此类方法虽然在单句对齐上取得了较好的效果, 但是对于句子语义匹配的能力有限, 并且在一些多对多的对齐模式上的性能表现不佳。在本文中我们提出尝试利用现在发展迅速且具有强大语义表示能力的预训练语言模型来考虑双语的语义信息, 但是单独使用预训练语言模型只能考虑相对局部的信息, 因此我们提出采用基于动态规划算法的强化学习训练目标来整合段落全局信息, 并且进行无监督训练。实验结果证明我们提出的方法训练得到的模型性能优于此前获得最好表现的基线模型, 尤其相较于传统模型难以处理的多对多对齐模式下, 性能提升较大。”
no code implementations • COLING 2022 • Kun Zhang, Yunqi Qiu, Yuanzhuo Wang, Long Bai, Wei Li, Xuhui Jiang, HuaWei Shen, Xueqi Cheng
Complex question generation over knowledge bases (KB) aims to generate natural language questions involving multiple KB relations or functional constraints.
no code implementations • 5 Oct 2022 • Yanbing Liu, Wei Li, Kun Cheng, Xun Liu, Wei Yang
In order to comprehensively investigate the influence caused by the misalignment, we proposed a method for estimating the performance of a 4f-ONN in response to various misalignment in the context of the image classification task. The misalignment in numerical simulation is estimated by manipulating the optical intensity distributions in the fourth focus plane in the 4f system.
no code implementations • 18 Oct 2022 • Zixuan Li, Zhongni Hou, Saiping Guan, Xiaolong Jin, Weihua Peng, Long Bai, Yajuan Lyu, Wei Li, Jiafeng Guo, Xueqi Cheng
This is actually a matching task between a query and candidate entities based on their historical structures, which reflect behavioral trends of the entities at different timestamps.
no code implementations • 18 Oct 2022 • Runnan Chen, Xinge Zhu, Nenglun Chen, Wei Li, Yuexin Ma, Ruigang Yang, Wenping Wang
To this end, we propose a novel framework to learn the geometric primitives shared in seen and unseen categories' objects and employ a fine-grained alignment between language and the learned geometric primitives.
no code implementations • 19 Oct 2022 • Longyuan Zhang, Ziyue Hou, Ji Wang, Ziang Liu, Wei Li
Multiple predictive path points are dynamically generated by a deep Markov model optimized using RL approach for robot to track.
no code implementations • 22 Oct 2022 • Wenhao Wu, Wei Li, Jiachen Liu, Xinyan Xiao, Sujian Li, Yajuan Lyu
Though model robustness has been extensively studied in language understanding, the robustness of Seq2Seq generation remains understudied.
no code implementations • 25 Oct 2022 • Fei Ma, Feiyi Liu, Wei Li
In this paper, we introduce an approach of GNNs combined with a HaarPooling operation to analyze the events, called HaarPooling Message Passing neural network (HMPNet).
no code implementations • 28 Oct 2022 • Wei Li, Xue Xu, Xinyan Xiao, Jiachen Liu, Hu Yang, Guohao Li, Zhanpeng Wang, Zhifan Feng, Qiaoqiao She, Yajuan Lyu, Hua Wu
Diffusion generative models have recently greatly improved the power of text-conditioned image generation.
no code implementations • 1 Nov 2022 • Wenhao Wu, Wei Li, Jiachen Liu, Xinyan Xiao, Ziqiang Cao, Sujian Li, Hua Wu
We first measure a model's factual robustness by its success rate to defend against adversarial attacks when generating factual information.
no code implementations • 2 Nov 2022 • Wei Li, Wolfgang Karl Härdle, Stefan Lessmann
In addition, we delicately examine the explainability of the CBR system in the decision-making process of bankruptcy prediction.
no code implementations • 22 Nov 2022 • Hai Wu, Chenglu Wen, Wei Li, Xin Li, Ruigang Yang, Cheng Wang
However, it is difficult to apply such networks to 3D object detection in autonomous driving due to its large computation cost and slow reasoning speed.
no code implementations • 25 Nov 2022 • Tianpeng Bao, Jiadong Chen, Wei Li, Xiang Wang, Jingjing Fei, Liwei Wu, Rui Zhao, Ye Zheng
However, existing datasets for unsupervised anomaly detection are biased towards manufacturing inspection, not considering maintenance inspection which is usually conducted under outdoor uncontrolled environment such as varying camera viewpoints, messy background and degradation of object surface after long-term working.
no code implementations • 15 Dec 2022 • Junbo Qiao, Shaohui Lin, Yunlun Zhang, Wei Li, Jie Hu, Gaoqi He, Changbo Wang, Lizhuang Ma
Real-world image super-resolution (RISR) has received increased focus for improving the quality of SR images under unknown complex degradation.
no code implementations • 10 Jan 2023 • Xuming Zhang, Jian Yan, Jia Tian, Wei Li, Xingfa Gu, Qingjiu Tian
This framework comprises two main parts: (i) a leakage-free balanced sampling strategy, and (ii) a modified end-to-end fully convolutional network (FCN) architecture that optimizes the trade-off between accuracy and efficiency.
Hyperspectral Image Classification Vocal Bursts Intensity Prediction
no code implementations • 19 Feb 2023 • Wei Li, Weiyan Liu, Shitong Shao, Shiyi Huang
The results show that AIIR-MIX can dynamically assign each agent a real-time intrinsic reward in accordance with their actual contribution.
no code implementations • 20 Feb 2023 • Wei Liu, Kaiqi Fu, Xiaohai Tian, Shuju Shi, Wei Li, Zejun Ma, Tan Lee
A typical fluency scoring system generally relies on an automatic speech recognition (ASR) system to obtain time stamps in input speech for either the subsequent calculation of fluency-related features or directly modeling speech fluency with an end-to-end approach.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
no code implementations • 21 Feb 2023 • Wei Liu, Kaiqi Fu, Xiaohai Tian, Shuju Shi, Wei Li, Zejun Ma, Tan Lee
Recent studies on pronunciation scoring have explored the effect of introducing phone embeddings as reference pronunciation, but mostly in an implicit manner, i. e., addition or concatenation of reference phone embedding and actual pronunciation of the target phone as the phone-level pronunciation quality representation.
no code implementations • 24 Feb 2023 • Zili Lu, Yuexing Peng, Wei Li, Junchuan Yu, Daqing Ge, Wei Xiang
An object-level contrastive learning (OCL) strategy is employed in the object classification sub-network featuring a siamese network to realize the global features extraction, and a sub-object-level contrastive learning (SOCL) paradigm is designed in the semantic segmentation sub-network to efficiently extract salient features from boundaries of landslides.
no code implementations • 28 Feb 2023 • Zhaowen Li, Yousong Zhu, Zhiyang Chen, Wei Li, Chaoyang Zhao, Liwei Wu, Rui Zhao, Ming Tang, Jinqiao Wang
However, its high random mask ratio would result in two serious problems: 1) the data are not efficiently exploited, which brings inefficient pre-training (\eg, 1600 epochs for MAE $vs.$ 300 epochs for the supervised), and 2) the high uncertainty and inconsistency of the pre-trained model, \ie, the prediction of the same patch may be inconsistent under different mask rounds.
no code implementations • CVPR 2023 • Rui Zhao, Wei Li, Zhipeng Hu, Lincheng Li, Zhengxia Zou, Zhenwei Shi, Changjie Fan
In our method, taking the power of large-scale pre-trained multi-modal CLIP and neural rendering, T2P searches both continuous facial parameters and discrete facial parameters in a unified framework.
no code implementations • 15 Mar 2023 • Guoqiang Jin, Fan Yang, Mingshan Sun, Ruyi Zhao, Yakun Liu, Wei Li, Tianpeng Bao, Liwei Wu, Xingyu Zeng, Rui Zhao
To this end, we propose SeqCo-DETR, a novel Sequence Consistency-based self-supervised method for object DEtection with TRansformers.
no code implementations • 29 Mar 2023 • Weicheng Kuo, AJ Piergiovanni, Dahun Kim, Xiyang Luo, Ben Caine, Wei Li, Abhijit Ogale, Luowei Zhou, Andrew Dai, Zhifeng Chen, Claire Cui, Anelia Angelova
We propose a novel paradigm of training with a decoder-only model for multimodal tasks, which is surprisingly effective in jointly learning of these disparate vision-language tasks.
Ranked #1 on Video Captioning on MSVD
no code implementations • IEEE Transactions on Geoscience and Remote Sensing 2019 • Ji He, Lina Zhao, HongWei Yang, Mengmeng Zhang, Wei Li
Moreover, several attentions are learned by different heads, and each head of the MHSA layer encodes the semantic context-aware representation to obtain discriminative features.
no code implementations • 8 May 2023 • Wei Li, Xiangxu Meng, Chuhao Chen, Jianing Chen
In this paper, we carefully examine the opposing properties of CI and CD, and raise a practical question that has not been effectively answered, e. g.,"How to effectively mix the CI and CD properties of time series to achieve better predictive performance?"
no code implementations • 19 May 2023 • Kaiqi Fu, Shaojun Gao, Shuju Shi, Xiaohai Tian, Wei Li, Zejun Ma
Specifically, we first pre-train the model using a reconstruction loss function, by masking phones and their durations jointly on a large amount of unlabeled speech and text prompts.
no code implementations • 23 May 2023 • Haochen Wang, Yujun Shen, Jingjing Fei, Wei Li, Liwei Wu, Yuxi Wang, Zhaoxiang Zhang
To this end, we propose T2S-DA, which we interpret as a form of pulling Target to Source for Domain Adaptation, encouraging the model in learning similar cross-domain features.
no code implementations • 25 May 2023 • Wenhao Cheng, Junbo Yin, Wei Li, Ruigang Yang, Jianbing Shen
In this work, we propose a new multi-modal visual grounding task, termed LiDAR Grounding.
no code implementations • 28 May 2023 • Kang Xu, Chenjia Bai, Shuang Qiu, Haoran He, Bin Zhao, Zhen Wang, Wei Li, Xuelong Li
Leveraging learned strategies in unfamiliar scenarios is fundamental to human intelligence.
no code implementations • 7 Jun 2023 • Xi Zhu, Xiya Cao, Zhiwei Dong, Caifa Zhou, Qiangbo Liu, Wei Li, Yongliang Wang
We also provide a new scene-level BEV map evaluation setting along with the corresponding baseline for a more comprehensive comparison.
no code implementations • 2 Aug 2023 • Yiming Zhou, Yuexing Peng, Wei Li, Junchuan Yu, Daqing Ge, Wei Xiang
To extract accurate semantic features, a hyper-pixel-wise contrastive learning augmented segmentation network (HPCL-Net) is proposed, which augments the local salient feature extraction from boundaries of landslides through HPCL-Net and fuses heterogeneous infromation in the semantic space from high-resolution remote sensing images and digital elevation model data.
no code implementations • ICCV 2023 • Siao Liu, Zhaoyu Chen, Yang Liu, Yuzheng Wang, Dingkang Yang, Zhile Zhao, Ziqing Zhou, Xie Yi, Wei Li, Wenqiang Zhang, Zhongxue Gan
In particular, CG2A develops a Gradient Agreement Solver to adaptively balance the varying gradient magnitudes, and introduces a Soft Gradient Surgery strategy to alleviate the gradient conflicts.
no code implementations • 7 Aug 2023 • Cheng Wang, Wei Li
Image deraining is a challenging task that involves restoring degraded images affected by rain streaks.
no code implementations • 11 Aug 2023 • Liang Chen, Yifei Yin, Hao Shi, Qingqing Sheng, Wei Li
The training image pairs are generated by the sub-sampler from real-word SAR image to estimate the noise distribution.
no code implementations • 16 Aug 2023 • Ji Zhang, Xiao Wu, Zhi-Qi Cheng, Qi He, Wei Li
Anomaly segmentation plays a pivotal role in identifying atypical objects in images, crucial for hazard detection in autonomous driving systems.
no code implementations • 23 Sep 2023 • Monan Zhou, Shangda Wu, YuAn Wang, Wei Li
WikiMT++ is an expanded and refined version of WikiMusicText (WikiMT), featuring 1010 curated lead sheets in ABC notation.
no code implementations • 21 Sep 2023 • Yidong Liu, FuKai Shang, Fang Wang, Rui Xu, Jun Wang, Wei Li, Yao Li, Conghui He
With the advancement of deep learning technologies, general-purpose large models such as GPT-4 have demonstrated exceptional capabilities across various domains.
no code implementations • 25 Sep 2023 • Wenyi Yu, Changli Tang, Guangzhi Sun, Xianzhao Chen, Tian Tan, Wei Li, Lu Lu, Zejun Ma, Chao Zhang
Q-Former-based LLMs can generalise well to out-of-domain datasets, where 12% relative WER reductions over the Whisper baseline ASR model were achieved on the Eval2000 test set without using any in-domain training data from Switchboard.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
no code implementations • 26 Sep 2023 • Hailing Wang, Wei Li, Yuanyuan Xi, Jie Hu, Hanting Chen, Longyu Li, Yunhe Wang
By matching similar patches between frames, objects with large motion ranges in dynamic scenes can be aligned, which can effectively alleviate the generation of artifacts.
no code implementations • 29 Sep 2023 • Runnan Chen, Xinge Zhu, Nenglun Chen, Dawei Wang, Wei Li, Yuexin Ma, Ruigang Yang, Tongliang Liu, Wenping Wang
In this paper, we propose Model2Scene, a novel paradigm that learns free 3D scene representation from Computer-Aided Design (CAD) models and languages.
no code implementations • 7 Oct 2023 • Monan Zhou, Shangda Wu, Shaohua Ji, Zijin Li, Wei Li
Unlike previous studies that focused on the effect of piano performance techniques on sound quality, this study evaluates the inherent sound quality of different pianos.
no code implementations • 9 Oct 2023 • Xin Liu, Wei Li, Dazhi Zhan, Yu Pan, Xin Ma, Yu Ding, Zhisong Pan
Federated learning (FL) is a widely employed distributed paradigm for collaboratively training machine learning models from multiple clients without sharing local data.
no code implementations • 6 Nov 2023 • Chenzhong Gao, Wei Li
This paper aims at providing an effective multi-modal images invariant feature extraction and matching algorithm for the application of multi-source data analysis.