1 code implementation • COLING 2022 • Zhongjian Miao, Xiang Li, Liyan Kang, Wen Zhang, Chulun Zhou, Yidong Chen, Bin Wang, Min Zhang, Jinsong Su
Most existing methods on robust neural machine translation (NMT) construct adversarial examples by injecting noise into authentic examples and indiscriminately exploit two types of examples.
no code implementations • CCL 2021 • Xiang Li, Chengwei Liu, Xiaoxu Zhu
“反问是现代汉语中一种常用的修辞手法, 根据是否含有反问标记可分为显式反问句与隐式反问句。其中隐式反问句表达的情感更为丰富, 表现形式也十分复杂, 对隐式反问句的识别更具挑战性。本文首先扩充了汉语反问句语料库, 语料库规模达到10000余句, 接着针对隐式反问句的特点, 提出了一种融合情感分析的隐式反问句识别模型。模型考虑了句子的语义信息, 上下文信息, 并借助情感分析任务辅助识别隐式反问句。实验结果表明, 本文提出的模型在隐式反问句识别任务上取得了良好的性能。”
no code implementations • NAACL (AutoSimTrans) 2022 • Mengge Liu, Xiang Li, Bao Chen, Yanzhi Tian, Tianwei Lan, Silin Li, Yuhang Guo, Jian Luan, Bin Wang
This system paper describes the BIT-Xiaomi simultaneous translation system for Autosimtrans 2022 simultaneous translation challenge.
no code implementations • ECCV 2020 • Chi Xu, Yasushi Makihara, Xiang Li, Yasushi Yagi, Jianfeng Lu
Specifically, a phase estimation network is introduced for the input single image, and the gait cycle reconstruction network exploits the estimated phase to mitigate the dependence of an encoded feature on the phase of that single image.
1 code implementation • ACL 2022 • Bin Liang, Chenwei Lou, Xiang Li, Min Yang, Lin Gui, Yulan He, Wenjie Pei, Ruifeng Xu
Then, the descriptions of the objects are served as a bridge to determine the importance of the association between the objects of image modality and the contextual words of text modality, so as to build a cross-modal graph for each multi-modal instance.
1 code implementation • ACL 2022 • Bin Liang, Qinglin Zhu, Xiang Li, Min Yang, Lin Gui, Yulan He, Ruifeng Xu
In this paper, we propose a joint contrastive learning (JointCL) framework, which consists of stance contrastive learning and target-aware prototypical graph contrastive learning.
no code implementations • IWSLT (ACL) 2022 • Bao Guo, Mengge Liu, Wen Zhang, Hexuan Chen, Chang Mu, Xiang Li, Jianwei Cui, Bin Wang, Yuhang Guo
Our system is built based on the Transformer model with novel techniques borrowed from our recent research work.
no code implementations • 5 Feb 2023 • Chengcheng Han, Yuhe Wang, Yingnan Fu, Xiang Li, Minghui Qiu, Ming Gao, Aoying Zhou
Few-shot learning has been used to tackle the problem of label scarcity in text classification, of which meta-learning based methods have shown to be effective, such as the prototypical networks (PROTO).
no code implementations • 31 Jan 2023 • Jiayi Yuan, Haobo Jiang, Xiang Li, Jianjun Qian, Jun Li, Jian Yang
Specifically, our framework consists of a cross-modality flow-guided upsampling network (CFUNet) and a flow-enhanced pyramid edge attention network (PEANet).
no code implementations • 31 Jan 2023 • Jiayi Yuan, Haobo Jiang, Xiang Li, Jianjun Qian, Jun Li, Jian Yang
Second, instead of the coarse concatenation guidance, we propose a recurrent structure attention block, which iteratively utilizes the latest depth estimation and the image features to jointly select clear patterns and boundaries, aiming at providing refined guidance for accurate depth recovery.
no code implementations • 29 Jan 2023 • Xiang Li, Shuwei Chen, Jian Dong, Jin Zhang, Yongkang Wang, Xingxing Wang, Dong Wang
Click-through rate (CTR) prediction is crucial in recommendation and online advertising systems.
no code implementations • 29 Jan 2023 • Xiang Li, Tiandi Ye, Caihua Shan, Dongsheng Li, Ming Gao
In this paper, to comprehensively enhance the performance of generative graph SSL against other GCL models on both unsupervised and supervised learning tasks, we propose the SeeGera model, which is based on the family of self-supervised variational graph auto-encoder (VGAE).
1 code implementation • 14 Jan 2023 • Zhaohui Zheng, Yuming Chen, Qibin Hou, Xiang Li, Ming-Ming Cheng
In this paper, we study the spatial disequilibrium problem of modern object detectors and propose to quantify this ``spatial bias'' by measuring the detection performance over zones.
no code implementations • 5 Jan 2023 • Xiang Li, Jinshan Pan, Jinhui Tang, Jiangxin Dong
We develop a hybrid dynamic-Transformer block(HDTB) that integrates the MHDLSA and SparseGSA for both local and global feature exploration.
1 code implementation • 28 Dec 2022 • Jianxiang Yu, Xiang Li
We take node embeddings in the coarse view as anchors, and construct positive and negative samples from the fine-grained view.
no code implementations • 27 Dec 2022 • Xiang Li, Rabih Younes
We make use of an auto-encoder-based structure to extract pose features from WiFi frames.
no code implementations • 19 Dec 2022 • Xiang Li, Viraj Mehta, Johannes Kirschner, Ian Char, Willie Neiswanger, Jeff Schneider, Andreas Krause, Ilija Bogunovic
Many real-world reinforcement learning tasks require control of complex dynamical systems that involve both costly data acquisition processes and large state spaces.
1 code implementation • 16 Dec 2022 • Yimian Dai, Xiang Li, Fei Zhou, Yulei Qian, Yaohong Chen, Jian Yang
Finally, we present a new research benchmark for infrared small target detection, consisting of the SIRST-V2 dataset of real-world, high-resolution single-frame targets, the normalized contrast evaluation metric, and the DeepInfrared toolkit for detection.
no code implementations • 7 Dec 2022 • Xiang Li, Junbo Yin, Botian Shi, Yikang Li, Ruigang Yang, Jianbing Shen
In this paper, we present a more artful framework, LiDAR-guided Weakly Supervised Instance Segmentation (LWSIS), which leverages the off-the-shelf 3D data, i. e., Point Cloud, together with the 3D boxes, as natural weak supervisions for training the 2D image instance segmentation models.
1 code implementation • 29 Nov 2022 • Zheng Li, Xiang Li, Lingfeng Yang, Borui Zhao, RenJie Song, Lei Luo, Jun Li, Jian Yang
In this paper, we propose a simple curriculum-based technique, termed Curriculum Temperature for Knowledge Distillation (CTKD), which controls the task difficulty level during the student's learning career through a dynamic and learnable temperature.
no code implementations • 26 Nov 2022 • Xiang Li, Haoyuan Cao, Shijie Zhao, Junlin Li, Li Zhang, Bhiksha Raj
In this paper, we aim to tackle the video salient object detection problem for panoramic videos, with their corresponding ambisonic audios.
1 code implementation • IEEE Transactions on Multimedia 2020 • Fan Yang, Yang Wu, Zheng Wang, Xiang Li, Sakriani Sakti, Satoshi Nakamura
Therefore, previous works pre-train their models on rich-labeled photo retrieval data (i. e., source domain) and then fine-tune them on the limited-labeled sketch-to-photo retrieval data (i. e., target domain).
Ranked #1 on
Image Retrieval
on PKU-Reid
no code implementations • 23 Nov 2022 • Ryan Burgert, Kanchana Ranasinghe, Xiang Li, Michael S. Ryoo
Recent diffusion-based generative models combined with vision-language models are capable of creating realistic images from natural language prompts.
no code implementations • 20 Nov 2022 • Zhiqiang Yan, Kun Wang, Xiang Li, Zhenyu Zhang, Jun Li, Jian Yang
Unsupervised depth completion aims to recover dense depth from the sparse one without using the ground-truth annotation.
no code implementations • 15 Nov 2022 • Jia Li, Xiang Li, Xiaowei Jia, Michael Steinbach, Vipin Kumar
Causal graphs are usually considered in a 2D plane, but it has rarely been noticed that within multiple relatively independent timelines, which is comparatively common in causality machine learning, the individual-level differences may lead to Causal Representation Bias (CRB).
no code implementations • 7 Nov 2022 • Andrey Ignatov, Radu Timofte, Maurizio Denna, Abdel Younes, Ganzorig Gankhuyag, Jingang Huh, Myeong Kyun Kim, Kihwan Yoon, Hyeon-Cheol Moon, Seungho Lee, Yoonsik Choe, Jinwoo Jeong, Sungjei Kim, Maciej Smyl, Tomasz Latkowski, Pawel Kubik, Michal Sokolski, Yujie Ma, Jiahao Chao, Zhou Zhou, Hongfan Gao, Zhengfeng Yang, Zhenbing Zeng, Zhengyang Zhuge, Chenghua Li, Dan Zhu, Mengdi Sun, Ran Duan, Yan Gao, Lingshun Kong, Long Sun, Xiang Li, Xingdong Zhang, Jiawei Zhang, Yaqi Wu, Jinshan Pan, Gaocheng Yu, Jin Zhang, Feng Zhang, Zhe Ma, Hongbin Wang, Hojin Cho, Steve Kim, Huaen Li, Yanbo Ma, Ziwei Luo, Youwei Li, Lei Yu, Zhihong Wen, Qi Wu, Haoqiang Fan, Shuaicheng Liu, Lize Zhang, Zhikai Zong, Jeremy Kwon, Junxi Zhang, Mengyuan Li, Nianxiang Fu, Guanchen Ding, Han Zhu, Zhenzhong Chen, Gen Li, Yuanfan Zhang, Lei Sun, Dafeng Zhang, Neo Yang, Fitz Liu, Jerry Zhao, Mustafa Ayazoglu, Bahri Batuhan Bilecen, Shota Hirose, Kasidis Arunruangsirilert, Luo Ao, Ho Chun Leung, Andrew Wei, Jie Liu, Qiang Liu, Dahai Yu, Ao Li, Lei Luo, Ce Zhu, Seongmin Hong, Dongwon Park, Joonhee Lee, Byeong Hyun Lee, Seunggyu Lee, Se Young Chun, Ruiyuan He, Xuhao Jiang, Haihang Ruan, Xinjian Zhang, Jing Liu, Garas Gendy, Nabil Sabor, Jingchao Hou, Guanghui He
While numerous solutions have been proposed for this problem in the past, they are usually not compatible with low-power mobile NPUs having many computational and memory constraints.
no code implementations • 5 Nov 2022 • Homgmin Cai, Wenxiong Liao, Zhengliang Liu, Xiaoke Huang, Yiyang Zhang, Siqi Ding, Sheng Li, Quanzheng Li, Tianming Liu, Xiang Li
In this framework, we apply distant-supervision on cross-domain knowledge graph adaptation.
no code implementations • 31 Oct 2022 • Xiang Li, Junchi Yang, Niao He
Adaptive gradient methods have shown their ability to adjust the stepsizes on the fly in a parameter-agnostic manner, and empirically achieve faster convergence for solving minimization problems.
no code implementations • 27 Oct 2022 • Xiang Li, Yucheng Zhou
Researching bragging behavior on social media arouses interest of computational (socio) linguists.
no code implementations • 17 Oct 2022 • Shijian Jiang, Guwen Han, Danhang Tang, Yang Zhou, Xiang Li, Jiming Chen, Qi Ye
The decoder aggregate both local image features in pixels and geometric features in vertices.
no code implementations • 17 Oct 2022 • Haoming Li, Xinzhuo Lin, Yang Zhou, Xiang Li, Yuchi Huo, Jiming Chen, Qi Ye
To tackle the challenge, we introduce an intermediate variable for grasp contact areas to constrain the grasp generation; in other words, we factorize the mapping into two sequential stages by assuming that grasping poses are fully constrained given contact maps: 1) we first learn contact map distributions to generate the potential contact maps for grasps; 2) then learn a mapping from the contact maps to the grasping poses.
1 code implementation • 16 Oct 2022 • Jianing Wang, Wenkang Huang, Qiuhui Shi, Hongbin Wang, Minghui Qiu, Xiang Li, Ming Gao
In this paper, to address these problems, we introduce a seminal knowledge prompting paradigm and further propose a knowledge-prompting-based PLM framework KP-PLM.
1 code implementation • 15 Oct 2022 • Shaoming Xu, Ankush Khandelwal, Xiang Li, Xiaowei Jia, Licheng Liu, Jared Willard, Rahul Ghosh, Kelly Cutler, Michael Steinbach, Christopher Duffy, John Nieber, Vipin Kumar
To address this issue, we further propose a new strategy which augments a training segment with an initial value of the target variable from the timestep right before the starting of the training segment.
no code implementations • 12 Oct 2022 • Somya Sharma, Rahul Ghosh, Arvind Renganathan, Xiang Li, Snigdhansu Chatterjee, John Nieber, Christopher Duffy, Vipin Kumar
We propose uncertainty based learning method that offers 6\% improvement in $R^2$ for streamflow prediction (forward modeling) from inverse model inferred basin characteristic estimates, 17\% reduction in uncertainty (40\% in presence of noise) and 4\% higher coverage rate for basin characteristics.
1 code implementation • 7 Oct 2022 • Nuo Chen, Qiushi Sun, Renyu Zhu, Xiang Li, Xuesong Lu, Ming Gao
To interpret these models, some probing methods have been applied.
1 code implementation • COLING 2022 • Yequan Wang, Xiang Li, Aixin Sun, Xuying Meng, Huaming Liao, Jiafeng Guo
CofeNet is able to extract complicated quotations with components of variable lengths and complicated structures.
1 code implementation • COLING 2022 • Fangyu Lei, Shizhu He, Xiang Li, Jun Zhao, Kang Liu
In the real-world question answering scenarios, hybrid form combining both tabular and textual contents has attracted more and more attention, among which numerical reasoning problem is one of the most typical and challenging problems.
1 code implementation • 9 Sep 2022 • Yeon Seonwoo, Guoyin Wang, Changmin Seo, Sajal Choudhary, Jiwei Li, Xiang Li, Puyang Xu, Sunghyun Park, Alice Oh
This results in inadequate supervision signals for capturing a semantic similarity of similar sentences.
no code implementations • 29 Aug 2022 • Hang Chen, Xinyu Yang, Xiang Li
To learn it applicably, we propose a general clause-level encoding model named EA-GAT comprising E-GAT and Activation Sort.
no code implementations • 10 Aug 2022 • Xiang Li, Changhe Song, Xianhao Wei, Zhiyong Wu, Jia Jia, Helen Meng
This paper aims to introduce a chunk-wise multi-scale cross-speaker style model to capture both the global genre and the local prosody in audiobook speeches.
1 code implementation • 19 Jul 2022 • Ningyi Liao, Dingheng Mo, Siqiang Luo, Xiang Li, Pengcheng Yin
Recent advances in data processing have stimulated the demand for learning graphs of very large scales.
no code implementations • 12 Jul 2022 • Xiang Li, Jinglu Wang, Xiaohao Xu, Bhiksha Raj, Yan Lu
We propose a robust context fusion network to tackle VIS in an online fashion, which predicts instance segmentation frame-by-frame with a few preceding frames.
1 code implementation • 12 Jul 2022 • Gang Li, Xiang Li, Yujie Wang, Yichao Wu, Ding Liang, Shanshan Zhang
Specifically, we propose the Inverse NMS Clustering (INC) and Rank Matching (RM) to instantiate the dense supervision, without the widely used, conventional sparse pseudo labels.
1 code implementation • 4 Jul 2022 • Xiang Li, Jinglu Wang, Xiaohao Xu, Xiao Li, Yan Lu, Bhiksha Raj
We leverage the cycle consistency to discriminate the semantic consensus, thus advancing the primary task.
Ranked #4 on
Referring Expression Segmentation
on Refer-YouTube-VOS (2021 public validation)
(using extra training data)
Referring Expression Segmentation
Referring Video Object Segmentation
+2
no code implementations • 28 Jun 2022 • Di Jin, Rui Wang, Meng Ge, Dongxiao He, Xiang Li, Wei Lin, Weixiong Zhang
Due to the homophily assumption of Graph Convolutional Networks (GCNs) that these methods use, they are not suitable for heterophily graphs where nodes with different labels or dissimilar attributes tend to be adjacent.
no code implementations • 27 Jun 2022 • Ryan Burgert, Jinghuan Shang, Xiang Li, Michael Ryoo
Unpaired image translation algorithms can be used for sim2real tasks, but many fail to generate temporally consistent results.
no code implementations • 24 Jun 2022 • Hao Wu, Yongqiang Cheng, Xixi Chen, Zheng Yang, Xiang Li, Hongqiang Wang
These advantages benefit from the geometry of the Toeplitz Hermitian positive definite (HPD) manifold $\mathcal{M}_{\mathcal{T}H_{++}}$, but the sophisticated geometry also results in some challenges for geometric detectors, such as the implementation of the enhanced detector to improve the SCR (signal-to-clutter ratio) and the analysis of the detection performance.
no code implementations • 18 Jun 2022 • Chonghan Chen, Qi Jiang, Chih-Hao Wang, Noel Chen, Haohan Wang, Xiang Li, Bhiksha Raj
With our proposed QCM, the downstream fusion module receives visual features that are more discriminative and focused on the desired object described in the expression, leading to more accurate predictions.
2 code implementations • 10 Jun 2022 • Xiang Li, Jinghuan Shang, Srijan Das, Michael S. Ryoo
We investigate whether self-supervised learning (SSL) can improve online reinforcement learning (RL) from pixels.
no code implementations • 1 Jun 2022 • Junchi Yang, Xiang Li, Niao He
Adaptive algorithms like AdaGrad and AMSGrad are successful in nonconvex optimization owing to their parameter-agnostic ability -- requiring no a priori knowledge about problem-specific parameters nor tuning of learning rates.
no code implementations • 30 May 2022 • Di Jin, Luzhi Wang, Yizhen Zheng, Xiang Li, Fei Jiang, Wei Lin, Shirui Pan
As most of the existing graph neural networks yield effective graph representations of a single graph, little effort has been made for jointly learning two graph representations and calculating their similarity score.
1 code implementation • 20 May 2022 • Xiang Li, Wenhai Wang, Lingfeng Yang, Jian Yang
Masked AutoEncoder (MAE) has recently led the trends of visual self-supervision area by an elegant asymmetric encoder-decoder design, which significantly optimizes both the pre-training efficiency and fine-tuning accuracy.
Ranked #26 on
Object Detection
on COCO minival
no code implementations • 19 May 2022 • Xiang Li, Xiaojiang Zhou, Yao Xiao, Peihao Huang, Dayao Chen, Sheng Chen, Yunsen Xian
Industrial search and recommendation systems mostly follow the classic multi-stage information retrieval paradigm: matching, pre-ranking, ranking, and re-ranking stages.
1 code implementation • 15 May 2022 • Xiang Li, Renyu Zhu, Yao Cheng, Caihua Shan, Siqiang Luo, Dongsheng Li, Weining Qian
Further, for other homophilous nodes excluded in the neighborhood, they are ignored for information aggregation.
1 code implementation • Findings (NAACL) 2022 • Ziqian Zeng, Weimin Ni, Tianqing Fang, Xiang Li, Xinran Zhao, Yangqiu Song
In this paper, we propose to query a masked language model with cloze style prompts to obtain supervision signals.
1 code implementation • ACL 2022 • Renyu Zhu, Lei Yuan, Xiang Li, Ming Gao, Wenyuan Cai
In this paper, we consider human behaviors and propose the PGNN-EK model that consists of two main components.
1 code implementation • ACL 2022 • Zhiyong Wu, Wei Bi, Xiang Li, Lingpeng Kong, Ben Kao
We propose knowledge internalization (KI), which aims to complement the lexical knowledge into neural dialog models.
1 code implementation • 12 Apr 2022 • Wenjing Zhu, Xiang Li
Speech Emotion Recognition (SER) is a fundamental task to predict the emotion label from speech data.
1 code implementation • 31 Mar 2022 • Wenlin Dai, Changhe Song, Xiang Li, Zhiyong Wu, Huashan Pan, Xiulin Li, Helen Meng
Inspired by Flat-LAttice Transformer (FLAT), we propose an end-to-end Chinese text normalization model, which accepts Chinese characters as direct input and integrates expert knowledge contained in rules into the neural network, both contribute to the superior performance of proposed model for the text normalization task.
1 code implementation • 30 Mar 2022 • Gang Li, Xiang Li, Yujie Wang, Yichao Wu, Ding Liang, Shanshan Zhang
Specifically, for pseudo labeling, existing works only focus on the classification score yet fail to guarantee the localization precision of pseudo boxes; For consistency training, the widely adopted random-resize training only considers the label-level consistency but misses the feature-level one, which also plays an important role in ensuring the scale invariance.
1 code implementation • 29 Mar 2022 • Zhifang Fan, Dan Ou, Yulong Gu, Bairan Fu, Xiang Li, Wentian Bao, Xin-yu Dai, Xiaoyi Zeng, Tao Zhuang, Qingwen Liu
In this paper, we propose a new perspective for context-aware users' behavior modeling by including the whole page-wisely exposed products and the corresponding feedback as contextualized page-wise feedback sequence.
no code implementations • 20 Mar 2022 • Yang Lou, Ruizi Wu, Junli Li, Lin Wang, Xiang Li, Guanrong Chen
Extensive experimental studies on both synthetic and real-world networks, both directed and undirected, demonstrate that 1) the proposed LFR-CNN performs better than other two state-of-the-art prediction methods, with significantly lower prediction errors; 2) LFR-CNN is insensitive to the variation of the network size, which significantly extends its applicability; 3) although LFR-CNN needs more time to perform feature learning, it can achieve accurate prediction faster than attack simulations; 4) LFR-CNN not only can accurately predict network robustness, but also provides a good indicator for connectivity robustness, better than the classical spectral measures.
no code implementations • 18 Mar 2022 • Zhiqiang Yan, Xiang Li, Kun Wang, Zhenyu Zhang, Jun Li, Jian Yang
To deal with the PDC task, we train a deep network that takes both depth and image as inputs for the dense panoramic depth recovery.
no code implementations • 16 Mar 2022 • Xiang Li, Yazhou Zhang, Prayag Tiwari, Dawei Song, Bin Hu, Meihong Yang, Zhigang Zhao, Neeraj Kumar, Pekka Marttinen
Hence, in this paper, we review from the perspective of researchers who try to take the first step on this topic.
1 code implementation • 14 Mar 2022 • Lingfeng Yang, Xiang Li, Borui Zhao, RenJie Song, Jian Yang
In semantic segmentation, RM also surpasses the baseline and CutMix by 1. 9 and 1. 1 mIoU points under UperNet on ADE20K, respectively.
1 code implementation • CVPR 2022 • Lingfeng Yang, Xiang Li, RenJie Song, Borui Zhao, Juntian Tao, Shihao Zhou, Jiajun Liang, Jian Yang
Therefore, it is helpful to leverage additional information, e. g., the locations and dates for data shooting, which can be easily accessible but rarely exploited.
no code implementations • 2 Mar 2022 • Yunxiao Shan, Shu Li, Fuxiang Li, Yuxin Cui, Shuai Li, Ming Zhou, Xiang Li
It is proved that the algorithm can effectively reduce the computational complexity of the original DPC from $O(n^2K)$ to $O(n(n^{1-1/K}+k))$.
no code implementations • 28 Feb 2022 • Jingwei Zhuo, Bin Liu, Xiang Li, Han Zhu, Xiaoqiang Zhu
Motivated by the observation that model-free methods like behavioral retargeting (BR) and item-based collaborative filtering (ItemCF) hit different parts of the user-item relevance compared to neural sequential recommendation models, we propose a novel model-agnostic training approach called WSLRec, which adopts a three-stage framework: pre-training, top-$k$ mining, and fine-tuning.
1 code implementation • 14 Feb 2022 • Qiyang Zhang, Xiang Li, Xiangying Che, Xiao Ma, Ao Zhou, Mengwei Xu, Shangguang Wang, Yun Ma, Xuanzhe Liu
Deploying deep learning (DL) on mobile devices has been a notable trend in recent years.
no code implementations • 6 Feb 2022 • Keli Huang, Botian Shi, Xiang Li, Xin Li, Siyuan Huang, Yikang Li
Multi-modal fusion is a fundamental task for the perception of an autonomous driving system, which has recently intrigued many researchers.
no code implementations • 24 Jan 2022 • Yong Huang, Xiang Li, Wei Wang, Tao Jiang, Qian Zhang
The cybersecurity breaches expose surveillance video streams to forgery attacks, under which authentic streams are falsified to hide unauthorized activities.
1 code implementation • 10 Jan 2022 • Lianghao Xia, Chao Huang, Yong Xu, Huance Xu, Xiang Li, WeiGuo Zhang
As the deep learning techniques have expanded to real-world recommendation tasks, many deep neural network based Collaborative Filtering (CF) models have been developed to project user-item interactions into latent feature space, based on various neural architectures, such as multi-layer perceptron, auto-encoder and graph neural networks.
no code implementations • 31 Dec 2021 • Xiang Li, Dong Li, Ruoming Jin, Gagan Agrawal, Rajiv Ramnath
Though other methods (particularly those based on Laplacian Smoothing) have reported better accuracy, a fundamental limitation of all the work is a lack of scalability.
no code implementations • 29 Dec 2021 • Xiang Li, Wenhao Yang, Jiadong Liang, Zhihua Zhang, Michael I. Jordan
We study synchronous Q-learning with Polyak-Ruppert averaging (a. k. a., averaged Q-learning) in a $\gamma$-discounted MDP.
no code implementations • 9 Dec 2021 • Gang Li, Xiang Li, Yujie Wang, Shanshan Zhang, Yichao Wu, Ding Liang
Based on the two observations, we propose Rank Mimicking (RM) and Prediction-guided Feature Imitation (PFI) for distilling one-stage detectors, respectively.
no code implementations • 3 Dec 2021 • Xiang Li, Jinglu Wang, Xiao Li, Yan Lu
Based on this representation, we introduce a cropping-free temporal fusion approach to model the temporal consistency between video frames.
no code implementations • 2 Dec 2021 • Alina Walch, Xiang Li, Jonathan Chambers, Nahid Mohajeri, Selin Yilmaz, Martin Patel, Jean-Louis Scartezzini
Shallow ground-source heat pumps (GSHPs) are a promising technology for contributing to the decarbonisation of the energy sector.
no code implementations • NeurIPS 2021 • Caihua Shan, Yifei Shen, Yao Zhang, Xiang Li, Dongsheng Li
To address these issues, we propose a RL-enhanced GNN explainer, RG-Explainer, which consists of three main components: starting point selection, iterative graph generation and stopping criteria learning.
no code implementations • 10 Nov 2021 • Yi Lin, Jianchao Su, Xiang Wang, Xiang Li, Jingen Liu, Kwang-Ting Cheng, Xin Yang
We have evaluated our approach using the 20 CTPA test dataset from the PE challenge, achieving a sensitivity of 78. 9%, 80. 7% and 80. 7% at 2 false positives per volume at 0mm, 2mm and 5mm localization error, which is superior to the state-of-the-art methods.
1 code implementation • 8 Nov 2021 • Xiang Li, Shihao Ji
Extensive experiments on VGGFace, Traffic Sign and ImageNet show that GDPA achieves higher attack success rates than state-of-the-art patch attacks, while adversarially trained model with GDPA demonstrates superior robustness to adversarial patch attacks than competing methods.
no code implementations • 2 Nov 2021 • Liao Qu, Shuaiqi Huang, Yunsong Jia, Xiang Li
By increasing the weight of extreme temperature samples and reducing the possibility of misjudging extreme temperature as normal, the proposed loss function can enhance the prediction results in extreme situations.
1 code implementation • 20 Oct 2021 • Guanjie Huang, Hongjian He, Xiang Li, Xingchen Li, Ziang Liu
Currently, it is hard to compare and evaluate different style transfer algorithms due to chaotic definitions of style and the absence of agreed objective validation methods in the study of style transfer.
no code implementations • 20 Oct 2021 • Xiang Li, Jinglu Wang, Xiao Li, Yan Lu
Instance segmentation is a challenging task aiming at classifying and segmenting all object instances of specific classes.
1 code implementation • 12 Oct 2021 • Jinghuan Shang, Kumara Kahatapitiya, Xiang Li, Michael S. Ryoo
Reinforcement Learning (RL) can be considered as a sequence modeling task: given a sequence of past state-action-reward experiences, an agent predicts a sequence of next actions.
no code implementations • 5 Oct 2021 • Yusui Chen, Wenhao Hu, Xiang Li
Fully convolutional networks are robust in performing semantic segmentation, with many applications from signal processing to computer vision.
no code implementations • 1 Oct 2021 • Zheng Li, Xiang Li, Lingfeng Yang, Jian Yang, Zhigeng Pan
Knowledge distillation usually transfers the knowledge from a pre-trained cumbersome teacher network to a compact student network, which follows the classical teacher-teaching-student paradigm.
1 code implementation • 29 Sep 2021 • Liang Zongwei, Junan Yang, Keju Huang, Hui Liu, Lin Cui, Lingzhi Qu, Xiang Li
The interpretability of the current temporal KG forecasting models is manifested in providing the reasoning paths.
no code implementations • 14 Sep 2021 • Rahul Ghosh, Arvind Renganathan, Kshitij Tayal, Xiang Li, Ankush Khandelwal, Xiaowei Jia, Chris Duffy, John Neiber, Vipin Kumar
Furthermore, we show that KGSSL is relatively more robust to distortion than baseline methods, and outperforms the baseline model by 35\% when plugging in KGSSL inferred characteristics.
no code implementations • 13 Sep 2021 • Chaofei Wang, Shiji Song, Qisen Yang, Xiang Li, Gao Huang
As a data augmentation method, FOT can be conveniently applied to any existing few shot learning algorithm and greatly improve its performance on FG-FSL tasks.
no code implementations • 12 Sep 2021 • Bing Luo, Xiang Li, Shiqiang Wang, Jianwei Huang, Leandros Tassiulas
Federated learning (FL) is a distributed learning paradigm that enables a large number of mobile devices to collaboratively learn a model under the coordination of a central server without sharing their raw data.
no code implementations • 3 Sep 2021 • Xiang Li, Jiadong Liang, Xiangyu Chang, Zhihua Zhang
Both the methods are communication efficient and applicable to online data.
2 code implementations • ICCV 2021 • Kun Wang, Zhenyu Zhang, Zhiqiang Yan, Xiang Li, Baobei Xu, Jun Li, Jian Yang
Monocular depth estimation aims at predicting depth from a single image or video.
no code implementations • SEMEVAL 2021 • Qinglin Zhu, Zijie Lin, Yice Zhang, Jingyi Sun, Xiang Li, Qihui Lin, Yixue Dang, Ruifeng Xu
This paper presents the winning system that participated in SemEval-2021 Task 5: Toxic Spans Detection.
no code implementations • 29 Jul 2021 • Zhiqiang Yan, Kun Wang, Xiang Li, Zhenyu Zhang, Jun Li, Jian Yang
However, blurry guidance in the image and unclear structure in the depth still impede the performance of the image guided frameworks.
Ranked #2 on
Depth Completion
on KITTI Depth Completion
no code implementations • 7 Jul 2021 • Xiang Li, Lingjing Wang, Yi Fang
To achieve this, we treat the shape segmentation as a point labeling problem in the metric space.
no code implementations • DCASE workshop 2021 • Weiqiang Yuan ∗, Qichen Han∗, Dong Liu, Xiang Li, Zhen Yang
Our solution focuses on solving two problems in automated audio captioning: data insufficiency and word selection indeterminacy.
Ranked #1 on
Audio captioning
on Clotho
(using extra training data)
no code implementations • 28 Jun 2021 • Wenchao Zhang, Chong Fu, Xiangshi Chang, Tengfei Zhao, Xiang Li, Chiu-Wing Sham
Without losing generality, we can also build a more lighter head network for other multi-stage detectors by assembling our method.
11 code implementations • 25 Jun 2021 • Wenhai Wang, Enze Xie, Xiang Li, Deng-Ping Fan, Kaitao Song, Ding Liang, Tong Lu, Ping Luo, Ling Shao
We hope this work will facilitate state-of-the-art Transformer researches in computer vision.
Ranked #57 on
Object Detection
on COCO minival
no code implementations • 20 Jun 2021 • Chaolei Tan, Zihang Lin, Jian-Fang Hu, Xiang Li, Wei-Shi Zheng
We propose an effective two-stage approach to tackle the problem of language-based Human-centric Spatio-Temporal Video Grounding (HC-STVG) task.
1 code implementation • NeurIPS 2021 • Chenjie Cao, Yuxin Hong, Xiang Li, Chengrong Wang, Chengming Xu, xiangyang xue, Yanwei Fu
To address these limitations, we propose a novel model -- image Local Autoregressive Transformer (iLAT), to better facilitate the locally guided image synthesis.
no code implementations • 31 May 2021 • Binbin Xie, Jinsong Su, Yubin Ge, Xiang Li, Jianwei Cui, Junfeng Yao, Bin Wang
However, such a decoder only exploits the preorder traversal based preceding actions, which are insufficient to ensure correct action predictions.
no code implementations • ACL 2021 • Zhiyong Wu, Lingpeng Kong, Wei Bi, Xiang Li, Ben Kao
A neural multimodal machine translation (MMT) system is one that aims to perform better translation by extending conventional text-only translation models with multimodal information.
1 code implementation • 19 May 2021 • Cong Xu, Xiang Li, Min Yang
Neural networks are susceptible to artificially designed adversarial perturbations.
Ranked #1 on
Adversarial Attack
on CIFAR-10
1 code implementation • 2 May 2021 • Wenhai Wang, Enze Xie, Xiang Li, Xuebo Liu, Ding Liang, Zhibo Yang, Tong Lu, Chunhua Shen
By systematically comparing with existing scene text representations, we show that our kernel representation can not only describe arbitrarily-shaped text but also well distinguish adjacent text.
no code implementations • 12 Apr 2021 • Cong Li, Min Shi, Bo Qu, Xiang Li
In this paper, we propose a deep attributed network representation learning via attribute enhanced neighborhood (DANRL-ANE) model to improve the robustness and effectiveness of node representations.
no code implementations • 8 Apr 2021 • Xiang Li, Changhe Song, Jingbei Li, Zhiyong Wu, Jia Jia, Helen Meng
This paper introduces a multi-scale speech style modeling method for end-to-end expressive speech synthesis.
no code implementations • 21 Mar 2021 • Varun Buch, Aoxiao Zhong, Xiang Li, Marcio Aloisio Bezerra Cavalcanti Rockenbach, Dufan Wu, Hui Ren, Jiahui Guan, Andrew Liteplo, Sayon Dutta, Ittai Dayan, Quanzheng Li
Predictive risk scores for COVID-19 severe outcomes ("CO-RISK" score) were derived from model output and evaluated on the testing dataset, as well as compared to human performance.
1 code implementation • 5 Mar 2021 • Shunyu Jiang, Fuli Feng, Weijian Chen, Xiang Li, Xiangnan He
Graph classification is a highly impactful task that plays a crucial role in a myriad of real-world applications such as molecular property prediction and protein function prediction. Aiming to handle the new classes with limited labeled graphs, few-shot graph classification has become a bridge of existing graph classification solutions and practical usage. This work explores the potential of metric-based meta-learning for solving few-shot graph classification. We highlight the importance of considering structural characteristics in the solution and propose a novel framework which explicitly considers global structure and local structure of the input graph.
no code implementations • 1 Mar 2021 • Xiao Guo, Xiang Li, Xiangyu Chang, Shusen Wang, Zhihua Zhang
The low communication and computation power of such devices, and the possible privacy breaches of users' sensitive data make the computation of SVD challenging.
9 code implementations • ICCV 2021 • Wenhai Wang, Enze Xie, Xiang Li, Deng-Ping Fan, Kaitao Song, Ding Liang, Tong Lu, Ping Luo, Ling Shao
Unlike the recently-proposed Transformer model (e. g., ViT) that is specially designed for image classification, we propose Pyramid Vision Transformer~(PVT), which overcomes the difficulties of porting Transformer to various dense prediction tasks.
Ranked #5 on
Semantic Segmentation
on SynPASS
no code implementations • 24 Feb 2021 • Xiang Li, Yuzheng Chen, Rakesh Patibanda, Florian 'Floyd' Mueller
With the popularity of online access in virtual reality (VR) devices, it will become important to investigate exclusive and interactive CAPTCHA (Completely Automated Public Turing test to tell Computers and Humans Apart) designs for VR devices.
Human-Computer Interaction
no code implementations • 12 Feb 2021 • Gang Wang, Ziyi Guo, Xiang Li, Dawei Yin, Shuai Ma
Collaborative filtering has been largely used to advance modern recommender systems to predict user preference.
1 code implementation • 1 Feb 2021 • Meimei Shang, Fei Gao, Xiang Li, Jingjie Zhu, Lingna Dai
In this paper, we propose a novel method to learn face sketch synthesis models by using unpaired data.
no code implementations • 5 Jan 2021 • Xiang Li, Zhihua Zhang
In this work, we study a novel class of projection-based algorithms for linearly constrained problems (LCPs) which have a lot of applications in statistics, optimization, and machine learning.
no code implementations • 4 Jan 2021 • Yong Huang, Xiang Li, Wei Wang, Tao Jiang, Qian Zhang
Traditional video forensics approaches can detect and localize forgery traces in each video frame using computationally-expensive spatial-temporal analysis, while falling short in real-time verification of live video feeds.
Time Series
Video Forensics
Cryptography and Security
1 code implementation • 1 Jan 2021 • Xiulong Yang, Hui Ye, Yang Ye, Xiang Li, Shihao Ji
We show that our Generative MMC (GMMC) can be trained discriminatively, generatively, or jointly for image classification and generation.
no code implementations • 1 Jan 2021 • Shib Sankar Dasgupta, Xiang Li, Michael Boratko, Dongxu Zhang, Andrew McCallum
In Patel et al. (2020), the authors demonstrate that only the transitive reduction is required, and further extend box embeddings to capture joint hierarchies by augmenting the graph with new nodes.
no code implementations • 18 Dec 2020 • Xiang Li, Danhao Ding, Ben Kao, Yizhou Sun, Nikos Mamoulis
A heterogeneous information network (HIN) has as vertices objects of different types and as edges the relations between objects, which are also of various types.
no code implementations • 15 Dec 2020 • Bing Luo, Xiang Li, Shiqiang Wang, Jianwei Huang, Leandros Tassiulas
In this paper, we analyze how to design adaptive FL that optimally chooses these essential control variables to minimize the total cost while ensuring convergence.
no code implementations • 15 Dec 2020 • Wenjie Qin, Xiang Li, Yuhui Sun, Deyi Xiong, Jianwei Cui, Bin Wang
In this paper, we propose a robust neural machine translation (NMT) framework.
1 code implementation • 6 Dec 2020 • Jia-Qi Yang, Xiang Li, Shuguang Han, Tao Zhuang, De-Chuan Zhan, Xiaoyi Zeng, Bin Tong
To strike a balance in this trade-off, we propose Elapsed-Time Sampling Delayed Feedback Model (ES-DFM), which models the relationship between the observed conversion distribution and the true conversion distribution.
no code implementations • 2 Dec 2020 • Ankush Khandelwal, Shaoming Xu, Xiang Li, Xiaowei Jia, Michael Stienbach, Christopher Duffy, John Nieber, Vipin Kumar
The goal of this work is to incorporate our understanding of physical processes and constraints in hydrology into machine learning algorithms, and thus bridge the performance gap while reducing the need for large amounts of data compared to traditional data-driven approaches.
1 code implementation • NeurIPS 2020 • Tao Zhuang, Zhixuan Zhang, Yuheng Huang, Xiaoyi Zeng, Kai Shuang, Xiang Li
Experimentally, we show that structured pruning using polarization regularizer achieves much better results than using L1 regularizer.
no code implementations • Joint Conference on Lexical and Computational Semantics 2020 • Anshuman Mishra, Dhruvesh Patel, Aparna Vijayakumar, Xiang Li, Pavan Kapanipathi, Kartik Talamadupula
We transform one of the largest available MRC dataset (RACE) to an NLI form, and compare the performances of a state-of-the-art model (RoBERTa) on both these forms.
no code implementations • 26 Nov 2020 • Aoxiao Zhong, Xiang Li, Dufan Wu, Hui Ren, Kyungsang Kim, YoungGon Kim, Varun Buch, Nir Neumark, Bernardo Bizzo, Won Young Tak, Soo Young Park, Yu Rim Lee, Min Kyu Kang, Jung Gil Park, Byung Seok Kim, Woo Jin Chung, Ning Guo, Ittai Dayan, Mannudeep K. Kalra, Quanzheng Li
These results demonstrate our deep metric learning based image retrieval model is highly efficient in the CXR retrieval, diagnosis and prognosis, and thus has great clinical value for the treatment and management of COVID-19 patients.
5 code implementations • CVPR 2021 • Xiang Li, Wenhai Wang, Xiaolin Hu, Jun Li, Jinhui Tang, Jian Yang
Such a property makes the distribution statistics of a bounding box highly correlated to its real localization quality.
Ranked #54 on
Object Detection
on COCO test-dev
1 code implementation • 16 Nov 2020 • Yufeng Wang, Dan Li, Xiang Li, Min Yang
Further, this classifier is incorporated into the generative adversarial framework to help the generator to yield higher quality imputation results.
no code implementations • 16 Nov 2020 • Xiang Li, Xinyu Fu, Zheng Lu, Ruibin Bai, Uwe Aickelin, Peiming Ge, Gong Liu
Internet hospital is a rising business thanks to recent advances in mobile web technology and high demand of health care services.
no code implementations • 31 Oct 2020 • Wenhao Yang, Xiang Li, Guangzeng Xie, Zhihua Zhang
Regularized MDPs serve as a smooth version of original MDPs.
no code implementations • 22 Oct 2020 • Lingjing Wang, Yu Hao, Xiang Li, Yi Fang
In this paper, we propose a meta-learning based 3D registration model, named 3D Meta-Registration, that is capable of rapidly adapting and well generalizing to new 3D registration tasks for unseen 3D point clouds.
no code implementations • 21 Oct 2020 • Hao Huang, Lingjing Wang, Xiang Li, Yi Fang
In this paper, we propose a novel meta-learning based 3D point signature model, named 3Dmetapointsignature (MEPS) network, that is capable of learning robust point signatures in 3D shapes.
no code implementations • 4 Oct 2020 • Anshuman Mishra, Dhruvesh Patel, Aparna Vijayakumar, Xiang Li, Pavan Kapanipathi, Kartik Talamadupula
We transform the one of the largest available MRC dataset (RACE) to an NLI form, and compare the performances of a state-of-the-art model (RoBERTa) on both these forms.
no code implementations • 29 Sep 2020 • Lingjing Wang, Xiang Li, Yi Fang
Point cloud registration is the process of aligning a pair of point sets via searching for a geometric transformation.
no code implementations • 18 Sep 2020 • Anshuman Mishra, Dhruvesh Patel, Aparna Vijayakumar, Xiang Li, Pavan Kapanipathi, Kartik Talamadupula
In recent years, the Natural Language Inference (NLI) task has garnered significant attention, with new datasets and models achieving near human-level performance on it.
no code implementations • 11 Sep 2020 • Xiang Li, Lingjing Wang, Yi Fang
To bridge the performance gaps between partial point set registration with full point set registration, we proposed to incorporate a shape completion network to benefit the registration process.
no code implementations • 15 Aug 2020 • Xiang Li, Yuan Tian, Fuyao Zhang, Shuxue Quan, Yi Xu
Ordinary object detection approaches process information from the images only, and they are oblivious to the camera pose with regard to the environment and the scale of the environment.
no code implementations • 14 Aug 2020 • Bin Gu, Zhiyuan Dang, Xiang Li, Heng Huang
In this paper, we focus on nonlinear learning with kernels, and propose a federated doubly stochastic kernel learning (FDSKL) algorithm for vertically partitioned data.
no code implementations • 13 Aug 2020 • Hao Huang, Jianchun Chen, Xiang Li, Lingjing Wang, Yi Fang
Recent works introduce convolutional neural networks (CNNs) to extract high-level feature maps and find correspondences through feature matching.
no code implementations • 25 Jul 2020 • Lingjing Wang, Xiang Li, Yi Fang
More specifically, for a given group we first define an optimizable Group Latent Descriptor (GLD) to characterize the gruopwise relationship among a group of point sets.
1 code implementation • IEEE International Joint Conference on Neural Network 2020 • Yijun Su, Jia-Dong Zhang, Xiang Li, Daren Zha, Ji Xiang, Wei Tang, and Neng Gao
Recent studies mainly utilize social information, categorical information and/or geographical information to supplement the highly sparse check-in data.
no code implementations • 11 Jul 2020 • Dong Xu, Xiao Huang, Joseph Mango, Xiang Li, Zhenlong Li
We propose a multi-exit evacuation simulation based on Deep Reinforcement Learning (DRL), referred to as the MultiExit-DRL, which involves in a Deep Neural Network (DNN) framework to facilitate state-to-action mapping.
no code implementations • WS 2020 • Yuhui Sun, Mengxue Guo, Xiang Li, Jianwei Cui, Bin Wang
This paper describes the Xiaomi{'}s submissions to the IWSLT20 shared open domain translation task for Chinese{\textless}-{\textgreater}Japanese language pair.
no code implementations • 24 Jun 2020 • Shuaihang Yuan, Xiang Li, Yi Fang
In this paper, we aim at handling the problem of 3D tracking, which provides the tracking of the consecutive frames of 3D shapes.
no code implementations • 24 Jun 2020 • Shuaihang Yuan, Xiang Li, Anthony Tzes, Yi Fang
To approach this problem, we propose a self-supervised approach that leverages the power of the deep neural network to learn a continuous flow function of 3D point clouds that can predict temporally consistent future motions and naturally bring out the correspondences among consecutive point clouds at the same time.
3 code implementations • 19 Jun 2020 • Yu Zheng, Chen Gao, Xiang Li, Xiangnan He, Depeng Jin, Yong Li
We further demonstrate that the learned embeddings successfully capture the desired causes, and show that DICE guarantees the robustness and interpretability of recommendation.
no code implementations • 17 Jun 2020 • Lingjing Wang, Yi Shi, Xiang Li, Yi Fang
Global registration of point clouds aims to find an optimal alignment of a sequence of 2D or 3D point sets.
no code implementations • 14 Jun 2020 • Jingyu Deng, Xiang Li, Yi Fang
In this paper, we introduce a few-shot learning-based method for object detection on remote sensing images where only a few annotated samples are provided for the unseen object categories.
no code implementations • 11 Jun 2020 • Lingjing Wang, Xiang Li, Yi Fang
Moreover, for a pair of source and target point sets, existing deep learning mechanisms require explicitly designed encoders to extract both deep spatial features from unstructured point clouds and their spatial correlation representation, which is further fed to a decoder to regress the desired geometric transformation for point set alignment.
no code implementations • 10 Jun 2020 • Xiang Li, Lingjing Wang, Yi Fang
Recent studies have shown the benefits of using additional elevation data (e. g., DSM) for enhancing the performance of the semantic segmentation of aerial images.
no code implementations • WS 2020 • Junxuan Chen, Xiang Li, Jiarui Zhang, Chulun Zhou, Jianwei Cui, Bin Wang, Jinsong Su
Finally, we combine the discourse structure information with the word embedding before it is fed into the encoder.
7 code implementations • NeurIPS 2020 • Xiang Li, Wenhai Wang, Lijun Wu, Shuo Chen, Xiaolin Hu, Jun Li, Jinhui Tang, Jian Yang
Specifically, we merge the quality estimation into the class prediction vector to form a joint representation of localization quality and classification, and use a vector to represent arbitrary distribution of box locations.
Ranked #101 on
Object Detection
on COCO test-dev
1 code implementation • 8 Jun 2020 • Xiang Li, Ben Kao, Caihua Shan, Dawei Yin, Martin Ester
We study the problem of applying spectral clustering to cluster multi-scale data, which is data whose clusters are of various sizes and densities.
1 code implementation • IEEE International Conference on Communications 2020 • Yijun Su, Xiang Li, Baoping Liu, Daren Zha, Ji Xiang, Wei Tang and Neng Gao.
With the popularity of location-based social networks (LBSNs), Point-of-Interest (POI) recommendation has become an essential location-based service to help people explore novel locations.
no code implementations • 4 Jun 2020 • Xiang Li, Mingyang Wang, Yi Fang
Previous researches have extensively studied the problem of height estimation from aerial images based on stereo or multi-view image matching.
no code implementations • CVPR 2020 • Xiang Li, Yasushi Makihara, Chi Xu, Yasushi Yagi, Mingwu Ren
Existing gait recognition approaches typically focus on learning identity features that are invariant to covariates (e. g., the carrying status, clothing, walking speed, and viewing angle) and seldom involve learning features from the covariate aspect, which may lead to failure modes when variations due to the covariate overwhelm those due to the identity.
no code implementations • 20 May 2020 • Xin Hu, Zhijun Liu, Xiaofei Yu, Yulong Zhao, WenHua Chen, Biao Hu, Xuekun Du, Xiang Li, Mohamed Helaoui, Weidong Wang, Fadhel M. Ghannouchi
We design a pre-designed filter using the convolutional layer to extract the basis functions required for the PA forward or reverse modeling.
1 code implementation • 8 May 2020 • Xiang Li, Lin Zhang, Yau Pun Chen, Yu-Wing Tai, Chi-Keung Tang
Deep learning has revolutionized object detection thanks to large-scale datasets, but their object categories are still arguably very limited.
1 code implementation • 3 May 2020 • Xiang Li, Songcan Chen
In aligning, we characterize the global and local structures of multiple labels to be high-rank and low-rank, respectively.
no code implementations • 20 Apr 2020 • Congcong Wen, Xiang Li, Xiaojing Yao, Ling Peng, Tianhe Chi
To achieve point cloud classification, previous studies proposed point cloud deep learning models that can directly process raw point clouds based on PointNet-like architectures.
1 code implementation • 9 Mar 2020 • Xiang Li, Peng Wang
Firstly, both communication parties establish a word vector table by training a deep learning model according to specified hyperparameters.
no code implementations • 7 Mar 2020 • Xiang Li, Chao Wang, Jiwei Tan, Xiaoyi Zeng, Dan Ou, Bo Zheng
Finally, we achieve the multimodal item representations by combining both modality-specific and modality-invariant representations.
no code implementations • 19 Feb 2020 • Xiang Li, Shusen Wang, Kun Chen, Zhihua Zhang
As a practical surrogate of OPT, sign-fixing, which uses a diagonal matrix with $\pm 1$ entries as weights, has better computation complexity and stability in experiments.
1 code implementation • AKBC 2020 • Dhruvesh Patel, Shib Sankar Dasgupta, Michael Boratko, Xiang Li, Luke Vilnis, Andrew McCallum
Box Embeddings [Vilnis et al., 2018, Li et al., 2019] represent concepts with hyperrectangles in $n$-dimensional space and are shown to be capable of modeling tree-like structures efficiently by training on a large subset of the transitive closure of the WordNet hypernym graph.
1 code implementation • ACL 2020 • Qi Zhu, Zheng Zhang, Yan Fang, Xiang Li, Ryuichi Takanobu, Jinchao Li, Baolin Peng, Jianfeng Gao, Xiaoyan Zhu, Minlie Huang
We present ConvLab-2, an open-source toolkit that enables researchers to build task-oriented dialogue systems with state-of-the-art models, perform an end-to-end evaluation, and diagnose the weakness of systems.