no code implementations • EMNLP 2021 • Tianfu Zhang, Heyan Huang, Chong Feng, Longbing Cao
Multi-head self-attention recently attracts enormous interest owing to its specialized functions, significant parallelizable computation, and flexible extensibility.
no code implementations • 25 Oct 2024 • Hui Chen, Xuhui Fan, Hengyu Liu, Longbing Cao
Additionally, by adding joint noise to the marked temporal data space, BMTPP effectively captures and explicitly reveals the interdependence between timestamps and event types.
no code implementations • 8 Oct 2024 • Hui Chen, Hengyu Liu, Yaqiong Li, Xuhui Fan, Zhilin Zhao, Feng Zhou, Christopher John Quinn, Longbing Cao
Temporal point processes (TPPs) are effective for modeling event occurrences over time, but they struggle with sparse and uncertain events in federated systems, where privacy is a major concern.
no code implementations • 24 Aug 2024 • Jiwei Guan, Tianyu Ding, Longbing Cao, Lei Pan, Chen Wang, Xi Zheng
In this paper, we study the adversarial vulnerability of recent VLP transformers and design a novel Joint Multimodal Transformer Feature Attack (JMTFA) that concurrently introduces adversarial perturbations in both visual and textual modalities under white-box settings.
no code implementations • 6 Aug 2024 • Manqing Dong, Hao Huang, Longbing Cao
An emerging topic in large language models (LLMs) is their application to time series forecasting, characterizing mainstream and patternable characteristics of time series.
no code implementations • 27 Jul 2024 • Jiaxing Miao, Liang Hu, Qi Zhang, Longbing Cao
BGML incorporates a multi-granular hierarchical progressive learning mechanism rooted in feature graph grain learning to mitigate potential conflict between memorization and forgetting in graph memory learning.
no code implementations • 18 Jul 2024 • Hui He, Qi Zhang, Kun Yi, Xiaojun Xue, Shoujin Wang, Liang Hu, Longbing Cao
The non-stationary nature of real-world Multivariate Time Series (MTS) data presents forecasting models with a formidable challenge of the time-variant distribution of time series, referred to as distribution shift.
1 code implementation • 24 May 2024 • Zhangkai Wu, Xuhui Fan, Jin Li, Zhilin Zhao, Hui Chen, Longbing Cao
Specifically, ParamReL proposes a \emph{self-}encoder to learn latent semantics directly from parameters, rather than from observations.
no code implementations • 24 Apr 2024 • Hui Chen, Hengyu Liu, Zhangkai Wu, Xuhui Fan, Longbing Cao
While deep neural networks (DNNs) based personalized federated learning (PFL) is demanding for addressing data heterogeneity and shows promising performance, existing methods for federated learning (FL) suffer from efficient systematic uncertainty quantification.
no code implementations • 13 Feb 2024 • Jin Li, Shoujin Wang, Qi Zhang, Longbing Cao, Fang Chen, Xiuzhen Zhang, Dietmar Jannach, Charu C. Aggarwal
However, emerging vulnerabilities in RS have catalyzed a paradigm shift towards Trustworthy RS (TRS).
no code implementations • 3 Feb 2024 • Jianing He, Qi Zhang, Weiping Ding, Duoqian Miao, Jun Zhao, Liang Hu, Longbing Cao
DE$^3$-BERT implements a hybrid exiting strategy that supplements classic entropy-based local information with distance-based global information to enhance the estimation of prediction correctness for more reliable early exiting decisions.
no code implementations • 7 Jan 2024 • Zhangkai Wu, Longbing Cao, Qi Zhang, Junxian Zhou, Hui Chen
Due to their unsupervised training and uncertainty estimation, deep Variational Autoencoders (VAEs) have become powerful tools for reconstruction-based Time Series Anomaly Detection (TSAD).
1 code implementation • 18 Dec 2023 • An Lao, Qi Zhang, Chongyang Shi, Longbing Cao, Kun Yi, Liang Hu, Duoqian Miao
Multimodal content, such as mixing text with images, presents significant challenges to rumor detection in social media.
1 code implementation • 14 Nov 2023 • Zhilin Zhao, Longbing Cao, Yixuan Zhang, Kun-Yu Lin, Wei-Shi Zheng
This paper introduces OOD knowledge distillation, a pioneering learning framework applicable whether or not training ID data is available, given a standard network.
1 code implementation • NeurIPS 2023 • Kun Yi, Qi Zhang, Wei Fan, Hui He, Liang Hu, Pengyang Wang, Ning An, Longbing Cao, Zhendong Niu
Multivariate time series (MTS) forecasting has shown great importance in numerous industries.
Graph Neural Network Multivariate Time Series Forecasting +1
2 code implementations • NeurIPS 2023 • Kun Yi, Qi Zhang, Wei Fan, Shoujin Wang, Pengyang Wang, Hui He, Defu Lian, Ning An, Longbing Cao, Zhendong Niu
FreTS mainly involves two stages, (i) Domain Conversion, that transforms time-domain signals into complex numbers of frequency domain; (ii) Frequency Learning, that performs our redesigned MLPs for the learning of real and imaginary part of frequency components.
no code implementations • 27 Sep 2023 • Hui Chen, Hengyu Liu, Longbing Cao, Tiancheng Zhang
BPFL aims to quantify the uncertainty and heterogeneity within and across clients towards uncertainty representations by addressing the statistical heterogeneity of client data.
no code implementations • 23 Sep 2023 • Zhangkai Wu, Longbing Cao
Then, a self-supervised contrastive classifier differentiates the disentangled representations from the coupled representations, where a contrastive loss regularizes this contrastive classification together with the TC loss for eliminating entangled factors and strengthening disentangled representations.
no code implementations • 9 May 2023 • Jia Xu, Longbing Cao
Our variational neural network WPVC-VLSTM models variational sequential dependence degrees and structures across multivariate time series by variational long short-term memory networks and regular vine copula.
no code implementations • 26 Apr 2023 • Longbing Cao, Hui Chen, Xuhui Fan, Joao Gama, Yew-Soon Ong, Vipin Kumar
This survey presents a critical overview of BFL, including its basic concepts, its relations to Bayesian learning in the context of FL, and a taxonomy of BFL from both Bayesian and federated perspectives.
1 code implementation • 4 Feb 2023 • Kun Yi, Qi Zhang, Longbing Cao, Shoujin Wang, Guodong Long, Liang Hu, Hui He, Zhendong Niu, Wei Fan, Hui Xiong
Despite the growing attention and the proliferation of research in this emerging field, there is currently a lack of a systematic review and in-depth analysis of deep learning-based time series models with FT.
1 code implementation • 27 Jan 2023 • Hui He, Qi Zhang, Shoujin Wang, Kun Yi, Zhendong Niu, Longbing Cao
To bridge such significant gap, we formulate the fairness modeling problem as learning informative representations attending to both advantaged and disadvantaged variables.
1 code implementation • 1 Jan 2023 • Zhangkai Wu, Longbing Cao, Lei Qi
VAEs still suffer from uncertain tradeoff learning. We propose a novel evolutionary variational autoencoder (eVAE) building on the variational information bottleneck (VIB) theory and integrative evolutionary neural learning.
1 code implementation • 5 Nov 2022 • Rui Yu, Yifeng Li, Wenpeng Lu, Longbing Cao
In natural language processing (NLP), the context of a word or sentence plays an essential role.
no code implementations • 6 Oct 2022 • Kun Yi, Qi Zhang, Liang Hu, Hui He, Ning An, Longbing Cao, Zhendong Niu
The key problem in multivariate time series (MTS) analysis and forecasting aims to disclose the underlying couplings between variables that drive the co-movements.
no code implementations • 1 Sep 2022 • Hui He, Qi Zhang, Kun Yi, Kaize Shi, Zhendong Niu, Longbing Cao
Most existing MTS forecasting models greatly suffer from distribution drift and degrade the forecasting performance over time.
no code implementations • 29 Jun 2022 • Qi Zhang, Liang Hu, Chongyang Shi, Ke Liu, Longbing Cao
Case-based Reasoning (CBR) on high-dimensional and heterogeneous data is a trending yet challenging and computationally expensive task in the real world.
1 code implementation • 19 Jun 2022 • Zhilin Zhao, Longbing Cao, Kun-Yu Lin
We thus improve the discriminability of a pretrained network by finetuning it with out-of-distribution samples drawn from the cross-class vicinity distribution, where each out-of-distribution input corresponds to a complementary label.
1 code implementation • 19 Jun 2022 • Zhilin Zhao, Longbing Cao
To distinguish in- and out-of-distribution samples, Dual Representation Learning (DRL) makes out-of-distribution samples harder to have high-confidence predictions by exploring both strongly and weakly label-related information from in-distribution samples.
no code implementations • 19 Jun 2022 • Zhilin Zhao, Longbing Cao, Kun-Yu Lin
To tackle this issue, several state-of-the-art methods include adding extra OOD samples to training and assign them with manually-defined labels.
Out-of-Distribution Detection Out of Distribution (OOD) Detection
1 code implementation • 19 Jun 2022 • Zhilin Zhao, Longbing Cao, Chang-Dong Wang
We observe that both in- and out-of-distribution samples can almost invariably be ruled out from belonging to certain classes, aside from those corresponding to unreliable ground-truth labels.
no code implementations • 4 Apr 2022 • Wei Wang, Longbing Cao
A DPP-based NSP representation and actionable NSP discovery method EINSP introduces novel and significant contributions for NSA and sequence analysis: (1) it represents NSPs by a determinantal point process (DPP) based graph; (2) it quantifies actionable NSPs in terms of their statistical significance, diversity, and strength of explicit/implicit element/pattern relations; and (3) it models and measures both explicit and implicit element/pattern relations in the DPP-based NSP graph to represent direct and indirect couplings between NSP items, elements and patterns.
1 code implementation • 12 Feb 2022 • Zhilin Zhao, Longbing Cao, Yuanyu Wan
MOOE learns static offline experts from offline intervals and maintains a dynamic online expert for the current online interval.
no code implementations • 3 Dec 2021 • Longbing Cao, Chengzhang Zhu
The learned universal representation of each customer is all-round, representative and benchmarkable to support both enterprise-wide and domain-specific learning goals and tasks in enterprise data science.
no code implementations • 23 Aug 2021 • Qing Liu, Longbing Cao
Different from existing COVID-19 models, SUDR characterizes the undocumented infections during unknown transmission processes.
1 code implementation • 23 Aug 2021 • Zhilin Zhao, Longbing Cao, Kun-Yu Lin
According to the Shannon entropy, an energy-based implicit generator is inferred from a discriminator without extra training costs.
no code implementations • 19 Aug 2021 • Longbing Cao, Chengzhang Zhu
CRN represents multiple coupled dynamic sequences of a customer's historical and current states, responses to decision-makers' actions, decision rewards to actions, and learns long-term multi-sequence interactions between parties (customer and decision-maker).
no code implementations • 18 Aug 2021 • Siyuan Ren, Bin Guo, Longbing Cao, Ke Li, Jiaqi Liu, Zhiwen Yu
To address these issues, we propose DeepExpress - a deep-learning based express delivery sequence prediction model, which extends the classic seq2seq framework to learning complex coupling between sequence and features.
no code implementations • 20 Jul 2021 • Longbing Cao
The landscapes and challenges of financial businesses and data are firstly outlined, followed by a comprehensive categorization and a dense overview of the decades of AI research in finance.
no code implementations • 23 Jun 2021 • Jinjin Guo, Longbing Cao, Zhiguo Gong
The abundant sequential documents such as online archival, social media and news feeds are streamingly updated, where each chunk of documents is incorporated with smoothly evolving yet dependent topics.
1 code implementation • 13 May 2021 • Shoujin Wang, Liang Hu, Yan Wang, Xiangnan He, Quan Z. Sheng, Mehmet A. Orgun, Longbing Cao, Francesco Ricci, Philip S. Yu
Recent years have witnessed the fast development of the emerging topic of Graph Learning based Recommender Systems (GLRS).
no code implementations • 16 Apr 2021 • Longbing Cao, Qing Liu
The SARS-CoV-2 virus and COVID-19 disease have posed unprecedented and overwhelming demand, challenges and opportunities to domain, model and data driven modeling.
no code implementations • 21 Mar 2021 • Guansong Pang, Longbing Cao, Ling Chen
Most of existing outlier detection methods assume that the outlier factors (i. e., outlierness scoring measures) of data entities (e. g., feature values and data objects) are Independent and Identically Distributed (IID).
2 code implementations • 15 Sep 2020 • Guansong Pang, Anton Van Den Hengel, Chunhua Shen, Longbing Cao
We consider the problem of anomaly detection with a small set of partially labeled anomaly examples and a large-scale unlabeled dataset.
no code implementations • 21 Jul 2020 • Chengzhang Zhu, Longbing Cao, Jianping Yin
This work introduces a shallow but powerful UNsupervised heTerogeneous couplIng lEarning (UNTIE) approach for representing coupled categorical data by untying the interactions between couplings and revealing heterogeneous distributions embedded in each type of couplings.
no code implementations • 10 Jul 2020 • Longbing Cao, Qiang Yang, Philip S. Yu
Financial technology (FinTech) has been playing an increasingly critical role in driving modern economies, society, technology, and many other areas.
no code implementations • 6 Jul 2020 • Guansong Pang, Chunhua Shen, Longbing Cao, Anton Van Den Hengel
This paper surveys the research of deep anomaly detection with a comprehensive taxonomy, covering advancements in three high-level categories and 11 fine-grained categories of the methods.
no code implementations • 1 Jul 2020 • Longbing Cao
While recommendation plays an increasingly critical role in our living, study, work, and entertainment, the recommendations we receive are often for irrelevant, duplicate, or uninteresting products and services.
no code implementations • 1 Jul 2020 • Longbing Cao
This paper provides a comprehensive survey and tutorial of the fundamental aspects of data science: the evolution from data analysis to data science, the data science concepts, a big picture of the era of data science, the major challenges and directions in data innovation, the nature of data analytics, new industrialization and service opportunities in the data economy, the profession and competency of data education, and the future of data science.
Computers and Society
no code implementations • 1 Jul 2020 • Longbing Cao
Complex applications such as big data analytics involve different forms of coupling relationships that reflect interactions between factors related to technical, business (domain-specific) and environmental (including socio-cultural and economic) aspects.
no code implementations • 30 May 2020 • Shoujin Wang, Longbing Cao, Liang Hu, Shlomo Berkovsky, Xiaoshui Huang, Lin Xiao, Wenpeng Lu
Most existing TBRSs recommend next item by only modeling the intra-transaction dependency within the current transaction while ignoring inter-transaction dependency with recent transactions that may also affect the next item.
no code implementations • 22 Apr 2020 • Shoujin Wang, Liang Hu, Yan Wang, Xiangnan He, Quan Z. Sheng, Mehmet Orgun, Longbing Cao, Nan Wang, Francesco Ricci, Philip S. Yu
Recent years have witnessed the fast development of the emerging topic of Graph Learning based Recommender Systems (GLRS).
no code implementations • 28 Dec 2019 • Shoujin Wang, Liang Hu, Yan Wang, Longbing Cao, Quan Z. Sheng, Mehmet Orgun
The emerging topic of sequential recommender systems has attracted increasing attention in recent years. Different from the conventional recommender systems including collaborative filtering and content-based filtering, SRSs try to understand and model the sequential user behaviors, the interactions between users and items, and the evolution of users preferences and item popularity over time.
no code implementations • 1 Jul 2019 • Longxiang Shi, Shijian Li, Longbing Cao, Long Yang, Gang Zheng, Gang Pan
Alternatively, derivative-based methods treat the optimization process as a blackbox and show robustness and stability in learning continuous control tasks, but not data efficient in learning.
1 code implementation • 25 May 2019 • Peng Xu, Zhaohong Deng, Kup-Sze Choi, Longbing Cao, Shitong Wang
More specifically, it exploits the agreement and disagreement among views by sharing a common clustering results along the sample dimension and keeping the clustering results of each view specific along the feature dimension.
no code implementations • 17 May 2019 • Longxiang Shi, Shijian Li, Longbing Cao, Long Yang, Gang Pan
However, existing off-policy learning methods based on probabilistic policy measurement are inefficient when utilizing traces under a greedy target policy, which is ineffective for control problems.
1 code implementation • 13 Feb 2019 • Shoujin Wang, Longbing Cao, Yan Wang, Quan Z. Sheng, Mehmet Orgun, Defu Lian
In recent years, session-based recommender systems (SBRSs) have emerged as a new paradigm of RSs.
no code implementations • NeurIPS 2018 • Trong Dinh Thac Do, Longbing Cao
A conjugate Gamma-Poisson model for Dynamic Matrix Factorization incorporated with metadata influence (mGDMF for short) is proposed to effectively and efficiently model massive, sparse and dynamic data in recommendations.
3 code implementations • 13 Jun 2018 • Guansong Pang, Longbing Cao, Ling Chen, Huan Liu
However, existing unsupervised representation learning methods mainly focus on preserving the data regularity information and learning the representations independently of subsequent outlier detection methods, which can result in suboptimal and unstable performance of detecting irregularities (i. e., outliers).
no code implementations • 18 Feb 2015 • Fangfang Li, Guandong Xu, Longbing Cao
In this paper, we propose an innovative and effective clustering framework based on self-adaptive labeling (CSAL) which integrates clustering and classification on unlabeled data.
no code implementations • 8 Apr 2014 • Fangfang Li, Guandong Xu, Longbing Cao
The essence of the challenges cold start and sparsity in Recommender Systems (RS) is that the extant techniques, such as Collaborative Filtering (CF) and Matrix Factorization (MF), mainly rely on the user-item rating matrix, which sometimes is not informative enough for predicting recommendations.
no code implementations • 6 Oct 2013 • Xuhui Fan, Richard Yi Da Xu, Longbing Cao, Yin Song
In this work, we propose an informative relational model (InfRM) framework to adequately involve rich information and its granularity in a network, including metadata information about each entity and various forms of link data.
no code implementations • 13 Jun 2013 • Xuhui Fan, Longbing Cao, Richard Yi Da Xu
Directional and pairwise measurements are often used to model inter-relationships in a social network setting.
no code implementations • 13 Jun 2013 • Xuhui Fan, Longbing Cao
Graph Shift (GS) algorithms are recently focused as a promising approach for discovering dense subgraphs in noisy data.
no code implementations • 13 Jun 2013 • Xuhui Fan, Yiling Zeng, Longbing Cao
However, several problems remains unsolved in this pioneering work, including the power-law data applicability, mechanism to merge centers to avoid the over-fitting problem, clustering order problem, e. t. c.. To address these issues, the Pitman-Yor Process based k-means (namely \emph{pyp-means}) is proposed in this paper.
no code implementations • 12 Jun 2013 • Xuhui Fan, Longbing Cao, Richard Yi Da Xu
To this end, we introduce a \emph{Copula Mixed-Membership Stochastic Blockmodel (cMMSB)} where an individual Copula function is employed to jointly model the membership pairs of those nodes within the subgroup of interest.
no code implementations • 24 May 2013 • Yin Song, Longbing Cao, Xuhui Fan, Wei Cao, Jian Zhang
These sequence-level latent parameters for each sequence are modeled as latent Dirichlet random variables and parameterized by a set of deterministic database-level hyper-parameters.