Search Results for author: Lei Cao

Found 18 papers, 5 papers with code

Tensor-based Graph Learning with Consistency and Specificity for Multi-view Clustering

1 code implementation27 Mar 2024 Long Shi, Lei Cao, Yunshan Ye, Yu Zhao, Badong Chen

In the context of multi-view clustering, graph learning is recognized as a crucial technique, which generally involves constructing an adaptive neighbor graph based on probabilistic neighbors, and then learning a consensus graph to for clustering.

Clustering Graph Learning +1

Nonlinear subspace clustering by functional link neural networks

no code implementations3 Feb 2024 Long Shi, Lei Cao, Zhongpu Chen, Badong Chen, Yu Zhao

Additionally, we introduce a convex combination subspace clustering scheme, which combining a linear subspace clustering method with the functional link neural network subspace clustering approach.

Clustering Computational Efficiency

Enhanced Latent Multi-view Subspace Clustering

1 code implementation22 Dec 2023 Long Shi, Lei Cao, Jun Wang, Badong Chen

Specifically, we stack the data matrices from various views into the block-diagonal locations of the augmented matrix to exploit the complementary information.

Clustering Multi-view Subspace Clustering

Extract-Transform-Load for Video Streams

1 code implementation7 Oct 2023 Ferdinand Kossmann, Ziniu Wu, Eugenie Lai, Nesime Tatbul, Lei Cao, Tim Kraska, Samuel Madden

We find that no current system sufficiently fulfills both needs and therefore propose Skyscraper, a system tailored to V-ETL.

Self-Driving Cars

SEED: Domain-Specific Data Curation With Large Language Models

no code implementations1 Oct 2023 Zui Chen, Lei Cao, Sam Madden, Tim Kraska, Zeyuan Shang, Ju Fan, Nan Tang, Zihui Gu, Chunwei Liu, Michael Cafarella

As a result, data scientists often have to develop domain-specific solutions tailored to both the dataset and the task, e. g. writing domain-specific code or training machine learning models on a sufficient number of annotated examples.

Code Generation Imputation +1

VerifAI: Verified Generative AI

no code implementations6 Jul 2023 Nan Tang, Chenyu Yang, Ju Fan, Lei Cao, Yuyu Luo, Alon Halevy

We propose that verifying the outputs of generative AI from a data management perspective is an emerging issue for generative AI.

Decision Making Knowledge Graphs +2

RoTaR: Efficient Row-Based Table Representation Learning via Teacher-Student Training

no code implementations20 Jun 2023 Zui Chen, Lei Cao, Sam Madden

In addition to the row-based architecture, we introduce several techniques: cell-aware position embedding, teacher-student training paradigm, and selective backward to improve the performance of RoTaR model.

Position Representation Learning

Lingua Manga: A Generic Large Language Model Centric System for Data Curation

no code implementations20 Jun 2023 Zui Chen, Lei Cao, Sam Madden

Data curation is a wide-ranging area which contains many critical but time-consuming data processing tasks.

Language Modelling Large Language Model

Interleaving Pre-Trained Language Models and Large Language Models for Zero-Shot NL2SQL Generation

1 code implementation15 Jun 2023 Zihui Gu, Ju Fan, Nan Tang, Songyue Zhang, Yuxin Zhang, Zui Chen, Lei Cao, Guoliang Li, Sam Madden, Xiaoyong Du

PLMs can perform well in schema alignment but struggle to achieve complex reasoning, while LLMs is superior in complex reasoning tasks but cannot achieve precise schema alignment.

RITA: Group Attention is All You Need for Timeseries Analytics

no code implementations2 Jun 2023 Jiaming Liang, Lei Cao, Samuel Madden, Zachary Ives, Guoliang Li

Timeseries analytics is of great importance in many real-world applications.

Interpretable Outlier Summarization

no code implementations11 Mar 2023 Yu Wang, Lei Cao, Yizhou Yan, Samuel Madden

Moreover, to effectively handle high dimensional, highly complex data sets which are hard to summarize with simple rules, we propose a localized STAIR approach, called L-STAIR.

Anomaly Detection Outlier Detection

Scalable Motif Counting for Large-scale Temporal Graphs

1 code implementation20 Apr 2022 Zhongqiang Gao, Chuanqi Cheng, Yanwei Yu, Lei Cao, Chao Huang, Junyu Dong

We first categorize the temporal motifs based on their distinct properties, and then design customized algorithms that offer efficient strategies to exactly count the motif instances of each category.

Anomaly Detection Representation Learning

Parallel Fourier Ptychography reconstruction

no code implementations4 Mar 2022 Guocheng Zhou, Shaohui Zhang, Yao Hu, Lei Cao, Yong Huang, Qun Hao

Fourier ptychography has attracted a wide range of focus for its ability of large space-bandwidth-produce, and quantative phase measurement.

Building and Using Personal Knowledge Graph to Improve Suicidal Ideation Detection on Social Media

no code implementations16 Dec 2020 Lei Cao, Huijun Zhang, Ling Feng

As the most popular platform for self-expression, emotion release, and personal interaction, individuals may exhibit a number of symptoms of suicidal ideation on social media.

Latent Suicide Risk Detection on Microblog via Suicide-Oriented Word Embeddings and Layered Attention

no code implementations IJCNLP 2019 Lei Cao, Huijun Zhang, Ling Feng, Zihan Wei, Xin Wang, Ningyun Li, Xiaohao He

Despite detection of suicidal ideation on social media has made great progress in recent years, people's implicitly and anti-real contrarily expressed posts still remain as an obstacle, constraining the detectors to acquire higher satisfactory performance.

Word Embeddings

Context-Aware Object Detection With Convolutional Neural Networks

no code implementations25 Sep 2019 Yizhou Yan, Lei Cao, Samuel Madden, Elke Rundensteiner

Although the state-of-the-art object detection methods are successful in detecting and classifying objects by leveraging deep convolutional neural networks (CNNs), these methods overlook the semantic context which implies the probabilities that different classes of objects occur jointly.

Object object-detection +1

Unknown-Aware Deep Neural Network

no code implementations25 Sep 2019 Lei Cao, Yizhou Yan, Samuel Madden, Elke Rundensteiner

Unfortunately, although the strong generalization ability of existing CNNs ensures their accuracy when classifying known objects, it also causes them to often assign an unknown to a target class with high confidence.

Image Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.