Search Results for author: Lei Cao

Found 18 papers, 5 papers with code

Tensor-based Graph Learning with Consistency and Specificity for Multi-view Clustering

1 code implementation • 27 Mar 2024 • Long Shi, Lei Cao, Yunshan Ye, Yu Zhao, Badong Chen

In the context of multi-view clustering, graph learning is recognized as a crucial technique, which generally involves constructing an adaptive neighbor graph based on probabilistic neighbors, and then learning a consensus graph to for clustering.

Clustering Graph Learning +1

Paper
Code

Nonlinear subspace clustering by functional link neural networks

no code implementations • 3 Feb 2024 • Long Shi, Lei Cao, Zhongpu Chen, Badong Chen, Yu Zhao

Additionally, we introduce a convex combination subspace clustering scheme, which combining a linear subspace clustering method with the functional link neural network subspace clustering approach.

Clustering Computational Efficiency

Paper
Add Code

Enhanced Latent Multi-view Subspace Clustering

1 code implementation • 22 Dec 2023 • Long Shi, Lei Cao, Jun Wang, Badong Chen

Specifically, we stack the data matrices from various views into the block-diagonal locations of the augmented matrix to exploit the complementary information.

Clustering Multi-view Subspace Clustering

Paper
Code

Extract-Transform-Load for Video Streams

1 code implementation • 7 Oct 2023 • Ferdinand Kossmann, Ziniu Wu, Eugenie Lai, Nesime Tatbul, Lei Cao, Tim Kraska, Samuel Madden

We find that no current system sufficiently fulfills both needs and therefore propose Skyscraper, a system tailored to V-ETL.

Self-Driving Cars

Paper
Code

SEED: Domain-Specific Data Curation With Large Language Models

no code implementations • 1 Oct 2023 • Zui Chen, Lei Cao, Sam Madden, Tim Kraska, Zeyuan Shang, Ju Fan, Nan Tang, Zihui Gu, Chunwei Liu, Michael Cafarella

As a result, data scientists often have to develop domain-specific solutions tailored to both the dataset and the task, e. g. writing domain-specific code or training machine learning models on a sufficient number of annotated examples.

Code Generation Imputation +1

Paper
Add Code

VerifAI: Verified Generative AI

no code implementations • 6 Jul 2023 • Nan Tang, Chenyu Yang, Ju Fan, Lei Cao, Yuyu Luo, Alon Halevy

We propose that verifying the outputs of generative AI from a data management perspective is an emerging issue for generative AI.

Decision Making Knowledge Graphs +2

Paper
Add Code

RoTaR: Efficient Row-Based Table Representation Learning via Teacher-Student Training

no code implementations • 20 Jun 2023 • Zui Chen, Lei Cao, Sam Madden

In addition to the row-based architecture, we introduce several techniques: cell-aware position embedding, teacher-student training paradigm, and selective backward to improve the performance of RoTaR model.

Position Representation Learning

Paper
Add Code

Lingua Manga: A Generic Large Language Model Centric System for Data Curation

no code implementations • 20 Jun 2023 • Zui Chen, Lei Cao, Sam Madden

Data curation is a wide-ranging area which contains many critical but time-consuming data processing tasks.

Language Modelling Large Language Model

Paper
Add Code

Interleaving Pre-Trained Language Models and Large Language Models for Zero-Shot NL2SQL Generation

1 code implementation • 15 Jun 2023 • Zihui Gu, Ju Fan, Nan Tang, Songyue Zhang, Yuxin Zhang, Zui Chen, Lei Cao, Guoliang Li, Sam Madden, Xiaoyong Du

PLMs can perform well in schema alignment but struggle to achieve complex reasoning, while LLMs is superior in complex reasoning tasks but cannot achieve precise schema alignment.

Paper
Code

RITA: Group Attention is All You Need for Timeseries Analytics

no code implementations • 2 Jun 2023 • Jiaming Liang, Lei Cao, Samuel Madden, Zachary Ives, Guoliang Li

Timeseries analytics is of great importance in many real-world applications.

Paper
Add Code

Interpretable Outlier Summarization

no code implementations • 11 Mar 2023 • Yu Wang, Lei Cao, Yizhou Yan, Samuel Madden

Moreover, to effectively handle high dimensional, highly complex data sets which are hard to summarize with simple rules, we propose a localized STAIR approach, called L-STAIR.

Anomaly Detection Outlier Detection

Paper
Add Code

Scalable Motif Counting for Large-scale Temporal Graphs

1 code implementation • 20 Apr 2022 • Zhongqiang Gao, Chuanqi Cheng, Yanwei Yu, Lei Cao, Chao Huang, Junyu Dong

We first categorize the temporal motifs based on their distinct properties, and then design customized algorithms that offer efficient strategies to exactly count the motif instances of each category.

Anomaly Detection Representation Learning

Paper
Code

Parallel Fourier Ptychography reconstruction

no code implementations • 4 Mar 2022 • Guocheng Zhou, Shaohui Zhang, Yao Hu, Lei Cao, Yong Huang, Qun Hao

Fourier ptychography has attracted a wide range of focus for its ability of large space-bandwidth-produce, and quantative phase measurement.

Paper
Add Code

Building and Using Personal Knowledge Graph to Improve Suicidal Ideation Detection on Social Media

no code implementations • 16 Dec 2020 • Lei Cao, Huijun Zhang, Ling Feng

As the most popular platform for self-expression, emotion release, and personal interaction, individuals may exhibit a number of symptoms of suicidal ideation on social media.

Paper
Add Code

Latent Suicide Risk Detection on Microblog via Suicide-Oriented Word Embeddings and Layered Attention

no code implementations • IJCNLP 2019 • Lei Cao, Huijun Zhang, Ling Feng, Zihan Wei, Xin Wang, Ningyun Li, Xiaohao He

Despite detection of suicidal ideation on social media has made great progress in recent years, people's implicitly and anti-real contrarily expressed posts still remain as an obstacle, constraining the detectors to acquire higher satisfactory performance.

Word Embeddings

Paper
Add Code

Context-Aware Object Detection With Convolutional Neural Networks

no code implementations • 25 Sep 2019 • Yizhou Yan, Lei Cao, Samuel Madden, Elke Rundensteiner

Although the state-of-the-art object detection methods are successful in detecting and classifying objects by leveraging deep convolutional neural networks (CNNs), these methods overlook the semantic context which implies the probabilities that different classes of objects occur jointly.

Object object-detection +1

Paper
Add Code

Unknown-Aware Deep Neural Network

no code implementations • 25 Sep 2019 • Lei Cao, Yizhou Yan, Samuel Madden, Elke Rundensteiner

Unfortunately, although the strong generalization ability of existing CNNs ensures their accuracy when classifying known objects, it also causes them to often assign an unknown to a target class with high confidence.

Image Classification

Paper
Add Code

Outlier Detection from Image Data

no code implementations • ICLR 2019 • Lei Cao, Yizhou Yan, Samuel Madden, Elke Rundensteiner

Modern applications from Autonomous Vehicles to Video Surveillance generate massive amounts of image data.

Autonomous Vehicles General Classification +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.