Search Results for author: Xu Chu

Found 25 papers, 12 papers with code

Distance Metric Learning with Joint Representation Diversification

1 code implementation ICML 2020 Xu Chu, Yang Lin, Xiting Wang, Xin Gao, Qi Tong, Hailong Yu, Yasha Wang

Distance metric learning (DML) is to learn a representation space equipped with a metric, such that examples from the same class are closer than examples from different classes with respect to the metric.

Metric Learning

LoRA Dropout as a Sparsity Regularizer for Overfitting Control

no code implementations15 Apr 2024 Yang Lin, Xinyu Ma, Xu Chu, Yujie Jin, Zhibang Yang, Yasha Wang, Hong Mei

We then demonstrate the theoretical mechanism of our LoRA Dropout mechanism from the perspective of sparsity regularization by providing a generalization error bound under this framework.

Parameter Efficient Quasi-Orthogonal Fine-Tuning via Givens Rotation

no code implementations5 Apr 2024 Xinyu Ma, Xu Chu, Zhibang Yang, Yang Lin, Xin Gao, Junfeng Zhao

With the increasingly powerful performances and enormous scales of Pretrained Language Models (PLMs), promoting parameter efficiency in fine-tuning has become a crucial need for effective and efficient adaptation to various downstream tasks.

Exploring the Potential of Large Language Models in Graph Generation

no code implementations21 Mar 2024 Yang Yao, Xin Wang, Zeyang Zhang, Yijian Qin, Ziwei Zhang, Xu Chu, Yuekui Yang, Wenwu Zhu, Hong Mei

In this paper, we propose LLM4GraphGen to explore the ability of LLMs for graph generation with systematical task designs and extensive experiments.

Drug Discovery Graph Generation +1

Learnable Prompt as Pseudo-Imputation: Reassessing the Necessity of Traditional EHR Data Imputation in Downstream Clinical Prediction

no code implementations30 Jan 2024 Weibin Liao, Yinghao Zhu, Zixiang Wang, Xu Chu, Yasha Wang, Liantao Ma

PAI no longer introduces any imputed data but constructs a learnable prompt to model the implicit preferences of the downstream model for missing values, resulting in a significant performance improvement for all EHR analysis models.

Imputation

Infinite-Horizon Graph Filters: Leveraging Power Series to Enhance Sparse Information Aggregation

1 code implementation18 Jan 2024 Ruizhe Zhang, Xinke Jiang, Yuchen Fang, Jiayuan Luo, Yongxin Xu, Yichen Zhu, Xu Chu, Junfeng Zhao, Yasha Wang

Graph Neural Networks (GNNs) have shown considerable effectiveness in a variety of graph learning tasks, particularly those based on the message-passing approach in recent years.

Graph Learning Node Classification

Imputation with Inter-Series Information from Prototypes for Irregular Sampled Time Series

1 code implementation14 Jan 2024 Zhihao Yu, Xu Chu, Liantao Ma, Yasha Wang, Wenwu Zhu

To bridge this gap, we propose PRIME, a Prototype Recurrent Imputation ModEl, which integrates both intra-series and inter-series information for imputing missing values in irregularly sampled time series.

Imputation Memorization +1

Think and Retrieval: A Hypothesis Knowledge Graph Enhanced Medical Large Language Models

no code implementations26 Dec 2023 Xinke Jiang, Ruizhe Zhang, Yongxin Xu, Rihong Qiu, Yue Fang, Zhiyuan Wang, Jinyi Tang, Hongxin Ding, Xu Chu, Junfeng Zhao, Yasha Wang

We explore how the rise of Large Language Models (LLMs) significantly impacts task performance in the field of Natural Language Processing.

Knowledge Graphs Retrieval

DiffPrep: Differentiable Data Preprocessing Pipeline Search for Learning over Tabular Data

1 code implementation20 Aug 2023 Peng Li, Zhiyi Chen, Xu Chu, Kexin Rong

Data preprocessing is a crucial step in the machine learning process that transforms raw data into a more usable format for downstream ML models.

AutoML

Improving Generalization of Meta-Learning With Inverted Regularization at Inner-Level

no code implementations CVPR 2023 Lianzhe Wang, Shiji Zhou, Shanghang Zhang, Xu Chu, Heng Chang, Wenwu Zhu

Despite the broad interest in meta-learning, the generalization problem remains one of the significant challenges in this field.

Meta-Learning

Ground Truth Inference for Weakly Supervised Entity Matching

no code implementations13 Nov 2022 Renzhi Wu, Alexander Bendeck, Xu Chu, Yeye He

We also show that a deep learning EM end model (DeepMatcher) trained on labels generated from our weak supervision approach is comparable to an end model trained using tens of thousands of ground-truth labels, demonstrating that our approach can significantly reduce the labeling efforts required in EM.

Domain Generalization through the Lens of Angular Invariance

1 code implementation28 Oct 2022 Yujie Jin, Xu Chu, Yasha Wang, Wenwu Zhu

Based on the proposed term of invariance, we propose a novel deep DG method called Angular Invariance Domain Generalization Network (AIDGN).

Domain Generalization Representation Learning

M$^3$Care: Learning with Missing Modalities in Multimodal Healthcare Data

1 code implementation28 Oct 2022 Chaohe Zhang, Xu Chu, Liantao Ma, Yinghao Zhu, Yasha Wang, Jiangtao Wang, Junfeng Zhao

M3Care is an end-to-end model compensating the missing information of the patients with missing modalities to perform clinical analysis.

iFlipper: Label Flipping for Individual Fairness

1 code implementation15 Sep 2022 Hantian Zhang, Ki Hyun Tae, Jaeyoung Park, Xu Chu, Steven Euijong Whang

We then propose an approximate linear programming algorithm and provide theoretical guarantees on how close its result is to the optimal solution in terms of the number of label flips.

Fairness

Learning Hyper Label Model for Programmatic Weak Supervision

1 code implementation27 Jul 2022 Renzhi Wu, Shen-En Chen, Jieyu Zhang, Xu Chu

We train the model on synthetic data generated in the way that ensures the model approximates the analytical optimal solution, and build the model upon Graph Neural Network (GNN) to ensure the model prediction being invariant (or equivariant) to the permutation of LFs (or data points).

MedFACT: Modeling Medical Feature Correlations in Patient Health Representation Learning via Feature Clustering

no code implementations21 Apr 2022 Xinyu Ma, Xu Chu, Yasha Wang, Hailong Yu, Liantao Ma, Wen Tang, Junfeng Zhao

Thus, to address the issues, we expect to group up strongly correlated features and learn feature correlations in a group-wise manner to reduce the learning complexity without losing generality.

Clustering Representation Learning

Learning to be a Statistician: Learned Estimator for Number of Distinct Values

1 code implementation6 Feb 2022 Renzhi Wu, Bolin Ding, Xu Chu, Zhewei Wei, Xiening Dai, Tao Guan, Jingren Zhou

We derive conditions of the learning framework under which the learned model is workload agnostic, in the sense that the model/estimator can be trained with synthetically generated training data, and then deployed into any data warehouse simply as, e. g., user-defined functions (UDFs), to offer efficient (within microseconds on CPU) and accurate NDV estimations for unseen tables and workloads.

Demonstration of Panda: A Weakly Supervised Entity Matching System

no code implementations21 Jun 2021 Renzhi Wu, Prem Sakala, Peng Li, Xu Chu, Yeye He

Panda's IDE includes many novel features purpose-built for EM, such as smart data sampling, a builtin library of EM utility functions, automatically generated LFs, visual debugging of LFs, and finally, an EM-specific labeling model.

Management

OmniFair: A Declarative System for Model-Agnostic Group Fairness in Machine Learning

no code implementations13 Mar 2021 Hantian Zhang, Xu Chu, Abolfazl Asudeh, Shamkant B. Navathe

Existing techniques for producing fair ML models either are limited to the type of fairness constraints they can handle (e. g., preprocessing) or require nontrivial modifications to downstream ML training algorithms (e. g., in-processing).

BIG-bench Machine Learning Fairness

Nearest Neighbor Classifiers over Incomplete Information: From Certain Answers to Certain Predictions

1 code implementation11 May 2020 Bojan Karlaš, Peng Li, Renzhi Wu, Nezihe Merve Gürel, Xu Chu, Wentao Wu, Ce Zhang

Machine learning (ML) applications have been thriving recently, largely attributed to the increasing availability of data.

BIG-bench Machine Learning

ZeroER: Entity Resolution using Zero Labeled Examples

1 code implementation16 Aug 2019 Renzhi Wu, Sanya Chaba, Saurabh Sawlani, Xu Chu, Saravanan Thirumuruganathan

We investigate an important problem that vexes practitioners: is it possible to design an effective algorithm for ER that requires Zero labeled examples, yet can achieve performance comparable to supervised approaches?

Entity Resolution

CleanML: A Study for Evaluating the Impact of Data Cleaning on ML Classification Tasks

no code implementations20 Apr 2019 Peng Li, Xi Rao, Jennifer Blase, Yue Zhang, Xu Chu, Ce Zhang

Data quality affects machine learning (ML) model performances, and data scientists spend considerable amount of time on data cleaning before model training.

General Classification Two-sample testing

GOGGLES: Automatic Image Labeling with Affinity Coding

1 code implementation11 Mar 2019 Nilaksh Das, Sanya Chaba, Renzhi Wu, Sakshi Gandhi, Duen Horng Chau, Xu Chu

We build the GOGGLES system that implements affinity coding for labeling image datasets by designing a novel set of reusable affinity functions for images, and propose a novel hierarchical generative model for class inference using a small development set.

Few-Shot Learning

Multi-Label Robust Factorization Autoencoder and its Application in Predicting Drug-Drug Interactions

no code implementations1 Nov 2018 Xu Chu, Yang Lin, Jingyue Gao, Jiangtao Wang, Yasha Wang, Leye Wang

However, the shallow models leveraging bilinear forms suffer from limitations on capturing complicated nonlinear interactions between drug pairs.

Motif-based Rule Discovery for Predicting Real-valued Time Series

no code implementations14 Sep 2017 Yuanduo He, Xu Chu, Juguang Peng, Jingyue Gao, Yasha Wang

Time series prediction is of great significance in many applications and has attracted extensive attention from the data mining community.

Time Series Time Series Prediction +1

Cannot find the paper you are looking for? You can Submit a new open access paper.