Search Results for author: Eli Chien

Found 27 papers, 17 papers with code

Underestimated Privacy Risks for Minority Populations in Large Language Model Unlearning

no code implementations11 Dec 2024 Rongzhe Wei, Mufei Li, Mohsen Ghassemi, Eleonora Kreačić, YiFan Li, Xiang Yue, Bo Li, Vamsi K. Potluru, Pan Li, Eli Chien

Given that the right to be forgotten should be upheld for every individual, we advocate for a more rigorous evaluation of LLM unlearning methods.

Language Modeling Language Modelling +2

LayerDAG: A Layerwise Autoregressive Diffusion Model for Directed Acyclic Graph Generation

1 code implementation4 Nov 2024 Mufei Li, Viraj Shitole, Eli Chien, Changhai Man, Zhaodong Wang, Srinivas Sridharan, Ying Zhang, Tushar Krishna, Pan Li

By interpreting the partial order of nodes as a sequence of bipartite graphs, LayerDAG leverages autoregressive generation to model directional dependencies and employs diffusion models to capture logical dependencies within each bipartite graph.

Benchmarking Graph Generation

Privately Learning from Graphs with Applications in Fine-tuning Large Language Models

1 code implementation10 Oct 2024 Haoteng Yin, Rongzhe Wei, Eli Chien, Pan Li

Additionally, we explore the trade-offs between privacy, utility, and computational efficiency, offering insights into the practical deployment of our approach.

Computational Efficiency Privacy Preserving +1

Convergent Privacy Loss of Noisy-SGD without Convexity and Smoothness

no code implementations1 Oct 2024 Eli Chien, Pan Li

We also provide a strictly better privacy bound compared to state-of-the-art results for smooth strongly convex losses.

LEMMA

Differentially Private Graph Diffusion with Applications in Personalized PageRanks

no code implementations22 Jun 2024 Rongzhe Wei, Eli Chien, Pan Li

Our privacy loss analysis is based on Privacy Amplification by Iteration (PABI), which to our best knowledge, is the first effort that analyzes PABI with Laplace noise and provides relevant applications.

Certified Machine Unlearning via Noisy Stochastic Gradient Descent

2 code implementations25 Mar 2024 Eli Chien, Haoyu Wang, Ziang Chen, Pan Li

``The right to be forgotten'' ensured by laws for user data privacy becomes increasingly important.

Machine Unlearning

Machine Unlearning of Pre-trained Large Language Models

1 code implementation23 Feb 2024 Jin Yao, Eli Chien, Minxin Du, Xinyao Niu, Tianhao Wang, Zezhou Cheng, Xiang Yue

This study investigates the concept of the `right to be forgotten' within the context of large language models (LLMs).

Machine Unlearning

Langevin Unlearning: A New Perspective of Noisy Gradient Descent for Machine Unlearning

2 code implementations18 Jan 2024 Eli Chien, Haoyu Wang, Ziang Chen, Pan Li

Machine unlearning has raised significant interest with the adoption of laws ensuring the ``right to be forgotten''.

Machine Unlearning

Breaking the Trilemma of Privacy, Utility, Efficiency via Controllable Machine Unlearning

1 code implementation28 Oct 2023 Zheyuan Liu, Guangyao Dou, Yijun Tian, Chunhui Zhang, Eli Chien, Ziwei Zhu

Exploring the full spectrum of trade-offs between privacy, model utility, and runtime efficiency is critical for practical unlearning scenarios.

Machine Unlearning

On the Inherent Privacy Properties of Discrete Denoising Diffusion Models

no code implementations24 Oct 2023 Rongzhe Wei, Eleonora Kreačić, Haoyu Wang, Haoteng Yin, Eli Chien, Vamsi K. Potluru, Pan Li

Privacy concerns have led to a surge in the creation of synthetic datasets, with diffusion models emerging as a promising avenue.

Dataset Generation Denoising +1

Federated Classification in Hyperbolic Spaces via Secure Aggregation of Convex Hulls

2 code implementations14 Aug 2023 Saurav Prakash, Jin Sima, Chao Pan, Eli Chien, Olgica Milenkovic

Third, we compute the complexity of the convex hulls in hyperbolic spaces to assess the extent of data leakage; at the same time, in order to limit communication cost for the hulls, we propose a new quantization method for the Poincar\'e disc coupled with Reed-Solomon-like encoding.

Federated Learning graph partitioning +2

Representer Point Selection for Explaining Regularized High-dimensional Models

no code implementations31 May 2023 Che-Ping Tsai, Jiong Zhang, Eli Chien, Hsiang-Fu Yu, Cho-Jui Hsieh, Pradeep Ravikumar

We introduce a novel class of sample-based explanations we term high-dimensional representers, that can be used to explain the predictions of a regularized high-dimensional model in terms of importance weights for each of the training samples.

Binary Classification Collaborative Filtering +1

PINA: Leveraging Side Information in eXtreme Multi-label Classification via Predicted Instance Neighborhood Aggregation

1 code implementation21 May 2023 Eli Chien, Jiong Zhang, Cho-Jui Hsieh, Jyun-Yu Jiang, Wei-Cheng Chang, Olgica Milenkovic, Hsiang-Fu Yu

Unlike most existing XMC frameworks that treat labels and input instances as featureless indicators and independent entries, PINA extracts information from the label metadata and the correlations among training instances.

Extreme Multi-Label Classification MUlTI-LABEL-ClASSIFICATION +1

Unlearning Graph Classifiers with Limited Data Resources

1 code implementation6 Nov 2022 Chao Pan, Eli Chien, Olgica Milenkovic

As the demand for user privacy grows, controlled data removal (machine unlearning) is becoming an important feature of machine learning models for data-sensitive Web applications such as social networks and recommender systems.

Graph Classification Machine Unlearning +1

Certified Graph Unlearning

1 code implementation18 Jun 2022 Eli Chien, Chao Pan, Olgica Milenkovic

For example, when unlearning $20\%$ of the nodes on the Cora dataset, our approach suffers only a $0. 1\%$ loss in test accuracy while offering a $4$-fold speed-up compared to complete retraining.

GPR Machine Unlearning

HyperAid: Denoising in hyperbolic spaces for tree-fitting and hierarchical clustering

1 code implementation19 May 2022 Eli Chien, Puoya Tabaghi, Olgica Milenkovic

Furthermore, it is currently not known how to choose the most suitable approximation objective for noisy fitting.

Clustering Denoising

Provably Accurate and Scalable Linear Classifiers in Hyperbolic Spaces

1 code implementation7 Mar 2022 Chao Pan, Eli Chien, Puoya Tabaghi, Jianhao Peng, Olgica Milenkovic

The excellent performance of the Poincar\'e second-order and strategic perceptrons shows that the proposed framework can be extended to general machine learning problems in hyperbolic spaces.

Time Series Analysis

Node Feature Extraction by Self-Supervised Multi-scale Neighborhood Prediction

4 code implementations ICLR 2022 Eli Chien, Wei-Cheng Chang, Cho-Jui Hsieh, Hsiang-Fu Yu, Jiong Zhang, Olgica Milenkovic, Inderjit S Dhillon

We also provide a theoretical analysis that justifies the use of XMC over link prediction and motivates integrating XR-Transformers, a powerful method for solving XMC problems, into the GIANT framework.

Extreme Multi-Label Classification Language Modelling +4

Highly Scalable and Provably Accurate Classification in Poincare Balls

1 code implementation8 Sep 2021 Eli Chien, Chao Pan, Puoya Tabaghi, Olgica Milenkovic

For hierarchical data, the space of choice is a hyperbolic space since it guarantees low-distortion embeddings for tree-like structures.

Classification Time Series Analysis

You are AllSet: A Multiset Function Framework for Hypergraph Neural Networks

1 code implementation ICLR 2022 Eli Chien, Chao Pan, Jianhao Peng, Olgica Milenkovic

We propose AllSet, a new hypergraph neural network paradigm that represents a highly general framework for (hyper)graph neural networks and for the first time implements hypergraph neural network layers as compositions of two multiset functions that can be efficiently learned for each task and each dataset.

Benchmarking Node Classification

Linear Classifiers in Product Space Forms

1 code implementation19 Feb 2021 Puoya Tabaghi, Chao Pan, Eli Chien, Jianhao Peng, Olgica Milenkovic

The results show that classification in low-dimensional product space forms for scRNA-seq data offers, on average, a performance improvement of $\sim15\%$ when compared to that in Euclidean spaces of the same dimension.

Support Estimation with Sampling Artifacts and Errors

no code implementations14 Jun 2020 Eli Chien, Olgica Milenkovic, Angelia Nedich

Here we introduce the first known approach to support estimation in the presence of sampling artifacts and errors where each sample is assumed to arise from a Poisson repeat channel which simultaneously captures repetitions and deletions of samples.

Adaptive Universal Generalized PageRank Graph Neural Network

1 code implementation ICLR 2021 Eli Chien, Jianhao Peng, Pan Li, Olgica Milenkovic

We address these issues by introducing a new Generalized PageRank (GPR) GNN architecture that adaptively learns the GPR weights so as to jointly optimize node feature and topological information extraction, regardless of the extent to which the node labels are homophilic or heterophilic.

GPR Graph Neural Network +2

Active learning in the geometric block model

no code implementations15 Nov 2019 Eli Chien, Antonia Maria Tulino, Jaime Llorca

Galhotra et al. recently proposed a motif-counting algorithm for unsupervised community detection in the geometric block model that is proved to be near-optimal.

Active Learning Community Detection +2

Multi-MotifGAN (MMGAN): Motif-targeted Graph Generation and Prediction

no code implementations8 Nov 2019 Anuththari Gamage, Eli Chien, Jianhao Peng, Olgica Milenkovic

Generative models are successful at retaining pairwise associations in the underlying networks but often fail to capture higher-order connectivity patterns known as network motifs.

Generative Adversarial Network Graph Generation +1

Landing Probabilities of Random Walks for Seed-Set Expansion in Hypergraphs

no code implementations20 Oct 2019 Eli Chien, Pan Li, Olgica Milenkovic

We describe the first known mean-field study of landing probabilities for random walks on hypergraphs.

Cannot find the paper you are looking for? You can Submit a new open access paper.