Search Results for author: Yujia Zheng

Found 34 papers, 11 papers with code

Position: Mechanistic Interpretability Should Prioritize Feature Consistency in SAEs

1 code implementation26 May 2025 Xiangchen Song, Aashiq Muhamed, Yujia Zheng, Lingjing Kong, Zeyu Tang, Mona T. Diab, Virginia Smith, Kun Zhang

Sparse Autoencoders (SAEs) are a prominent tool in mechanistic interpretability (MI) for decomposing neural network activations into interpretable features.

Position Semantic Similarity +1

Nonparametric Factor Analysis and Beyond

no code implementations21 Mar 2025 Yujia Zheng, Yang Liu, Jiaxiong Yao, Yingyao Hu, Kun Zhang

Nearly all identifiability results in unsupervised representation learning inspired by, e. g., independent component analysis, factor analysis, and causal representation learning, rely on assumptions of additive independent noise or noiseless regimes.

Representation Learning

Type Information-Assisted Self-Supervised Knowledge Graph Denoising

1 code implementation13 Mar 2025 Jiaqi Sun, Yujia Zheng, Xinshuai Dong, Haoyue Dai, Kun Zhang

For the specific task of noise detection, the discrepancy between the reconstruction results and the input knowledge graph provides an opportunity for denoising, which is facilitated by the type consistency embedded in our method.

Denoising Entity Alignment +2

Synergy Between Sufficient Changes and Sparse Mixing Procedure for Disentangled Representation Learning

1 code implementation1 Mar 2025 Zijian Li, Shunxing Fan, Yujia Zheng, Ignavier Ng, Shaoan Xie, Guangyi Chen, Xinshuai Dong, Ruichu Cai, Kun Zhang

Some approaches rely on sufficient changes on the distribution of latent variables indicated by auxiliary variables such as domain indices, but acquiring enough domains is often challenging.

Representation Learning

Identification of Nonparametric Dynamic Causal Structure and Latent Process in Climate System

no code implementations21 Jan 2025 Minghao Fu, Biwei Huang, Zijian Li, Yujia Zheng, Ignavier Ng, Yingyao Hu, Kun Zhang

The study of learning causal structure with latent variables has advanced the understanding of the world by uncovering causal relationships and latent factors, e. g., Causal Representation Learning (CRL).

Causal Discovery Representation Learning

SmartCLIP: Modular Vision-language Alignment with Identification Guarantees

no code implementations CVPR 2025 Shaoan Xie, Lingjing Lingjing, Yujia Zheng, Yu Yao, Zeyu Tang, Eric P. Xing, Guangyi Chen, Kun Zhang

On the other hand, directly aligning long captions with images can lead to the retention of entangled details, preventing the model from learning disentangled, atomic concepts -- ultimately limiting its generalization on certain downstream tasks involving short prompts.

Contrastive Learning

Causal Representation Learning from Multimodal Biomedical Observations

no code implementations10 Nov 2024 Yuewen Sun, Lingjing Kong, Guangyi Chen, Loka Li, Gongxu Luo, Zijian Li, Yixuan Zhang, Yujia Zheng, Mengyue Yang, Petar Stojanov, Eran Segal, Eric P. Xing, Kun Zhang

Theoretically, we consider a nonparametric latent distribution (c. f., parametric assumptions in previous work) that allows for causal relationships across potentially different modalities.

Representation Learning

Identifying Selections for Unsupervised Subtask Discovery

no code implementations28 Oct 2024 Yiwen Qiu, Yujia Zheng, Kun Zhang

When solving long-horizon tasks, it is intriguing to decompose the high-level task into subtasks.

Imitation Learning

Causality for Large Language Models

1 code implementation20 Oct 2024 Anpeng Wu, Kun Kuang, Minqin Zhu, Yingrong Wang, Yujia Zheng, Kairong Han, Baohong Li, Guangyi Chen, Fei Wu, Kun Zhang

How to embed causality into the training process of LLMs and build more general and intelligent models remains unexplored.

Prompt Engineering

SciSafeEval: A Comprehensive Benchmark for Safety Alignment of Large Language Models in Scientific Tasks

no code implementations2 Oct 2024 TianHao Li, Jingyu Lu, Chuangxin Chu, Tianyu Zeng, Yujia Zheng, Mei Li, Haotian Huang, Bin Wu, Zuoxian Liu, Kai Ma, Xuejing Yuan, Xingkai Wang, Keyan Ding, Huajun Chen, Qiang Zhang

To address these limitations, we introduce SciSafeEval, a comprehensive benchmark designed to evaluate the safety alignment of LLMs across a range of scientific tasks.

Safety Alignment

Causal Temporal Representation Learning with Nonstationary Sparse Transition

1 code implementation5 Sep 2024 Xiangchen Song, Zijian Li, Guangyi Chen, Yujia Zheng, Yewen Fan, Xinshuai Dong, Kun Zhang

Based on the theoretical result, we introduce a novel framework, Causal Temporal Representation Learning with Nonstationary Sparse Transition (CtrlNS), designed to leverage the constraints on transition sparsity and conditional independence to reliably identify both distribution shifts and latent factors.

Representation Learning Temporal Sequences

On the Identifiability of Sparse ICA without Assuming Non-Gaussianity

no code implementations NeurIPS 2023 Ignavier Ng, Yujia Zheng, Xinshuai Dong, Kun Zhang

To accommodate Gaussian sources, we develop an identifiability theory that relies on second-order statistics without imposing further preconditions on the distribution of sources, by introducing novel assumptions on the connective structure from sources to observed variables.

Cloud Atlas: Efficient Fault Localization for Cloud Systems using Language Models and Causal Insight

no code implementations11 Jul 2024 Zhiqiang Xie, Yujia Zheng, Lizi Ottens, Kun Zhang, Christos Kozyrakis, Jonathan Mace

We evaluate Atlas across a range of fault localization scenarios and demonstrate that Atlas is capable of generating causal graphs in a scalable and generalizable manner, with performance that far surpasses that of data-driven algorithms and is commensurate to the ground-truth baseline.

Causal Discovery Fault localization

Detecting and Identifying Selection Structure in Sequential Data

no code implementations29 Jun 2024 Yujia Zheng, Zeyu Tang, Yiwen Qiu, Bernhard Schölkopf, Kun Zhang

Therefore, rather than merely viewing it as a bias, we explore the causal structure of selection in sequential data to delve deeper into the complete causal process.

Inductive Bias

Local Causal Discovery with Linear non-Gaussian Cyclic Models

1 code implementation21 Mar 2024 Haoyue Dai, Ignavier Ng, Yujia Zheng, Zhengqing Gao, Kun Zhang

Local causal discovery is of great practical significance, as there are often situations where the discovery of the global causal structure is unnecessary, and the interest lies solely on a single target variable.

Causal Discovery

Causal Representation Learning from Multiple Distributions: A General Setting

no code implementations7 Feb 2024 Kun Zhang, Shaoan Xie, Ignavier Ng, Yujia Zheng

We show that under the sparsity constraint on the recovered graph over the latent variables and suitable sufficient change conditions on the causal influences, interestingly, one can recover the moralized graph of the underlying directed acyclic graph, and the recovered latent variables and their relations are related to the underlying causal model in a specific, nontrivial way.

Representation Learning

A Versatile Causal Discovery Framework to Allow Causally-Related Hidden Variables

no code implementations18 Dec 2023 Xinshuai Dong, Biwei Huang, Ignavier Ng, Xiangchen Song, Yujia Zheng, Songyao Jin, Roberto Legaspi, Peter Spirtes, Kun Zhang

Most existing causal discovery methods rely on the assumption of no latent confounders, limiting their applicability in solving real-life problems.

Causal Discovery

A Comparative Study of AI-Generated (GPT-4) and Human-crafted MCQs in Programming Education

no code implementations5 Dec 2023 Jacob Doughty, Zipiao Wan, Anishka Bompelli, Jubahed Qayum, Taozhi Wang, Juran Zhang, Yujia Zheng, Aidan Doyle, Pragnya Sridhar, Arav Agarwal, Christopher Bogart, Eric Keylor, Can Kultur, Jaromir Savelka, Majd Sakr

While there is a growing body of research in computing education on utilizing large language models (LLMs) in generation and engagement with coding exercises, the use of LLMs for generating programming MCQs has not been extensively explored.

Multiple-choice

Causal-learn: Causal Discovery in Python

1 code implementation31 Jul 2023 Yujia Zheng, Biwei Huang, Wei Chen, Joseph Ramsey, Mingming Gong, Ruichu Cai, Shohei Shimizu, Peter Spirtes, Kun Zhang

Causal discovery aims at revealing causal relations from observational data, which is a fundamental task in science and engineering.

Causal Discovery

Partial Identifiability for Domain Adaptation

no code implementations10 Jun 2023 Lingjing Kong, Shaoan Xie, Weiran Yao, Yujia Zheng, Guangyi Chen, Petar Stojanov, Victor Akinwande, Kun Zhang

In general, without further assumptions, the joint distribution of the features and the label is not identifiable in the target domain.

Unsupervised Domain Adaptation

Understanding Breast Cancer Survival: Using Causality and Language Models on Multi-omics Data

no code implementations28 May 2023 Mugariya Farooq, Shahad Hardan, Aigerim Zhumbhayeva, Yujia Zheng, Preslav Nakov, Kun Zhang

The need for more usable and explainable machine learning models in healthcare increases the importance of developing and utilizing causal discovery algorithms, which aim to discover causal relations by analyzing observational data.

Causal Discovery Prognosis

Generalized Precision Matrix for Scalable Estimation of Nonparametric Markov Networks

no code implementations19 May 2023 Yujia Zheng, Ignavier Ng, Yewen Fan, Kun Zhang

A Markov network characterizes the conditional independence structure, or Markov property, among a set of random variables.

Learning Task-Aware Effective Brain Connectivity for fMRI Analysis with Graph Neural Networks

1 code implementation1 Nov 2022 Yue Yu, Xuan Kan, Hejie Cui, ran Xu, Yujia Zheng, Xiangchen Song, Yanqiao Zhu, Kun Zhang, Razieh Nabi, Ying Guo, Chao Zhang, Carl Yang

To better adapt GNNs for fMRI analysis, we propose TBDS, an end-to-end framework based on \underline{T}ask-aware \underline{B}rain connectivity \underline{D}AG (short for Directed Acyclic Graph) \underline{S}tructure generation for fMRI analysis.

Time Series Time Series Analysis

Whole Page Unbiased Learning to Rank

no code implementations19 Oct 2022 Haitao Mao, Lixin Zou, Yujia Zheng, Jiliang Tang, Xiaokai Chu, Jiashu Zhao, Qian Wang, Dawei Yin

To address the above challenges, we propose a Bias Agnostic whole-page unbiased Learning to rank algorithm, named BAL, to automatically find the user behavior model with causal discovery and mitigate the biases induced by multiple SERP features with no specific design.

Causal Discovery Information Retrieval +2

On the Identifiability of Nonlinear ICA: Sparsity and Beyond

no code implementations15 Jun 2022 Yujia Zheng, Ignavier Ng, Kun Zhang

We show that under specific instantiations of such constraints, the independent latent sources can be identified from their nonlinear mixtures up to a permutation and a component-wise transformation, thus achieving nontrivial identifiability of nonlinear ICA without auxiliary variables.

Inductive Bias

Reliable Causal Discovery with Improved Exact Search and Weaker Assumptions

1 code implementation NeurIPS 2021 Ignavier Ng, Yujia Zheng, Jiji Zhang, Kun Zhang

Many of the causal discovery methods rely on the faithfulness assumption to guarantee asymptotic correctness.

Causal Discovery

Source Free Unsupervised Graph Domain Adaptation

1 code implementation2 Dec 2021 Haitao Mao, Lun Du, Yujia Zheng, Qiang Fu, Zelin Li, Xu Chen, Shi Han, Dongmei Zhang

To address the non-trivial adaptation challenges in this practical scenario, we propose a model-agnostic algorithm called SOGA for domain adaptation to fully exploit the discriminative ability of the source model while preserving the consistency of structural proximity on the target graph.

Domain Adaptation GRAPH DOMAIN ADAPTATION +1

Learning Elastic Embeddings for Customizing On-Device Recommenders

no code implementations4 Jun 2021 Tong Chen, Hongzhi Yin, Yujia Zheng, Zi Huang, Yang Wang, Meng Wang

The core idea is to compose elastic embeddings for each item, where an elastic embedding is the concatenation of a set of embedding blocks that are carefully chosen by an automated search function.

Recommendation Systems

Cold-start Sequential Recommendation via Meta Learner

no code implementations10 Dec 2020 Yujia Zheng, Siyi Liu, Zekun Li, Shu Wu

As there is generally no side information in the setting of sequential recommendation task, previous cold-start methods could not be applied when only user-item interactions are available.

Meta-Learning Sequential Recommendation

Heterogeneous Graph Collaborative Filtering

no code implementations13 Nov 2020 Zekun Li, Yujia Zheng, Shu Wu, XiaoYu Zhang, Liang Wang

In this work, we propose to model user-item interactions as a heterogeneous graph which consists of not only user-item edges indicating their interaction but also user-user edges indicating their similarity.

Collaborative Filtering

DGTN: Dual-channel Graph Transition Network for Session-based Recommendation

1 code implementation21 Sep 2020 Yujia Zheng, Siyi Liu, Zekun Li, Shu Wu

These item transitions include potential collaborative information and reflect similar behavior patterns, which we assume may help with the recommendation for the target session.

Session-Based Recommendations

Long-tail Session-based Recommendation

no code implementations24 Jul 2020 Siyi Liu, Yujia Zheng

Session-based recommendation focuses on the prediction of user actions based on anonymous sessions and is a necessary method in the lack of user historical data.

Diversity Session-Based Recommendations

Balancing Multi-level Interactions for Session-based Recommendation

no code implementations29 Oct 2019 Yujia Zheng, Siyi Liu, Zailei Zhou

For mining the new data without breaking the equilibrium of the model between different interactions, we construct an intra-session graph and an inter-session graph for the current session.

Session-Based Recommendations

Cannot find the paper you are looking for? You can Submit a new open access paper.