1 code implementation • 26 May 2025 • Xiangchen Song, Aashiq Muhamed, Yujia Zheng, Lingjing Kong, Zeyu Tang, Mona T. Diab, Virginia Smith, Kun Zhang
Sparse Autoencoders (SAEs) are a prominent tool in mechanistic interpretability (MI) for decomposing neural network activations into interpretable features.
no code implementations • 21 Mar 2025 • Yujia Zheng, Yang Liu, Jiaxiong Yao, Yingyao Hu, Kun Zhang
Nearly all identifiability results in unsupervised representation learning inspired by, e. g., independent component analysis, factor analysis, and causal representation learning, rely on assumptions of additive independent noise or noiseless regimes.
1 code implementation • 13 Mar 2025 • Jiaqi Sun, Yujia Zheng, Xinshuai Dong, Haoyue Dai, Kun Zhang
For the specific task of noise detection, the discrepancy between the reconstruction results and the input knowledge graph provides an opportunity for denoising, which is facilitated by the type consistency embedded in our method.
1 code implementation • 1 Mar 2025 • Zijian Li, Shunxing Fan, Yujia Zheng, Ignavier Ng, Shaoan Xie, Guangyi Chen, Xinshuai Dong, Ruichu Cai, Kun Zhang
Some approaches rely on sufficient changes on the distribution of latent variables indicated by auxiliary variables such as domain indices, but acquiring enough domains is often challenging.
no code implementations • 6 Feb 2025 • TianHao Li, Tianyu Zeng, Yujia Zheng, Chulong Zhang, Jingyu Lu, Haotian Huang, Chuangxin Chu, Fang-Fang Yin, Zhenyu Yang
Deep learning-based medical image segmentation models, such as U-Net, rely on high-quality annotated datasets to achieve accurate predictions.
no code implementations • 21 Jan 2025 • Minghao Fu, Biwei Huang, Zijian Li, Yujia Zheng, Ignavier Ng, Yingyao Hu, Kun Zhang
The study of learning causal structure with latent variables has advanced the understanding of the world by uncovering causal relationships and latent factors, e. g., Causal Representation Learning (CRL).
no code implementations • CVPR 2025 • Shaoan Xie, Lingjing Lingjing, Yujia Zheng, Yu Yao, Zeyu Tang, Eric P. Xing, Guangyi Chen, Kun Zhang
On the other hand, directly aligning long captions with images can lead to the retention of entangled details, preventing the model from learning disentangled, atomic concepts -- ultimately limiting its generalization on certain downstream tasks involving short prompts.
no code implementations • 10 Nov 2024 • Yuewen Sun, Lingjing Kong, Guangyi Chen, Loka Li, Gongxu Luo, Zijian Li, Yixuan Zhang, Yujia Zheng, Mengyue Yang, Petar Stojanov, Eran Segal, Eric P. Xing, Kun Zhang
Theoretically, we consider a nonparametric latent distribution (c. f., parametric assumptions in previous work) that allows for causal relationships across potentially different modalities.
no code implementations • 28 Oct 2024 • Yiwen Qiu, Yujia Zheng, Kun Zhang
When solving long-horizon tasks, it is intriguing to decompose the high-level task into subtasks.
1 code implementation • 20 Oct 2024 • Anpeng Wu, Kun Kuang, Minqin Zhu, Yingrong Wang, Yujia Zheng, Kairong Han, Baohong Li, Guangyi Chen, Fei Wu, Kun Zhang
How to embed causality into the training process of LLMs and build more general and intelligent models remains unexplored.
no code implementations • 2 Oct 2024 • TianHao Li, Jingyu Lu, Chuangxin Chu, Tianyu Zeng, Yujia Zheng, Mei Li, Haotian Huang, Bin Wu, Zuoxian Liu, Kai Ma, Xuejing Yuan, Xingkai Wang, Keyan Ding, Huajun Chen, Qiang Zhang
To address these limitations, we introduce SciSafeEval, a comprehensive benchmark designed to evaluate the safety alignment of LLMs across a range of scientific tasks.
1 code implementation • 5 Sep 2024 • Xiangchen Song, Zijian Li, Guangyi Chen, Yujia Zheng, Yewen Fan, Xinshuai Dong, Kun Zhang
Based on the theoretical result, we introduce a novel framework, Causal Temporal Representation Learning with Nonstationary Sparse Transition (CtrlNS), designed to leverage the constraints on transition sparsity and conditional independence to reliably identify both distribution shifts and latent factors.
no code implementations • NeurIPS 2023 • Ignavier Ng, Yujia Zheng, Xinshuai Dong, Kun Zhang
To accommodate Gaussian sources, we develop an identifiability theory that relies on second-order statistics without imposing further preconditions on the distribution of sources, by introducing novel assumptions on the connective structure from sources to observed variables.
no code implementations • 11 Jul 2024 • Zhiqiang Xie, Yujia Zheng, Lizi Ottens, Kun Zhang, Christos Kozyrakis, Jonathan Mace
We evaluate Atlas across a range of fault localization scenarios and demonstrate that Atlas is capable of generating causal graphs in a scalable and generalizable manner, with performance that far surpasses that of data-driven algorithms and is commensurate to the ground-truth baseline.
no code implementations • 29 Jun 2024 • Yujia Zheng, Zeyu Tang, Yiwen Qiu, Bernhard Schölkopf, Kun Zhang
Therefore, rather than merely viewing it as a bias, we explore the causal structure of selection in sequential data to delve deeper into the complete causal process.
1 code implementation • 21 Mar 2024 • Haoyue Dai, Ignavier Ng, Yujia Zheng, Zhengqing Gao, Kun Zhang
Local causal discovery is of great practical significance, as there are often situations where the discovery of the global causal structure is unnecessary, and the interest lies solely on a single target variable.
no code implementations • 7 Feb 2024 • Kun Zhang, Shaoan Xie, Ignavier Ng, Yujia Zheng
We show that under the sparsity constraint on the recovered graph over the latent variables and suitable sufficient change conditions on the causal influences, interestingly, one can recover the moralized graph of the underlying directed acyclic graph, and the recovered latent variables and their relations are related to the underlying causal model in a specific, nontrivial way.
no code implementations • 18 Dec 2023 • Xinshuai Dong, Biwei Huang, Ignavier Ng, Xiangchen Song, Yujia Zheng, Songyao Jin, Roberto Legaspi, Peter Spirtes, Kun Zhang
Most existing causal discovery methods rely on the assumption of no latent confounders, limiting their applicability in solving real-life problems.
no code implementations • 5 Dec 2023 • Jacob Doughty, Zipiao Wan, Anishka Bompelli, Jubahed Qayum, Taozhi Wang, Juran Zhang, Yujia Zheng, Aidan Doyle, Pragnya Sridhar, Arav Agarwal, Christopher Bogart, Eric Keylor, Can Kultur, Jaromir Savelka, Majd Sakr
While there is a growing body of research in computing education on utilizing large language models (LLMs) in generation and engagement with coding exercises, the use of LLMs for generating programming MCQs has not been extensively explored.
1 code implementation • 31 Jul 2023 • Yujia Zheng, Biwei Huang, Wei Chen, Joseph Ramsey, Mingming Gong, Ruichu Cai, Shohei Shimizu, Peter Spirtes, Kun Zhang
Causal discovery aims at revealing causal relations from observational data, which is a fundamental task in science and engineering.
no code implementations • 10 Jun 2023 • Lingjing Kong, Shaoan Xie, Weiran Yao, Yujia Zheng, Guangyi Chen, Petar Stojanov, Victor Akinwande, Kun Zhang
In general, without further assumptions, the joint distribution of the features and the label is not identifiable in the target domain.
no code implementations • 28 May 2023 • Mugariya Farooq, Shahad Hardan, Aigerim Zhumbhayeva, Yujia Zheng, Preslav Nakov, Kun Zhang
The need for more usable and explainable machine learning models in healthcare increases the importance of developing and utilizing causal discovery algorithms, which aim to discover causal relations by analyzing observational data.
no code implementations • 19 May 2023 • Yujia Zheng, Ignavier Ng, Yewen Fan, Kun Zhang
A Markov network characterizes the conditional independence structure, or Markov property, among a set of random variables.
1 code implementation • 1 Nov 2022 • Yue Yu, Xuan Kan, Hejie Cui, ran Xu, Yujia Zheng, Xiangchen Song, Yanqiao Zhu, Kun Zhang, Razieh Nabi, Ying Guo, Chao Zhang, Carl Yang
To better adapt GNNs for fMRI analysis, we propose TBDS, an end-to-end framework based on \underline{T}ask-aware \underline{B}rain connectivity \underline{D}AG (short for Directed Acyclic Graph) \underline{S}tructure generation for fMRI analysis.
no code implementations • 19 Oct 2022 • Haitao Mao, Lixin Zou, Yujia Zheng, Jiliang Tang, Xiaokai Chu, Jiashu Zhao, Qian Wang, Dawei Yin
To address the above challenges, we propose a Bias Agnostic whole-page unbiased Learning to rank algorithm, named BAL, to automatically find the user behavior model with causal discovery and mitigate the biases induced by multiple SERP features with no specific design.
no code implementations • 15 Jun 2022 • Yujia Zheng, Ignavier Ng, Kun Zhang
We show that under specific instantiations of such constraints, the independent latent sources can be identified from their nonlinear mixtures up to a permutation and a component-wise transformation, thus achieving nontrivial identifiability of nonlinear ICA without auxiliary variables.
1 code implementation • NeurIPS 2021 • Ignavier Ng, Yujia Zheng, Jiji Zhang, Kun Zhang
Many of the causal discovery methods rely on the faithfulness assumption to guarantee asymptotic correctness.
1 code implementation • 2 Dec 2021 • Haitao Mao, Lun Du, Yujia Zheng, Qiang Fu, Zelin Li, Xu Chen, Shi Han, Dongmei Zhang
To address the non-trivial adaptation challenges in this practical scenario, we propose a model-agnostic algorithm called SOGA for domain adaptation to fully exploit the discriminative ability of the source model while preserving the consistency of structural proximity on the target graph.
no code implementations • 4 Jun 2021 • Tong Chen, Hongzhi Yin, Yujia Zheng, Zi Huang, Yang Wang, Meng Wang
The core idea is to compose elastic embeddings for each item, where an elastic embedding is the concatenation of a set of embedding blocks that are carefully chosen by an automated search function.
no code implementations • 10 Dec 2020 • Yujia Zheng, Siyi Liu, Zekun Li, Shu Wu
As there is generally no side information in the setting of sequential recommendation task, previous cold-start methods could not be applied when only user-item interactions are available.
no code implementations • 13 Nov 2020 • Zekun Li, Yujia Zheng, Shu Wu, XiaoYu Zhang, Liang Wang
In this work, we propose to model user-item interactions as a heterogeneous graph which consists of not only user-item edges indicating their interaction but also user-user edges indicating their similarity.
1 code implementation • 21 Sep 2020 • Yujia Zheng, Siyi Liu, Zekun Li, Shu Wu
These item transitions include potential collaborative information and reflect similar behavior patterns, which we assume may help with the recommendation for the target session.
Ranked #6 on
Session-Based Recommendations
on Diginetica
no code implementations • 24 Jul 2020 • Siyi Liu, Yujia Zheng
Session-based recommendation focuses on the prediction of user actions based on anonymous sessions and is a necessary method in the lack of user historical data.
no code implementations • 29 Oct 2019 • Yujia Zheng, Siyi Liu, Zailei Zhou
For mining the new data without breaking the equilibrium of the model between different interactions, we construct an intra-session graph and an inter-session graph for the current session.