Search Results for author: Kaiyi Zhang

Found 7 papers, 6 papers with code

Interpreting Key Mechanisms of Factual Recall in Transformer-Based Language Models

1 code implementation • 28 Mar 2024 • Ang Lv, Kaiyi Zhang, Yuhan Chen, Yulong Wang, Lifeng Liu, Ji-Rong Wen, Jian Xie, Rui Yan

In this paper, we deeply explore the mechanisms employed by Transformer-based language models in factual recall tasks.

Paper
Code

Batch-ICL: Effective, Efficient, and Order-Agnostic In-Context Learning

1 code implementation • 12 Jan 2024 • Kaiyi Zhang, Ang Lv, Yuhan Chen, Hansen Ha, Tao Xu, Rui Yan

In this paper, by treating in-context learning (ICL) as a meta-optimization process, we explain why LLMs are sensitive to the order of ICL examples.

In-Context Learning Zero-Shot Learning

Paper
Code

Point Cloud Part Editing: Segmentation, Generation, Assembly, and Selection

1 code implementation • 19 Dec 2023 • Kaiyi Zhang, Yang Chen, Ximing Yang, Weizhong Zhang, Cheng Jin

Based on this process, we introduce SGAS, a model for part editing that employs two strategies: feature disentanglement and constraint.

Disentanglement Point Cloud Generation

Paper
Code

Are We Falling in a Middle-Intelligence Trap? An Analysis and Mitigation of the Reversal Curse

1 code implementation • 13 Nov 2023 • Ang Lv, Kaiyi Zhang, Shufang Xie, Quan Tu, Yuhan Chen, Ji-Rong Wen, Rui Yan

Recent studies have highlighted a phenomenon in large language models (LLMs) known as "the reversal curse," in which the order of knowledge entities in the training data biases the models' comprehension.

Denoising Language Modelling

Paper
Code

A Push-Relabel Based Additive Approximation for Optimal Transport

1 code implementation • 7 Mar 2022 • Nathaniel Lahn, Sharath Raghvendra, Kaiyi Zhang

Interestingly, unlike the Sinkhorn algorithm, our method also readily provides a compact transport plan as well as a solution to an approximate version of the dual formulation of the OT problem, both of which have numerous applications in Machine Learning.

Paper
Code

SRPCN: Structure Retrieval based Point Completion Network

no code implementations • 6 Feb 2022 • Kaiyi Zhang, Ximing Yang, Yuan Wu, Cheng Jin

Besides, the missing patterns are diverse in reality, but existing methods can only handle fixed ones, which means a poor generalization ability.

Point Cloud Completion Retrieval

Paper
Add Code

Attention-based Transformation from Latent Features to Point Clouds

1 code implementation • 10 Dec 2021 • Kaiyi Zhang, Ximing Yang, Yuan Wu, Cheng Jin

The points generated by AXform do not have the strong 2-manifold constraint, which improves the generation of non-smooth surfaces.

Point Cloud Completion Unsupervised Semantic Segmentation

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.