2 code implementations • 26 May 2025 • Bingguang Hao, Maolin Wang, Zengzhuang Xu, Cunyin Peng, Yicheng Chen, Xiangyu Zhao, Jinjie Gu, Chenyi Zhuang
FunReason provides a comprehensive solution for enhancing LLMs' function calling capabilities by introducing a balanced training methodology and a data refinement pipeline.
no code implementations • 6 May 2025 • Chang Xie, Chenyi Zhuang, Pan Gao
In this work, we highlight two factors that affect this alignment: the quality of the randomly initialized noise and the reliability of the generated controlling mask.
no code implementations • 7 Apr 2025 • Zhi Zuo, Chenyi Zhuang, Pan Gao, Jie Qin, Hao Feng, Nicu Sebe
Self-supervised representation learning for point cloud videos remains a challenging problem with two key limitations: (1) existing methods rely on explicit knowledge to learn motion, resulting in suboptimal representations; (2) prior Masked AutoEncoder (MAE) frameworks struggle to bridge the gap between low-level geometry and high-level dynamics in 4D data.
no code implementations • 11 Mar 2025 • Jikai Chen, Leilei Gan, Ziyu Zhao, Zechuan Wang, Dong Wang, Chenyi Zhuang
Existing refinement methods in LLM-based Text-to-SQL systems exhibit limited effectiveness.
no code implementations • 16 Dec 2024 • Hongxuan Zhang, Yao Zhao, Jiaqi Zheng, Chenyi Zhuang, Jinjie Gu, Guihai Chen
The emergence of long-context text applications utilizing large language models (LLMs) has presented significant scalability challenges, particularly in memory footprint.
no code implementations • 30 Oct 2024 • Yanchu Guan, Dong Wang, Yan Wang, Haiqing Wang, Renen Sun, Chenyi Zhuang, Jinjie Gu, Zhixuan Chu
In this paper, we propose an Explainable Behavior Cloning LLM Agent (EBC-LLMAgent), a novel approach that combines large language models (LLMs) with behavior cloning by learning demonstrations to create intelligent and explainable agents for autonomous mobile app interaction.
1 code implementation • 19 Oct 2024 • Ying Hu, Chenyi Zhuang, Pan Gao
Style transfer aims to fuse the artistic representation of a style image with the structural information of a content image.
1 code implementation • 30 Sep 2024 • Chenyi Zhuang, Ying Hu, Pan Gao
In this work, we critically examine the limitations of the CLIP text encoder in understanding attributes and investigate how this affects diffusion models.
no code implementations • 29 May 2024 • Yao Zhao, Zhining Liu, Tianchi Cai, Haipeng Zhang, Chenyi Zhuang, Jinjie Gu
Using both synthetic and industrial datasets, we first show how this widely coexisted ranking bias deteriorates the performance of the existing position bias estimation methods.
1 code implementation • 13 May 2024 • Qingguo Liu, Chenyi Zhuang, Pan Gao, Jie Qin
Existing Blind image Super-Resolution (BSR) methods focus on estimating either kernel or degradation information, but have long overlooked the essential content details.
1 code implementation • 15 Feb 2024 • Pengyang Shao, Yonghui Yang, Chen Gao, Lei Chen, Kun Zhang, Chenyi Zhuang, Le Wu, Yong Li, Meng Wang
Specifically, to explore heterogeneity, we propose a semantic-aware graph neural networks based CD model.
no code implementations • 1 Feb 2024 • Sheng Zhang, Maolin Wang, Yao Zhao, Chenyi Zhuang, Jinjie Gu, Ruocheng Guo, Xiangyu Zhao, Zijian Zhang, Hongzhi Yin
Our principal contribution is the development of a Data-aware Neural Architecture Search for Recommender System (DNS-Rec).
no code implementations • 31 Jan 2024 • Zhitian Xie, Yinger Zhang, Chenyi Zhuang, Qitao Shi, Zhining Liu, Jinjie Gu, Guannan Zhang
However, the gate's routing mechanism also gives rise to narrow vision: the individual MoE's expert fails to use more samples in learning the allocated sub-task, which in turn limits the MoE to further improve its generalization ability.
no code implementations • 7 Jan 2024 • Chengyue Yu, Lei Zang, Jiaotuan Wang, Chenyi Zhuang, Jinjie Gu
A video demonstration of CharPoet is available at https://youtu. be/voZ25qEp3Dc.
1 code implementation • CVPR 2024 • Qingguo Liu, Chenyi Zhuang, Pan Gao, Jie Qin
Existing Blind image Super-Resolution (BSR) methods focus on estimating either kernel or degradation information but have long overlooked the essential content details.
1 code implementation • 20 Dec 2023 • Yao Zhao, Zhitian Xie, Chen Liang, Chenyi Zhuang, Jinjie Gu
Instead of generating a single token at a time, we propose a Trie-based retrieval and verification mechanism to be able to accept several tokens at a forward step.
no code implementations • 15 Dec 2023 • Xingyu Lu, Zhining Liu, Yanchu Guan, Hongxuan Zhang, Chenyi Zhuang, Wenqi Ma, Yize Tan, Jinjie Gu, Guannan Zhang
of a cascade RS, when a user triggers a request, we define two actions that determine the computation: (1) the trained instances of models with different computational complexity; and (2) the number of items to be inferred in the stage.
no code implementations • 10 Dec 2023 • Maolin Wang, Yao Zhao, Jiajia Liu, Jingdong Chen, Chenyi Zhuang, Jinjie Gu, Ruocheng Guo, Xiangyu Zhao
In our research, we constructed a dataset, the Multimodal Advertisement Audition Dataset (MAAD), from real-world scenarios within Alipay, and conducted experiments to validate the reliability of our proposed strategy.
no code implementations • 4 Dec 2023 • Yanchu Guan, Dong Wang, Zhixuan Chu, Shiyu Wang, Feiyue Ni, Ruihua Song, Longfei Li, Jinjie Gu, Chenyi Zhuang
This paper proposes a novel LLM-based virtual assistant that can automatically perform multi-step operations within mobile apps based on high-level user requests.
1 code implementation • 14 Nov 2023 • Hongxuan Zhang, Zhining Liu, Yao Zhao, Jiaqi Zheng, Chenyi Zhuang, Jinjie Gu, Guihai Chen
In this work, we propose FastCoT, a model-agnostic framework based on parallel decoding without any further training of an auxiliary model or modification to the LLM itself.
1 code implementation • 30 Jul 2023 • Chenyi Zhuang, Pan Gao, Aljosa Smolic
We then prove that StylePrompter lies in a more disentangled $\mathcal{W^+}$ and show the controllability of SMART.
no code implementations • 5 Jun 2023 • Maolin Wang, Yaoming Zhen, Yu Pan, Yao Zhao, Chenyi Zhuang, Zenglin Xu, Ruocheng Guo, Xiangyu Zhao
THNN is a faithful hypergraph modeling framework through high-order outer product feature message passing and is a natural tensor extension of the adjacency-matrix-based graph neural networks.
no code implementations • IEEE Access 2019 • Zhining Liu, Weiyi Liu, Pin-Yu Chen, Chenyi Zhuang, Chengyun Song
Graph neural networks (GNNs) have recently made remarkable breakthroughs in the paradigm of learning with graph-structured data.
Ranked #37 on
Node Classification
on Citeseer