1 code implementation • 20 Mar 2025 • Qiang Zou, Shuli Cheng, Jiayi Chen
Cross-modal hashing is a promising approach for efficient data retrieval and storage optimization.
1 code implementation • 9 Mar 2025 • Yanbiao Ma, Wei Dai, Wenke Huang, Jiayi Chen
Subsequently, we propose GGEUR, which leverages global geometric shapes to guide the generation of new samples, enabling a closer approximation to the ideal global distribution.
no code implementations • 4 Mar 2025 • Wenxuan Song, Jiayi Chen, Pengxiang Ding, Han Zhao, Wei Zhao, Zhide Zhong, ZongYuan Ge, Jun Ma, Haoang Li
The performance of VLA models can be improved by integrating with action chunking, a critical technique for effective control.
no code implementations • 27 Feb 2025 • Kuan Lok Zhou, Jiayi Chen, Siddharth Suresh, Reuben Narad, Timothy T. Rogers, Lalit K Jain, Robert D Nowak, Bob Mankoff, Jifan Zhang
Large Language Models (LLMs) have shown significant limitations in understanding creative content, as demonstrated by Hessel et al. (2023)'s influential work on the New Yorker Cartoon Caption Contest (NYCCC).
no code implementations • 17 Feb 2025 • Yanbiao Ma, Bowei Liu, Wei Dai, Jiayi Chen, Shuo Li
Deep neural networks (DNNs) often exhibit biases toward certain categories during object recognition, even under balanced training data conditions.
no code implementations • 6 Feb 2025 • Yanbiao Ma, Wei Dai, Jiayi Chen
However, models still exhibit category bias even in datasets where instance counts are relatively balanced, clearly indicating that instance count alone cannot explain this phenomenon.
no code implementations • 5 Feb 2025 • Jiayi Chen, Ruifeng Gao, Jue Wang, Shu Sun, Yi Wu
The construction of channel gain map (CGM) is essential for realizing environment-aware wireless communications expected in 6G, for which a fundamental problem is how to predict the channel gains at unknown locations effectively by a finite number of measurements.
no code implementations • 11 Jan 2025 • TingWei Chen, Jiayi Chen, Zijian Zhao, Haolong Chen, Liang Zhang, Guangxu Zhu
Large Language Models (LLMs) have garnered significant attention for their impressive general-purpose capabilities.
Multiple-choice
Multiple Choice Question Answering (MCQA)
+2
no code implementations • 17 Dec 2024 • Jinhao Jiang, Jiayi Chen, Junyi Li, Ruiyang Ren, Shijie Wang, Wayne Xin Zhao, Yang song, Tao Zhang
Existing large language models (LLMs) show exceptional problem-solving capabilities but might struggle with complex reasoning tasks.
1 code implementation • 23 Nov 2024 • Jiayi Chen, Chen Wu, ShaoQun Zhang, Nan Li, Liangjie Zhang, Qi Zhang
Embedding models have become essential tools in both natural language processing and computer vision, enabling efficient semantic search, recommendation, clustering, and more.
1 code implementation • 21 Nov 2024 • Dazhi Huang, Pengcheng Xu, Xiaocheng Huang, Jiayi Chen
Topological Data Analysis (TDA) has recently gained significant attention in the field of financial prediction.
no code implementations • 30 Oct 2024 • Jialiang Zhang, Haoran Liu, Danshi Li, Xinqiang Yu, Haoran Geng, Yufei Ding, Jiayi Chen, He Wang
Grasping in cluttered scenes remains highly challenging for dexterous hands due to the scarcity of data.
1 code implementation • 17 Jul 2024 • Zheni Zeng, Jiayi Chen, Huimin Chen, Yukun Yan, Yuxuan Chen, Zhenghao Liu, Zhiyuan Liu, Maosong Sun
Large language models exhibit aspects of human-level intelligence that catalyze their application as human-like agents in domains such as social simulations, human-machine interactions, and collaborative multi-agent systems.
1 code implementation • 15 Jun 2024 • Jifan Zhang, Lalit Jain, Yang Guo, Jiayi Chen, Kuan Lok Zhou, Siddharth Suresh, Andrew Wagenmaker, Scott Sievert, Timothy Rogers, Kevin Jamieson, Robert Mankoff, Robert Nowak
We present a novel multimodal preference dataset for creative tasks, consisting of over 250 million human ratings on more than 2. 2 million captions, collected through crowdsourcing rating data for The New Yorker's weekly cartoon caption contest over the past eight years.
1 code implementation • 24 May 2024 • Jiayi Chen, Rong Quan, Jie Qin
Instead of completely relying on support images, we propose Self-Matching Transformation (SMT) to construct query-specific transformation matrices based on query images themselves to transform domain-specific query features into domain-agnostic ones.
no code implementations • 12 May 2024 • Jiayi Chen, Chunhua Deng
With the advancement of video analysis technology, the multi-object tracking (MOT) problem in complex scenes involving pedestrians is gaining increasing importance.
1 code implementation • 12 Feb 2024 • Haoyu Li, Yuchen Xu, Jiayi Chen, Rohit Dwivedula, Wenfei Wu, Keqiang He, Aditya Akella, Daehyeok Kim
As deep neural networks (DNNs) grow in complexity and size, the resultant increase in communication overhead during distributed training has become a significant bottleneck, challenging the scalability of distributed training systems.
no code implementations • 13 Dec 2023 • Divyanshu Saxena, Nihal Sharma, Donghyun Kim, Rohit Dwivedula, Jiayi Chen, Chenxi Yang, Sriram Ravula, Zichao Hu, Aditya Akella, Sebastian Angel, Joydeep Biswas, Swarat Chaudhuri, Isil Dillig, Alex Dimakis, P. Brighten Godfrey, Daehyeok Kim, Chris Rossbach, Gang Wang
This paper lays down the research agenda for a domain-specific foundation model for operating systems (OSes).
1 code implementation • CVPR 2024 • Jiayi Chen, Benteng Ma, Hengfei Cui, Yong Xia
Extensive experiments and analysis on five real multi-center medical image datasets demonstrate the superiority of FEAL over the state-of-the-art active learning methods in federated scenarios with domain shifts.
no code implementations • 1 Nov 2023 • Jiayi Chen, Hanjun Dai, Bo Dai, Aidong Zhang, Wei Wei
However, prior works for Few-shot VDER mainly address the problem at the document level with a predefined global entity space, which doesn't account for the entity-level few-shot scenario: target entity types are locally personalized by each task and entity occurrences vary significantly among documents.
1 code implementation • 27 Oct 2023 • Keira Behal, Jiayi Chen, Caleb Fikes, Sophia Xiao
Machine learning models trained on class-imbalanced EHR datasets perform significantly worse in deployment for individuals of the minority classes compared to those from majority classes, which may lead to inequitable healthcare outcomes for minority groups.
no code implementations • 15 Jun 2023 • Lijun Yu, Jin Miao, Xiaoyu Sun, Jiayi Chen, Alexander G. Hauptmann, Hanjun Dai, Wei Wei
Document understanding tasks, in particular, Visually-rich Document Entity Retrieval (VDER), have gained significant attention in recent years thanks to their broad applications in enterprise AI.
1 code implementation • CVPR 2023 • Haoran Geng, Ziming Li, Yiran Geng, Jiayi Chen, Hao Dong, He Wang
Learning a generalizable object manipulation policy is vital for an embodied agent to work in complex real-world scenes.
1 code implementation • CVPR 2023 • Yinzhen Xu, Weikang Wan, Jialiang Zhang, Haoran Liu, Zikang Shan, Hao Shen, Ruicheng Wang, Haoran Geng, Yijia Weng, Jiayi Chen, Tengyu Liu, Li Yi, He Wang
Trained on our synthesized large-scale dexterous grasp dataset, this model enables us to sample diverse and high-quality dexterous grasp poses for the object point cloud. For the second stage, we propose to replace the motion planning used in parallel gripper grasping with a goal-conditioned grasp policy, due to the complexity involved in dexterous grasping execution.
no code implementations • 2 Nov 2022 • Jiayi Chen, Wen Wu, Liye Shi, Yu Ji, Wenxin Hu, Xi Chen, Wei Zheng, Liang He
We evaluate the effectiveness of the proposed model in terms of both accurate and calibrated sequential recommendation.
no code implementations • COLING 2022 • Jiayi Chen, Xiao-Yu Guo, Yuan-Fang Li, Gholamreza Haffari
Answering complex questions that require multi-step multi-type reasoning over raw text is challenging, especially when conducting numerical reasoning.
no code implementations • 6 Oct 2022 • Ruicheng Wang, Jialiang Zhang, Jiayi Chen, Yinzhen Xu, Puhao Li, Tengyu Liu, He Wang
Robotic dexterous grasping is the first step to enable human-like dexterous object manipulation and thus a crucial robotic technology.
no code implementations • 24 Sep 2022 • Jiayi Chen, Mi Yan, Jiazhao Zhang, Yinzhen Xu, Xiaolong Li, Yijia Weng, Li Yi, Shuran Song, He Wang
We for the first time propose a point cloud based hand joint tracking network, HandTrackNet, to estimate the inter-frame hand joint motion.
1 code implementation • 31 May 2022 • Fei Shen, Zhe Wang, Zijun Wang, Xiaode Fu, Jiayi Chen, Xiaoyu Du, Jinhui Tang
Vision-based pattern identification (such as face, fingerprint, iris etc.)
no code implementations • 22 Apr 2022 • Jiayi Chen, Wen Wu, Liye Shi, Yu Ji, Wenxin Hu, Wei Zheng, Liang He
In this work, we focus on the calibrated recommendations for sequential recommendation, which is connected to both fairness and diversity.
no code implementations • 5 Dec 2021 • Jiayi Chen, Wen Wu, Wei Zheng, Liang He
Accurate predictions in session-based recommendations have progressed, but a few studies have focused on skewed recommendation lists caused by popularity bias.
1 code implementation • CVPR 2022 • Jiayi Chen, Yingda Yin, Tolga Birdal, Baoquan Chen, Leonidas Guibas, He Wang
Regressing rotations on SO(3) manifold using deep neural networks is an important yet unsolved problem.
no code implementations • 17 May 2021 • Jiayi Chen, Aidong Zhang
To deal with task heterogeneity and promote fast within-task adaptions for each type of tasks, in this paper, we propose HetMAML, a task-heterogeneous model-agnostic meta-learning framework, which can capture both the type-specific and globally shared knowledge and can achieve the balance between knowledge customization and generalization.
no code implementations • 20 Nov 2019 • Hao Zhang, Jiayi Chen, Haotian Xue, Quanshi Zhang
This paper proposes a set of criteria to evaluate the objectiveness of explanation methods of neural networks, which is crucial for the development of explainable AI, but it also presents significant challenges.