Search Results for author: Jie Peng

Found 26 papers, 11 papers with code

Harnessing Your DRAM and SSD for Sustainable and Accessible LLM Inference with Mixed-Precision and Multi-level Caching

no code implementations17 Oct 2024 Jie Peng, Zhang Cao, Huaizhi Qu, Zhengyu Zhang, Chang Guo, Yanyong Zhang, Zhichao Cao, Tianlong Chen

To enhance communication efficiency, M2Cache maintains a neuron-level mixed-precision LRU cache in HBM, a larger layer-aware cache in DRAM, and a full model in SSD.

Quantization

Flex-MoE: Modeling Arbitrary Modality Combination via the Flexible Mixture-of-Experts

1 code implementation10 Oct 2024 Sukwon Yun, Inyoung Choi, Jie Peng, Yangfan Wu, Jingxuan Bao, Qiyiwen Zhang, Jiayi Xin, Qi Long, Tianlong Chen

The core idea of Flex-MoE is to first address missing modalities using a new missing modality bank that integrates observed modality combinations with the corresponding missing ones.

Glider: Global and Local Instruction-Driven Expert Router

1 code implementation9 Oct 2024 Pingzhi Li, Prateek Yadav, Jaehong Yoon, Jie Peng, Yi-Lin Sung, Mohit Bansal, Tianlong Chen

Our experiments using T5-based models for T0 and FLAN tasks demonstrate that GLIDER achieves substantially improved held-in performance while maintaining strong generalization on held-out tasks.

Accelerating the discovery of low-energy structure configurations: a computational approach that integrates first-principles calculations, Monte Carlo sampling, and Machine Learning

no code implementations8 Oct 2024 Md Rajib Khan Musa, Yichen Qian, Jie Peng, David Cereceda

Finding Minimum Energy Configurations (MECs) is essential in fields such as physics, chemistry, and materials science, as they represent the most stable states of the systems.

Computational Efficiency

AnyDesign: Versatile Area Fashion Editing via Mask-Free Diffusion

1 code implementation21 Aug 2024 Yunfang Niu, Lingxiang Wu, Dong Yi, Jie Peng, Ning Jiang, Haiying Wu, Jinqiao Wang

Moreover, these methods are limited in the variety of clothing types they can handle, as most datasets focus on people in clean backgrounds and only include generic garments such as tops, pants, and dresses.

Beyond Over-smoothing: Uncovering the Trainability Challenges in Deep Graph Neural Networks

no code implementations7 Aug 2024 Jie Peng, Runlin Lei, Zhewei Wei

In this paper, we systematically analyze the real dominant problem in deep GNNs and identify the issues that these GNNs towards addressing Over-smoothing essentially work on via empirical experiments and theoretical gradient analysis.

Mew: Multiplexed Immunofluorescence Image Analysis through an Efficient Multiplex Network

1 code implementation25 Jul 2024 Sukwon Yun, Jie Peng, Alexandro E. Trevino, Chanyoung Park, Tianlong Chen

Recent advancements in graph-based approaches for multiplexed immunofluorescence (mIF) images have significantly propelled the field forward, offering deeper insights into patient-level phenotyping.

Graph Neural Network Image Classification

LDP: A Local Diffusion Planner for Efficient Robot Navigation and Collision Avoidance

no code implementations2 Jul 2024 Wenhao Yu, Jie Peng, Huanyu Yang, JunRui Zhang, Yifan Duan, Jianmin Ji, Yanyong Zhang

The complex conditional distribution in local navigation needs training data to include diverse policy in diverse real-world scenarios; (2) Myopic Observation.

Collision Avoidance Robot Navigation

Human-like object concept representations emerge naturally in multimodal large language models

no code implementations1 Jul 2024 Changde Du, Kaicheng Fu, Bincheng Wen, Yi Sun, Jie Peng, Wei Wei, Ying Gao, Shengpei Wang, Chuncheng Zhang, Jinpeng Li, Shuang Qiu, Le Chang, Huiguang He

The conceptualization and categorization of natural objects in the human mind have long intrigued cognitive scientists and neuroscientists, offering crucial insights into human perception and cognition.

Triplet

Multi-label Class Incremental Emotion Decoding with Augmented Emotional Semantics Learning

no code implementations31 May 2024 Kaicheng Fu, Changde Du, Xiaoyu Chen, Jie Peng, Huiguang He

Specifically, we design an augmented emotional relation graph module with label disambiguation to handle the past-missing partial label problem.

class-incremental learning Class Incremental Learning +3

Multi-level Shared Knowledge Guided Learning for Knowledge Graph Completion

no code implementations8 May 2024 Yongxue Shan, Jie zhou, Jie Peng, Xin Zhou, Jiaqian Yin, Xiaodong Wang

In the task of Knowledge Graph Completion (KGC), the existing datasets and their inherent subtasks carry a wealth of shared knowledge that can be utilized to enhance the representation of knowledge triplets and overall performance.

Knowledge Graph Completion Multi-Task Learning +2

Mean Aggregator Is More Robust Than Robust Aggregators Under Label Poisoning Attacks

1 code implementation21 Apr 2024 Jie Peng, Weiyu Li, Qing Ling

Robustness to malicious attacks is of paramount importance for distributed learning.

Tuning-Free Accountable Intervention for LLM Deployment -- A Metacognitive Approach

no code implementations8 Mar 2024 Zhen Tan, Jie Peng, Tianlong Chen, Huan Liu

Large Language Models (LLMs) have catalyzed transformative advances across a spectrum of natural language processing tasks through few-shot or zero-shot prompting, bypassing the need for parameter tuning.

Decision Making Hallucination

PathRL: An End-to-End Path Generation Method for Collision Avoidance via Deep Reinforcement Learning

no code implementations20 Oct 2023 Wenhao Yu, Jie Peng, Quecheng Qiu, Hanyu Wang, Lu Zhang, Jianmin Ji

However, two roadblocks arise for training a DRL policy that outputs paths: (1) The action space for potential paths often involves higher dimensions comparing to low-level commands, which increases the difficulties of training; (2) It takes multiple time steps to track a path instead of a single time step, which requires the path to predicate the interactions of the robot w. r. t.

Collision Avoidance Deep Reinforcement Learning +1

Byzantine-Robust Decentralized Stochastic Optimization with Stochastic Gradient Noise-Independent Learning Error

no code implementations10 Aug 2023 Jie Peng, Weiyu Li, Qing Ling

Motivated by this observation, we introduce two variance reduction methods, stochastic average gradient algorithm (SAGA) and loopless stochastic variance-reduced gradient (LSVRG), to Byzantine-robust decentralized stochastic optimization for eliminating the negative effect of the stochastic gradient noise.

Stochastic Optimization

$P^{3}O$: Transferring Visual Representations for Reinforcement Learning via Prompting

no code implementations22 Mar 2023 Guoliang You, Xiaomeng Chu, Yifan Duan, Jie Peng, Jianmin Ji, Yu Zhang, Yanyong Zhang

In particular, we specify a prompt-transformer for representation conversion and propose a two-step training process to train the prompt-transformer for the target environment, while the rest of the DRL pipeline remains unchanged.

Deep Reinforcement Learning reinforcement-learning

Deep Reinforcement Learning for Localizability-Enhanced Navigation in Dynamic Human Environments

no code implementations22 Mar 2023 Yuan Chen, Quecheng Qiu, Xiangyu Liu, Guangda Chen, Shunyi Yao, Jie Peng, Jianmin Ji, Yanyong Zhang

The planner learns to assign different importance to the geometric features and encourages the robot to navigate through areas that are helpful for laser localization.

Deep Reinforcement Learning Navigate +1

TLP: A Deep Learning-based Cost Model for Tensor Program Tuning

1 code implementation7 Nov 2022 Yi Zhai, Yu Zhang, Shuo Liu, Xiaomeng Chu, Jie Peng, Jianmin Ji, Yanyong Zhang

Instead of extracting features from the tensor program itself, TLP extracts features from the schedule primitives.

Multi-Task Learning

Reinforcement Learning for Robot Navigation with Adaptive Forward Simulation Time (AFST) in a Semi-Markov Model

1 code implementation13 Aug 2021 Yu'an Chen, Ruosong Ye, Ziyang Tao, Hongjian Liu, Guangda Chen, Jie Peng, Jun Ma, Yu Zhang, Jianmin Ji, Yanyong Zhang

Deep reinforcement learning (DRL) algorithms have proven effective in robot navigation, especially in unknown environments, by directly mapping perception inputs into robot control commands.

Deep Reinforcement Learning reinforcement-learning +2

One-photon Solutions to Multiqubit Multimode quantum Rabi model

no code implementations22 Feb 2021 Jie Peng, Juncong Zheng, Jing Yu, Pinghua Tang, G. Alvarado Barrios, Jianxin Zhong, Enrique Solano, F. Albarran-Arriagada, Lucas Lamata

General solutions to the quantum Rabi model involve subspaces with unbounded number of photons.

Quantum Physics Optics

Constructing new APN functions through relative trace functions

no code implementations27 Jan 2021 Lijing Zheng, Haibin Kan, Yanjun Li, Jie Peng, Deng Tang

With the help of this characterization, we obtain an infinite family of APN functions for $ n=2m $ with $m$ being an odd positive integer: $ f(x)=a{\rm Tr}^{n}_{m}(bx^3)+a^q{\rm Tr}^{n}_{m}(b^3x^9) $, where $ a\in \mathbb{F}_{2^n}$ such that $ a+a^q\neq 0 $ and $ b $ is a non-cube in $ \mathbb{F}_{2^n} $.

Information Theory Information Theory

Byzantine-Robust Variance-Reduced Federated Learning over Distributed Non-i.i.d. Data

2 code implementations17 Sep 2020 Jie Peng, Zhaoxian Wu, Qing Ling, Tianyi Chen

We prove that the proposed method reaches a neighborhood of the optimal solution at a linear convergence rate and the learning error is determined by the number of Byzantine workers.

Federated Learning

Byzantine-Robust Decentralized Stochastic Optimization over Static and Time-Varying Networks

1 code implementation12 May 2020 Jie Peng, Weiyu Li, Qing Ling

In this paper, we consider the Byzantine-robust stochastic optimization problem defined over decentralized static and time-varying networks, where the agents collaboratively minimize the summation of expectations of stochastic local cost functions, but some of the agents are unreliable due to data corruptions, equipment failures or cyber-attacks.

Stochastic Optimization

Estimating Time-Varying Graphical Models

2 code implementations11 Apr 2018 Jilei Yang, Jie Peng

In this paper, we study time-varying graphical models based on data measured over a temporal grid.

Computational Efficiency

Learning directed acyclic graphs via bootstrap aggregating

no code implementations9 Jun 2014 Ru Wang, Jie Peng

Specifically, an ensemble of DAGs is first learned based on bootstrap resamples of the data and then an aggregated DAG is derived by minimizing the overall distance to the entire ensemble.

Cannot find the paper you are looking for? You can Submit a new open access paper.