Search Results for author: Cheng Qian

Found 34 papers, 13 papers with code

Zero-Shot Generalization during Instruction Tuning: Insights from Similarity and Granularity

1 code implementation17 Jun 2024 Bingxiang He, Ning Ding, Cheng Qian, Jia Deng, Ganqu Cui, Lifan Yuan, Huan-ang Gao, Huimin Chen, Zhiyuan Liu, Maosong Sun

For the first time, we show that zero-shot generalization during instruction tuning is a form of similarity-based generalization between training and test data at the instance level.

Continual Learning Zero-shot Generalization

LanePtrNet: Revisiting Lane Detection as Point Voting and Grouping on Curves

no code implementations8 Mar 2024 Jiayan Cao, Xueyu Zhu, Cheng Qian

from object detection and segmentation tasks, while these approaches require manual adjustments for curved objects, involve exhaustive searches on predefined anchors, require complex post-processing steps, and may lack flexibility when applied to real-world scenarios. In this paper, we propose a novel approach, LanePtrNet, which treats lane detection as a process of point voting and grouping on ordered sets: Our method takes backbone features as input and predicts a curve-aware centerness, which represents each lane as a point and assigns the most probable center point to it.

3D Lane Detection Autonomous Driving +2

Reconstruction-Based Anomaly Localization via Knowledge-Informed Self-Training

no code implementations22 Feb 2024 Cheng Qian, Xiaoxian Lao, Chunguang Li

Most existing reconstruction-based methods only use normal samples to construct model.

Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents

1 code implementation14 Feb 2024 Cheng Qian, Bingxiang He, Zhong Zhuang, Jia Deng, Yujia Qin, Xin Cong, Zhong Zhang, Jie zhou, Yankai Lin, Zhiyuan Liu, Maosong Sun

Current language model-driven agents often lack mechanisms for effective user participation, which is crucial given the vagueness commonly found in user instructions.

Language Modelling

Investigate-Consolidate-Exploit: A General Strategy for Inter-Task Agent Self-Evolution

no code implementations25 Jan 2024 Cheng Qian, Shihao Liang, Yujia Qin, Yining Ye, Xin Cong, Yankai Lin, Yesai Wu, Zhiyuan Liu, Maosong Sun

This paper introduces Investigate-Consolidate-Exploit (ICE), a novel strategy for enhancing the adaptability and flexibility of AI agents through inter-task self-evolution.

Triple Simplex Matrix Completion for Expense Forecasting

no code implementations23 Oct 2023 Cheng Qian, Lucas Glass, Nikos Sidiropoulos

Forecasting project expenses is a crucial step for businesses to avoid budget overruns and project failures.

Matrix Completion Time Series +1

Toolink: Linking Toolkit Creation and Using through Chain-of-Solving on Open-Source Model

1 code implementation8 Oct 2023 Cheng Qian, Chenyan Xiong, Zhenghao Liu, Zhiyuan Liu

We first validate the efficacy of Toolink in harnessing the model's creativity and CoS ability on ChatGPT.

valid

"Merge Conflicts!" Exploring the Impacts of External Distractors to Parametric Knowledge Graphs

1 code implementation15 Sep 2023 Cheng Qian, Xinran Zhao, Sherry Tongshuang Wu

Large language models (LLMs) acquire extensive knowledge during pre-training, known as their parametric knowledge.

Hallucination Knowledge Graphs

The 2nd Place Solution for 2023 Waymo Open Sim Agents Challenge

no code implementations28 Jun 2023 Cheng Qian, Di Xiu, Minghao Tian

In this technical report, we present the 2nd place solution of 2023 Waymo Open Sim Agents Challenge (WOSAC)[4].

Motion Forecasting

CREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning of Large Language Models

2 code implementations23 May 2023 Cheng Qian, Chi Han, Yi R. Fung, Yujia Qin, Zhiyuan Liu, Heng Ji

Additionally, we introduce the Creation Challenge dataset, featuring 2K diverse questions, to emphasize the necessity and benefits of LLMs' tool creation ability.

2k Math +1

Recyclable Tuning for Continual Pre-training

1 code implementation15 May 2023 Yujia Qin, Cheng Qian, Xu Han, Yankai Lin, Huadong Wang, Ruobing Xie, Zhiyuan Liu, Maosong Sun, Jie zhou

In pilot studies, we find that after continual pre-training, the upgraded PLM remains compatible with the outdated adapted weights to some extent.

Distinguish Sense from Nonsense: Out-of-Scope Detection for Virtual Assistants

no code implementations16 Jan 2023 Cheng Qian, Haode Qi, Gengyu Wang, Ladislav Kunc, Saloni Potdar

Out of Scope (OOS) detection in Conversational AI solutions enables a chatbot to handle a conversation gracefully when it is unable to make sense of the end-user query.

Chatbot

Exploring Mode Connectivity for Pre-trained Language Models

1 code implementation25 Oct 2022 Yujia Qin, Cheng Qian, Jing Yi, Weize Chen, Yankai Lin, Xu Han, Zhiyuan Liu, Maosong Sun, Jie zhou

(3) How does the PLM's task knowledge change along the path connecting two minima?

GOCPT: Generalized Online Canonical Polyadic Tensor Factorization and Completion

1 code implementation8 May 2022 Chaoqi Yang, Cheng Qian, Jimeng Sun

Our variant GOCPTE shows up to 1:2% and 5:5% fitness improvement on two datasets with about 20% speedup compared to the best model.

ATD: Augmenting CP Tensor Decomposition by Self Supervision

1 code implementation15 Jun 2021 Chaoqi Yang, Cheng Qian, Navjot Singh, Cao Xiao, M Brandon Westover, Edgar Solomonik, Jimeng Sun

This paper addresses the above challenges by proposing augmented tensor decomposition (ATD), which effectively incorporates data augmentations and self-supervised learning (SSL) to boost downstream classification.

Data Augmentation Dimensionality Reduction +3

MTC: Multiresolution Tensor Completion from Partial and Coarse Observations

1 code implementation14 Jun 2021 Chaoqi Yang, Navjot Singh, Cao Xiao, Cheng Qian, Edgar Solomonik, Jimeng Sun

Our MTC model explores tensor mode properties and leverages the hierarchy of resolutions to recursively initialize an optimization setup, and optimizes on the coupled system using alternating least squares.

Condition Integration Memory Network: An Interpretation of the Meaning of the Neuronal Design

no code implementations21 May 2021 Cheng Qian

When a neuron's activation represents some symbolic element in the environment, each of its synapses can indicate a potential change to the element and its future state.

Multi-version Tensor Completion for Time-delayed Spatio-temporal Data

no code implementations11 May 2021 Cheng Qian, Nikos Kargas, Cao Xiao, Lucas Glass, Nicholas Sidiropoulos, Jimeng Sun

Recovering such missing or noisy (under-reported) elements of the input tensor can be viewed as a generalized tensor completion problem.

Missing Elements

STELAR: Spatio-temporal Tensor Factorization with Latent Epidemiological Regularization

no code implementations8 Dec 2020 Nikos Kargas, Cheng Qian, Nicholas D. Sidiropoulos, Cao Xiao, Lucas M. Glass, Jimeng Sun

Accurate prediction of the transmission of epidemic diseases such as COVID-19 is crucial for implementing effective mitigation measures.

Attribute

Learning Barrier Functions with Memory for Robust Safe Navigation

no code implementations3 Nov 2020 Kehan Long, Cheng Qian, Jorge Cortés, Nikolay Atanasov

Control barrier functions are widely used to enforce safety properties in robot motion planning and control.

Motion Planning Robotics

SWIFT: Scalable Wasserstein Factorization for Sparse Nonnegative Tensors

no code implementations8 Oct 2020 Ardavan Afshar, Kejing Yin, Sherry Yan, Cheng Qian, Joyce C. Ho, Haesun Park, Jimeng Sun

In particular, we define the N-th order tensor Wasserstein loss for the widely used tensor CP factorization and derive the optimization algorithm that minimizes it.

Computational Efficiency

On the Compression of Translation Operator Tensors in FMM-FFT-Accelerated SIE Simulators via Tensor Decompositions

no code implementations25 Sep 2020 Cheng Qian, Abdulkadir C. Yucel

Tensor decomposition methodologies are proposed to reduce the memory requirement of translation operator tensors arising in the fast multipole method-fast Fourier transform (FMM-FFT)-accelerated surface integral equation (SIE) simulators.

Tensor Decomposition Translation

Model-aided Deep Neural Network for Source Number Detection

no code implementations29 Sep 2019 Yuwen Yang, Feifei Gao, Cheng Qian, Guisheng Liao

Specifically, we first propose the eigenvalue based regression network (ERNet) and classification network (ECNet) to estimate the number of non-coherent sources, where the eigenvalues of the received signal covariance matrix and the source number are used as the input and the supervise label of the networks, respectively.

REP: Predicting the Time-Course of Drug Sensitivity

no code implementations27 Jul 2019 Cheng Qian, Amin Emad, Nicholas D. Sidiropoulos

Time-course gene expression data is a rich source of information that can be used to unravel these complex processes, identify biomarkers of drug sensitivity and predict the response to a drug.

Drug Response Prediction

High-dimensional Gaussian graphical model for network-linked data

1 code implementation4 Jul 2019 Tianxi Li, Cheng Qian, Elizaveta Levina, Ji Zhu

Graphical models are commonly used to represent conditional dependence relationships between variables.

Vocal Bursts Intensity Prediction

Cannot find the paper you are looking for? You can Submit a new open access paper.