Search Results for author: Zekun Li

Found 41 papers, 19 papers with code

Can Editing LLMs Inject Harm?

1 code implementation29 Jul 2024 Canyu Chen, Baixiang Huang, Zekun Li, Zhaorun Chen, Shiyang Lai, Xiongxiao Xu, Jia-Chen Gu, Jindong Gu, Huaxiu Yao, Chaowei Xiao, Xifeng Yan, William Yang Wang, Philip Torr, Dawn Song, Kai Shu

Then, we find that editing attacks can inject both types of misinformation into LLMs, and the effectiveness is particularly high for commonsense misinformation injection.

Fairness General Knowledge +4

MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension

1 code implementation6 Jul 2024 Zekun Li, Xianjun Yang, Kyuri Choi, Wanrong Zhu, Ryan Hsieh, HyeonJung Kim, Jin Hyuk Lim, Sungyoung Ji, Byungju Lee, Xifeng Yan, Linda Ruth Petzold, Stephen D. Wilson, Woosang Lim, William Yang Wang

The rapid advancement of Large Language Models (LLMs) and Large Multimodal Models (LMMs) has heightened the demand for AI-based scientific assistants capable of understanding scientific articles and figures.

visual instruction following

MMFakeBench: A Mixed-Source Multimodal Misinformation Detection Benchmark for LVLMs

no code implementations13 Jun 2024 Xuannan Liu, Zekun Li, Peipei Li, Shuhan Xia, Xing Cui, Linzhi Huang, Huaibo Huang, Weihong Deng, Zhaofeng He

Current multimodal misinformation detection (MMD) methods often assume a single source and type of forgery for each sample, which is insufficient for real-world scenarios where multiple forgery sources coexist.

Misinformation

Localize, Understand, Collaborate: Semantic-Aware Dragging via Intention Reasoner

no code implementations1 Jun 2024 Xing Cui, Peipei Li, Zekun Li, Xuannan Liu, Yueying Zou, Zhaofeng He

Specifically, semantic guidance is derived by establishing a semantic editing direction based on reasoned intentions, while quality guidance is achieved through classifier guidance using an image fidelity discriminator.

FKA-Owl: Advancing Multimodal Fake News Detection through Knowledge-Augmented LVLMs

no code implementations4 Mar 2024 Xuannan Liu, Peipei Li, Huaibo Huang, Zekun Li, Xing Cui, Jiahao Liang, Lixiong Qin, Weihong Deng, Zhaofeng He

The massive generation of multimodal fake news involving both text and images exhibits substantial distribution discrepancies, prompting the need for generalized detectors.

Fake News Detection Image Manipulation +2

Large Language Models as Zero-shot Dialogue State Tracker through Function Calling

1 code implementation16 Feb 2024 Zekun Li, Zhiyu Zoey Chen, Mike Ross, Patrick Huber, Seungwhan Moon, Zhaojiang Lin, Xin Luna Dong, Adithya Sagar, Xifeng Yan, Paul A. Crook

We also show that by fine-tuning on a small collection of diverse task-oriented dialogues, we can equip modestly sized models, specifically a 13B parameter LLaMA2-Chat model, with function-calling capabilities and DST performance comparable to ChatGPT while maintaining their chat capabilities.

Avg Dialogue State Tracking +1

MANUS: Markerless Grasp Capture using Articulated 3D Gaussians

no code implementations CVPR 2024 Chandradeep Pokhariya, Ishaan N Shah, Angela Xing, Zekun Li, Kefan Chen, Avinash Sharma, Srinath Sridhar

Since our representation uses Gaussian primitives, it enables us to efficiently and accurately estimate contacts between the hand and the object.

Mixed Reality Object

InstaStyle: Inversion Noise of a Stylized Image is Secretly a Style Adviser

1 code implementation25 Nov 2023 Xing Cui, Zekun Li, Pei Pei Li, Huaibo Huang, Xuannan Liu, Zhaofeng He

We employ DDIM inversion to extract this noise from the reference image and leverage a diffusion model to generate new stylized images from the "style" noise.

Text-to-Image Generation

GeoLM: Empowering Language Models for Geospatially Grounded Language Understanding

1 code implementation23 Oct 2023 Zekun Li, Wenxuan Zhou, Yao-Yi Chiang, Muhao Chen

This paper introduces GeoLM, a geospatially grounded language model that enhances the understanding of geo-entities in natural language.

Contrastive Learning Entity Typing +4

AlpaCare:Instruction-tuned Large Language Models for Medical Application

1 code implementation23 Oct 2023 Xinlu Zhang, Chenxin Tian, Xianjun Yang, Lichang Chen, Zekun Li, Linda Ruth Petzold

Instruction-finetuning (IFT) has become crucial in aligning Large Language Models (LLMs) with diverse human needs and has shown great potential in medical applications.

Diversity Instruction Following

Enhancing Large Language Model Induced Task-Oriented Dialogue Systems Through Look-Forward Motivated Goals

no code implementations16 Sep 2023 Zhiyuan Hu, Yue Feng, Yang Deng, Zekun Li, See-Kiong Ng, Anh Tuan Luu, Bryan Hooi

Recently, the development of large language models (LLMs) has been significantly enhanced the question answering and dialogue generation, and makes them become increasingly popular in current practical scenarios.

Dialogue Generation Language Modelling +3

Evaluating the Instruction-Following Robustness of Large Language Models to Prompt Injection

2 code implementations17 Aug 2023 Zekun Li, Baolin Peng, Pengcheng He, Xifeng Yan

In this work, we establish a benchmark to evaluate the robustness of instruction-following LLMs against prompt injection attacks.

Instruction Following

The mapKurator System: A Complete Pipeline for Extracting and Linking Text from Historical Maps

no code implementations29 Jun 2023 Jina Kim, Zekun Li, Yijun Lin, Min Namgung, Leeje Jang, Yao-Yi Chiang

mapKurator empowers automated extraction, post-processing, and linkage of text labels from large numbers of large-dimension historical map scans.

Zero-Shot Learning

Learning Anchor Transformations for 3D Garment Animation

no code implementations CVPR 2023 Fang Zhao, Zekun Li, Shaoli Huang, Junwu Weng, Tianfei Zhou, Guo-Sen Xie, Jue Wang, Ying Shan

Once the anchor transformations are found, per-vertex nonlinear displacements of the garment template can be regressed in a canonical space, which reduces the complexity of deformation space learning.

Position

CHATEDIT: Towards Multi-turn Interactive Facial Image Editing via Dialogue

no code implementations20 Mar 2023 Xing Cui, Zekun Li, Peipei Li, Yibo Hu, Hailin Shi, Zhaofeng He

This paper explores interactive facial image editing via dialogue and introduces the ChatEdit benchmark dataset for evaluating image editing and conversation abilities in this context.

Attribute Facial Editing +1

Time Series as Images: Vision Transformer for Irregularly Sampled Time Series

1 code implementation NeurIPS 2023 Zekun Li, Shiyang Li, Xifeng Yan

This paper introduces a novel perspective by converting irregularly sampled time series into line graph images, then utilizing powerful pre-trained vision transformers for time series classification in the same way as image classification.

Image Classification Time Series +1

Guiding Large Language Models via Directional Stimulus Prompting

1 code implementation NeurIPS 2023 Zekun Li, Baolin Peng, Pengcheng He, Michel Galley, Jianfeng Gao, Xifeng Yan

Our experiments demonstrate that the framework consistently improves LLMs' (e. g., ChatGPT, Codex, InstructGPT) performance on these supervised tasks using minimal labeled data.

Response Generation

SpaBERT: A Pretrained Language Model from Geographic Data for Geo-Entity Representation

no code implementations21 Oct 2022 Zekun Li, Jina Kim, Yao-Yi Chiang, Muhao Chen

Characterizing geo-entities is integral to various application domains, such as geo-intelligence and map comprehension, while a key challenge is to capture the spatial-varying context of an entity.

Entity Linking Entity Typing +2

Explanations from Large Language Models Make Small Reasoners Better

no code implementations13 Oct 2022 Shiyang Li, Jianshu Chen, Yelong Shen, Zhiyu Chen, Xinlu Zhang, Zekun Li, Hong Wang, Jing Qian, Baolin Peng, Yi Mao, Wenhu Chen, Xifeng Yan

Integrating free-text explanations to in-context learning of large language models (LLM) is shown to elicit strong reasoning capabilities along with reasonable explanations.

Explanation Generation In-Context Learning +1

Controllable Dialogue Simulation with In-Context Learning

1 code implementation9 Oct 2022 Zekun Li, Wenhu Chen, Shiyang Li, Hong Wang, Jing Qian, Xifeng Yan

Experimental results on the MultiWOZ dataset demonstrate that training a model on the simulated dialogues leads to even better performance than using the same amount of human-generated dialogues under the challenging low-resource settings, with as few as 85 dialogues as a seed.

Data Augmentation In-Context Learning +2

Limitations of Language Models in Arithmetic and Symbolic Induction

no code implementations9 Aug 2022 Jing Qian, Hong Wang, Zekun Li, Shiyang Li, Xifeng Yan

LMs with tutor is able to deliver 100% accuracy in situations of OOD and repeating symbols, shedding new insights on the boundary of large LMs in induction.

Eliminating Gradient Conflict in Reference-based Line-Art Colorization

1 code implementation13 Jul 2022 Zekun Li, Zhengyang Geng, Zhao Kang, Wenyu Chen, Yibo Yang

To understand the instability in training, we detect the gradient flow of attention and observe gradient conflict among attention branches.

Line Art Colorization SSIM

Synthetic Map Generation to Provide Unlimited Training Data for Historical Map Text Detection

no code implementations12 Dec 2021 Zekun Li, Runyu Guan, Qianmu Yu, Yao-Yi Chiang, Craig A. Knoblock

We show that the state-of-the-art text detection models (e. g., PSENet) can benefit from the synthetic historical maps and achieve significant improvement for historical map text detection.

Style Transfer Text Detection

SDTP: Semantic-aware Decoupled Transformer Pyramid for Dense Image Prediction

no code implementations18 Sep 2021 Zekun Li, Yufan Liu, Bing Li, Weiming Hu, Kebin Wu, Pei Wang

CDI builds the global attention and interaction among different levels in decoupled space which also solves the problem of heavy computation.

Diversity

GraphFM: Graph Factorization Machines for Feature Interaction Modeling

1 code implementation25 May 2021 Shu Wu, Zekun Li, Yunyue Su, Zeyu Cui, XiaoYu Zhang, Liang Wang

To solve the problems, we propose a novel approach, Graph Factorization Machine (GraphFM), by naturally representing features in the graph structure.

Graph Neural Network

DyGCN: Dynamic Graph Embedding with Graph Convolutional Network

no code implementations7 Apr 2021 Zeyu Cui, Zekun Li, Shu Wu, XiaoYu Zhang, Qiang Liu, Liang Wang, Mengmeng Ai

We naturally generalizes the embedding propagation scheme of GCN to dynamic setting in an efficient manner, which is to propagate the change along the graph to update node embeddings.

Dynamic graph embedding

PoP-Net: Pose over Parts Network for Multi-Person 3D Pose Estimation from a Depth Image

1 code implementation12 Dec 2020 Yuliang Guo, Zhong Li, Zekun Li, Xiangyu Du, Shuxue Quan, Yi Xu

In this paper, a real-time method called PoP-Net is proposed to predict multi-person 3D poses from a depth image.

3D Pose Estimation Data Augmentation

Cold-start Sequential Recommendation via Meta Learner

no code implementations10 Dec 2020 Yujia Zheng, Siyi Liu, Zekun Li, Shu Wu

As there is generally no side information in the setting of sequential recommendation task, previous cold-start methods could not be applied when only user-item interactions are available.

Meta-Learning Sequential Recommendation

Heterogeneous Graph Collaborative Filtering

no code implementations13 Nov 2020 Zekun Li, Yujia Zheng, Shu Wu, XiaoYu Zhang, Liang Wang

In this work, we propose to model user-item interactions as a heterogeneous graph which consists of not only user-item edges indicating their interaction but also user-user edges indicating their similarity.

Collaborative Filtering

DGTN: Dual-channel Graph Transition Network for Session-based Recommendation

1 code implementation21 Sep 2020 Yujia Zheng, Siyi Liu, Zekun Li, Shu Wu

These item transitions include potential collaborative information and reflect similar behavior patterns, which we assume may help with the recommendation for the target session.

Session-Based Recommendations

Semi-supervised Compatibility Learning Across Categories for Clothing Matching

1 code implementation31 Jul 2019 Zekun Li, Zeyu Cui, Shu Wu, Xiao-Yu Zhang, Liang Wang

To achieve the alignment, we minimize the distances between distributions with unsupervised adversarial learning, and also the distances between some annotated compatible items which play the role of anchor points to help align.

Dressing as a Whole: Outfit Compatibility Learning Based on Node-wise Graph Neural Networks

1 code implementation21 Feb 2019 Zeyu Cui, Zekun Li, Shu Wu, Xiao-Yu Zhang, Liang Wang

In this paper, we aim to investigate a practical problem of fashion recommendation by answering the question "which item should we select to match with the given fashion items and form a compatible outfit".

 Ranked #1 on Recommendation Systems on Polyvore (Accuracy metric)

Recommendation Systems

Cannot find the paper you are looking for? You can Submit a new open access paper.