Search Results for author: Zhen Han

Found 49 papers, 18 papers with code

Multi-Hop Open-Domain Question Answering over Structured and Unstructured Knowledge

no code implementations Findings (NAACL) 2022 Yue Feng, Zhen Han, Mingming Sun, Ping Li

DEHG employs a graph constructor to integrate structured and unstructured information, a context encoder to represent nodes and question, a heterogeneous information reasoning layer to conduct multi-hop reasoning on both information sources, and an answer decoder to generate answers for the question.

Decoder Open-Domain Question Answering

Learning Neural Ordinary Equations for Forecasting Future Links on Temporal Knowledge Graphs

no code implementations EMNLP 2021 Zhen Han, Zifeng Ding, Yunpu Ma, Yujia Gu, Volker Tresp

In addition, a novel graph transition layer is applied to capture the transitions on the dynamic graph, i. e., edge formation and dissolution.

Knowledge Graphs

TempCaps: A Capsule Network-based Embedding Model for Temporal Knowledge Graph Completion

1 code implementation spnlp (ACL) 2022 Guirong Fu, Zhao Meng, Zhen Han, Zifeng Ding, Yunpu Ma, Matthias Schubert, Volker Tresp, Roger Wattenhofer

In this paper, we tackle the temporal knowledge graph completion task by proposing TempCaps, which is a Capsule network-based embedding model for Temporal knowledge graph completion.

Entity Embeddings Temporal Knowledge Graph Completion

ICE-Bench: A Unified and Comprehensive Benchmark for Image Creating and Editing

no code implementations18 Mar 2025 Yulin Pan, Xiangteng He, Chaojie Mao, Zhen Han, Zeyinzi Jiang, Jingfeng Zhang, Yu Liu

In this paper, we propose ICE-Bench, a unified and comprehensive benchmark designed to rigorously assess image generation models.

Image Generation

VACE: All-in-One Video Creation and Editing

no code implementations10 Mar 2025 Zeyinzi Jiang, Zhen Han, Chaojie Mao, Jingfeng Zhang, Yulin Pan, Yu Liu

Further pursuing the unification of generation and editing tasks has yielded significant progress in the domain of image content creation.

All Video Editing +1

ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling

no code implementations5 Jan 2025 Chaojie Mao, Jingfeng Zhang, Yulin Pan, Zeyinzi Jiang, Zhen Han, Yu Liu, Jingren Zhou

There are many models in the community based on the post-training of text-to-image foundational models that meet this training paradigm of the first stage.

Image Generation

HybGRAG: Hybrid Retrieval-Augmented Generation on Textual and Relational Knowledge Bases

no code implementations20 Dec 2024 Meng-Chieh Lee, Qi Zhu, Costas Mavromatis, Zhen Han, Soji Adeshina, Vassilis N. Ioannidis, Huzefa Rangwala, Christos Faloutsos

Given a semi-structured knowledge base (SKB), where text documents are interconnected by relations, how can we effectively retrieve relevant information to answer user questions?

Question Answering RAG +1

PERFT: Parameter-Efficient Routed Fine-Tuning for Mixture-of-Expert Model

no code implementations12 Nov 2024 Yilun Liu, Yunpu Ma, Shuo Chen, Zifeng Ding, Bailan He, Zhen Han, Volker Tresp

By combining design choices within our framework, we introduce Parameter-Efficient Routed Fine-Tuning (PERFT) as a flexible and scalable family of PEFT strategies tailored for MoE models.

Arithmetic Reasoning Mixture-of-Experts +1

ACE: All-round Creator and Editor Following Instructions via Diffusion Transformer

no code implementations30 Sep 2024 Zhen Han, Zeyinzi Jiang, Yulin Pan, Jingfeng Zhang, Chaojie Mao, ChenWei Xie, Yu Liu, Jingren Zhou

To comprehensively evaluate the performance of our model, we establish a benchmark of manually annotated pairs data across a variety of visual generation tasks.

All Large Language Model

Visual Question Decomposition on Multimodal Large Language Models

no code implementations28 Sep 2024 Haowei Zhang, Jianzhe Liu, Zhen Han, Shuo Chen, Bailan He, Volker Tresp, Zhiqiang Xu, Jindong Gu

The finetuning pipeline consists of our proposed dataset and a training objective for selective decomposition.

Visual Question Answering (VQA)

WebPilot: A Versatile and Autonomous Multi-Agent System for Web Task Execution with Strategic Exploration

no code implementations28 Aug 2024 Yao Zhang, Zijian Ma, Yunpu Ma, Zhen Han, Yu Wu, Volker Tresp

LLM-based autonomous agents often fail to execute complex web tasks that require dynamic interaction due to the inherent uncertainty and complexity of these environments.

Decision Making global-optimization

IDRetracor: Towards Visual Forensics Against Malicious Face Swapping

no code implementations13 Aug 2024 Jikang Cheng, Jiaxin Ai, Zhen Han, Chao Liang, Qin Zou, Zhongyuan Wang, Qian Wang

To achieve visual forensics and target face attribution, we propose a novel task named face retracing, which considers retracing the original target face from the given fake one via inverse mapping.

DeepFake Detection Face Swapping

A rapid approach to urban traffic noise mapping with a generative adversarial network

no code implementations21 May 2024 Xinhao Yang, Zhen Han, Xiaodong Lu, Yuan Zhang

With rapid urbanisation and the accompanying increase in traffic density, traffic noise has become a major concern in urban planning.

Generative Adversarial Network SSIM

StyleBooth: Image Style Editing with Multimodal Instruction

1 code implementation18 Apr 2024 Zhen Han, Chaojie Mao, Zeyinzi Jiang, Yulin Pan, Jingfeng Zhang

We integrate encoded textual instruction and image exemplar as a unified condition for diffusion model, enabling the editing of original image following multimodal instructions.

Red Teaming GPT-4V: Are GPT-4V Safe Against Uni/Multi-Modal Jailbreak Attacks?

1 code implementation4 Apr 2024 Shuo Chen, Zhen Han, Bailan He, Zifeng Ding, Wenqian Yu, Philip Torr, Volker Tresp, Jindong Gu

Various jailbreak attacks have been proposed to red-team Large Language Models (LLMs) and revealed the vulnerable safeguards of LLMs.

Red Teaming

Locate, Assign, Refine: Taming Customized Promptable Image Inpainting

no code implementations28 Mar 2024 Yulin Pan, Chaojie Mao, Zeyinzi Jiang, Zhen Han, Jingfeng Zhang, Xiangteng He

Prior studies have made significant progress in image inpainting guided by either text description or subject image.

Image Inpainting

Stop Reasoning! When Multimodal LLM with Chain-of-Thought Reasoning Meets Adversarial Image

1 code implementation22 Feb 2024 Zefeng Wang, Zhen Han, Shuo Chen, Fan Xue, Zifeng Ding, Xun Xiao, Volker Tresp, Philip Torr, Jindong Gu

Based on our findings, we further propose a novel attack method, termed as stop-reasoning attack, that attacks the model while bypassing the CoT reasoning process.

Adversarial Robustness Multimodal Reasoning +1

Can Multimodal Large Language Models Truly Perform Multimodal In-Context Learning?

no code implementations29 Nov 2023 Shuo Chen, Zhen Han, Bailan He, Jianzhe Liu, Mark Buckley, Yao Qin, Philip Torr, Volker Tresp, Jindong Gu

Experiments revealed that multimodal ICL is predominantly driven by the textual content whereas the visual information in the demos has little influence.

In-Context Learning

GraphextQA: A Benchmark for Evaluating Graph-Enhanced Large Language Models

1 code implementation12 Oct 2023 Yuanchun Shen, Ruotong Liao, Zhen Han, Yunpu Ma, Volker Tresp

The proposed dataset is designed to evaluate graph-language models' ability to understand graphs and make use of it for answer generation.

Answer Generation Hallucination +4

A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models

2 code implementations24 Jul 2023 Jindong Gu, Zhen Han, Shuo Chen, Ahmad Beirami, Bailan He, Gengyuan Zhang, Ruotong Liao, Yao Qin, Volker Tresp, Philip Torr

This paper aims to provide a comprehensive survey of cutting-edge research in prompt engineering on three types of vision-language models: multimodal-to-text generation models (e. g. Flamingo), image-text matching models (e. g.

Image-text matching Language Modeling +5

Logic Diffusion for Knowledge Graph Reasoning

no code implementations6 Jun 2023 Xiaoying Xie, Biao Gong, Yiliang Lv, Zhen Han, Guoshuai Zhao, Xueming Qian

Most recent works focus on answering first order logical queries to explore the knowledge graph reasoning via multi-hop logic predictions.

A Graph-Guided Reasoning Approach for Open-ended Commonsense Question Answering

no code implementations18 Mar 2023 Zhen Han, Yue Feng, Mingming Sun

Hence, a new benchmark challenge set for open-ended commonsense reasoning (OpenCSR) has been recently released, which contains natural science questions without any predefined choices.

Multiple-choice Question Answering +1

Mutimodal Ranking Optimization for Heterogeneous Face Re-identification

no code implementations11 Dec 2022 Hui Hu, Jiawei Zhang, Zhen Han

Secondly, we propose linear and non-linear fusion strategies to aggregate initial ranking lists of multimodal face pairs and acquire the optimized re-ranked list based on modal complementarity.

Few-Shot Inductive Learning on Temporal Knowledge Graphs using Concept-Aware Information

no code implementations15 Nov 2022 Zifeng Ding, Jingpei Wu, Bailan He, Yunpu Ma, Zhen Han, Volker Tresp

Similar problem exists in temporal knowledge graphs (TKGs), and no previous temporal knowledge graph completion (TKGC) method is developed for modeling newly-emerged entities.

Inductive Learning Link Prediction +2

Deepfake Face Traceability with Disentangling Reversing Network

no code implementations8 Jul 2022 Jiaxin Ai, Zhongyuan Wang, Baojin Huang, Zhen Han

Deepfake face not only violates the privacy of personal identity, but also confuses the public and causes huge social harm.

DeepFake Detection Face Swapping

Continuous Temporal Graph Networks for Event-Based Graph Data

no code implementations NAACL (DLG4NLP) 2022 Jin Guo, Zhen Han, Zhou Su, Jiliang Li, Volker Tresp, Yuyi Wang

Hence, we propose Continuous Temporal Graph Networks (CTGNs) to capture the continuous dynamics of temporal graph data.

Graph Neural Network

Learning Meta Representations of One-shot Relations for Temporal Knowledge Graph Link Prediction

no code implementations21 May 2022 Zifeng Ding, Bailan He, Yunpu Ma, Zhen Han, Volker Tresp

In this paper, we follow the previous work that focuses on few-shot relational learning on static KGs and extend two fundamental TKG reasoning tasks, i. e., interpolated and extrapolated link prediction, to the one-shot setting.

Few-Shot Learning Knowledge Graphs +2

ECOLA: Enhanced Temporal Knowledge Embeddings with Contextualized Language Representations

no code implementations17 Mar 2022 Zhen Han, Ruotong Liao, Jindong Gu, Yao Zhang, Zifeng Ding, Yujia Gu, Heinz Köppl, Hinrich Schütze, Volker Tresp

Since conventional knowledge embedding models cannot take full advantage of the abundant textual information, there have been extensive research efforts in enhancing knowledge embedding using texts.

Knowledge Graph Embedding Link Prediction +1

Consistent Style Transfer

1 code implementation6 Jan 2022 Xuan Luo, Zhen Han, Lingkang Yang, Lingling Zhang

Recently, attentional arbitrary style transfer methods have been proposed to achieve fine-grained results, which manipulates the point-wise similarity between content and style features for stylization.

Style Transfer

TimeTraveler: Reinforcement Learning for Temporal Knowledge Graph Forecasting

1 code implementation EMNLP 2021 Haohai Sun, Jialun Zhong, Yunpu Ma, Zhen Han, Kun He

Compared with the completion task, the forecasting task is more difficult that faces two main challenges: (1) how to effectively model the time information to handle future timestamps?

Link Prediction reinforcement-learning +2

Video Similarity and Alignment Learning on Partial Video Copy Detection

no code implementations4 Aug 2021 Zhen Han, Xiangteng He, Mingqian Tang, Yiliang Lv

To address the above issues, we propose the Video Similarity and Alignment Learning (VSAL) approach, which jointly models spatial similarity, temporal similarity and partial alignment.

Copy Detection Partial Video Copy Detection +1

When Face Recognition Meets Occlusion: A New Benchmark

1 code implementation4 Mar 2021 Baojin Huang, Zhongyuan Wang, Guangcheng Wang, Kui Jiang, Kangli Zeng, Zhen Han, Xin Tian, Yuhong Yang

In particular, we first collect a variety of glasses and masks as occlusion, and randomly combine the occlusion attributes (occlusion objects, textures, and colors) to achieve a large number of more realistic occlusion types.

Diversity Face Recognition

Temporal Knowledge Graph Forecasting with Neural ODE

1 code implementation13 Jan 2021 Zhen Han, Zifeng Ding, Yunpu Ma, Yujia Gu, Volker Tresp

In addition, a novel graph transition layer is applied to capture the transitions on the dynamic graph, i. e., edge formation and dissolution.

Future prediction Knowledge Graphs

Graph Hawkes Neural Network for Forecasting on Temporal Knowledge Graphs

1 code implementation AKBC 2020 Zhen Han, Yunpu Ma, Yuyi Wang, Stephan Günnemann, Volker Tresp

The Hawkes process has become a standard method for modeling self-exciting event sequences with different event types.

Knowledge Graphs

MMD GAN with Random-Forest Kernels

no code implementations ICLR 2020 Tao Huang, Zhen Han, Xu Jia, Hanyuan Hang

In this paper, we propose a novel kind of kernel, random forest kernel, to enhance the empirical performance of MMD GAN.

Ensemble Learning

Unsupervised Image Super-Resolution with an Indirect Supervised Path

no code implementations7 Oct 2019 Zhen Han, Enyan Dai, Xu Jia, Xiaoying Ren, Shuaijun Chen, Chunjing Xu, Jianzhuang Liu, Qi Tian

The task of single image super-resolution (SISR) aims at reconstructing a high-resolution (HR) image from a low-resolution (LR) image.

Image Super-Resolution Translation

Global Norm-Aware Pooling for Pose-Robust Face Recognition at Low False Positive Rate

no code implementations1 Aug 2018 Sheng Chen, Jia Guo, Yang Liu, Xiang Gao, Zhen Han

In this paper, we propose a novel Global Norm-Aware Pooling (GNAP) block, which reweights local features in a convolutional neural network (CNN) adaptively according to their L2 norms and outputs a global feature vector with a global average pooling layer.

Face Recognition Robust Face Recognition

Gradient Band-based Adversarial Training for Generalized Attack Immunity of A3C Path Finding

no code implementations18 Jul 2018 Tong Chen, Wenjia Niu, Yingxiao Xiang, Xiaoxuan Bai, Jiqiang Liu, Zhen Han, Gang Li

In addition, we propose Gradient Band-based Adversarial Training, which trained with a single randomly choose dominant adversarial example without taking any modification, to realize the "1:N" attack immunity for generalized dominant adversarial examples.

Dynamic Stacked Generalization for Node Classification on Networks

no code implementations16 Oct 2016 Zhen Han, Alyson Wilson

We propose a novel stacked generalization (stacking) method as a dynamic ensemble technique using a pool of heterogeneous classifiers for node label classification on networks.

Classification General Classification +1

Cannot find the paper you are looking for? You can Submit a new open access paper.