Search Results for author: Ziyi Liu

Found 51 papers, 18 papers with code

ER-TEST Evaluating Explanation Regularization Methods for NLP Models

no code implementations NAACL (TrustNLP) 2022 Brihi Joshi, Aaron Chan, Ziyi Liu, Xiang Ren

For the latter, explanation regularization (ER) aims to improve NLM generalization by pushing the machine rationales to align with human rationales.

AI Agent Behavioral Science

no code implementations4 Jun 2025 Lin Chen, Yunke Zhang, Jie Feng, Haoye Chai, Honglin Zhang, Bingbing Fan, Yibo Ma, Shiyuan Zhang, Nian Li, Tianhui Liu, Nicholas Sukiennik, Keyu Zhao, Yu Li, Ziyi Liu, Fengli Xu, Yong Li

Recent advances in large language models (LLMs) have enabled the development of AI agents that exhibit increasingly human-like behaviors, including planning, adaptation, and social dynamics across diverse, interactive, and open-ended scenarios.

anyECG-chat: A Generalist ECG-MLLM for Flexible ECG Input and Multi-Task Understanding

no code implementations1 Jun 2025 Haitao Li, Ziyu Li, Yiheng Mao, Ziyi Liu, Zhoujian Sun, Zhengxing Huang

However, existing ECG-focused MLLMs primarily focus on report generation tasks, often limited to single 12-lead, short-duration (10s) ECG inputs, thereby underutilizing the potential of MLLMs.

Discovering Pathology Rationale and Token Allocation for Efficient Multimodal Pathology Reasoning

no code implementations21 May 2025 Zhe Xu, Cheng Jin, Yihui Wang, Ziyi Liu, Hao Chen

Multimodal pathological image understanding has garnered widespread interest due to its potential to improve diagnostic accuracy and enable personalized treatment through integrated visual and textual data.

Computational Efficiency Diagnostic +3

Improving Random Forests by Smoothing

no code implementations11 May 2025 Ziyi Liu, Phuc Luong, Mario Boley, Daniel F. Schmidt

Gaussian process regression is a popular model in the small data regime due to its sound uncertainty quantification and the exploitation of the smoothness of the regression function that is encountered in a wide range of practical problems.

Gaussian Processes regression +1

Bridge the Gap: From Weak to Full Supervision for Temporal Action Localization with PseudoFormer

no code implementations CVPR 2025 Ziyi Liu, Yangcen Liu

While recent approaches employ pseudo labels for training, three key challenges: generating high-quality pseudo labels, making full use of different priors, and optimizing training methods with noisy labels remain unresolved.

Can LLMs Grasp Implicit Cultural Values? Benchmarking LLMs' Metacognitive Cultural Intelligence with CQ-Bench

1 code implementation1 Apr 2025 Ziyi Liu, Priyanka Dey, Zhenyu Zhao, Jen-tse Huang, Rahul Gupta, Yang Liu, Jieyu Zhao

To address this gap, we introduce CQ-Bench, a benchmark specifically designed to assess LLMs' capability to infer implicit cultural values from natural conversational contexts.

Benchmarking

Make the Most of Everything: Further Considerations on Disrupting Diffusion-based Customization

no code implementations18 Mar 2025 Long Tang, Dengpan Ye, Sirun Chen, Xiuwen Shi, Yunna Lv, Ziyi Liu

We propose Dual Anti-Diffusion (DADiff), a two-stage adversarial attack targeting diffusion customization, which, for the first time, integrates the adversarial prompt-level attack into the generation process of image-level adversarial examples.

Adversarial Attack

VLMs as GeoGuessr Masters: Exceptional Performance, Hidden Biases, and Privacy Risks

1 code implementation16 Feb 2025 Jingyuan Huang, Jen-tse Huang, Ziyi Liu, Xiaoyuan Liu, Wenxuan Wang, Jieyu Zhao

Evaluating four VLMs, we find that while these models demonstrate the ability to recognize geographic information from images, achieving up to 53. 8% accuracy in city prediction, they exhibit significant biases.

De-biased Multimodal Electrocardiogram Analysis

no code implementations22 Nov 2024 Haitao Li, Ziyu Li, Yiheng Mao, Ziyi Liu, Zhoujian Sun, Zhengxing Huang

We analyzed this phenomenon from a causal perspective in the context of ECG MLLMs and discovered that the confounder, severity of illness, introduces a spurious correlation between the question and answer, leading the model to rely on this spurious correlation and ignore the ECG input.

Translating Electrocardiograms to Cardiac Magnetic Resonance Imaging Useful for Cardiac Assessment and Disease Screening: A Multi-Center Study AI for ECG to CMR Translation Study

1 code implementation19 Nov 2024 Zhengyao Ding, Ziyu Li, Yujian Hu, Youyao Xu, Chengchen Zhao, Yiheng Mao, Haitao Li, Zhikang Li, Qian Li, Jing Wang, Yue Chen, Mengjia Chen, Longbo Wang, Xuesen Chu, Weichao Pan, Ziyi Liu, Fei Wu, HongKun Zhang, Ting Chen, Zhengxing Huang

Trained on 159, 819 samples from five cohorts, including the UK Biobank (n=42, 483) and MIMIC-IV-ECG (n=164, 550), and externally validated on independent clinical datasets (n=3, 767), CardioNets achieved strong performance across disease screening and phenotype estimation tasks.

Contrastive Learning Diagnostic +3

Causal Interventions on Causal Paths: Mapping GPT-2's Reasoning From Syntax to Semantics

no code implementations28 Oct 2024 Isabelle Lee, Joshua Lum, Ziyi Liu, Dani Yogatama

While interpretability research has shed light on some internal algorithms utilized by transformer-based LLMs, reasoning in natural language, with its deep contextuality and ambiguity, defies easy categorization.

An Efficient System for Automatic Map Storytelling -- A Case Study on Historical Maps

1 code implementation21 Oct 2024 Ziyi Liu, Claudio Affolter, Sidi Wu, Yizi Chen, Lorenz Hurni

Despite the recent advance of GPT-4 in text recognition and map captioning, it still has a limited understanding of maps, as its performance wanes when texts (e. g., titles and legends) in maps are missing or inaccurate.

Image Captioning

Sequential Probability Assignment with Contexts: Minimax Regret, Contextual Shtarkov Sums, and Contextual Normalized Maximum Likelihood

no code implementations4 Oct 2024 Ziyi Liu, Idan Attias, Daniel M. Roy

Inspired by the seminal work of Shtarkov (1987) and Rakhlin, Sridharan, and Tewari (2010), we introduce a novel complexity measure, the \emph{contextual Shtarkov sum}, corresponding to the Shtarkov sum after projection onto a multiary context tree, and show that the worst case log contextual Shtarkov sum equals the minimax regret.

Causal Bandits: The Pareto Optimal Frontier of Adaptivity, a Reduction to Linear Bandits, and Limitations around Unknown Marginals

no code implementations1 Jul 2024 Ziyi Liu, Idan Attias, Daniel M. Roy

In this work, we investigate the problem of adapting to the presence or absence of causal structure in multi-armed bandit problems.

InterIntent: Investigating Social Intelligence of LLMs via Intention Understanding in an Interactive Game Context

1 code implementation18 Jun 2024 Ziyi Liu, Abhishek Anand, Pei Zhou, Jen-tse Huang, Jieyu Zhao

In this paper, we developed a novel framework, InterIntent, to assess LLMs' social intelligence by mapping their ability to understand and manage intentions in a game setting.

DIP-Watermark: A Double Identity Protection Method Based on Robust Adversarial Watermark

no code implementations23 Apr 2024 Yunming Zhang, Dengpan Ye, Caiyun Xie, Sipeng Shen, Ziyi Liu, Jiacheng Deng, Long Tang

This strategy enhances the representation of universal carrier features, mitigating multi-objective optimization conflicts in watermarking.

Adversarial Attack Decoder +1

STAT: Towards Generalizable Temporal Action Localization

no code implementations20 Apr 2024 Yangcen Liu, Ziyi Liu, Yuanhao Zhai, Wen Li, David Doerman, Junsong Yuan

To address this problem, we propose the Generalizable Temporal Action Localization task (GTAL), which focuses on improving the generalizability of action localization methods.

Conversational Disease Diagnosis via External Planner-Controlled Large Language Models

1 code implementation4 Apr 2024 Zhoujian Sun, Cheng Luo, Ziyi Liu, Zhengxing Huang

We demonstrated that our system obtained impressive performance in both disease screening and differential diagnoses tasks.

Active Learning Decision Making +4

Self-Contradictory Reasoning Evaluation and Detection

1 code implementation16 Nov 2023 Ziyi Liu, Soumya Sanyal, Isabelle Lee, Yongkang Du, Rahul Gupta, Yang Liu, Jieyu Zhao

For 2), we task the state-of-the-art model GPT-4 with identifying Self-Contra reasoning and finer-grained fallacies.

Dual Defense: Adversarial, Traceable, and Invisible Robust Watermarking against Face Swapping

no code implementations25 Oct 2023 Yunming Zhang, Dengpan Ye, Caiyun Xie, Long Tang, Chuanxi Chen, Ziyi Liu, Jiacheng Deng

Dual Defense invisibly embeds a single robust watermark within the target face to actively respond to sudden cases of malicious face swapping.

Face Swapping Misinformation

Bootstrap Your Own Skills: Learning to Solve New Tasks with Large Language Model Guidance

no code implementations16 Oct 2023 Jesse Zhang, Jiahui Zhang, Karl Pertsch, Ziyi Liu, Xiang Ren, Minsuk Chang, Shao-Hua Sun, Joseph J. Lim

Instead, our approach BOSS (BOotStrapping your own Skills) learns to accomplish new tasks by performing "skill bootstrapping," where an agent with a set of primitive skills interacts with the environment to practice new skills without receiving reward feedback for tasks outside of the initial skill set.

Language Modeling Language Modelling +1

SOAR: Scene-debiasing Open-set Action Recognition

1 code implementation ICCV 2023 Yuanhao Zhai, Ziyi Liu, Zhenyu Wu, Yi Wu, Chunluan Zhou, David Doermann, Junsong Yuan, Gang Hua

The former prevents the decoder from reconstructing the video background given video features, and thus helps reduce the background information in feature learning.

Decoder Open Set Action Recognition +1

Are Machine Rationales (Not) Useful to Humans? Measuring and Improving Human Utility of Free-Text Rationales

1 code implementation11 May 2023 Brihi Joshi, Ziyi Liu, Sahana Ramnath, Aaron Chan, Zhewei Tong, Shaoliang Nie, Qifan Wang, Yejin Choi, Xiang Ren

Existing metrics like task performance of the LM generating the rationales, or similarity between generated and gold rationales are not good indicators of their human utility.

Handling Concept Drift in Global Time Series Forecasting

1 code implementation4 Apr 2023 Ziyi Liu, Rakshitha Godahewa, Kasun Bandara, Christoph Bergmeir

Handling concept drift in forecasting is essential for many ML methods in use nowadays, however, the prior work only proposes methods to handle concept drift in the classification domain.

Time Series Time Series Forecasting

Clustered Federated Learning based on Nonconvex Pairwise Fusion

1 code implementation8 Nov 2022 Xue Yu, Ziyi Liu, Wu Wang, Yifan Sun

We propose a clustered FL framework that incorporates a nonconvex penalty to pairwise differences of parameters.

Federated Learning

XMD: An End-to-End Framework for Interactive Explanation-Based Debugging of NLP Models

no code implementations30 Oct 2022 Dong-Ho Lee, Akshen Kadakia, Brihi Joshi, Aaron Chan, Ziyi Liu, Kiran Narahari, Takashi Shibuya, Ryosuke Mitani, Toshiyuki Sekiya, Jay Pujara, Xiang Ren

Explanation-based model debugging aims to resolve spurious biases by showing human users explanations of model behavior, asking users to give feedback on the behavior, then using the feedback to update the model.

text-classification Text Classification

Correlation Information Bottleneck: Towards Adapting Pretrained Multimodal Models for Robust Visual Question Answering

no code implementations14 Sep 2022 Jingjing Jiang, Ziyi Liu, Nanning Zheng

In this paper, we aim to improve input robustness from an information bottleneck perspective when adapting pretrained VLMs to the downstream VQA task.

Adversarial Robustness Question Answering +1

AiM: Taking Answers in Mind to Correct Chinese Cloze Tests in Educational Applications

1 code implementation COLING 2022 Yusen Zhang, Zhongli Li, Qingyu Zhou, Ziyi Liu, Chao Li, Mina Ma, Yunbo Cao, Hongzhi Liu

To automatically correct handwritten assignments, the traditional approach is to use an OCR model to recognize characters and compare them to answers.

Optical Character Recognition (OCR)

Multi-Marginal Contrastive Learning for Multi-Label Subcellular Protein Localization

1 code implementation CVPR 2022 Ziyi Liu, Zengmao Wang, Bo Du

In this paper, we propose a deep protein subcellular localization method with multi-marginal contrastive learning to perceive the same PSLs in different tissue images and different PSLs within the same tissue image.

Contrastive Learning

LiVLR: A Lightweight Visual-Linguistic Reasoning Framework for Video Question Answering

no code implementations29 Nov 2021 Jingjing Jiang, Ziyi Liu, Nanning Zheng

Video Question Answering (VideoQA), aiming to correctly answer the given question based on understanding multi-modal video content, is challenging due to the rich video content.

Diversity Question Answering +3

Machine Learning for Multimodal Electronic Health Records-based Research: Challenges and Perspectives

no code implementations9 Nov 2021 Ziyi Liu, JiaQi Zhang, Yongshuai Hou, Xinran Zhang, Ge Li, Yang Xiang

Background: Electronic Health Records (EHRs) contain rich information of patients' health history, which usually include both structured and unstructured data.

BIG-bench Machine Learning

SGEN: Single-cell Sequencing Graph Self-supervised Embedding Network

no code implementations15 Oct 2021 Ziyi Liu, Minghui Liao, Fulin Luo, Bo Du

This method constructs the graph by the similarity relationship between cells and adopts GCN to analyze the neighbor embedding information of samples, which makes the similar cell closer to each other on the 2D scatter plot.

Dimensionality Reduction Graph Embedding

X-GGM: Graph Generative Modeling for Out-of-Distribution Generalization in Visual Question Answering

1 code implementation24 Jul 2021 Jingjing Jiang, Ziyi Liu, Yifan Liu, Zhixiong Nan, Nanning Zheng

In this paper, we formulate OOD generalization in VQA as a compositional generalization problem and propose a graph generative modeling-based training scheme (X-GGM) to implicitly model the problem.

Attribute Out-of-Distribution Generalization +2

LightFuse: Lightweight CNN based Dual-exposure Fusion

1 code implementation5 Jul 2021 Ziyi Liu, Jie Yang, Svetlana Yanushkevich, Orly Yadid-Pecht

Embedded systems have a huge market, and utilizing DCNNs' powerful functionality into them will further reduce human intervention.

Weakly Supervised Temporal Action Localization Through Learning Explicit Subspaces for Action and Context

no code implementations30 Mar 2021 Ziyi Liu, Le Wang, Wei Tang, Junsong Yuan, Nanning Zheng, Gang Hua

To address this challenge, we introduce a framework that learns two feature subspaces respectively for actions and their context.

Action Recognition

Mobile-end Tone Mapping based on Integral Image and Integral Histogram

no code implementations2 Feb 2021 Jie Yang, Mengchen Lin, Ziyi Liu, Ulian Shahnovich, Orly Yadid-Pecht

It is especially crucial for mobile devices because most of the images taken today are from mobile phones, hence such technology is highly demanded in the consumer market of mobile devices and is essential for a good customer experience.

Tone Mapping

Deep Reformulated Laplacian Tone Mapping

1 code implementation31 Jan 2021 Jie Yang, Ziyi Liu, Mengchen Lin, Svetlana Yanushkevich, Orly Yadid-Pecht

The reformulated Laplacian pyramid always decompose a WDR image into two frequency bands where the low-frequency band is global feature-oriented, and the high-frequency band is local feature-oriented.

Tone Mapping

Tone Mapping Based on Multi-scale Histogram Synthesis

1 code implementation31 Jan 2021 Jie Yang, Ziyi Liu, Ulian Shahnovich, Orly Yadid-Pecht

HVS perceives luminance differently when under different adaptation levels, and therefore our algorithm uses functions built upon different scales to tone map pixels to different values.

Tone Mapping

A review for Tone-mapping Operators on Wide Dynamic Range Image

no code implementations8 Jan 2021 Ziyi Liu

The dynamic range of our normal life can exceeds 120 dB, however, the smart-phone cameras and the conventional digital cameras can only capture a dynamic range of 90 dB, which sometimes leads to loss of details for the recorded image.

Tone Mapping

Detecting Foodborne Illness Complaints in Multiple Languages Using English Annotations Only

no code implementations EMNLP (Louhi) 2020 Ziyi Liu, Giannis Karamanolakis, Daniel Hsu, Luis Gravano

To improve performance without extra annotations, we create artificial training documents in the target language through machine translation and train mBERT jointly for the source (English) and target language.

Machine Translation text-classification +1

Weakly Supervised Temporal Action Localization Through Contrast Based Evaluation Networks

no code implementations ICCV 2019 Ziyi Liu, Le Wang, Qilin Zhang, Zhanning Gao, Zhenxing Niu, Nanning Zheng, Gang Hua

To address this challenge, we propose the Contrast-based Localization EvaluAtioN Network (CleanNet) with our new action proposal evaluator, which provides pseudo-supervision by leveraging the temporal contrast in snippet-level action classification predictions.

Action Classification Weakly Supervised Action Localization

Attention-based Temporal Weighted Convolutional Neural Network for Action Recognition

no code implementations19 Mar 2018 Jinliang Zang, Le Wang, Ziyi Liu, Qilin Zhang, Zhenxing Niu, Gang Hua, Nanning Zheng

Research in human action recognition has accelerated significantly since the introduction of powerful machine learning tools such as Convolutional Neural Networks (CNNs).

Action Recognition Temporal Action Localization

Detecting Drivable Area for Self-driving Cars: An Unsupervised Approach

no code implementations1 May 2017 Ziyi Liu, Siyu Yu, Xiao Wang, Nanning Zheng

Experiments show that our unsupervised approach is efficient and robust for detecting drivable area for self-driving cars.

Self-Driving Cars

Cannot find the paper you are looking for? You can Submit a new open access paper.