no code implementations • NAACL (TrustNLP) 2022 • Brihi Joshi, Aaron Chan, Ziyi Liu, Xiang Ren
For the latter, explanation regularization (ER) aims to improve NLM generalization by pushing the machine rationales to align with human rationales.
no code implementations • 4 Jun 2025 • Lin Chen, Yunke Zhang, Jie Feng, Haoye Chai, Honglin Zhang, Bingbing Fan, Yibo Ma, Shiyuan Zhang, Nian Li, Tianhui Liu, Nicholas Sukiennik, Keyu Zhao, Yu Li, Ziyi Liu, Fengli Xu, Yong Li
Recent advances in large language models (LLMs) have enabled the development of AI agents that exhibit increasingly human-like behaviors, including planning, adaptation, and social dynamics across diverse, interactive, and open-ended scenarios.
no code implementations • 1 Jun 2025 • Haitao Li, Ziyu Li, Yiheng Mao, Ziyi Liu, Zhoujian Sun, Zhengxing Huang
However, existing ECG-focused MLLMs primarily focus on report generation tasks, often limited to single 12-lead, short-duration (10s) ECG inputs, thereby underutilizing the potential of MLLMs.
no code implementations • 21 May 2025 • Zhe Xu, Cheng Jin, Yihui Wang, Ziyi Liu, Hao Chen
Multimodal pathological image understanding has garnered widespread interest due to its potential to improve diagnostic accuracy and enable personalized treatment through integrated visual and textual data.
no code implementations • 17 May 2025 • Haitao Li, Che Liu, Zhengyao Ding, Ziyi Liu, Zhengxing Huang
Electrocardiograms (ECGs) are essential for diagnosing cardiovascular diseases.
no code implementations • 11 May 2025 • Ziyi Liu, Phuc Luong, Mario Boley, Daniel F. Schmidt
Gaussian process regression is a popular model in the small data regime due to its sound uncertainty quantification and the exploitation of the smoothness of the regression function that is encountered in a wide range of practical problems.
no code implementations • CVPR 2025 • Ziyi Liu, Yangcen Liu
While recent approaches employ pseudo labels for training, three key challenges: generating high-quality pseudo labels, making full use of different priors, and optimizing training methods with noisy labels remain unresolved.
1 code implementation • 1 Apr 2025 • Ziyi Liu, Priyanka Dey, Zhenyu Zhao, Jen-tse Huang, Rahul Gupta, Yang Liu, Jieyu Zhao
To address this gap, we introduce CQ-Bench, a benchmark specifically designed to assess LLMs' capability to infer implicit cultural values from natural conversational contexts.
no code implementations • 18 Mar 2025 • Long Tang, Dengpan Ye, Sirun Chen, Xiuwen Shi, Yunna Lv, Ziyi Liu
We propose Dual Anti-Diffusion (DADiff), a two-stage adversarial attack targeting diffusion customization, which, for the first time, integrates the adversarial prompt-level attack into the generation process of image-level adversarial examples.
no code implementations • 24 Feb 2025 • Zhoujian Sun, Ziyi Liu, Cheng Luo, Jiebin Chu, Zhengxing Huang
The experimental results indicate that the PPME LLM achieved over 30% improvement compared to baselines.
1 code implementation • 16 Feb 2025 • Jingyuan Huang, Jen-tse Huang, Ziyi Liu, Xiaoyuan Liu, Wenxuan Wang, Jieyu Zhao
Evaluating four VLMs, we find that while these models demonstrate the ability to recognize geographic information from images, achieving up to 53. 8% accuracy in city prediction, they exhibit significant biases.
no code implementations • 22 Dec 2024 • Sipeng Shen, Yunming Zhang, Dengpan Ye, Xiuwen Shi, Long Tang, Haoran Duan, Jiacheng Deng, Ziyi Liu
Moreover, ErasableMask also exhibits outstanding perturbation erasion performance, achieving over 90% erasion success rate.
no code implementations • 22 Nov 2024 • Haitao Li, Ziyu Li, Yiheng Mao, Ziyi Liu, Zhoujian Sun, Zhengxing Huang
We analyzed this phenomenon from a causal perspective in the context of ECG MLLMs and discovered that the confounder, severity of illness, introduces a spurious correlation between the question and answer, leading the model to rely on this spurious correlation and ignore the ECG input.
1 code implementation • 19 Nov 2024 • Zhengyao Ding, Ziyu Li, Yujian Hu, Youyao Xu, Chengchen Zhao, Yiheng Mao, Haitao Li, Zhikang Li, Qian Li, Jing Wang, Yue Chen, Mengjia Chen, Longbo Wang, Xuesen Chu, Weichao Pan, Ziyi Liu, Fei Wu, HongKun Zhang, Ting Chen, Zhengxing Huang
Trained on 159, 819 samples from five cohorts, including the UK Biobank (n=42, 483) and MIMIC-IV-ECG (n=164, 550), and externally validated on independent clinical datasets (n=3, 767), CardioNets achieved strong performance across disease screening and phenotype estimation tasks.
no code implementations • 28 Oct 2024 • Isabelle Lee, Joshua Lum, Ziyi Liu, Dani Yogatama
While interpretability research has shed light on some internal algorithms utilized by transformer-based LLMs, reasoning in natural language, with its deep contextuality and ambiguity, defies easy categorization.
1 code implementation • 21 Oct 2024 • Ziyi Liu, Claudio Affolter, Sidi Wu, Yizi Chen, Lorenz Hurni
Despite the recent advance of GPT-4 in text recognition and map captioning, it still has a limited understanding of maps, as its performance wanes when texts (e. g., titles and legends) in maps are missing or inaccurate.
no code implementations • 4 Oct 2024 • Ziyi Liu, Idan Attias, Daniel M. Roy
Inspired by the seminal work of Shtarkov (1987) and Rakhlin, Sridharan, and Tewari (2010), we introduce a novel complexity measure, the \emph{contextual Shtarkov sum}, corresponding to the Shtarkov sum after projection onto a multiary context tree, and show that the worst case log contextual Shtarkov sum equals the minimax regret.
no code implementations • 19 Sep 2024 • Ziyi Liu, Dengpan Ye, Long Tang, Yunming Zhang, Jiacheng Deng
With the development of artificial intelligence, neural networks play a key role in network intrusion detection systems (NIDS).
no code implementations • 1 Jul 2024 • Ziyi Liu, Idan Attias, Daniel M. Roy
In this work, we investigate the problem of adapting to the presence or absence of causal structure in multi-armed bandit problems.
1 code implementation • 18 Jun 2024 • Ziyi Liu, Abhishek Anand, Pei Zhou, Jen-tse Huang, Jieyu Zhao
In this paper, we developed a novel framework, InterIntent, to assess LLMs' social intelligence by mapping their ability to understand and manage intentions in a game setting.
no code implementations • 23 Apr 2024 • Yunming Zhang, Dengpan Ye, Caiyun Xie, Sipeng Shen, Ziyi Liu, Jiacheng Deng, Long Tang
This strategy enhances the representation of universal carrier features, mitigating multi-objective optimization conflicts in watermarking.
no code implementations • 20 Apr 2024 • Yangcen Liu, Ziyi Liu, Yuanhao Zhai, Wen Li, David Doerman, Junsong Yuan
To address this problem, we propose the Generalizable Temporal Action Localization task (GTAL), which focuses on improving the generalizability of action localization methods.
1 code implementation • 4 Apr 2024 • Zhoujian Sun, Cheng Luo, Ziyi Liu, Zhengxing Huang
We demonstrated that our system obtained impressive performance in both disease screening and differential diagnoses tasks.
1 code implementation • 16 Nov 2023 • Ziyi Liu, Soumya Sanyal, Isabelle Lee, Yongkang Du, Rahul Gupta, Yang Liu, Jieyu Zhao
For 2), we task the state-of-the-art model GPT-4 with identifying Self-Contra reasoning and finer-grained fallacies.
no code implementations • 25 Oct 2023 • Yunming Zhang, Dengpan Ye, Caiyun Xie, Long Tang, Chuanxi Chen, Ziyi Liu, Jiacheng Deng
Dual Defense invisibly embeds a single robust watermark within the target face to actively respond to sudden cases of malicious face swapping.
no code implementations • 16 Oct 2023 • Jesse Zhang, Jiahui Zhang, Karl Pertsch, Ziyi Liu, Xiang Ren, Minsuk Chang, Shao-Hua Sun, Joseph J. Lim
Instead, our approach BOSS (BOotStrapping your own Skills) learns to accomplish new tasks by performing "skill bootstrapping," where an agent with a set of primitive skills interacts with the environment to practice new skills without receiving reward feedback for tasks outside of the initial skill set.
1 code implementation • ICCV 2023 • Yuanhao Zhai, Ziyi Liu, Zhenyu Wu, Yi Wu, Chunluan Zhou, David Doermann, Junsong Yuan, Gang Hua
The former prevents the decoder from reconstructing the video background given video features, and thus helps reduce the background information in feature learning.
1 code implementation • 11 May 2023 • Brihi Joshi, Ziyi Liu, Sahana Ramnath, Aaron Chan, Zhewei Tong, Shaoliang Nie, Qifan Wang, Yejin Choi, Xiang Ren
Existing metrics like task performance of the LM generating the rationales, or similarity between generated and gold rationales are not good indicators of their human utility.
1 code implementation • 4 Apr 2023 • Ziyi Liu, Rakshitha Godahewa, Kasun Bandara, Christoph Bergmeir
Handling concept drift in forecasting is essential for many ML methods in use nowadays, however, the prior work only proposes methods to handle concept drift in the classification domain.
1 code implementation • 8 Nov 2022 • Xue Yu, Ziyi Liu, Wu Wang, Yifan Sun
We propose a clustered FL framework that incorporates a nonconvex penalty to pairwise differences of parameters.
no code implementations • 30 Oct 2022 • Dong-Ho Lee, Akshen Kadakia, Brihi Joshi, Aaron Chan, Ziyi Liu, Kiran Narahari, Takashi Shibuya, Ryosuke Mitani, Toshiyuki Sekiya, Jay Pujara, Xiang Ren
Explanation-based model debugging aims to resolve spurious biases by showing human users explanations of model behavior, asking users to give feedback on the behavior, then using the feedback to update the model.
no code implementations • 14 Sep 2022 • Jingjing Jiang, Ziyi Liu, Nanning Zheng
In this paper, we aim to improve input robustness from an information bottleneck perspective when adapting pretrained VLMs to the downstream VQA task.
1 code implementation • COLING 2022 • Yusen Zhang, Zhongli Li, Qingyu Zhou, Ziyi Liu, Chao Li, Mina Ma, Yunbo Cao, Hongzhi Liu
To automatically correct handwritten assignments, the traditional approach is to use an OCR model to recognize characters and compare them to answers.
1 code implementation • 25 May 2022 • Brihi Joshi, Aaron Chan, Ziyi Liu, Shaoliang Nie, Maziar Sanjabi, Hamed Firooz, Xiang Ren
to align with human rationales (Which input tokens would humans focus on?).
1 code implementation • CVPR 2022 • Ziyi Liu, Zengmao Wang, Bo Du
In this paper, we propose a deep protein subcellular localization method with multi-marginal contrastive learning to perceive the same PSLs in different tissue images and different PSLs within the same tissue image.
no code implementations • 29 Nov 2021 • Jingjing Jiang, Ziyi Liu, Nanning Zheng
Video Question Answering (VideoQA), aiming to correctly answer the given question based on understanding multi-modal video content, is challenging due to the rich video content.
no code implementations • 9 Nov 2021 • Ziyi Liu, JiaQi Zhang, Yongshuai Hou, Xinran Zhang, Ge Li, Yang Xiang
Background: Electronic Health Records (EHRs) contain rich information of patients' health history, which usually include both structured and unstructured data.
no code implementations • 15 Oct 2021 • Ziyi Liu, Minghui Liao, Fulin Luo, Bo Du
This method constructs the graph by the similarity relationship between cells and adopts GCN to analyze the neighbor embedding information of samples, which makes the similar cell closer to each other on the 2D scatter plot.
1 code implementation • 24 Jul 2021 • Jingjing Jiang, Ziyi Liu, Yifan Liu, Zhixiong Nan, Nanning Zheng
In this paper, we formulate OOD generalization in VQA as a compositional generalization problem and propose a graph generative modeling-based training scheme (X-GGM) to implicitly model the problem.
1 code implementation • 5 Jul 2021 • Ziyi Liu, Jie Yang, Svetlana Yanushkevich, Orly Yadid-Pecht
Embedded systems have a huge market, and utilizing DCNNs' powerful functionality into them will further reduce human intervention.
no code implementations • 30 Mar 2021 • Ziyi Liu, Le Wang, Wei Tang, Junsong Yuan, Nanning Zheng, Gang Hua
To address this challenge, we introduce a framework that learns two feature subspaces respectively for actions and their context.
no code implementations • 28 Mar 2021 • Ziyi Liu, Le Wang, Qilin Zhang, Wei Tang, Junsong Yuan, Nanning Zheng, Gang Hua
In this paper, we introduce an Action-Context Separation Network (ACSNet) that explicitly takes into account context for accurate action localization.
Ranked #8 on
Weakly Supervised Action Localization
on THUMOS’14
Video Polyp Segmentation
Weakly Supervised Action Localization
no code implementations • 2 Feb 2021 • Jie Yang, Mengchen Lin, Ziyi Liu, Ulian Shahnovich, Orly Yadid-Pecht
It is especially crucial for mobile devices because most of the images taken today are from mobile phones, hence such technology is highly demanded in the consumer market of mobile devices and is essential for a good customer experience.
1 code implementation • 31 Jan 2021 • Jie Yang, Ziyi Liu, Mengchen Lin, Svetlana Yanushkevich, Orly Yadid-Pecht
The reformulated Laplacian pyramid always decompose a WDR image into two frequency bands where the low-frequency band is global feature-oriented, and the high-frequency band is local feature-oriented.
1 code implementation • 31 Jan 2021 • Jie Yang, Ziyi Liu, Ulian Shahnovich, Orly Yadid-Pecht
HVS perceives luminance differently when under different adaptation levels, and therefore our algorithm uses functions built upon different scales to tone map pixels to different values.
no code implementations • 11 Jan 2021 • Ziyi Liu, Jie Yang, Mengchen Lin, Kenneth Kam Fai Lai, Svetlana Yanushkevich, Orly Yadid-Pecht
Furthermore, we show the effect of different face detection procedures on the WDRIs in our database.
no code implementations • 8 Jan 2021 • Ziyi Liu
The dynamic range of our normal life can exceeds 120 dB, however, the smart-phone cameras and the conventional digital cameras can only capture a dynamic range of 90 dB, which sometimes leads to loss of details for the recorded image.
no code implementations • EMNLP (Louhi) 2020 • Ziyi Liu, Giannis Karamanolakis, Daniel Hsu, Luis Gravano
To improve performance without extra annotations, we create artificial training documents in the target language through machine translation and train mBERT jointly for the source (English) and target language.
no code implementations • ICCV 2019 • Ziyi Liu, Le Wang, Qilin Zhang, Zhanning Gao, Zhenxing Niu, Nanning Zheng, Gang Hua
To address this challenge, we propose the Contrast-based Localization EvaluAtioN Network (CleanNet) with our new action proposal evaluator, which provides pseudo-supervision by leveraging the temporal contrast in snippet-level action classification predictions.
Ranked #15 on
Weakly Supervised Action Localization
on ActivityNet-1.2
(mAP@0.5 metric)
no code implementations • 19 Mar 2018 • Jinliang Zang, Le Wang, Ziyi Liu, Qilin Zhang, Zhenxing Niu, Gang Hua, Nanning Zheng
Research in human action recognition has accelerated significantly since the introduction of powerful machine learning tools such as Convolutional Neural Networks (CNNs).
no code implementations • 1 May 2017 • Ziyi Liu, Siyu Yu, Xiao Wang, Nanning Zheng
Experiments show that our unsupervised approach is efficient and robust for detecting drivable area for self-driving cars.