Search Results for author: Zhimin Li

Found 9 papers, 0 papers with code

Visual Interrogation of Attention-Based Models for Natural Language Inference and Machine Comprehension

no code implementations EMNLP 2018 Shusen Liu, Tao Li, Zhimin Li, Vivek Srikumar, Valerio Pascucci, Peer-Timo Bremer

Neural networks models have gained unprecedented popularity in natural language processing due to their state-of-the-art performance and the flexible end-to-end training scheme.

Decision Making Natural Language Inference +1

Overview of Tencent Multi-modal Ads Video Understanding Challenge

no code implementations16 Sep 2021 Zhenzhi Wang, Liyu Wu, Zhimin Li, Jiangfeng Xiong, Qinglin Lu

Our challenge includes two tasks: video structuring in the temporal dimension and multi-modal video classification.

Multi-Label Classification Video Classification +1

Effective Actor-centric Human-object Interaction Detection

no code implementations24 Feb 2022 Kunlun Xu, Zhimin Li, Zhijun Zhang, Leizhen Dong, Wenhui Xu, Luxin Yan, Sheng Zhong, Xu Zou

Moreover, we also use an actor branch to get interaction prediction of the actor and propose a novel composition strategy based on center-point indexing to generate the final HOI prediction.

Human-Object Interaction Detection Object

Category-Aware Transformer Network for Better Human-Object Interaction Detection

no code implementations CVPR 2022 Leizhen Dong, Zhimin Li, Kunlun Xu, Zhijun Zhang, Luxin Yan, Sheng Zhong, Xu Zou

Specifically, the Object Query would be initialized via category priors represented by an external object detection model to yield better performance.

Human-Object Interaction Detection Object +2

"Understanding Robustness Lottery": A Geometric Visual Comparative Analysis of Neural Network Pruning Approaches

no code implementations16 Jun 2022 Zhimin Li, Shusen Liu, Xin Yu, Kailkhura Bhavya, Jie Cao, Diffenderfer James Daniel, Peer-Timo Bremer, Valerio Pascucci

We decomposed and evaluated a set of critical geometric concepts from the common adopted classification loss, and used them to design a visualization system to compare and highlight the impact of pruning on model performance and feature representation.

Network Pruning

Tencent AVS: A Holistic Ads Video Dataset for Multi-modal Scene Segmentation

no code implementations9 Dec 2022 Jie Jiang, Zhimin Li, Jiangfeng Xiong, Rongwei Quan, Qinglin Lu, Wei Liu

Therefore, TAVS is distinguished from previous temporal segmentation datasets due to its multi-modal information, holistic view of categories, and hierarchical granularities.

Multi-Label Classification Scene Segmentation +3

Instance-wise Linearization of Neural Network for Model Interpretation

no code implementations25 Oct 2023 Zhimin Li, Shusen Liu, Kailkhura Bhavya, Timo Bremer, Valerio Pascucci

For a neural network model, the non-linear behavior is often caused by non-linear activation units of a model.

Dimensionality Reduction

AVA: Towards Autonomous Visualization Agents through Visual Perception-Driven Decision-Making

no code implementations7 Dec 2023 Shusen Liu, Haichao Miao, Zhimin Li, Matthew Olson, Valerio Pascucci, Peer-Timo Bremer

With recent advances in multi-modal foundation models, the previously text-only large language models (LLM) have evolved to incorporate visual input, opening up unprecedented opportunities for various applications in visualization.

Decision Making

Cannot find the paper you are looking for? You can Submit a new open access paper.