Search Results for author: Yuqi Liu

Found 21 papers, 16 papers with code

Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement

2 code implementations9 Mar 2025 Yuqi Liu, Bohao Peng, Zhisheng Zhong, Zihao Yue, Fanbin Lu, Bei Yu, Jiaya Jia

Traditional methods for reasoning segmentation rely on supervised fine-tuning with categorical labels and simple descriptions, limiting its out-of-domain generalization and lacking explicit reasoning processes.

Domain Generalization Open Vocabulary Object Detection +6

Improving Similar Case Retrieval Ranking Performance By Revisiting RankSVM

1 code implementation16 Feb 2025 Yuqi Liu, Yan Zheng

In our paper, however, we try to improve the ranking performance of current models from the perspective of learning to rank instead of language models.

Learning-To-Rank Retrieval

SVFR: A Unified Framework for Generalized Video Face Restoration

1 code implementation2 Jan 2025 Zhiyao Wang, Xu Chen, Chengming Xu, Junwei Zhu, Xiaobin Hu, Jiangning Zhang, Chengjie Wang, Yuqi Liu, Yiyi Zhou, Rongrong Ji

In this paper, we propose a novel approach for the Generalized Video Face Restoration (GVFR) task, which integrates video BFR, inpainting, and colorization tasks that we empirically show to benefit each other.

Colorization Representation Learning

Reversed in Time: A Novel Temporal-Emphasized Benchmark for Cross-Modal Video-Text Retrieval

1 code implementation26 Dec 2024 Yang Du, Yuqi Liu, Qin Jin

We further enhance the use of harder-negatives in model training, and benchmark a variety of video-text models on RTime.

Image-text Retrieval Information Retrieval +2

Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition

1 code implementation12 Dec 2024 Zhisheng Zhong, Chengyao Wang, Yuqi Liu, Senqiao Yang, Longxiang Tang, Yuechen Zhang, Jingyao Li, Tianyuan Qu, Yanwei Li, Yukang Chen, Shaozuo Yu, Sitong Wu, Eric Lo, Shu Liu, Jiaya Jia

As Multi-modal Large Language Models (MLLMs) evolve, expanding beyond single-domain capabilities is essential to meet the demands for more versatile and efficient AI.

EgoSchema +6

Emoji Attack: A Method for Misleading Judge LLMs in Safety Risk Detection

1 code implementation1 Nov 2024 Zhipeng Wei, Yuqi Liu, N. Benjamin Erichson

To exploit this bias in Judge LLMs, we introduce the Emoji Attack -- a method that places emojis within tokens to increase the embedding differences between sub-tokens and their originals.

Few-Shot Learning

SCOPE: Sign Language Contextual Processing with Embedding from LLMs

1 code implementation2 Sep 2024 Yuqi Liu, Wenqian Zhang, Sihan Ren, Chengyu Huang, Jingyi Yu, Lan Xu

Current methods in vision-based sign language recognition (SLR) and translation (SLT) struggle with dialogue scenes due to limited dataset diversity and the neglect of contextually relevant information.

Diversity Language Modeling +3

Toward Open-Set Human Object Interaction Detection

1 code implementation Proceedings of the AAAI Conference on Artificial Intelligence 2024 Mingrui Wu, Yuqi Liu, Jiayi Ji, Xiaoshuai Sun, Rongrong Ji

To address this challenge, we introduce a simple Disentangled HOI Detection (DHD) model for detecting novel relationships by integrating an open-set object detector with a Visual Language Model (VLM).

Contrastive Learning Human-Object Interaction Detection +3

Computational Modelling of Plurality and Definiteness in Chinese Noun Phrases

1 code implementation7 Mar 2024 Yuqi Liu, Guanyi Chen, Kees Van Deemter

In this paper, we focus on the omission of the plurality and definiteness markers in Chinese noun phrases (NPs) to investigate the predictability of their intended meaning given the contexts.

Structure Aggregation for Cross-Spectral Stereo Image Guided Denoising

1 code implementation CVPR 2023 Zehua Sheng, Zhu Yu, Xiongwei Liu, Si-Yuan Cao, Yuqi Liu, Hui-Liang Shen, Huaqi Zhang

Instead of aligning the input images via conventional stereo matching, we aggregate structures from the guidance image to estimate a clean structure map for the noisy target image, which is then used to regress the final denoising result with a spatially variant linear representation model.

Deblurring Denoising +2

TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval

1 code implementation16 Jul 2022 Yuqi Liu, Pengfei Xiong, Luhui Xu, Shengming Cao, Qin Jin

In this paper, we propose Token Shift and Selection Network (TS2-Net), a novel token shift and selection transformer architecture, which dynamically adjusts the token sequence and selects informative tokens in both temporal and spatial dimensions from input video samples.

Retrieval Video Retrieval

An Efficient End-to-End 3D Voxel Reconstruction based on Neural Architecture Search

1 code implementation27 Feb 2022 Yongdong Huang, Yuanzhan Li, Xulong Cao, Siyu Zhang, Shen Cai, Ting Lu, Jie Wang, Yuqi Liu

However, many previous works employ neural networks with fixed architecture and size to represent different 3D objects, which lead to excessive network parameters for simple objects and limited reconstruction accuracy for complex objects.

Binary Classification Neural Architecture Search +1

Multi-task Safe Reinforcement Learning for Navigating Intersections in Dense Traffic

no code implementations19 Feb 2022 Yuqi Liu, Qichao Zhang, Dongbin Zhao

In this paper, we formulate a multi-task safe reinforcement learning with social attention to improve the safety and efficiency when interacting with other traffic participants.

Autonomous Driving reinforcement-learning +3

High-fidelity 3D Model Compression based on Key Spheres

1 code implementation19 Jan 2022 Yuanzhan Li, Yuqi Liu, Yujie Lu, Siyu Zhang, Shen Cai, Yanting Zhang

Compared to previous works, our method achieves the high-fidelity and high-compression 3D object coding and reconstruction.

Model Compression Object +1

Deep Learning Assisted End-to-End Synthesis of mm-Wave Passive Networks with 3D EM Structures: A Study on A Transformer-Based Matching Network

no code implementations6 Jan 2022 Siawpeng Er, Edward Liu, Minshuo Chen, Yan Li, Yuqi Liu, Tuo Zhao, Hua Wang

This paper presents a deep learning assisted synthesis approach for direct end-to-end generation of RF/mm-wave passive matching network with 3D EM structures.

A Reinforcement Learning Benchmark for Autonomous Driving in Intersection Scenarios

1 code implementation22 Sep 2021 Yuqi Liu, Qichao Zhang, Dongbin Zhao

The test benchmark and baselines are to provide a fair and comprehensive training and testing platform for the study of RL for autonomous driving in the intersection scenario, advancing the progress of RL-based methods for intersection autonomous driving control.

Autonomous Driving reinforcement-learning +2

Spherical Transformer: Adapting Spherical Signal to CNNs

no code implementations11 Jan 2021 Yuqi Liu, Yin Wang, Haikuan Du, Shen Cai

To this end, the proposed method first uses local structured sampling methods such as HEALPix to construct a transformer grid by using the information of spherical points and its adjacent points, and then transforms the spherical signals to the vectors through the grid.

3D Object Classification General Classification +2

Cannot find the paper you are looking for? You can Submit a new open access paper.