Search Results for author: Yuchen Yang

Found 44 papers, 21 papers with code

Inductive Link Prediction on N-ary Relational Facts via Semantic Hypergraph Reasoning

1 code implementation26 Mar 2025 Gongzhu Yin, Hongli Zhang, Yuchen Yang, Yi Luo

The results highlight the superiority of the n-ary subgraph reasoning framework and the exceptional inductive ability of NS-HART.

Inductive Link Prediction Knowledge Graphs

SGA-INTERACT: A 3D Skeleton-based Benchmark for Group Activity Understanding in Modern Basketball Tactic

1 code implementation9 Mar 2025 Yuchen Yang, Wei Wang, Yifei Liu, Linfeng Dong, Hao Wu, Mingxin Zhang, Zhihang Zhong, Xiao Sun

This framework aligns with the feature extraction paradigm in RGB-based methods, enabling direct evaluation of RGB-based models on skeleton-based benchmarks.

Group Activity Recognition Temporal Group Activity Localization

DSVD: Dynamic Self-Verify Decoding for Faithful Generation in Large Language Models

no code implementations5 Mar 2025 YiQiu Guo, Yuchen Yang, Zhe Chen, Pingjie Wang, Yusheng Liao, Ya zhang, Yanfeng Wang, Yu Wang

The reliability of large language models remains a critical challenge, particularly due to their susceptibility to hallucinations and factual inaccuracies during text generation.

Hallucination Text Generation

Jailbreaking Safeguarded Text-to-Image Models via Large Language Models

no code implementations3 Mar 2025 Zhengyuan Jiang, Yuepeng Hu, Yuchen Yang, Yinzhi Cao, Neil Zhenqiang Gong

Text-to-Image models may generate harmful content, such as pornographic images, particularly when unsafe prompts are submitted.

Language Modeling Language Modelling +1

Demographic Attributes Prediction from Speech Using WavLM Embeddings

no code implementations17 Feb 2025 Yuchen Yang, Thomas Thebaud, Najim Dehak

This paper introduces a general classifier based on WavLM features, to infer demographic characteristics, such as age, gender, native language, education, and country, from speech.

Diversity Gender Classification +1

HAUR: Human Annotation Understanding and Recognition Through Text-Heavy Images

no code implementations24 Dec 2024 Yuchen Yang, Haoran Yan, Yanhao Chen, Qingqiang Wu, Qingqi Hong

As part of this effort, we introduce the Human Annotation Understanding and Recognition-5 (HAUR-5) dataset, which encompasses five common types of human annotations.

Optical Character Recognition (OCR) Question Answering +1

Enhancing Facial Consistency in Conditional Video Generation via Facial Landmark Transformation

no code implementations12 Dec 2024 Lianrui Mu, Xingze Zhou, Wenjie Zheng, Jiangnan Ye, Xiaoyu Liang, Yuchen Yang, Jianhong Bai, Jiedong Zhuang, Haoji Hu

Existing methods often fail to maintain facial feature consistency due to mismatches between the facial landmarks extracted from source videos and the target facial features in the reference image.

Video Generation

X as Supervision: Contending with Depth Ambiguity in Unsupervised Monocular 3D Pose Estimation

1 code implementation20 Nov 2024 Yuchen Yang, Xuanyi Liu, Xing Gao, Zhihang Zhong, Xiao Sun

Recent unsupervised methods for monocular 3D pose estimation have endeavored to reduce dependence on limited annotated 3D data, but most are solely formulated in 2D space, overlooking the inherent depth ambiguity issue.

3D Pose Estimation

Pseudo-Probability Unlearning: Towards Efficient and Privacy-Preserving Machine Unlearning

no code implementations4 Nov 2024 Zihao Zhao, Yijiang Li, Yuchen Yang, Wenqing Zhang, Nuno Vasconcelos, Yinzhi Cao

Machine unlearning--enabling a trained model to forget specific data--is crucial for addressing biased data and adhering to privacy regulations like the General Data Protection Regulation (GDPR)'s "right to be forgotten".

Machine Unlearning Privacy Preserving

ARQ: A Mixed-Precision Quantization Framework for Accurate and Certifiably Robust DNNs

no code implementations31 Oct 2024 Yuchen Yang, Shubham Ugare, Yifan Zhao, Gagandeep Singh, Sasa Misailovic

Mixed precision quantization has become an important technique for optimizing the execution of deep neural networks (DNNs).

Quantization

Mitigating Hallucination in Visual-Language Models via Re-Balancing Contrastive Decoding

no code implementations10 Sep 2024 Xiaoyu Liang, Jiayuan Yu, Lianrui Mu, Jiedong Zhuang, Jiaqi Hu, Yuchen Yang, Jiangnan Ye, Lu Lu, Jian Chen, Haoji Hu

Concurrently, the visual branch focuses on the selection of significant tokens, refining the attention mechanism to highlight the primary subject.

Hallucination Image Captioning +2

CL-DiffPhyCon: Closed-loop Diffusion Control of Complex Physical Systems

1 code implementation31 Jul 2024 Long Wei, Haodong Feng, Yuchen Yang, Ruiqi Feng, Peiyan Hu, Xiang Zheng, Tao Zhang, Dixia Fan, Tailin Wu

The results demonstrate that CL-DiffPhyCon achieves superior control performance with significant improvements in sampling efficiency.

Denoising

Towards Scale-Aware Full Surround Monodepth with Transformers

no code implementations15 Jul 2024 Yuchen Yang, Xinyi Wang, Dong Li, Lu Tian, Ashish Sirasao, Xun Yang

Full surround monodepth (FSM) methods can learn from multiple camera views simultaneously in a self-supervised manner to predict the scale-aware depth, which is more practical for real-world applications in contrast to scale-ambiguous depth from a standalone monocular camera.

Depth Estimation

Follow the Rules: Reasoning for Video Anomaly Detection with Large Language Models

1 code implementation14 Jul 2024 Yuchen Yang, Kwonjoon Lee, Behzad Dariush, Yinzhi Cao, Shao-Yuan Lo

In the induction stage, the LLM is fed with few-shot normal reference samples and then summarizes these normal patterns to induce a set of rules for detecting anomalies.

Anomaly Detection Video Anomaly Detection

TPIA: Towards Target-specific Prompt Injection Attack against Code-oriented Large Language Models

no code implementations12 Jul 2024 Yuchen Yang, Hongwei Yao, Bingrun Yang, Yiling He, Yiming Li, Tianwei Zhang, Zhan Qin

We show that our TPIA can successfully attack three representative open-source Code LLMs (with an attack success rate of up to 97. 9%) and two mainstream commercial Code LLM-integrated applications (with an attack success rate of over 90%) in all threat cases, using only a 12-token non-functional perturbation.

Code Completion

Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation

1 code implementation24 Jun 2024 Yuchen Yang, Yingdong Shi, Cheems Wang, XianTong Zhen, Yuxuan Shi, Jun Xu

Fine-tuning pretrained large models to downstream tasks is an important problem, which however suffers from huge memory overhead due to large-scale parameters.

Towards Holistic Language-video Representation: the language model-enhanced MSR-Video to Text Dataset

no code implementations19 Jun 2024 Yuchen Yang, Yingxuan Duan

The method's effectiveness in improving language-video representation is evaluated through text-video retrieval using the MSR-VTT dataset and several multi-modal retrieval models.

Language Modeling Language Modelling +5

Monocular Localization with Semantics Map for Autonomous Vehicles

no code implementations6 Jun 2024 Jixiang Wan, Xudong Zhang, Shuzhou Dong, Yuwei Zhang, Yuchen Yang, Ruoxi Wu, Ye Jiang, Jijunnan Li, Jinquan Lin, Ming Yang

To balance efficiency and accuracy, we propose a novel lightweight visual semantic localization algorithm that employs stable semantic features instead of low-level texture features.

Autonomous Driving Computational Efficiency +1

SafeGen: Mitigating Sexually Explicit Content Generation in Text-to-Image Models

1 code implementation10 Apr 2024 Xinfeng Li, Yuchen Yang, Jiangyi Deng, Chen Yan, Yanjiao Chen, Xiaoyu Ji, Wenyuan Xu

Text-to-image (T2I) models, such as Stable Diffusion, have exhibited remarkable performance in generating high-quality images from text descriptions in recent years.

Robust Noisy Correspondence Learning with Equivariant Similarity Consistency

no code implementations CVPR 2024 Yuchen Yang, Likai Wang, Erkun Yang, Cheng Deng

Accordingly we first calculate the ESC by comparing image and text semantic variations between a set of elaborated anchor points and other undivided training data.

Triplet

Improving the Reliability of Large Language Models by Leveraging Uncertainty-Aware In-Context Learning

no code implementations7 Oct 2023 Yuchen Yang, Houqiang Li, Yanfeng Wang, Yu Wang

In this study, we introduce an uncertainty-aware in-context learning framework to empower the model to enhance or reject its output in response to uncertainty.

Hallucination In-Context Learning +1

CLIPUNetr: Assisting Human-robot Interface for Uncalibrated Visual Servoing Control with CLIP-driven Referring Expression Segmentation

no code implementations17 Sep 2023 Chen Jiang, Yuchen Yang, Martin Jagersand

To generate high-quality segmentation predictions from referring expressions, we propose CLIPUNetr - a new CLIP-driven referring expression segmentation network.

Decoder Referring Expression +3

SneakyPrompt: Jailbreaking Text-to-image Generative Models

1 code implementation20 May 2023 Yuchen Yang, Bo Hui, Haolin Yuan, Neil Gong, Yinzhi Cao

Text-to-image generative models such as Stable Diffusion and DALL$\cdot$E raise many ethical concerns due to the generation of harmful images such as Not-Safe-for-Work (NSFW) ones.

Reinforcement Learning (RL) Semantic Similarity +1

SGL: Structure Guidance Learning for Camera Localization

no code implementations12 Apr 2023 Xudong Zhang, Shuang Gao, Xiaohu Nan, Haikuan Ning, Yuchen Yang, Yishan Ping, Jixiang Wan, Shuzhou Dong, Jijunnan Li, Yandong Guo

Camera localization is a classical computer vision task that serves various Artificial Intelligence and Robotics applications.

Camera Localization Visual Localization

HGV4Risk: Hierarchical Global View-guided Sequence Representation Learning for Risk Prediction

1 code implementation15 Nov 2022 Youru Li, Zhenfeng Zhu, Xiaobo Guo, Shaoshuai Li, Yuchen Yang, Yao Zhao

Moreover, the hierarchical representations at both instance level and channel level can be coordinated by the heterogeneous information aggregation under the guidance of global view.

Graph Embedding Prediction +2

A Real-Time Fusion Framework for Long-term Visual Localization

no code implementations18 Oct 2022 Yuchen Yang, Xudong Zhang, Shuang Gao, Jixiang Wan, Yishan Ping, Yuyue Liu, Jijunnan Li, Yandong Guo

In this paper, we present an efficient client-server visual localization architecture that fuses global and local pose estimations to realize promising precision and efficiency.

Visual Localization

Multi-modal Graph Learning for Disease Prediction

1 code implementation11 Mar 2022 Shuai Zheng, Zhenfeng Zhu, Zhizhe Liu, Zhenyu Guo, Yang Liu, Yuchen Yang, Yao Zhao

For disease prediction tasks, most existing graph-based methods tend to define the graph manually based on specified modality (e. g., demographic information), and then integrated other modalities to obtain the patient representation by Graph Representation Learning (GRL).

Disease Prediction Graph Learning +3

FCNet: A Convolutional Neural Network for Arbitrary-Length Exposure Estimation

1 code implementation5 Mar 2022 Jin Liang, Yuchen Yang, Anran Zhang, Jun Xu, Hui Li, XianTong Zhen

For image exposure enhancement, the tasks of Single-Exposure Correction (SEC) and Multi-Exposure Fusion (MEF) are widely studied in the image processing community.

Exposure Correction

Pose Refinement with Joint Optimization of Visual Points and Lines

no code implementations8 Oct 2021 Shuang Gao, Jixiang Wan, Yishan Ping, Xudong Zhang, Shuzhou Dong, Yuchen Yang, Haikuan Ning, Jijunnan Li, Yandong Guo

High-precision camera re-localization technology in a pre-established 3D environment map is the basis for many tasks, such as Augmented Reality, Robotics and Autonomous Driving.

Autonomous Driving

Practical Blind Membership Inference Attack via Differential Comparisons

1 code implementation5 Jan 2021 Bo Hui, Yuchen Yang, Haolin Yuan, Philippe Burlina, Neil Zhenqiang Gong, Yinzhi Cao

The success of the former heavily depends on the quality of the shadow model, i. e., the transferability between the shadow and the target; the latter, given only blackbox probing access to the target model, cannot make an effective inference of unknowns, compared with MI attacks using shadow models, due to the insufficient number of qualified samples labeled with ground truth membership information.

Inference Attack Membership Inference Attack

Big-Data Clustering: K-Means or K-Indicators?

1 code implementation3 Jun 2019 Feiyu Chen, Yuchen Yang, Liwei Xu, Taiping Zhang, Yin Zhang

The K-means algorithm is arguably the most popular data clustering method, commonly applied to processed datasets in some "feature spaces", as is in spectral clustering.

Clustering

Efficient Traffic-Sign Recognition with Scale-aware CNN

no code implementations31 May 2018 Yuchen Yang, Shuo Liu, Wei Ma, Qiuyuan Wang, Zheng Liu

The paper presents a Traffic Sign Recognition (TSR) system, which can fast and accurately recognize traffic signs of different sizes in images.

General Classification Traffic Sign Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.