Search Results for author: Yu Yao

Found 37 papers, 21 papers with code

A Sample Efficient Conditional Independence Test in the Presence of Discretization

1 code implementation10 Jun 2025 Boyang Sun, Yu Yao, Xinshuai Dong, Zongfang Liu, Tongliang Liu, Yumou Qiu, Kun Zhang

Motivated by this, this paper introduces a sample-efficient CI test that does not rely on the binarization process.

Binarization

Beyond Optimal Transport: Model-Aligned Coupling for Flow Matching

no code implementations29 May 2025 Yexiong Lin, Yu Yao, Tongliang Liu

However, we observe that such geometry-based couplings do not necessarily align with the model's preferred trajectories, making it difficult to learn the vector field induced by these couplings, which prevents the model from learning straight trajectories.

Mamba-VA: A Mamba-based Approach for Continuous Emotion Recognition in Valence-Arousal Space

1 code implementation13 Mar 2025 Yuheng Liang, Zheyu Wang, Feng Liu, Mingzhou Liu, Yu Yao

Experimental results on the Valence-Arousal (VA) Estimation task of the 8th competition on Affective Behavior Analysis in-the-wild (ABAW) demonstrate that the proposed model achieves valence and arousal scores of 0. 5362 (0. 5036) and 0. 4310 (0. 4119) on the validation (test) set, respectively, outperforming the baseline.

Autonomous Driving Emotion Recognition +1

SafeAuto: Knowledge-Enhanced Safe Autonomous Driving with Multimodal Foundation Models

1 code implementation28 Feb 2025 Jiawei Zhang, Xuan Yang, Taiqi Wang, Yu Yao, Aleksandr Petiushko, Bo Li

Traditional autonomous driving systems often struggle to connect high-level reasoning with low-level control, leading to suboptimal and sometimes unsafe behaviors.

Attribute Autonomous Driving +2

Flow: Modularized Agentic Workflow Automation

1 code implementation14 Jan 2025 Boye Niu, Yiliao Song, Kai Lian, Yifan Shen, Yu Yao, Kun Zhang, Tongliang Liu

In this paper, we define workflows as an activity-on-vertex (AOV) graph, which allows continuous workflow refinement by LLM agents through dynamic subtask allocation adjustment based on historical performance and previous AOVs.

SmartCLIP: Modular Vision-language Alignment with Identification Guarantees

1 code implementation CVPR 2025 Shaoan Xie, Lingjing Lingjing, Yujia Zheng, Yu Yao, Zeyu Tang, Eric P. Xing, Guangyi Chen, Kun Zhang

On the other hand, directly aligning long captions with images can lead to the retention of entangled details, preventing the model from learning disentangled, atomic concepts -- ultimately limiting its generalization on certain downstream tasks involving short prompts.

Contrastive Learning

Ranked from Within: Ranking Large Multimodal Models for Visual Question Answering Without Labels

no code implementations9 Dec 2024 Weijie Tu, Weijian Deng, Dylan Campbell, Yu Yao, Jiyang Zheng, Tom Gedeon, Tongliang Liu

As large multimodal models (LMMs) are increasingly deployed across diverse applications, the need for adaptable, real-world model ranking has become paramount.

Question Answering Visual Question Answering

Multi-Dimensional Reconfigurable, Physically Composable Hybrid Diffractive Optical Neural Network

no code implementations8 Nov 2024 Ziang Yin, Yu Yao, Jeff Zhang, Jiaqi Gu

To overcome this challenge, we introduce, for the first time, a multi-dimensional reconfigurable hybrid diffractive ONN system (MDR-HDONN), a physically composable architecture that unlocks a new degree of freedom and unprecedented versatility in DONNs.

Enhanced channel estimation for near-field IRS-aided multi-user MIMO system via a large deep residual network

no code implementations28 Oct 2024 Yan Wang, Yongqiang Li, Minghao Chen, Yu Yao, Feng Shu, Jiangzhou Wang

In this paper, the channel estimation (CE) of intelligent reflecting surface-aided near-field (NF) multi-user communication is investigated.

Denoising Federated Learning

MambaTS: Improved Selective State Space Models for Long-term Time Series Forecasting

1 code implementation26 May 2024 Xiuding Cai, Yaoyao Zhu, Xueyao Wang, Yu Yao

A recent model, Mamba, based on selective state space models (SSMs), has emerged as a competitive alternative to Transformer, offering comparable performance with higher throughput and linear complexity related to sequence length.

Mamba State Space Models +2

A Conditional Independence Test in the Presence of Discretization

1 code implementation26 Apr 2024 Boyang Sun, Yu Yao, Guang-Yuan Hao, Yumou Qiu, Kun Zhang

Applying existing test methods to the observations of $X_1$, $\tilde{X}_2$ and $X_3$ can lead to a false conclusion about the underlying conditional independence of variables $X_1$, $X_2$ and $X_3$.

Causal Discovery

MD-Dose: A diffusion model based on the Mamba for radiation dose prediction

2 code implementations13 Mar 2024 Linjie Fu, Xia Li, Xiuding Cai, Yingkai Wang, Xueyao Wang, Yali Shen, Yu Yao

To tackle these challenges, we introduce a novel diffusion model, MD-Dose, based on the Mamba architecture for predicting radiation therapy dose distribution in thoracic cancer patients.

Denoising Mamba

HyperLips: Hyper Control Lips with High Resolution Decoder for Talking Face Generation

1 code implementation9 Oct 2023 Yaosen Chen, Yu Yao, Zhiqiang Li, Wei Wang, Yanru Zhang, Han Yang, Xuming Wen

First, FaceEncoder is used to obtain latent code by extracting features from the visual face information taken from the video source containing the face frame. Then, HyperConv, which weighting parameters are updated by HyperNet with the audio features as input, will modify the latent code to synchronize the lip movement with the audio.

Decoder Talking Face Generation

Energy-Guided Diffusion Model for CBCT-to-CT Synthesis

no code implementations7 Aug 2023 Linjie Fu, Xia Li, Xiuding Cai, Dong Miao, Yu Yao, Yali Shen

Cone Beam CT (CBCT) plays a crucial role in Adaptive Radiation Therapy (ART) by accurately providing radiation treatment when organ anatomy changes occur.

Anatomy

PNT-Edge: Towards Robust Edge Detection with Noisy Labels by Learning Pixel-level Noise Transitions

1 code implementation26 Jul 2023 Wenjie Xuan, Shanshan Zhao, Yu Yao, Juhua Liu, Tongliang Liu, Yixin Chen, Bo Du, DaCheng Tao

Exploiting the estimated noise transitions, our model, named PNT-Edge, is able to fit the prediction to clean labels.

Edge Detection

Towards Real-World Applications of Personalized Anesthesia Using Policy Constraint Q Learning for Propofol Infusion Control

no code implementations17 Mar 2023 Xiuding Cai, Jiao Chen, Yaoyao Zhu, Beimin Wang, Yu Yao

In this paper, Policy Constraint Q-Learning (PCQL), a data-driven reinforcement learning algorithm for solving the problem of learning anesthesia strategies on real clinical datasets, is proposed.

Q-Learning reinforcement-learning +1

Rethinking the Paradigm of Content Constraints in Unpaired Image-to-Image Translation

1 code implementation20 Nov 2022 Xiuding Cai, Yaoyao Zhu, Dong Miao, Linjie Fu, Yu Yao

In this paper, we propose EnCo, a simple but efficient way to maintain the content by constraining the representational similarity in the latent space of patch-level features from the same stage of the \textbf{En}coder and de\textbf{Co}der of the generator.

Contrastive Learning Image-to-Image Translation +1

A Many-ported and Shared Memory Architecture for High-Performance ADAS SoCs

no code implementations13 Sep 2022 Hao Luan, Yu Yao, Chang Huang

A domain specific memory architecture is essential to achieve the above goals.

Multi-scale Cooperative Multimodal Transformers for Multimodal Sentiment Analysis in Videos

no code implementations16 Jun 2022 Lianyang Ma, Yu Yao, Tao Liang, Tongliang Liu

On the whole, the "multi-scale" mechanism is capable of exploiting the different levels of semantic information of each modality which are used for fine-grained crossmodal interactions.

Multimodal Sentiment Analysis

Emulating Quantum Dynamics with Neural Networks via Knowledge Distillation

1 code implementation19 Mar 2022 Yu Yao, Chao Cao, Stephan Haas, Mahak Agarwal, Divyam Khanna, Marcin Abram

We focus on the question of how the emulator learns the rules of quantum dynamics from the curriculum of simple training examples and to which extent it can generalize the acquired knowledge to solve more challenging cases.

Knowledge Distillation

Can Label-Noise Transition Matrix Help to Improve Sample Selection and Label Correction?

no code implementations29 Sep 2021 Yu Yao, Xuefeng Li, Tongliang Liu, Alan Blair, Mingming Gong, Bo Han, Gang Niu, Masashi Sugiyama

Existing methods for learning with noisy labels can be generally divided into two categories: (1) sample selection and label correction based on the memorization effect of neural networks; (2) loss correction with the transition matrix.

Learning with noisy labels Memorization

Instance-dependent Label-noise Learning under a Structural Causal Model

2 code implementations NeurIPS 2021 Yu Yao, Tongliang Liu, Mingming Gong, Bo Han, Gang Niu, Kun Zhang

In particular, we show that properly modeling the instances will contribute to the identifiability of the label noise transition matrix and thus lead to a better classifier.

Coupling Intent and Action for Pedestrian Crossing Behavior Prediction

1 code implementation10 May 2021 Yu Yao, Ella Atkins, Matthew Johnson Roberson, Ram Vasudevan, Xiaoxiao Du

In this work, we follow the neuroscience and psychological literature to define pedestrian crossing behavior as a combination of an unobserved inner will (a probabilistic representation of binary intent of crossing vs. not crossing) and a set of multi-class actions (e. g., walking, standing, etc.).

Action Detection Autonomous Vehicles +3

Markov chain Monte Carlo methods for hierarchical clustering of dynamic causal models

no code implementations10 Dec 2020 Yu Yao, Klaas E. Stephan

Specifically, we introduce a class of proposal distributions which aims to capture the interdependencies between the parameters of the clustering and subject-wise generative models and helps to reduce random walk behaviour of the MCMC scheme.

Clustering

Unsupervised Regionalization of Particle-resolved Aerosol Mixing State Indices on the Global Scale

no code implementations6 Dec 2020 Zhonghua Zheng, Joseph Ching, Jeffrey H. Curtis, Yu Yao, Peng Xu, Matthew West, Nicole Riemer

Here we developed a simple but effective unsupervised learning approach to regionalize predictions of global aerosol mixing state indices.

BiTraP: Bi-directional Pedestrian Trajectory Prediction with Multi-modal Goal Estimation

1 code implementation29 Jul 2020 Yu Yao, Ella Atkins, Matthew Johnson-Roberson, Ram Vasudevan, Xiaoxiao Du

BiTraP estimates the goal (end-point) of trajectories and introduces a novel bi-directional decoder to improve longer-term trajectory prediction accuracy.

Autonomous Driving Collision Avoidance +6

Dual T: Reducing Estimation Error for Transition Matrix in Label-noise Learning

1 code implementation NeurIPS 2020 Yu Yao, Tongliang Liu, Bo Han, Mingming Gong, Jiankang Deng, Gang Niu, Masashi Sugiyama

By this intermediate class, the original transition matrix can then be factorized into the product of two easy-to-estimate transition matrices.

Rethinking Class-Prior Estimation for Positive-Unlabeled Learning

no code implementations ICLR 2022 Yu Yao, Tongliang Liu, Bo Han, Mingming Gong, Gang Niu, Masashi Sugiyama, DaCheng Tao

Hitherto, the distributional-assumption-free CPE methods rely on a critical assumption that the support of the positive data distribution cannot be contained in the support of the negative data distribution.

valid

Unsupervised Traffic Accident Detection in First-Person Videos

2 code implementations2 Mar 2019 Yu Yao, Mingze Xu, Yuchen Wang, David J. Crandall, Ella M. Atkins

Recognizing abnormal events such as traffic violations and accidents in natural driving scenes is essential for successful autonomous driving and advanced driver assistance systems.

Autonomous Driving Object Localization +4

Egocentric Vision-based Future Vehicle Localization for Intelligent Driving Assistance Systems

2 code implementations19 Sep 2018 Yu Yao, Mingze Xu, Chiho Choi, David J. Crandall, Ella M. Atkins, Behzad Dariush

Predicting the future location of vehicles is essential for safety-critical applications such as advanced driver assistance systems (ADAS) and autonomous driving.

Autonomous Driving Decoder +3

Cannot find the paper you are looking for? You can Submit a new open access paper.