Search Results for author: Ying Hu

Found 19 papers, 4 papers with code

DiffuseST: Unleashing the Capability of the Diffusion Model for Style Transfer

no code implementations19 Oct 2024 Ying Hu, Chenyi Zhuang, Pan Gao

Style transfer aims to fuse the artistic representation of a style image with the structural information of a content image.

Style Transfer

Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Function

1 code implementation30 Sep 2024 Chenyi Zhuang, Ying Hu, Pan Gao

In this work, we critically examine the limitations of the CLIP text encoder in understanding attributes and investigate how this affects diffusion models.

Attribute Disentanglement

Uni-Med: A Unified Medical Generalist Foundation Model For Multi-Task Learning Via Connector-MoE

no code implementations26 Sep 2024 Xun Zhu, Ying Hu, Fanbin Mo, Miao Li, Ji Wu

Multi-modal large language models (MLLMs) have shown impressive capabilities as a general-purpose interface for various visual and linguistic tasks.

Image Classification Multi-Task Learning +5

SparseSSP: 3D Subcellular Structure Prediction from Sparse-View Transmitted Light Images

no code implementations2 Jul 2024 Jintu Zheng, Yi Ding, Qizhe Liu, Yi Cao, Ying Hu, Zenan Wang

Traditional fluorescence staining is phototoxic to live cells, slow, and expensive; thus, the subcellular structure prediction (SSP) from transmitted light (TL) images is emerging as a label-free, faster, low-cost alternative.

Ultrasound Report Generation with Cross-Modality Feature Alignment via Unsupervised Guidance

no code implementations2 Jun 2024 Jun Li, Tongkun Su, Baoliang Zhao, Faqin Lv, Qiong Wang, Nassir Navab, Ying Hu, Zhongliang Jiang

In this work, we propose a novel framework for automatic ultrasound report generation, leveraging a combination of unsupervised and supervised learning methods to aid the report generation process.

TinyLLaVA Factory: A Modularized Codebase for Small-scale Large Multimodal Models

2 code implementations20 May 2024 Junlong Jia, Ying Hu, Xi Weng, Yiming Shi, Miao Li, Xingjian Zhang, Baichuan Zhou, Ziyu Liu, Jie Luo, Lei Huang, Ji Wu

We present TinyLLaVA Factory, an open-source modular codebase for small-scale large multimodal models (LMMs) with a focus on simplicity of code implementations, extensibility of new features, and reproducibility of training results.

Philosophy

Personalized Forgetting Mechanism with Concept-Driven Knowledge Tracing

no code implementations18 Apr 2024 Shanshan Wang, Ying Hu, Xun Yang, Zhongzhou Zhang, Keyang Wang, Xingyi Zhang

To address these problems, we propose a Concept-driven Personalized Forgetting knowledge tracing model (CPF) which integrates hierarchical relationships between knowledge concepts and incorporates students' personalized cognitive abilities.

Knowledge Tracing

TinyLLaVA: A Framework of Small-scale Large Multimodal Models

2 code implementations22 Feb 2024 Baichuan Zhou, Ying Hu, Xi Weng, Junlong Jia, Jie Luo, Xien Liu, Ji Wu, Lei Huang

We present the TinyLLaVA framework that provides a unified perspective in designing and analyzing the small-scale Large Multimodal Models (LMMs).

Visual Question Answering

Intelligent Robotic Sonographer: Mutual Information-based Disentangled Reward Learning from Few Demonstrations

1 code implementation7 Jul 2023 Zhongliang Jiang, Yuan Bi, Mingchuan Zhou, Ying Hu, Michael Burke, Nassir Navab

The results demonstrated that the proposed advanced framework can robustly work on a variety of seen and unseen phantoms as well as in-vivo human carotid data.

Navigate

Constrained monotone mean-variance problem with random coefficients

no code implementations29 Dec 2022 Ying Hu, Xiaomin Shi, Zuo Quan Xu

This paper studies the monotone mean-variance (MMV) problem and the classical mean-variance (MV) problem with convex cone trading constraints in a market with random coefficients.

Math

Twin identification over viewpoint change: A deep convolutional neural network surpasses humans

no code implementations12 Jul 2022 Connor J. Parde, Virginia E. Strehle, Vivekjyoti Banerjee, Ying Hu, Jacqueline G. Cavazos, Carlos D. Castillo, Alice J. O'Toole

These findings also contribute to our understanding of DCNN performance for discriminating high-resemblance faces, demonstrate that the DCNN performs at a level at or above humans, and suggest a degree of parity between the features used by humans and the DCNN.

Face Identification

Hierarchic Temporal Convolutional Network With Cross-Domain Encoder for Music Source Separation

no code implementations IEEE Signal Processing Letters 2022 Ying Hu, Yadong Chen, Wenzhong Yang, Liang He, Hao Huang

In this paper, we propose a model which combines the complexed spectrogram domain feature and time-domain feature by a cross-domain encoder (CDE) and adopts the hierarchic temporal convolutional network (HTCN) for multiple music sources separation.

Audio Source Separation Music Source Separation +2

A Self-Guided Framework for Radiology Report Generation

no code implementations19 Jun 2022 Jun Li, Shibo Li, Ying Hu, Huiren Tao

Moreover, SGF successfully improves the accuracy and length of medical report generation by incorporating a similarity comparison mechanism that imitates the process of human self-improvement through compar-ative practice.

Image Captioning Medical Report Generation

Quadratic $G$-BSDEs with convex generators and unbounded terminal conditions

no code implementations27 Jan 2021 Ying Hu, Shanjian Tang, Falei Wang

In this paper, we first study one-dimensional quadratic backward stochastic differential equations driven by $G$-Brownian motions ($G$-BSDEs) with unbounded terminal values.

Probability 60H10

Dynamic sensitivity of quantum Rabi model with quantum criticality

no code implementations5 Jan 2021 Ying Hu, Jian Huang, Jin-Feng Huang, Qiong-Tao Xie, Jie-Qiao Liao

We study the dynamic sensitivity of the quantum Rabi model, which exhibits quantum criticality in the finite-component-system case.

Quantum Physics

Consistent Investment of Sophisticated Rank-Dependent Utility Agents in Continuous Time

no code implementations2 Jun 2020 Ying Hu, Hanqing Jin, Xun Yu Zhou

We study portfolio selection in a complete continuous-time market where the preference is dictated by the rank-dependent utility.

Euler class of taut foliations and Dehn filling

no code implementations3 Dec 2019 Ying Hu

Lastly, we prove that given any $\mathbb{Q}$-homology solid torus, the set of slopes for which the corresponding Dehn fillings admit a taut foliation transverse to the core with zero Euler class is nowhere dense in $\mathbb{R}\cup \{\frac{1}{0}\}$.

Geometric Topology 57M50, 57M25, 57R30, 20F60

Cannot find the paper you are looking for? You can Submit a new open access paper.