Search Results for author: Hao Liang

Found 41 papers, 15 papers with code

Fast-RF-Shimming: Accelerate RF Shimming in 7T MRI using Deep Learning

no code implementations21 Jan 2025 Zhengyi Lu, Hao Liang, Ming Lu, Xiao Wang, Xinqiang Yan, Yuankai Huo

This approach offers a faster and more efficient solution to RF shimming challenges in UHF MRI.

SEG-SAM: Semantic-Guided SAM for Unified Medical Image Segmentation

no code implementations17 Dec 2024 Shuangping Huang, Hao Liang, Qingfeng Wang, Chulong Zhong, Zijian Zhou, Miaojing Shi

First, to avoid the potential conflict between binary and semantic predictions, we introduce a semantic-aware decoder independent of SAM's original decoder, specialized for both semantic segmentation on the prompted object and classification on unprompted objects in images.

Decoder Image Segmentation +3

MC-LLaVA: Multi-Concept Personalized Vision-Language Model

1 code implementation18 Nov 2024 Ruichuan An, Sihan Yang, Ming Lu, Kai Zeng, Yulin Luo, Ying Chen, Jiajun Cao, Hao Liang, Qi She, Shanghang Zhang, Wentao Zhang

Specifically, MC-LLaVA uses a joint training strategy incorporating multiple concepts in a single training step, allowing VLMs to perform accurately in multi-concept personalization.

Language Modeling Language Modelling +2

EVQAScore: Efficient Video Question Answering Data Evaluation

no code implementations11 Nov 2024 Hao Liang, Zirong Chen, Wentao Zhang

To address this gap, we introduce EVQAScore, a reference-free method that leverages keyword extraction to assess both video caption and video QA data quality.

Keyword Extraction Question Answering +2

Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction

no code implementations28 Oct 2024 Qintong Zhang, Victor Shea-Jay Huang, Bin Wang, Junyuan Zhang, Zhengren Wang, Hao Liang, Shawn Wang, Matthieu Lin, Conghui He, Wentao Zhang

Document parsing is essential for converting unstructured and semi-structured documents-such as contracts, academic papers, and invoices-into structured, machine-readable data.

Data Integration Knowledge Base Construction

Baichuan Alignment Technical Report

no code implementations19 Oct 2024 MingAn Lin, Fan Yang, Yanjun Shen, Haoze Sun, Tianpeng Li, Chenzheng Zhu, Tao Zhang, Miao Zheng, Xu Li, Yijie Zhou, Mingyang Chen, Yanzhao Qin, Youquan Li, Hao Liang, Fei Li, Yadong Li, Mang Wang, Guosheng Dong, Kun Fang, Jianhua Xu, Bin Cui, Wentao Zhang, Zenan Zhou, WeiPeng Chen

Baichuan-Instruct is an internal model, while Qwen2-Nova-72B and Llama3-PBM-Nova-70B are instruct versions of the Qwen2-72B and Llama-3-70B base models, optimized through Baichuan Alignment.

Facilitating Multi-turn Function Calling for LLMs via Compositional Instruction Tuning

1 code implementation16 Oct 2024 Mingyang Chen, Haoze Sun, Tianpeng Li, Fan Yang, Hao Liang, Keer Lu, Bin Cui, Wentao Zhang, Zenan Zhou, WeiPeng Chen

While current research on function calling by LLMs primarily focuses on single-turn interactions, this paper addresses the overlooked necessity for LLMs to engage in multi-turn function calling--critical for handling compositional, real-world queries that require planning with functions but not only use functions.

8k

Data Proportion Detection for Optimized Data Management for Large Language Models

1 code implementation26 Sep 2024 Hao Liang, Keshi Zhao, Yajie Yang, Bin Cui, Guosheng Dong, Zenan Zhou, Wentao Zhang

Large language models (LLMs) have demonstrated exceptional performance across a wide range of tasks and domains, with data preparation playing a critical role in achieving these results.

Management

CFBench: A Comprehensive Constraints-Following Benchmark for LLMs

1 code implementation2 Aug 2024 Yanjun Shen, Wenjing Luo, Yan Zhang, Hao Liang, Tao Zhang, Fan Yang, MingAn Lin, Yujing Qiao, WeiPeng Chen, Bin Cui, Wentao Zhang, Zenan Zhou

The adeptness of Large Language Models (LLMs) in comprehending and following natural language instructions is critical for their deployment in sophisticated real-world applications.

Are Bigger Encoders Always Better in Vision Large Models?

no code implementations1 Aug 2024 Bozhou Li, Hao Liang, Zimo Meng, Wentao Zhang

Moreover, we analyzed the effects of LLM backbone parameter size and data quality on the pretraining outcomes.

Language Modeling Language Modelling +1

Synth-Empathy: Towards High-Quality Synthetic Empathy Data

1 code implementation31 Jul 2024 Hao Liang, Linzhuang Sun, Jingxuan Wei, Xijie Huang, Linkun Sun, Bihui Yu, Conghui He, Wentao Zhang

In recent years, with the rapid advancements in large language models (LLMs), achieving excellent empathetic response capabilities has become a crucial prerequisite.

Diversity

SynthVLM: High-Efficiency and High-Quality Synthetic Data for Vision Language Models

1 code implementation30 Jul 2024 Zheng Liu, Hao Liang, Xijie Huang, Wentao Xiong, Qinhan Yu, Linzhuang Sun, Chong Chen, Conghui He, Bin Cui, Wentao Zhang

Crucially, our method's reliance on purely generated data ensures the preservation of privacy, achieving SoTA performance with just 100k data points (only 18% of the official dataset size).

Caption Generation Question Answering

PAS: Data-Efficient Plug-and-Play Prompt Augmentation System

no code implementations8 Jul 2024 Miao Zheng, Hao Liang, Fan Yang, Haoze Sun, Tianpeng Li, Lingchu Xiong, Yan Zhang, Youzhen Wu, Kun Li, Yanjun Shen, MingAn Lin, Tao Zhang, Guosheng Dong, Yujing Qiao, Kun Fang, WeiPeng Chen, Bin Cui, Wentao Zhang, Zenan Zhou

This combination of high performance, efficiency, and flexibility makes PAS a valuable system for enhancing the usability and effectiveness of LLMs through improved prompt engineering.

Prompt Engineering

KeyVideoLLM: Towards Large-scale Video Keyframe Selection

no code implementations3 Jul 2024 Hao Liang, Jiapeng Li, Tianyi Bai, Xijie Huang, Linzhuang Sun, Zhengren Wang, Conghui He, Bin Cui, Chong Chen, Wentao Zhang

Recently, with the rise of web videos, managing and understanding large-scale video datasets has become increasingly important.

Data Compression Management +3

Efficient-Empathy: Towards Efficient and Effective Selection of Empathy Data

no code implementations2 Jul 2024 Linzhuang Sun, Hao Liang, Jingxuan Wei, Linkun Sun, Bihui Yu, Bin Cui, Wentao Zhang

By integrating sensibility and rationality data with a MoE structure, we achieve even higher performance, demonstrating the effectiveness of our Efficient-Empathy algorithm.

Medical MLLM is Vulnerable: Cross-Modality Jailbreak and Mismatched Attacks on Medical Multimodal Large Language Models

1 code implementation26 May 2024 Xijie Huang, Xinyuan Wang, Hantao Zhang, Yinghao Zhu, Jiawen Xi, Jingkun An, Hao Wang, Hao Liang, Chengwei Pan

Security concerns related to Large Language Models (LLMs) have been extensively explored, yet the safety implications for Multimodal Large Language Models (MLLMs), particularly in medical contexts (MedMLLMs), remain insufficiently studied.

A Survey of Multimodal Large Language Model from A Data-centric Perspective

1 code implementation26 May 2024 Tianyi Bai, Hao Liang, Binwang Wan, Yanran Xu, Xi Li, Shiyu Li, Ling Yang, Bozhou Li, Yifan Wang, Bin Cui, Ping Huang, Jiulong Shan, Conghui He, Binhang Yuan, Wentao Zhang

Multimodal large language models (MLLMs) enhance the capabilities of standard large language models by integrating and processing data from multiple modalities, including text, vision, audio, video, and 3D environments.

Language Modeling Language Modelling +3

Optimistic Thompson Sampling for No-Regret Learning in Unknown Games

no code implementations7 Feb 2024 Yingru Li, Liangqi Liu, Wenqiang Pu, Hao Liang, Zhi-Quan Luo

This work tackles the complexities of multi-player scenarios in \emph{unknown games}, where the primary challenge lies in navigating the uncertainty of the environment through bandit feedback alongside strategic decision-making.

Decision Making Thompson Sampling

MC-NeRF: Multi-Camera Neural Radiance Fields for Multi-Camera Image Acquisition Systems

no code implementations14 Sep 2023 Yu Gao, Lutong Su, Hao Liang, Yufeng Yue, Yi Yang, Mengyin Fu

In this paper, we propose MC-NeRF, a method that enables joint optimization of both intrinsic and extrinsic parameters alongside NeRF.

NeRF

A Distribution Optimization Framework for Confidence Bounds of Risk Measures

no code implementations12 Jun 2023 Hao Liang, Zhi-Quan Luo

Unlike traditional approaches that add or subtract a confidence radius from the empirical risk measures, our proposed schemes evaluate a specific transformation of the empirical distribution based on the distance.

Decision Making

Regret Bounds for Risk-sensitive Reinforcement Learning with Lipschitz Dynamic Risk Measures

no code implementations4 Jun 2023 Hao Liang, Zhi-Quan Luo

We study finite episodic Markov decision processes incorporating dynamic risk measures to capture risk sensitivity.

reinforcement-learning

Visualizing chest X-ray dataset biases using GANs

no code implementations29 Apr 2023 Hao Liang, Kevin Ni, Guha Balakrishnan

Recent work demonstrates that images from various chest X-ray datasets contain visual features that are strongly correlated with protected demographic attributes like race and gender.

Fairness

X-ray Recognition: Patient identification from X-rays using a contrastive objective

no code implementations29 Apr 2023 Hao Liang, Kevin Ni, Guha Balakrishnan

Recent research demonstrates that deep learning models are capable of precisely extracting bio-information (e. g. race, gender and age) from patients' Chest X-Rays (CXRs).

Deep Learning

Linking convolutional kernel size to generalization bias in face analysis CNNs

no code implementations7 Feb 2023 Hao Liang, Josue Ortega Caro, Vikram Maheshri, Ankit B. Patel, Guha Balakrishnan

Our framework is experimental, in that we train several versions of a network with an intervention to a specific hyperparameter, and measure the resulting causal effect of this choice on performance bias when a particular out-of-distribution image perturbation is applied.

Bridging Distributional and Risk-sensitive Reinforcement Learning with Provable Regret Bounds

no code implementations25 Oct 2022 Hao Liang, Zhi-Quan Luo

We study the regret guarantee for risk-sensitive reinforcement learning (RSRL) via distributional reinforcement learning (DRL) methods.

Computational Efficiency Distributional Reinforcement Learning +3

Acoustic-Net: A Novel Neural Network for Sound Localization and Quantification

no code implementations31 Mar 2022 Guanxing Zhou, Hao Liang, Xinghao Ding, Yue Huang, Xiaotong Tu, Saqlain Abbas

Acoustic source localization has been applied in different fields, such as aeronautics and ocean science, generally using multiple microphones array data to reconstruct the source location.

RareGAN: Generating Samples for Rare Classes

1 code implementation20 Mar 2022 Zinan Lin, Hao Liang, Giulia Fanti, Vyas Sekar

We study the problem of learning generative adversarial networks (GANs) for a rare class of an unlabeled dataset subject to a labeling budget.

Active Learning Diversity

False Data Injection Attack on Electric Vehicle-Assisted Voltage Regulation

no code implementations9 Mar 2022 YuAn Liu, Omid Ardakanian, Ioanis Nikolaidis, Hao Liang

With the large scale penetration of electric vehicles (EVs) and the advent of bidirectional chargers, EV aggregators will become a major player in the voltage regulation market.

Capacity Estimation Stochastic Optimization

Controlled AutoEncoders to Generate Faces from Voices

no code implementations16 Jul 2021 Hao Liang, Lulan Yu, Guikang Xu, Bhiksha Raj, Rita Singh

With this in perspective, we propose a framework to morph a target face in response to a given voice in a way that facial features are implicitly guided by learned voice-face correlation in this paper.

MORPH Retrieval

Homomorphisms to 3-manifold groups

no code implementations10 Mar 2021 Daniel Groves, Michael Hull, Hao Liang

We prove foundational results about the set of homomorphisms from a finitely generated group to the collection of all fundamental groups of compact 3-manifolds and answer questions of Reid-Wang-Zhou and Agol-Liu.

Geometric Topology Group Theory

Tackling the Objective Inconsistency Problem in Heterogeneous Federated Optimization

1 code implementation NeurIPS 2020 Jianyu Wang, Qinghua Liu, Hao Liang, Gauri Joshi, H. Vincent Poor

In federated optimization, heterogeneity in the clients' local datasets and computation speeds results in large variations in the number of local updates performed by each client in each communication round.

DTCA: Decision Tree-based Co-Attention Networks for Explainable Claim Verification

no code implementations ACL 2020 Lianwei Wu, Yuan Rao, Yongqiang Zhao, Hao Liang, Ambreen Nazir

Simultaneously, the discovered evidence only roughly aims at the interpretability of the whole sequence of claims but insufficient to focus on the false parts of claims.

Claim Verification

Overlap Local-SGD: An Algorithmic Approach to Hide Communication Delays in Distributed SGD

1 code implementation21 Feb 2020 Jianyu Wang, Hao Liang, Gauri Joshi

In this paper, we propose an algorithmic approach named Overlap-Local-SGD (and its momentum variant) to overlap the communication and computation so as to speedup the distributed training procedure.

Cannot find the paper you are looking for? You can Submit a new open access paper.