Search Results for author: Haobo Wang

Found 23 papers, 17 papers with code

RECOST: External Knowledge Guided Data-efficient Instruction Tuning

no code implementations27 Feb 2024 Qi Zhang, Yiming Zhang, Haobo Wang, Junbo Zhao

When it comes to datasets synthesized by LLMs, a common scenario in this field, dirty samples will even be selected with a higher probability than other samples.

Re-Ranking

Debiased Sample Selection for Combating Noisy Labels

1 code implementation24 Jan 2024 Qi Wei, Lei Feng, Haobo Wang, Bo An

To address this limitation, we propose a noIse-Tolerant Expert Model (ITEM) for debiased learning in sample selection.

Learning with noisy labels

Energy-based Automated Model Evaluation

1 code implementation23 Jan 2024 Ru Peng, Heming Zou, Haobo Wang, Yawen Zeng, Zenan Huang, Junbo Zhao

The core of the MDE is to establish a meta-distribution statistic, on the information (energy) associated with individual samples, then offer a smoother representation enabled by energy-based learning.

FreeAL: Towards Human-Free Active Learning in the Era of Large Language Models

1 code implementation27 Nov 2023 Ruixuan Xiao, Yiwen Dong, Junbo Zhao, Runze Wu, Minmin Lin, Gang Chen, Haobo Wang

While copious solutions, such as active learning for small language models (SLMs) and prevalent in-context learning in the era of large language models (LLMs), have been proposed and alleviate the labeling burden to some extent, their performances are still subject to human intervention.

Active Learning In-Context Learning

Revisiting the Knowledge Injection Frameworks

no code implementations2 Nov 2023 Peng Fu, Yiming Zhang, Haobo Wang, Weikang Qiu, Junbo Zhao

Briefly, the core of this technique is rooted in an ideological emphasis on the pruning and purification of the external knowledge base to be injected into LLMs.

CAME: Contrastive Automated Model Evaluation

1 code implementation ICCV 2023 Ru Peng, Qiuyang Duan, Haobo Wang, Jiachen Ma, Yanbo Jiang, Yongjun Tu, Xiu Jiang, Junbo Zhao

In this work, we propose Contrastive Automatic Model Evaluation (CAME), a novel AutoEval framework that is rid of involving training set in the loop.

Rethinking Noisy Label Learning in Real-world Annotation Scenarios from the Noise-type Perspective

1 code implementation28 Jul 2023 Renyu Zhu, Haoyu Liu, Runze Wu, Minmin Lin, Tangjie Lv, Changjie Fan, Haobo Wang

In this paper, we investigate the problem of learning with noisy labels in real-world annotation scenarios, where noise can be categorized into two types: factual noise and ambiguity noise.

Learning with noisy labels

Towards Cross-Table Masked Pretraining for Web Data Mining

1 code implementation10 Jul 2023 Chao Ye, Guoshan Lu, Haobo Wang, Liyao Li, Sai Wu, Gang Chen, Junbo Zhao

Tabular data pervades the landscape of the World Wide Web, playing a foundational role in the digital architecture that underpins online information.

Contrastive Learning

A Generalized Unbiased Risk Estimator for Learning with Augmented Classes

1 code implementation12 Jun 2023 Senlin Shu, Shuo He, Haobo Wang, Hongxin Wei, Tao Xiang, Lei Feng

In this paper, we propose a generalized URE that can be equipped with arbitrary loss functions while maintaining the theoretical guarantees, given unlabeled data for LAC.

Multi-class Classification

Assessing Hidden Risks of LLMs: An Empirical Study on Robustness, Consistency, and Credibility

1 code implementation15 May 2023 Wentao Ye, Mingfeng Ou, Tianyi Li, Yipeng chen, Xuetao Ma, Yifan Yanggong, Sai Wu, Jie Fu, Gang Chen, Haobo Wang, Junbo Zhao

With most of the related literature in the era of LLM uncharted, we propose an automated workflow that copes with an upscaled number of queries/responses.

Memorization

Latent Processes Identification From Multi-View Time Series

1 code implementation14 May 2023 Zenan Huang, Haobo Wang, Junbo Zhao, Nenggan Zheng

Understanding the dynamics of time series data typically requires identifying the unique latent factors for data generation, \textit{a. k. a.

Contrastive Learning Time Series

Deep Partial Multi-Label Learning with Graph Disambiguation

no code implementations10 May 2023 Haobo Wang, Shisong Yang, Gengyu Lyu, Weiwei Liu, Tianlei Hu, Ke Chen, Songhe Feng, Gang Chen

In partial multi-label learning (PML), each data example is equipped with a candidate label set, which consists of multiple ground-truth labels and other false-positive labels.

Multi-Label Learning

Controllable Textual Inversion for Personalized Text-to-Image Generation

1 code implementation11 Apr 2023 Jianan Yang, Haobo Wang, YanMing Zhang, Ruixuan Xiao, Sai Wu, Gang Chen, Junbo Zhao

The recent large-scale generative modeling has attained unprecedented performance especially in producing high-fidelity images driven by text prompts.

Active Learning Text-to-Image Generation

iDAG: Invariant DAG Searching for Domain Generalization

1 code implementation ICCV 2023 Zenan Huang, Haobo Wang, Junbo Zhao, Nenggan Zheng

In this work, we first characterize that this failure of conventional ML models in DG is attributed to an inadequate identification of causal structures.

Contrastive Learning Domain Generalization

SoLar: Sinkhorn Label Refinery for Imbalanced Partial-Label Learning

1 code implementation21 Sep 2022 Haobo Wang, Mingxuan Xia, Yixuan Li, YUREN MAO, Lei Feng, Gang Chen, Junbo Zhao

Partial-label learning (PLL) is a peculiar weakly-supervised learning task where the training samples are generally associated with a set of candidate labels instead of single ground truth.

Partial Label Learning Weakly-supervised Learning

ProMix: Combating Label Noise via Maximizing Clean Sample Utility

1 code implementation21 Jul 2022 Ruixuan Xiao, Yiwen Dong, Haobo Wang, Lei Feng, Runze Wu, Gang Chen, Junbo Zhao

To overcome the potential side effect of excessive clean set selection procedure, we further devise a novel SSL framework that is able to train balanced and unbiased classifiers on the separated clean and noisy samples.

Learning with noisy labels

PiCO+: Contrastive Label Disambiguation for Robust Partial Label Learning

1 code implementation22 Jan 2022 Haobo Wang, Ruixuan Xiao, Yixuan Li, Lei Feng, Gang Niu, Gang Chen, Junbo Zhao

Partial label learning (PLL) is an important problem that allows each training example to be labeled with a coarse candidate set, which well suits many real-world data annotation scenarios with label ambiguity.

Contrastive Learning Partial Label Learning +2

Contrastive Label Disambiguation for Partial Label Learning

1 code implementation ICLR 2022 Haobo Wang, Ruixuan Xiao, Sharon Li, Lei Feng, Gang Niu, Gang Chen, Junbo Zhao

Partial label learning (PLL) is an important problem that allows each training example to be labeled with a coarse candidate set, which well suits many real-world data annotation scenarios with label ambiguity.

Contrastive Learning Partial Label Learning +2

The Emerging Trends of Multi-Label Learning

no code implementations23 Nov 2020 Weiwei Liu, Haobo Wang, Xiaobo Shen, Ivor W. Tsang

Exabytes of data are generated daily by humans, leading to the growing need for new efforts in dealing with the grand challenges for multi-label learning brought by big data.

Classification Extreme Multi-Label Classification +2

Unified framework for modeling multivariate distributions in biological sequences

2 code implementations6 Jun 2019 Justas Dauparas, Haobo Wang, Avi Swartz, Peter Koo, Mor Nitzan, Sergey Ovchinnikov

Revealing the functional sites of biological sequences, such as evolutionary conserved, structurally interacting or co-evolving protein sites, is a fundamental, and yet challenging task.

Quantitative Methods

Cannot find the paper you are looking for? You can Submit a new open access paper.