Search Results for author: Masashi Sugiyama

Found 260 papers, 86 papers with code

Variational Imitation Learning with Diverse-quality Demonstrations

1 code implementation • ICML 2020 • Voot Tangkaratt, Bo Han, Mohammad Emtiyaz Khan, Masashi Sugiyama

Learning from demonstrations can be challenging when the quality of demonstrations is diverse, and even more so when the quality is unknown and there is no additional information to estimate the quality.

Continuous Control Imitation Learning +2

Paper
Code

Accelerating the diffusion-based ensemble sampling by non-reversible dynamics

no code implementations • ICML 2020 • Futoshi Futami, Issei Sato, Masashi Sugiyama

Compared with the naive parallel-chain SGLD that updates multiple particles independently, ensemble methods update particles with their interactions.

Bayesian Inference

Paper
Add Code

Leveraging Domain-Unlabeled Data in Offline Reinforcement Learning across Two Domains

no code implementations • 11 Apr 2024 • Soichiro Nishimori, Xin-Qiang Cai, Johannes Ackermann, Masashi Sugiyama

In this paper, we investigate an offline reinforcement learning (RL) problem where datasets are collected from two domains.

Offline RL Reinforcement Learning (RL)

Paper
Add Code

Counterfactual Reasoning for Multi-Label Image Classification via Patching-Based Training

no code implementations • 9 Apr 2024 • Ming-Kun Xie, Jia-Hao Xiao, Pei Peng, Gang Niu, Masashi Sugiyama, Sheng-Jun Huang

In this paper, we provide a causal inference framework to show that the correlative features caused by the target object and its co-occurring objects can be regarded as a mediator, which has both positive and negative impacts on model predictions.

Causal Inference counterfactual +3

Paper
Add Code

Reinforcement Learning with Options and State Representation

no code implementations • 16 Mar 2024 • Ayoub Ghriss, Masashi Sugiyama, Alessandro Lazaric

The current thesis aims to explore the reinforcement learning field and build on existing methods to produce improved ones to tackle the problem of learning in high-dimensional and complex environments.

Decision Making Hierarchical Reinforcement Learning +1

Paper
Add Code

Learning with Noisy Foundation Models

no code implementations • 11 Mar 2024 • Hao Chen, Jindong Wang, Zihan Wang, Ran Tao, Hongxin Wei, Xing Xie, Masashi Sugiyama, Bhiksha Raj

Foundation models are usually pre-trained on large-scale datasets and then adapted to downstream tasks through tuning.

Paper
Add Code

VEC-SBM: Optimal Community Detection with Vectorial Edges Covariates

1 code implementation • 29 Feb 2024 • Guillaume Braun, Masashi Sugiyama

Social networks are often associated with rich side information, such as texts and images.

Community Detection Stochastic Block Model

Paper
Code

Generating Chain-of-Thoughts with a Direct Pairwise-Comparison Approach to Searching for the Most Promising Intermediate Thought

no code implementations • 10 Feb 2024 • Zhen-Yu Zhang, Siwei Han, Huaxiu Yao, Gang Niu, Masashi Sugiyama

In this paper, motivated by Vapnik's principle, we propose a novel comparison-based CoT generation algorithm that directly identifies the most promising thoughts with the noisy feedback from the LLM.

Language Modelling Large Language Model

Paper
Add Code

Reinforcement Learning from Bagged Reward: A Transformer-based Approach for Instance-Level Reward Redistribution

no code implementations • 6 Feb 2024 • Yuting Tang, Xin-Qiang Cai, Yao-Xiang Ding, Qiyu Wu, Guoqing Liu, Masashi Sugiyama

Instead, the learner only obtains rewards at the ends of bags, where a bag is defined as a partial sequence of a complete trajectory.

Reinforcement Learning (RL)

Paper
Add Code

A General Framework for Learning from Weak Supervision

1 code implementation • 2 Feb 2024 • Hao Chen, Jindong Wang, Lei Feng, Xiang Li, Yidong Wang, Xing Xie, Masashi Sugiyama, Rita Singh, Bhiksha Raj

Weakly supervised learning generally faces challenges in applicability to various scenarios with diverse weak supervision and in scalability due to the complexity of existing algorithms, thereby hindering the practical deployment.

Weakly-supervised Learning

Paper
Code

Direct Distillation between Different Domains

no code implementations • 12 Jan 2024 • Jialiang Tang, Shuo Chen, Gang Niu, Hongyuan Zhu, Joey Tianyi Zhou, Chen Gong, Masashi Sugiyama

Then, we build a fusion-activation mechanism to transfer the valuable domain-invariant knowledge to the student network, while simultaneously encouraging the adapter within the teacher network to learn the domain-specific knowledge of the target data.

Domain Adaptation Knowledge Distillation

Paper
Add Code

The Selected-completely-at-random Complementary Label is a Practical Weak Supervision for Multi-class Classification

no code implementations • 27 Nov 2023 • Wei Wang, Takashi Ishida, Yu-Jie Zhang, Gang Niu, Masashi Sugiyama

Complementary-label learning is a weakly supervised learning problem in which each training example is associated with one or multiple complementary labels indicating the classes to which it does not belong.

Binary Classification Multi-class Classification +1

Paper
Add Code

Fixed-Budget Real-Valued Combinatorial Pure Exploration of Multi-Armed Bandit

no code implementations • 24 Oct 2023 • Shintaro Nakamura, Masashi Sugiyama

We show that the upper bound of the probability of error of the CSA algorithm matches a lower bound up to a logarithmic factor in the exponent.

Paper
Add Code

Atom-Motif Contrastive Transformer for Molecular Property Prediction

no code implementations • 11 Oct 2023 • Wentao Yu, Shuo Chen, Chen Gong, Gang Niu, Masashi Sugiyama

As motifs in a molecule are significant patterns that are of great importance for determining molecular properties (e. g., toxicity and solubility), overlooking motif interactions inevitably hinders the effectiveness of MPP.

Molecular Property Prediction Property Prediction

Paper
Add Code

Thompson Exploration with Best Challenger Rule in Best Arm Identification

no code implementations • 1 Oct 2023 • Jongyeong Lee, Junya Honda, Masashi Sugiyama

This paper studies the fixed-confidence best arm identification (BAI) problem in the bandit framework in the canonical single-parameter exponential models.

Thompson Sampling

Paper
Add Code

Understanding and Mitigating the Label Noise in Pre-training on Downstream Tasks

no code implementations • 29 Sep 2023 • Hao Chen, Jindong Wang, Ankit Shah, Ran Tao, Hongxin Wei, Xing Xie, Masashi Sugiyama, Bhiksha Raj

This paper aims to understand the nature of noise in pre-training datasets and to mitigate its impact on downstream tasks.

Paper
Add Code

Unified Risk Analysis for Weakly Supervised Learning

no code implementations • 15 Sep 2023 • Chao-Kai Chiang, Masashi Sugiyama

The analysis component of the framework, viewed as a decontamination process, provides a systematic method of conducting risk rewrite.

Weakly-supervised Learning

Paper
Add Code

Thompson Sampling for Real-Valued Combinatorial Pure Exploration of Multi-Armed Bandit

no code implementations • 20 Aug 2023 • Shintaro Nakamura, Masashi Sugiyama

We introduce an algorithm named the Generalized Thompson Sampling Explore (GenTS-Explore) algorithm, which is the first algorithm that can work even when the size of the action set is exponentially large in $d$.

Thompson Sampling

Paper
Add Code

Multi-Label Knowledge Distillation

1 code implementation • ICCV 2023 • Penghui Yang, Ming-Kun Xie, Chen-Chen Zong, Lei Feng, Gang Niu, Masashi Sugiyama, Sheng-Jun Huang

Existing knowledge distillation methods typically work by imparting the knowledge of output logits or intermediate feature maps from the teacher network to the student network, which is very successful in multi-class single-label learning.

Binary Classification Knowledge Distillation +1

Paper
Code

Distribution Shift Matters for Knowledge Distillation with Webly Collected Images

no code implementations • ICCV 2023 • Jialiang Tang, Shuo Chen, Gang Niu, Masashi Sugiyama, Chen Gong

Knowledge distillation aims to learn a lightweight student network from a pre-trained teacher network.

Contrastive Learning Data-free Knowledge Distillation +2

Paper
Add Code

Diversity-enhancing Generative Network for Few-shot Hypothesis Adaptation

no code implementations • 12 Jul 2023 • Ruijiang Dong, Feng Liu, Haoang Chi, Tongliang Liu, Mingming Gong, Gang Niu, Masashi Sugiyama, Bo Han

In this paper, we propose a diversity-enhancing generative network (DEG-Net) for the FHA problem, which can generate diverse unlabeled data with the help of a kernel independence measure: the Hilbert-Schmidt independence criterion (HSIC).

Paper
Add Code

An Optimal Algorithm for the Real-Valued Combinatorial Pure Exploration of Multi-Armed Bandit

no code implementations • 15 Jun 2023 • Shintaro Nakamura, Masashi Sugiyama

In such a case, the R-CPE-MAB can be seen as a special case of the so-called transductive linear bandits.

Decision Making

Paper
Add Code

Making Binary Classification from Multiple Unlabeled Datasets Almost Free of Supervision

no code implementations • 12 Jun 2023 • Yuhao Wu, Xiaobo Xia, Jun Yu, Bo Han, Gang Niu, Masashi Sugiyama, Tongliang Liu

Training a classifier exploiting a huge amount of supervised data is expensive or even prohibited in a situation, where the labeling cost is high.

Binary Classification Pseudo Label

Paper
Add Code

BadLabel: A Robust Perspective on Evaluating and Enhancing Label-noise Learning

1 code implementation • 28 May 2023 • Jingfeng Zhang, Bo Song, Haohan Wang, Bo Han, Tongliang Liu, Lei Liu, Masashi Sugiyama

To address the challenge posed by BadLabel, we further propose a robust LNL method that perturbs the labels in an adversarial manner at each epoch to make the loss values of clean and noisy labels again distinguishable.

Paper
Code

Imprecise Label Learning: A Unified Framework for Learning with Various Imprecise Label Configurations

no code implementations • 22 May 2023 • Hao Chen, Ankit Shah, Jindong Wang, Ran Tao, Yidong Wang, Xing Xie, Masashi Sugiyama, Rita Singh, Bhiksha Raj

In this paper, we introduce imprecise label learning (ILL), a framework for the unification of learning with various imprecise label configurations.

Ranked #1 on Learning with noisy labels on mini WebVision 1.0

Learning with noisy labels Partial Label Learning

Paper
Add Code

Enriching Disentanglement: Definitions to Metrics

no code implementations • 19 May 2023 • Yivan Zhang, Masashi Sugiyama

Disentangled representation learning is a challenging task that involves separating multiple factors of variation in complex data.

Disentanglement

Paper
Add Code

Enhancing Label Sharing Efficiency in Complementary-Label Learning with Label Augmentation

no code implementations • 15 May 2023 • Wei-I Lin, Gang Niu, Hsuan-Tien Lin, Masashi Sugiyama

Our analysis reveals that the efficiency of implicit label sharing is closely related to the performance of existing CLL models.

Weakly-supervised Learning

Paper
Add Code

A Category-theoretical Meta-analysis of Definitions of Disentanglement

no code implementations • 11 May 2023 • Yivan Zhang, Masashi Sugiyama

Disentangling the factors of variation in data is a fundamental concept in machine learning and has been studied in various ways by different researchers, leading to a multitude of definitions.

Disentanglement Navigate

Paper
Add Code

Class-Distribution-Aware Pseudo Labeling for Semi-Supervised Multi-Label Learning

1 code implementation • 4 May 2023 • Ming-Kun Xie, Jia-Hao Xiao, Hao-Zhe Liu, Gang Niu, Masashi Sugiyama, Sheng-Jun Huang

Pseudo-labeling has emerged as a popular and effective approach for utilizing unlabeled data.

Multi-Label Learning Pseudo Label

Paper
Code

Enhancing Adversarial Contrastive Learning via Adversarial Invariant Regularization

1 code implementation • NeurIPS 2023 • Xilie Xu, Jingfeng Zhang, Feng Liu, Masashi Sugiyama, Mohan Kankanhalli

To improve transferability, the existing work introduced the standard invariant regularization (SIR) to impose style-independence property to SCL, which can exempt the impact of nuisance style factors in the standard representation.

Contrastive Learning

Paper
Code

Fairness Improves Learning from Noisily Labeled Long-Tailed Data

no code implementations • 22 Mar 2023 • Jiaheng Wei, Zhaowei Zhu, Gang Niu, Tongliang Liu, Sijia Liu, Masashi Sugiyama, Yang Liu

Both long-tailed and noisily labeled data frequently appear in real-world applications and impose significant challenges for learning.

Fairness

Paper
Add Code

The Choice of Noninformative Priors for Thompson Sampling in Multiparameter Bandit Models

no code implementations • 28 Feb 2023 • Jongyeong Lee, Chao-Kai Chiang, Masashi Sugiyama

Although the uniform prior is shown to be optimal, we highlight the inherent limitation of its optimality, which is limited to specific parameterizations and emphasizes the significance of the invariance property of priors.

Multi-Armed Bandits Thompson Sampling

Paper
Add Code

Efficient Adversarial Contrastive Learning via Robustness-Aware Coreset Selection

1 code implementation • NeurIPS 2023 • Xilie Xu, Jingfeng Zhang, Feng Liu, Masashi Sugiyama, Mohan Kankanhalli

Adversarial contrastive learning (ACL) does not require expensive data annotations but outputs a robust representation that withstands adversarial attacks and also generalizes to a wide range of downstream tasks.

Contrastive Learning

Paper
Code

GAT: Guided Adversarial Training with Pareto-optimal Auxiliary Tasks

1 code implementation • 6 Feb 2023 • Salah Ghamizi, Jingfeng Zhang, Maxime Cordy, Mike Papadakis, Masashi Sugiyama, Yves Le Traon

While leveraging additional training data is well established to improve adversarial robustness, it incurs the unavoidable cost of data collection and the heavy computation to train models.

Adversarial Robustness Data Augmentation +1

Paper
Code

Optimality of Thompson Sampling with Noninformative Priors for Pareto Bandits

no code implementations • 3 Feb 2023 • Jongyeong Lee, Junya Honda, Chao-Kai Chiang, Masashi Sugiyama

In addition to the empirical performance, TS has been shown to achieve asymptotic problem-dependent lower bounds in several models.

Thompson Sampling

Paper
Add Code

Robust computation of optimal transport by $β$-potential regularization

no code implementations • 26 Dec 2022 • Shintaro Nakamura, Han Bao, Masashi Sugiyama

Optimal transport (OT) has become a widely used tool in the machine learning field to measure the discrepancy between probability distributions.

Paper
Add Code

Representation Learning for Continuous Action Spaces is Beneficial for Efficient Policy Learning

no code implementations • 23 Nov 2022 • Tingting Zhao, Ying Wang, Wei Sun, Yarui Chen, Gang Niub, Masashi Sugiyama

Meanwhile, we divide the whole learning task into learning with the large-scale representation models in an unsupervised manner and learning with the small-scale policy model in the RL manner. The small policy model facilitates policy learning, while not sacrificing generalization and expressiveness via the large representation model.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Adversarial Training with Complementary Labels: On the Benefit of Gradually Informative Attacks

1 code implementation • 1 Nov 2022 • Jianan Zhou, Jianing Zhu, Jingfeng Zhang, Tongliang Liu, Gang Niu, Bo Han, Masashi Sugiyama

Adversarial training (AT) with imperfect supervision is significant but receives limited attention.

Adversarial Robustness Pseudo Label

Paper
Code

Generalized Consistent Multi-Class Classification with Rejection to be Compatible with Arbitrary Losses

2 code implementations • Conference 2022 • Yuzhou Cao, Tianchi Cai, Lei Feng, Lihong Gu, Jinjie Gu, Bo An, Gang Niu, Masashi Sugiyama

\emph{Classification with rejection} (CwR) refrains from making a prediction to avoid critical misclassification when encountering test samples that are difficult to classify.

Classification Multi-class Classification

Paper
Code

Equivariant Disentangled Transformation for Domain Generalization under Combination Shift

no code implementations • 3 Aug 2022 • Yivan Zhang, Jindong Wang, Xing Xie, Masashi Sugiyama

To formally analyze this issue, we provide a unique algebraic formulation of the combination shift problem based on the concepts of homomorphism, equivariance, and a refined definition of disentanglement.

Disentanglement Domain Generalization

Paper
Add Code

Adapting to Online Label Shift with Provable Guarantees

no code implementations • 5 Jul 2022 • Yong Bai, Yu-Jie Zhang, Peng Zhao, Masashi Sugiyama, Zhi-Hua Zhou

In this paper, we formulate and investigate the problem of \emph{online label shift} (OLaS): the learner trains an initial model from the labeled offline data and then deploys it to an unlabeled online environment where the underlying label distribution changes over time but the label-conditional density does not.

Paper
Add Code

Multi-class Classification from Multiple Unlabeled Datasets with Partial Risk Regularization

1 code implementation • 4 Jul 2022 • Yuting Tang, Nan Lu, Tianyi Zhang, Masashi Sugiyama

Recent years have witnessed a great success of supervised deep learning, where predictive models were trained from a large amount of fully labeled data.

Multi-class Classification

Paper
Code

The Survival Bandit Problem

no code implementations • 7 Jun 2022 • Charles Riou, Junya Honda, Masashi Sugiyama

For that purpose, we identify two key components in the survival regret: the regret given no ruin (which corresponds to the regret in the MAB), and the probability that the procedure is interrupted, called the probability of ruin.

Paper
Add Code

Instance-Dependent Label-Noise Learning with Manifold-Regularized Transition Matrix Estimation

no code implementations • CVPR 2022 • De Cheng, Tongliang Liu, Yixiong Ning, Nannan Wang, Bo Han, Gang Niu, Xinbo Gao, Masashi Sugiyama

In label-noise learning, estimating the transition matrix has attracted more and more attention as the matrix plays an important role in building statistically consistent classifiers.

Paper
Add Code

Excess risk analysis for epistemic uncertainty with application to variational inference

no code implementations • 2 Jun 2022 • Futoshi Futami, Tomoharu Iwata, Naonori Ueda, Issei Sato, Masashi Sugiyama

Bayesian deep learning plays an important role especially for its ability evaluating epistemic uncertainty (EU).

Bayesian Inference Variational Inference

Paper
Add Code

Universal approximation property of invertible neural networks

no code implementations • 15 Apr 2022 • Isao Ishikawa, Takeshi Teshima, Koichi Tojo, Kenta Oono, Masahiro Ikeda, Masashi Sugiyama

Invertible neural networks (INNs) are neural network architectures with invertibility by design.

Representation Learning

Paper
Add Code

Federated Learning from Only Unlabeled Data with Class-Conditional-Sharing Clients

1 code implementation • 7 Apr 2022 • Nan Lu, Zhao Wang, Xiaoxiao Li, Gang Niu, Qi Dou, Masashi Sugiyama

We propose federation of unsupervised learning (FedUL), where the unlabeled data are transformed into surrogate labeled data for each of the clients, a modified model is trained by supervised FL, and the wanted model is recovered from the modified model.

Federated Learning

Paper
Code

On the Effectiveness of Adversarial Training against Backdoor Attacks

no code implementations • 22 Feb 2022 • Yinghua Gao, Dongxian Wu, Jingfeng Zhang, Guanhao Gan, Shu-Tao Xia, Gang Niu, Masashi Sugiyama

To explore whether adversarial training could defend against backdoor attacks or not, we conduct extensive experiments across different threat models and perturbation budgets, and find the threat model in adversarial training matters.

Paper
Add Code

Adversarial Attack and Defense for Non-Parametric Two-Sample Tests

1 code implementation • 7 Feb 2022 • Xilie Xu, Jingfeng Zhang, Feng Liu, Masashi Sugiyama, Mohan Kankanhalli

Furthermore, we theoretically find that the adversary can also degrade the lower bound of a TST's test power, which enables us to iteratively minimize the test criterion in order to search for adversarial pairs.

Adversarial Attack Vocal Bursts Valence Prediction

Paper
Code

Is the Performance of My Deep Network Too Good to Be True? A Direct Approach to Estimating the Bayes Error in Binary Classification

1 code implementation • 1 Feb 2022 • Takashi Ishida, Ikko Yamane, Nontawat Charoenphakdee, Gang Niu, Masashi Sugiyama

In contrast to others, our method is model-free and even instance-free.

Binary Classification

Paper
Code

Towards Adversarially Robust Deep Image Denoising

no code implementations • 12 Jan 2022 • Hanshu Yan, Jingfeng Zhang, Jiashi Feng, Masashi Sugiyama, Vincent Y. F. Tan

Secondly, to robustify DIDs, we propose an adversarial training strategy, hybrid adversarial training ({\sc HAT}), that jointly trains DIDs with adversarial and non-adversarial noisy data to ensure that the reconstruction quality is high and the denoisers around non-adversarial data are locally smooth.

Adversarial Attack Adversarial Robustness +1

Paper
Add Code

Learning with Proper Partial Labels

no code implementations • 23 Dec 2021 • Zhenguo Wu, Jiaqi Lv, Masashi Sugiyama

Recently, various approaches on partial-label learning have been proposed under different generation models of candidate label sets.

Partial Label Learning Weakly-supervised Learning

Paper
Add Code

Rethinking Importance Weighting for Transfer Learning

no code implementations • 19 Dec 2021 • Nan Lu, Tianyi Zhang, Tongtong Fang, Takeshi Teshima, Masashi Sugiyama

A key assumption in supervised learning is that training and test data follow the same probability distribution.

Selection bias Transfer Learning

Paper
Add Code

Can Label-Noise Transition Matrix Help to Improve Sample Selection and Label Correction?

no code implementations • 29 Sep 2021 • Yu Yao, Xuefeng Li, Tongliang Liu, Alan Blair, Mingming Gong, Bo Han, Gang Niu, Masashi Sugiyama

Existing methods for learning with noisy labels can be generally divided into two categories: (1) sample selection and label correction based on the memorization effect of neural networks; (2) loss correction with the transition matrix.

Learning with noisy labels Memorization

Paper
Add Code

Collaborate to Defend Against Adversarial Attacks

no code implementations • 29 Sep 2021 • Sen Cui, Jingfeng Zhang, Jian Liang, Masashi Sugiyama, ChangShui Zhang

However, an ensemble still wastes the limited capacity of multiple models.

Paper
Add Code

Exploiting Class Activation Value for Partial-Label Learning

3 code implementations • ICLR 2022 • Fei Zhang, Lei Feng, Bo Han, Tongliang Liu, Gang Niu, Tao Qin, Masashi Sugiyama

As the first contribution, we empirically show that the class activation map (CAM), a simple technique for discriminating the learning patterns of each class in images, is surprisingly better at making accurate predictions than the model itself on selecting the true label from candidate labels.

Multi-class Classification Partial Label Learning

Paper
Code

Adaptive Inertia: Disentangling the Effects of Adaptive Learning Rate and Momentum

no code implementations • 29 Sep 2021 • Zeke Xie, Xinrui Wang, Huishuai Zhang, Issei Sato, Masashi Sugiyama

Specifically, we disentangle the effects of Adaptive Learning Rate and Momentum of the Adam dynamics on saddle-point escaping and flat minima selection.

Paper
Add Code

Unsupervised Federated Learning is Possible

no code implementations • ICLR 2022 • Nan Lu, Zhao Wang, Xiaoxiao Li, Gang Niu, Qi Dou, Masashi Sugiyama

Federated Learning

Paper
Add Code

Does Adversarial Robustness Really Imply Backdoor Vulnerability?

no code implementations • 29 Sep 2021 • Yinghua Gao, Dongxian Wu, Jingfeng Zhang, Shu-Tao Xia, Gang Niu, Masashi Sugiyama

Based on thorough experiments, we find that such trade-off ignores the interactions between the perturbation budget of adversarial training and the magnitude of the backdoor trigger.

Adversarial Robustness

Paper
Add Code

Active Refinement for Multi-Label Learning: A Pseudo-Label Approach

no code implementations • 29 Sep 2021 • Cheng-Yu Hsieh, Wei-I Lin, Miao Xu, Gang Niu, Hsuan-Tien Lin, Masashi Sugiyama

The goal of multi-label learning (MLL) is to associate a given instance with its relevant labels from a set of concepts.

Active Learning Multi-Label Learning +1

Paper
Add Code

Mediated Uncoupled Learning: Learning Functions without Direct Input-output Correspondences

1 code implementation • 16 Jul 2021 • Ikko Yamane, Junya Honda, Florian Yger, Masashi Sugiyama

In this paper, we consider the task of predicting $Y$ from $X$ when we have no paired data of them, but we have two separate, independent datasets of $X$ and $Y$ each observed with some mediating variable $U$, that is, we have two datasets $S_X = \{(X_i, U_i)\}$ and $S_Y = \{(U'_j, Y'_j)\}$.

Paper
Code

Positive-Unlabeled Classification under Class-Prior Shift: A Prior-invariant Approach Based on Density Ratio Estimation

1 code implementation • 11 Jul 2021 • Shota Nakajima, Masashi Sugiyama

Learning from positive and unlabeled (PU) data is an important problem in various applications.

Density Ratio Estimation

Paper
Code

Seeing Differently, Acting Similarly: Heterogeneously Observable Imitation Learning

no code implementations • 17 Jun 2021 • Xin-Qiang Cai, Yao-Xiang Ding, Zi-Xuan Chen, Yuan Jiang, Masashi Sugiyama, Zhi-Hua Zhou

In many real-world imitation learning tasks, the demonstrator and the learner have to act under different observation spaces.

Imitation Learning

Paper
Add Code

Multi-Class Classification from Single-Class Data with Confidences

no code implementations • 16 Jun 2021 • Yuzhou Cao, Lei Feng, Senlin Shu, Yitian Xu, Bo An, Gang Niu, Masashi Sugiyama

We show that without any assumptions on the loss functions, models, and optimizers, we can successfully learn a multi-class classifier from only data of a single class with a rigorous consistency guarantee when confidences (i. e., the class-posterior probabilities for all the classes) are available.

Classification Multi-class Classification

Paper
Add Code

Probabilistic Margins for Instance Reweighting in Adversarial Training

1 code implementation • NeurIPS 2021 • Qizhou Wang, Feng Liu, Bo Han, Tongliang Liu, Chen Gong, Gang Niu, Mingyuan Zhou, Masashi Sugiyama

Reweighting adversarial data during training has been recently shown to improve adversarial robustness, where data closer to the current decision boundaries are regarded as more critical and given larger weights.

Adversarial Robustness

Paper
Code

On the Robustness of Average Losses for Partial-Label Learning

no code implementations • 11 Jun 2021 • Jiaqi Lv, Biao Liu, Lei Feng, Ning Xu, Miao Xu, Bo An, Gang Niu, Xin Geng, Masashi Sugiyama

Partial-label learning (PLL) utilizes instances with PLs, where a PL includes several candidate labels but only one is the true label (TL).

Partial Label Learning Weakly Supervised Classification

Paper
Add Code

Loss function based second-order Jensen inequality and its application to particle variational inference

no code implementations • NeurIPS 2021 • Futoshi Futami, Tomoharu Iwata, Naonori Ueda, Issei Sato, Masashi Sugiyama

First, we provide a new second-order Jensen inequality, which has the repulsion term based on the loss function.

Ensemble Learning Model Selection +1

Paper
Add Code

To Smooth or Not? When Label Smoothing Meets Noisy Labels

1 code implementation • 8 Jun 2021 • Jiaheng Wei, Hangyu Liu, Tongliang Liu, Gang Niu, Masashi Sugiyama, Yang Liu

We provide understandings for the properties of LS and NLS when learning with noisy labels.

Ranked #9 on Learning with noisy labels on CIFAR-10N-Random3

Learning with noisy labels

Paper
Code

Sample Selection with Uncertainty of Losses for Learning with Noisy Labels

no code implementations • NeurIPS 2021 • Xiaobo Xia, Tongliang Liu, Bo Han, Mingming Gong, Jun Yu, Gang Niu, Masashi Sugiyama

In this way, we also give large-loss but less selected data a try; then, we can better distinguish between the cases (a) and (b) by seeing if the losses effectively decrease with the uncertainty after the try.

Ranked #26 on Image Classification on mini WebVision 1.0

Learning with noisy labels

Paper
Add Code

Instance Correction for Learning with Open-set Noisy Labels

no code implementations • 1 Jun 2021 • Xiaobo Xia, Tongliang Liu, Bo Han, Mingming Gong, Jun Yu, Gang Niu, Masashi Sugiyama

Lots of approaches, e. g., loss correction and label correction, cannot handle such open-set noisy labels well, since they need training data and test data to share the same label space, which does not hold for learning with open-set noisy labels.

Paper
Add Code

A unified view of likelihood ratio and reparameterization gradients

no code implementations • 31 May 2021 • Paavo Parmas, Masashi Sugiyama

Reparameterization (RP) and likelihood ratio (LR) gradient estimators are used to estimate gradients of expectations throughout machine learning and reinforcement learning; however, they are usually explained as simple mathematical tricks, with no insight into their nature.

Paper
Add Code

NoiLIn: Improving Adversarial Training and Correcting Stereotype of Noisy Labels

1 code implementation • 31 May 2021 • Jingfeng Zhang, Xilie Xu, Bo Han, Tongliang Liu, Gang Niu, Lizhen Cui, Masashi Sugiyama

First, we thoroughly investigate noisy labels (NLs) injection into AT's inner maximization and outer minimization, respectively and obtain the observations on when NL injection benefits AT.

Adversarial Robustness

Paper
Code

Positive-Negative Momentum: Manipulating Stochastic Gradient Noise to Improve Generalization

1 code implementation • 31 Mar 2021 • Zeke Xie, Li Yuan, Zhanxing Zhu, Masashi Sugiyama

It is well-known that stochastic gradient noise (SGN) acts as implicit regularization for deep learning and is essentially important for both optimization and generalization of deep networks.

Paper
Code

Approximating Instance-Dependent Noise via Instance-Confidence Embedding

1 code implementation • 25 Mar 2021 • Yivan Zhang, Masashi Sugiyama

Label noise in multiclass classification is a major obstacle to the deployment of learning systems.

text-classification Text Classification

Paper
Code

Discovering Diverse Solutions in Deep Reinforcement Learning by Maximizing State-Action-Based Mutual Information

1 code implementation • 12 Mar 2021 • Takayuki Osa, Voot Tangkaratt, Masashi Sugiyama

In our method, a policy conditioned on a continuous or discrete latent variable is trained by directly maximizing the variational lower bound of the mutual information, instead of using the mutual information as unsupervised rewards as in previous studies.

Continuous Control

Paper
Code

Lower-Bounded Proper Losses for Weakly Supervised Classification

1 code implementation • 4 Mar 2021 • Shuhei M. Yoshida, Takashi Takenouchi, Masashi Sugiyama

To this end, we derive a representation theorem for proper losses in supervised learning, which dualizes the Savage representation.

Classification General Classification +1

Paper
Code

LocalDrop: A Hybrid Regularization for Deep Neural Networks

no code implementations • 1 Mar 2021 • Ziqing Lu, Chang Xu, Bo Du, Takashi Ishida, Lefei Zhang, Masashi Sugiyama

In neural networks, developing regularization algorithms to settle overfitting is one of the major study areas.

Paper
Add Code

Incorporating Causal Graphical Prior Knowledge into Predictive Modeling via Simple Data Augmentation

1 code implementation • 27 Feb 2021 • Takeshi Teshima, Masashi Sugiyama

Causal graphs (CGs) are compact representations of the knowledge of the data generating processes behind the data distributions.

Data Augmentation

Paper
Code

Guided Interpolation for Adversarial Training

no code implementations • 15 Feb 2021 • Chen Chen, Jingfeng Zhang, Xilie Xu, Tianlei Hu, Gang Niu, Gang Chen, Masashi Sugiyama

To enhance adversarial robustness, adversarial training learns deep neural networks on the adversarial variants generated by their natural data.

Adversarial Robustness

Paper
Add Code

Learning from Similarity-Confidence Data

no code implementations • 13 Feb 2021 • Yuzhou Cao, Lei Feng, Yitian Xu, Bo An, Gang Niu, Masashi Sugiyama

Weakly supervised learning has drawn considerable attention recently to reduce the expensive time and labor consumption of labeling massive data.

Weakly-supervised Learning

Paper
Add Code

CIFS: Improving Adversarial Robustness of CNNs via Channel-wise Importance-based Feature Selection

2 code implementations • 10 Feb 2021 • Hanshu Yan, Jingfeng Zhang, Gang Niu, Jiashi Feng, Vincent Y. F. Tan, Masashi Sugiyama

By comparing \textit{non-robust} (normally trained) and \textit{robustified} (adversarially trained) models, we observe that adversarial training (AT) robustifies CNNs by aligning the channel-wise activations of adversarial data with those of their natural counterparts.

Adversarial Robustness feature selection

Paper
Code

Meta Discovery: Learning to Discover Novel Classes given Very Limited Data

1 code implementation • ICLR 2022 • Haoang Chi, Feng Liu, Bo Han, Wenjing Yang, Long Lan, Tongliang Liu, Gang Niu, Mingyuan Zhou, Masashi Sugiyama

In this paper, we demystify assumptions behind NCD and find that high-level semantic features should be shared among the seen and unseen classes.

Clustering Meta-Learning +1

Paper
Code

Understanding the Interaction of Adversarial Training with Noisy Labels

no code implementations • 6 Feb 2021 • Jianing Zhu, Jingfeng Zhang, Bo Han, Tongliang Liu, Gang Niu, Hongxia Yang, Mohan Kankanhalli, Masashi Sugiyama

A recent adversarial training (AT) study showed that the number of projected gradient descent (PGD) steps to successfully attack a point (i. e., find an adversarial example in its proximity) is an effective measure of the robustness of this point.

Paper
Add Code

Provably End-to-end Label-Noise Learning without Anchor Points

1 code implementation • 4 Feb 2021 • Xuefeng Li, Tongliang Liu, Bo Han, Gang Niu, Masashi Sugiyama

In label-noise learning, the transition matrix plays a key role in building statistically consistent classifiers.

Ranked #14 on Learning with noisy labels on CIFAR-100N

Learning with noisy labels

Paper
Code

Learning Noise Transition Matrix from Only Noisy Labels via Total Variation Regularization

1 code implementation • 4 Feb 2021 • Yivan Zhang, Gang Niu, Masashi Sugiyama

To estimate the transition matrix from noisy data, existing methods often need to estimate the noisy class-posterior, which could be unreliable due to the overconfidence of neural networks.

Weakly Supervised Classification

Paper
Code

Learning Diverse-Structured Networks for Adversarial Robustness

1 code implementation • 3 Feb 2021 • Xuefeng Du, Jingfeng Zhang, Bo Han, Tongliang Liu, Yu Rong, Gang Niu, Junzhou Huang, Masashi Sugiyama

In adversarial training (AT), the main focus has been the objective and optimizer while the model has been less studied, so that the models being used are still those classic ones in standard training (ST).

Adversarial Robustness

Paper
Code

Binary Classification from Multiple Unlabeled Datasets via Surrogate Set Classification

1 code implementation • 1 Feb 2021 • Nan Lu, Shida Lei, Gang Niu, Issei Sato, Masashi Sugiyama

SSC can be solved by a standard (multi-class) classification method, and we use the SSC solution to obtain the final binary classifier through a certain linear-fractional transformation.

Binary Classification Classification +2

Paper
Code

Source-free Domain Adaptation via Distributional Alignment by Matching Batch Normalization Statistics

no code implementations • 19 Jan 2021 • Masato Ishii, Masashi Sugiyama

In this setting, we cannot access source data during adaptation, while unlabeled target data and a model pretrained with source data are given.

Source-Free Domain Adaptation

Paper
Add Code

A Symmetric Loss Perspective of Reliable Machine Learning

no code implementations • 5 Jan 2021 • Nontawat Charoenphakdee, Jongyeong Lee, Masashi Sugiyama

When minimizing the empirical risk in binary classification, it is a common practice to replace the zero-one loss with a surrogate loss to make the learning objective feasible to optimize.

BIG-bench Machine Learning Binary Classification +2

Paper
Add Code

On the Role of Pre-training for Meta Few-Shot Learning

no code implementations • 1 Jan 2021 • Chia-You Chen, Hsuan-Tien Lin, Gang Niu, Masashi Sugiyama

One is to (pre-)train a classifier with examples from known classes, and then transfer the pre-trained classifier to unknown classes using the new examples.

Disentanglement Few-Shot Learning

Paper
Add Code

Combinatorial Pure Exploration with Full-bandit Feedback and Beyond: Solving Combinatorial Optimization under Uncertainty with Limited Observation

no code implementations • 31 Dec 2020 • Yuko Kuroki, Junya Honda, Masashi Sugiyama

Combinatorial optimization is one of the fundamental research fields that has been extensively studied in theoretical computer science and operations research.

Combinatorial Optimization Multi-Armed Bandits +1

Paper
Add Code

On the Overlooked Pitfalls of Weight Decay and How to Mitigate Them: A Gradient-Norm Perspective

1 code implementation • NeurIPS 2023 • Zeke Xie, Zhiqiang Xu, Jingzhao Zhang, Issei Sato, Masashi Sugiyama

Weight decay is a simple yet powerful regularization technique that has been very widely used in training of deep neural networks (DNNs).

Paper
Code

On Focal Loss for Class-Posterior Probability Estimation: A Theoretical Perspective

no code implementations • CVPR 2021 • Nontawat Charoenphakdee, Jayakorn Vongkulbhisal, Nuttapong Chairatanakul, Masashi Sugiyama

In this paper, we first prove that the focal loss is classification-calibrated, i. e., its minimizer surely yields the Bayes-optimal classifier and thus the use of the focal loss in classification can be theoretically justified.

Classification General Classification +3

Paper
Add Code

Artificial Neural Variability for Deep Learning: On Overfitting, Noise Memorization, and Catastrophic Forgetting

1 code implementation • 12 Nov 2020 • Zeke Xie, Fengxiang He, Shaopeng Fu, Issei Sato, DaCheng Tao, Masashi Sugiyama

Thus it motivates us to design a similar mechanism named {\it artificial neural variability} (ANV), which helps artificial neural networks learn some advantages from ``natural'' neural networks.

Memorization

Paper
Code

A Survey of Label-noise Representation Learning: Past, Present and Future

1 code implementation • 9 Nov 2020 • Bo Han, Quanming Yao, Tongliang Liu, Gang Niu, Ivor W. Tsang, James T. Kwok, Masashi Sugiyama

Classical machine learning implicitly assumes that labels of the training data are sampled from a clean distribution, which can be too restrictive for real-world scenarios.

BIG-bench Machine Learning Learning Theory +1

Paper
Code

Binary classification with ambiguous training data

no code implementations • 5 Nov 2020 • Naoya Otani, Yosuke Otsubo, Tetsuya Koike, Masashi Sugiyama

This problem is substantially different from semi-supervised learning since unlabeled samples are not necessarily difficult samples.

Binary Classification Classification +1

Paper
Add Code

Maximum Mean Discrepancy Test is Aware of Adversarial Attacks

2 code implementations • 22 Oct 2020 • Ruize Gao, Feng Liu, Jingfeng Zhang, Bo Han, Tongliang Liu, Gang Niu, Masashi Sugiyama

However, it has been shown that the MMD test is unaware of adversarial attacks -- the MMD test failed to detect the discrepancy between natural and adversarial data.

Adversarial Attack Detection

Paper
Code

Classification with Rejection Based on Cost-sensitive Classification

no code implementations • 22 Oct 2020 • Nontawat Charoenphakdee, Zhenghang Cui, Yivan Zhang, Masashi Sugiyama

The goal of classification with rejection is to avoid risky misclassification in error-critical applications such as medical diagnosis and product inspection.

Classification General Classification +1

Paper
Add Code

Robust Imitation Learning from Noisy Demonstrations

1 code implementation • 20 Oct 2020 • Voot Tangkaratt, Nontawat Charoenphakdee, Masashi Sugiyama

Robust learning from noisy demonstrations is a practical but highly challenging problem in imitation learning.

Classification Continuous Control +2

Paper
Code

Pointwise Binary Classification with Pairwise Confidence Comparisons

no code implementations • 5 Oct 2020 • Lei Feng, Senlin Shu, Nan Lu, Bo Han, Miao Xu, Gang Niu, Bo An, Masashi Sugiyama

To alleviate the data requirement for training effective binary classifiers in binary classification, many weakly supervised learning settings have been proposed.

Binary Classification Classification +2

Paper
Add Code

Geometry-aware Instance-reweighted Adversarial Training

2 code implementations • ICLR 2021 • Jingfeng Zhang, Jianing Zhu, Gang Niu, Bo Han, Masashi Sugiyama, Mohan Kankanhalli

The belief was challenged by recent studies where we can maintain the robustness and improve the accuracy.

Paper
Code

Stable Weight Decay Regularization

no code implementations • 28 Sep 2020 • Zeke Xie, Issei Sato, Masashi Sugiyama

\citet{loshchilov2018decoupled} demonstrated that $L_{2}$ regularization is not identical to weight decay for adaptive gradient methods, such as Adaptive Momentum Estimation (Adam), and proposed Adam with Decoupled Weight Decay (AdamW).

Paper
Add Code

Provably Consistent Partial-Label Learning

no code implementations • NeurIPS 2020 • Lei Feng, Jiaqi Lv, Bo Han, Miao Xu, Gang Niu, Xin Geng, Bo An, Masashi Sugiyama

Partial-label learning (PLL) is a multi-class classification problem, where each training example is associated with a set of candidate labels.

Multi-class Classification Partial Label Learning

Paper
Add Code

A One-step Approach to Covariate Shift Adaptation

no code implementations • 8 Jul 2020 • Tianyi Zhang, Ikko Yamane, Nan Lu, Masashi Sugiyama

A default assumption in many machine learning scenarios is that the training and test samples are drawn from the same probability distribution.

Paper
Add Code

Unbiased Risk Estimators Can Mislead: A Case Study of Learning with Complementary Labels

no code implementations • ICML 2020 • Yu-Ting Chou, Gang Niu, Hsuan-Tien Lin, Masashi Sugiyama

In weakly supervised learning, unbiased risk estimator(URE) is a powerful tool for training classifiers when training and test data are drawn from different distributions.

Weakly-supervised Learning

Paper
Add Code

Adai: Separating the Effects of Adaptive Learning Rate and Momentum Inertia

1 code implementation • 29 Jun 2020 • Zeke Xie, Xinrui Wang, Huishuai Zhang, Issei Sato, Masashi Sugiyama

Specifically, we disentangle the effects of Adaptive Learning Rate and Momentum of the Adam dynamics on saddle-point escaping and minima selection.

131

Paper
Code

Online Dense Subgraph Discovery via Blurred-Graph Feedback

no code implementations • ICML 2020 • Yuko Kuroki, Atsushi Miyauchi, Junya Honda, Masashi Sugiyama

Dense subgraph discovery aims to find a dense component in edge-weighted graphs.

Graph Mining valid

Paper
Add Code

Generalisation Guarantees for Continual Learning with Orthogonal Gradient Descent

1 code implementation • 21 Jun 2020 • Mehdi Abbana Bennani, Thang Doan, Masashi Sugiyama

In this framework, we prove that OGD is robust to Catastrophic Forgetting then derive the first generalisation bound for SGD and OGD for Continual Learning.

Continual Learning Transfer Learning

Paper
Code

Coupling-based Invertible Neural Networks Are Universal Diffeomorphism Approximators

no code implementations • NeurIPS 2020 • Takeshi Teshima, Isao Ishikawa, Koichi Tojo, Kenta Oono, Masahiro Ikeda, Masashi Sugiyama

We answer this question by showing a convenient criterion: a CF-INN is universal if its layers contain affine coupling and invertible linear functions as special cases.

Image Generation Representation Learning

Paper
Add Code

Analysis and Design of Thompson Sampling for Stochastic Partial Monitoring

no code implementations • NeurIPS 2020 • Taira Tsuchiya, Junya Honda, Masashi Sugiyama

We investigate finite stochastic partial monitoring, which is a general model for sequential learning with limited feedback.

Decision Making Thompson Sampling

Paper
Add Code

LFD-ProtoNet: Prototypical Network Based on Local Fisher Discriminant Analysis for Few-shot Learning

no code implementations • 15 Jun 2020 • Kei Mukaiyama, Issei Sato, Masashi Sugiyama

The prototypical network (ProtoNet) is a few-shot learning framework that performs metric learning and classification using the distance to prototype representations of each class.

Few-Shot Learning General Classification +1

Paper
Add Code

Dual T: Reducing Estimation Error for Transition Matrix in Label-noise Learning

1 code implementation • NeurIPS 2020 • Yu Yao, Tongliang Liu, Bo Han, Mingming Gong, Jiankang Deng, Gang Niu, Masashi Sugiyama

By this intermediate class, the original transition matrix can then be factorized into the product of two easy-to-estimate transition matrices.

Paper
Code

Part-dependent Label Noise: Towards Instance-dependent Label Noise

1 code implementation • NeurIPS 2020 • Xiaobo Xia, Tongliang Liu, Bo Han, Nannan Wang, Mingming Gong, Haifeng Liu, Gang Niu, DaCheng Tao, Masashi Sugiyama

Learning with the \textit{instance-dependent} label noise is challenging, because it is hard to model such real-world noise.

Paper
Code

$γ$-ABC: Outlier-Robust Approximate Bayesian Computation Based on a Robust Divergence Estimator

no code implementations • 13 Jun 2020 • Masahiro Fujisawa, Takeshi Teshima, Issei Sato, Masashi Sugiyama

Approximate Bayesian computation (ABC) is a likelihood-free inference method that has been employed in various applications.

Paper
Add Code

Pairwise Supervision Can Provably Elicit a Decision Boundary

no code implementations • 11 Jun 2020 • Han Bao, Takuya Shimada, Liyuan Xu, Issei Sato, Masashi Sugiyama

A classifier built upon the representations is expected to perform well in downstream classification; however, little theory has been given in literature so far and thereby the relationship between similarity and classification has remained elusive.

Binary Classification Classification +5

Paper
Add Code

Rethinking Importance Weighting for Deep Learning under Distribution Shift

1 code implementation • NeurIPS 2020 • Tongtong Fang, Nan Lu, Gang Niu, Masashi Sugiyama

Under distribution shift (DS) where the training data distribution differs from the test one, a powerful technique is importance weighting (IW) which handles DS in two separate steps: weight estimation (WE) estimates the test-over-training density ratio and weighted classification (WC) trains the classifier from weighted training data.

Paper
Code

Calibrated Surrogate Losses for Adversarially Robust Classification

no code implementations • 28 May 2020 • Han Bao, Clayton Scott, Masashi Sugiyama

Adversarially robust classification seeks a classifier that is insensitive to adversarial perturbations of test patterns.

Classification General Classification +1

Paper
Add Code

Learning from Aggregate Observations

1 code implementation • NeurIPS 2020 • Yivan Zhang, Nontawat Charoenphakdee, Zhenguo Wu, Masashi Sugiyama

We study the problem of learning from aggregate observations where supervision signals are given to sets of instances instead of individual instances, while the goal is still to predict labels of unseen individuals.

Binary Classification Classification +3

Paper
Code

Do Public Datasets Assure Unbiased Comparisons for Registration Evaluation?

no code implementations • 20 Mar 2020 • Jie Luo, Guangshen Ma, Sarah Frisken, Parikshit Juvekar, Nazim Haouchine, Zhe Xu, Yiming Xiao, Alexandra Golby, Patrick Codd, Masashi Sugiyama, William Wells III

In this study, we use the variogram to screen the manually annotated landmarks in two datasets used to benchmark registration in image-guided neurosurgeries.

Image Registration

Paper
Add Code

Time-varying Gaussian Process Bandit Optimization with Non-constant Evaluation Time

no code implementations • 10 Mar 2020 • Hideaki Imamura, Nontawat Charoenphakdee, Futoshi Futami, Issei Sato, Junya Honda, Masashi Sugiyama

If the black-box function varies with time, then time-varying Bayesian optimization is a promising framework.

Bayesian Optimization Recommendation Systems

Paper
Add Code

Attacks Which Do Not Kill Training Make Adversarial Learning Stronger

1 code implementation • ICML 2020 • Jingfeng Zhang, Xilie Xu, Bo Han, Gang Niu, Lizhen Cui, Masashi Sugiyama, Mohan Kankanhalli

Adversarial training based on the minimax formulation is necessary for obtaining adversarial robustness of trained models.

Adversarial Robustness

118

Paper
Code

Do We Need Zero Training Loss After Achieving Zero Training Error?

1 code implementation • ICML 2020 • Takashi Ishida, Ikko Yamane, Tomoya Sakai, Gang Niu, Masashi Sugiyama

We experimentally show that flooding improves performance and, as a byproduct, induces a double descent curve of the test loss.

Memorization

Paper
Code

Progressive Identification of True Labels for Partial-Label Learning

1 code implementation • ICML 2020 • Jiaqi Lv, Miao Xu, Lei Feng, Gang Niu, Xin Geng, Masashi Sugiyama

Partial-label learning (PLL) is a typical weakly supervised learning problem, where each training instance is equipped with a set of candidate labels among which only one is the true label.

Partial Label Learning Stochastic Optimization +1

Paper
Code

A Diffusion Theory For Deep Learning Dynamics: Stochastic Gradient Descent Exponentially Favors Flat Minima

no code implementations • ICLR 2021 • Zeke Xie, Issei Sato, Masashi Sugiyama

Stochastic Gradient Descent (SGD) and its variants are mainstream methods for training deep networks in practice.

Stochastic Optimization

Paper
Add Code

Few-shot Domain Adaptation by Causal Mechanism Transfer

1 code implementation • ICML 2020 • Takeshi Teshima, Issei Sato, Masashi Sugiyama

We take the structural equations in causal modeling as an example and propose a novel DA method, which is shown to be useful both theoretically and experimentally.

Domain Adaptation

Paper
Code

Rethinking Class-Prior Estimation for Positive-Unlabeled Learning

no code implementations • ICLR 2022 • Yu Yao, Tongliang Liu, Bo Han, Mingming Gong, Gang Niu, Masashi Sugiyama, DaCheng Tao

Hitherto, the distributional-assumption-free CPE methods rely on a critical assumption that the support of the positive data distribution cannot be contained in the support of the negative data distribution.

valid

Paper
Add Code

Learning from Noisy Similar and Dissimilar Data

no code implementations • 3 Feb 2020 • Soham Dan, Han Bao, Masashi Sugiyama

We perform a detailed investigation of this problem under two realistic noise models and propose two algorithms to learn from noisy S-D data.

Paper
Add Code

Binary Classification from Positive Data with Skewed Confidence

no code implementations • 29 Jan 2020 • Kazuhiko Shinoda, Hirotaka Kaji, Masashi Sugiyama

Positive-confidence (Pconf) classification [Ishida et al., 2018] is a promising weakly-supervised learning method which trains a binary classifier only from positive data equipped with confidence.

Binary Classification Classification +2

Paper
Add Code

Confidence Scores Make Instance-dependent Label-noise Learning Possible

no code implementations • 11 Jan 2020 • Antonin Berthon, Bo Han, Gang Niu, Tongliang Liu, Masashi Sugiyama

We find with the help of confidence scores, the transition distribution of each instance can be approximately estimated.

Learning with noisy labels

Paper
Add Code

Learning with Multiple Complementary Labels

no code implementations • ICML 2020 • Lei Feng, Takuo Kaneko, Bo Han, Gang Niu, Bo An, Masashi Sugiyama

In this paper, we propose a novel problem setting to allow MCLs for each example and two ways for learning with MCLs.

Paper
Add Code

Where is the Bottleneck of Adversarial Learning with Unlabeled Data?

no code implementations • 20 Nov 2019 • Jingfeng Zhang, Bo Han, Gang Niu, Tongliang Liu, Masashi Sugiyama

Deep neural networks (DNNs) are incredibly brittle due to adversarial examples.

Paper
Add Code

Scalable Evaluation and Improvement of Document Set Expansion via Neural Positive-Unlabeled Learning

1 code implementation • EACL 2021 • Alon Jacovi, Gang Niu, Yoav Goldberg, Masashi Sugiyama

We consider the situation in which a user has collected a small set of documents on a cohesive topic, and they want to retrieve additional documents on this topic from a large collection.

Information Retrieval Retrieval

Paper
Code

Mitigating Overfitting in Supervised Classification from Two Unlabeled Datasets: A Consistent Risk Correction Approach

no code implementations • 20 Oct 2019 • Nan Lu, Tianyi Zhang, Gang Niu, Masashi Sugiyama

The recently proposed unlabeled-unlabeled (UU) classification method allows us to train a binary classifier only from two unlabeled datasets with different class priors.

Classification General Classification

Paper
Add Code

A unified view of likelihood ratio and reparameterization gradients and an optimal importance sampling scheme

no code implementations • 14 Oct 2019 • Paavo Parmas, Masashi Sugiyama

Reparameterization (RP) and likelihood ratio (LR) gradient estimators are used throughout machine and reinforcement learning; however, they are usually explained as simple mathematical tricks without providing any insight into their nature.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Learning Only from Relevant Keywords and Unlabeled Documents

no code implementations • IJCNLP 2019 • Nontawat Charoenphakdee, Jongyeong Lee, Yiping Jin, Dittaya Wanvarie, Masashi Sugiyama

We consider a document classification problem where document labels are absent but only relevant keywords of a target class and unlabeled documents are given.

Document Classification General Classification +1

Paper
Add Code

Learning from Indirect Observations

1 code implementation • 10 Oct 2019 • Yivan Zhang, Nontawat Charoenphakdee, Masashi Sugiyama

Weakly-supervised learning is a paradigm for alleviating the scarcity of labeled data by leveraging lower-quality but larger-scale supervision signals.

Weakly-supervised Learning

Paper
Code

Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics

2 code implementations • 3 Oct 2019 • Johannes Ackermann, Volker Gabler, Takayuki Osa, Masashi Sugiyama

Finally, we investigate the application of multi-agent methods to high-dimensional robotic tasks and show that our approach can be used to learn decentralized policies in this domain.

Ranked #1 on Multi-agent Reinforcement Learning on ParticleEnvs Cooperative Communication

Multi-agent Reinforcement Learning Reinforcement Learning (RL)

110

Paper
Code

Wildly Unsupervised Domain Adaptation and Its Powerful and Efficient Solution

no code implementations • 25 Sep 2019 • Feng Liu, Jie Lu, Bo Han, Gang Niu, Guangquan Zhang, Masashi Sugiyama

Hence, we consider a new, more realistic and more challenging problem setting, where classifiers have to be trained with noisy labeled data from SD and unlabeled data from TD---we name it wildly UDA (WUDA).

Unsupervised Domain Adaptation Wildly Unsupervised Domain Adaptation

Paper
Add Code

VILD: Variational Imitation Learning with Diverse-quality Demonstrations

no code implementations • 15 Sep 2019 • Voot Tangkaratt, Bo Han, Mohammad Emtiyaz Khan, Masashi Sugiyama

However, the quality of demonstrations in reality can be diverse, since it is easier and cheaper to collect demonstrations from a mix of experts and amateurs.

Continuous Control Imitation Learning

Paper
Add Code

Constraint Learning for Control Tasks with Limited Duration Barrier Functions

no code implementations • 26 Aug 2019 • Motoya Ohnishi, Gennaro Notomista, Masashi Sugiyama, Magnus Egerstedt

When deploying autonomous agents in unstructured environments over sustained periods of time, adaptability and robustness oftentimes outweigh optimality as a primary consideration.

Paper
Add Code

Are Registration Uncertainty and Error Monotonically Associated

no code implementations • 21 Aug 2019 • Jie Luo, Sarah Frisken, Duo Wang, Alexandra Golby, Masashi Sugiyama, William M. Wells III

Probabilistic image registration (PIR) methods provide measures of registration uncertainty, which could be a surrogate for assessing the registration error.

Image Registration

Paper
Add Code

Classification from Triplet Comparison Data

1 code implementation • 24 Jul 2019 • Zhenghang Cui, Nontawat Charoenphakdee, Issei Sato, Masashi Sugiyama

Although learning from triplet comparison data has been considered in many applications, an important fundamental question of whether we can learn a classifier only from triplet comparison data has remained unanswered.

Classification General Classification +1

Paper
Code

Direction Matters: On Influence-Preserving Graph Summarization and Max-cut Principle for Directed Graphs

no code implementations • 22 Jul 2019 • Wenkai Xu, Gang Niu, Aapo Hyvärinen, Masashi Sugiyama

On the other hand, compressing the vertices while preserving the directed edge information provides a way to learn the small-scale representation of a directed graph.

Clustering

Paper
Add Code

Are Anchor Points Really Indispensable in Label-Noise Learning?

1 code implementation • NeurIPS 2019 • Xiaobo Xia, Tongliang Liu, Nannan Wang, Bo Han, Chen Gong, Gang Niu, Masashi Sugiyama

Existing theories have shown that the transition matrix can be learned by exploiting \textit{anchor points} (i. e., data points that belong to a specific class almost surely).

Ranked #18 on Learning with noisy labels on CIFAR-10N-Random3

Learning with noisy labels

Paper
Code

Uncoupled Regression from Pairwise Comparison Data

1 code implementation • NeurIPS 2019 • Liyuan Xu, Junya Honda, Gang Niu, Masashi Sugiyama

We propose two practical methods for uncoupled regression from pairwise comparison data and show that the learned regression model converges to the optimal model with the optimal parametric convergence rate when the target variable distributes uniformly.

Learning-To-Rank regression

Paper
Code

Fast and Robust Rank Aggregation against Model Misspecification

1 code implementation • 29 May 2019 • Yuangang Pan, WeiJie Chen, Gang Niu, Ivor W. Tsang, Masashi Sugiyama

Specifically, the properties of our CoarsenRank are summarized as follows: (1) CoarsenRank is designed for mild model misspecification, which assumes there exist the ideal preferences (consistent with model assumption) that locates in a neighborhood of the actual preferences.

Bayesian Inference

Paper
Code

Calibrated Surrogate Maximization of Linear-fractional Utility in Binary Classification

no code implementations • 29 May 2019 • Han Bao, Masashi Sugiyama

A clue to tackle their direct optimization is a calibrated surrogate utility, which is a tractable lower bound of the true utility function representing a given metric.

Binary Classification Classification +5

Paper
Add Code

Solving NP-Hard Problems on Graphs with Extended AlphaGo Zero

2 code implementations • 28 May 2019 • Kenshin Abe, Zijian Xu, Issei Sato, Masashi Sugiyama

There have been increasing challenges to solve combinatorial optimization problems by machine learning.

Combinatorial Optimization Q-Learning

Paper
Code

Butterfly: One-step Approach towards Wildly Unsupervised Domain Adaptation

1 code implementation • 19 May 2019 • Feng Liu, Jie Lu, Bo Han, Gang Niu, Guangquan Zhang, Masashi Sugiyama

Hence, we consider a new, more realistic and more challenging problem setting, where classifiers have to be trained with noisy labeled data from SD and unlabeled data from TD -- we name it wildly UDA (WUDA).

Ranked #1 on Wildly Unsupervised Domain Adaptation on Noisy-MNIST-to-SYND

Unsupervised Domain Adaptation Wildly Unsupervised Domain Adaptation

Paper
Code

Classification from Pairwise Similarities/Dissimilarities and Unlabeled Data via Empirical Risk Minimization

no code implementations • 26 Apr 2019 • Takuya Shimada, Han Bao, Issei Sato, Masashi Sugiyama

In this paper, we derive an unbiased risk estimator which can handle all of similarities/dissimilarities and unlabeled data.

Clustering General Classification

Paper
Add Code

A Pseudo-Label Method for Coarse-to-Fine Multi-Label Learning with Limited Supervision

no code implementations • ICLR Workshop LLD 2019 • Cheng-Yu Hsieh, Miao Xu, Gang Niu, Hsuan-Tien Lin, Masashi Sugiyama

To address the need, we propose a special weakly supervised MLL problem that not only focuses on the situation of limited fine-grained supervision but also leverages the hierarchical relationship between the coarse concepts and the fine-grained ones.

Meta-Learning Multi-Label Learning +1

Paper
Add Code

Zero-shot Domain Adaptation Based on Attribute Information

no code implementations • 13 Mar 2019 • Masato Ishii, Takashi Takenouchi, Masashi Sugiyama

In this paper, we propose a novel domain adaptation method that can be applied without target data.

Attribute Domain Adaptation

Paper
Add Code

Polynomial-time Algorithms for Multiple-arm Identification with Full-bandit Feedback

no code implementations • 27 Feb 2019 • Yuko Kuroki, Liyuan Xu, Atsushi Miyauchi, Junya Honda, Masashi Sugiyama

Based on our approximation algorithm, we propose novel bandit algorithms for the top-k selection problem, and prove that our algorithms run in polynomial time.

Paper
Add Code

Online Multiclass Classification Based on Prediction Margin for Partial Feedback

no code implementations • 4 Feb 2019 • Takuo Kaneko, Issei Sato, Masashi Sugiyama

We consider the problem of online multiclass classification with partial feedback, where an algorithm predicts a class for a new instance in each round and only receives its correctness.

Classification General Classification

Paper
Add Code

New Tricks for Estimating Gradients of Expectations

no code implementations • 31 Jan 2019 • Christian J. Walder, Paul Roussel, Richard Nock, Cheng Soon Ong, Masashi Sugiyama

We introduce a family of pairwise stochastic gradient estimators for gradients of expectations, which are related to the log-derivative trick, but involve pairwise interactions between samples.

Paper
Add Code

Semi-Supervised Ordinal Regression Based on Empirical Risk Minimization

no code implementations • 31 Jan 2019 • Taira Tsuchiya, Nontawat Charoenphakdee, Issei Sato, Masashi Sugiyama

We further provide an estimation error bound to show that our risk estimator is consistent.

regression

Paper
Add Code

Domain Discrepancy Measure for Complex Models in Unsupervised Domain Adaptation

no code implementations • 30 Jan 2019 • Jongyeong Lee, Nontawat Charoenphakdee, Seiichi Kuroki, Masashi Sugiyama

Appropriately evaluating the discrepancy between domains is essential for the success of unsupervised domain adaptation.

Binary Classification Classification +3

Paper
Add Code

On the Calibration of Multiclass Classification with Rejection

1 code implementation • NeurIPS 2019 • Chenri Ni, Nontawat Charoenphakdee, Junya Honda, Masashi Sugiyama

First, we consider an approach based on simultaneous training of a classifier and a rejector, which achieves the state-of-the-art performance in the binary case.

Classification General Classification

Paper
Code

Revisiting Sample Selection Approach to Positive-Unlabeled Learning: Turning Unlabeled Data into Positive rather than Negative

no code implementations • 29 Jan 2019 • Miao Xu, Bingcong Li, Gang Niu, Bo Han, Masashi Sugiyama

May there be a new sample selection method that can outperform the latest importance reweighting method in the deep learning age?

Memorization

Paper
Add Code

Principled analytic classifier for positive-unlabeled learning via weighted integral probability metric

1 code implementation • 28 Jan 2019 • Yongchan Kwon, Wonyoung Kim, Masashi Sugiyama, Myunghee Cho Paik

We consider the problem of learning a binary classifier from only positive and unlabeled observations (called PU learning).

Hyperparameter Optimization

Paper
Code

Imitation Learning from Imperfect Demonstration

no code implementations • 27 Jan 2019 • Yueh-Hua Wu, Nontawat Charoenphakdee, Han Bao, Voot Tangkaratt, Masashi Sugiyama

Imitation learning (IL) aims to learn an optimal policy from demonstrations.

Imitation Learning

Paper
Add Code

On Symmetric Losses for Learning from Corrupted Labels

1 code implementation • 27 Jan 2019 • Nontawat Charoenphakdee, Jongyeong Lee, Masashi Sugiyama

This paper aims to provide a better understanding of a symmetric loss.

General Classification

Paper
Code

Normalized Flat Minima: Exploring Scale Invariant Definition of Flat Minima for Neural Networks using PAC-Bayesian Analysis

no code implementations • ICML 2020 • Yusuke Tsuzuku, Issei Sato, Masashi Sugiyama

However, existing definitions of the flatness are known to be sensitive to the rescaling of parameters.

Paper
Add Code

How does Disagreement Help Generalization against Label Corruption?

3 code implementations • 14 Jan 2019 • Xingrui Yu, Bo Han, Jiangchao Yao, Gang Niu, Ivor W. Tsang, Masashi Sugiyama

Learning with noisy labels is one of the hottest problems in weakly-supervised learning.

Ranked #13 on Learning with noisy labels on CIFAR-100N

Learning with noisy labels Memorization +1

Paper
Code

Hierarchical Reinforcement Learning via Advantage-Weighted Information Maximization

1 code implementation • ICLR 2019 • Takayuki Osa, Voot Tangkaratt, Masashi Sugiyama

However, identifying the hierarchical policy structure that enhances the performance of RL is not a trivial task.

Continuous Control Hierarchical Reinforcement Learning +2

Paper
Code

Active Deep Q-learning with Demonstration

no code implementations • 6 Dec 2018 • Si-An Chen, Voot Tangkaratt, Hsuan-Tien Lin, Masashi Sugiyama

In this work, we propose Active Reinforcement Learning with Demonstration (ARLD), a new framework to streamline RL in terms of demonstration efforts by allowing the RL agent to query for demonstration actively during training.

Q-Learning reinforcement-learning +1

Paper
Add Code

Complementary-Label Learning for Arbitrary Losses and Models

1 code implementation • Proceedings of the 36th International Conference on Machine Learning, 2019 • Takashi Ishida, Gang Niu, Aditya Krishna Menon, Masashi Sugiyama

In contrast to the standard classification paradigm where the true class is given to each training pattern, complementary-label learning only uses training patterns each equipped with a complementary label, which only specifies one of the classes that the pattern does not belong to.

Ranked #22 on Image Classification on Kuzushiji-MNIST

General Classification Image Classification

Paper
Code

Classification from Positive, Unlabeled and Biased Negative Data

1 code implementation • ICLR 2019 • Yu-Guan Hsieh, Gang Niu, Masashi Sugiyama

In binary classification, there are situations where negative (N) data are too diverse to be fully labeled and we often resort to positive-unlabeled (PU) learning in these scenarios.

Binary Classification Classification +1

Paper
Code

SIGUA: Forgetting May Make Learning with Noisy Labels More Robust

1 code implementation • ICML 2020 • Bo Han, Gang Niu, Xingrui Yu, Quanming Yao, Miao Xu, Ivor Tsang, Masashi Sugiyama

Given data with noisy labels, over-parameterized deep networks can gradually memorize the data, and fit everything in the end.

Learning with noisy labels Memorization

Paper
Code

Improving Generative Adversarial Imitation Learning with Non-expert Demonstrations

no code implementations • 27 Sep 2018 • Voot Tangkaratt, Masashi Sugiyama

Imitation learning aims to learn an optimal policy from expert demonstrations and its recent combination with deep learning has shown impressive performance.

Continuous Control Imitation Learning

Paper
Add Code

Pumpout: A Meta Approach for Robustly Training Deep Neural Networks with Noisy Labels

no code implementations • 27 Sep 2018 • Bo Han, Gang Niu, Jiangchao Yao, Xingrui Yu, Miao Xu, Ivor Tsang, Masashi Sugiyama

To handle these issues, by using the memorization effects of deep neural networks, we may train deep neural networks on the whole dataset only the first few iterations.

Memorization

Paper
Add Code

Positive-Unlabeled Classification under Class Prior Shift and Asymmetric Error

no code implementations • 19 Sep 2018 • Nontawat Charoenphakdee, Masashi Sugiyama

Based on the analysis of the Bayes optimal classifier, we show that given a test class prior, PU classification under class prior shift is equivalent to PU classification with asymmetric error.

Binary Classification Classification +2

Paper
Add Code

Alternate Estimation of a Classifier and the Class-Prior from Positive and Unlabeled Data

no code implementations • 15 Sep 2018 • Masahiro Kato, Liyuan Xu, Gang Niu, Masashi Sugiyama

In this paper, we propose a novel unified approach to estimating the class-prior and training a classifier alternately.

Paper
Add Code

Dueling Bandits with Qualitative Feedback

no code implementations • 14 Sep 2018 • Liyuan Xu, Junya Honda, Masashi Sugiyama

We formulate and study a novel multi-armed bandit problem called the qualitative dueling bandit (QDB) problem, where an agent observes not numeric but qualitative feedback by pulling each arm.

Paper
Add Code

Clipped Matrix Completion: A Remedy for Ceiling Effects

no code implementations • 13 Sep 2018 • Takeshi Teshima, Miao Xu, Issei Sato, Masashi Sugiyama

On the other hand, matrix completion (MC) methods can recover a low-rank matrix from various information deficits by using the principle of low-rank completion.

Matrix Completion Recommendation Systems

Paper
Add Code

Unsupervised Domain Adaptation Based on Source-guided Discrepancy

no code implementations • 11 Sep 2018 • Seiichi Kuroki, Nontawat Charoenphakdee, Han Bao, Junya Honda, Issei Sato, Masashi Sugiyama

A previously proposed discrepancy that does not use the source domain labels requires high computational cost to estimate and may lead to a loose generalization error bound in the target domain.

Unsupervised Domain Adaptation

Paper
Add Code

On the Minimal Supervision for Training Any Binary Classifier from Only Unlabeled Data

1 code implementation • ICLR 2019 • Nan Lu, Gang Niu, Aditya Krishna Menon, Masashi Sugiyama

In this paper, we study training arbitrary (from linear to deep) binary classifier from only unlabeled (U) data by ERM.

Paper
Code

Continuous-time Value Function Approximation in Reproducing Kernel Hilbert Spaces

no code implementations • NeurIPS 2018 • Motoya Ohnishi, Masahiro Yukawa, Mikael Johansson, Masashi Sugiyama

Motivated by the success of reinforcement learning (RL) for discrete-time tasks such as AlphaGo and Atari games, there has been a recent surge of interest in using RL for continuous-time control of physical systems (cf.

Atari Games Gaussian Processes +2

Paper
Add Code

Matrix Co-completion for Multi-label Classification with Missing Features and Labels

no code implementations • 23 May 2018 • Miao Xu, Gang Niu, Bo Han, Ivor W. Tsang, Zhi-Hua Zhou, Masashi Sugiyama

We consider a challenging multi-label classification problem where both feature matrix $\X$ and label matrix $\Y$ have missing entries.

General Classification Matrix Completion +1

Paper
Add Code

Masking: A New Perspective of Noisy Supervision

2 code implementations • NeurIPS 2018 • Bo Han, Jiangchao Yao, Gang Niu, Mingyuan Zhou, Ivor Tsang, Ya zhang, Masashi Sugiyama

It is important to learn various types of classifiers given training data with noisy labels.

Ranked #42 on Image Classification on Clothing1M (using extra training data)

Image Classification

481

Paper
Code

Bayesian posterior approximation via greedy particle optimization

no code implementations • 21 May 2018 • Futoshi Futami, Zhenghang Cui, Issei Sato, Masashi Sugiyama

Another example is the Stein points (SP) method, which minimizes kernelized Stein discrepancy directly.

Bayesian Inference Variational Inference

Paper
Add Code

Co-teaching: Robust Training of Deep Neural Networks with Extremely Noisy Labels

5 code implementations • NeurIPS 2018 • Bo Han, Quanming Yao, Xingrui Yu, Gang Niu, Miao Xu, Weihua Hu, Ivor Tsang, Masashi Sugiyama

Deep learning with noisy labels is practically challenging, as the capacity of deep models is so high that they can totally memorize these noisy labels sooner or later during training.

Ranked #8 on Learning with noisy labels on CIFAR-10N-Random3

Learning with noisy labels Memorization

481

Paper
Code

A Feature-Driven Active Framework for Ultrasound-Based Brain Shift Compensation

no code implementations • 20 Mar 2018 • Jie Luo, Matt Toews, Ines Machado, Sarah Frisken, Miaomiao Zhang, Frank Preiswerk, Alireza Sedghi, Hongyi Ding, Steve Pieper, Polina Golland, Alexandra Golby, Masashi Sugiyama, William M. Wells III

Kernels of the GP are estimated by using variograms and a discrete grid search method.

Paper
Add Code

Uplift Modeling from Separate Labels

1 code implementation • NeurIPS 2018 • Ikko Yamane, Florian Yger, Jamal Atif, Masashi Sugiyama

Uplift modeling is aimed at estimating the incremental impact of an action on an individual's behavior, which is useful in various application domains such as targeted marketing (advertisement campaigns) and personalized medicine (medical treatments).

Marketing

Paper
Code

On the Applicability of Registration Uncertainty

no code implementations • 14 Mar 2018 • Jie Luo, Alireza Sedghi, Karteek Popuri, Dana Cobzas, Miaomiao Zhang, Frank Preiswerk, Matthew Toews, Alexandra Golby, Masashi Sugiyama, William M. Wells III, Sarah Frisken

For probabilistic image registration (PIR), the predominant way to quantify the registration uncertainty is using summary statistics of the distribution of transformation parameters.

Image Registration

Paper
Add Code

Binary Matrix Completion Using Unobserved Entries

no code implementations • 13 Mar 2018 • Masayoshi Hayashi, Tomoya Sakai, Masashi Sugiyama

In this paper, motivated by a semi-supervised classification method recently proposed by Sakai et al. (2017), we develop a method for the BMC problem which can use all of positive, negative, and unobserved entries, by combining the risks of Davenport et al. (2014) and Hsieh et al. (2015).

General Classification Matrix Completion +1

Paper
Add Code

Variational Inference for Gaussian Process with Panel Count Data

no code implementations • 12 Mar 2018 • Hongyi Ding, Young Lee, Issei Sato, Masashi Sugiyama

We present the first framework for Gaussian-process-modulated Poisson processes when the temporal data appear in the form of panel counts.

Variational Inference

Paper
Add Code

Active Feature Acquisition with Supervised Matrix Completion

no code implementations • 15 Feb 2018 • Sheng-Jun Huang, Miao Xu, Ming-Kun Xie, Masashi Sugiyama, Gang Niu, Songcan Chen

Feature missing is a serious problem in many applications, which may lead to low quality of training data and further significantly degrade the learning performance.

Matrix Completion

Paper
Add Code

Analysis of Minimax Error Rate for Crowdsourcing and Its Application to Worker Clustering Model

1 code implementation • ICML 2018 • Hideaki Imamura, Issei Sato, Masashi Sugiyama

In this paper, we derive a minimax error rate under more practical setting for a broader class of crowdsourcing models including the DS model as a special case.

Clustering

Paper
Code

Lipschitz-Margin Training: Scalable Certification of Perturbation Invariance for Deep Neural Networks

2 code implementations • NeurIPS 2018 • Yusuke Tsuzuku, Issei Sato, Masashi Sugiyama

High sensitivity of neural networks against malicious perturbations on inputs causes security concerns.

Paper
Code

Gaussian Process Classification with Privileged Information by Soft-to-Hard Labeling Transfer

no code implementations • 12 Feb 2018 • Ryosuke Kamesawa, Issei Sato, Masashi Sugiyama

A state-of-the-art method of Gaussian process classification (GPC) with privileged information is GPC+, which incorporates privileged information into a noise term of the likelihood.

Gaussian Processes General Classification +1

Paper
Add Code

Classification from Pairwise Similarity and Unlabeled Data

2 code implementations • ICML 2018 • Han Bao, Gang Niu, Masashi Sugiyama

Supervised learning needs a huge amount of labeled data, which can be a big bottleneck under the situation where there is a privacy concern or labeling cost is high.

Classification General Classification +1

Paper
Code

Generative Local Metric Learning for Kernel Regression

no code implementations • NeurIPS 2017 • Yung-Kyun Noh, Masashi Sugiyama, Kee-Eung Kim, Frank Park, Daniel D. Lee

This paper shows how metric learning can be used with Nadaraya-Watson (NW) kernel regression.

Metric Learning regression

Paper
Add Code

Hierarchical Policy Search via Return-Weighted Density Estimation

no code implementations • 28 Nov 2017 • Takayuki Osa, Masashi Sugiyama

Learning an optimal policy from a multi-modal reward function is a challenging problem in reinforcement learning (RL).

Density Estimation Motion Planning +1

Paper
Add Code

Binary Classification from Positive-Confidence Data

1 code implementation • NeurIPS 2018 • Takashi Ishida, Gang Niu, Masashi Sugiyama

Can we learn a binary classifier from only positive data, without any negative data or unlabeled data?

Binary Classification Classification +3

Paper
Code

Variational Inference based on Robust Divergences

1 code implementation • 18 Oct 2017 • Futoshi Futami, Issei Sato, Masashi Sugiyama

In this paper, based on Zellner's optimization and variational formulation of Bayesian inference, we propose an outlier-robust pseudo-Bayesian variational method by replacing the Kullback-Leibler divergence used for data fitting to a robust divergence such as the beta- and gamma-divergences.

Bayesian Inference Variational Inference

Paper
Code

Good Arm Identification via Bandit Feedback

no code implementations • 17 Oct 2017 • Hideaki Kano, Junya Honda, Kentaro Sakamaki, Kentaro Matsuura, Atsuyoshi Nakamura, Masashi Sugiyama

We consider a novel stochastic multi-armed bandit problem called {\em good arm identification} (GAI), where a good arm is defined as an arm with expected reward greater than or equal to a given threshold.

Paper
Add Code

Fully adaptive algorithm for pure exploration in linear bandits

no code implementations • 16 Oct 2017 • Liyuan Xu, Junya Honda, Masashi Sugiyama

We propose the first fully-adaptive algorithm for pure exploration in linear bandits---the task to find the arm with the largest expected reward, which depends on an unknown parameter linearly.

Paper
Add Code

Information-Theoretic Representation Learning for Positive-Unlabeled Classification

no code implementations • 15 Oct 2017 • Tomoya Sakai, Gang Niu, Masashi Sugiyama

Recent advances in weakly supervised classification allow us to train a classifier only from positive and unlabeled (PU) data.

Classification Dimensionality Reduction +3

Paper
Add Code

Mode-Seeking Clustering and Density Ridge Estimation via Direct Estimation of Density-Derivative-Ratios

no code implementations • 6 Jul 2017 • Hiroaki Sasaki, Takafumi Kanamori, Aapo Hyvärinen, Gang Niu, Masashi Sugiyama

Based on the proposed estimator, novel methods both for mode-seeking clustering and density ridge estimation are developed, and the respective convergence rates to the mode and ridge of the underlying density are also established.

Clustering Density Estimation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.