Selection bias

78 papers with code • 0 benchmarks • 2 datasets

This task has no description! Would you like to contribute one?

Most implemented papers

PanNuke Dataset Extension, Insights and Baselines

TIA-Lab/PanNuke-metrics 24 Mar 2020

The emerging area of computational pathology (CPath) is ripe ground for the application of deep learning (DL) methods to healthcare due to the sheer volume of raw pixel data in whole-slide images (WSIs) of cancerous tissue slides.

Entire Space Multi-Task Model: An Effective Approach for Estimating Post-Click Conversion Rate

shenweichen/DeepCTR 21 Apr 2018

To the best of our knowledge, this is the first public dataset which contains samples with sequential dependence of click and conversion labels for CVR modeling.

A Debiased MDI Feature Importance Measure for Random Forests

shifwang/paper-debiased-feature-importance NeurIPS 2019

Based on the original definition of MDI by Breiman et al. for a single tree, we derive a tight non-asymptotic bound on the expected bias of MDI importance of noisy features, showing that deep trees have higher (expected) feature selection bias than shallow ones.

Active Structure Learning of Causal DAGs via Directed Clique Tree

csquires/dct-policy 1 Nov 2020

Most existing works focus on worst-case or average-case lower bounds for the number of interventions required to orient a DAG.

Selection Bias Explorations and Debias Methods for Natural Language Sentence Matching Datasets

arthua196/Leakage-Neutral-Learning-for-QuoraQP ACL 2019

Natural Language Sentence Matching (NLSM) has gained substantial attention from both academics and the industry, and rich public datasets contribute a lot to this process.

To Model or to Intervene: A Comparison of Counterfactual and Online Learning to Rank from User Interactions

HarrieO/OnlineLearningToRank 15 Jul 2019

At the moment, two methodologies for dealing with bias prevail in the field of LTR: counterfactual methods that learn from historical data and model user behavior to deal with biases; and online methods that perform interventions to deal with bias but use no explicit user models.

Automated Dependence Plots

davidinouye/adp 2 Dec 2019

To address these drawbacks, we formalize a method for automating the selection of interesting PDPs and extend PDPs beyond showing single features to show the model response along arbitrary directions, for example in raw feature space or a latent space arising from some generative model.

Algorithm is Experiment: Machine Learning, Market Design, and Policy Eligibility Rules

rfgong/IVaps 26 Apr 2021

We use this observation to develop a treatment-effect estimator for a class of stochastic and deterministic decision-making algorithms.

Alleviating the Sample Selection Bias in Few-shot Learning by Removing Projection to the Centroid

kikimormay/fsl-tcbr 30 Oct 2022

In this paper, we find that this problem usually occurs when the positions of support samples are in the vicinity of task centroid -- the mean of all class centroids in the task.

Discovering and Explaining the Non-Causality of Deep Learning in SAR ATR

waterdisappear/data-bias-in-mstar 3 Apr 2023

Therefore, the degree of overfitting for clutter reflects the non-causality of deep learning in SAR ATR.