Search Results for author: Haoran Zhang

Found 31 papers, 13 papers with code

Essay Quality Signals as Weak Supervision for Source-based Essay Scoring

no code implementations EACL (BEA) 2021 Haoran Zhang, Diane Litman

However, because AES typically uses supervised machine learning, a human-graded essay corpus is still required to train the AES model.

Automated Essay Scoring

Change is Hard: A Closer Look at Subpopulation Shift

1 code implementation23 Feb 2023 Yuzhe Yang, Haoran Zhang, Dina Katabi, Marzyeh Ghassemi

Machine learning models often perform poorly on subgroups that are underrepresented in the training data.

Model Selection

SimMTM: A Simple Pre-Training Framework for Masked Time-Series Modeling

no code implementations2 Feb 2023 Jiaxiang Dong, Haixu Wu, Haoran Zhang, Li Zhang, Jianmin Wang, Mingsheng Long

By relating masked modeling to manifold learning, SimMTM proposes to recover masked time points by the weighted aggregation of multiple neighbors outside the manifold, which eases the reconstruction task by assembling ruined but complementary temporal variations from multiple masked series.

Representation Learning Time Series Analysis

Efficient Estimation for Longitudinal Network via Adaptive Merging

no code implementations15 Nov 2022 Haoran Zhang, Junhui Wang

Longitudinal network consists of a sequence of temporal edges among multiple nodes, where the temporal edges are observed in real time.

Tensor Decomposition

"Why did the Model Fail?": Attributing Model Performance Changes to Distribution Shifts

no code implementations19 Oct 2022 Haoran Zhang, Harvineet Singh, Marzyeh Ghassemi, Shalmali Joshi

In this work, we introduce the problem of attributing performance differences between environments to distribution shifts in the underlying data generating mechanisms.

Signed Network Embedding with Application to Simultaneous Detection of Communities and Anomalies

no code implementations8 Jul 2022 Haoran Zhang, Junhui Wang

This paper develops a unified embedding model for signed networks to disentangle the intertwined balance structure and anomaly effect, which can greatly facilitate the downstream analysis, including community detection, anomaly detection, and network inference.

Anomaly Detection Community Detection +1

The Road to Explainability is Paved with Bias: Measuring the Fairness of Explanations

no code implementations6 May 2022 Aparna Balagopalan, Haoran Zhang, Kimia Hamidieh, Thomas Hartvigsen, Frank Rudzicz, Marzyeh Ghassemi

Across two different blackbox model architectures and four popular explainability methods, we find that the approximation quality of explanation models, also known as the fidelity, differs significantly between subgroups.

BIG-bench Machine Learning Fairness

Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches

no code implementations4 Apr 2022 Zifeng Zhao, Dongchao Yang, Rongzhi Gu, Haoran Zhang, Yuexian Zou

However, its performance is often inferior to that of a blind source separation (BSS) counterpart with a similar network architecture, due to the auxiliary speaker encoder may sometimes generate ambiguous speaker embeddings.

Metric Learning Speaker Separation +1

Improving the Fairness of Chest X-ray Classifiers

1 code implementation23 Mar 2022 Haoran Zhang, Natalie Dullerud, Karsten Roth, Lauren Oakden-Rayner, Stephen Robert Pfohl, Marzyeh Ghassemi

We also find that methods which achieve group fairness do so by worsening performance for all groups.

Fairness

Reinforcement Learning from Demonstrations by Novel Interactive Expert and Application to Automatic Berthing Control Systems for Unmanned Surface Vessel

no code implementations23 Feb 2022 Haoran Zhang, Chenkun Yin, Yanxin Zhang, Shangtai Jin, Zhenxuan Li

A new expert data generation method, called Model Predictive Based Expert (MPBE) which combines Model Predictive Control and Deep Deterministic Policy Gradient, is developed to provide high quality supervision data for RLfD algorithms.

reinforcement-learning reinforcement Learning

Learning Optimal Predictive Checklists

1 code implementation NeurIPS 2021 Haoran Zhang, Quaid Morris, Berk Ustun, Marzyeh Ghassemi

Our results show that our method can fit simple predictive checklists that perform well and that can easily be customized to obey a rich class of custom constraints.

Fairness

Differentiable Projection for Constrained Deep Learning

no code implementations21 Nov 2021 Dou Huang, Haoran Zhang, Xuan Song, Ryosuke Shibasaki

In this paper, we propose to use a differentiable projection layer in DNN instead of directly solving time-consuming KKT conditions.

Image Segmentation Semantic Segmentation

OneFlow: Redesign the Distributed Deep Learning Framework from Scratch

1 code implementation28 Oct 2021 Jinhui Yuan, Xinqi Li, Cheng Cheng, Juncheng Liu, Ran Guo, Shenghang Cai, Chi Yao, Fei Yang, Xiaodong Yi, Chuan Wu, Haoran Zhang, Jie Zhao

Aiming at a simple, neat redesign of distributed deep learning frameworks for various parallelism paradigms, we present OneFlow, a novel distributed training framework based on an SBP (split, broadcast and partial-value) abstraction and the actor model.

An open GPS trajectory dataset and benchmark for travel mode detection

no code implementations17 Sep 2021 Jinyu Chen, Haoran Zhang, Xuan Song, Ryosuke Shibasaki

In this study, we propose and open GPS trajectory dataset marked with travel mode and benchmark for the travel mode detection.

Pulling Up by the Causal Bootstraps: Causal Data Augmentation for Pre-training Debiasing

1 code implementation27 Aug 2021 Sindhu C. M. Gowda, Shalmali Joshi, Haoran Zhang, Marzyeh Ghassemi

This systematic investigation underlines the importance of accounting for the underlying data-generating mechanisms and fortifying data-preprocessing pipelines with a causal framework to develop methods robust to confounding biases.

Benchmarking Data Augmentation +1

A comparison of approaches to improve worst-case predictive model performance over patient subpopulations

1 code implementation27 Aug 2021 Stephen R. Pfohl, Haoran Zhang, Yizhe Xu, Agata Foryciarz, Marzyeh Ghassemi, Nigam H. Shah

Predictive models for clinical outcomes that are accurate on average in a patient population may underperform drastically for some subpopulations, potentially introducing or reinforcing inequities in care access and quality.

Invariance-based Multi-Clustering of Latent Space Embeddings for Equivariant Learning

no code implementations25 Jul 2021 Chandrajit Bajaj, Avik Roy, Haoran Zhang

Variational Autoencoders (VAEs) have been shown to be remarkably effective in recovering model latent spaces for several computer vision tasks.

Reading Race: AI Recognises Patient's Racial Identity In Medical Images

no code implementations21 Jul 2021 Imon Banerjee, Ananth Reddy Bhimireddy, John L. Burns, Leo Anthony Celi, Li-Ching Chen, Ramon Correa, Natalie Dullerud, Marzyeh Ghassemi, Shih-Cheng Huang, Po-Chih Kuo, Matthew P Lungren, Lyle Palmer, Brandon J Price, Saptarshi Purkayastha, Ayis Pyrros, Luke Oakden-Rayner, Chima Okechukwu, Laleh Seyyed-Kalantari, Hari Trivedi, Ryan Wang, Zachary Zaiman, Haoran Zhang, Judy W Gichoya

Methods: Using private and public datasets we evaluate: A) performance quantification of deep learning models to detect race from medical images, including the ability of these models to generalize to external environments and across multiple imaging modalities, B) assessment of possible confounding anatomic and phenotype population features, such as disease distribution and body habitus as predictors of race, and C) investigation into the underlying mechanism by which AI models can recognize race.

An Empirical Framework for Domain Generalization in Clinical Settings

1 code implementation20 Mar 2021 Haoran Zhang, Natalie Dullerud, Laleh Seyyed-Kalantari, Quaid Morris, Shalmali Joshi, Marzyeh Ghassemi

In this work, we benchmark the performance of eight domain generalization methods on multi-site clinical time series and medical imaging data.

Domain Generalization Time Series Analysis

Incorporating Inner-word and Out-word Features for Mongolian Morphological Segmentation

no code implementations COLING 2020 Na Liu, Xiangdong Su, Haoran Zhang, Guanglai Gao, Feilong Bao

The inner-word encoder uses the self-attention mechanisms to capture the inner-word features of the target word.

An Empirical Study of Representation Learning for Reinforcement Learning in Healthcare

1 code implementation23 Nov 2020 Taylor W. Killian, Haoran Zhang, Jayakumar Subramanian, Mehdi Fatemi, Marzyeh Ghassemi

Reinforcement Learning (RL) has recently been applied to sequential estimation and prediction problems identifying and developing hypothetical treatment strategies for septic patients, with a particular focus on offline learning with observational data.

reinforcement-learning reinforcement Learning +1

Automated Topical Component Extraction Using Neural Network Attention Scores from Source-based Essay Scoring

no code implementations ACL 2020 Haoran Zhang, Diane Litman

While automated essay scoring (AES) can reliably grade essays at scale, automated writing evaluation (AWE) additionally provides formative feedback to guide essay revision.

Automated Essay Scoring Automated Writing Evaluation

Hurtful Words: Quantifying Biases in Clinical Contextual Word Embeddings

1 code implementation11 Mar 2020 Haoran Zhang, Amy X. Lu, Mohamed Abdalla, Matthew McDermott, Marzyeh Ghassemi

In this work, we examine the extent to which embeddings may encode marginalized populations differently, and how this may lead to a perpetuation of biases and worsened performance on clinical tasks.

Fairness Word Embeddings

Word Embedding for Response-To-Text Assessment of Evidence

no code implementations ACL 2017 Haoran Zhang, Diane Litman

Our long-term goal is to also use this scoring method to provide formative feedback to students and teachers about students' writing quality.

Automated Essay Scoring

Co-Attention Based Neural Network for Source-Dependent Essay Scoring

1 code implementation WS 2018 Haoran Zhang, Diane Litman

This paper presents an investigation of using a co-attention based neural network for source-dependent essay scoring.

Automated Essay Scoring

Dose-response modeling in high-throughput cancer drug screenings: An end-to-end approach

1 code implementation13 Dec 2018 Wesley Tansey, Kathy Li, Haoran Zhang, Scott W. Linderman, Raul Rabadan, David M. Blei, Chris H. Wiggins

Personalized cancer treatments based on the molecular profile of a patient's tumor are an emerging and exciting class of treatments in oncology.

Applications

The Holdout Randomization Test for Feature Selection in Black Box Models

3 code implementations1 Nov 2018 Wesley Tansey, Victor Veitch, Haoran Zhang, Raul Rabadan, David M. Blei

We propose the holdout randomization test (HRT), an approach to feature selection using black box predictive models.

Methodology

Cannot find the paper you are looking for? You can Submit a new open access paper.