TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Medical Code Prediction	MIMIC-III	RAC	Macro-AUC	94.8	# 3
Medical Code Prediction	MIMIC-III	RAC	Micro-AUC	99.2	# 2
Medical Code Prediction	MIMIC-III	RAC	Macro-F1	12.7	# 2
Medical Code Prediction	MIMIC-III	RAC	Micro-F1	58.6	# 4
Medical Code Prediction	MIMIC-III	RAC	Precision@5	82.9	# 1
Medical Code Prediction	MIMIC-III	RAC	Precision@8	75.4	# 4
Medical Code Prediction	MIMIC-III	RAC	Precision@15	60.1	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/read-attend-and-code-pushing-the-limits-of/medical-code-prediction-on-mimic-iii)](https://paperswithcode.com/sota/medical-code-prediction-on-mimic-iii?p=read-attend-and-code-pushing-the-limits-of)`

Read, Attend, and Code: Pushing the Limits of Medical Codes Prediction from Clinical Notes by Machines

10 Jul 2021 · Byung-Hak Kim, Varun Ganapathi ·

Prediction of medical codes from clinical notes is both a practical and essential need for every healthcare delivery organization within current medical systems. Automating annotation will save significant time and excessive effort spent by human coders today. However, the biggest challenge is directly identifying appropriate medical codes out of several thousands of high-dimensional codes from unstructured free-text clinical notes. In the past three years, with Convolutional Neural Networks (CNN) and Long Short-Term Memory (LTSM) networks, there have been vast improvements in tackling the most challenging benchmark of the MIMIC-III-full-label inpatient clinical notes dataset. This progress raises the fundamental question of how far automated machine learning (ML) systems are from human coders' working performance. We assessed the baseline of human coders' performance on the same subsampled testing set. We also present our Read, Attend, and Code (RAC) model for learning the medical code assignment mappings. By connecting convolved embeddings with self-attention and code-title guided attention modules, combined with sentence permutation-based data augmentations and stochastic weight averaging training, RAC establishes a new state of the art (SOTA), considerably outperforming the current best Macro-F1 by 18.7%, and reaches past the human-level coding baseline. This new milestone marks a meaningful step toward fully autonomous medical coding (AMC) in machines reaching parity with human coders' performance in medical code prediction.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Medical Code Prediction

Multi-Label Classification Of Biomedical Texts

Sentence

Datasets

MIMIC-III

Results from the Paper

Edit

Ranked #4 on Medical Code Prediction on MIMIC-III

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Medical Code Prediction	MIMIC-III	RAC	Macro-AUC	94.8	# 3	Compare
			Micro-AUC	99.2	# 2	Compare
			Macro-F1	12.7	# 2	Compare
			Micro-F1	58.6	# 4	Compare
			Precision@5	82.9	# 1	Compare
			Precision@8	75.4	# 4	Compare
			Precision@15	60.1	# 4	Compare

Methods

Add Remove

1D CNN • Linear Layer • Multi-Head Attention • Scaled Dot-Product Attention • Softmax • Stochastic Weight Averaging

Edit Social Preview

Read, Attend, and Code: Pushing the Limits of Medical Codes Prediction from Clinical Notes by Machines

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove