TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Machine Translation	V_A (trained on T_H)	M_C	Median Relative Edit Distance	0.28	# 1
Machine Translation	V_B (trained on T_H)	M_C	Median Relative Edit Distance	0.25	# 1
Machine Translation	V_C (trained on T_H)	M_C	Median Relative Edit Distance	0.27	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/on-automatic-parsing-of-log-records/machine-translation-on-v-a-trained-on-t-h)](https://paperswithcode.com/sota/machine-translation-on-v-a-trained-on-t-h?p=on-automatic-parsing-of-log-records)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/on-automatic-parsing-of-log-records/machine-translation-on-v-b-trained-on-t-h)](https://paperswithcode.com/sota/machine-translation-on-v-b-trained-on-t-h?p=on-automatic-parsing-of-log-records)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/on-automatic-parsing-of-log-records/machine-translation-on-v-c-trained-on-t-h)](https://paperswithcode.com/sota/machine-translation-on-v-c-trained-on-t-h?p=on-automatic-parsing-of-log-records)`

On Automatic Parsing of Log Records

12 Feb 2021 · Jared Rand, Andriy Miranskyy ·

Software log analysis helps to maintain the health of software solutions and ensure compliance and security. Existing software systems consist of heterogeneous components emitting logs in various formats. A typical solution is to unify the logs using manually built parsers, which is laborious. Instead, we explore the possibility of automating the parsing task by employing machine translation (MT). We create a tool that generates synthetic Apache log records which we used to train recurrent-neural-network-based MT models. Models' evaluation on real-world logs shows that the models can learn Apache log format and parse individual log records. The median relative edit distance between an actual real-world log record and the MT prediction is less than or equal to 28%. Thus, we show that log parsing using an MT approach is promising.

PDF Abstract

Code

Add Remove Mark official

WulffHunter/log_generator official

Tasks

Add Remove

Log Parsing

Machine Translation

Translation

Datasets

Introduced in the Paper:

Synthetic and Real Apache Log Records

Results from the Paper

Edit

Ranked #1 on Machine Translation on V_A (trained on T_H)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Machine Translation	V_A (trained on T_H)	M_C	Median Relative Edit Distance	0.28	# 1	Compare
Machine Translation	V_B (trained on T_H)	M_C	Median Relative Edit Distance	0.25	# 1	Compare
Machine Translation	V_C (trained on T_H)	M_C	Median Relative Edit Distance	0.27	# 1	Compare

Methods

Add Remove

GRU • LSTM

Edit Social Preview

On Automatic Parsing of Log Records

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove