Search Results for author: Wonseok Hwang

Found 20 papers, 12 papers with code

LARGE: Legal Retrieval Augmented Generation Evaluation Tool

1 code implementation2 Apr 2025 Minhu Park, Hongseok Oh, Eunkyung Choi, Wonseok Hwang

Recently, building retrieval-augmented generation (RAG) systems to enhance the capability of large language models (LLMs) has become a common practice.

RAG Retrieval

Developing a Pragmatic Benchmark for Assessing Korean Legal Language Understanding in Large Language Models

1 code implementation11 Oct 2024 Yeeun Kim, Young Rok Choi, Eunkyung Choi, Jinhwan Choi, Hai Jin Park, Wonseok Hwang

Here, we introduce KBL, a benchmark for assessing the Korean legal language understanding of LLMs, consisting of (1) 7 legal knowledge tasks (510 examples), (2) 4 legal reasoning tasks (288 examples), and (3) the Korean bar exam (4 domains, 53 tasks, 2, 510 examples).

Legal Reasoning RAG

Does Alignment Tuning Really Break LLMs' Internal Confidence?

1 code implementation31 Aug 2024 Hongseok Oh, Wonseok Hwang

Large Language Models (LLMs) have shown remarkable progress, but their real-world application necessitates reliable calibration.

Instruction Following

On the Consideration of AI Openness: Can Good Intent Be Abused?

no code implementations11 Mar 2024 Yeeun Kim, Hyunseo Shin, Eunkyung Choi, Hongseok Oh, Hyunjun Kim, Wonseok Hwang

Open source is a driving force behind scientific advancement. However, this openness is also a double-edged sword, with the inherent risk that innovative technologies can be misused for purposes harmful to society.

Domain Generalization

SymBa: Symbolic Backward Chaining for Structured Natural Language Reasoning

no code implementations20 Feb 2024 Jinu Lee, Wonseok Hwang

To improve the performance and explainability of LLM-based natural language reasoning, structured reasoning can be applied to generate explicitly structured proofs.

Arithmetic Reasoning GSM8K +1

Data-efficient End-to-end Information Extraction for Statistical Legal Analysis

1 code implementation3 Nov 2022 Wonseok Hwang, Saehee Eom, Hanuhl Lee, Hai Jin Park, Minjoon Seo

Lawyers, for instance, search for appropriate precedents favorable to their clients, while the number of legal precedents is ever-growing.

Instance Search

A Multi-Task Benchmark for Korean Legal Language Understanding and Judgement Prediction

1 code implementation10 Jun 2022 Wonseok Hwang, Dongjun Lee, Kyoungyeon Cho, Hanuhl Lee, Minjoon Seo

Here we present the first large-scale benchmark of Korean legal AI datasets, LBOX OPEN, that consists of one legal corpus, two classification tasks, two legal judgement prediction (LJP) tasks, and one summarization task.

Language Modelling

OCR-free Document Understanding Transformer

5 code implementations30 Nov 2021 Geewook Kim, Teakgyu Hong, Moonbin Yim, Jeongyeon Nam, Jinyoung Park, Jinyeong Yim, Wonseok Hwang, Sangdoo Yun, Dongyoon Han, Seunghyun Park

Current Visual Document Understanding (VDU) methods outsource the task of reading text to off-the-shelf Optical Character Recognition (OCR) engines and focus on the understanding task with the OCR outputs.

Document Image Classification document understanding +4

Cost-effective End-to-end Information Extraction for Semi-structured Document Images

no code implementations EMNLP 2021 Wonseok Hwang, Hyunji Lee, Jinyeong Yim, Geewook Kim, Minjoon Seo

A real-world information extraction (IE) system for semi-structured document images often involves a long pipeline of multiple modules, whose complexity dramatically increases its development and maintenance cost.

BROS: A Pre-trained Language Model for Understanding Texts in Document

no code implementations1 Jan 2021 Teakgyu Hong, Donghyun Kim, Mingi Ji, Wonseok Hwang, Daehyun Nam, Sungrae Park

Although the recent advance in OCR enables the accurate extraction of text segments, it is still challenging to extract key information from documents due to the diversity of layouts.

Decoder Diversity +5

Tractable loss function and color image generation of multinary restricted Boltzmann machine

no code implementations27 Nov 2020 Juno Hwang, Wonseok Hwang, Junghyo Jo

The restricted Boltzmann machine (RBM) is a representative generative model based on the concept of statistical mechanics.

Image Generation

Spatial Dependency Parsing for Semi-Structured Document Information Extraction

1 code implementation Findings (ACL) 2021 Wonseok Hwang, Jinyeong Yim, Seunghyun Park, Sohee Yang, Minjoon Seo

Information Extraction (IE) for semi-structured document images is often approached as a sequence tagging problem by classifying each recognized input token into one of the IOB (Inside, Outside, and Beginning) categories.

Dependency Parsing

Syntactic Question Abstraction and Retrieval for Data-Scarce Semantic Parsing

no code implementations AKBC 2020 Wonseok Hwang, Jinyeong Yim, Seunghyun Park, Minjoon Seo

Deep learning approaches to semantic parsing require a large amount of labeled data, but annotating complex logical forms is costly.

Retrieval Semantic Parsing

A Comprehensive Exploration on WikiSQL with Table-Aware Word Contextualization

5 code implementations4 Feb 2019 Wonseok Hwang, Jinyeong Yim, Seunghyun Park, Minjoon Seo

We present SQLova, the first Natural-language-to-SQL (NL2SQL) model to achieve human performance in WikiSQL dataset.

Decoder Semantic Parsing

Cannot find the paper you are looking for? You can Submit a new open access paper.