Search Results for author: Yu Su

Found 55 papers, 28 papers with code

Bridging the Generalization Gap in Text-to-SQL Parsing with Schema Expansion

no code implementations ACL 2022 Chen Zhao, Yu Su, Adam Pauls, Emmanouil Antonios Platanios

Text-to-SQL parsers map natural language questions to programs that are executable over tables to generate answers, and are typically evaluated on large-scale datasets like Spider (Yu et al., 2018).

Domain Generalization SQL Parsing +1

Federated Learning for Semantic Parsing: Task Formulation, Evaluation Setup, New Algorithms

1 code implementation26 May 2023 Tianshu Zhang, Changchang Liu, Wei-Han Lee, Yu Su, Huan Sun

By leveraging data from multiple clients, the FL paradigm can be especially beneficial for clients that have little training data to develop a data-hungry neural semantic parser on their own.

Federated Learning Semantic Parsing +1

Error Detection for Text-to-SQL Semantic Parsing

no code implementations23 May 2023 Shijie Chen, Ziru Chen, Huan Sun, Yu Su

Despite remarkable progress in text-to-SQL semantic parsing in recent years, the performance of existing parsers is still far from perfect.

Language Modelling Semantic Parsing +1

Adaptive Chameleon or Stubborn Sloth: Unraveling the Behavior of Large Language Models in Knowledge Clashes

1 code implementation22 May 2023 Jian Xie, Kai Zhang, Jiangjie Chen, Renze Lou, Yu Su

By providing external information to large language models (LLMs), tool augmentation (including retrieval augmentation) has emerged as a promising solution for addressing the limitations of LLMs' static parametric memory.

Retrieval

Text-to-SQL Error Correction with Language Models of Code

1 code implementation22 May 2023 Ziru Chen, Shijie Chen, Michael White, Raymond Mooney, Ali Payani, Jayanth Srinivasa, Yu Su, Huan Sun

Thus, we propose a novel representation for SQL queries and their edits that adheres more closely to the pre-training corpora of language models of code.

SQL Parsing Text-To-SQL

Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors

1 code implementation18 May 2023 Kai Zhang, Bernal Jiménez Gutiérrez, Yu Su

Recent work has shown that fine-tuning large language models (LLMs) on large-scale instruction-following datasets substantially improves their performance on a wide range of NLP tasks, especially in the zero-shot setting.

Instruction Following Question Answering +1

Memorization for Good: Encryption with Autoregressive Language Models

1 code implementation15 May 2023 Samuel Stevens, Yu Su

Over-parameterized neural language models (LMs) can memorize and recite long sequences of training data.

Cryptanalysis Memorization

Automatic Evaluation of Attribution by Large Language Models

1 code implementation10 May 2023 Xiang Yue, Boshi Wang, Kai Zhang, Ziru Chen, Yu Su, Huan Sun

To facilitate the evaluation, we manually curate a set of test examples covering 12 domains from a generative search engine, New Bing.

Fact Checking Language Modelling +2

PiML Toolbox for Interpretable Machine Learning Model Development and Validation

1 code implementation7 May 2023 Agus Sudjianto, Aijun Zhang, Zebin Yang, Yu Su, Ningzhou Zeng

PiML (read $\pi$-ML, /`pai.`em.`el/) is an integrated and open-access Python toolbox for interpretable machine learning model development and model diagnostics.

Fairness Interpretable Machine Learning

Quiz-based Knowledge Tracing

no code implementations5 Apr 2023 Shuanghong Shen, Enhong Chen, Bihan Xu, Qi Liu, Zhenya Huang, Linbo Zhu, Yu Su

In this paper, we present the Quiz-based Knowledge Tracing (QKT) model to monitor students' knowledge states according to their quiz-based learning interactions.

Decision Making Knowledge Tracing

A Retrieve-and-Read Framework for Knowledge Graph Link Prediction

no code implementations19 Dec 2022 Vardaan Pahuja, Boshi Wang, Hugo Latapie, Jayanth Srinivasa, Yu Su

To address the limitations of existing KG link prediction frameworks, we propose a novel retrieve-and-read framework, which first retrieves a relevant subgraph context for the query and then jointly reasons over the context and the query with a high-capacity reader.

Link Prediction

Don't Generate, Discriminate: A Proposal for Grounding Language Models to Real-World Environments

no code implementations19 Dec 2022 Yu Gu, Xiang Deng, Yu Su

Most existing work for grounded language understanding uses LMs to directly generate plans that can be executed in the environment to achieve the desired effects.

Knowledge Base Question Answering

LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models

no code implementations8 Dec 2022 Chan Hee Song, Jiaman Wu, Clayton Washington, Brian M. Sadler, Wei-Lun Chao, Yu Su

In this work, we propose a novel method, LLM-Planner, that harnesses the power of large language models to do few-shot planning for embodied agents.

Hand Hygiene Assessment via Joint Step Segmentation and Key Action Scorer

no code implementations25 Sep 2022 Chenglong Li, Qiwen Zhu, Tubiao Liu, Jin Tang, Yu Su

To address this issue, we design a multi-stage convolution-transformer network for step segmentation.

Action Assessment

Knowledge Base Question Answering: A Semantic Parsing Perspective

no code implementations12 Sep 2022 Yu Gu, Vardaan Pahuja, Gong Cheng, Yu Su

In this survey, we situate KBQA in the broader literature of semantic parsing and give a comprehensive account of how existing KBQA approaches attempt to address the unique challenges.

Knowledge Base Question Answering Semantic Parsing +1

Bootstrapping a User-Centered Task-Oriented Dialogue System

no code implementations11 Jul 2022 Shijie Chen, Ziru Chen, Xiang Deng, Ashley Lewis, Lingbo Mo, Samuel Stevens, Zhen Wang, Xiang Yue, Tianshu Zhang, Yu Su, Huan Sun

We present TacoBot, a task-oriented dialogue system built for the inaugural Alexa Prize TaskBot Challenge, which assists users in completing multi-step cooking and home improvement tasks.

Data Augmentation Dialogue Management +2

When More Data Hurts: A Troubling Quirk in Developing Broad-Coverage Natural Language Understanding Systems

1 code implementation24 May 2022 Elias Stengel-Eskin, Emmanouil Antonios Platanios, Adam Pauls, Sam Thomson, Hao Fang, Benjamin Van Durme, Jason Eisner, Yu Su

Rejecting class imbalance as the sole culprit, we reveal that the trend is closely associated with an effect we call source signal dilution, where strong lexical cues for the new symbol become diluted as the training dataset grows.

Intent Recognition Natural Language Understanding +1

ArcaneQA: Dynamic Program Induction and Contextualized Encoding for Knowledge Base Question Answering

1 code implementation COLING 2022 Yu Gu, Yu Su

Question answering on knowledge bases (KBQA) poses a unique challenge for semantic parsing research due to two intertwined challenges: large search space and ambiguities in schema linking.

Knowledge Base Question Answering Program induction +1

Thinking about GPT-3 In-Context Learning for Biomedical IE? Think Again

1 code implementation16 Mar 2022 Bernal Jiménez Gutiérrez, Nikolas McNeal, Clay Washington, You Chen, Lang Li, Huan Sun, Yu Su

In this paper, we present the first systematic and comprehensive study to compare the few-shot performance of GPT-3 in-context learning with fine-tuning smaller (i. e., BERT-sized) PLMs on two highly representative biomedical information extraction tasks, named entity recognition and relation extraction.

Model Selection named-entity-recognition +4

One Step at a Time: Long-Horizon Vision-and-Language Navigation with Milestones

1 code implementation CVPR 2022 Chan Hee Song, Jihyung Kil, Tai-Yu Pan, Brian M. Sadler, Wei-Lun Chao, Yu Su

We study the problem of developing autonomous agents that can follow human instructions to infer and perform a sequence of actions to complete the underlying task.

Vision and Language Navigation

Compositional Generalization for Natural Language Interfaces to Web APIs

no code implementations9 Dec 2021 Saghar Hosseini, Ahmed Hassan Awadallah, Yu Su

We define new compositional generalization tasks for NL2API which explore the models' ability to extrapolate from simple API calls in the training set to new and more complex API calls in the inference phase.

Semantic Parsing

ReasonBERT: Pre-trained to Reason with Distant Supervision

1 code implementation EMNLP 2021 Xiang Deng, Yu Su, Alyssa Lees, You Wu, Cong Yu, Huan Sun

We present ReasonBert, a pre-training method that augments language models with the ability to reason over long-range relations and multiple, possibly hybrid contexts.

Extractive Question-Answering Question Answering +1

A Systematic Investigation of KB-Text Embedding Alignment at Scale

1 code implementation ACL 2021 Vardaan Pahuja, Yu Gu, Wenhu Chen, Mehdi Bahrami, Lei Liu, Wei-Peng Chen, Yu Su

Knowledge bases (KBs) and text often contain complementary knowledge: KBs store structured knowledge that can support long range reasoning, while text stores more comprehensive and timely knowledge in an unstructured way.

Link Prediction

Quality meets Diversity: A Model-Agnostic Framework for Computerized Adaptive Testing

no code implementations15 Jan 2021 Haoyang Bi, Haiping Ma, Zhenya Huang, Yu Yin, Qi Liu, Enhong Chen, Yu Su, Shijin Wang

In this paper, we study a novel model-agnostic CAT problem, where we aim to propose a flexible framework that can adapt to different cognitive models.

Active Learning

Explainable Recommendation Systems by Generalized Additive Models with Manifest and Latent Interactions

no code implementations15 Dec 2020 Yifeng Guo, Yu Su, Zebin Yang, Aijun Zhang

In this paper, we propose the explainable recommendation systems based on a generalized additive model with manifest and latent interactions (GAMMLI).

Additive models Collaborative Filtering +2

An Investigation of Language Model Interpretability via Sentence Editing

2 code implementations EMNLP (BlackboxNLP) 2021 Samuel Stevens, Yu Su

Pre-trained language models (PLMs) like BERT are being used for almost all language-related tasks, but interpreting their behavior still remains a significant challenge and many important questions remain largely unanswered.

General Classification Language Modelling

Beyond I.I.D.: Three Levels of Generalization for Question Answering on Knowledge Bases

1 code implementation16 Nov 2020 Yu Gu, Sue Kase, Michelle Vanni, Brian Sadler, Percy Liang, Xifeng Yan, Yu Su

To facilitate the development of KBQA models with stronger generalization, we construct and release a new large-scale, high-quality dataset with 64, 331 questions, GrailQA, and provide evaluation settings for all three levels of generalization.

Knowledge Base Question Answering

Marcus' electron transfer rate revisited via a Rice-Ramsperger-Kassel-Marcus analogue: A unified formalism for linear and nonlinear solvation scenarios

no code implementations10 Oct 2020 Yao Wang, Yu Su, Rui-Xue Xu, Xiao Zheng, YiJing Yan

In this work, on the basis of the thermodynamic solvation potentials analysis, we reexamine Marcus' formula with respect to the Rice-Ramsperger-Kassel-Marcus (RRKM) theory.

Chemical Physics

KGPT: Knowledge-Grounded Pre-Training for Data-to-Text Generation

1 code implementation EMNLP 2020 Wenhu Chen, Yu Su, Xifeng Yan, William Yang Wang

We propose a knowledge-grounded pre-training (KGPT), which consists of two parts, 1) a general knowledge-grounded generation model to generate knowledge-enriched text.

General Knowledge KG-to-Text Generation +1

Document Classification for COVID-19 Literature

1 code implementation NLP-COVID19 (ACL) 2020 Bernal Jiménez Gutiérrez, Juncheng Zeng, Dong-dong Zhang, Ping Zhang, Yu Su

The global pandemic has made it more important than ever to quickly and accurately retrieve relevant scientific literature for effective consumption by researchers in a wide range of fields.

Classification Document Classification +1

An Imitation Game for Learning Semantic Parsers from User Interaction

1 code implementation EMNLP 2020 Ziyu Yao, Yiqi Tang, Wen-tau Yih, Huan Sun, Yu Su

Despite the widely successful applications, bootstrapping and fine-tuning semantic parsers are still a tedious process with challenges such as costly data annotation and privacy risks.

Imitation Learning Text-To-SQL

Logical Natural Language Generation from Open-Domain Tables

1 code implementation ACL 2020 Wenhu Chen, Jianshu Chen, Yu Su, Zhiyu Chen, William Yang Wang

To facilitate the study of the proposed logical NLG problem, we use the existing TabFact dataset \cite{chen2019tabfact} featured with a wide range of logical/symbolic inferences as our testbed, and propose new automatic metrics to evaluate the fidelity of generation models w. r. t.\ logical inference.

Text Generation

Decision Propagation Networks for Image Classification

no code implementations27 Nov 2019 Keke Tang, Peng Song, Yuexin Ma, Zhaoquan Gu, Yu Su, Zhihong Tian, Wenping Wang

High-level (e. g., semantic) features encoded in the latter layers of convolutional neural networks are extensively exploited for image classification, leaving low-level (e. g., color) features in the early layers underexplored.

Classification General Classification +1

Model-based Interactive Semantic Parsing: A Unified Framework and A Text-to-SQL Case Study

2 code implementations IJCNLP 2019 Ziyu Yao, Yu Su, Huan Sun, Wen-tau Yih

As a promising paradigm, interactive semantic parsing has shown to improve both semantic parsing accuracy and user confidence in the results.

Semantic Parsing Text-To-SQL

EKT: Exercise-aware Knowledge Tracing for Student Performance Prediction

1 code implementation7 Jun 2019 Qi Liu, Zhenya Huang, Yu Yin, Enhong Chen, Hui Xiong, Yu Su, Guoping Hu

In EERNN, we simply summarize each student's state into an integrated vector and trace it with a recurrent neural network, where we design a bidirectional LSTM to learn the encoding of each exercise's content.

Knowledge Tracing

Global Textual Relation Embedding for Relational Understanding

1 code implementation ACL 2019 Zhiyu Chen, Hanwen Zha, Honglei Liu, Wenhu Chen, Xifeng Yan, Yu Su

Pre-trained embeddings such as word embeddings and sentence embeddings are fundamental tools facilitating a wide range of downstream NLP tasks.

Action Classification Sentence Embeddings +1

QuesNet: A Unified Representation for Heterogeneous Test Questions

no code implementations27 May 2019 Yu Yin, Qi Liu, Zhenya Huang, Enhong Chen, Wei Tong, Shijin Wang, Yu Su

Then we propose a two-level hierarchical pre-training algorithm to learn better understanding of test questions in an unsupervised way.

Language Modelling

Learning to Compose Topic-Aware Mixture of Experts for Zero-Shot Video Captioning

no code implementations7 Nov 2018 Xin Wang, Jiawei Wu, Da Zhang, Yu Su, William Yang Wang

Although promising results have been achieved in video captioning, existing models are limited to the fixed inventory of activities in the training corpus, and do not generalize to open vocabulary scenarios.

Video Captioning

What It Takes to Achieve 100\% Condition Accuracy on WikiSQL

no code implementations EMNLP 2018 Semih Yavuz, Izzeddin Gur, Yu Su, Xifeng Yan

The SQL queries in WikiSQL are simple: Each involves one relation and does not have any join operation.

Translation

XL-NBT: A Cross-lingual Neural Belief Tracking Framework

1 code implementation EMNLP 2018 Wenhu Chen, Jianshu Chen, Yu Su, Xin Wang, Dong Yu, Xifeng Yan, William Yang Wang

Then, we pre-train a state tracker for the source language as a teacher, which is able to exploit easy-to-access parallel data.

Transfer Learning

DialSQL: Dialogue Based Structured Query Generation

no code implementations ACL 2018 Izzeddin Gur, Semih Yavuz, Yu Su, Xifeng Yan

The recent advance in deep learning and semantic parsing has significantly improved the translation accuracy of natural language questions to structured queries.

Semantic Parsing Translation

Aggregated Channels Network for Real-Time Pedestrian Detection

no code implementations1 Jan 2018 Farzin Ghorban, Javier Marín, Yu Su, Alessandro Colombo, Anton Kummert

Convolutional neural networks (CNNs) have demonstrated their superiority in numerous computer vision tasks, yet their computational cost results prohibitive for many real-time applications such as pedestrian detection which is usually performed on low-consumption hardware.

Pedestrian Detection

Recovering Question Answering Errors via Query Revision

no code implementations EMNLP 2017 Semih Yavuz, Izzeddin Gur, Yu Su, Xifeng Yan

The existing factoid QA systems often lack a post-inspection component that can help models recover from their own mistakes.

Question Answering Semantic Parsing

An End-to-End Deep Framework for Answer Triggering with a Novel Group-Level Objective

no code implementations EMNLP 2017 Jie Zhao, Yu Su, Ziyu Guan, Huan Sun

Given a question and a set of answer candidates, answer triggering determines whether the candidate set contains any correct answers.

Multiple Instance Learning Question Answering

Cannot find the paper you are looking for? You can Submit a new open access paper.