Search Results for author: Ke Shen

Found 7 papers, 0 papers with code

A Formalism and Approach for Improving Robustness of Large Language Models Using Risk-Adjusted Confidence Scores

no code implementations • 5 Oct 2023 • Ke Shen, Mayank Kejriwal

We also propose a risk-centric evaluation framework, and four novel metrics, for assessing LLMs on these risks in both in-domain and out-of-domain settings.

Natural Language Inference

Paper
Add Code

Understanding Prior Bias and Choice Paralysis in Transformer-based Language Representation Models through Four Experimental Probes

no code implementations • 3 Oct 2022 • Ke Shen, Mayank Kejriwal

Recent work on transformer-based neural networks has led to impressive advances on multiple-choice natural language understanding (NLU) problems, such as Question Answering (QA) and abductive reasoning.

Decision Making Multiple-choice +2

Paper
Add Code

Understanding Substructures in Commonsense Relations in ConceptNet

no code implementations • 3 Oct 2022 • Ke Shen, Mayank Kejriwal

A potential source of structured commonsense knowledge that could be used to derive insights is ConceptNet.

Graph Representation Learning

Paper
Add Code

A Theoretically Grounded Benchmark for Evaluating Machine Commonsense

no code implementations • 23 Mar 2022 • Henrique Santos, Ke Shen, Alice M. Mulvehill, Yasaman Razeghi, Deborah L. McGuinness, Mayank Kejriwal

Preliminary results suggest that the benchmark is challenging even for advanced language representation models designed for discriminative CSR question answering tasks.

Generative Question Answering Multiple-choice

Paper
Add Code

A Data-Driven Study of Commonsense Knowledge using the ConceptNet Knowledge Base

no code implementations • 28 Nov 2020 • Ke Shen, Mayank Kejriwal

Acquiring commonsense knowledge and reasoning is recognized as an important frontier in achieving general Artificial Intelligence (AI).

Clustering Graph Representation Learning +2

Paper
Add Code

Do Fine-tuned Commonsense Language Models Really Generalize?

no code implementations • 18 Nov 2020 • Mayank Kejriwal, Ke Shen

According to influential leaderboards hosted by the Allen Institute (evaluating state-of-the-art performance on commonsense reasoning benchmarks), models based on such transformer methods are approaching human-like performance and have average accuracy well over 80% on many benchmarks.

Multiple-choice Question Answering

Paper
Add Code

Ranking sentences from product description & bullets for better search

no code implementations • 15 Jul 2019 • Prateek Verma, Aliasgar Kutiyanawala, Ke Shen

Products in an ecommerce catalog contain information-rich fields like description and bullets that can be useful to extract entities (attributes) using NER based systems.

Extractive Summarization NER +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.