Search Results for author: Ke Shen

Found 7 papers, 0 papers with code

A Formalism and Approach for Improving Robustness of Large Language Models Using Risk-Adjusted Confidence Scores

no code implementations5 Oct 2023 Ke Shen, Mayank Kejriwal

We also propose a risk-centric evaluation framework, and four novel metrics, for assessing LLMs on these risks in both in-domain and out-of-domain settings.

Natural Language Inference

Understanding Prior Bias and Choice Paralysis in Transformer-based Language Representation Models through Four Experimental Probes

no code implementations3 Oct 2022 Ke Shen, Mayank Kejriwal

Recent work on transformer-based neural networks has led to impressive advances on multiple-choice natural language understanding (NLU) problems, such as Question Answering (QA) and abductive reasoning.

Decision Making Multiple-choice +2

Understanding Substructures in Commonsense Relations in ConceptNet

no code implementations3 Oct 2022 Ke Shen, Mayank Kejriwal

A potential source of structured commonsense knowledge that could be used to derive insights is ConceptNet.

Graph Representation Learning

A Theoretically Grounded Benchmark for Evaluating Machine Commonsense

no code implementations23 Mar 2022 Henrique Santos, Ke Shen, Alice M. Mulvehill, Yasaman Razeghi, Deborah L. McGuinness, Mayank Kejriwal

Preliminary results suggest that the benchmark is challenging even for advanced language representation models designed for discriminative CSR question answering tasks.

Generative Question Answering Multiple-choice

A Data-Driven Study of Commonsense Knowledge using the ConceptNet Knowledge Base

no code implementations28 Nov 2020 Ke Shen, Mayank Kejriwal

Acquiring commonsense knowledge and reasoning is recognized as an important frontier in achieving general Artificial Intelligence (AI).

Clustering Graph Representation Learning +2

Do Fine-tuned Commonsense Language Models Really Generalize?

no code implementations18 Nov 2020 Mayank Kejriwal, Ke Shen

According to influential leaderboards hosted by the Allen Institute (evaluating state-of-the-art performance on commonsense reasoning benchmarks), models based on such transformer methods are approaching human-like performance and have average accuracy well over 80% on many benchmarks.

Multiple-choice Question Answering

Ranking sentences from product description & bullets for better search

no code implementations15 Jul 2019 Prateek Verma, Aliasgar Kutiyanawala, Ke Shen

Products in an ecommerce catalog contain information-rich fields like description and bullets that can be useful to extract entities (attributes) using NER based systems.

Extractive Summarization NER +3

Cannot find the paper you are looking for? You can Submit a new open access paper.