no code implementations • 5 Oct 2023 • Ke Shen, Mayank Kejriwal
We also propose a risk-centric evaluation framework, and four novel metrics, for assessing LLMs on these risks in both in-domain and out-of-domain settings.
no code implementations • 3 Oct 2022 • Ke Shen, Mayank Kejriwal
Recent work on transformer-based neural networks has led to impressive advances on multiple-choice natural language understanding (NLU) problems, such as Question Answering (QA) and abductive reasoning.
no code implementations • 3 Oct 2022 • Ke Shen, Mayank Kejriwal
A potential source of structured commonsense knowledge that could be used to derive insights is ConceptNet.
no code implementations • 23 Mar 2022 • Henrique Santos, Ke Shen, Alice M. Mulvehill, Yasaman Razeghi, Deborah L. McGuinness, Mayank Kejriwal
Preliminary results suggest that the benchmark is challenging even for advanced language representation models designed for discriminative CSR question answering tasks.
no code implementations • 28 Nov 2020 • Ke Shen, Mayank Kejriwal
Acquiring commonsense knowledge and reasoning is recognized as an important frontier in achieving general Artificial Intelligence (AI).
no code implementations • 18 Nov 2020 • Mayank Kejriwal, Ke Shen
According to influential leaderboards hosted by the Allen Institute (evaluating state-of-the-art performance on commonsense reasoning benchmarks), models based on such transformer methods are approaching human-like performance and have average accuracy well over 80% on many benchmarks.
no code implementations • 15 Jul 2019 • Prateek Verma, Aliasgar Kutiyanawala, Ke Shen
Products in an ecommerce catalog contain information-rich fields like description and bullets that can be useful to extract entities (attributes) using NER based systems.