no code implementations • NAACL (DaSH) 2021 • Peter Griggs, Cagatay Demiralp, Sajjadur Rahman
From tweets to product reviews, text is ubiquitous on the web and often contains valuable information for both enterprises and consumers.
1 code implementation • Findings (NAACL) 2022 • Yutong Shao, Nikita Bhutani, Sajjadur Rahman, Estevam Hruschka
Entity set expansion (ESE) aims at obtaining a more complete set of entities given a textual corpus and a seed set of entities of a concept.
1 code implementation • 24 Dec 2024 • Yanlin Feng, Simone Papicchio, Sajjadur Rahman
Retrieval from graph data is crucial for augmenting large language models (LLM) with both open-domain knowledge and private enterprise data, and it is also a key component in the recent GraphRAG system (edge et al., 2024).
no code implementations • 8 Nov 2024 • Kushan Mitra, Dan Zhang, Sajjadur Rahman, Estevam Hruschka
We introduce FactLens, a benchmark for evaluating fine-grained fact verification, with metrics and automated evaluators of sub-claim quality.
no code implementations • 2 Jun 2024 • Eser Kandogan, Sajjadur Rahman, Nikita Bhutani, Dan Zhang, Rafael Li Chen, Kushan Mitra, Sairam Gurajada, Pouya Pezeshkpour, Hayate Iso, Yanlin Feng, Hannah Kim, Chen Shen, Jin Wang, Estevam Hruschka
Large Language Models (LLMs) have showcased remarkable capabilities surpassing conventional NLP challenges, creating opportunities for use in production use cases.
1 code implementation • 2 Jun 2024 • Yanlin Feng, Sajjadur Rahman, Aaron Feng, Vincent Chen, Eser Kandogan
While these systems have the potential to supplement typical analysis workflows of data analysts in enterprise data platforms, unfortunately, CASs are subject to the same data discovery challenges that analysts have encountered over the years -- silos of multimodal data sources, created across teams and departments within an organization, make it difficult to identify appropriate data sources for accomplishing the task at hand.
no code implementations • 28 Feb 2024 • Hannah Kim, Kushan Mitra, Rafael Li Chen, Sajjadur Rahman, Dan Zhang
Large language models (LLMs) can label data faster and cheaper than humans for various NLP tasks.
no code implementations • 2 Feb 2024 • Pouya Pezeshkpour, Eser Kandogan, Nikita Bhutani, Sajjadur Rahman, Tom Mitchell, Estevam Hruschka
We present a formal definition of reasoning capacity and illustrate its utility in identifying limitations within each component of the system.
no code implementations • 9 Nov 2023 • Aditi Mishra, Sajjadur Rahman, Hannah Kim, Kushan Mitra, Estevam Hruschka
We consider the task of generating knowledge-guided rationalization in natural language by using expert-written examples in a few-shot manner.
no code implementations • 9 Jan 2023 • Sajjadur Rahman, Hannah Kim, Dan Zhang, Estevam Hruschka, Eser Kandogan
Human-centered AI workflows involve stakeholders with multiple roles interacting with each other and automated agents to accomplish diverse tasks.
no code implementations • 8 Sep 2020 • Sajjadur Rahman, Peter Griggs, Çağatay Demiralp
Text data analysis is an iterative, non-linear process with diverse workflows spanning multiple stages, from data cleaning to visualization.