Search Results for author: Armineh Nourbakhsh

Found 15 papers, 0 papers with code

BuDDIE: A Business Document Dataset for Multi-task Information Extraction

no code implementations • 5 Apr 2024 • Ran Zmigrod, Dongsheng Wang, Mathieu Sibue, Yulong Pei, Petr Babkin, Ivan Brugere, Xiaomo Liu, Nacho Navarro, Antony Papadimitriou, William Watson, Zhiqiang Ma, Armineh Nourbakhsh, Sameena Shah

Several datasets exist for research on specific tasks of VRDU such as document classification (DC), key entity extraction (KEE), entity linking, visual question answering (VQA), inter alia.

Document Classification document understanding +5

Paper
Add Code

TreeForm: End-to-end Annotation and Evaluation for Form Document Parsing

no code implementations • 7 Feb 2024 • Ran Zmigrod, Zhiqiang Ma, Armineh Nourbakhsh, Sameena Shah

Visually Rich Form Understanding (VRFU) poses a complex research problem due to the documents' highly structured nature and yet highly variable style and content.

Paper
Add Code

DocGraphLM: Documental Graph Language Model for Information Extraction

no code implementations • 5 Jan 2024 • Dongsheng Wang, Zhiqiang Ma, Armineh Nourbakhsh, Kang Gu, Sameena Shah

Advances in Visually Rich Document Understanding (VrDU) have enabled information extraction and question answering over documents with complex layouts.

document understanding Language Modelling +2

Paper
Add Code

DocLLM: A layout-aware generative language model for multimodal document understanding

no code implementations • 31 Dec 2023 • Dongsheng Wang, Natraj Raman, Mathieu Sibue, Zhiqiang Ma, Petr Babkin, Simerjot Kaur, Yulong Pei, Armineh Nourbakhsh, Xiaomo Liu

Enterprise documents such as forms, invoices, receipts, reports, contracts, and other similar records, often carry rich semantics at the intersection of textual and spatial modalities.

document understanding Language Modelling

Paper
Add Code

Parameterized Explanations for Investor / Company Matching

no code implementations • 27 Oct 2021 • Simerjot Kaur, Ivan Brugere, Andrea Stefanucci, Armineh Nourbakhsh, Sameena Shah, Manuela Veloso

We compare the performance of our system with human generated recommendations and demonstrate the ability of our algorithm to perform extremely well on this task.

Decision Making Explainable Recommendation +2

Paper
Add Code

A Framework for Institutional Risk Identification using Knowledge Graphs and Automated News Profiling

no code implementations • 19 Sep 2021 • Mahmoud Mahfouz, Armineh Nourbakhsh, Sameena Shah

Organizations around the world face an array of risks impacting their operations globally.

Knowledge Graphs

Paper
Add Code

Robust Document Representations using Latent Topics and Metadata

no code implementations • 23 Oct 2020 • Natraj Raman, Armineh Nourbakhsh, Sameena Shah, Manuela Veloso

Task specific fine-tuning of a pre-trained neural language model using a custom softmax output layer is the de facto approach of late when dealing with document classification problems.

Document Classification Language Modelling

Paper
Add Code

DocuBot : Generating financial reports using natural language interactions

no code implementations • 2 Oct 2020 • Vineeth Ravi, Selim Amrouni, Andrea Stefanucci, Armineh Nourbakhsh, Prashant Reddy, Manuela Veloso

Digital reports are often created based on tedious manual analysis as well as visualization of the underlying trends and characteristics of data.

Paper
Add Code

SPot: A tool for identifying operating segments in financial tables

no code implementations • 17 May 2020 • Zhiqiang Ma, Steven Pomerville, Mingyang Di, Armineh Nourbakhsh

In this paper we present SPot, an automated tool for detecting operating segments and their related performance indicators from earnings reports.

Benchmarking

Paper
Add Code

A framework for anomaly detection using language modeling, and its applications to finance

no code implementations • 24 Aug 2019 • Armineh Nourbakhsh, Grace Bang

In the finance sector, studies focused on anomaly detection are often associated with time-series and transactional data analytics.

Anomaly Detection Language Modelling +2

Paper
Add Code

Reuters Tracer: Toward Automated News Production Using Large Scale Social Media Data

no code implementations • 11 Nov 2017 • Xiaomo Liu, Armineh Nourbakhsh, Quanzhi Li, Sameena Shah, Robert Martin, John Duprey

It has a bottom-up approach to news detection, and does not rely on a predefined set of sources or subjects.

Social and Information Networks

Paper
Add Code

Data Sets: Word Embeddings Learned from Tweets and General Data

no code implementations • 14 Aug 2017 • Quanzhi Li, Sameena Shah, Xiaomo Liu, Armineh Nourbakhsh

In addition to the data sets learned from just tweet data, we also built embedding sets from the general data and the combination of tweets with the general data.

Sentiment Analysis Topic Classification +1