Search Results for author: Armineh Nourbakhsh

Found 15 papers, 0 papers with code

BuDDIE: A Business Document Dataset for Multi-task Information Extraction

no code implementations5 Apr 2024 Ran Zmigrod, Dongsheng Wang, Mathieu Sibue, Yulong Pei, Petr Babkin, Ivan Brugere, Xiaomo Liu, Nacho Navarro, Antony Papadimitriou, William Watson, Zhiqiang Ma, Armineh Nourbakhsh, Sameena Shah

Several datasets exist for research on specific tasks of VRDU such as document classification (DC), key entity extraction (KEE), entity linking, visual question answering (VQA), inter alia.

Document Classification document understanding +5

TreeForm: End-to-end Annotation and Evaluation for Form Document Parsing

no code implementations7 Feb 2024 Ran Zmigrod, Zhiqiang Ma, Armineh Nourbakhsh, Sameena Shah

Visually Rich Form Understanding (VRFU) poses a complex research problem due to the documents' highly structured nature and yet highly variable style and content.

DocGraphLM: Documental Graph Language Model for Information Extraction

no code implementations5 Jan 2024 Dongsheng Wang, Zhiqiang Ma, Armineh Nourbakhsh, Kang Gu, Sameena Shah

Advances in Visually Rich Document Understanding (VrDU) have enabled information extraction and question answering over documents with complex layouts.

document understanding Language Modelling +2

DocLLM: A layout-aware generative language model for multimodal document understanding

no code implementations31 Dec 2023 Dongsheng Wang, Natraj Raman, Mathieu Sibue, Zhiqiang Ma, Petr Babkin, Simerjot Kaur, Yulong Pei, Armineh Nourbakhsh, Xiaomo Liu

Enterprise documents such as forms, invoices, receipts, reports, contracts, and other similar records, often carry rich semantics at the intersection of textual and spatial modalities.

document understanding Language Modelling

Parameterized Explanations for Investor / Company Matching

no code implementations27 Oct 2021 Simerjot Kaur, Ivan Brugere, Andrea Stefanucci, Armineh Nourbakhsh, Sameena Shah, Manuela Veloso

We compare the performance of our system with human generated recommendations and demonstrate the ability of our algorithm to perform extremely well on this task.

Decision Making Explainable Recommendation +2

Robust Document Representations using Latent Topics and Metadata

no code implementations23 Oct 2020 Natraj Raman, Armineh Nourbakhsh, Sameena Shah, Manuela Veloso

Task specific fine-tuning of a pre-trained neural language model using a custom softmax output layer is the de facto approach of late when dealing with document classification problems.

Document Classification Language Modelling

DocuBot : Generating financial reports using natural language interactions

no code implementations2 Oct 2020 Vineeth Ravi, Selim Amrouni, Andrea Stefanucci, Armineh Nourbakhsh, Prashant Reddy, Manuela Veloso

Digital reports are often created based on tedious manual analysis as well as visualization of the underlying trends and characteristics of data.

SPot: A tool for identifying operating segments in financial tables

no code implementations17 May 2020 Zhiqiang Ma, Steven Pomerville, Mingyang Di, Armineh Nourbakhsh

In this paper we present SPot, an automated tool for detecting operating segments and their related performance indicators from earnings reports.

Benchmarking

A framework for anomaly detection using language modeling, and its applications to finance

no code implementations24 Aug 2019 Armineh Nourbakhsh, Grace Bang

In the finance sector, studies focused on anomaly detection are often associated with time-series and transactional data analytics.

Anomaly Detection Language Modelling +2

Reuters Tracer: Toward Automated News Production Using Large Scale Social Media Data

no code implementations11 Nov 2017 Xiaomo Liu, Armineh Nourbakhsh, Quanzhi Li, Sameena Shah, Robert Martin, John Duprey

It has a bottom-up approach to news detection, and does not rely on a predefined set of sources or subjects.

Social and Information Networks

Data Sets: Word Embeddings Learned from Tweets and General Data

no code implementations14 Aug 2017 Quanzhi Li, Sameena Shah, Xiaomo Liu, Armineh Nourbakhsh

In addition to the data sets learned from just tweet data, we also built embedding sets from the general data and the combination of tweets with the general data.

Sentiment Analysis Topic Classification +1

Cannot find the paper you are looking for? You can Submit a new open access paper.