Search Results for author: Prasenjit Mitra

Found 44 papers, 13 papers with code

STAPI: An Automatic Scraper for Extracting Iterative Title-Text Structure from Web Documents

1 code implementation LREC 2022 Nan Zhang, Shomir Wilson, Prasenjit Mitra

Therefore, we propose the first title-text dataset on web documents that incorporates a wide variety of domains to facilitate downstream training.

Headline Generation

Are BERTs Sensitive to Native Interference in L2 Production?

no code implementations EMNLP (insights) 2021 Zixin Tang, Prasenjit Mitra, David Reitter

With the essays part from The International Corpus Network of Asian Learners of English (ICNALE) and the TOEFL11 corpus, we fine-tuned neural language models based on BERT to predict English learners’ native languages.

WildGraph: Realistic Graph-based Trajectory Generation for Wildlife

1 code implementation11 Apr 2024 Ali Al-Lawati, Elsayed Eshra, Prasenjit Mitra

Trajectory generation is an important task in movement studies; it circumvents the privacy, ethical, and technical challenges of collecting real trajectories from the target population.

PEaCE: A Chemistry-Oriented Dataset for Optical Character Recognition on Scientific Documents

1 code implementation23 Mar 2024 Nan Zhang, Connor Heaton, Sean Timothy Okonsky, Prasenjit Mitra, Hilal Ezgi Toraman

To mitigate this gap, we present the Printed English and Chemical Equations (PEaCE) dataset, containing both synthetic and real-world records, and evaluate the efficacy of transformer-based OCR models when trained on this resource.

Optical Character Recognition Optical Character Recognition (OCR)

Automated Multi-Task Learning for Joint Disease Prediction on Electronic Health Records

1 code implementation6 Mar 2024 Suhan Cui, Prasenjit Mitra

To reduce human intervention and improve the framework design, we propose an automated approach named AutoDP, which can search for the optimal configuration of task grouping and architectures simultaneously.

Disease Prediction Multi-Task Learning

Milestones in Bengali Sentiment Analysis leveraging Transformer-models: Fundamentals, Challenges and Future Directions

no code implementations15 Jan 2024 Saptarshi Sengupta, Shreya Ghosh, Prasenjit Mitra, Tarikul Islam Tamiti

Sentiment Analysis (SA) refers to the task of associating a view polarity (usually, positive, negative, or neutral; or even fine-grained such as slightly angry, sad, etc.)

Sentiment Analysis

Leveraging External Knowledge Resources to Enable Domain-Specific Comprehension

no code implementations15 Jan 2024 Saptarshi Sengupta, Connor Heaton, Prasenjit Mitra, Soumalya Sarkar

Machine Reading Comprehension (MRC) has been a long-standing problem in NLP and, with the recent introduction of the BERT family of transformer based language models, it has come a long way to getting solved.

Knowledge Graphs Machine Reading Comprehension +1

WildGEN: Long-horizon Trajectory Generation for Wildlife

no code implementations30 Dec 2023 Ali Al-Lawati, Elsayed Eshra, Prasenjit Mitra

Trajectory generation is an important concern in pedestrian, vehicle, and wildlife movement studies.

FaMeSumm: Investigating and Improving Faithfulness of Medical Summarization

1 code implementation3 Nov 2023 Nan Zhang, Yusen Zhang, Wu Guo, Prasenjit Mitra, Rui Zhang

In this paper, we investigate and improve faithfulness in summarization on a broad range of medical summarization tasks.

Contrastive Learning

Analysis of Elephant Movement in Sub-Saharan Africa: Ecological, Climatic, and Conservation Perspectives

no code implementations21 Jul 2023 Matthew Hines, Gregory Glatzer, Shreya Ghosh, Prasenjit Mitra

The interaction between elephants and their environment has profound implications for both ecology and conservation strategies.

Management

Spatio-temporal Storytelling? Leveraging Generative Models for Semantic Trajectory Analysis

no code implementations24 Jun 2023 Shreya Ghosh, Saptarshi Sengupta, Prasenjit Mitra

In this paper, we lay out a vision for analysing semantic trajectory traces and generating synthetic semantic trajectory data (SSTs) using generative language model.

Language Modelling

Lumos in the Night Sky: AI-enabled Visual Tool for Exploring Night-Time Light Patterns

no code implementations5 Jun 2023 Jakob Hederich, Shreya Ghosh, Zeyu He, Prasenjit Mitra

We introduce NightPulse, an interactive tool for Night-time light (NTL) data visualization and analytics, which enables researchers and stakeholders to explore and analyze NTL data with a user-friendly platform.

Clustering Data Visualization +2

Forecasting User Interests Through Topic Tag Predictions in Online Health Communities

no code implementations5 Nov 2022 Amogh Subbakrishna Adishesha, Lily Jakielaszek, Fariha Azhar, Peixuan Zhang, Vasant Honavar, Fenglong Ma, Chandra Belani, Prasenjit Mitra, Sharon Xiaolei Huang

Specifically, we pose the problem of predicting topic tags or keywords that describe the future information needs of users based on their profiles, traces of their online interactions within the community (past posts, replies) and the profiles and traces of online interactions of other users with similar profiles and similar traces of past interaction with the target users.

Recommendation Systems Text2text Generation

Exploring Descriptions of Movement Through Geovisual Analytics

1 code implementation1 Mar 2022 Scott Pezanowski, Prasenjit Mitra, Alan M. MacEachren

We present GeoMovement, a system that is based on combining machine learning and rule-based extraction of movement-related information with state-of-the-art visualization techniques.

Negation

Recognition of Implicit Geographic Movement in Text

no code implementations LREC 2020 Scott Pezanowski, Prasenjit Mitra

Analyzing the geographic movement of humans, animals, and other phenomena is a growing field of research.

Word Embeddings

Federated Unlearning with Knowledge Distillation

no code implementations24 Jan 2022 Chen Wu, Sencun Zhu, Prasenjit Mitra

Federated Learning (FL) is designed to protect the data privacy of each client during the training process by transmitting only models instead of the original data.

Federated Learning Knowledge Distillation

Differentiating Geographic Movement Described in Text Documents

no code implementations12 Jan 2022 Scott Pezanowski, Alan M. MacEachren, Prasenjit Mitra

Understanding movement described in text documents is important since text descriptions of movement contain a wealth of geographic and contextual information about the movement of people, wildlife, goods, and much more.

An Analysis of Elephants' Movement Data in Sub-Saharan Africa Using Clustering

no code implementations5 Nov 2021 Gregory Glatzer, Prasenjit Mitra, Johnson Kinyua

We explore the use of clustering to identify locations of interest to African Elephants in regions of Sub-Saharan Africa.

Clustering

Learning To Describe Player Form in The MLB

1 code implementation11 Sep 2021 Connor Heaton, Prasenjit Mitra

Major League Baseball (MLB) has a storied history of using statistics to better understand and discuss the game of baseball, with an entire discipline of statistics dedicated to the craft, known as sabermetrics.

Contrastive Learning

Extractive Research Slide Generation Using Windowed Labeling Ranking

1 code implementation NAACL (sdp) 2021 Athar Sefid, Jian Wu, Prasenjit Mitra, Lee Giles

Presentation slides describing the content of scientific and technical papers are an efficient and effective way to present that work.

Extractive Summarization Sentence

Mitigating Backdoor Attacks in Federated Learning

no code implementations28 Oct 2020 Chen Wu, Xian Yang, Sencun Zhu, Prasenjit Mitra

To minimize the pruning influence on test accuracy, we can fine-tune after pruning, and the attack success rate drops to 6. 4%, with only a 1. 7% loss of test accuracy.

Federated Learning

Repurposing TREC-COVID Annotations to Answer the Key Questions of CORD-19

no code implementations27 Aug 2020 Connor T. Heaton, Prasenjit Mitra

Seeing the related endeavors, we set out to repurpose the relevancy annotations for TREC-COVID tasks to identify journal articles in CORD-19 which are relevant to the key questions posed by CORD-19.

Information Retrieval Retrieval

Extractive Summarizer for Scholarly Articles

1 code implementation25 Aug 2020 Athar Sefid, Clyde Lee Giles, Prasenjit Mitra

We introduce an extractive method that will summarize long scientific papers.

Investigating and Mitigating Degree-Related Biases in Graph Convolutional Networks

no code implementations28 Jun 2020 Xianfeng Tang, Huaxiu Yao, Yiwei Sun, Yiqi Wang, Jiliang Tang, Charu Aggarwal, Prasenjit Mitra, Suhang Wang

Pseudo labels increase the chance of connecting to labeled neighbors for low-degree nodes, thus reducing the biases of GCNs from the data perspective.

Self-Supervised Learning

Read, Highlight and Summarize: A Hierarchical Neural Semantic Encoder-based Approach

1 code implementation8 Oct 2019 Rajeev Bhatt Ambati, Saptarashmi Bandyopadhyay, Prasenjit Mitra

In this paper, we propose a method based on extracting the highlights of a document; a key concept that is conveyed in a few sentences.

Abstractive Text Summarization Hard Attention +4

Transferring Robustness for Graph Neural Network Against Poisoning Attacks

1 code implementation20 Aug 2019 Xianfeng Tang, Yandong Li, Yiwei Sun, Huaxiu Yao, Prasenjit Mitra, Suhang Wang

To optimize PA-GNN for a poisoned graph, we design a meta-optimization algorithm that trains PA-GNN to penalize perturbations using clean graphs and their adversarial counterparts, and transfers such ability to improve the robustness of PA-GNN on the poisoned graph.

Node Classification Transfer Learning

Applications of Online Deep Learning for Crisis Response Using Social Media Information

no code implementations4 Oct 2016 Dat Tien Nguyen, Shafiq Joty, Muhammad Imran, Hassan Sajjad, Prasenjit Mitra

During natural or man-made disasters, humanitarian response organizations look for useful information to support their decision-making processes.

Decision Making Disaster Response +3

Generating Abstractive Summaries from Meeting Transcripts

no code implementations22 Sep 2016 Siddhartha Banerjee, Prasenjit Mitra, Kazunari Sugiyama

The most informative and well-formed sub-graph obtained by integer linear programming (ILP) is selected to generate a one-sentence summary for each topic segment.

Document Summarization Sentence +1

Multi-document abstractive summarization using ILP based multi-sentence compression

no code implementations22 Sep 2016 Siddhartha Banerjee, Prasenjit Mitra, Kazunari Sugiyama

The sentences in the most important document are aligned to sentences in other documents to generate clusters of similar sentences.

Abstractive Text Summarization Document Summarization +4

Abstractive Meeting Summarization UsingDependency Graph Fusion

no code implementations22 Sep 2016 Siddhartha Banerjee, Prasenjit Mitra, Kazunari Sugiyama

Automatic summarization techniques on meeting conversations developed so far have been primarily extractive, resulting in poor summaries.

Meeting Summarization Sentence +1

Rapid Classification of Crisis-Related Data on Social Networks using Convolutional Neural Networks

no code implementations12 Aug 2016 Dat Tien Nguyen, Kamela Ali Al Mannai, Shafiq Joty, Hassan Sajjad, Muhammad Imran, Prasenjit Mitra

The current state-of-the-art classification methods require a significant amount of labeled data specific to a particular event for training plus a lot of feature engineering to achieve best results.

BIG-bench Machine Learning Classification +2

Twitter as a Lifeline: Human-annotated Twitter Corpora for NLP of Crisis-related Messages

1 code implementation LREC 2016 Muhammad Imran, Prasenjit Mitra, Carlos Castillo

Microblogging platforms such as Twitter provide active communication channels during mass convergence and emergency events such as earthquakes, typhoons.

Disaster Response Humanitarian +1

A neural probabilistic model for context based citation recommendation

no code implementations AAAI 2015 Wenyi Huang, Zhaohui Wu, Chen Liang, Prasenjit Mitra, C. Lee Giles

It is not always easy for knowledgeable researchers to give an accurate citation context for a cited paper or to find the right paper to cite given context.

Citation Recommendation

Cannot find the paper you are looking for? You can Submit a new open access paper.