Search Results for author: Yujie He

Found 15 papers, 7 papers with code

Right for the Right Reason: Evidence Extraction for Trustworthy Tabular Reasoning

no code implementations ACL 2022 Vivek Gupta, Shuo Zhang, Alakananda Vempala, Yujie He, Temma Choji, Vivek Srikumar

On the downstream tabular inference task, using only the automatically extracted evidence as the premise, our approach outperforms prior benchmarks.

EmoAgent: Multi-Agent Collaboration of Plan, Edit, and Critic, for Affective Image Manipulation

no code implementations14 Mar 2025 Qi Mao, Haobo Hu, Yujie He, Difei Gao, Haokun Chen, Libiao Jin

Affective Image Manipulation (AIM) aims to alter an image's emotional impact by adjusting multiple visual elements to evoke specific feelings. Effective AIM is inherently complex, necessitating a collaborative approach that involves identifying semantic cues within source images, manipulating these elements to elicit desired emotional responses, and verifying that the combined adjustments successfully evoke the target emotion. To address these challenges, we introduce EmoAgent, the first multi-agent collaboration framework for AIM.

Decision Making Image Manipulation

M3DocRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding

no code implementations7 Nov 2024 Jaemin Cho, Debanjan Mahata, Ozan Irsoy, Yujie He, Mohit Bansal

However, there are difficulties in applying these methods in real-world scenarios: (a) questions often require information across different pages or documents, where MLMs cannot handle many long documents; (b) documents often have important information in visual elements such as figures, but text extraction tools ignore them.

document understanding Optical Character Recognition +6

GraphMamba: An Efficient Graph Structure Learning Vision Mamba for Hyperspectral Image Classification

1 code implementation11 Jul 2024 Aitao Yang, Min Li, Yao Ding, Leyuan Fang, Yaoming Cai, Yujie He

Efficient extraction of spectral sequences and geospatial information has always been a hot topic in hyperspectral image classification.

Computational Efficiency Graph structure learning +2

Enhancing Question Answering on Charts Through Effective Pre-training Tasks

no code implementations14 Jun 2024 Ashim Gupta, Vivek Gupta, Shuo Zhang, Yujie He, Ning Zhang, Shalin Shah

To address these issues, we propose three simple pre-training tasks that enforce the existing model in terms of both structural-visual knowledge, as well as its understanding of numerical questions.

document understanding Optical Character Recognition (OCR) +1

TempTabQA: Temporal Question Answering for Semi-Structured Tables

no code implementations14 Nov 2023 Vivek Gupta, Pranshu Kandoi, Mahek Bhavesh Vora, Shuo Zhang, Yujie He, Ridho Reinanda, Vivek Srikumar

Given these results, our dataset has the potential to serve as a challenging benchmark to improve the temporal reasoning capabilities of NLP models.

Question Answering

Pedestrian-Robot Interactions on Autonomous Crowd Navigation: Reactive Control Methods and Evaluation Metrics

1 code implementation3 Aug 2022 Diego Paez-Granados, Yujie He, David Gonon, Dan Jia, Bastian Leibe, Kenji Suzuki, Aude Billard

Autonomous navigation in highly populated areas remains a challenging task for robots because of the difficulty in guaranteeing safe interactions with pedestrians in unstructured situations.

Autonomous Navigation

Automatic Construction of Enterprise Knowledge Base

no code implementations EMNLP (ACL) 2021 Junyi Chai, Yujie He, Homa Hashemi, Bing Li, Daraksha Parveen, Ranganath Kondapally, Wenjin Xu

In this paper, we present an automatic knowledge base construction system from large scale enterprise documents with minimal efforts of human intervention.

Knowledge Base Construction

MedSelect: Selective Labeling for Medical Image Classification Combining Meta-Learning with Deep Reinforcement Learning

1 code implementation26 Mar 2021 Akshay Smit, Damir Vrabac, Yujie He, Andrew Y. Ng, Andrew L. Beam, Pranav Rajpurkar

We propose a selective learning method using meta-learning and deep reinforcement learning for medical image interpretation in the setting of limited labeling resources.

Deep Reinforcement Learning General Classification +4

MULLS: Versatile LiDAR SLAM via Multi-metric Linear Least Square

1 code implementation7 Feb 2021 Yue Pan, Pengchuan Xiao, Yujie He, Zhenlei Shao, Zesong Li

The rapid development of autonomous driving and mobile mapping calls for off-the-shelf LiDAR SLAM solutions that are adaptive to LiDARs of different specifications on various complex scenarios.

Autonomous Driving Simultaneous Localization and Mapping

Cross-Lingual Named Entity Recognition Using Parallel Corpus: A New Approach Using XLM-RoBERTa Alignment

no code implementations26 Jan 2021 Bing Li, Yujie He, Wenjin Xu

We built an entity alignment model on top of XLM-RoBERTa to project the entities detected on the English part of the parallel data to the target language sentences, whose accuracy surpasses all previous unsupervised models.

Cross-Lingual NER Entity Alignment +4

Optimal Action Extraction for Random Forests and Boosted Trees

1 code implementation13 Aug 2015 Zhicheng Cui, Wenlin Chen, Yujie He, Yixin Chen

To address this problem, we present a novel framework to post-process any ATM classifier to extract an optimal actionable plan that can change a given input to a desired class with a minimum cost.

Cannot find the paper you are looking for? You can Submit a new open access paper.