1 code implementation • 30 Jan 2025 • Haejin Lee, Shubhanshu Mishra, Apratim Mishra, Zhiwen You, Jinseok Kim, Jana Diesner
These cards will provide a structured framework for documenting and reporting key methodological choices in scholarly data analysis, including author name disambiguation and gender identification procedures.
1 code implementation • 24 Jan 2025 • Sullam Jeoung, Yubin Ge, Haohan Wang, Jana Diesner
Drawing on cognitive science findings related to representativeness heuristics -- where individuals readily recall the representative attribute of a target group in a way that leads to exaggerated beliefs -- we scrutinize LLM responses through this heuristics lens.
1 code implementation • 8 Oct 2024 • Amir Hossein Kargaran, Ali Modarressi, Nafiseh Nikeghbal, Jana Diesner, François Yvon, Hinrich Schütze
This suggests that MEXA is a reliable method for estimating the multilingual capabilities of English-centric LLMs, providing a clearer understanding of their multilingual potential and the inner workings of LLMs.
1 code implementation • 2 Oct 2024 • Zhiwen You, Kanyao Han, Haotian Zhu, Bertram Ludäscher, Jana Diesner
For multi-class classification tasks, prompt-based fine-tuning under low-resource scenarios has resulted in performance levels comparable to those of fully fine-tuning methods.
no code implementations • 7 Jul 2024 • Zhiwen You, Haejin Lee, Shubhanshu Mishra, Sullam Jeoung, Apratim Mishra, Jinseok Kim, Jana Diesner
The experimental results show that incorporating the birth year does not improve the overall accuracy of gender prediction, especially for names with evolving gender associations.
1 code implementation • 20 Oct 2023 • Sullam Jeoung, Yubin Ge, Jana Diesner
Based on the SCM theory, StereoMap maps LLMs' perceptions of social groups (defined by socio-demographic features) using the dimensions of Warmth and Competence.
1 code implementation • 1 Jun 2023 • Sullam Jeoung, Jana Diesner, Halil Kilicoglu
As language models continue to be integrated into applications of personal and societal relevance, ensuring these models' trustworthiness is crucial, particularly with respect to producing consistent outputs regardless of sensitive attributes.
1 code implementation • 24 Nov 2022 • Shubhanshu Mishra, Jana Diesner
For natural language processing (NLP) tasks that utilize a collection of features based on lexicons and rules, it is important to adapt these features to the changing data.
no code implementations • NAACL (GeBNLP) 2022 • Sullam Jeoung, Jana Diesner
Previous work has examined how debiasing language models affect downstream tasks, specifically, how debiasing techniques influence task performance and whether debiased models also make impartial predictions in downstream tasks or not.
no code implementations • 23 May 2022 • Yubin Ge, Ziang Xiao, Jana Diesner, Heng Ji, Karrie Karahalios, Hari Sundaram
We constructed a new human-annotated dataset of human-written follow-up questions with dialogue history and labeled knowledge in the context of conversational surveys.
no code implementations • ACL 2021 • Yubin Ge, Ly Dinh, Xiaofeng Liu, Jinsong Su, Ziyao Lu, Ante Wang, Jana Diesner
In this paper, we focus on the problem of citing sentence generation, which entails generating a short text to capture the salient information in a cited paper and the connection between the citing and cited paper.
1 code implementation • Findings of the Association for Computational Linguistics 2020 • M. Janina Sarol, Ly Dinh, Rezvaneh Rezapour, Chieh-Li Chin, Pingjing Yang, Jana Diesner
However, the sparsity of the information as well as the amount of noisy content present a challenge to practitioners to effectively identify shared information on these platforms.
1 code implementation • 20 May 2020 • Samin Aref, Ly Dinh, Rezvaneh Rezapour, Jana Diesner
We expand this modeling approach to incorporate directionality of edges, and consider three levels of analysis: triads, subgroups, and the whole network.
Social and Information Networks Optimization and Control Physics and Society 05C22, 90C90, 90C09, 90C10, 90C35, 05C15, 65K05
no code implementations • LREC 2020 • Rezvaneh Rezapour, Jutta Bopp, Norman Fiedler, Diana Steffen, Andreas Witt, Jana Diesner
This paper proposes, implements and evaluates a novel, corpus-based approach for identifying categories indicative of the impact of research via a deductive (top-down, from theory to data) and an inductive (bottom-up, from data to theory) approach.
no code implementations • WS 2019 • Ming Jiang, Jana Diesner
We present a simple, rule-based method for extracting entity networks from the abstracts of scientific literature.
1 code implementation • IJCNLP 2019 • Ming Jiang, Junjie Hu, Qiuyuan Huang, Lei Zhang, Jana Diesner, Jianfeng Gao
In this study, we present a fine-grained evaluation method REO for automatically measuring the performance of image captioning systems.
1 code implementation • IJCNLP 2019 • Ming Jiang, Qiuyuan Huang, Lei Zhang, Xin Wang, Pengchuan Zhang, Zhe Gan, Jana Diesner, Jianfeng Gao
This paper presents a new metric called TIGEr for the automatic evaluation of image captioning systems.
no code implementations • WS 2019 • Rezvaneh Rezapour, Saumil H. Shah, Jana Diesner
We investigate the relationship between basic principles of human morality and the expression of opinions in user-generated text data.
no code implementations • WS 2017 • Aseel Addawood, Rezvaneh Rezapour, Omid Abdar, Jana Diesner
Using a combination of emphatic, language-specific, and Twitter-specific features for supervised learning resulted in 87{\%} accuracy (F1) for cross-validation of the training set and 63. 4{\%} accuracy when using the test set.
1 code implementation • WS 2016 • Shubhanshu Mishra, Jana Diesner
In this paper, we report on the solution [ST] we submitted to the WNUT 2016 NER shared task.
no code implementations • COLING 2016 • Ming Jiang, Jana Diesner
We extend classic review mining work by building a binary classifier that predicts whether a review of a documentary film was written by an expert or a layman with 90. 70{\%} accuracy (F1 score), and compare the characteristics of the predicted classes.