no code implementations • SemEval (NAACL) 2022 • Rishik Lad, Weicheng Ma, Soroush Vosoughi
This paper introduces the result of Team Dartmouth’s experiments on each of the five subtasks for the detection of sarcasm in English and Arabic tweets.
no code implementations • EMNLP 2020 • Weicheng Ma, Ruibo Liu, Lili Wang, Soroush Vosoughi
The lack of multi-label and aspect-level emoji prediction datasets is one of the bottlenecks for this task.
no code implementations • SemEval (NAACL) 2022 • Joseph Hajjar, Weicheng Ma, Soroush Vosoughi
This paper presents our approach for tackling SemEval-2022 Task 8: Multilingual News Article Similarity.
no code implementations • 1 Nov 2024 • Weicheng Ma, Luyang Zhao, Chun-Yi She, Yitao Jiang, Alan Sun, Bo Zhu, Devin Balkcom, Soroush Vosoughi
Recent large language models (LLMs) have demonstrated promising capabilities in modeling real-world knowledge and enhancing knowledge-based generation tasks.
no code implementations • 12 Jun 2024 • Lin Shi, Chiyu Ma, Wenhua Liang, Weicheng Ma, Soroush Vosoughi
LLM-as-a-Judge presents a promising alternative to human evaluators across various tasks, but inherent biases, especially position bias - a tendency to favor solutions based on their position in the prompt - have compromised its effectiveness.
no code implementations • 1 Jun 2023 • Lili Wang, Chenghan Huang, Chongyang Gao, Weicheng Ma, Soroush Vosoughi
In the pursuit of accurate and scalable quantitative methods for financial market analysis, the focus has shifted from individual stock models to those capturing interrelations between companies and their stocks.
no code implementations • 1 Jun 2023 • Lili Wang, Chenghan Huang, Weicheng Ma, Xinyuan Cao, Soroush Vosoughi
We evaluate our proposed model on five publicly available datasets for the task of temporal graph similarity ranking, and our model outperforms baseline methods.
no code implementations • 7 Feb 2023 • Xiaobo Guo, Weicheng Ma, Soroush Vosoughi
Differential framing of issues can lead to divergent world views on important issues.
no code implementations • Findings (ACL) 2022 • Weicheng Ma, Samiha Datta, Lili Wang, Soroush Vosoughi
While cultural backgrounds have been shown to affect linguistic expressions, existing natural language processing (NLP) research on culture modeling is overly coarse-grained and does not examine cultural differences among speakers of the same language.
Cultural Vocal Bursts Intensity Prediction Language Modelling +5
no code implementations • 14 Sep 2021 • Lili Wang, Chenghan Huang, Weicheng Ma, Xinyuan Cao, Soroush Vosoughi
Recent years have seen a rise in the development of representational learning methods for graph data.
no code implementations • 14 Sep 2021 • Lili Wang, Chenghan Huang, Weicheng Ma, Ying Lu, Soroush Vosoughi
In this paper, we present a novel and flexible framework using stress majorization, to transform the high-dimensional role identities in networks directly (without approximation or indirect modeling) to a low-dimensional embedding space.
no code implementations • EMNLP 2021 • Weicheng Ma, Renze Lou, Kai Zhang, Lili Wang, Soroush Vosoughi
Compared to AUTOSEM, a strong baseline method, GradTS improves the performance of MT-DNN with a bert-base-cased backend model, from 0. 33% to 17. 93% on 8 natural language understanding (NLU) tasks in the GLUE benchmarks.
no code implementations • ACL 2021 • Weicheng Ma, Kai Zhang, Renze Lou, Lili Wang, Soroush Vosoughi
Through extensive experiments, we show that (1) pruning a number of attention heads in a multi-lingual Transformer-based model has, in general, positive effects on its performance in cross-lingual and multi-lingual tasks and (2) the attention heads to be pruned can be ranked using gradients and identified with a few trial experiments.
no code implementations • 18 Jun 2021 • Lili Wang, Chongyang Gao, Chenghan Huang, Ruibo Liu, Weicheng Ma, Soroush Vosoughi
A common type of network is the heterogeneous network, where the nodes (and edges) can be of different types.
1 code implementation • SEMEVAL 2021 • Aadil Islam, Weicheng Ma, Soroush Vosoughi
This paper describes a system submitted by team BigGreen to LCP 2021 for predicting the lexical complexity of English words in a given context.
1 code implementation • SEMEVAL 2021 • Yakoob Khan, Weicheng Ma, Soroush Vosoughi
This paper describes our approach to the Toxic Spans Detection problem (SemEval-2021 Task 5).
no code implementations • ACL (unimplicit) 2021 • Weicheng Ma, Ruibo Liu, Lili Wang, Soroush Vosoughi
Finally, we clean up the improper or outdated annotations in one of the MD benchmark datasets and re-benchmark it with our Transformer-based model.
no code implementations • EMNLP (WNUT) 2020 • Lili Wang, Chongyang Gao, Jason Wei, Weicheng Ma, Ruibo Liu, Soroush Vosoughi
The field of NLP has seen unprecedented achievements in recent years.
no code implementations • EMNLP 2020 • Ruibo Liu, Guangxuan Xu, Chenyan Jia, Weicheng Ma, Lili Wang, Soroush Vosoughi
For instance, Data Boost improves F1 for the three tasks by 8. 7% on average when given only 10% of the whole data for training.
no code implementations • 30 Sep 2020 • Weicheng Ma, Ruibo Liu, Li-Li Wang, Soroush Vosoughi
While other tasks based on linguistic style understanding benefit from deep learning methods, these methods have not behaved as well as traditional machine learning methods in many authorship-based tasks.
BIG-bench Machine Learning Natural Language Understanding +1
1 code implementation • 14 Jul 2020 • Weicheng Ma, Ruibo Liu, Lili Wang, Soroush Vosoughi
In this paper, we extend the existing setting of the emoji prediction task to include a richer set of emojis and to allow multi-label classification on the task.
1 code implementation • WS 2019 • Yuanhe Tian, Weicheng Ma, Fei Xia, Yan Song
Question answering (QA) is a challenging task in natural language processing (NLP), especially when it is applied to specific domains.