no code implementations • COLING 2022 • Yu Yu, Abdul Rafae Khan, Jia Xu
The quality of Natural Language Processing (NLP) models is typically measured by the accuracy or error rate of a predefined test set.
1 code implementation • EMNLP 2020 • Abdul Rafae Khan, Jia Xu, Weiwei Sun
Natural Language Processing (NLP) tasks are usually performed word by word on textual inputs.
no code implementations • 12 Nov 2022 • Firoj Alam, Fahim Dalvi, Nadir Durrani, Hassan Sajjad, Abdul Rafae Khan, Jia Xu
We use an unsupervised method to discover concepts learned in these models and enable a graphical interface for humans to generate explanations for the concepts.
no code implementations • 21 Oct 2022 • Abdul Rafae Khan, Hrishikesh Kanade, Girish Amar Budhrani, Preet Jhanglani, Jia Xu
This paper describes the Stevens Institute of Technology's submission for the WMT 2022 Shared Task: Code-mixed Machine Translation (MixMT).
1 code implementation • NAACL 2022 • Hassan Sajjad, Nadir Durrani, Fahim Dalvi, Firoj Alam, Abdul Rafae Khan, Jia Xu
We propose a novel framework ConceptX, to analyze how latent concepts are encoded in representations learned within pre-trained language models.
no code implementations • ICLR 2022 • Fahim Dalvi, Abdul Rafae Khan, Firoj Alam, Nadir Durrani, Jia Xu, Hassan Sajjad
We address this limitation by discovering and analyzing latent concepts learned in neural network models in an unsupervised fashion and provide interpretations from the model's perspective.
no code implementations • 25 Jun 2021 • Abdul Rafae Khan, Jia Xu, Peter Varsanyi, Rachit Pabreja
Our analysis of the importance of each input feature shows the critical causal impact on decision-making, suggesting that criminal histories are statistically significant factors, while identifiers, such as race and age, are not.
1 code implementation • NAACL 2021 • Karine Chubarian, Abdul Rafae Khan, Anastasios Sidiropoulos, Jia Xu
Deep Learning-based NLP systems can be sensitive to unseen tokens and hard to learn with high-dimensional inputs, which critically hinder learning generalization.
1 code implementation • 31 Mar 2020 • Abdul Rafae Khan, Asim Karim, Hassan Sajjad, Faisal Kamiran, Jia Xu
Roman Urdu is an informal form of the Urdu language written in Roman script, which is widely used in South Asia for online textual content.
no code implementations • 11 Nov 2019 • Abdul Rafae Khan, Jia Xu
We achieve significant and consistent improvements overall language pairs and datasets: French-English, German-English, and Chinese-English in medium task IWSLT'17 and French-English in large task WMT'18 Bio, with up to 4 BLEU points over the state-of-the-art.
1 code implementation • SEMEVAL 2019 • Weimin Lyu, Sheng Huang, Abdul Rafae Khan, Shengqiang Zhang, Weiwei Sun, Jia Xu
This paper describes the systems of the CUNY-PKU team in SemEval 2019 Task 1: Cross-lingual Semantic Parsing with UCCA.