no code implementations • 30 Mar 2024 • Taishi Nakamura, Mayank Mishra, Simone Tedeschi, Yekun Chai, Jason T Stillerman, Felix Friedrich, Prateek Yadav, Tanmay Laud, Vu Minh Chien, Terry Yue Zhuo, Diganta Misra, Ben Bogin, Xuan-Son Vu, Marzena Karpinska, Arnav Varma Dantuluri, Wojciech Kusa, Tommaso Furlanello, Rio Yokota, Niklas Muennighoff, Suhas Pai, Tosin Adewumi, Veronika Laippala, Xiaozhe Yao, Adalberto Junior, Alpay Ariyak, Aleksandr Drozd, Jordan Clive, Kshitij Gupta, Liangyu Chen, Qi Sun, Ken Tsui, Noah Persaud, Nour Fahmy, Tianlong Chen, Mohit Bansal, Nicolo Monti, Tai Dang, Ziyang Luo, Tien-Tung Bui, Roberto Navigli, Virendra Mehta, Matthew Blumberg, Victor May, Huu Nguyen, Sampo Pyysalo
Despite these efforts, such models encounter challenges such as limited multilingual capabilities, risks of catastrophic forgetting during continual pretraining, and the high costs of training models from scratch, alongside the need to align with AI safety standards and regulatory frameworks.
1 code implementation • 23 Oct 2023 • Khanh-Tung Tran, Truong Son Hy, Lili Jiang, Xuan-Son Vu
This integration provides rich indicators of pandemic dynamics through learning with temporal graph neural networks.
no code implementations • 30 Aug 2023 • Elena Volodina, Simon Dobnik, Therese Lindström Tiedemann, Xuan-Son Vu
Accessibility of research data is critical for advances in many research fields, but textual data often cannot be shared due to the personal and sensitive information which it contains, e. g names or political opinions.
no code implementations • VLSP 2020 • Duc-Trong Le, Xuan-Son Vu, Nhu-Dung To, Huu-Quang Nguyen, Thuy-Trinh Nguyen, Linh Le, Anh-Tuan Nguyen, Minh-Duc Hoang, Nghia Le, Huyen Nguyen, Hoang D. Nguyen
This paper reports on the ReINTEL Shared Task for Responsible Information Identification on social network sites, which is hosted at the seventh annual workshop on Vietnamese Language and Speech Processing (VLSP 2020).
1 code implementation • COLING 2020 • Xuan-Son Vu, Thanh-Son Nguyen, Duc-Trong Le, Lili Jiang
Users express their opinions towards entities (e. g., restaurants) via online reviews which can be in diverse forms such as text, ratings, and images.
1 code implementation • 20 Oct 2020 • Xuan-Son Vu, Duc-Trong Le, Christoffer Edlund, Lili Jiang, Hoang D. Nguyen
With the rapid growth of Internet media, content tagging has become an important topic with many multimedia understanding applications, including efficient organisation and search.
1 code implementation • 20 Oct 2020 • Xuan-Son Vu, Duc-Trong Le, Christoffer Edlund, Lili Jiang, Hoang D. Nguyen
With the rapid growth of Internet media, content tagging has become an important topic with many multimedia understanding applications, including efficient organisation and search.
1 code implementation • 13 Jul 2020 • Xuan-Son Vu, Thanh Vu, Mai-Vu Tran, Thanh Le-Cong, Huyen T M. Nguyen
The paper describes the organisation of the "HateSpeech Detection" (HSD) task at the VLSP workshop 2019 on detecting the fine-grained presence of hate speech in Vietnamese textual items (i. e., messages) extracted from Facebook, which is the most popular social network site (SNS) in Vietnam.
1 code implementation • 12 Jun 2020 • Hoang D. Nguyen, Xuan-Son Vu, Quoc-Tuan Truong, Duc-Trong Le
With the rising number of machine learning competitions, the world has witnessed an exciting race for the best algorithms.
no code implementations • 21 May 2019 • Xuan-Son Vu, Abhishek Santra, Sharma Chakravarthy, Lili Jiang
Multi-feature data analysis (e. g., on Facebook, LinkedIn) is challenging especially if one wants to do it efficiently and retain the flexibility by choosing features of interest for analysis.
1 code implementation • 25 Mar 2019 • Xuan-Son Vu, Son N. Tran, Lili Jiang
To our best knowledge, this is the first work of learning user-level differentially private word embedding model from text for sharing.
2 code implementations • RANLP 2019 • Xuan-Son Vu, Thanh Vu, Son N. Tran, Lili Jiang
We demonstrate the effectiveness of the proposed approach on our pre-trained word embedding models in Vietnamese to select which models are suitable for a named entity recognition (NER) task.
no code implementations • 19 Jun 2018 • Xuan-Son Vu, Lili Jiang
To protect user privacy in data analysis, a state-of-the-art strategy is differential privacy in which scientific noise is injected into the real analysis output.
1 code implementation • SEMEVAL 2018 • Thanh Vu, Dat Quoc Nguyen, Xuan-Son Vu, Dai Quoc Nguyen, Michael Catt, Michael Trenell
This paper describes our NIHRIO system for SemEval-2018 Task 3 "Irony detection in English tweets".
no code implementations • GWC 2018 • Xuan-Son Vu, Lucie Flekova, Lili Jiang, Iryna Gurevych
In this paper, we aim to reveal the impact of lexical-semantic resources, used in particular for word sense disambiguation and sense-level semantic categorization, on automatic personality classification task.
no code implementations • 9 Feb 2017 • Xuan-Son Vu, Seong-Bae Park
However, most of the studies beyond one aspect user generated- content such as user ratings, user feedback and so on to state user preferences.
no code implementations • 27 Dec 2014 • Xuan-Son Vu, Seong-Bae Park
Therefore, we propose a method to construct VSWN from a Vietnamese dictionary, not from WordNet.