no code implementations • 24 Jul 2024 • Thomas Manzini, Priyankari Perali, Raisa Karnik, Robin Murphy
This document presents the Center for Robot Assisted Search And Rescue - Uncrewed Aerial Systems - Disaster Response Overhead Inspection Dataset (CRASAR-U-DROIDs) for building damage assessment and spatial alignment collected from small uncrewed aerial systems (sUAS) geospatial imagery.
no code implementations • 10 May 2024 • Thomas Manzini, Priyankari Perali, Raisa Karnik, Mihir Godbole, Hasnat Abdullah, Robin Murphy
This work presents the first quantitative study of alignment errors between small uncrewed aerial systems (sUAS) geospatial imagery and a priori building polygons and finds that alignment errors are non-uniform and irregular.
1 code implementation • 5 Sep 2023 • Robin Murphy, Thomas Manzini
This paper describes gaps in acquisition of drone imagery that impair the use with computer vision/machine learning (CV/ML) models and makes five recommendations to maximize image suitability for CV/ML post-processing.
2 code implementations • 26 Jul 2023 • Thomas Manzini, Robin Murphy
This paper details the challenges in applying two computer vision systems, an EfficientDET supervised learning model and the unsupervised RX spectral classifier, to 98. 9 GB of drone imagery from the Wu-Murad wilderness search and rescue (WSAR) effort in Japan and identifies 3 directions for future research.
1 code implementation • NAACL 2019 • Thomas Manzini, Yao Chong Lim, Yulia Tsvetkov, Alan W. black
Online texts -- across genres, registers, domains, and styles -- are riddled with human stereotypes, expressed in overt or subtle ways.
2 code implementations • 19 Dec 2018 • Hai Pham, Paul Pu Liang, Thomas Manzini, Louis-Philippe Morency, Barnabas Poczos
Our method is based on the key insight that translation from a source to a target modality provides a method of learning joint representations using only the source modality as input.
no code implementations • WS 2018 • Hai Pham, Thomas Manzini, Paul Pu Liang, Barnabas Poczos
Multimodal machine learning is a core research area spanning the language, visual and acoustic modalities.
no code implementations • WS 2018 • Ch, Khyathi u, Thomas Manzini, Sumeet Singh, Alan W. black
Code-switching (CS), the practice of alternating between two or more languages in conversations, is pervasive in most multi-lingual communities.
no code implementations • WS 2017 • Ravich, Abhilasha er, Thomas Manzini, Matthias Grabmair, Graham Neubig, Jonathan Francis, Eric Nyberg
Wang et al. (2015) proposed a method to build semantic parsing datasets by generating canonical utterances using a grammar and having crowdworkers paraphrase them into natural wording.