An Entity Resolution approach to isolate instances of Human Trafficking online

WS 2017  ·  Nagpal Chirag, Miller Kyle, Boecking Benedikt, Dubrawski Artur ·

Human trafficking is a challenging law enforcement problem, and a large amount of such activity manifests itself on various online forums. Given the large, heterogeneous and noisy structure of this data, building models to predict instances of trafficking is an even more convolved a task. In this paper we propose and entity resolution pipeline using a notion of proxy labels, in order to extract clusters from this data with prior history of human trafficking activity. We apply this pipeline to 5M records from backpage.com and report on the performance of this approach, challenges in terms of scalability, and some significant domain specific characteristics of our resolved entities.

PDF Abstract WS 2017 PDF WS 2017 Abstract
No code implementations yet. Submit your code now

Categories


Social and Information Networks

Datasets


  Add Datasets introduced or used in this paper