This repository contains code and data for the following paper:
@inproceedings{choshen-etal-2019-language,
title = "The Language of Legal and Illegal Activity on the {D}arknet",
author = "Choshen, Leshem and
Eldad, Dan and
Hershcovich, Daniel and
Sulem, Elior and
Abend, Omri",
booktitle = "Proceedings of the 57th Conference of the Association for Computational Linguistics",
month = jul,
year = "2019",
address = "Florence, Italy",
publisher = "Association for Computational Linguistics",
url = "https://www.aclweb.org/anthology/P19-1419",
pages = "4271--4279"
}
csvs: Onion labels (e.g., legal/illegal) per websitecyber: code to read and classify documentsebay: documents from eBay (product descriptions)ebay_clean: documents from eBay (product descriptions), after cleaningexperiments: AllenNLP configuration filesonion: documents from Onion (website text), classified by labelonion_clean: documents from Onion, classified by label, after cleaningpaper: source code for the paper