The dangers of Webcrawled datasets
Keywords: internet, webcrawler, webcrawling, data gathering, image-processing, information forensics
AbstractThis article highlights legal, ethical and scientific problems arising from the use of large experimental datasets gathered from the Internet - in particular, image datasets. Such datasets are currently used within research into topics such as information forensics and image-processing. This paper strongly recommends against webcrawling as a means for generating experimental datasets, and proposes safer alternatives.
How to Cite
Bell, G. B. (2010). The dangers of Webcrawled datasets. First Monday, 15(2). https://doi.org/10.5210/fm.v15i2.2739
Authors retain copyright to their work published in First Monday. Please see the footer of each article for details.