Supplemental material to the paper "Deep Tags: Toward a Quantitative Analysis of Online Pornography" published in the journal Porn Studies.
Unless specified otherwise, the following datasets are released under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.
This in an exhaustive dataset of metadatas of all videos published on the site from its creation - 2007 - until february 2013. This represents almost 800,000 entries.
For each entry, the following metadatas are available:
Metadata | Description | Example | % of Dataset |
---|---|---|---|
upload_date |
Day when the video was uploaded | 4/30/2011 |
NA |
title |
Title of the video | "Tea party at Dick's house" |
NA |
channels |
List of the video's tags | ['Tea', 'Spoon', 'Sugar'] |
NA |
description |
Description of the video | "What a spoon !" |
NA |
nb_views |
Number of times the video has been displayed | 69 |
NA |
nb_votes |
Number of users who voted for or against this video | 42 |
NA |
nb_comments |
Number of comments posted on this video | 666 |
NA |
runtime |
Length of the video in seconds | 4815 |
NA |
uploader |
Anonymized identifier of the uploader's username | 6f60cbef5b891f80 |
NA |
JSON | CSV - 786,121 entries (50M)
This is a non-exhaustive dataset of metadatas for approximately one third of all videos published on the site until february 2013. This represents almost 1,200,000 entries.
For each entry, the following metadatas are available:
Metadata | Description | Example | % of Dataset |
---|---|---|---|
title |
Title of the video | "Tea party at Dick's house" |
NA |
nb_comments |
Number of comments posted on this video | 666 |
NA |
tags |
List of the video's tags | ['Tea', 'Spoon', 'Sugar'] |
NA |
The interest of this dataset is its Tag ecosystem. Unlike other pornographic sites, Uploaders can tag the videos at will. Xnxx has got more than 6,000 tags for describing its videos.
JSON | CSV - 1,166,278 entries (50M)