This download form is a temporary solution. Usually, the dataset can be created using the information available here: https://github.com/mediatechnologycenter/aestheval. However, since Reddit's API has changed, the code is no longer working. In the meantime, you can request to download the full dataset using this form.
More information about the dataset can be found here: https://mtc.ethz.ch/publications/open-source/rpcd.html
If you use this dataset, please cite the following paper:
Daniel Vera Nieto, Luigi Celona, and Clara Fernandez-Labrador. "Understanding Aesthetics with Language: A Photo Critique Dataset for Aesthetic Assessment." In Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks (2022)
@inproceedings{nieto2022understanding,
title={Understanding Aesthetics with Language: A Photo Critique Dataset for Aesthetic Assessment},
author={Daniel Vera Nieto and Luigi Celona and Clara Fernandez Labrador},
booktitle={Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track},
year={2022},
url={https://openreview.net/forum?id=-VyJim9UBxQ}
}
Important info about the licence
We comply with Reddit User Agreement (1), Reddit API terms of use (2) and PushShift database Creative Commons License (3). In particular, we refer to the Section 2.d of Reddit API Terms of Use, which states: "User Content. Reddit user photos, text and videos ("User Content") are owned by the users and not by Reddit. Subject to the terms and conditions of these Terms, Reddit grants You a non-exclusive, non-transferable, non-sublicensable, and revocable license to copy and display the User Content using the Reddit API through your application, website, or service to end users. You may not modify the User Content except to format it for such display. You will comply with any requirements or restrictions imposed on usage of User Content by their respective owners, which may include "all rights reserved" notices, Creative Commons licenses or other terms and conditions that may be agreed upon between you and the owners." Moreover, we do not modify the original content by no means, while we provide the necessary tools to process the data and run the same experiments we carried out. We release the dataset under the Creative Commons Attribution 4.0 International license.