Social Multiple Upload-Download Datasets


Online images nowadays might be shared multiple times over Social Network platforms. To facilitate research on tracking social network origin of images, we collect two large-scale datasets: i) R-SMUD (RAISE Social Multiple Up-Download), and ii) V-SMUD (VISION Social Multiple Up-Download). All images are shared at maximum three times through three platforms: Facebook (FB), Flickr (FL), Twitter (TW). The folders are named following this convention: [P1]: original images are uploaded to platform [P1] and downloaded. [P1]-[P2]: images in [P1] are uploaded to platform [P2] and downloaded. [P1]-[P2]-[P3]: images in [P1]-[P2] are uploaded to platform [P3] and downloaded. 1) R-SMUD (download link). 50 RAW images are extracted from RAISE dataset [1] which can be downloaded from here. Those images are undergone top-left cropping to create 3 different sizes 377x600, 1012x1800, and 1687x3000 respecting 9:16 aspect ratio. Naming convention of cropped images is: original-[id]-[h]x[w].jpg, where id is the sequence number, h and w are height and width, respectively. All cropped images are compressed using The Independent JPEG Group's JPEG software under six quality factors {50,60,70,80,90,100}. Compressed images are stored as: original/QF-[qf]/original-[id]-[h]x[w].jpg, where qf is the corresponding quality factor. 1) V-SMUD (download link). 510 JPEG images are extracted from VISION dataset [2] which can be downloaded from here. For each of 34 cameras, we select 15 images. Camera D12_Sony_XperiaZ1Compact is excluded since all of its images exceed 5 MBs which is the upper limit size allowed by Twitter. Images remain the same as they are in VISION. The list of used images is stored in original/original-jpeg.txt.

If you use one of these datasets, please cite our work as: @inproceedings{Phan2019, title={Tracking Multiple Image Sharing On Social Networks}, author={{Q.-T. Phan, G. Boato, R. Caldelli, I. Amerini}}, booktitle={IEEE International Conference on Acoustics, Speech, and Signal Processing}, year={2019} }

References [1] D.-T. Dang-Nguyen, C. Pasquini, V. Conotter, G. Boato, RAISE - A Raw Images Dataset for Digital Image Forensics, ACM Multimedia Systems, Mar. 2015. [2] D. Shullani, M. Fontani, M. Iuliani, O. A. Shaya, A. Piva, VISION: a video and image dataset for source identification, EURASIP Journal on Information Security, Dec. 2017.


All rights reserved. Multimedia Signal Processing and Understanding Lab. - University of Trento