Description

After the previous videos scraping we noticed that the video-news is often spread using a screenshot of the original video. This visualization allows us to connect the images to the integral videos. In this way we observed which parts of the videos were used as widespread images and we organized them by Google rank. Each video was converted into images using one frame per second, so we could find the better range of time to connect with the Google images. Afterwards, all the videos were compressed in a strip to provide an overview about them and their duration.

Considering the sensitivity of contents we decided to give the possibility to the users to choose how to explore these contents, assigning some labels:
- ‘no explicit violence’: these contents aren’t clearly graphic but the language or the subjects’ conditions bring back to violence; - ‘danger’: these contents are considered dangerous as clearly violent, offensive and they correspond to the minutes before the execution;
- ‘sensitive contents’: the images show graphic content as explicit violence against someone and blood.

In most cases we found out that the main images used to communicate the video-news in the web corresponded to the last couple of labels and that they are often placed in the last minutes of the videos.

Protocol

google imagesadvanced search queriesdefinition image database any videoconverter google image searchurl extractor excel 1 downthemall 1. download videos— 7 original videos— .avi extention2. creation of a folder VIDEO 3 from video to png— 1 fps manual correspondancebetween pictures and filmstrips filmstrip creation advanced options— url source— url image— google ranking DATABASE 3 creation7 Spreadsheets about 7 executions with:— ranking— image domain— image url— source domain— source url exportation options— *inum_*flaturl*.*ext*— jpg, png— 200 images> 1400 results after effects movie barcodegenerator illustrator 2 urldatabase visualization videodatabase 1. incognito search2. seven queries— james foley execution— steven sotloff isis— david haines isis— herve gourdel beheaded— jordan pilot execution— coptic christians isis— ethiopian christians isis 1. manual scraping> 371 results2. creation of a folder IMG 3— 7 folders with pictures 7 excel corpus definition YTDT video+info comments chrome extention scraper open refine excel illustrator visualization comments extractionfrom +18 videos 1. from DATABASE 2— selecting "other" videos— detecting "fake videos" file tab extraction from tab to csv from DATABASE 2 creating DATABASE 7 containing5 Spreadsheets1. General informations— ranking— query— site domain— site typology— censorship level— notes— url— video typology— n. views— n. comments— fake theories2,3,4,5. Video comments— ranking— polarization— comments corpusdefinition 7 excel corpus definition YTDT video+info comments chrome extention scraper open refine excel illustrator visualization comments extractionfrom +18 videos 1. from DATABASE 2— selecting "other" videos— detecting "fake videos" file tab extraction from tab to csv from DATABASE 2 creating DATABASE 7 containing5 Spreadsheets1. General informations— ranking— query— site domain— site typology— censorship level— notes— url— video typology— n. views— n. comments— fake theories2,3,4,5. Video comments— ranking— polarization— comments excel 1 032_http://www.prescdn.pagely... 1. images database/url databse matching 032 http://www.prescdn.pagely.netdna... Rank Image url illustrator 2 042_cdn.thefiscalti-mes.com... A. Picture from images DB Pilot frame_143 Pilot frame_144 Pilot frame_145 B. Video in FPS 2:23 - 2:26 minutes C. Filmstrip 2. timing pictures and video frames HOW TO READ: website terminal software tool corpusdefinition visualization networkdefinition data-sourcedefinition queriesdefinition
1. Queries definition

In Google Image Advanced Search we found the right queries to search the images on Google. We filtered the results for United States, in order to get only the pictures spread in that country. We wanted to analyze which were the video-frames used to report the news from different websites, and the same time where the attention of people was focused on.

2. Corpues definition

The corpus was made of three parts: an image database, a video database and an url database. In particular we dowloaded 1400 pictures from Google using Downthemall, scraped manually to 371 results, and organized by similarities in folders. Then we used Google Image Search Url Extractor, and after the url download, we created an excel file writing the correspondence between url and pictures’ name, adding the google ranking as well. In the end, we extracted the frames from the video using After Effects, importing the video and downloading 1fps. The frames were then associated to the pictures. The last step consisted on the conversion of the videos to strip-line, using Movie Barcode Generator, in order to star the visualization process.

Data

Timestamp: 01/12/2016 - 08/12/2016

Data source: Google images advanced

The data consist of two folders and an .xls file. The excel file contains 7 spreadsheets for each group of image referred to each video, they are organized in 5 columns: growing rank, image domain, image url, source domain and source url. The images folder is organized in 7 sub-folders for each event, containing pictures renamed with the Google rank number first. The videos folders contain the integral videos for each execution. Warning, these folders contain graphic images and videos that may upset some viewers. If you are sure to download these contents you need to use this password: gruattro.