Description
The second stage of our research was to investigate what is the most discussed topic online in relation to “Gender Equality” and “Gender Inequality”. To find out the answer we decided to study which words are more associated with the topic.
Using the same results from the first chapter queries, we then used the Keyword Density Tool to extract a list of important single-word and two-word keywords from our top 100 results for each query excluding irrelevant words. By counting the keywords manually and merging any similar ones together we created broader clusters. We were then able to identify and rank the most discussed topics for both “Gender Inequality” and “Gender Equality” searches.
For both the queries the most relevant cluster of keywords are about work. The results showed that the links from the “Inequality” search are mostly centred around this topic and economic issues. The “Equality” search also generated similar results, however we found an increased importance on rights with only a narrow difference between the first and second place. This could be explained by the fact that “Gender Equality” focuses more on improvement and working towards the goal of equality, whereas “Gender Inequality” mainly focuses on more economic statistics and factual reporting.
Protocol
We searched for both queries on google.com/nrc (incognito) to be sure that the results were generated using anonymous browsing and with no nationality to influence the outcome. We selected the first 100 results for each query manually, excluding Google books and Wikipedia pages.
Then using the keyword density tool, we inserted the links individually into the programme to create a list of single and two worded phrases, which excluded our queries, any articles and conjunctions. We selected the first 50 single words and first 50 double barrelled phrases to total 100 phrases for each query.
Later we manually identified clusters to identify the most discussed topic online.
Data
Timestamp: 01/12/2016 - 05/12/2016
Download data
Our data consists of two tables of information. One is with the first 100 links from the “Gender Equality” query and the other is the first 100 links from the “Gender Inequality” query. For every link the tool gave us a list of keywords, one word and two-words together. Starting with this dataset, we grouped keywords into clusters to identify what topic is most discussed.