Description
The treemap shows how recurring words in the posts analyzed in the previous protocol are used in relation to the approach of the post itself. Each rectangle is a word, blue if used on Twitter and red if used on Weibo; the ones with a marked rectangle around them are the ones in common. The size represents how many times a word was written. When each button is clicked, the main recurring related words used in the posts with that type of approach are highlighted. We matched these information to better understand the relation between specific words and the approaches to the theme. We decided to extrapolate words so that we could understand the vocabulary used within every approach and also in comparison between the two platforms, so the two cultures. It’s interesting to see how the word honest is used only by Chinese people, while words like die or orwellian are used only by the western world.
Protocol
From the previous protocol, we extracted the recurring words for both Weibo and Twitter, with Wordcounter, Text analyzer and Zhonghuayuwen. Then, we selected the first 10 recurring words and the most relevant posts for each kind of approach for both western and Chinese social networks. Finally we matched these two information in a treemap made with Raw Graph that we reworked with Illustrator.
Data
Data Source: Twitter, Weibo
Timestamp: 01/12/2018
View Data (500Kb)
The file focuses on how many times a word is written and the relation between these words and the approach of the posts in which they were used. For that reason, the dataset is made by two columns: keywords (filtered words) and occurrences.