Description

In this section, we study the key words of these 100 books, and use keywords to understand the themes and focus of these books. By comparing the keywords, we can get the topic tendency of 50 Chinese books and 50 American books.

These keywords are extracted from the brief introduction of each book. We extracted the common keywords of Chinese books and American books to facilitate the analysis and comparison of the same part and different part of Chinese concerns and American’s. And we divided the keywords into five categories: actor, field, climate related, controversy, other. So the comparison can be multi-dimensional.

Through this analysis we learned that the 50 Chinese books involve more fields than 50 American’s. There are huge differences between China and the United States in the actor and climate related keywords. In the controversy part, United States has eight keywords, China only has one, it means the American books talk more controversy of climate change than Chinese books.

Protocol

Data source: dangdang, amazon

Get into the online bookstore and input the keywords “Climate Change”, sort the book list by popularity and collect top 50 books in each data source. Manually acquire the reader rating, classification, summery, author.

Use Voyant Tools to get the keywords: put all the Chinese books summery into Voyant Tools, it gives us the high-frequency words/terms, export all the words in text format.

Manually filter out meaningless high-frequency words/terms such as “it”, “a”, “very”, keep the words/terms such as “economy”, “resource”, “EU”, “energy crisis”… get the final keywords. Repeat previous process for American keywords.

Manually divided the keywords into five categories: actor, field, climate related, controversy, other. And organize them in Excel.

Use Illustrator to visualize the data. We use circle to present each keywords, the size of the circle present the frequency of the words, we put the common words of Chinese books and American books in the middle, put the different words on both sides, sort them by size. And click the bottom we can see each category’s words.

Data

Data source: Amazon, DangDang