Image Gallery: Why Humanists Should Fall in Love with “Big Data,” and How?

March 20th, 2016

This is the image gallery for “Why Humanists Should Fall in Love with “Big Data,” and How?,” published in the Talking Shop series of Dissertation Review on March 15, 2016 (http://dissertationreviews.org/archives/13643). Click on each image to see it in full resolution.

Map 1. High speed railways in China as of 2011 (red lines) and Ming courier stations (blue dots). Credit: Lex Berman, ed., China’s History in Maps, powered by WorldMap.

Map 1. High speed railways in China as of 2011 (red lines) and Ming courier stations (blue dots). Credit: Lex Berman, ed., China’s History in Maps, powered by WorldMap.

 

Map 2a. From where did prefects of the 1040s hail? Data from the China Biographical Database (CBDB), China Historical GIS (CHGIS), and additional research, mapped in ArcGIS. Both CBDB and CHGIS are digital projects developed by multinational and multidisciplinary teams chaired by Peter K. Bol.

Map 2a. From where did prefects of the 1040s hail? Data from the China Biographical Database (CBDB), China Historical GIS (CHGIS), and additional research, mapped in ArcGIS. Both CBDB and CHGIS are digital projects developed by multinational and multidisciplinary teams chaired by Peter K. Bol.

 

Map 2b. From where did prefects of the 1210s hail? Data from the CBDB, CHGIS, and additional research, mapped in ArcGIS. (For full analysis of maps 2a and 2b, see Song Chen, “Governing a Multicentered Empire: Prefects and Their Networks in the 1040s and 1210s,” in State Power in China, 900-1325, edited by Patricia Buckley Ebrey and Paul Jakov Smith. Seattle, WA: University of Washington Press, forthcoming.)

Map 2b. From where did prefects of the 1210s hail? Data from the CBDB, CHGIS, and additional research, mapped in ArcGIS. (For full analysis of maps 2a and 2b, see Song Chen, “Governing a Multicentered Empire: Prefects and Their Networks in the 1040s and 1210s,” in State Power in China, 900-1325, edited by Patricia Buckley Ebrey and Paul Jakov Smith. Seattle, WA: University of Washington Press, forthcoming.)

 

Map 3a. Close-up on central China in Map 1a. “High speed railways in China as of 2011 (red lines) and Ming courier stations (blue dots).”

Map 3a. Close-up on central China in Map 1a. “High speed railways in China as of 2011 (red lines) and Ming courier stations (blue dots).”

 

Map 3b. Close-up on Sichuan basin in Map 1a. “High speed railways in China as of 2011 (red lines) and Ming courier stations (blue dots),” showing also Ming courier routes (green lines).

Map 3b. Close-up on Sichuan basin in Map 1a. “High speed railways in China as of 2011 (red lines) and Ming courier stations (blue dots),” showing also Ming courier routes (green lines).

 

Map 4a. Close-up on the Southeast Coast in Map 2a. “From where did prefects of the 1040s hail?”

Map 4a. Close-up on the Southeast Coast in Map 2a. “From where did prefects of the 1040s hail?”

 

Map 4b. Close-up on the Southeast Coast in Map 2b. “From where did prefects of the 1210s hail?”

Map 4b. Close-up on the Southeast Coast in Map 2b. “From where did prefects of the 1210s hail?”

 

“Bag of Words”

Figure 1. Automated Markups on Wu chuan lu using MARKUS, showing only the first five passages. The text of Wu chuan lu is retrieved from Donald Sturgeon’s Chinese Text Project site. MARKUS is an online markup platform developed by Hilde De Weerdt and Brent Hou Ieong Ho at Universiteit Leiden.

Figure 1. Automated Markups on Wu chuan lu using MARKUS, showing only the first five passages. The text of Wu chuan lu is retrieved from Donald Sturgeon’s Chinese Text Project site. MARKUS is an online markup platform developed by Hilde De Weerdt and Brent Hou Ieong Ho at Universiteit Leiden.

 

Figure 2. Word Cloud of Wu chuan lu using Guan-tin Chien’s HTML5 Word Cloud Generator. Words with a frequency of four or less are excluded for clarity.

Figure 2. Word Cloud of Wu chuan lu using Guan-tin Chien’s HTML5 Word Cloud Generator. Words with a frequency of four or less are excluded for clarity.

 

Figure 3. Network graph showing text reuse in the Korean Buddhist Canons by Donald Sturgeon. Buddhist canons are labeled by numbers according to Lewis R. Lancaster’s catalogue (http://www.acmuller.net/descriptive_catalogue/index.html) and divided into different clusters (shown in different colors) based on the degree of text reuse. Reproduced with permission of the author.

Figure 3. Network graph showing text reuse in the Korean Buddhist Canons by Donald Sturgeon. Buddhist canons are labeled by numbers according to Lewis R. Lancaster’s catalogue (http://www.acmuller.net/descriptive_catalogue/index.html) and divided into different clusters (shown in different colors) based on the degree of text reuse. Reproduced with permission of the author.

 

The Entity-Relationship Model

Figure 4. Automated markups of full personal names in Wu chuan lu, exported from MARKUS after editorial corrections.

Figure 4. Automated markups of full personal names in Wu chuan lu, exported from MARKUS after editorial corrections.

 

Figure 5. Automated markups of place names in Wu chuan lu, exported from MARKUS after editorial corrections.

Figure 5. Automated markups of place names in Wu chuan lu, exported from MARKUS after editorial corrections.

 

There’s Something for Everyone…

Figure 7. Query results of all degree holders from 1000 to 1040 using the querying utility available in the desktop version of CBDB

Figure 7. Query results of all degree holders from 1000 to 1040 using the querying utility available in the desktop version of CBDB

 

Figure 8. Map created in CartoDB showing from where the degree holders in the above query hailed

Figure 8. Map created in CartoDB showing from where the degree holders in the above query hailed

 

Figure 9. Network graph created in Palladio showing scholarly associations found in a CBDB query involving people who flourished between 1030 and 1070. The size of a node is in proportion to the number of other nodes with which it has connections.

Figure 9. Network graph created in Palladio showing scholarly associations found in a CBDB query involving people who flourished between 1030 and 1070. The size of a node is in proportion to the number of other nodes with which it has connections.

 

 Figure 10. Map created in Palladio showing scholarly associations found in the above CBDB query involving people who flourished between 1030 and 1070. The size of a node is in proportion to the number of other nodes with which it has connections.


Figure 10. Map created in Palladio showing scholarly associations found in the above CBDB query involving people who flourished between 1030 and 1070. The size of a node is in proportion to the number of other nodes with which it has connections.

 

Figure 11. Regexes submitted in MARKUS to tag only places on Fan’s travel route in 1177.

Figure 11. Regexes submitted in MARKUS to tag only places on Fan’s travel route in 1177.

 

Figure 12. Keyword markup results on Wu chuan lu using the above regexes in MARKUS, showing only the first five passages.

Figure 12. Keyword markup results on Wu chuan lu using the above regexes in MARKUS, showing only the first five passages.

 

Song Chen
Assistant Professor of Chinese History
Department of East Asian Studies
Bucknell University
songchen@post.harvard.edu

For link to the original post on Dissertation Review, please go to: http://dissertationreviews.org/archives/13643

Comments are closed.