Benjamin M Schmidt Assistant Professor of History, Northeastern University Core Faculty, NuLab for Texts, Maps, and Networks
Approaches to modeling plot
Plot arcs
- Corpus: Television shows and scripts
 - Method: Characterization of aggregate patterns through dimensionality reduction (topic modeling).
 
- Applicable to other corpora
 
CBS internal program analysis
Ernst Dichter papers, Hagley Museum & Archive
Linechart of different media over time
{
"database": "movies",
"plotType": "linechart",
"method": "return_json",
"search_limits": {"MovieYear":{"$lte":2020}},
"aesthetic": {
    "x": "MovieYear",
    "y": "TotalTexts",
    "color": "medium"
}   
}
A subtitle chunking algorithm:
{
"database": "screenworm",
"plotType": "linechart",
"method": "return_json",
"search_limits": {"6th":{"$gte":1},
"word":["love you"]},
"aesthetic": {
    "x": "6th",
    "y": "WordsPerMillion",
    "color": "medium"
}
}
Interactives
benschmidt.org/arceology
Example topics
| topic | label | 
|---|---|
| 2 | Wait wait minute Let’s Look Hurry let’s | 
| 10 | clean smell water wash use bath bathroom | 
| 11 | film movie show TV movies scene play | 
| 31 | hair funny look joke laugh big teeth | 
| 34 | sir course Thank dear London quite Ah | 
| 60 | talk talking Look crazy understand Listen problem | 
| 61 | animals animal bear food wild hunting lion | 
| 74 | hear voice heard sound radio noise listen | 
| 78 | game play ball team playing win football | 
| 80 | girl girls boy look name beautiful pretty | 
| 85 | drink wine beer drunk bottle drinking glass | 
| 90 | years world land water sea ago life | 
| 110 | Madame de Monsieur French dear course evening | 
| 113 | God cool Whoa Look look dude Wow | 
| 119 | married wife wedding husband love marriage woman | 
| 121 | Agent agent security FBI team CIA agents |