Hi. My name is Jonathan Reeve. I'm a graduate student and researcher in computational literary analysis at Columbia University. I write computer programs that help us understand novels and poetry. This is my blog.

+=
  • Fingerprinting the Chapter
  • Computationally Identifying Similar Books in Project Gutenberg
    corporapython
  • A Project Gutenberg Database for Text Mining
    corpora
  • The Henry James Sentence: New Quantitative Approaches
    syntaxsentencesJames
  • A Macro-Etymological Analysis of The Canterbury Tales
    etymologymedievalChaucer
  • Probabilistic Detection of Character Voices in Fiction
  • A Generator of Socratic Dialogues
  • Chapterize: a Tool for Automatically Splitting Electronic Texts into Chapters
  • A Macro-Etymological Analysis of Milton’s Paradise Lost
  • Macroetym: a Command-Line Tool for Macro-Etymological Textual Analysis
  • A Programmatic Generator for Pound's Cantos
  • Tag, Catalog, Iterate: A Non-Linear Analysis of Katherine Mansfield's "The Garden Party"
  • Introducing Git-Lit
  • A Macro-Etymological Analysis of James Joyce's A Portrait of the Artist as a Young Man
  • A Bookmarklet For Creating Annotags
    annotationopen source
  • A Proposal for a Corpus Sharing Protocol
  • Imperial Voices: Gender and Social Class among Shakespeare's Characters, a Stylometric Approach
  • Detecting Literary Chiaroscuro in Eliot, Dickens, and other Victorian Novelists
    wordnetpythonNLTK
  • A Macro-Etymological Analysis of Character Dialog in Virginia Woolf's The Waves
    etymologyWoolfgender
  • Annotag Calculator 1.0
    annotation
  • A Comparative Macro-Etymology of Whitman Editions
    etymologyWhitman
  • Hacking at the Open Syllabus Project: Collocations by Subject
  • Introducing the Macro-Etymological Analyzer
  • Announcing Annotags: A Concept for a Decentralized Literary Annotation Protocol
    annotation
  • Virginia Woolf's The Waves: A TEI Edition