What is text analysis?
Text analysis, sometimes referred to as text mining or text analytics, is an umbrella term for research methods that use computers to surface information from large amounts of unstructured text. Novels, newspapers, journals, blog posts, emails, tweets, and metadata are all examples of unstructured text. Text analysis methods aid researchers in sifting through large text corpora to identify patterns and make connections.
What is distant reading and how is it different from close reading?
Distant reading refers to the use of computers to ingest and analyze large text corpora — more than would otherwise be possible for a single person. As opposed to close reading which prioritizes focusing on the details of a certain text, distant reading facilitates observation and analysis of recurring patterns and concepts that recur over time in large text corpora. In literary studies and comparative literature, distant reading and text analysis are closely aligned in their meaning.
Tools for Text Analytics Projects
Mallet (machine learning applications to text)
Beautiful Soup (Python library designed for screen scraping)
TAPoR (Xml, Html, and plain text analysis)
Juxta (comparison/collation tool)
- WordSeer (comparison tool)
- Lancsbox (software package for the analysis of language data and corpora)
Articles for Reference
Benito-Santos, A. and Sanchez, R.T., “A Data-Driven Introduction to Authors, Readings, and Techniques in Visualization for the Digital Humanities,” IEEE Computer Graphics and Applications IEEE Comput. Grap. Appl. Computer Graphics and Applications, IEEE. 40(3):45-57 Jun, 2020
Gruenwald, Oskar, “Taming the Digital Behemoth: Rethinking the Digital-Human Divide,” Journal of Interdisciplinary Studies. 2020, Vol. 32 Issue 1/2, p1-16.
Cheng Li and Menuta, Fedeke, “Big Digital Humanities: Imagining a Meeting Place for the Humanities and the Digital,” Word & Text: A Journal of Literary Studies & Linguistics. 2020, Vol. 10, p173-182
Poole, Alex H., “Now is the Future Now? The Urgency of Digital Curation in the Digital Humanities,” Digital Humanities Quarterly. 2013
Svensson, Patrick, “Introduction to Digital Humanities,” Digital Humanities Quarterly, Vol. 4 No. 1 (2010), http://digitalhumanities.org/dhq/vol/4/1/000080/000080.html
Cordell, Ryan, “Speculative Bibliography,” https://www.degruyter.com/view/journals/angl/138/3/article-p519.xml?tab_body=abstract
- Ed. Gold, Matthew, Debates in the Digital Humanities, University of Minnesota Press (2012) [available here, open source]