About the Viral Texts Project

“Nothing but a newspaper can drop the same thought into a thousand minds at the same moment…”

—Alexis de Tocqueville, Democracy in America

This site presents data, visualizations, and both computational and literary publications drawn from the Viral Texts project, which seeks to develop theoretical models that will help scholars better understand what qualities—both textual and thematic—helped particular news stories, short fiction, and poetry “go viral” in nineteenth-century newspapers and magazines. During this period, texts published in newspapers and magazines were not typically protected as intellectual property, and so literary texts as well as other non-fiction prose texts circulated promiscuously among newspapers as editors freely reprinted materials borrowed from other venues. In the Viral Texts project, we’re asking: What texts were reprinted and why? How did ideas—literary, political, scientific, economic, religious—circulate in the public sphere and achieve critical force among audiences? By employing and developing computational linguistics tools to analyze the large textual databases of nineteenth-century newspapers newly available to scholars, this project will generate new knowledge of the nineteenth-century print public sphere.

In its first phase (2012-2014), the Viral Texts project focused on developing its text reuse discovery algorithms and investigating reprinting in the nineteenth-century United States. Links to publications from this phase can be found on our publications page. In 2015-2016, the Viral Texts team will focus on improving the project’s algorithms and investigating international reprinting among English-language newspapers in the United States, United Kingdom, and Australia. In addition, researchers are developing methods for analyzing reprinting across languages, beginning with German, the most common non-English language for nineteenth-century US newspapers.

Viral Texts is sponsored by Northeastern University’s NULab for Texts, Maps, and Networks and generously funded by the National Endowment for the Humanities’ Office of Digital Humanities and the American Council of Learned Societies’ Digital Innovation Fellowship Program. The project team includes Professors Ryan Cordell, Elizabeth Maddock Dillon, and David Smith, as well as Ph.D. students Abby Mullen, Peter Roby, Kevin Smith, and Matthew Williamson. This website was designed by Ryan Cordell, who you can contact at @ryancordell on Twitter.