News, Resources

Resource: Principal Component Analysis, Step by Step

In this article I want to explain how a Principal Component Analysis (PCA) works by implementing it in Python step by step. At the end we will compare the results to the more convenient Python PCA()classes that are available through the popular matplotlib and scipy libraries and discuss how they differ. The main purposes of […]

News, Resources

Resource: Ballad Sheet Forensics, Preservation, and the Digital Archive

Attached are the slides from my recent talk, “Ballad Sheet Forensics, Preservation, and the Digital Archive,” the final presentation at the Huntington Library’s Living English Broadside Ballads conference, April 4-5, 2014 (http://www.huntington.org/uploadedFiles/Files/PDFs/broadside_conf.pdf). The talk focused on the need to reconsider our understanding of what constitutes the “information” that we are trying to capture and/or preserve […]

News, Resources

Resource: Using Census Survey Data Properly

The American Community Survey, an ongoing survey that the Census administers to millions per year, provides detailed information about how Americans live now and decades ago. There are tons of data tables on topics such as housing situations, education, and commute. The natural thing to do is to download the data, take it at face […]

News, Resources

Resource: The Next Giant List of Digitised Manuscript Hyperlinks

It’s that time of year again, friends – when we inflict our quarterly massive list of manuscript hyperlinks upon an unsuspecting public.  As always, this list contains everything that has been digitised up to this point by the Medieval and Earlier Manuscripts department, complete with hyperlinks to each record on our Digitised Manuscripts site.  There […]

News, Resources, Uncategorized

Resources: Tutorials on Text Analysis and Topic Modeling in Python

A series tutorials on quantitative text analysis with Python are now available on the DARIAH-DE  website. The tutorials were written by Allen Riddell with help from Christof Schöch. The tutorials assume familiarity with the Python programming language. If you’re new to Python and would like to learn the basics, head straight over to the excellent (and recently expanded) […]

News, Resources

Resource: Sustaining Digital Collections

There has been much debate about how we might help secure the future of digital scholarship for the next generation of learners, teachers and researchers. The business end of how this might be achieved in regards revenue generation beyond host institutional support remains a challenge. Read full post here.

News, Resources

Resource: The Walt Whitman Archive

In the past few months, MITH has been developing software for a project related to the Walt Whitman Archive. The Walt Whitman Archive is an electronic research and teaching tool that sets out to make Whitman’s vast work, for the first time, easily and conveniently accessible to scholars, students, and general readers. Working in collaboration […]