(This post is based on “Improving Software Sustainability: Lessons Learned from Profiles in Science“, an interactive paper (pdf) at the Society for Imaging Science and Technology’s Archiving 2013 conference, April 2-5, 2013.) This story begins in the early 1990s at the National Library of Medicine, when our group experimented with arranging, describing and digitizing historical manuscript collections to make…

Read More

Topic modeling is a catchall term for a group of computational techniques that, at a very high level, find patterns of co-occurrence in data (broadly conceived). In many cases, but not always, the data in question are words. More specifically, the frequency of words in documents. In natural language processing this is often called a…

Read More

The digital age and the tools it provides allow for a different mediation of knowledge than standard forms of scholarly communications. As noted by Abby Smith Rumsey these new methods have brought “fundamental operational changes and epistemological challenges [that] generate new possibilities for analysis, presentation, and reach into new audiences”.[2] The exhibit format in Omeka…

Read More

“Non-consumptive research” is the term digital humanities scholars use to describe the large-scale analysis of a texts—say topic modeling millions of books or data-mining tens of thousands of court cases. In non-consumptive research, a text is not read by a scholar so much as it is processed by a machine. The phrase frequently appears in…

Read More

Metadata Games is an online game system for gathering useful data on photo, audio, and moving image artifacts, enticing those who might not visit archives to explore humanities content while contributing to vital records. Furthermore, the suite enables archivists to gather and analyze information for image archives in novel and possibly unexpected ways. Check out…

Read More

Big Data can have enormous appeal. Who wants to be thought of as a small thinker when there is an opportunity to go BIG? The positivistic bias in favor of Big Data (a term often used to describe the quantitative data that is produced through analysis of enormous datasets) as an objective way to understand our…

Read More