This Amicus Brief was filed in the United States Court of Appeal for the Second Circuit in the case of Authors Guild v. Hathitrust on June 4, 2013. The case is on Appeal from the United States District Court for the Southern District of New York, No. 11 CV 6351 (Baer, J.) Amici are over…
The Internet is finally beginning to penetrate historical practice. At the recent North American Society for Sports History (NASSH) Conference, held May 24-26, 2013 at Saint Mary’s University, Douglas Booth and Gary Osmond provided a fascinating primer on the impact digital history is starting to exert on a field like the study of international sports…
(This post is based on “Improving Software Sustainability: Lessons Learned from Profiles in Science“, an interactive paper (pdf) at the Society for Imaging Science and Technology’s Archiving 2013 conference, April 2-5, 2013.) This story begins in the early 1990s at the National Library of Medicine, when our group experimented with arranging, describing and digitizing historical manuscript collections to make…
When I was asked to participate on this panel about digital humanities and the alternative publishing needs of faculty, I felt obliged to temper my delighted acceptance of the invitation with a caveat that “ scholarly publishing” (and its associated challenges) is not something I usually see as central to the work I do. One…
Topic modeling is a catchall term for a group of computational techniques that, at a very high level, find patterns of co-occurrence in data (broadly conceived). In many cases, but not always, the data in question are words. More specifically, the frequency of words in documents. In natural language processing this is often called a…
The digital age and the tools it provides allow for a different mediation of knowledge than standard forms of scholarly communications. As noted by Abby Smith Rumsey these new methods have brought “fundamental operational changes and epistemological challenges [that] generate new possibilities for analysis, presentation, and reach into new audiences”.[2] The exhibit format in Omeka…
“Non-consumptive research” is the term digital humanities scholars use to describe the large-scale analysis of a texts—say topic modeling millions of books or data-mining tens of thousands of court cases. In non-consumptive research, a text is not read by a scholar so much as it is processed by a machine. The phrase frequently appears in…
Thinking about modeling your data using Resource Description Framework (RDF)? As with any choice of technology, there are benefits and downsides, appropriate situations for Linked Data and use cases that would be fulfilled more effectively by other frameworks. This presentation will focus on the pitfalls to avoid and the challenges of using graphs that are…
Metadata Games is an online game system for gathering useful data on photo, audio, and moving image artifacts, enticing those who might not visit archives to explore humanities content while contributing to vital records. Furthermore, the suite enables archivists to gather and analyze information for image archives in novel and possibly unexpected ways. Check out…
Big Data can have enormous appeal. Who wants to be thought of as a small thinker when there is an opportunity to go BIG? The positivistic bias in favor of Big Data (a term often used to describe the quantitative data that is produced through analysis of enormous datasets) as an objective way to understand our…