News, Resources, Uncategorized

Resources: Tutorials on Text Analysis and Topic Modeling in Python

A series tutorials on quantitative text analysis with Python are now available on the DARIAH-DE  website. The tutorials were written by Allen Riddell with help from Christof Schöch. The tutorials assume familiarity with the Python programming language. If you’re new to Python and would like to learn the basics, head straight over to the excellent (and recently expanded) […]

News, Resources

Resource: Sustaining Digital Collections

There has been much debate about how we might help secure the future of digital scholarship for the next generation of learners, teachers and researchers. The business end of how this might be achieved in regards revenue generation beyond host institutional support remains a challenge. Read full post here.

News, Resources

Resource: The Walt Whitman Archive

In the past few months, MITH has been developing software for a project related to the Walt Whitman Archive. The Walt Whitman Archive is an electronic research and teaching tool that sets out to make Whitman’s vast work, for the first time, easily and conveniently accessible to scholars, students, and general readers. Working in collaboration […]

News, Resources

RESOURCE: Data Mining the Internet Archive Collection

A new lesson by Caleb McDaniel on The Programming Historian focuses on downloading and analyzing records from the Internet Archive. In this lesson, you’ll learn how to download files from such collections using a Python module specifically designed for the Internet Archive. You will also learn how to use another Python module designed for parsing […]

News, Resources

Resource: Guidelines for Digital Newspaper Preservation Readiness

The Educopia Institute has released Guidelines for Digital Newspaper Preservation Readiness. (PDF) These Guidelines are an effort to distill preservation-readiness steps into incremental processes that an institution of almost any size or type can deploy to begin maturing its digital newspaper content management practices. Digital Scholarship Overview | DigitalKoans  

News, Resources

Resource: Web Scraping: Working with APIs

APIs present researchers with a diverse set of data sources through a standardised access mechanism: send a pasted together HTTP request, receive JSON or XML in return. Today we tap into a range of APIs to get comfortable sending queries and processing responses. These are the slides from the final class in Web Scraping through […]

News, Resources

Resource: Semantic Web application

This posting outlines the implementation of a Semantic Web application. It is a sort of inference engine feed with a URI and integer, both supplied by a person. Its ultimate goal is to find relationships between URIs that were not immediately or readily apparent. Source: Morgan, Eric Lease: Semantic Web application