News, Resources

Resource: Doing OCR Using Command Line Tools in Linux

In previous posts, we looked at a variety of Linux command line techniques for analyzing text and finding patterns in it, including word frequencies, permuted term indexes, regular expressions, simple search engines and named entity recognition. In this post we focus on a preliminary issue: converting images of texts into text files that we can […]

News, Resources

Resource: Digital Humanities GIS projects revisited

A milestone: my list of Digital Humanities GIS projects has now topped 100 entries. It currently stands at 103 entries, the latest to be added being the Google-sponsored Routes of Sefarad, mapping Jewish Heritage in Spain, and Placing Literature, an ambitious attempt to crowd-source the locations of novels. The original aim was to catalogue academic […]

News, Resources

Resource: Making Forms Accessible

I’ve written on making forms accessible before in the WCAG series, but I thought I’d document some real examples using the work that I’ve been doing. This one is a fairly simple, but important example especially since we’re moving to PDA (patron driven acquisitions). http://dhnow.org/14zLIqp

News, Resources

Resource: Batch Downloading and Building Simple Search Engines with Command Line Tools in Linux

In previous posts we downloaded a single book from the Internet Archive, calculated word frequencies, searched through it with regular expressions, and created a permuted term index. In this post, we extend our command line methods to include automatically downloading an arbitrarily large batch of files and building a simple search engine for our collection of sources. […]

News, Resources

Resource: Digital Preservation Tool Grid

Digital Preservation Tool Grid. This tool grid is the product of researching digital preservation tools by Digital POWRR team members in early 2013.  The information included is accurate to the best of our knowledge, but some information may be incorrect or have changed.  We learned the information from various sources including tool websites, contacting the […]

News, Resources

Resource: Scalar Webinars, Register Now

http://laurientaylor.org/2013/06/15/new-scalar-webinars-register-now/ Curious about Scalar? Sign up for a free webinar! Following up on their recent Beta release, the Scalar development team is offering free online webinars to help new users and the curious learn the platform more easily. The “Intro” webinars will cover the basics of Scalar: fundamental concepts, a review of existing Scalar books, […]

News, Resources

Resource: WordSeer 3.0

WordSeer is a web-based text analysis and sensemaking environment for humanists and social scientists. It’s a a research project at UC Berkeley’s Computer Science Division and School of Information.