From the resource: Jupyter notebooks have seen enthusiastic adoption in the data science community, to an extent where they are increasingly replacing Microsoft Word as the default authoring environment for research. Within digital humanities literature, one can find references to Jupyter notebooks (split off from iPython, or interactive Python, notebooks in 2014) dating to 2015….

Read More

From the resource: Second, you need to communicate your research to whomever you can find, wherever and whenever you can. Just like an entrepreneur, you need to have a 30-second, 1-minute, 3-minute and 5-minute versions of your research project. Think of it as pitching your research project and include all the essential parts (topic and…

Read More

From the resource: Over the summer of 2019, inspired by the promising results in articles like Romanov et al. 2017, I set out to use the Kraken OCR software on a variety of texts. Kraken, see their website or their repository, is open-source command line software that is capable of reaching accuracy rates in the high nineties for Arabic and Persian printed text. Kraken is not equipped to handle every text – I recommend…

Read More

About the resource: In April 2019, “What is a Feminist Lab?” Symposium was held at the University of Colorado Boulder and organized by Maya Livio, Lori Emerson, and Thea Lindquist. The event included a range of speakers from interdisciplinary research labs and explored ways in which intersectional feminist approaches can be integrated into labs and…

Read More

About the resource: The fall 2019 programme of Sunoikisis Digital Classics, which focusses on Digital Cultural heritage, has now begun. The nine common sessions, which are broadcast live (and then archived indefinitely) on YouTube, cover three broad strands: imaging technologies, geographic methods and ethical issues. This collaboratively taught semester includes contributions from 20 scholars from…

Read More

About the resource: Over the past two years, my colleagues and I in the Digital Content Management section have been working with experts from across many divisions of the Library of Congress to collate and assemble guidance and policy that guide or reflect the practices that the Library uses to manage digital collections. I am…

Read More

From the resource: This post will cover how to measure the relationship between two numeric variables with the corrr package. We will look at how to assess a variable’s distribution using skewness and normality. Then we’ll examine the relationship between two variables by looking at the covariance and the correlation coefficient. Read the full resource…

Read More

From the resource: While techniques for text mining, sentiment analysis and other natural language processing are ubiquitous on the Internet, they’re not always accessible to students. I recently ran this tutorial in my journalism class at Northeastern University to help students answer some questions they generated around the Twitter timelines of the Democratic candidates running…

Read More