Extracting Illustrated Pages from Digital Libraries with Python

Machine learning and API extensions by HathiTrust and Internet Archive are making it easier to extract page regions of visual interest from digitized volumes. This lesson shows how to efficiently extract those regions and, in doing so, prompt new, visual research questions.

Bibliographic Details
Main Author: Stephen Krewson
Format: Article
Language:English
Published: Editorial Board of the Programming Historian 2019-01-01
Series:The Programming Historian
Online Access:https://programminghistorian.org/en/lessons/extracting-illustrated-pages