Projects by Phil Reed at The University of Manchester
This project is maintained by PhilReedData
Analyse a visual aspect of an entire historical collection using a simple tool and begin thinking of questions to ask. Repeat the process with another collection and a developed set of tools from another institution.
The University of Manchester subscribes to the Gale Cengage collection Illustrated London News Historical Archive 1842 to 2003. Access to Illustrated London News is provided online through usual browse and search tools within a web interface.
You can search for keywords or any text within the documents, or browse the issues’ front covers by year. This interface suits many uses but does not, for example, allow one to look at the covers of all issues at once.
We have a back-up hard drive with all issues, as OCR-transcribed text files and image scans. I wrote a tool to quantify one aspect of the cover image, its (crude) average colour, and show this in a calendar view. It covers data from 1964 onwards, when colour was first used on the cover.
(Note that the crude method to determine single mode colour the image is used to keep the workshop simple. A better method would be to use k-means clustering for the k main colours of the image.)
The tool has controls to alter the display, which may appear at the bottom or right of the screen. You can adjust scale, gridlines, wrapping and background colour (black or white). It might help to zoom in or out in the browser as well.
The image below shows the tool adjusted to fit all the issues on-screen, with a black background for clarity. Click the image to view full-size.
An alternative, advanced table view is also available, with each issue is accompanied by its red, green and blue values, plus the equivalent hue, luminance and saturation values. There are controls to alter the display of the table, and links to open the issue in the regular Gale Cengage web interface (if you are on-campus). Be aware, the average colour calculation is very crude, so anything you interpret from the hue, luminance and saturation numbers should be used with care.
More tasks follow after the next part.
There were two stages to the process, both involving writing a little Python (version 2.7.6).
make_csv.py, create_calendara_page.py, create_table_page.py, dummy-values.csv
This version of Python is available on your University desktop through the Software Centre (search “Anaconda”).
An established Digital Humanities Lab at Yale University Library has developed a series of projects titled Robots Reading Vogue.
These projects cover text mining and image mining of the digitised ProQuest collection of the Vogue Archive, the entire run of Vogue magazine (US Edition), from the first edition in 1892 to the current month, reproduced in high resolution colour page images. We (currently) have standard web access to the collection; the projects that form Robots Reading Vogue depend on bulk access to the full text and images.
At the start of this Lab we looked at topic modelling and N-gram viewers, concerned with text mining the collection. Here, we turn to visualisation using the cover images, in particular, colourmetric space.
The project Vogue Covers in Colormetric Space uses the wonderful free tool ImagePlot to quantitatively visualise and display the covers of all issues of Vogue in an interactive chart. We can see how colourful Vogue covers were over time – from the 1890s on the left to the 2010s on the right. The most colourful covers are higher on the y-axis.
Back to top | Parent | Prev | Next |