mu:zines Blog

Home -> Blog -> Display Blog

From Print To Screen

Part One - Overview

by Ben | 1st Nov 2017



This is the first part in a series of articles in which I intend to go through the process I use to "convert" a complete print magazine into an "Active" issue on the site.

This series will be a deep dive, so for most readers, this first overview part will be enough of an insight into the process to get an idea of the effort involved. Subsequent articles will go deep into the weeds for the people who are interested, or who maybe are considering tackling something similar and would find it useful to get an idea of what's involved, what problems they may encounter, what solutions I came up with, and what tools to use.

Overview



So, let's take a broad overview of the the main task groups, once we have a physical issue in our hands to process:


1) - Scanning

(Guide time: 1.5 hours for typical 80-page issue, could be 3 hours or more for 280 page SOS)

2) - Importing to the Scan Library

(Guide time: 5m)

3) - Processing the scans

(Guide time: 10-30m typically)

4) - Outputting the scans for the site

(Guide time: 10m)

5) - OCR Part 1 - Contents & Metadata

(Guide time: 30m)

6) - Article Images

(Guide time: 30-60m typically)

7) - OCR Part 2 - Text Processing & Proofing

(Guide time: 4-6 hours typically)

Guide times are based on a typical 80-page issue in good condition. Older manually-typeset issues, damaged or poor quality issues, lots of image processing and large page-count issues like IM&RW and the later SOS will all increase the time taken on various tasks.

Keeping It Manageable



With any large, complex, multi-step task, the key to keeping it manageable is to make the process as simple, quick, and automated as possible. Every few minutes that can be shaved off a task with an improved tool or workflow, and every task you can get the computer to do for you instead of doing it manually pays off when the process needs to be repeated hundreds, or thousands of times. Workflow really is the key.

I try to keep flexible - I won't necessarily do all steps 1-7 (above) per issue in order. I might do 10 issues of scanning (step 1), or I might process the scans for a few issues in a row (step 3), or get the next three issues up to the proofing stage (step 7). The reason is that the different steps require different resources and mindsets - if I'm sitting in a cafe for an hour, I can't be scanning, or outputting from the scan library (which is on an external drive I don't have with me), but I can be OCRing, or proofing. If I've got a podcast to listen to, I can't be proof-reading articles, but I can be image processing scans or working on article images. Got some TV to watch? I can't really do visual things like image processing, but I can scan another issue, or do some text processing - and so on.

Sometimes items within a task can be done in a different order, or a little differently with the same result, and this also helps break up the monotony.

Keeping the tasks flexible is also key to staying motivated, as I can choose to do tasks that suit the current situation/time available/mood, rather than be forced into doing a particular step that's not convenient at that time, causing a workflow block. This helps keep the project moving forward.

Workflow Notes



Workflows should generally evolve and improve over time. The workflow I have now has evolved immensely since I started, with the result being that the time and effort it takes to get an issue done - while still significant - is significantly less than it used to be.

(As a rough idea, an issue of MT, a reasonably "easy" magazine to process, took around 20 hours to do when I started. With my current workflow, those issues take around 5-6 hours - with better results and quite a bit more meta data options.)

You can't build a workflow until you know what it is you need to do. And you won't know what you need to do until you have experience actually doing the work. So, it's advisable to start with an initial survey to get an idea of the basic workflow, and then put that into action. As you come across new problems, or ways of optimising the workflow, you solve those problems, implement and improve them, and evolve the workflow so it can handle those things better.

In the case of mu:zines, I started first by looking at one issue of Music Technology, and running through processing some example articles to get an idea of the process. As I came across new things, I would adapt the workflow accordingly. And then I'd get onto a new publication, and find this threw up some new things I wasn't expecting - forcing new solutions to be made. I'll go into these in the later parts of this series.

It's important as you work to constantly identify areas that could be done faster, better, improved or automated, and implement them as you go. The net effect is that the more content you do, the more efficient and evolved the workflow is, so it gets *easier* over time, rather than getting more and more bogged down as the process gets inevitably more complex.

So, flexibility, and efficient workflow is what we're going for.


Next time, for part two, we'll dive into the Scanning process...



Back to blog list





mu:zines Blog
- From Print To Screen


    Part One - Overview | Nov 2017

- The Long Lost Patch


    ...and how I rediscovered it | Oct 2017

- One.


    (Happy Birthday to us!) | Mar 2017

- Origins


    You Always Fondly Remember Your First... | Feb 2017

- Electronics & Music


    E&MM Issue 1 is now online | Jan 2017

Blog entries from 2016...

- Contributions Update


    Nov 2016

- Site Updates and Improvements


    Nov 2016

- Added More Publications...


    Sep 2016

- 'Patchwork' articles now available...


     (Synth Patches) | Jun 2016

- Two Months [and ten years] In...


    A quick catch up of activity since our launch. | May 2016

- Welcome to mu:zines!


    It took a while, but we've finally reached a beginning...
    Mar 2016

Follow mu:zines on Twitter: @mu_zines
for updates and other bits and pieces

Please Contribute to mu:zines by supplying magazines, scanning or donating funds. Thanks!

We currently are running with a balance of £50+, with total outgoings so far of £670.50. More details...
muzines_logo_02

Small Print

Terms of usePrivacy