I have a hoarding problem. Not, I hasten to add, in the physical world. There, with the exception of kitchen equipment, I tend to the minimal. My hoarding problem is in the virtual world of information. I hate the idea of losing content. It’s so easy to copy around – it happens for every visit on every page for every image – yet we do such a terrible job of preserving old digital sources.
When tumblr announced it was banning all adult content I was therefore particularly perturbed, and set about locally archiving what I could. The initial idea was to just save sites I liked to use as image sources for this blog. I figured I could sort and catalog it all later. That seemed like a fun way to spend the holiday period. However, once I had the pipeline setup and ticking along, things got maybe a little out of hand.
When I finally pulled it all together and de-duplicated, I discovered I’d got 5.1 million images from about 400 sites occupying over 3TB of space. Oops. The cataloging process might therefore be a touch more time consuming than I first thought. Let’s say I want to run through all images just once and spend just 4 seconds per image. With a lot of animated gifs involved, that seems like a pretty fast average pace. If I devote 2 hours every day, 7 days a week, 52 weeks a year, I should be done in a little over 7 years and 9 months. Alternatively, I could quit my job entirely, put in a solid 8 hours a day, 5 days a week, and be done in a bit over 2 years and 8 months. Piece of cake.
I guess the good news is that there’s no danger of me running out of images to post. The bad news is that I’m not sure how I’ll be able to sort through them all to find the good stuff.
Talking of archiving content, here’s an old image from the Leda / NuWest company. I’d guess its from the mid to late 80’s. I found it via the now unavailable ‘x ray blue eyes’ tumblr. That was the single largest tumblr site I archived, with 355,247 images. That’s a lot of femdom.