Yay, this thread bumped up again. Which reminded me, I now could afford to buy that copy I linked to. It was still available (amazingly) and I got it. I *hope* it was still actually in the shop, rather than just a dead listing. Touch wood, that nothing goes wrong in the post and reshipping.
Edit to add:
So, I've had the book disassembled at a local bookbinder (under 3 EUR, they have great services and prices) and started scanning the individual page blocks, then I will have it reassembled (which will cost 7 EUR with repairs to the binding included). So far I've managed to scan about 1/4 or 1/5th of the book, it went pretty quickly. Though some postprocessing (any volunteers, hints for automated OCR? ) will be needed.
I've uploaded what I've got so far. Note: The pages are pretty much ordered randomly.
WARNING: 90 MEG file
http://daqq.eu/temp/1of4.zip
Wow, that really is an amazing book. I'm eagerly awaiting having a copy in hand.
Daqq, you're going to a lot of trouble to scan it and kudos to you for that. But I think the result could be better with some improvements in your technique.
Main points:
* You have to get the pages flat on the glass. A springy backing, pressed hard on the document, is not optional. Holding the paper down with a few fingers doesn't work. Yes I know that's what you're doing, see pic 04 below - there's your hand. The dark shading on most pages is due to not having the paper flat.
* TIFF is a non-lossy file format - that's good. But doing the scan in two-tone B&W (Fax mode) then letting the scanner software dither dots to approximate fine tonal shading, is a disaster. It also results in character edge jaggies. The large file sizes despite fairly lo-res scan and Fax mode, is due to coding all those dots for the dithered shading. The data loss is not recoverable via post processing.
I hope you didn't yet scan the other three quarters of the book the same way. Because, I'm sad to say and knowing you won't like to hear this, it should be all done over again. In gray scale, and at high enough resolution to avoid moire pattering in the few toned photographs in the book. Since you went to the effort and expense of dissecting the book, it's a shame to get sub-par scanning results.
* You have bleed-through of the text on the back of the pages. This is highly undesirable too. Can be avoided.
Sorry to be blunt. I'm trying to save you putting in effort that will only result in griping from people who read the document you create.
Last year I started writing an article on the techniques of scanning and digital book creation. It's one of those background projects that goes very slowly, not least because I'm still struggling with the techniques for dealing with print photo-screening. Which will be a section of the article, so I want to get that right before posting the article. Long way to go yet.
But for now I've put a draft copy here:
http://everist.org/temp/On_scanning.htmAlso related:
http://everist.org/temp/20140812_disconnecting_the_dots.htmMaybe a few useful points for you.
Ha ha... and that 126 page HP computer manual - I've decided I have to scan it all again, because at that time I hadn't figured out how to eliminate bleedthrough of the text on the back of sheets, and there's too much of it.