Post Reply 
Question for the book scanners
02-17-2021, 09:14 PM
Post: #7
RE: Question for the book scanners
I've been scanning a lot of books for my new site lately, and everything has been possible with free tools.

I use ImageMagick to cut two pages into one. I script it so I don't have to manually do it for each file, but here are sample commands for one file:

convert -crop 1656x2540+5+0 hp42s-prog-de-000.png out/hp42s-prog-de-000a.png;
convert -crop 1656x2540+1656+0 hp42s-prog-de-000.png out/hp42s-prog-de-000b.png

That starts at position 5,0 and grabs 1656x2540 for the left page, and starts at position 1656,0 and grabs 1656x2540 for the right page, and saves them in files with "a" and "b" appended to the original filenames.

I then do a mix of automatic and manual cleanup of the pages, optimize them with optipng and some tools I wrote myself, and then run through tesseract-ocr (a modified build I made to fix bugs) to convert each page to PDF, and then the Popper PDF tools to combine the pages into a single PDF.
Visit this user's website Find all posts by this user
Quote this message in a reply
Post Reply 


Messages In This Thread
RE: Question for the book scanners - gege - 02-17-2021, 02:57 PM
RE: Question for the book scanners - EdS2 - 02-18-2021, 08:49 AM
RE: Question for the book scanners - Eric Rechlin - 02-17-2021 09:14 PM
RE: Question for the book scanners - robve - 02-17-2021, 09:19 PM



User(s) browsing this thread: 1 Guest(s)