Online sources of scanned book images
From DPCanadaWiki
What is Harvesting?
There are many online sources of scans suitable for processing through DPC. Content providers may download or "harvest" image scans from these websites, get copyright clearance for them, and then OCR/process them as they would their own scans. There are several coordinated scan harvesting efforts going on at DP, so check there before setting up your own harvesting program.
Requirements for Harvested Material
The requirements for harvested projects are the same as for any content going through DPC:
- The material scanned must be in the public domain so that copyright clearance can be obtained. Be careful that you do not harvest and illegally distribute any copyrighted or otherwise restricted material.
- All pages must be present. Users beware -- the quality control in some of these collections isn't the best, so before you start, make sure that scans are available for every page of the project. If you love a project, broken though it may be, please complete it with the aid of the Missing Pages Wiki before uploading it to DPC.
- High enough quality illustrations must be included for the post-processor. Some of the collections provide lovely high resolutions scans, so please include these in addition to the lower quality scans for the proofers at the time of project creation. (Please do not ask the post-processors to download these themselves -- this is your job!) If there are no high quality illustrations available to harvest, please consider choosing another project, or again completing the project with the aid of the Missing Pages Wiki before uploading. Though the text is often considered the more important part of these e-books, you may have trouble finding a post-processor for the project if the illustrations are not satisfactory.
