Projects needing OCR

From DPCanadaWiki

Jump to: navigation, search

See also: General information and guidelines on using The OCR Pool.

Template:Dpscans news

Contents


Using this page

Changes are now logged by the software, so you don't need to add that information at the end of this page. In the "Summary" field write what you previously placed in the change log (e.g. "Added 'Crime and Punishment'"). You do not need to add your user name, the software adds it automatically.

Click on the 'history' link in the gray menu bar to see the change log entries for this page.

Content Providers

  • Add an entry to the list of projects - clone one of the existing entries. Please be sure to mention the required OCR language, processing the scans might need (page splitting, cropping, renumbering, despeckling), and any other issues that might concern a potential OCR volunteer.
  • When a request has been fulfilled, delete the entries which have been fulfilled.

OCR volunteers

***NB: changes to these instructions on May 12, 2007 to aid tracking what's being done.*** 

To claim the project:

  • Update the project you wish to process with *Claimed by username* together with a date. Note that hitting the "signature" button, second from right, or typing four tildes, inserts your username and a timestamp, so is the most convenient way of doing this.
  • Send a PM to the Content Provider saying you are working on OCR for their project.
  • After the OCR is complete, PM the Content Provider to let them know the location of the new images and text files. Edit the Wiki entry to read *Done by username* and date.
  • If there are any problems (such as a missing page) please leave a note on the Wiki page.
  • It is the Project Manager's responsibility to remove the uploaded files from WillPM and remove the Wiki page entry.

If the project is CP-only, then you become the Project Manager once you claim it (so need to do the tidying up as well).

Projects needing OCR

NB The article is a sample only, from the DP-INT Wiki, showing how to style your request. It will be removed once OCR requests begin to accumulate.

ahkitj

Send a Private Message Uploaded to CPOnly: ahkitj_economist-18430902-png.zip running one volume now, others awaiting rescans Crb11 13:53, 28 August 2007 (PDT)

Update: This one's still up for grabs, but scan quality is bad as the microfilm scanner used only does 600dpi max. Could someone still have a go and please give some suggestions for better scan outputs? Contact me via private message if you're keen -- I'm not sure it's still uploaded, but can send a URL for it if required. J
Well, I've decided to start a series of projects involving the first year of The Economist! I may only do one issue, I may do just a few or the whole 1843 volume, but this is a start.
It started in 1843, and this is the 'first' issue -- it's actually the second, as there was a 'prospectus' issue in the August before this one was published.
ahkitj_economist-18430902-png.zip and ahkitj_economist-18430902-png_README in CPOnly will be the relevant files. Clearance key e-mail is in the ZIP file itself.
I'm actually interested in PMing this myself, but it's out of the scope of my academic research interests, so if someone would like to PM this for me as well, I would be most appreciative. So, to repeat, I'm happy to PM this, but wouldn't mind someone else doing so.
Thanks,
Jonathan.
--ngeru 03:40, 4 September 2006 (PDT)
Personal tools