Information about DocuBase Documents   

 

DocuBase
Home Page

Search the Collection

Start a New Collection

Add/Delete/Edit Documents

Frequently Asked Questions

Supported Document Types

Types of documents currently accepted into DocuBase:
  • ASCII text
  • GIF
  • GZ files
  • HTML (HyperText Markup Language)
  • JPEG
  • PDF (Adobe Portable Document Format)
  • Postscript
  • PPT (PowerPoint)
  • Microsoft Word
  • TIFF (more information below)
  • All document formats except TIFF can be stored on our server or on any Web server you specify. TIFF formatted documents will be processed on our computers (OCR; conversion to PDF). We store the TIFFs on our server while they are processed. The resulting files are also stored on our server.

    Scanned documents should be submitted in TIFF format. TIFFs need to be anywhere between 300 and 600 dpi and may be color or black and white. (Black and white is much faster to process.) You may submit a multipage TIFF, or a set of individual TIFF files. If you submit a set of individual TIFF files, you need to compress them into one zip or tar file to transfer them to us. Make sure that the uncompressed zip or tar file results in a set of TIFFs, not a directory with TIFFs inside.

    You can have more than one different format of the same document. For example, document 12 of the DigitalBooks collection can exist in both HTML and PDF format. When you go to the "add a new document" form, make sure you know the ID of the document to which you want to add a new format.


    Berkeley Natural History Museums | University of California | Berkeley

    Last updated: Jun 7, 2021