Skip to main content
OCLC Support

Learner guide: CONTENTdm Basic Skills 2 - Working with Text in CONTENTdm

Build a digital collection

  1. Add a collection.
  2. Configure the collection
  3. Create a project in the Project Client
  4. Configure the project
  5. Add digital items and/or objects
  6. Approve items/objects and index collection

Compound object types

  • Postcard - Front and back images of two-sided items (e.g., tickets, baseball cards).
  • Document - Sequential pages of a report, a journal, a photo album, etc.
  • Monograph - Retain the hierarchical structure of documents, including chapters, sections, and pages.
  • Picture Cube - Select up to six images to link together views of a 3-dimesional object.

Configure or reconfigure a collection for text

1: Metadata fields

basic-skills2-1.png

To edit metadata fields for the collection:

  1. Under the collections tab, click the fields link.
  2. View, add, edit, or delete metadata fields.
    • To add a new metadata field:
      • Click add fields.
      • Enter data for the new field.
      • Click Save changes to add new field.
    • To edit an existing metadata field:
      • Click edit next to the desired field to modify.
      • Edit the field.
      • Click Save changes to update.
    • To delete an existing metadata field:
      • Click delete next to the desired field to remove.
      • Click Yes to confirm or cancel.
  3. Use the move-to drop-down menu next to each field to reorder metadata fields.
  4. Index the collection to update changes to metadata fields. To index the collection:
    1. Click the items tab.
    2. Click index.
    3. Under Index scheduler, click index now.

2: Data type - Full text search

Only one metadata field per collection may have a data type of full text search.

basic-skills2-2.png

Compound objects - File organization

1: Postcards

  • Only two image files can be imported for postcards
  • Transcript files are optional - Can be one or two transcript files (.txt)
  • Root file names for images must match optional transcript files
  • All image files will be in one folder (or directory) and transcript fields in its own folder (or directory)
    basic-skills2-9.png
    Image file name Transcript file name
    paris1.jpg  
    paris2.jpg paris2.txt

2: Documents

  • Transcript files are optional—can be one or two transcript files (.txt)
  • Root file names for images must match optional transcript files
  • All image files will be in one folder (or directory) and transcript files in its own folder (or directory)
    basic-skills2-3.png
    Image file name Transcript file name
    bell1.tif bell1.txt
    bell2.tif bell2.txt
    bell3.tif bell3.txt

3: Monographs

  • Organize image files to reflect the hierarchy of the monograph (e.g., images of chapters or sections in its own folder/directory
  • All optional transcript files can be together in a separate folder/directory, but outside the hierarchy of the monograph
    basic-skills2-4.png

Project client: Import compound objects

1: Create a new project

basic-skills2-5.png

  1. Open your Project Client.
  2. Click the Project menu, then select New.
  3. Enter the URL for your library’s CONTENTdm server.
  4. Enter your login credentials to the server. Click Next.
  5. Select the collection to associate with this project. Click Next.
  6. Enter a name for the project.
  7. Click Finish to create the new project.

2: Configure the project

Use the Project Setting Manager to configure your project.

  1. Click the tab with the name of the current project.
  2. Click the Project menu.
  3. Select Project Settings Manager.
  4. Select the setting you wish to configure.
    • General settings: Contains information about the project and the collection to which it is contributing items; can export these setting for use in other projects
    • Metadata templates: Use to streamline adding metadata to items imported into the project
    • Metadata field types: Contains information about how the metadata fields are defined for the collection
    • Images & thumbnails: Use to modify the size and type of your display images; create watermarks, brands, and bands
    • Image rights: Use to configure the display of copyright information or indicate ownership of items in the collection; must be configured before importing items
    • Processing and OCR: Use if you want to create fully searchable text
    • Project options: Use to configure how upload items in the project to the collection on the server; use and setup the spell checker
    • Find in Collection: Useful for maintaining already-built collections

3: Add compound objects

Add digital files to the project

  1. Click the Add menu.
  2. Select Compound objects.

     Note: Use this to add multiple related files (e.g., pages in a document, audio files on a CD, postcards, etc.).

  3. Add using Compound Object Wizard. Click Add.
    basic-skills2-6.png
  4. Select the type of compound object to create. Click Next.
    basic-skills2-7.png
  5. Browse to folder/directory to locate the desired image folder to add
  6. Select the desired folder. Click Next.
  7. Select Yes to generate display images. Click Next.
  8. Select Page Information, including page names and location of transcripts. Click Next.
    basic-skills2-8.png
  9. Click Finish to add compound object.
  10. Click Close to dismiss the Summary Screen.

View and edit the digital objects after they have been imported to the project

  1. In the project spreadsheet, double-click to open a compound object.
  2. Edit the object and items metadata.
    • View Structure - Use the left-navigation to display and edit metadata for the object and each item in the compound object
    • View Thumbnails - Use the thumbnails to display and edit metadata for the object and each item in the compound object
    • View Spreadsheet - Display the object and each item in the compound object in a spreadsheet view for editing
  3. Click Save at the top of the editor to save changes.

Upload digital objects to the collection on the server

  1. In the project spreadsheet, select each object to upload (or use select all to select all objects).
  2. Click Upload for Approval at the top of the project spreadsheet.
  3. Fix any errors and select those items to Upload for Approval again.

Approve objects/items and index the collection

Approve items uploaded to the collection

  1. In the Project Client, click Administration menu.
  2. Select  approve.

     Note: This opens to the approval page in CONTENTdm Administration.

  3. (Optional) Can select to Approve all or Approve and Index All.
  4. Select individual items or select all to approve.
  5. Click go.

Index the collection

  1. In CONTENTdm Administration, click the items tab
  2. Click index.
  3. Under Index Scheduler section, click Index Now to index the collection immediately
    Or
  4. Schedule the index.
    • Select Once on to schedule the index process to run on a specified date and time.
    • Select Recurring at to specify day(s) and time for the index process to run more than once.

PDF files

PDF characteristics

  • Ideal file format for documents initially created as digital documents (e.g., dissertations)
  • Not ideal file format for scanned images because scanned items do not automatically contain embedded text
  • Adobe Acrobat Reader required to read PDF files

PDF files in CONTENTdm

  •  A PDF file can be treated as:
    • Single file
    • Compound object
  • An advantage to converting a PDF file to a compound object is that each page of the PDF becomes a page with its own metadata record.

Add PDF files

Add PDF as a single item

  1. Under Project Settings Manager, select Metadata templates. Configure PDF metadata template, as needed.
  2. To add a single PDF file:
    1. Click the Add menu.
    2. Select Item.
    3. Browse to folder/directory to locate the desired file to add.
    4. Select the desired file.
    5. Click Add to import the file into the project.
    6. Click Close to dismiss the Summary Screen.

Add PDF as a compound object

  1. Under Project Settings Manager, select Processing. In PDF File Conversion section, select convert PDF to compound objects.
  2. To add a single PDF file:
    1. Click the Add menu.
    2. Select Item.
    3. Browse to folder/directory to locate the desired file to add.
    4. Select the desired file.
    5. Click Add to import the file into the project.
    6. Click Close to dismiss the Summary Screen.

Test your knowledge

  1. What are the four types of compound object structures that CONTENTdm supports?
  2. How does CONTENTdm distinguish a monograph from a document?
  3. There is really only one required action that must be done as part of collection configuration to prepare the collection for transcript files. What is that step?
  4. When importing images and text as part of compound object, how does CONTENT match the image file to the appropriate text file? In other words, what does it match on?
  5. If you use the compound object wizard, it is not necessary for the files to be stored in a root directory.
    1. True
    2. False
  6. Do all PDF files contain embedded text?
    1. True
    2. False
  7. There are two options for importing PDF files in CONTENTdm. What are they?
  8. When viewing a compound object for which there is a transcript file, what is the advantage for the end user if the institution has chosen to use the OCR feature that is available through CONTENTdm?

 

Answer key
  1. CONTENTdm supports four compound objects types: document, monograph, picture cube, and postcard.
  2. Whereas a Document lets you create multiple sequential pages of a report, journal, photo album, or related image sets, a Monograph allows you to retain the hierarchical structure of documents, including sections, chapters, and pages.
  3. The administrator must define a Full Text Search field by editing the collection field properties in CONTENTdm Administration.
  4. The root file name. For example, for the files bell1.tiff and bell1.txt, bell1 is the root file name.
  5. b. False. If using directory structure, monographs require the files be stored in subdirectories within the root directory to create hierarchical organization.
  6. b. False. To check whether your PDF file has embedded text, save it as a .txt file. If the text file contains the text, then the PDF has embedded text.
  7. You can import the PDF file as a single file or convert the file into a compound object.
  8. Search terms will be highlighted in both the text and image file.

Supplemental information

Additional questions? Contact OCLC Support in your region.