CONTENTdm basic skills 2: Working with text in CONTENTdm 1:31 pm - 3:41 pm Thursday, February 29, 2024 | (UTC-05:00) Eastern Time (US & Canada) WEBVTT 1 Mindy Tran 00:33:45.560 --> 00:33:59.200 Well, I'd like to welcome everyone again to content DM basic skills, too in this session, we will be working with text in content DM, my name is Mindy Tran and I will be leading the session today. 2 Mindy Tran 00:34:00.920 --> 00:34:20.760 So with this session, we will be focusing basically on text, however, just a quick review since our first session building a collection, regardless of whether it is a collection of just images or a collection that contains textual material. 3 Mindy Tran 00:34:20.760 --> 00:34:41.240 Or a collection of only texture materials or a combination. You, there are six basic steps to building the collection, first off as you recall, you would need to have a collection added to your content DM server or content DM instance and then configure that collection for inst. 4 Mindy Tran 00:34:42.520 --> 00:35:01.720 Setting up metadata fields that the metadata schema for items that will be added to that collection, and then one of the things that you will do is using the project client to work behind the scenes to add these digital items, Digital objects to. 5 Mindy Tran 00:35:02.480 --> 00:35:22.200 Once it's been configured, you'll create a project that's associated with the collection that you're adding digital items or objects. You'll configure that project so that you can streamline your import into the project, adding your metadata, and then. 6 Mindy Tran 00:35:22.440 --> 00:35:43.320 Uploading it to the items to the collection for them for those items or objects to be approved and index, which will then make it available to the user out on your content DM website. Okay, so regardless of like I said, what, what type of material. 7 Mindy Tran 00:35:43.400 --> 00:35:55.360 Or what type, whether it's a simple images or texture material or what we'll be talking about today compound object, the steps are essentially the same. 8 Mindy Tran 00:35:56.760 --> 00:36:03.120 The way that you import and the way that you add digital items or objects. 9 Mindy Tran 00:36:05.080 --> 00:36:12.200 It just differs by the type of material, but the, the steps are essentially the same. 10 Mindy Tran 00:36:17.240 --> 00:36:33.880 So our agenda for today is either a new collection has been created for you. So you'll configure that collection to handle texture materials or you can reconfigure your existing collection to handle. 11 Mindy Tran 00:36:33.920 --> 00:36:54.360 Items or objects that contains textual materials, some collections, some libraries have collections that are exclusively texture materials such as PDF, for instance, some libraries have a collection where it may be about an individual, all the materials about. 12 Mindy Tran 00:36:54.360 --> 00:37:14.840 An individual and it may contain images. It may contain items or objects with texture materials. Okay, so we'll reconfigure a collection to handle texture materials. We'll talk about compound objects, the different type of compound objects that content DM handles P and then. 13 Mindy Tran 00:37:15.520 --> 00:37:33.240 But not least the texture materials that most libraries work with or create collections for PDF files. How does content DM handle that? How do you important and add that to your content DM collections. 14 Mindy Tran 00:37:43.640 --> 00:38:00.920 So compound objects are multiple images or scans that can be assigned to a single object within content DM creating what we call a compound object in this way, you can store and retrieve related. 15 Mindy Tran 00:38:01.560 --> 00:38:16.840 Including say the front and back of a postcard, a multi- page document and, and up to six views of multidimensional items such as a sculpture or a picture cube, for instance. 16 Mindy Tran 00:38:18.240 --> 00:38:38.680 Content DM labels. These objects as postcards, which are hold both the front and back images of two sided items content DM calls and postcard, but they could be tickets, They could be flyers or even baseball cards, anything that's too sided. Okay. 17 Mindy Tran 00:38:40.600 --> 00:38:59.160 Document, lets you create multiple sequential pages of a report, a journal, maybe a photo album or related image sets something like a CD, for instance, that contain twelve tracks. 18 Mindy Tran 00:39:01.080 --> 00:39:19.640 Or more, I'm, I'm just making, I'm just using twelve as, as a number that just came to mind monograph allows you to retain the hierarchical structure of documents, including if you, if that document or that monograph have sections ch. 19 Mindy Tran 00:39:20.440 --> 00:39:23.480 And pages to it. 20 Mindy Tran 00:39:25.400 --> 00:39:45.240 And then picture cube is the other compound object, which let's use select up to six images to link together. Views of a three dimensional object in this session, we will not cover importing picture cubes that is something outside of the session today and as far. 21 Mindy Tran 00:39:45.520 --> 00:39:56.720 We know we have not have many libraries use a picture cube as a compound object to import into content DM. 22 Mindy Tran 00:40:03.160 --> 00:40:23.640 So if you recall in getting started with content DM, you must ensure that the collection has been configured and defined properly before you add items to the collection. you can, like I said, mentioned earlier reconfigure a collections metadata schema and change the collections options as needed. 23 Mindy Tran 00:40:25.060 --> 00:40:45.540 Or content to the collection that may differ different types of materials or if it's the same type of material that there may be just a slight difference, it may be texture material, but with a slight variation or you need it to add an additional say metadata field, for instance. 24 Mindy Tran 00:40:47.540 --> 00:41:06.020 So looking first at the collection configuration, you will want to be able to edit the metadata that's attached to your digital items, even after they've been added to your collection, so let's put into practice a good workflow to be able to say retrieve and edit the items or objects in. 25 Mindy Tran 00:41:07.940 --> 00:41:26.500 Creating a collection adding items or objects to the collection is one thing, but there's also the maintenance side of it, which is something that we talk about in session three and so one of the good practice to do is how can you retrieve something such as a. 26 Mindy Tran 00:41:26.580 --> 00:41:46.980 Compound object that has multiple pages or multiple images attached to, as a one- one object instead of looking for one page at a time, it may not make that much of a difference using the project client to maintain your item. 27 Mindy Tran 00:41:47.020 --> 00:42:07.460 Or object, but it does make a difference if you use the content DM administration web module to do that, and that's something that we talk about in, in session three and so to put that into good practice, we can add a metadata field called Tag. You can call it. 28 Mindy Tran 00:42:08.780 --> 00:42:27.540 Compound pages, whatever you want to call it, that makes sense for your collection or for your library for the maintenance of that item or for that object in the future, we'll just use tag for our purpose today and another field. 29 Mindy Tran 00:42:29.220 --> 00:42:49.060 For the compound objects that we will be working with today, all have a company transcripts now for those transcript to be searchable, we need to edit or add a metadata field and make that data type full text search. The full text search. 30 Mindy Tran 00:42:50.340 --> 00:43:05.420 Feature in content, DM, makes it possible to search and view the transcript files. Only one metadata field per collection may have a data type of full tech search. 31 Mindy Tran 00:43:06.980 --> 00:43:27.300 So these are at least two fields that we will want to configure in advance of importing compound objects, most of which typically do have a company transcripts. There will be some configuring that we'll need to do with our projects in the project client when we get there. Okay. 32 Mindy Tran 00:43:30.020 --> 00:43:47.940 Remember also that templates are used to automatically create some of the metadata for us if you recall from session one where we went in and created a collection, we configured the collection and we spent time configuring our metadat. 33 Mindy Tran 00:43:48.140 --> 00:44:08.420 Template for that project, excuse me. And so we don't have to enter the same data for multiple items or objects, but if we do metadata metadata templates are a good way to help us do that. We'll want to edit the templates or con. 34 Mindy Tran 00:44:08.820 --> 00:44:28.900 New ones to choose for, you know, the compound objects that we are adding to the collection and if you recall metadata templates do not inherit information. So if you've set up a metadata template say for compound. 35 Mindy Tran 00:44:28.900 --> 00:44:42.180 Objects, so you also elect it to use the image template, then you will need to set up both templates to add the same type of information to the record. Okay. 36 Mindy Tran 00:44:48.740 --> 00:45:04.940 So that's what the full tech search really does is allows the end user to be able to do a search out on the end user website and for it to search the transcript and highlight that term. 37 Mindy Tran 00:45:10.500 --> 00:45:26.500 Not going to spend a lot of time here just want to mention content DM provides an extension that enables the project client to generate file transcripts using optical character recognition or OCR, and this allows the. 38 Mindy Tran 00:45:26.740 --> 00:45:46.980 Characters in an image file to be searched for compound objects. The extension provides an option to create a PDF of the entire compound object for ease of printing. Now the accuracy of this though is dependent upon the quality of your scan. 39 Mindy Tran 00:45:47.620 --> 00:46:07.460 The quality of the original document that is being scanned whether the characters being recognized are type written whether they're a computer generated whether they're hamprinted or cursive. So those font phase makes a lot of difference in how. 40 Mindy Tran 00:46:07.780 --> 00:46:27.940 And how accurate OCR works on that image. Okay, so the type font phase for the type written or computer generated text that OCR can be performed on includes JPEG images, JPEG, two thousand PNG gift and TIFF files. 41 Mindy Tran 00:46:29.220 --> 00:46:48.380 Something to, to keep in mind, but that is an additional license on top of what you already have for content DM and that's something not all libraries choose to use or choose to have. 42 Mindy Tran 00:46:54.820 --> 00:47:14.020 Now, before we actually go in and start importing any compound object, one of the important things about you importing items importing objects whether it's compound object or single, especially compound object. 43 Mindy Tran 00:47:14.100 --> 00:47:34.500 The most important thing before you open up the project client and start working is your file organization for compound objects. There are several options for importing compound objects. So for this training, we will focus exclusively on using the compound OBJ. 44 Mindy Tran 00:47:34.740 --> 00:47:54.980 Wizard, when you're using the wizard, all compound objects must be stored in a main directory or folder the main directory will have subdirectories or subfolders, However you want to think about that for the actual image files and for the optional trans. 45 Mindy Tran 00:47:55.420 --> 00:48:15.380 Files if you are also using custom thumbnails or you've, chosen not to have content DM, create your access image and you have these files as well. You can have a separate sub- directory or folder for those thumbnails. 46 Mindy Tran 00:48:16.740 --> 00:48:35.940 Like the organization of your files, The files are equally important file names are equally important for the success of your import, The root file names for your images image files must match the transcript file if they're accompany. 47 Mindy Tran 00:48:36.060 --> 00:48:39.620 Transcript files for each of the image. 48 Mindy Tran 00:48:41.100 --> 00:49:02.180 You see in this slide that we have Bell one dot TIFF, which is the image file and then we have Bell one dot text, which is the optional transcript file. You may choose not to have a transcript file for the bell one dot TIFF and the corresponding text there won't be a corre. 49 Mindy Tran 00:49:03.020 --> 00:49:05.940 Text file for that. 50 Mindy Tran 00:49:09.220 --> 00:49:24.540 All of the tip files are in the same folder under you can name it image file name. We just sense. my files are tiff, I created a folder called TIFF and all of the image files are stored there. Okay. 51 Mindy Tran 00:49:25.860 --> 00:49:46.340 And my text files are all stored under a file, a folder called text, with the same corresponding file name, except the extension is not PXT for text. This particular structure is what you would expect to see for postcards and. 52 Mindy Tran 00:49:46.420 --> 00:50:02.340 Documents now the order of import is ascending alpha numeric order. So you want to determine your file names accordingly, page titles are used as navigation for end users. 53 Mindy Tran 00:50:05.540 --> 00:50:24.100 So libraries may decide not to use Bell One dot TIFF, they may decide to use page one dot TIFF, which is fine too, but you can also make the page titles change that after the fact after you've. 54 Mindy Tran 00:50:24.740 --> 00:50:27.020 Into the project client. 55 Mindy Tran 00:50:30.920 --> 00:50:50.080 Monographs because they do have a hierarchy are a bit more complex. a monograph is a compound object with more than one level of items monographs are similar to documents, but their pages are organized in. 56 Mindy Tran 00:50:50.200 --> 00:50:56.400 Hierarchy as in the case of a book with chapters or a report with sections. 57 Mindy Tran 00:50:59.040 --> 00:51:18.760 Content DM has two ways of establishing directory structure. They're mimicking the existing directory structure or using a tab delimited text file, which we will discuss in session three, if using directory structure, monograph. 58 Mindy Tran 00:51:19.040 --> 00:51:39.360 Requires that the files be stored in sub- directories or subfolders within the main directory to create the hierarchy of the organization. Content DM supports up to nine levels of hierarchy within a monograph. I have yet to see. 59 Mindy Tran 00:51:39.360 --> 00:51:46.360 A library or monograph that goes bey more than two or three levels. 60 Mindy Tran 00:51:48.320 --> 00:52:08.800 Text files associated with your monograph must be stored in another separate directory or folder and saved as dot text files with the root file names that corresponds to the image file names. So if you have, this is your file name. 61 Mindy Tran 00:52:09.040 --> 00:52:29.280 If there is a text file, so we have our transcripts all in one folder here. So our book is called History of Ohio Canals under that we have a subfolder that contains the directory for each chapter. 62 Mindy Tran 00:52:29.920 --> 00:52:49.760 Including any forwards or cover pages or introductory or preface. So on and so forth here, and then everything under chapter one, everything under Chapter Two is here Chapter three, for instance, all the transcripts though it's in one single folder doesn't, and. 63 Mindy Tran 00:52:49.840 --> 00:52:56.840 It's not separated out by directory, like the images, right? So. 64 Mindy Tran 00:52:59.360 --> 00:53:18.560 Content DM always imports all of the files in the root directory first and then imports the subdirectories. So in this example, shown on the screen, the root directory is called History of Ohio Canals, which is this one right here and. 65 Mindy Tran 00:53:18.880 --> 00:53:39.040 First item to appear in your, in our imported monograph will be the file shown below the sub- directory zero- zero one underscore covers zero, zero two underscore inside cover and so forth. After the individual files are imported, then the subdirectories or chapters will be imported. 66 Mindy Tran 00:53:41.600 --> 00:53:59.520 Okay, now in this example, shown on the screen, the door, the folder, zero, two underscore chapter two or the chapter two here, which is the chapter one is imported and chapter two, you will notice that. 67 Mindy Tran 00:54:00.160 --> 00:54:20.000 It contains files and another sub- directory or another subfolder, if you want to think about it called Section one content DM will import the file, Zero one underscore page, fifteen all the way to eighteen first, and then it goes back and then to import everything under the SE. 68 Mindy Tran 00:54:21.600 --> 00:54:24.360 Folder, okay. 69 Mindy Tran 00:54:27.040 --> 00:54:28.920 Make sense so far. 70 Mindy Tran 00:54:30.240 --> 00:54:51.360 The order of import again is always ascending alpha numeric order in these examples numeric prefixes help to establish the order of import. So, because you are telling the project line, you want these images to be import in a very specific order. So all of these. 71 Mindy Tran 00:54:52.360 --> 00:55:10.080 All of these prefix zero- zero zero one underscore cover is forcing the project client to import it in the order that you want these pages to appear in the monograph in the, in this compound object. 72 Mindy Tran 00:55:12.480 --> 00:55:32.320 If using a tab delivant text file to determine structure, the files are stored in a single folder and we will cover that in session three. Okay, so this is all of your organization for monographs for the images, The transcripts all just. 73 Mindy Tran 00:55:32.320 --> 00:55:52.520 Disappear in a single folder under the transcripts or text, whatever folder you name it, and we'll have the name exactly the same as this. So if there is, like I said, it will be zero, zero zero one underscore cover dot text zero, zero, zero two underscore inside cover dot text. So on and so forth. 74 Mindy Tran 00:56:02.400 --> 00:56:18.400 Now, before I demonstrate this process live, let's review the process to build a compound object first. you'll want to make sure that the file organization is correct. You want to make sure that you can access all the files and that the file names and directory stru. 75 Mindy Tran 00:56:18.840 --> 00:56:30.040 Is ready for import next? You'll want to make sure that the collection and your project are set up for full text, if you have searchable transcript files. 76 Mindy Tran 00:56:32.480 --> 00:56:50.720 One of the very quick way to know that you may not have a full tech search field in your collection that you're adding your compound object to that contains transcript. Is that when you get here and the transcript area is grayed out. 77 Mindy Tran 00:56:51.680 --> 00:57:12.160 Okay, and that's an indication that could be that the project, the collection that you are associating this project with is an incorrect one or the collection administrator did not add a full text search metadat. 78 Mindy Tran 00:57:12.440 --> 00:57:23.040 Field to the collection. So if that is the case, you'll need to contact the, your content, a collection administrator and ask. 79 Mindy Tran 00:57:26.880 --> 00:57:34.320 For the individual to check the collection to make sure that there is a full tech search metadata field. 80 Mindy Tran 00:57:36.480 --> 00:57:56.320 Once you start up the compound wizard in the project client, you will be prompted to identify the location of the image files and then if you have successfully configured your collection so that one metadata element has full tax as it's data type. You'll be prompted to identify the location of the trans. 81 Mindy Tran 00:57:57.160 --> 00:58:01.960 Also if it's grayed out, you know, that it hasn't been configured. 82 Mindy Tran 00:58:07.200 --> 00:58:24.480 So let me go ahead and switch my screen here. I'm gonna switch over to my browser and when you are in content DM administration, you'll go to collections and make sure that the collection. 83 Mindy Tran 00:58:24.560 --> 00:58:44.960 That you want to configure is the one that appears here as the current collection, if you have more than one collection, if you are the collection administrator, if you have more than one collection that you are responsible for, you wanna make sure that the collection that you want to configure is the one that appears here as the current collection. Other. 84 Mindy Tran 00:58:45.320 --> 00:58:53.760 You can click the drop down here to select the right one and make sure once you make that selection to click change. 85 Mindy Tran 00:58:55.840 --> 00:59:15.520 Some, sometimes if you're in a hurry, you may select from the drop down and then for CLI forget to click change and that won't change the current collection, it'll just stay with whatever collection that was there first, and you may have made a, an incorrect configuration. 86 Mindy Tran 00:59:16.960 --> 00:59:23.840 So, in this case I already have this collection, I need to add per. 87 Mindy Tran 00:59:25.960 --> 00:59:45.760 My profile, I need to add a couple of metadata fields to the existing schema. So I'm gonna go to the fields section here and we already see that we have eighteen fields. I'm gonna go ahead and add another field and I'll call. 88 Mindy Tran 00:59:45.920 --> 01:00:06.240 This, as I said, a tag field, we'll just call it tag will map will, it's not gonna be mapped to win core element. It is a data type text only. There's no, yes, I'm gonna make this searchable because I want to be able to search the tag to bring back. 89 Mindy Tran 01:00:07.520 --> 01:00:11.320 Which is something that we want to think about for the future. 90 Mindy Tran 01:00:13.280 --> 01:00:33.120 Yes, I want it hidden from the public because this is something only for internal use is not something that the public needs to know about, and no, it's not a required field because it may not other objects may not need it, or it may be a combined collection where I don't want this. 91 Mindy Tran 01:00:34.260 --> 01:00:55.300 Created for the sake of having one, if it's just a simple image. Okay, there's no control vocabulary to this. So I'll go ahead and just save it. Another field that I want to add and here this is, will be up to each individual library. Some libraries have a separate field. 92 Mindy Tran 01:00:55.380 --> 01:01:15.780 Create a new field call it transcript. They, some libraries decide to use an existing field, like description or an existing field that they know, they will never use and just, and, and just rename it as something else, for instance, we'll go ahead and just add a new field just to make it eas. 93 Mindy Tran 01:01:16.420 --> 01:01:23.580 And I will just call this transcript as like I've done before, and I will map this. 94 Mindy Tran 01:01:25.380 --> 01:01:45.860 To none, it's, there's no Dublin core equivalent. However, my data type isn't just tax. I want this to be full text. I want this to be searchable. So we'll add this as a full tech search data type. Okay, we don't need to show the. 95 Mindy Tran 01:01:45.900 --> 01:02:06.340 Large field there will be on the end user interface. Now there is a separate tab for transcripts and so we don't have to worry about the show large field. We searchable, we don't want to hide it from the public because we want the user and users to be able. 96 Mindy Tran 01:02:06.540 --> 01:02:25.460 See it the transcript. It is not required. Not every I object will have a transcript and so we don't want to make it required and just have something random added there by the project client operator and so we'll go ahead and save changes. 97 Mindy Tran 01:02:26.820 --> 01:02:47.940 And we are done adding our metadata and so since we are done configuring or reconfiguring our existing collection, now we are ready to open up the project client and this time I'm gonna go ahead and create a new project. Let's see handle. 98 Mindy Tran 01:02:48.380 --> 01:03:08.420 Materials, I'm, I may be working in the same collection, but I decided that I want to keep my simple images separate from my compound objects and texture materials. Okay, so I don't have to continually go back and reconfigure my metadata templates if I was importing. 99 Mindy Tran 01:03:09.180 --> 01:03:26.660 I'll just keep those metadata templates associated with the project for images and have the metadata templates configured for compound objects and texture materials. So I'll go ahead and just create a new project. 100 Mindy Tran 01:03:28.340 --> 01:03:34.180 My servers already there. I just need to go in and add my username and password. 101 Mindy Tran 01:03:35.980 --> 01:03:44.820 I need to select my collection and it is this one since I have so many and I will call this. 102 Mindy Tran 01:03:54.500 --> 01:03:56.380 And click finish. 103 Mindy Tran 01:03:58.340 --> 01:04:04.980 So it went out to the server and picked up anything new that's out there. 104 Mindy Tran 01:04:06.660 --> 01:04:26.500 Like new fields that have been added to the metadata so before we start importing compound object, one of the things that we want to set up as before is, say our metadata templates for these compound object. So because it's a new project, the compound OBJ. 105 Mindy Tran 01:04:28.500 --> 01:04:46.980 Because this is a new project, excuse me, the debt metadata templates are new, so we will reconfigure these, most of my. I am not going to have any select any specific image ti files. I will just use the generic image template. 106 Mindy Tran 01:04:47.100 --> 01:05:06.820 To be applicable for regardless of what type of image file? I'm importing whether they're TIFF JPEG or PNG, et cetera and for these, I will edit so that the publisher will always show as OCLC training. 107 Mindy Tran 01:05:09.380 --> 01:05:10.700 For the. 108 Mindy Tran 01:05:17.700 --> 01:05:23.940 Relation is Craven family collection. 109 Mindy Tran 01:05:26.660 --> 01:05:29.980 I want the source to be automatically. 110 Mindy Tran 01:05:31.780 --> 01:05:52.260 The file name to just automatically extract it and put in this field for me, so I don't have to do it myself. Same thing for the catalogger, My username to be extracted and if I scroll down just a little bit, you'll notice that there's a tag and the transcript field, those two new metadata fields that was added by the. 111 Mindy Tran 01:05:52.660 --> 01:06:13.380 Administrator for this Craven family, January, twenty- twenty- four collection. So that's one template. Now if I want that same information to be added to say, a compound, the compound object, which is the description for. 112 Mindy Tran 01:06:14.660 --> 01:06:33.740 Object the image template that metadata the metadata that we've added there or configured there will only be added to each individual image files that's being added in or imported in. So if we want that same information to. 113 Mindy Tran 01:06:36.180 --> 01:06:54.300 Or to be added to the compound object to describing that entire object that we're adding. We also need to configure the compound object template, right? Because I'm gonna select to use it. I'll go ahead and edit again for this one. 114 Mindy Tran 01:06:58.180 --> 01:07:07.740 I didn't spell that right source. I will want it to extract the file name and put it there. the same thing with catalogger. 115 Mindy Tran 01:07:10.340 --> 01:07:14.700 And same thing with relations. 116 Mindy Tran 01:07:16.740 --> 01:07:17.860 Okay. 117 Mindy Tran 01:07:20.580 --> 01:07:24.260 Actually, for the source, I'm gonna say. 118 Mindy Tran 01:07:33.380 --> 01:07:51.940 We'll just leave that as file name. That's okay and say, okay, we've selected our templates we've already configured our templates say, Okay, and now we are ready to go over to our compound object project here and start importing compound objects. 119 Mindy Tran 01:07:52.580 --> 01:07:59.100 Let's say that we want, we have a postcard that we want to import. 120 Mindy Tran 01:08:01.660 --> 01:08:22.020 To do that, we want to add a compound object using the common task on the left hand side, or we go up to add and say, compound object whichever works for you or use the keyboard shortcut. Once we're here, we want to use the compound object wizard. like I said, there's multiple ways. 121 Mindy Tran 01:08:22.180 --> 01:08:42.460 To add compound objects, we will just be using the wizard for class today. We're gonna go ahead and click ad here and the wizard will walk us through this process. We are adding a POSTCARD, so we're gonna choose that. 122 Mindy Tran 01:08:43.299 --> 01:09:02.980 This compound object is not defined by a tab delunus, so we're not using a tab delimited today. Next, where is the file of images that we want to import. So we'll browse out and point the project. 123 Mindy Tran 01:09:03.100 --> 01:09:23.020 Client to our postcard, we are adding the bell postcard. So I'm gonna open this up. There are two folders I'm going to import the images folder. There are two images select the entire folder that contains all of the images say, okay. 124 Mindy Tran 01:09:23.500 --> 01:09:37.540 Next, yes, I want content DM to generate display image for me. I don't want to do it myself next now do I want to use the file names as titles. 125 Mindy Tran 01:09:39.500 --> 01:09:59.940 Do I want to relabel it or, you know, like page one page, two, for instance, if you've spent the time to label your files and you want to use that as the title, as well as the navigation on the end user side, you don't have just use the file names. 126 Mindy Tran 01:10:00.020 --> 01:10:20.420 Ask your title and ask, and that's, and that's it transcript. Do you have a company transcripts? No, then you can leave it at. No, but if you do, then you want to select that and point the wizard to where your transcript folder is. So for me. 127 Mindy Tran 01:10:20.420 --> 01:10:24.980 It is under bell Postcard and it's this text file here. 128 Mindy Tran 01:10:26.180 --> 01:10:46.660 There's a one file only. We'll talk about why there's only one file in just a little bit here say, okay, next it's making sure this is a summary of what the wizard will be importing into the project client two postcard item. 129 Mindy Tran 01:10:47.700 --> 01:11:08.180 And there's a transcript file in this folder. Okay, I'm gonna say finished and it tells me that it's added to postcard two pages for the postcard transcript file. It didn't find the bell post- one dot text. Well, that's okay because. 130 Mindy Tran 01:11:08.860 --> 01:11:28.660 Cards, sometimes they're the front side is only an image, so there's no written words or anything. So there is no transcript file associated with that image, and that is perfectly fine. There's only a bell post- two dot tax because this other side there's written text and it's been transcribed into. 131 Mindy Tran 01:11:28.860 --> 01:11:36.420 A transcript file and so there is that accompanying transcript for the second image. 132 Mindy Tran 01:11:38.900 --> 01:11:58.100 Close it and now your file that is here or your compound object is staged and it's ready for you to finish and add it imported into this project. You can continue to use the pro. 133 Mindy Tran 01:11:58.100 --> 01:12:18.580 Compound object wizard to add more postcard. So if you have ten or twelve postcards before you click this finish, you can go through that same process again to bring in to this staging area. What I call the staging area additional postcards before you. 134 Mindy Tran 01:12:22.420 --> 01:12:39.060 Now also at this screen, you can change the title of the compound object because the image, the file, the folder on your- wherever your image files are it, the, my folder says Gift images, I can. 135 Mindy Tran 01:12:39.100 --> 01:12:56.620 Change this now if I know that this is the bell postcard I can change the title of this postcard to say, bell postcard at this point, or I can change it later once it's been imported in either way, you can do. 136 Mindy Tran 01:12:58.260 --> 01:13:17.980 For the object at the object level itself. Okay, now if I don't want to click finish yet, I can bring in another postcard same process add like I said, postcards, no, it's not using tab delimited next this time. I am bringing in. 137 Mindy Tran 01:13:20.020 --> 01:13:38.580 The paris Postcard and I'm going to bring in the tip images that's associated with them. Okay, so I'm gonna select those, this folder say, okay, next notice, it's the same process. I'm gonna use. 138 Mindy Tran 01:13:39.220 --> 01:13:58.940 Name as the title. I am going to browse out to a different folder for the transcript because it's a different postcard. There's that text file here. Okay, next, yes, just confirm that those are the correct path. 139 Mindy Tran 01:13:59.060 --> 01:14:19.540 Next and same message, yes, pairs one dot text doesn't exist. There's only images there's no transcript associated with the image file, close it, and now you have two postcards in there and let's say I'm ready to do that. 140 Mindy Tran 01:14:19.580 --> 01:14:40.020 Click finish if you have a lot of them, probably, it's time to go grab a cup of coffee once it finished adding it into the project client to this project that you just created for compound object, come back from your coffee break and you have the images when you click close here. 141 Mindy Tran 01:14:40.220 --> 01:15:00.500 Dismiss this message. Now you have the two compound objects to begin working on them or ten or fifty, however many that you're bringing in. Okay, you'll notice the same project spreadsheet, but a little bit different than the image file because this is a compound ob. 142 Mindy Tran 01:15:01.260 --> 01:15:07.140 Only the object level of each compound object is shown in this spreadsheet. 143 Mindy Tran 01:15:08.820 --> 01:15:28.140 So you will notice that the bell postcard is one, the Paris Postcard is the second one, but you only see the object description for this postcard, so it's the metadata that describes the entire object. 144 Mindy Tran 01:15:29.940 --> 01:15:49.780 It requires a subject here, but the individual page level, the front image and the back image aren't showing here. So, in order to see that you need to click to open this up in its own tab and you will notice gonna move this. 145 Mindy Tran 01:15:49.980 --> 01:16:10.260 In a little bit here on the left hand side, you're viewing the structure of this compound object. So what it's showing here is again the object level metadata, there is just the bellpost card. That's the title that I made the change to in the staging area. I can. 146 Mindy Tran 01:16:11.540 --> 01:16:14.860 The title, again, if I want to hear. 147 Mindy Tran 01:16:16.020 --> 01:16:36.500 Changing the title in the metadata is independent of changing the title in the structure of the compound object. So if I decide that I'm taking the word postcard out of this title here, notice, it only change the. 148 Mindy Tran 01:16:39.220 --> 01:16:56.980 For this, the post, the structure of this postcard, but the metadata associated with this compound object, what the user is seeing like under describing this object stays the same, the title is still Bill postcard. Okay, if you want. 149 Mindy Tran 01:16:58.060 --> 01:17:17.460 You click it so that it appears like that, and then you change it back. Okay, if you want to edit the metadata describing the front page of this bell postcard, you click that, and it shows you, this is the title. 150 Mindy Tran 01:17:17.660 --> 01:17:24.740 Of Bellpost one. so I can change here to say capitalize it. 151 Mindy Tran 01:17:30.260 --> 01:17:46.260 Okay, and if I click in the subject, I can click on it and it steps me into the source for graphics material that have been chosen as the subject of controlled vocabulary for this. 152 Mindy Tran 01:17:46.660 --> 01:17:50.700 So maybe for this one, I can say this is travel. 153 Mindy Tran 01:17:53.940 --> 01:18:10.540 And assign that everything else looks good, Source relation. I don't have anything else to add the tag. You'll notice, we won't worry about this right now, cause I have one other thing to show you now Bell. 154 Mindy Tran 01:18:13.780 --> 01:18:16.540 Maybe another subject to add. 155 Mindy Tran 01:18:20.900 --> 01:18:38.740 Is Aircraft is the other one that I want to add here. That's the image. Okay, bell, post- two may be a different subject. It again, it may be travel notice that with bell post- two, let's add this. 156 Mindy Tran 01:18:41.300 --> 01:18:59.220 As travel here for the subject, but you will notice that the transcript field now has the TR, the company transcript that's for side two. Okay, so have I double clicked just so that you can see here. 157 Mindy Tran 01:19:00.020 --> 01:19:12.900 Transcript file that was associated with it. It's been brought in and put in the transcript field for this object on Bellpost on side two. 158 Mindy Tran 01:19:15.220 --> 01:19:17.700 Let's change this. 159 Mindy Tran 01:19:25.460 --> 01:19:44.660 You will notice that as I'm changing the metadata here it doesn't change the structure. Now this is, you clicking and editing one item at a time or one page at a time. You also, if you click on the view structure, have the option. 160 Mindy Tran 01:19:45.940 --> 01:20:05.100 This compound object this postcard in what we call the spreadsheet view for just this compound object, you will notice that anything above this dark gray line, that is the object level metadata. This describes the entire object. 161 Mindy Tran 01:20:05.780 --> 01:20:11.500 Anything below that or the individual page level metadata each page. 162 Mindy Tran 01:20:15.620 --> 01:20:33.300 Describes each side of the postcard. Okay, now if I know that this has the same, I'll call, I can just type in the same subject there notice that everything else is pretty much EQU. 163 Mindy Tran 01:20:34.300 --> 01:20:53.780 Looks the same don't have to worry about that if let's say I, one of the things I forgot is that this postcard was actually dated nineteen eighty for instance, I can type that in here in the project. 164 Mindy Tran 01:20:54.780 --> 01:21:01.540 Away and then come back and right, click and say, I want to feel all or feel down from this point. 165 Mindy Tran 01:21:03.740 --> 01:21:23.580 So this is a way for you to edit a compound object that has a lot of pages so that you can use the same spreadsheet view to view the entire compound object in the project client be able to add large quantities of metadata using those. 166 Mindy Tran 01:21:27.420 --> 01:21:44.060 Tools for, of the spreadsheet view that you would use in, we introduce in session one where you can use to edit multiple image items like your ad metadata for multiple images at a time. 167 Mindy Tran 01:21:44.060 --> 01:22:04.540 So using in the spreadsheet view and then if all looks good, you want to save it and close it, and it takes you back to the spreadsheet view that contains the two compound object and the same thing, if you edit the Paris Postcard, doub. 168 Mindy Tran 01:22:05.300 --> 01:22:18.660 Shows you the structure view and if here instead of tiff images you want to change this to Paris postcard you can do that and again, change the structure of this to say. 169 Mindy Tran 01:22:28.220 --> 01:22:40.020 Now one of the things that I had forgotten to do for the bell postcard, which I will do here. So let's say for this subject, let's go ahead and. 170 Mindy Tran 01:22:46.780 --> 01:22:49.500 We'll call this voy just around the world. 171 Mindy Tran 01:22:57.020 --> 01:23:16.220 And same thing with Paris two. So there is a transcript file with side two, so it's here. I'm gonna leave it as. I'm not gonna change the structure of this. I'll leave the title exactly the same. I'll view this in a spreadsheet view again, and. 172 Mindy Tran 01:23:16.220 --> 01:23:28.500 Since there's already a subject here and the subject will be the same for side one and side two or Paris one in Paris two, I'll just take advantage and feel all from this point. Okay. 173 Mindy Tran 01:23:30.300 --> 01:23:33.060 Now the thing that I've forgot. 174 Mindy Tran 01:23:34.780 --> 01:23:37.020 I do not like. 175 Mindy Tran 01:23:43.100 --> 01:24:03.580 Sorry about that. Notice that in the tag we didn't add anything and I had talked about making sure that you have a tag, that would be helpful when you try to search for item for objects, not in the project client. 176 Mindy Tran 01:24:03.740 --> 01:24:08.820 But out, and when you're using content DM administration to. 177 Mindy Tran 01:24:10.020 --> 01:24:29.300 To maintain objects that made something useful so that you can bring back and we'll see how that happens in session three, if you don't have a tag or a way to bring all of the, not just the object level, but all of the page level. 178 Mindy Tran 01:24:30.700 --> 01:24:50.340 To, in order to edit the page level metadata, it will be very difficult and especially when you, you're collections continue to grow and then if you need to go back and make a change, you may, if you don't have a tag to be able to pull all of those page level. 179 Mindy Tran 01:24:51.580 --> 01:25:11.420 Metadata together so that you can make the edits, it may become a little bit unwilly to find each individual page of that, or you may have to wait until you have access to the project client in order to edit that compound object. Sometimes you may not have that. 180 Mindy Tran 01:25:12.180 --> 01:25:31.900 Maybe working at home and you don't have the project line installed on your home laptop for instance, and you may need to make a change immediately. Okay, so let's say that this is the paris Postcard, so we can call this postcard underscor score pairs if I. 181 Mindy Tran 01:25:32.180 --> 01:25:52.380 Just name it. Something like that. Anything that you want that you will be able to search for, and I'm just gonna add this to all of the individual pages. Okay, something will see in our session tomorrow. I'll leave the other post. 182 Mindy Tran 01:25:53.020 --> 01:26:12.860 Is so that when we go to talk about in session, three and search for these items to maintain in the future, why having a tag is very useful for working in the content DM administration, web module. 183 Mindy Tran 01:26:13.500 --> 01:26:17.380 I'll go ahead and save it, close it and. 184 Mindy Tran 01:26:20.780 --> 01:26:24.300 Controlled vocabulary, not found interesting. 185 Mindy Tran 01:26:26.300 --> 01:26:28.980 Nope, it disappeared finally. 186 Mindy Tran 01:26:30.860 --> 01:26:32.900 That was in the. 187 Mindy Tran 01:26:49.980 --> 01:26:54.380 Is it no longer a part of it. Okay. 188 Mindy Tran 01:27:00.220 --> 01:27:16.220 I'll go ahead and select all and upload it for approval. So we spent a lot of time talking about postcards, but it's really the editing, the metadata for that. So let's say now that we got our postcard, a. 189 Mindy Tran 01:27:16.580 --> 01:27:36.700 And upload it to the collection. what if we have a document, say a letter, for instance, same process we already have our file organization. So all we need to do is add a compound object using the wizard again, but this time we're adding a document, so if we're adding. 190 Mindy Tran 01:27:38.020 --> 01:27:57.180 There's no hierarchy associated with this, and so that's, we're using the object type document, not defined by a tab delimited text file. Next our letter is just point the wizard to where the images for our EDG. 191 Mindy Tran 01:27:57.340 --> 01:28:17.660 Craven letter is right here. All of our images there is four pages to this letter say, okay, next we do want content DM to generate display image. Next we want to use the file names. Now my file names weren't very helpful. It just says. 192 Mindy Tran 01:28:17.940 --> 01:28:38.140 Tr- underscore, see something or other. So, in that case, I want as it's importing this letter in to change it to say, either page one or letter one up to you. I'm gonna say I'm gonna say page one, right? And begin. 193 Mindy Tran 01:28:38.220 --> 01:28:58.620 With one I have transcripts for these. I'll go ahead and browse out again to the Edgar Craven letter folder to the and select the folder text notice this time I have the text files for all four images say. Okay. 194 Mindy Tran 01:29:00.540 --> 01:29:19.100 Next, this is the path for the images. This is the path for the transcripts and so click finish. it's adding four pages. No other warning close, and there is my letter. Okay, so. 195 Mindy Tran 01:29:19.100 --> 01:29:27.900 I know what this is. I'm gonna change the title here to Edgar Craveen letter. Okay. 196 Mindy Tran 01:29:29.340 --> 01:29:49.820 If I have more documents to add go through the compound wizard, exactly like we did with the postcard to add more and then when you have all of your documents, they finish and with documents, if you have a lot of pages, it may take a lot longer to import in than postcards. 197 Mindy Tran 01:29:50.860 --> 01:29:56.020 So there's only one close and there it is. 198 Mindy Tran 01:30:01.980 --> 01:30:07.460 Notice that this is the object level. So there is no transcript associated with it. 199 Mindy Tran 01:30:09.660 --> 01:30:12.580 If I open this item up. 200 Mindy Tran 01:30:14.820 --> 01:30:30.660 Object up excuse me. So there it is. The EDGAR Craven letter at the object level, if I click the individual page level, there's the corresponding transcript for page one corresponding transcript for page two page, three. 201 Mindy Tran 01:30:32.060 --> 01:30:39.900 And you may need to fix some of this, depending on your file. 202 Mindy Tran 01:30:45.500 --> 01:30:48.820 And you can just do that. Open it up. 203 Mindy Tran 01:31:19.380 --> 01:31:22.260 If we have a subject same idea. 204 Mindy Tran 01:31:38.580 --> 01:31:48.740 And then we can view the project spreadsheet in the exact same way if you have the same subject, you can use the field all or fill down to do that. 205 Mindy Tran 01:31:50.100 --> 01:31:55.940 And if we have the tag, we want to call this. 206 Mindy Tran 01:32:03.540 --> 01:32:12.660 And feel it all, you may have other information that you want to add here. That's fine. 207 Mindy Tran 01:32:13.780 --> 01:32:22.140 If you don't save it, if it's done close it upload it for approval off, it goes. 208 Mindy Tran 01:32:27.220 --> 01:32:32.220 If you have a monograph, same. 209 Mindy Tran 01:32:33.700 --> 01:32:48.940 Same process add a compound object. Have the wizard walk you through it monograph is what we're adding this time we already have our file organized next. 210 Mindy Tran 01:32:50.900 --> 01:33:10.740 My compound object is a book called Covington in Cincinnati. So I already have the folder created with all the books here. All the chapters. I'm choosing this folder that contains any individual files as well. 211 Mindy Tran 01:33:10.980 --> 01:33:12.100 Any. 212 Mindy Tran 01:33:13.940 --> 01:33:33.780 You'll notice that under each of the chapters that's the page. Okay, that's the image file. Now, let me open up the transcript folder for you to see notice the transcript file is exactly the same as the image file only. 213 Mindy Tran 01:33:33.780 --> 01:33:47.820 The extension is different A- and it doesn't have to be the transcript file as I mentioned earlier does not have to be in any sort of structure. It can be just in one single folder. 214 Mindy Tran 01:33:50.420 --> 01:34:10.260 So we are importing first images under Covington, Cincinnati, next, yes, content DM generate display image. Now since we have spent the time to set up our FIL. 215 Mindy Tran 01:34:12.540 --> 01:34:30.740 File names as a way for navigation, we want to use the file names as titles. The other thing is that if you recall the file names contain zero prefix zero one underscore, zero two underscore, what we use that in order to force the project client. 216 Mindy Tran 01:34:31.380 --> 01:34:51.220 Import the files in a specific order, however, we want the project client once it's imported in to ignore any information before the underscore because that becomes irrelevant to the nav to our navigation structure, we use it to force the project client. 217 Mindy Tran 01:34:51.260 --> 01:35:11.700 To bring our compound object in a very specific order, the order that we want, and then once that's done, we tell it to discard it, and that's what this checking this option ignore information before underscore does, okay, we have transcripts, so we're gonna go and point it out. 218 Mindy Tran 01:35:11.860 --> 01:35:32.820 To the transcript folder for this Covington in Cincinnati Book, there it is. Okay, next those are the correct paths finished. There are six pages which is correct and then closed, there's our monograph. 219 Mindy Tran 01:35:32.820 --> 01:35:45.940 We have more monograph we walk through the same process and when we have all all of our monographs that we want to bring in to work with in this project, quick finish to begin importing them. 220 Mindy Tran 01:35:49.460 --> 01:36:08.020 Okay, here is our book. Now I'm gonna open it up and you're gonna see the structure is a little bit different. This is not the same as the document notice the hierarchy first off there is the object level metadata and then if I click. 221 Mindy Tran 01:36:08.100 --> 01:36:28.500 Chapter, if I click the plus sign here it opens up title page here it is. There's the source of it, but notice that the zero one underscore is gone from the title pit title, as well as gone from the structure title same thing with, and. 222 Mindy Tran 01:36:28.620 --> 01:36:48.980 The transcript, the correct transcript files been added to this verso, same thing, Chapter two description page, four adverts advertisement. Okay, so the structure is forcing, so these pages appear. 223 Mindy Tran 01:36:49.100 --> 01:36:57.780 Each of these chapters and the associated transcripts went with these correct images. 224 Mindy Tran 01:37:01.780 --> 01:37:11.740 Even though there's a hierarchy here if you go to the spreadsheet view, it looks exactly like a document. 225 Mindy Tran 01:37:13.940 --> 01:37:16.180 You edit in the same way. 226 Mindy Tran 01:37:19.700 --> 01:37:22.460 You enter the tag in the same way. 227 Mindy Tran 01:37:32.540 --> 01:37:34.900 Build down everything. 228 Mindy Tran 01:37:38.900 --> 01:37:40.260 I think. 229 Mindy Tran 01:37:42.300 --> 01:37:46.740 The subject is bridges and I can fill all this way. 230 Mindy Tran 01:37:49.780 --> 01:37:55.660 Save it close, it upload it for approval and indexing. 231 Mindy Tran 01:38:00.660 --> 01:38:17.380 And then to do that, we go back out to our content DM administration out on our server out here. Go to items the items tab, Make sure you're in the right collection. 232 Mindy Tran 01:38:17.940 --> 01:38:38.340 And then go to approve, you'll notice that there are four pending items which is right? And then from here I can just approve an index. All if I needed to do, right away or just approve and the indexing may happen overnight at your library, if it's already scheduled. 233 Mindy Tran 01:38:38.420 --> 01:38:56.180 Something that we talked about in session one. Okay, since I don't have anything scheduled and I wanted to show right away, I'll just go approve and index all while this is taking so long to do, I want to show you a collection that is already. 234 Mindy Tran 01:38:57.900 --> 01:39:00.260 Has these compound objects. 235 Mindy Tran 01:39:07.220 --> 01:39:27.700 So first, let's look at the postcard. Here's the paris Postcard. So this is kind of what it will look like if I open this up, this is the navigation on the right hand side here. This is the navigation, if you recall in the project client, you saw the structure on the left. This is. 236 Mindy Tran 01:39:27.740 --> 01:39:37.780 What the end user is seeing and using to navigate from image one to image two by clicking this navigation. 237 Mindy Tran 01:39:39.860 --> 01:39:59.700 The postcard image. The first image shows here and the object description. This is the object level description that describes the entire postcard front and back. What is it? That's what's showing the item description describes only this page only. 238 Mindy Tran 01:40:01.740 --> 01:40:19.580 Paris one, this image. Any, there may be descriptions about it. Some, there are text here, but no one transcribed this. So there isn't an accompanying transcript notice. You don't see a transcript now if I move to my second image, the second side. 239 Mindy Tran 01:40:20.180 --> 01:40:30.460 You will notice that there's same thing, object description item description for pairs two. Okay. 240 Mindy Tran 01:40:31.700 --> 01:40:52.180 And then there's a transcript tab here that I can expand because there is an accompanying transcript with this second side, That's why this transcript appears if an image does not have an accompanying transcript when the user navigates to that within the content. 241 Mindy Tran 01:40:52.420 --> 01:40:57.900 Viewer here there won't be a transcript field that shows up. Okay. 242 Mindy Tran 01:41:00.460 --> 01:41:15.020 And one of the things is that the full tech search, so if I user did a search like this, and it will say that there's one result found. 243 Mindy Tran 01:41:17.420 --> 01:41:29.180 Two and there is that highlight if I go to the transcript for, and it highlights the word in the transcript for this image. 244 Mindy Tran 01:41:38.860 --> 01:41:56.140 Documents if I click on that same thing as a postcard. It's really no page one letter one notice that instead of it saying LTR underscore co- something we. 245 Mindy Tran 01:41:56.140 --> 01:42:07.620 We change it to letter one or page one. So now here it is navigating to it, and because this document has. 246 Mindy Tran 01:42:10.220 --> 01:42:20.820 A transcript that companies each image doesn't matter where you navigate or the user navigates to each image has a transcript field. 247 Mindy Tran 01:42:24.340 --> 01:42:32.460 Searching full tech searching for this is the same if I search for David, it tells me it finds two. 248 Mindy Tran 01:42:34.540 --> 01:42:35.660 So. 249 Mindy Tran 01:42:40.300 --> 01:42:45.380 In this page, two here David here and David here. Okay. 250 Mindy Tran 01:42:46.740 --> 01:42:56.900 Looks very similar to postcard except that postcard can contain only two pages, whereas documents can contain more. 251 Mindy Tran 01:43:00.140 --> 01:43:02.340 Now let's look at our. 252 Mindy Tran 01:43:03.820 --> 01:43:23.820 Covington and Cincinnati. So this is what it would look like for the end user. So there is that navigation that we saw on the left hand side each, it tells the user that there's a chapter one chapter two and there's an adverts chapter, if the user clicks in it. 253 Mindy Tran 01:43:23.820 --> 01:43:35.340 And goes to the first page. It displays the first page second page and then they can expand to see chapter two and chapter three or adverts. 254 Mindy Tran 01:43:37.260 --> 01:43:50.900 However, the, it's the same, it shows the first image with the company. Transcript object description describes the entire monograph item description describes the title page. 255 Mindy Tran 01:43:52.100 --> 01:44:03.340 Two, for instance is title page Versal. So it describes the object and then the second page of that chapter. 256 Mindy Tran 01:44:12.500 --> 01:44:14.700 Any questions about this? 257 Mindy Tran 01:44:15.660 --> 01:44:36.140 So that's what you do that. What we did using the project line on the back end side. This is what the end user see on the content DM site for your libraries public content DM site and how they would view it in the content DM viewer. 258 Mindy Tran 01:44:43.180 --> 01:44:45.300 Any questions about. 259 Mindy Tran 01:44:46.420 --> 01:44:49.220 Anything that we've covered so far. 260 Mindy Tran 01:45:01.100 --> 01:45:03.620 So let's talk about PDF files. 261 Mindy Tran 01:45:10.060 --> 01:45:26.820 Now these are ideal for materials that are initially created in a digital form because these will have embedded tax and so. 262 Mindy Tran 01:45:28.620 --> 01:45:48.460 Documents like dissertations. city council minutes are good examples of these converted into PDF files if your PDF file was created from a born digital document, like a Microsoft Word, for instance, it will almo. 263 Mindy Tran 01:45:48.820 --> 01:45:52.300 Always have embedded text. Okay. 264 Mindy Tran 01:45:53.580 --> 01:46:14.060 PDF files are not efficient nor does it provide an optimal end user experience for scanned images, books, maps or newspaper because an item that has been scanned does not automatically contain embedded text if your PDF file was created from SC. 265 Mindy Tran 01:46:14.740 --> 01:46:33.820 Images it does because it doesn't have embedded text. You would, in order for embedded text to happen or you would have to take that additional step to ocr the image and then add that text to the PDF file. Okay. 266 Mindy Tran 01:46:34.540 --> 01:46:55.020 The other consider the other thing that it's not optimal with, for scanned images for PDF is that they can be very large and it's very slow for online viewing for your users. Okay, regardless of whether the PDF. 267 Mindy Tran 01:46:55.340 --> 01:47:10.100 Is a born digital or whether it came from scanned image files. You will need to have a Adobe reader installed on the computers in order to read these files. 268 Mindy Tran 01:47:15.500 --> 01:47:32.140 A single PDF file can contain many pages regardless of the number of pages it is considered a single file and it is uploaded as a single file in content DM, you can import mult. 269 Mindy Tran 01:47:32.500 --> 01:47:52.620 PDF files using, you know, like you're importing multiple simple images depending on how your collection is configured multiple page. PDF files can be added to your collection to be viewed as just a single item or have the project client converted. 270 Mindy Tran 01:47:52.700 --> 01:47:57.580 At the time that you imported to a compound object. 271 Mindy Tran 01:47:59.020 --> 01:48:19.500 When multiple page PDF files are converted to compound objects, each page of the PDF becomes a page with its own metadata record and each page is, can be navigated using the feature of the compound object viewer and we'll take a look at the difference of that on the end US. 272 Mindy Tran 01:48:19.780 --> 01:48:28.300 Side depending on how on whether it's a single file or whether you've con- want it converted into a compound object. 273 Mindy Tran 01:48:36.780 --> 01:48:47.940 Questions about that before we go out and take a look here. there isn't, once you convert your PDF files the way to bring them in. 274 Mindy Tran 01:48:48.940 --> 01:49:07.340 Is the same? So I'm gonna switch over to my project client here and before we bring in a compo, a PDF file, one of the things we want to do is set up a me say metadata template for this. 275 Mindy Tran 01:49:08.820 --> 01:49:29.260 Metadata template for this PDF. So again, we'll go to project metadata templates and for the PDF files, I'm gonna select that, and for this one, same thing I want the publisher to say OCLC training, I. 276 Mindy Tran 01:49:29.340 --> 01:49:33.580 Want the source to. 277 Mindy Tran 01:49:37.580 --> 01:49:41.500 Have the file name. I want the relation. 278 Mindy Tran 01:49:48.460 --> 01:50:08.300 Cataloger is still the same. Okay, so we just want just to make it simplify our workflow and make it more efficient for us to add materials. The other thing that I want to show you is right now we are under. 279 Mindy Tran 01:50:08.340 --> 01:50:23.980 Processing here we are not converting our multipage PDF as compound objects. We are keeping it as a single file. Okay, so, but we will revisit this in a little bit. 280 Mindy Tran 01:50:25.580 --> 01:50:46.060 So, in order to add a PDF file, you just say add item like you're adding a simple image, go out and point the project client to where your PDF file is mine is on my desktop. I have a folder. 281 Mindy Tran 01:50:46.060 --> 01:50:49.540 Of all of them, I am going to. 282 Mindy Tran 01:50:50.900 --> 01:50:56.060 The content DM basic skills, too and add it. 283 Mindy Tran 01:51:00.780 --> 01:51:06.500 Add only one item. Okay, there it is. This whole entire. 284 Mindy Tran 01:51:09.480 --> 01:51:29.840 And there's a transcript for this and this is a single item. So if I open it up, it's just one, all of my transcript for this is all in because it was created from a digital. 285 Mindy Tran 01:51:30.720 --> 01:51:50.440 From Powerpoint, for instance, so there are embedded text and so that embedded text has been extracted and put into a transcript folder form a transcript field for me and all eighteen slides that text that's on all eight. 286 Mindy Tran 01:51:52.360 --> 01:51:56.160 Been added into a single field. Okay. 287 Mindy Tran 01:51:57.520 --> 01:51:59.680 I'm just gonna add a. 288 Mindy Tran 01:52:07.760 --> 01:52:14.240 Save it close it and upload it for approval. 289 Mindy Tran 01:52:18.600 --> 01:52:34.560 If I have multiple PDF files that I want to add, I can add multiple items and I can import the entire PDF folder with everything in there. 290 Mindy Tran 01:52:36.520 --> 01:52:51.600 Okay, so that's how you can import if you have ten twenty- thirty- PDF files all in a single folder, you can import that entire folder, bring it all in and work on it. 291 Mindy Tran 01:52:53.680 --> 01:53:13.640 The project client without having to do it one at a time, but if you are bringing it in and you are bringing in as a single file, now, what, if I don't want to bring it in as a single file, what if I want to bring in a PDF file because it is so. 292 Mindy Tran 01:53:14.280 --> 01:53:18.000 For instance, or I want each. 293 Mindy Tran 01:53:20.040 --> 01:53:39.080 Page, I want that PDF file multipage PDF file to be converted to a compound object. I can have that done what I will need to do is I want to go to Project project settings for this project. I want to go to processing. 294 Mindy Tran 01:53:39.880 --> 01:53:53.920 And I want to have this project client this project, specifically convert my multipage PDF to a compound object as it's importing it in. 295 Mindy Tran 01:53:55.880 --> 01:53:58.440 That's all I need to do. 296 Mindy Tran 01:53:59.720 --> 01:54:06.760 But you need to remember that what you're processing is, and what you want to do. Maybe you may have. 297 Mindy Tran 01:54:08.040 --> 01:54:28.520 There is some guidelines or best practices that your library to say, a certain number of pages before it's converted to a compound object or some such right, say, okay, the process is the same you want to add item. You want a browse and this. 298 Mindy Tran 01:54:28.800 --> 01:54:29.920 Time. 299 Mindy Tran 01:54:31.160 --> 01:54:34.680 I want, let's say I'll bring in. 300 Mindy Tran 01:54:36.200 --> 01:54:37.800 This one. 301 Mindy Tran 01:54:39.400 --> 01:54:59.240 Open it, and I'm gonna go ahead and add it. It's gonna take a little bit longer because as it's bringing in this PDF file behind the scenes, it's also converting that PDF file into a compound object, a document type compound object. 302 Mindy Tran 01:55:01.800 --> 01:55:17.320 There is no hierarchy. It is just, you know, like a linear report or anything, but it's converting it into a compound object. However, content DM treats it as a single file. 303 Mindy Tran 01:55:22.280 --> 01:55:25.720 Okay, and it takes a long time. I happen to choose. 304 Mindy Tran 01:55:28.040 --> 01:55:38.440 A file that has seventy- something pages. I think now that I remember, but while the project client is doing that. 305 Mindy Tran 01:55:40.200 --> 01:56:00.040 I am going to go out and show you what this looks like for the end user. So think of it as a compound object document type. So as the end user, once you've add this in, if you have a. 306 Mindy Tran 01:56:00.880 --> 01:56:10.040 A PDF file that is brought in as just a single file like this geneology research guide here. 307 Mindy Tran 01:56:11.560 --> 01:56:13.960 Notice the entire. 308 Mindy Tran 01:56:17.960 --> 01:56:23.080 PDF file all of the transcript is in one. 309 Mindy Tran 01:56:24.440 --> 01:56:44.840 The whole thing is described as an item description, You the user as a user, you can't view individual pages if it is a big file of hundreds of pages in that PDF, you will have to click this to view it out open up your. 310 Mindy Tran 01:56:45.240 --> 01:57:04.680 Reader to view all of the pages outside of the content DM object viewer. Okay, and that's the way to be able to view all six pages of this PDF geneology research, however. 311 Mindy Tran 01:57:06.600 --> 01:57:25.800 If the PDF was brought in and converted to a compound object, like this African American research Guide, it will act like a document compound object. So you still as a user, get to decide if you want to view this entire. 312 Mindy Tran 01:57:27.080 --> 01:57:29.240 Out in the Adobe reader. 313 Mindy Tran 01:57:32.200 --> 01:57:33.320 Or. 314 Mindy Tran 01:57:34.800 --> 01:57:55.240 You can just view it one page at a time and not have to go anywhere else, not have to open anything specific. You can just open it one page at a time still navigate through it one page at a time here. The other reason why some libraries may decide to. 315 Mindy Tran 01:57:56.560 --> 01:58:15.120 Convert their PDF into a compound object finally is because there are how many are there, there are seventy- eight like, I remembered each page has its own transcript. 316 Mindy Tran 01:58:15.720 --> 01:58:23.560 If you have there is a field, a character limitation to the. 317 Mindy Tran 01:58:25.320 --> 01:58:45.160 Transcript field, I believe it's a hundred and twenty- eight thousand characters. So if you have a PDF, that is this large that may be more than a hundred and twenty- eight thousand characters, not all of the text will fit in one fi. 318 Mindy Tran 01:58:45.840 --> 01:59:05.640 So there may be cut off at some point so that's a consideration of why the library may decide to convert a, a big PDF file into a compound object so that each so that there isn't. 319 Mindy Tran 01:59:05.800 --> 01:59:26.120 The transcript field isn't cut off the text is not cut off because of the limitation. Okay, but it's just up to you how you bring in if you bring this in, it's like a compound object. You can view this as a spreadsheet. You can edit this, like. 320 Mindy Tran 01:59:26.200 --> 01:59:44.680 Spreadsheet, each page level will need a subject because subject is required here you will need to add the publisher. You will need to add the relation to each of these pages, but you will notice that each page has its own. 321 Mindy Tran 01:59:45.960 --> 01:59:50.320 Transcript own metadata and you can. 322 Mindy Tran 01:59:52.360 --> 01:59:56.200 And you can add metadata to each page. 323 Mindy Tran 02:00:00.080 --> 02:00:09.880 Individually subject heading subject may be different for each page, depending on what is here That will be up to you. 324 Mindy Tran 02:00:16.680 --> 02:00:18.720 Questions about this. 325 Mindy Tran 02:00:23.080 --> 02:00:36.440 I apologize for showing you the end user side first before I showed you what the project client does with a multipage PDF file since it took so long for it to import. 326 Mindy Tran 02:00:37.800 --> 02:00:58.280 But this is what you can look forward to, for multipage PDF file that's been converted to a compound object, if you change your mind from this point, any PDF that you bring in will automatically be con into this project will automatically be converted to a compound object. 327 Mindy Tran 02:00:58.320 --> 02:01:11.000 If you no longer want that to be the case, you always have to go back to that project setting processing and change it back to do, not convert. Okay. 328 Mindy Tran 02:01:24.560 --> 02:01:30.720 Questions about this about anything that we've covered in the session today. 329 Mindy Tran 02:01:35.440 --> 02:01:38.080 It's okay if you don't. 330 Mindy Tran 02:01:43.120 --> 02:01:59.760 I have many scanned Pdfs to upload without OCR. Can you bring in a scanned PDF without OCR then add the transcript to the field. Yes, if you have transcripts, you can, there is a way to do that. 331 Mindy Tran 02:02:01.680 --> 02:02:20.240 I can look for the documentation to point you through to how to do that. If not, I do have instructions, you can bring the... let me very see if you quickly show you how to do that if you have a transcript folder. 332 Mindy Tran 02:02:20.440 --> 02:02:27.120 Or transcript somewhere that you put a transcript folder. 333 Mindy Tran 02:02:28.640 --> 02:02:40.120 Yeah, so OCR won't work that may, that makes sense? Yes, you can always bring in a transcript file after the fact. So this one is not a good one. 334 Mindy Tran 02:02:41.360 --> 02:02:46.920 Let me see if I... let's just bring in an image very quickly, so I can. 335 Mindy Tran 02:02:51.600 --> 02:02:54.040 I'm just gonna add a very. 336 Mindy Tran 02:03:04.400 --> 02:03:21.040 I'm just gonna bring in an image without any transcript. So let's say you have this PDF file that's scanned when you open it up. One of the things that you can do in the transcript field is right click and say, add text file and then. 337 Mindy Tran 02:03:21.200 --> 02:03:40.440 It will ask you where does that trans browse out and point to where the transcript file exists? So that's one way of doing it, and that's doing it one at a time. Another way is to utilize your project metadata template. 338 Mindy Tran 02:03:41.520 --> 02:03:44.320 Is, so under metadata templates. 339 Mindy Tran 02:03:46.640 --> 02:04:06.480 In your PDF file template, if you click on edit in the transcript field, you can point it to import from a directory import. So if you have your PDF files in a transcript folder, somewhere you. 340 Mindy Tran 02:04:08.160 --> 02:04:20.440 Directory import and browse out to point it to that folder that contains your transcript for the scanned Pdfs. Okay. 341 Mindy Tran 02:04:25.680 --> 02:04:42.280 So that's a couple of ways to do that, but yes, to answer your question, you can bring in a trans a transcripts to the field after you've import the, the PDF, the scans, Pdfs in. Okay. 342 Mindy Tran 02:04:42.960 --> 02:04:44.760 Any other questions? 343 Mindy Tran 02:04:52.560 --> 02:04:53.880 Great question. 344 Mindy Tran 02:04:56.400 --> 02:05:15.600 You can always contact OCLC support in your region, if you have questions, once you begin to add Textra materials in compound objects in and questions, arise always know that you can contact OCLC support, if you're in US or in Canada, you can also call the tow fre. 345 Mindy Tran 02:05:15.600 --> 02:05:21.280 Number to reach one of our support team members. Okay. 346 Mindy Tran 02:05:23.920 --> 02:05:43.120 I want to thank you for spending this time with me. I know that we probably went through in a little bit more detail. It takes a little bit longer because we're adding compound objects and PDF files converted into compound objects. So things tend to take a little bit longer, especially with the index. 347 Mindy Tran 02:05:43.600 --> 02:06:02.960 So hopefully with moving around and having a collection of that already exists with these items to show you what the end user sees is helpful and makes a class move a little bit faster than waiting around for these compound objects to index. 348 Mindy Tran 02:06:04.360 --> 02:06:24.080 I'll be online here for a few more minutes if you have other questions for me. Otherwise feel free to exit out at any time and when you do, you'll be redirected to an evaluation for this class. I'd like to ask for just another minute of your time to provide us with feedback for the session. 349 Mindy Tran 02:06:24.280 --> 02:06:42.440 Very much appreciated for those of you who will be in session, three. I will see you next time for those of you who can't make it. I'll see you at other OCLC training session. Thank you again. And hope you all have a great rest of your day.