PSI:Capture Document Redaction
In this demo we'll go over the redaction capabilities of science. I capture four-point-oh. We've put a lot of focus into the redaction feature is we've just had a number of requests from service bureau partners legal customers and resellers that really like the redaction functionality especially with the overall concern in the industry about security and documents and and things like that so within the product inside capture. Obviously we're an advanced capture company. We can scan import documents. Do all different types of processing. There's a number of ways that we can perform the redaction process and we can do it manually or interactively just through drawing a box across a term that needs to be redacted so i'll go ahead and import just a generic document and give you an overview of what we can do. Let's bring in a few pages here of a tiff file. That's on the network and i'll go ahead and stop the import process and within the capture module. I have the capability to perform redaction. But what i'm going to do is is continue in the capture workflow and I've set up a quality assurance step now in the quality assurance view. I have a few things I can do. I can go ahead and open up my tree within my document and I have a visual representation of the entire document and then I can go ahead and throw it into thumbnail view and this is our document preparation interface. A lot of legal entities folks. Who want to do redaction utilize this to split documents. Insert replace pages but within this interface we also have the ability to do a manual redaction process. So i can go ahead and use my redaction toolbar to come in cover a particular term. It'll go ahead and preview it and show me the exact coverage that will have of the redaction and then i can choose from redacting to black white or white with a black border so. I can come in choose what I'd like to do the border thickness click OK. And then you'll see that it gives me a nice redaction in this case white box with a black border ok so from an interactive perspective.
Very simple very easy to to work with the product on in the Quality Assurance module. We've also got some auto processing capabilities so if we wanted to go ahead and process. OCR the document and then search for particular terms to redact. We can do that as well. And within the auto redaction interface. I can go ahead and choose which pages of the document. I'd like to do whether I want to do all pages whether i want to particular pick a particular section of the page to look for specific terms we have. OCR engines that we can choose from depending on how you want to process the documents and then I can come in and add my expression so in this example. I'm going to go ahead and look for the word river. This is a Mark Twain book that I imported so there's going to be a few mentions of that and utilizing what we call a regular expression. I'm having it look for a capitalized river or lowercase river. Okay and I can use all different types of combinations go through. I can even list multiple expressions. I can build a library okay in which i can store these so if you had particular ones that you worked on with in a case like maybe you wanted to look for a pattern of a social security you can build the centralized library that will look for email addresses. Phone numbers socials. All different types of information that you can read act okay so in this example. I went ahead and put my information entered it. And now i can. I can come in choose how. I want to redact click. Save and then process. Now this is going to take a minute because it's going to go through it's going to convert the document into a searchable document and then it's going to go through and search for those patterns i entered which are lowercase and uppercase river. Okay so what's going through the process. Now we do have some other options. One of the new features in 40 we just added is redaction based on a field. Okay so now what we have.
The ability to do is utilize index fields which can be either manually entered or auto populated to perform the redaction process and. I'll show you a bit of that at the end of the demo here. But it's chugging through these pages doing its conversion searching for the words and note that this process can be fully automated as well so what. I'll do when I'm done here is actually show you how an automated process can work so now if I open up my thumbnail view and I come into my document you can see that in my automated process. I had it go ahead and look for for river and I utilized the black box redaction just for a differentiator so really nice. Seamless tool goes through. Nice tight redaction and it makes it really easy for you to do that. That conversion and review the document to make sure that that everything was redacted that needed to be redacted. Okay now this is expression based redaction we also have in the product what we call a form or an overlay redaction. So let's say that you had 10,000 medical forms that always have the social in the upper right corner and maybe a tax. ID in the lower left corner you can actually perform the redaction process using zones so you can come in. You can preload a document and then draw where you want the redaction to occur. So i'll go ahead and navigate. Oops let's go ahead and navigate into my folder here and i'll bring up a sample. Go ahead and grab one of my generic documents and i'll go ahead and import this item and then what i can do if if there's a repetitive area that I want blacked out. I can go ahead and draw the area that I'd like blacked out. I can give it a name. Let's say upper right. Go ahead and save it and then. I can run it on all my pages and in this case we'll go ahead and redact two white with a black border so you can see what it looks like and then I'll click process and there you go so now if we take a look at our thumbnail you can see that all the pages now have that redacted zone okay so three ways.
I've shown you interactive redaction with the toolbar automated redaction where we search the text and look for specific patterns. And then we've got the ability to do a zone or a template redaction when you've got information that occurs the same location in the page. Alright so let's go ahead and necks out of this. I've shown you interactive options here and and note when I click done or close batch. I have the ability to output redacted and non redacted copy to any of the 50 back-end systems that we support everything from sharepoint to world. Docs to laserfiche you. Name it we can even just kick them out to network folder okay so a lot of different options that you have with the product now. Along with being able to run site capture interactively during the redaction process. We also have the ability to run it in an automated fashion. And let's go ahead and grab that same document. Go ahead and grab that. Mark Twain book and grab my generics here. I'll go ahead and copy it and then i'm going to drop it into what we call an auto import folder so now either at the desktop or from a scanning copier and maybe a fax drop directory. I have the ability to drop in a file and the application will watch this folder. Pick up any image file that gets dropped in an auto process process it from a redaction perspective so you can see that the product is cute. It up pulled it into its system and it's starting to search the document. OCR it and then in this example. Okay you can see it's gone into the workflow and let's go ahead and look at at my output and. I perform this before. I won't have your weight that that file I dropped in there was fairly big so what I'll do is is come into an output directory that I've set up and I've got my law firm directory and you can see here that I've got redacted an original copy ok so I can output non redacted and redacted copy to any of the backend systems that we support so it gives you a great way to to have both that private and public document that you'd like to share within a folder structure or a back-end document management system that you'd like to use now.
The final redaction methodology we can use is what's called index field based redaction. Let's go ahead and import the same document and then. I'll show you how we can have an operator or an end user. Actually enter information into fields for very simple redaction workflow. Okay so let's go ahead and bring in this same document and we'll just bring in a few pages here now. I've added a workflow step here. Called an index step and typically an index step is utilized to enter information about the document and then migrated into the backend system so you have searchable fields or you can do custom file naming in this particular example. You can see that. I've set up these fields as an interactive redaction field so I can type the word river maybe river with with lowercase and then maybe we look for the word mark okay now when I move into my key way I've set it up to automatically take those terms go through the document and then automatically redact all right now interactively this works but we also have the ability need to populate those fields in a number of ways okay we could do a database lookup and bring back the terms that need to be redacted. Okay we could throw a barcode cover sheet on here with the terms that need to be redacted scan it and perform an automated process all right so if you look here you can see that we've automatically redacted again and we've got all our Terms blacked out so this is just a basic primer. Note that you can take this redaction functionality to too much higher. Heights everything from dropping in a CSV file with the document that contains the redaction terms. You need which really lends itself nicely to integrating with other applications. So if you have any questions about side capture and how we can help you with redaction and document confidentiality. Go ahead and send a note to sales @ seijin com or you can call 949 916 7700 extension 230 thanks.
Very simple very easy to to work with the product on in the Quality Assurance module. We've also got some auto processing capabilities so if we wanted to go ahead and process. OCR the document and then search for particular terms to redact. We can do that as well. And within the auto redaction interface. I can go ahead and choose which pages of the document. I'd like to do whether I want to do all pages whether i want to particular pick a particular section of the page to look for specific terms we have. OCR engines that we can choose from depending on how you want to process the documents and then I can come in and add my expression so in this example. I'm going to go ahead and look for the word river. This is a Mark Twain book that I imported so there's going to be a few mentions of that and utilizing what we call a regular expression. I'm having it look for a capitalized river or lowercase river. Okay and I can use all different types of combinations go through. I can even list multiple expressions. I can build a library okay in which i can store these so if you had particular ones that you worked on with in a case like maybe you wanted to look for a pattern of a social security you can build the centralized library that will look for email addresses. Phone numbers socials. All different types of information that you can read act okay so in this example. I went ahead and put my information entered it. And now i can. I can come in choose how. I want to redact click. Save and then process. Now this is going to take a minute because it's going to go through it's going to convert the document into a searchable document and then it's going to go through and search for those patterns i entered which are lowercase and uppercase river. Okay so what's going through the process. Now we do have some other options. One of the new features in 40 we just added is redaction based on a field. Okay so now what we have.
The ability to do is utilize index fields which can be either manually entered or auto populated to perform the redaction process and. I'll show you a bit of that at the end of the demo here. But it's chugging through these pages doing its conversion searching for the words and note that this process can be fully automated as well so what. I'll do when I'm done here is actually show you how an automated process can work so now if I open up my thumbnail view and I come into my document you can see that in my automated process. I had it go ahead and look for for river and I utilized the black box redaction just for a differentiator so really nice. Seamless tool goes through. Nice tight redaction and it makes it really easy for you to do that. That conversion and review the document to make sure that that everything was redacted that needed to be redacted. Okay now this is expression based redaction we also have in the product what we call a form or an overlay redaction. So let's say that you had 10,000 medical forms that always have the social in the upper right corner and maybe a tax. ID in the lower left corner you can actually perform the redaction process using zones so you can come in. You can preload a document and then draw where you want the redaction to occur. So i'll go ahead and navigate. Oops let's go ahead and navigate into my folder here and i'll bring up a sample. Go ahead and grab one of my generic documents and i'll go ahead and import this item and then what i can do if if there's a repetitive area that I want blacked out. I can go ahead and draw the area that I'd like blacked out. I can give it a name. Let's say upper right. Go ahead and save it and then. I can run it on all my pages and in this case we'll go ahead and redact two white with a black border so you can see what it looks like and then I'll click process and there you go so now if we take a look at our thumbnail you can see that all the pages now have that redacted zone okay so three ways.
I've shown you interactive redaction with the toolbar automated redaction where we search the text and look for specific patterns. And then we've got the ability to do a zone or a template redaction when you've got information that occurs the same location in the page. Alright so let's go ahead and necks out of this. I've shown you interactive options here and and note when I click done or close batch. I have the ability to output redacted and non redacted copy to any of the 50 back-end systems that we support everything from sharepoint to world. Docs to laserfiche you. Name it we can even just kick them out to network folder okay so a lot of different options that you have with the product now. Along with being able to run site capture interactively during the redaction process. We also have the ability to run it in an automated fashion. And let's go ahead and grab that same document. Go ahead and grab that. Mark Twain book and grab my generics here. I'll go ahead and copy it and then i'm going to drop it into what we call an auto import folder so now either at the desktop or from a scanning copier and maybe a fax drop directory. I have the ability to drop in a file and the application will watch this folder. Pick up any image file that gets dropped in an auto process process it from a redaction perspective so you can see that the product is cute. It up pulled it into its system and it's starting to search the document. OCR it and then in this example. Okay you can see it's gone into the workflow and let's go ahead and look at at my output and. I perform this before. I won't have your weight that that file I dropped in there was fairly big so what I'll do is is come into an output directory that I've set up and I've got my law firm directory and you can see here that I've got redacted an original copy ok so I can output non redacted and redacted copy to any of the backend systems that we support so it gives you a great way to to have both that private and public document that you'd like to share within a folder structure or a back-end document management system that you'd like to use now.
The final redaction methodology we can use is what's called index field based redaction. Let's go ahead and import the same document and then. I'll show you how we can have an operator or an end user. Actually enter information into fields for very simple redaction workflow. Okay so let's go ahead and bring in this same document and we'll just bring in a few pages here now. I've added a workflow step here. Called an index step and typically an index step is utilized to enter information about the document and then migrated into the backend system so you have searchable fields or you can do custom file naming in this particular example. You can see that. I've set up these fields as an interactive redaction field so I can type the word river maybe river with with lowercase and then maybe we look for the word mark okay now when I move into my key way I've set it up to automatically take those terms go through the document and then automatically redact all right now interactively this works but we also have the ability need to populate those fields in a number of ways okay we could do a database lookup and bring back the terms that need to be redacted. Okay we could throw a barcode cover sheet on here with the terms that need to be redacted scan it and perform an automated process all right so if you look here you can see that we've automatically redacted again and we've got all our Terms blacked out so this is just a basic primer. Note that you can take this redaction functionality to too much higher. Heights everything from dropping in a CSV file with the document that contains the redaction terms. You need which really lends itself nicely to integrating with other applications. So if you have any questions about side capture and how we can help you with redaction and document confidentiality. Go ahead and send a note to sales @ seijin com or you can call 949 916 7700 extension 230 thanks.