Difference between revisions of "Textract"

From
Jump to: navigation, search
Line 11: Line 11:
 
* [[Character Recognition]]
 
* [[Character Recognition]]
 
* [[Image Retrieval / Object Detection]]
 
* [[Image Retrieval / Object Detection]]
 +
* [[Natural Language Processing (NLP)]]
 
* [http://aws.amazon.com/textract/ Textract | Amazon]
 
* [http://aws.amazon.com/textract/ Textract | Amazon]
  

Revision as of 06:17, 9 July 2019

YouTube search... ...Google search

a service that automatically extracts text and data from scanned documents and also identify the contents of fields in forms and information stored in tables.

  1. Create smart search indexes - Extract structured data from documents and create a smart index to allow you to search through millions of financial statements quickly. For example, a mortgage company could use Amazon Textract to process millions of scanned loan applications in a matter of hours and have the extracted data indexed in Amazon Elasticsearch. This would allow them to create search experiences like “search for loan applications where applicant name is John Doe,” or “search contracts where the interest rate is 2 percent.”
  2. Build automated document processing workflows - Amazon Textract can provide the inputs required to automatically process forms without human intervention. For example, banks can automate loan applications using Amazon Textract. The information contained in the document could be used to initiate all of the necessary background and credit checks to approve the loan so that customers can get instant results of their application rather than having to wait several days for manual review and validation.
  3. Maintain compliance in document archives - Because Amazon Textract identifies data types and form labels automatically, it’s easy to maintain compliance with information controls. For example, an insurer could use Amazon Textract to feed a workflow that automatically redacts personally identifiable information (PII) for their review before archiving claim forms by automatically recognizing the important key-value pairs that require protection.