Skip to main content

Extract Text From PDF

Description

This activity extracts the text from a specified PDF file.

Design Time Configuration

At design time, you can configure the following properties:

  • PDF File Path – The PDF file path from where you want to extract the text.
  • Password – The password of the PDF file, if necessary
  • Page(s) to read – Use the options below to read the text within one or more pages.
OptionDescription
All    Choose this option to read the text from all the page(s).
SingleChoose this option to read the text from the specified page.
Range  Choose this option to read the text from the specified page(s) range.
  • From page - This option is visible when you choose Range in Page(s) to read. Specify the initial page to initiate the text's reading.
  • To page - This option is visible when you choose Range in Page(s) to read. Specify the concluding page number within the range for the text's reading.
  • Page number - This option is visible when you choose Single in Page(s) to read. Specify the specific page number for reading the text.
  • Result -  Returns the extracted text as a string from the specified PDF file.

Properties

Input

  • Password—Specify the file's password if password protected.

Misc

  • DisplayName – Add a display name to your activity.
  • Private – By default, activity will log the values of your properties inside your workflow. If private is selected, then it stops logging.

Optional

  • Continue On Error – It Specifies whether the automation should continue even when the activity throws an error. If True, the activity continues without throwing any exceptions. If False, the activity throws an exception. The default value is False.
note

Catches no error if this activity is present inside the Try-Catch block and the value of this property is True.

Example

Download Example

Correlation Scope