Extract Text From PDF
Description
This activity extracts the text from a specified PDF file.
Design Time Configuration
At design time, you can configure the following properties:
- PDF File Path – The PDF file path from where you want to extract the text.
- Password – The password of the PDF file, if necessary
- Page(s) to read – Use the options below to read the text within one or more pages.
Option | Description |
---|---|
All | Choose this option to read the text from all the page(s). |
Single | Choose this option to read the text from the specified page. |
Range | Choose this option to read the text from the specified page(s) range. |
- From page - This option is visible when you choose Range in Page(s) to read. Specify the initial page to initiate the text's reading.
- To page - This option is visible when you choose Range in Page(s) to read. Specify the concluding page number within the range for the text's reading.
- Page number - This option is visible when you choose Single in Page(s) to read. Specify the specific page number for reading the text.
- Result - Returns the extracted text as a string from the specified PDF file.
Properties
Input
- Password—Specify the file's password if password protected.
Misc
- DisplayName – Add a display name to your activity.
- Private – By default, activity will log the values of your properties inside your workflow. If private is selected, then it stops logging.
Optional
- Continue On Error – It Specifies whether the automation should continue even when the activity throws an error. If True, the activity continues without throwing any exceptions. If False, the activity throws an exception. The default value is False.
note
Catches no error if this activity is present inside the Try-Catch block and the value of this property is True.
Example
Download Example