Skip to main content

OCR Data Types

This page describes all the Object types defined and used by OCR activities.

BoundingRect

Properties

PropertyTypeDescription
X1Int32Top-Left X-coordinate value
Y1Int32Top-Left Y-coordinate value
X2Int32Bottom-Right X-coordinate value
Y2Int32Bottom-Right Y-coordinate value
WidthInt32Width of the Bounding Box. Either (X2, Y2) or Width and Height are required to define Bounding Box
HeightInt32Height of the Bounding Box. Either (X2, Y2) or Width and Height are required to define Bounding Box

TextResult

Properties

PropertyTypeDescription
LabelStringDisplayName of the region
TextStringData extracted from the input region
DescriptionStringDescription about the region
ConfidenceFloatA confidence value of extracted data
RectangleBoundingRectBoundingBox of the region
DataTypeTypeCodeThe data type of the data extracted
FormatStringUsing regular expressions to format the data
OcrParametersOcrParametersSpecifies the OCR parameter properties with which you can control the extraction capabilities of OCR Engine.

DocumentQuery

Properties

PropertyTypeDescription
NameStringName of the document for which creates the document query
ImageResizePercentageInt32Specifies the percentage value to rescale an image
DocumentSelectionParamsStringThe keyword to choose the document
PageQueriesList of PageQueriesList of PageQueries with page number, clip region, etc., information on the extracted data

PageQuery

Properties

PropertyTypeDescription
PageNumberInt32Page number to which the extracted data belongs
TitlePatternsList of StringList of title patterns in a page
ToleranceFuzzyStringComparisionToleranceCalculates the tolerance using fuzzy logic for matching results of the search string inside the page
CustomPercentMatchDoubleThe matching percentage value
TitleClippingRegionBoundingBoxClipping region of the title
MatchAllTitlePatternboolIts boolean value determines whether it matches all the specified title patterns.

PageInfo

Properties

PropertyTypeDescription
ImagePathStringSpecify the path of the Image
PageNumberInt32Specifies the page number
HeaderStringHeader of the page
FileNameStringFileName of the page
List of TestResultStringList of TestResult containing the text with other information about the extracted data on the page
DescriptionStringDetail description of the page
FilePathStringThe file path of the rasterized image
ImageResizePercentageInt32Specifies the percentage value to rescale an image

TypeCode

enum TypeCode
{
Empty = 0,
Object = 1,
DBNull = 2,
Boolean = 3,
Char = 4,
SByte = 5,
Byte = 6,
Int16 = 7,
UInt16 = 8,
Int32 = 9,
UInt32 = 10,
Int64 = 11,
UInt64 = 12,
Single = 13,
Double = 14,
Decimal = 15,
DateTime = 16,
String = 18
}

FuzzyStringComparisionTolerance

enum FuzzyStringComparisionTolerance
{
Strong = 0,
Normal = 1,
Weak = 2,
Custom = 3
}