Machine Learning Data Types
This page describes all the Object types defined and used by Machine Learning Activities.
DateParserInfo
Properties
Property | Type | Description |
---|---|---|
Date | DateTime | The parsed and modified Date value |
Confidence | Double | The confidence of the model in the range 0 to 1 |
NumberParserInfo
Properties
Property | Type | Description |
---|---|---|
Number | long | The parsed and modified Numeric value |
Confidence | Double | The confidence of the model in the range 0 to 1 |
DocumentInfo
Properties
Property | Type | Description |
---|---|---|
DocumentTypes | Dictionary<String, Float> | List of Document Types and the corresponding confidence |
Domain | String | The domain to which this document belongs |
SentimentInfo
Properties
Property | Type | Description |
---|---|---|
Sentiment | String | Contains one of the following values: Positive, Negative, or Neutral. |
Text | String | The input text. |
Confidence | Double | The confidence of the model in the range 0 to 1 |
LanguageDetectionResult
It represents language detection results obtained from cognitive services.
Properties
Property | Type | Description |
---|---|---|
Language | String | Language name. |
LanguageCode | String | Shortcode of detected Language in ISO 639-1 format. Refer to List of ISO 639-1 codes |
ConfidenceScore | Int32 | Score between 0 and 100, indicating the confidence for detecting this language. |
Entity
It represents an entity identified by cognitive services.
Properties
Property | Type | Description |
---|---|---|
Text | String | Text of the entity. |
ConfidenceScore | Int32 | Score between 0 to 100, indicating the confidence for detecting this entity. |
Length | Int32 | Length of the entity text. |
Offset | Int32 | Starting position of the entity text within the document text. |
NamedEntity
It represents a named entity identified by cognitive services.
Properties
Property | Type | Description |
---|---|---|
Text | String | Text of the entity. |
Category | String | Category of the identified entity. |
SubCategory | String | Subcategory of the entity. |
ConfidenceScore | Int32 | Score between 0 to 100, indicating the confidence for detecting this entity. |
Length | Int32 | Length of the entity text. |
Offset | Int32 | Starting position of the entity text within the document text. |
BoundingBox
Represents an abstract rectangle that surrounds elements on a document or form. It is used as a reference point for object detection.
Each point coordinate represents a corner of the box. For Image files, coordinates are in pixels. For PDF files, coordinates are in inches
Properties
Property | Type | Description |
---|---|---|
TopLeft | PointF | Contains top left coordinates of the bounding box. |
TopRight | PointF | Contains top right coordinates of the bounding box. |
BottomLeft | PointF | Contains bottom left coordinates of the bounding box. |
BottomRight | PointF | Contains bottom right coordinates of the bounding box. |
CheckMark
Represents a check mark (selection mark) detected in a form
Properties
Property | Type | Description |
---|---|---|
Text | String | Text associated with the check box (if any) |
ConfidenceScore | Float | Confidence score of this detection in the range of 0 to 100 |
PageNumber | Integer | Page number of detected check mark |
CheckState | Boolean | Indicates whether the SelectionMark was selected or unselected |
Box | BoundingBox | Box coordinates of the check mark detection |
FormText
Represents a word inside a line or cell element inside a table cell
Properties
Property | Type | Description |
---|---|---|
PageNumber | Integer | Page number of element |
ConfidenceScore | Float | Confidence score of the element detection in range 0 to 100 |
Text | String | Text detected in the element |
Box | BoundingBox | Box coordinates of the element |
FormTableCell
Represents a detected table cell in the form
Properties
Property | Type | Description |
---|---|---|
Text | String | Text in the table cell |
ConfidenceScore | Float | Confidence score of the table cell detection in range 0 to 100 |
Box | BoundingBox | Box coordinates of the table cell |
PageNumber | Integer | Page number of detected table cell |
ColumnIndex | Integer | Index of the column containing the table cell |
RowIndex | Integer | Index of the row containing the table cell |
ColumnSpan | Integer | Column span of the table cell |
RowSpan | Integer | Row span of the table cell |
HeaderCell | Boolean | Indicates whether the table cell is in header row |
FooterCell | Boolean | Indicates whether the table cell is in footer row |
CellElements | FormText | Listing of elements inside the table cell |
FormTable
Represents a detected table in a form
Properties
Property | Type | Description |
---|---|---|
PageNumber | Integer | Page number of detected table |
Box | BoundingBox | Box coordinates of the table |
ConfidenceScore | Float | Confidence score of the table detection in range 0 to 100 |
RowCount | Integer | Number of rows in the table |
ColumnCount | Integer | Number of columns in the table |
Cells | FormTableCell | Listing of all cells inside the table |
FormLine
Represents a detected line of text inside a form
Properties
Property | Type | Description |
---|---|---|
Handwritten | Boolean | Indicates whether the line was handwritten or not |
Words | FormText | Listing of words inside the line |
PageNumber | Integer | Page number of the detected line |
ConfidenceScore | Float | Confidence score of the line detection in the range between 0 to 100 |
Text | String | Detected text in the line |
Box | BoundingBox | Box coordinates of the line |
FormPage
Represents a page of a form
Properties
Property | Type | Description |
---|---|---|
Height | Float | Height of the detected page |
Width | Float | Width of the detected page |
PageNumber | Integer | Page number of the detected form page |
TextAngle | Float | Orientation of detected text in degrees. Value in the range between -180 to 180 |
Lines | FormLine | Listing of lines detected in the page |
Tables | FormTable | Listing of tables detected in the page |
CheckMarks | CheckMark | Listing of check marks (selection marks) detected on the page |
FormField
Represents a field inside the form
Properties
Property | Type | Description |
---|---|---|
ConfidenceScore | Float | Confidence score in range 0 to 100 of the detected form field |
Label | String | Label text of the detected field |
LabelBox | BoundingBox | Box coordinates of the field label |
LabelPageNumber | Integer | Page number of the field label |
LabelElements | FormText | Elements of the field label |
Value | Object | Value of the identified field. Cast to its appropriate type |
ValueBox | BoundingBox | Box coordinates of the field value |
ValuePageNumber | Integer | Page number of the field value |
ValueElements | FormText | Elements of the field value |
ValueText | String | Textual value of the identified field |
HasValue | Boolean | Indicates whether a value is available for this form field |
FormFieldValue
Represents a field inside the form
Properties
Property | Type | Description |
---|---|---|
ConfidenceScore | Float | Confidence score in range 0 to 100 of the detected form field |
Label | String | Label text of the detected field |
LabelBox | BoundingBox | Box coordinates of the field label |
LabelPageNumber | Integer | Page number of the field label |
LabelElements | FormText | Elements of the field label |
Value | T | Value of the identified field. Cast to its appropriate type |
ValueBox | BoundingBox | Box coordinates of the field value |
ValuePageNumber | Integer | Page number of the field value |
ValueElements | FormText | Elements of the field value |
ValueText | String | Textual value of the identified field |
Form
Represents the type of form
Properties
Property | Type | Description |
---|---|---|
FormType | FormType | Type of the detected form. Can be a BusinessCard, IdentityDocument, Invoice, or Receipt |
FormTypeConfidenceScore | Float | Confidence score in the range from 0 to 100 of the detected form type |
Pages | FormPage | Listing of pages identified in the form layout |
Fields | FormField | Dictionary of fields identified in the form. The field name is used as a key in this dictionary |
InvoiceLineItem
Represents a line item inside an invoice
Properties
Property | Type | Description |
---|---|---|
Text | String | Textual representation of the line item |
Amount | FormFieldValue Float | Amount detected in the line item |
Description | FormFieldValue String | Description of item detected in the line item |
Quantity | FormFieldValue Float | Quantity detected in the line item |
UnitPrice | FormFieldValue Float | Price per unit detected in the line item |
ProductCode | FormFieldValue String | Product code detected in the line item |
Unit | FormFieldValue String | Unit of the item. e.g., hours, nos, etc. |
Date | FormFieldValue DateTime | Date detected in the line item |
Tax | FormFieldValue Float | Tax amount detected in the line item. It could be represented as the actual amount, percentage, etc. |
ValueAddedTax | FormFieldValue Float | VAT amount detected in the line item. The field name in Azure: VAT |
Fields | FormField | Dictionary of fields identified in the line item. The field name is used as a key in this dictionary |
Invoice
Represents an invoice form
Properties
Property | Type | Description |
---|---|---|
CustomerName | FormFieldValue String | Invoiced customer name |
CustomerId | FormFieldValue String | Customer reference id |
PurchaseOrder | FormFieldValue String | Purchase order reference number |
InvoiceId | FormFieldValue String | Id of the invoice. Sometimes referred to as InvoiceNumber |
InvoiceDate | FormFieldValue DateTime | Date when invoice was issued |
DueDate | FormFieldValue DateTime | Date payment of invoice is due |
VendorName | FormFieldValue String | Name of vendor associated with invoice |
VendorTaxId | FormFieldValue String | Taxpayer Id associated with the vendor |
VendorAddress | FormFieldValue String | Vendor mailing address |
VendorAddressRecipient | FormFieldValue String | Recipient name associated with vendor address |
CustomerTaxId | FormFieldValue String | Taxpayer Id associated with the customer |
CustomerAddress | FormFieldValue String | Customer mailing address |
CustomerAddressRecipient | FormFieldValue String | Recipient name associated with customer address |
BillingAddress | FormFieldValue String | Explicit billing address for customer |
BillingAddressRecipient | FormFieldValue String | Recipient name associated with billing address |
ShippingAddress | FormFieldValue String | Shipping address of customer specified in invoice |
ShippingAddressRecipient | FormFieldValue String | Recipient name associated with shipping address |
ServiceAddress | FormFieldValue String | Explicit service address of customer associated with invoice |
ServiceAddressRecipient | FormFieldValue String | Recipient name associated with service address |
RemittanceAddress | FormFieldValue String | Explicit remittance/payment address associated with customer |
RemittanceAddressRecipient | FormFieldValue String | Recipient name associated with remittance address |
PaymentTerm | FormFieldValue String | Terms of payment specified in invoice |
ServiceStartDate | FormFieldValue DateTime | Starting date of provided service |
ServiceEndDate | FormFieldValue DateTime | End date of provided service |
AmountDue | FormFieldValue Float | Total amount due |
SubTotal | FormFieldValue Float | Subtotal identified in invoice |
PreviousUnpaidBalance | FormFieldValue Float | Previous unpaid balance amount identified in invoice |
InvoiceTotal | FormFieldValue Float | Total charges associated with the invoice |
TotalTax | FormFieldValue Float | Total tax amount identified in invoice |
TotalVAT | FormFieldValue Float | Total Value Added Tax amount identified in invoice |
LineItems | InvoiceLineItem | Listing of invoice line items identified in invoice |
FormType | FormType | Type of detected form. Can be an Invoice, Receipt, BusinessCard, or IdentityCard |
FormTypeConfidenceScore | Float | Confidence score in range 0 to 100 of the detected form type |
Pages | FormPage | Listing of pages identified in invoice |
Fields | FormField | Dictionary of fields identified in the invoice. The form field name is used as a key in the dictionary |
BusinessCard
Represents the model of a business card
Properties
Property | Type | Description |
---|---|---|
ContactNames | FormFieldValue List of String | Contact Name(s) |
FirstName | FormFieldValue String | First (given) name of contact. |
LastName | FormFieldValue String | Last (family) name of contact |
CompanyNames | FormFieldValue List of String | Company Name(s) of contact |
DepartmentNames | FormFieldValue List of String | Department Name(s)/Organization Name(s) of contact |
JobTitles | FormFieldValue List of String | Job Title(s) of contact |
EmailAddresses | FormFieldValue List of String | Email Address(es) of contact |
Websites | FormFieldValue List of String | Website(s) of contact |
Addresses | FormFieldValue List of String | Address(es) of contact |
MobilePhones | FormFieldValue List of String | Mobile Phone Number(s) of contact |
WorkPhones | FormFieldValue List of String | Work Phone Number(s) of contact |
OtherPhones | FormFieldValue List of String | Other Phone Number(s) of contact |
Faxes | FormFieldValue List of String | Fax Number(s) of contact |
FormType | FormType | Type of detected form. Can be an Invoice, Receipt, BusinessCard, or IdentityCard |
FormTypeConfidenceScore | Float | Confidence score in range 0 to 100 of the detected form type |
Pages | FormPage | Listing of pages identified in business card |
Fields | FormField | Dictionary of fields identified in the business card. The form field name is used as a key in the dictionary |
IdentityDocument
Represents an identity document like a driver's license, passport, etc
Properties
Property | Type | Description |
---|---|---|
IdentityDocumentType | FormFieldValue String | Type of identity document. Possible values are a driver's license and passport. |
FirstName | FormFieldValue String | First (given) name from document. |
LastName | FormFieldValue String | Last (family) name from document |
MiddleName | FormFieldValue String | Middle name from document |
Suffix | FormFieldValue String | Suffix of name from document |
DocumentNumber | FormFieldValue String | Identity document number |
DateOfBirth | FormFieldValue DateTime | Date of birth extracted from identity document |
DateOfIssue | FormFieldValue DateTime | Issue date extracted from identity document |
DateOfExpiration | FormFieldValue DateTime | Expiry date extracted from identity document |
PlaceOfBirth | FormFieldValue String | Place of birth extracted from document |
CountryCode | FormFieldValue String | ISO 3166 country code |
NationalityCode | FormFieldValue String | ISO 3166 country code of nationality extracted from passport document |
Sex | FormFieldValue String | Person sex extracted from document. Possible values include M, F, and X |
MachineReadableZone | FormFieldValue String | MRZ extracted from passport document. Two lines of 44 characters each |
Region | FormFieldValue String | Region extracted from identity document. (Driver's license) |
Address | FormFieldValue String | Person Address extracted from identity document. (Driver's license) |
AddressCity | FormFieldValue String | Person Address city extracted from identity document. (Driver's license) |
AddressState | FormFieldValue String | Person Address state code extracted from identity document. (Driver's license) |
AddressStateName | FormFieldValue String | Person Address state name extracted from identity document. (Driver's license) |
AddressZipCode | FormFieldValue String | Person Address zip code extracted from identity document. (Driver's license) |
AddressCounty | FormFieldValue String | Person Address county extracted from identity document. (Driver's license) |
LicenseClass | FormFieldValue String | License class extracted from identity document. (Driver's license) |
Restrictions | FormFieldValue String | Restrictions extracted from identity document. (Driver's license) |
Endorsements | FormFieldValue String | Endorsements extracted from identity document. (Driver's license) |
Veteran | FormFieldValue String | Veteran information extracted from identity document. (Driver's license) |
FormType | FormType | Type of detected form. Can be an Invoice, Receipt, BusinessCard, or IdentityCard |
FormTypeConfidenceScore | Float | Confidence score in range 0 to 100 of the detected form type |
Pages | FormPage | Listing of pages identified in identity document |
Fields | FormField | Dictionary of fields identified in identity document. The form field name is used as a key in the dictionary |
ReceiptLineItem
Represents a line item inside a sales receipt
Properties
Property | Type | Description |
---|---|---|
ItemName | FormFieldValue String | Item name/Item description |
Quantity | FormFieldValue Float | Quantity detected in the line item |
UnitPrice | FormFieldValue Float | Price per unit detected in the line item |
TotalPrice | FormFieldValue Float | Total price of receipt line item |
Receipt
Represents a sales receipt
Properties
Property | Type | Description |
---|---|---|
ReceiptType | FormFieldValue String | Type of sales receipt. e.g., itemized, hotel, etc. |
MerchantName | FormFieldValue String | Merchant name identified in sales receipt |
MerchantPhoneNumber | FormFieldValue String | Merchant phone number identified in sales receipt |
MerchantAddress | FormFieldValue String | Merchant address identified in sales receipt |
TransactionDate | FormFieldValue DateTime | Date when the receipt was issued |
TransactionTime | FormFieldValue TimeSpan | Time when the receipt was issued |
Total | FormFieldValue Float | Transaction total of the sales receipt |
Subtotal | FormFieldValue Float | Subtotal of the sales receipt (normally before tax is applied) |
Tax | FormFieldValue Float | Total tax on receipt |
Tip | FormFieldValue Float | Tip indicated in the sales receipt |
LineItems | ReceiptLineItem | Listing of receipt line items identified in sales receipt |
FormType | FormType | Type of detected form. Can be an Invoice, Receipt, BusinessCard, or IdentityCard |
FormTypeConfidenceScore | Float | Confidence score in range 0 to 100 of the detected form type |
Pages | FormPage | Listing of pages identified in receipt |
Fields | FormField | Dictionary of fields identified in receipt. The form field name is used as a key in the dictionary |
DetectedObject
It represents the information about the detected object by cognitive services.
Properties
Property | Type | Description |
---|---|---|
ObjectName | String | Language name. |
ConfidenceScore | Double | Score between 0 and 100 indicating the confidence for the detected object. |
TranslatedText
It represents translated text information obtained from cognitive services.
Properties
Property | Type | Description |
---|---|---|
Language | String | Language name. |
LanguageCode | String | Shortcode of language in ISO 639-1 format. Refer to List of ISO 639-1 codes |
Text | String | Translated text. |