Skip to main content

Media Data Types

This page describes all the Object types defined and used by Media activities.

Transcript

Properties

PropertyTypeDescription
LanguageStringThe language of the audio.
TextStringThe text extracted from the audio.
SegmentsList<Segment>Details segments extracted in audio transcription.

Segment

Properties

PropertyTypeDescription
IdInt32The unique identifier of the segment.
SeekInt32The position in the audio file where the segment begins.
StartDoubleThe segment's start time in the audio file. It provides a specific timestamp indicating when the segment begins.
EndDoubleThe end time of the segment in the audio file. It provides a timestamp indicating when the segment ends.
TextStringthe transcribed text corresponding to the spoken content within the segment. Converts the spoken words into written text during audio transcription, and this property stores that textual representation.
TokensList<Int32>Details about transcription.
TemperatureDoubleThe intensity or significance of the speech content within the segment, which could help prioritize or categorize segments during transcription.
AvgLogprobDoubleThe average logarithmic probability associated with the transcription of the segment.
CompressionRatioDoubleThe degree to which it compresses the audio data.
NoSpeechProbDoubleThe probability of absence of speech within the segment.
ConfidenceDoubleconfidence level or certainty associated with the transcription of the segment. A numerical value indicates the degree of confidence in the accuracy of the transcribed text.
WordsList<Word>A list of individual words extracted from the transcribed text.

Word

Properties

PropertyTypeDescription
TextStringThe text representation of an individual word within the transcribed segment.
StartDoubleThe start time of the Word within the audio segment.
EndDoubleThe end time of the Word within the audio segment.
ConfidenceDoublethe confidence level or certainty associated with the transcription of the Word. A numerical value indicates the degree of confidence in the accuracy of transcribing that specific Word.

TranscriptSearchResult

Properties

PropertyTypeDescription
Search TextStringThe text for the time slots were extracted.
SegmentsList<FoundSegment>The segments found for the given text.

FoundSegment

Properties

PropertyTypeDescription
IdInt32The unique identifier of the segment.
SeekInt32The position in the audio file where the segment begins.
StartDoubleThe segment's start time in the audio file. It provides a specific timestamp indicating when the segment begins.
EndDoubleThe end time of the segment in the audio file. It provides a timestamp indicating when the segment ends.
TextStringthe transcribed text corresponding to the spoken content within the segment. The spoken words it converts into written text during audio transcription, and this property stores that textual representation.
WordsList<FoundWord>A list of individual words extracted from the transcribed text.

FoundWord

Properties

PropertyTypeDescription
TextStringThe text representation of an individual word within the transcribed segment.
StartDoubleThe start time of the Word within the audio segment.
EndDoubleThe end time of the Word within the audio segment.