public class ParsedDocumentMetadata extends Object
| Modifier and Type | Field and Description |
|---|---|
String |
DocumentCulture
An ISO 3066 code that represents the cultural context of the document regarding formatting of
numbers, dates, character symbols, etc.
|
String |
DocumentLanguage
An ISO 639-1 code that represents the primary language of the parsed text.
|
java.time.LocalDate |
DocumentLastModified
The last-revised/last-modified date that was provided for the document.
|
String |
ParserSettings
The full parser settings that were used during parsing
|
String |
PlainText
The plain text that was used for parsing
|
List<String> |
SovrenSignature
A digital signature used to ensure there is no tampering between parsing and indexing.
|
| Constructor and Description |
|---|
ParsedDocumentMetadata() |
public String PlainText
public String DocumentLanguage
iv(invariant/unknown). Note that the two-letter ISO codes reported by the
Parser - such as zh for Chinese - do not differentiate between language
variants, such as Mandarin and Cantonese.public String DocumentCulture
en-US for US English; however, note that culture
can be set independently of language and country to achieve fine-tuned cultural control over parsing,
so if you use this value you should not assume that it always matches the language and country.public String ParserSettings
public java.time.LocalDate DocumentLastModified
Copyright © 2021. All rights reserved.