ocrModelEnumRequired
"google-vision": string"deepseek-ocr": string"amazon-textract": string"azure-vision": string"openai": string"huggingface": string"gemini": string"native": stringDefault value of this property is "native"
languageEnumOptional
"eng": string"spa": string"fra": string"deu": string"ita": string"por": string"rus": string"chi_sim": string"jpn": string"kor": string"ara": stringDefault value of this property is "eng"
preserveFormattingbooleanOptional
Default value of this property is true
extractImagesbooleanOptional
Default value of this property is false
outputFormatEnumOptional
"json": string"text": string"markdown": stringDefault value of this property is "json"
pageRangestringOptional
Default value of this property is "all"
googleVisionApiKeystringOptional
deepseekApiKeystringOptional
awsAccessKeyIdstringOptional
awsSecretAccessKeystringOptional
awsRegionstringOptional
Default value of this property is "us-east-1"
azureEndpointstringOptional
azureApiKeystringOptional
openaiApiKeystringOptional
huggingfaceApiKeystringOptional
geminiApiKeystringOptional