Connect
Route
.bmp
, .gif
, .heic
, .jpeg
, .jpg
, .pdf
, .png
, .tiff
, and .webp
.Transform
Header
, Footer
, Title
, NarrativeText
, Table
, Image
, and many more. Each document is wrapped in extensive metadata so you can understand languages, file types, sources, hierarchies, and much more.Chunk
Enrich
Embed
Persist
Source Connectors
Destination Connectors
Workflow
Jobs
.pdf
, .pptx
, and .tiff
..docx
files that have page metadata, Unstructured calculates the number of pages based on that metadata.