ZeroxPDFLoader
Overview​
ZeroxPDFLoader is a document loader that leverages the Zerox library. Zerox converts PDF documents into images, processes them using a vision-capable language model, and generates a structured Markdown representation. This loader allows for asynchronous operations and provides page-level document extraction.
Integration details​
| Class | Package | Local | Serializable | JS support |
|---|---|---|---|---|
| ZeroxPDFLoader | langchain_community | ❌ | ❌ | ❌ |
Loader features​
| Source | Document Lazy Loading | Native Async Support |
|---|---|---|
| ZeroxPDFLoader |