split node uses VLM to identify document boundaries in multi-document PDFs and categorize each document by type.
Basic Usage
Configuration Options
Options Reference
| Option | Type | Default | Description |
|---|---|---|---|
provider | VLMProvider | Required | VLM provider for document analysis |
schemas | Record<string, object> | Required | Map of document types to schemas |
schemaRef | string | - | Reference to schema registry |
includeOther | boolean | true | Include documents that don’t match any type |
consensus | ConsensusConfig | - | Multi-run voting |
reasoning | object | - | Extended reasoning options |
Output: SplitDocument[]
The split node outputs an array ofSplitDocument objects:
Processing Split Documents
With forEach
Process each split document with its corresponding schema:With Conditional Routing
Route to different processing flows:Schema Registry
Use registered schemas instead of inline:schemas property:
Handling “Other” Documents
By default, documents that don’t match any defined type are categorized as “other”:includeOther: false to exclude unrecognized documents:
Extended Reasoning
Enable for complex document analysis:Example: Insurance Document Bundle
Next Steps
combine
Merge split document results
Flows
Learn about forEach and conditional