The Docx File node is a document loader that allows you to load and process data from DOCX files. It’s part of the Document Loaders category and is designed to extract text content from Microsoft Word documents (.docx files).
Description: The DOCX file(s) to be loaded and processed.
Text Splitter (optional)
Label: Text Splitter
Name: textSplitter
Type: TextSplitter
Description: An optional text splitter to break the document into smaller chunks.
Additional Metadata (optional)
Label: Additional Metadata
Name: metadata
Type: json
Description: Additional metadata to be added to the extracted documents.
Omit Metadata Keys (optional)
Label: Omit Metadata Keys
Name: omitMetadataKeys
Type: string
Description: A comma-separated list of metadata keys to omit from the default set. Use * to omit all metadata keys except those specified in the Additional Metadata field.
The node outputs an array of IDocument objects, each representing a chunk or the entire content of the processed DOCX file(s), along with associated metadata.