The Apify Website Content Crawler is a document loader node that uses Apify’s Website Content Crawler to extract content from websites. It’s designed to crawl web pages, extract their content, and return it as a collection of documents that can be used in various natural language processing tasks.
pageContent
: The extracted text content from the webpage.metadata
: An object containing metadata about the document, including the source URL and any additional metadata specified in the input.