The Puppeteer Web Scraper is a document loader node that uses Puppeteer to load and extract data from web pages. It can scrape single pages or crawl multiple pages, and offers various options for customizing the scraping process.
URL (required)
Text Splitter (optional)
Get Relative Links Method (optional)
Get Relative Links Limit (optional)
Wait Until (optional)
Wait for selector to load (optional)
Additional Metadata (optional)
Omit Metadata Keys (optional)
The node outputs an array of IDocument objects, each representing a scraped webpage with its content and metadata.