Find All PDFs and Documents
on Any Website
Instantly extract links to every PDF, Word document, Excel spreadsheet, PowerPoint, and other downloadable file from any website page.
No account needed for your first scan · Results in seconds
PDFs
All .pdf links — research papers, brochures, whitepapers, product manuals and more.
Spreadsheets & Slides
.xlsx, .xls, .csv, .pptx and .ppt files — data exports, presentations, reports.
Word documents
.docx, .doc, .odt and other word processing formats — contracts, forms, guidelines.
How PDF & Document Extractor works
Enter any URL
Paste the URL of any webpage — a company homepage, resource library, government site or any page that may link to documents.
SmartScan parses all anchor links and file references
Every <a href> and file reference is inspected for document extensions — PDF, DOCX, XLS, PPTX, and more.
Download a clean list of every document URL
Results are exported as docs.csv with URL and file type columns, and included in the full JSON result.
All formats
Detects PDF, DOCX, DOC, XLS, XLSX, CSV, PPTX, PPT, ODP, ODS, ODT and more.
Absolute URLs
Resolves relative paths to full absolute URLs so every link is immediately usable.
CSV + JSON export
Document list exported as docs.csv (with URL and file type) and included in JSON result.
Popular use cases
Find all published reports, whitepapers and data on a target website quickly
Discover all downloadable assets on your own site — useful for SEO and compliance
See what documentation, pricing sheets and case studies competitors publish
Download prospect company filings, reports or product specs automatically
Frequently asked questions
- What document types does it detect?
- PDF, DOCX, DOC, XLS, XLSX, CSV, PPTX, PPT, ODP, ODS, ODT and any other linked file with a recognizable document extension.
- Does it download the documents?
- No — SmartScan extracts the document URLs, not the files themselves. You get a clean list of links to download at your convenience.
- Does it crawl the whole site for documents?
- By default it scans a single page. Enable Site Crawl mode to follow internal links and find documents across the entire website.
- Can it find documents on JavaScript-rendered pages?
- Yes. Switch to Dynamic rendering mode and SmartScan will fully render the page before extracting — catching document links injected by JavaScript.
- Is this tool free?
- Yes — SmartScan is free. Register for 1,000 scans/month. No credit card required.
Extract all documents from any website
Free — 1,000 scans/month. No credit card required.