Free Tool

Find All PDFs and Documents
on Any Website

Instantly extract links to every PDF, Word document, Excel spreadsheet, PowerPoint, and other downloadable file from any website page.

No account needed for your first scan · Results in seconds

📄

PDFs

All .pdf links — research papers, brochures, whitepapers, product manuals and more.

📊

Spreadsheets & Slides

.xlsx, .xls, .csv, .pptx and .ppt files — data exports, presentations, reports.

📝

Word documents

.docx, .doc, .odt and other word processing formats — contracts, forms, guidelines.

How PDF & Document Extractor works

1

Enter any URL

Paste the URL of any webpage — a company homepage, resource library, government site or any page that may link to documents.

2

SmartScan parses all anchor links and file references

Every <a href> and file reference is inspected for document extensions — PDF, DOCX, XLS, PPTX, and more.

3

Download a clean list of every document URL

Results are exported as docs.csv with URL and file type columns, and included in the full JSON result.

🎯

All formats

Detects PDF, DOCX, DOC, XLS, XLSX, CSV, PPTX, PPT, ODP, ODS, ODT and more.

🌐

Absolute URLs

Resolves relative paths to full absolute URLs so every link is immediately usable.

📋

CSV + JSON export

Document list exported as docs.csv (with URL and file type) and included in JSON result.

Popular use cases

Research & OSINT
Find all published reports, whitepapers and data on a target website quickly
Content audit
Discover all downloadable assets on your own site — useful for SEO and compliance
Competitive research
See what documentation, pricing sheets and case studies competitors publish
Lead qualification
Download prospect company filings, reports or product specs automatically

Frequently asked questions

What document types does it detect?
PDF, DOCX, DOC, XLS, XLSX, CSV, PPTX, PPT, ODP, ODS, ODT and any other linked file with a recognizable document extension.
Does it download the documents?
No — SmartScan extracts the document URLs, not the files themselves. You get a clean list of links to download at your convenience.
Does it crawl the whole site for documents?
By default it scans a single page. Enable Site Crawl mode to follow internal links and find documents across the entire website.
Can it find documents on JavaScript-rendered pages?
Yes. Switch to Dynamic rendering mode and SmartScan will fully render the page before extracting — catching document links injected by JavaScript.
Is this tool free?
Yes — SmartScan is free. Register for 1,000 scans/month. No credit card required.

Extract all documents from any website

Free — 1,000 scans/month. No credit card required.

Create Free Account → Try Without Account
Related Tools
🔗 Link Extractor — extract all internal and external links 🌐 Website Scraper — full data extraction 📋 Bulk Extractor — scan multiple URLs at once