What fields does it extract?

Title, author name, publish date, last modified date, article tags, categories, publisher name, excerpt, word count, estimated reading time, and canonical URL.

Does it work on all article formats?

It works best on standard blog posts, news articles and content pages that use structured data or common meta patterns. Custom CMS implementations may have lower accuracy.

Can I extract article data from multiple pages?

Yes — use Site Crawl or Bulk Scan mode to extract article metadata from multiple blog posts or news articles in one job.

Yes — SmartScan is free. Register for 1,000 scans/month. No credit card required.

Free Tool

किसी भी पेज से लेख का मेटाडेटा निकालें

Automatically pull structured metadata from any article or blog post — title, author, publication date, last modified, tags, word count and estimated reading time.

No account needed for your first scan · Results in seconds

✍️

लेखक & तिथियां

Extracts author name, publish date and last modified date from JSON-LD, Open Graph, meta tags and HTML semantic markup.

🏷️

टैग & कैटेगरी

Finds article tags, categories and keywords declared in structured data or visible on the page.

📊

शब्द संख्या & पढ़ने का समय

Calculates the word count of the main content and estimates reading time at 200 words per minute.

Article Info Extractor कैसे काम करता है

Enter any article or blog post URL

Paste the link to any blog post, news article or content page you want to analyze.

SmartScan extracts metadata from multiple sources

Checks JSON-LD structured data, Open Graph meta tags and semantic HTML elements to find the most accurate article information.

Download structured article data as JSON

Get all article metadata — title, author, dates, tags, word count, reading time — in a clean JSON file.

🎯

मल्टी-सोर्स एक्सट्रैक्शन

Checks JSON-LD (Article, NewsArticle, BlogPosting), Open Graph meta, Twitter Card meta, and HTML semantic elements — whichever has the best data.

🌐

प्रकाशन जानकारी

Extracts publisher name and logo if available in structured data — useful for source attribution and content audits.

📋

स्ट्रक्चर्ड JSON आउटपुट

All article data saved as content/article.json — title, author, dates, tags, word count, reading time, excerpt.

अक्सर पूछे जाने वाले सवाल

यह कौन से फील्ड निकालता है?: शीर्षक, लेखक, प्रकाशन तिथि, संशोधन तिथि, टैग, कैटेगरी, प्रकाशक, अंश, शब्द संख्या, अनुमानित पढ़ने का समय और कैनोनिकल URL।
How does it find the author and date?: SmartScan checks multiple sources in priority order: JSON-LD schema (Article/NewsArticle/BlogPosting), Open Graph meta tags, Twitter Card, and semantic HTML elements like <time> and byline patterns.
क्या यह सभी आर्टिकल फॉर्मेट पर काम करता है?: स्टैंडर्ड ब्लॉग पोस्ट, न्यूज़ आर्टिकल और स्ट्रक्चर्ड डेटा वाले पेजों पर सबसे अच्छा काम करता है। कस्टम CMS में कम सटीकता हो सकती है।
क्या मैं कई पेजों से आर्टिकल डेटा निकाल सकता हूँ?: हाँ — Site Crawl या Bulk Scan मोड से कई ब्लॉग पोस्ट या आर्टिकल से मेटाडेटा निकालें।
क्या यह टूल मुफ्त है?: हाँ — SmartScan मुफ्त है। 1,000 स्कैन/माह के लिए रजिस्टर करें। कोई क्रेडिट कार्ड नहीं चाहिए।

किसी भी वेबपेज से आर्टिकल मेटाडेटा निकालें

Free — 1,000 scans/month. No credit card required.

Create Free Account → Try Without Account