Skip to content

Latest commit

 

History

History
17 lines (14 loc) · 870 Bytes

README.md

File metadata and controls

17 lines (14 loc) · 870 Bytes

WP-Extractor - WordPress Blog Post Extractor in Json Format

WPExtractor is a python-based tool specifically made for Aritificial Intelligence based projects to make dataset. This helps to collect data from blog which can be used to train bot in many useful way.

Features

  • Automatically extract all posts from a WordPress website within seconds.
  • Saves the data in Json file in the directory for you.
  • Easily understable json format to make your life easier :D
  • Responsive developers. Just make an issue, we'll fix it for you :)

Usage

Usage:
python main.py -u https://csrockers.in
By default, it will fetch posts from the website. To fetch pages, use the following.
python main.py -u https://fulltimehosting.net --pages

Credits

Manal Shaikh & Somil Gumber.