WPExtractor is a python-based tool specifically made for Aritificial Intelligence based projects to make dataset. This helps to collect data from blog which can be used to train bot in many useful way.
- Automatically extract all posts from a WordPress website within seconds.
- Saves the data in Json file in the directory for you.
- Easily understable json format to make your life easier :D
- Responsive developers. Just make an issue, we'll fix it for you :)
Usage:
python main.py -u https://csrockers.in
By default, it will fetch posts from the website. To fetch pages, use the following.
python main.py -u https://fulltimehosting.net --pages
Manal Shaikh & Somil Gumber.