Skip to content

WPExtractor is a python-based tool specifically made for Aritificial Intelligence based projects to make dataset. This helps to collect data from blog which can be used to train bot in many useful way.

License

Notifications You must be signed in to change notification settings

SomilGumber/wpextractor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

WP-Extractor - WordPress Blog Post Extractor in Json Format

WPExtractor is a python-based tool specifically made for Aritificial Intelligence based projects to make dataset. This helps to collect data from blog which can be used to train bot in many useful way.

Features

  • Automatically extract all posts from a WordPress website within seconds.
  • Saves the data in Json file in the directory for you.
  • Easily understable json format to make your life easier :D
  • Responsive developers. Just make an issue, we'll fix it for you :)

Usage

Usage:
python main.py -u https://csrockers.in
By default, it will fetch posts from the website. To fetch pages, use the following.
python main.py -u https://fulltimehosting.net --pages

Credits

Manal Shaikh & Somil Gumber.

About

WPExtractor is a python-based tool specifically made for Aritificial Intelligence based projects to make dataset. This helps to collect data from blog which can be used to train bot in many useful way.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages