Skip to content

Latest commit

 

History

History
22 lines (16 loc) · 889 Bytes

README.MD

File metadata and controls

22 lines (16 loc) · 889 Bytes

DevProjects - Web scraper to get news article content from Washington Post

Tech/framework used

Built with Python with those frameworks and packages :

  1. for scraping data -> BS4, requests
  2. for process tasks -> sys, os, itertools

How to Install and use

  1. Clone the git project in your terminal
  2. Install python3 and beautifulSoup4
  3. Then run command in your terminal (remember to change to your download directory using cd command) Sample command: web_scraper.py https://www.washingtonpost.com/technology/2020/09/25/privacy-check-blacklight/

sample news url includes https://www.washingtonpost.com/technology/2020/09/25/privacy-check-blacklight/ https://www.washingtonpost.com/technology/2022/11/23/mars-rover-rock-samples/ https://www.washingtonpost.com/technology/2022/11/24/twitter-musk-reverses-suspensions/

License

No licence