Skip to content

Mario8433/Washington-Post-Scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

DevProjects - Web scraper to get news article content from Washington Post

Tech/framework used

Built with Python with those frameworks and packages :

  1. for scraping data -> BS4, requests
  2. for process tasks -> sys, os, itertools

How to Install and use

  1. Clone the git project in your terminal
  2. Install python3 and beautifulSoup4
  3. Then run command in your terminal (remember to change to your download directory using cd command) Sample command: web_scraper.py https://www.washingtonpost.com/technology/2020/09/25/privacy-check-blacklight/

sample news url includes https://www.washingtonpost.com/technology/2020/09/25/privacy-check-blacklight/ https://www.washingtonpost.com/technology/2022/11/23/mars-rover-rock-samples/ https://www.washingtonpost.com/technology/2022/11/24/twitter-musk-reverses-suspensions/

License

No licence

About

My first python repository omg :) for practice only

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages