Lockbit scraper fixed (now uses playwright) #74 #89

biligonzales · 2022-02-18T17:26:58Z

Describe the changes

Lockbit 2.0 now uses a ddos protection mechanism hence the regular http get method is no longer working.

As a workaround I have implemented the playwright Microsoft library which behaves as if a proper browser did the request.

Summary of the changes:

lockbit.py: replaced the use of requests by playwright
requirements.txt: added playwright
Dockerfile: added playwright chromium support as well as required libraries.

I have also upgraded at the top of the Dockerfile from python3.9-buster to python3.10-bullseye.

Related issue(s)

It fixes Issue #74

Note that the scraping engine for lockbit has been left untouched as it is still perfectly working. Only the web page retrieval method has been altered.

How was it tested?

docker-compose build app
docker-compose up --abort-on-container-exit
Checked that Lockbit entries have been inserted into the database

captainGeech42 · 2022-02-20T03:10:17Z

Hey @biligonzales, thanks for working on this.

I've got a large refactor/fix collection from an anonymous contributor who maintains a private fork that I'll be merging in this weekend (just waiting to hear back from them on something), that I believe covers this and your other PR.

Once I get that merged in and assess what is still an issue, I will let you know here.

Regardless, I appreciate the PRs, thank you! Will be in touch.

ocbrollingpaper · 2022-04-14T23:16:47Z

@captainGeech42 is this coming or nah? Asking because I have fully refactored scrapers and rest of the stuff to be ran in real-time without need for cronjobs

EDIT:
I also have go version

captainGeech42 · 2022-04-21T03:15:49Z

@ocbrollingpaper I guess not (was waiting to receive it from 3p, never came), feel free to PR those. Thank you!

biligonzales added 2 commits February 18, 2022 18:11

Lockbit scraper fixed (now uses playwright) captainGeech42#74

d2513c1

Lockbit: fixing url extracted and last scraped value

2bb264f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lockbit scraper fixed (now uses playwright) #74 #89

Lockbit scraper fixed (now uses playwright) #74 #89

biligonzales commented Feb 18, 2022

captainGeech42 commented Feb 20, 2022

ocbrollingpaper commented Apr 14, 2022 •

edited

Loading

captainGeech42 commented Apr 21, 2022 •

edited

Loading

Lockbit scraper fixed (now uses playwright) #74 #89

Are you sure you want to change the base?

Lockbit scraper fixed (now uses playwright) #74 #89

Conversation

biligonzales commented Feb 18, 2022

Describe the changes

Related issue(s)

How was it tested?

captainGeech42 commented Feb 20, 2022

ocbrollingpaper commented Apr 14, 2022 • edited Loading

captainGeech42 commented Apr 21, 2022 • edited Loading

ocbrollingpaper commented Apr 14, 2022 •

edited

Loading

captainGeech42 commented Apr 21, 2022 •

edited

Loading