Get the imformantion of producer and publisher of movies from 2010 to 2017
MTime_company
Based on python3.5 and selenium, first need to install:
- selenium
- phantomjs
- mongodb
Key skills that I have learned selenium phantomjs mongodb regular expression error handling
Take Let the bullets fly(让子弹飞)as an example.
First get the search page and get the id of this movie.
Second use the id to enter the page of companies.
Store the imformation of companies into mongodb. The example is as follow.