Skip to content

Crawl data from Traveloka.com for hotels, coaches, and plane trips using Selenium

Notifications You must be signed in to change notification settings

hhtrieu0108/Crawl_Traveloka

Repository files navigation

Traveloka Data Crawling Project Overview

The aim of this project is to systematically gather data from Traveloka using methods: Selenium. The extracted information will be organized into structured dataframes comprising Hotel, Coach, and PlaneTrip details.

Project Objective

This project endeavors to scrape Traveloka's data and transform it into a structured format suitable for subsequent analysis or integration into applications.

Methodologies

  • Selenium: Employing Selenium to automate web browser interactions for efficient data scraping from Traveloka.

Data Extraction Details

The extracted data will encompass the following aspects:

Hotel Data:

  • hotel_names: Name of the hotel
  • location: Hotel location
  • price: Hotel price
  • score_hotels: Hotel rating score
  • number_rating: Number of hotel ratings
  • star_number: Hotel star rating
  • received_time: Check-in time
  • giveback_time: Check-out time
  • description: Hotel description
  • hotel_link: Link to the hotel
  • id: Hotel identifier

Coach Data:

  • brand: Coach brand
  • price: Coach price
  • number_of_seat: Number of seats available
  • start_time: Departure time
  • start_day: Departure day
  • end_day: Arrival day
  • end_time: Arrival time
  • trip_time: Total trip duration
  • take_place: Departure location
  • destination: Destination
  • location: Destination location
  • id: Coach identifier

PlaneTrip Data:

  • brand: Airline brand
  • price: Airline ticket price
  • start_time: Departure time
  • start_day: Departure day
  • end_day: Arrival day
  • end_time: Arrival time
  • trip_time: Total trip duration
  • take_place: Departure location
  • destination: Destination
  • id: Plane trip identifier

Contact Information

For inquiries, suggestions, or collaboration opportunities, please don't hesitate to get in touch. Your feedback and insights are invaluable for continuous improvement.

About

Crawl data from Traveloka.com for hotels, coaches, and plane trips using Selenium

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages