Skip to content

Source code for processing insurance price transparency data

License

Notifications You must be signed in to change notification settings

cch0/price-transparency-data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Price Transparency Data

This repository contains source code for the blog post series titled A Practical Take On Processing Price Transparency Data


In Part A, we talk about downloading source machine readable data file using AWS Lambda running Python script.

In Part B, we talk about using Polars script to pre-process the source data and store data in Parquet format in S3.

In Part C, we talk about using both PySpark and Polars scripts to produce denormalized data partitioned by billing_code and store final data in Parquet format in S3.

About

Source code for processing insurance price transparency data

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages