Analysis run by Snakefile using this config file. See the README in the top directory for details.
Here is the rule graph of the computational workflow:
Here is the Markdown output of each notebook in the workflow:
-
Get codon-variant-table from here.
-
Count variants and then aggregate counts to create to create variant counts file.
-
Analyze sequencing counts to cells ratio; this prints a list of any samples where this ratio too low. Also creates a CSV with the sequencing counts, number of sorted cells, and ratios for all samples.
-
Escape fractions for mutations and homologs; creating mutation escape fraction file and homolog escape fraction file.
-
Call sites of strong escape, and write to a CSV file.
-
Plot escape profiles.
-
Map escape profiles to
*.pdb
files using this notebook -
Count mutations in GISAID RBD sequences to create this counts file.
-
Make supplementary data files, which are here. These include
dms-view
input files.