"That's a beautiful collection of public data you have here, it would be a shame if someone made it searchable." -- paraphrased
This repo provides examples that use sourmash to build FracMinHash sketches and search ~485,000 public metagenomes in the SRA with them in real time, using mastiff.
Click on the binder button below, and then select "Run... Run all cells", or hit the fast-forward button.
Deposit sequences of interest in sequences/
. Then run:
snakemake -j 1 --use-conda
and look in mastiff_out/
.