Skip to content

Commit

Permalink
Prepare to release optimus v1 (#177)
Browse files Browse the repository at this point in the history
* Prepare to release Optimus v1

* Add changelog.md

* Finish the skeleton of the changelog.

* Remove the pointer to the google docs for Optimus.

* Update README.txt

* Add diagram for Optimus.

* Address comments.

* Add the pointer to the acceptance report.
  • Loading branch information
rexwangcc authored Mar 27, 2019
1 parent 944b496 commit e3ed43f
Show file tree
Hide file tree
Showing 5 changed files with 49 additions and 4 deletions.
33 changes: 33 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,33 @@
# Skylab Change Log

---
## Optimus

### Optimus_v1.0.0

- The first major version release for the Optimus pipeline.

---
## Smart-seq2

### smartseq2_v2.2.0

### smartseq2_v2.1.0

### smartseq2_v2.0.0
- Add SS2 group qc metrics and zarr output files.

### smartseq2_v1.0.0
- The first major version release for the Smart-seq2 pipeline.

---
## Cellranger

### cellranger_v1.0.2
- Add max_retries runtime parameter.

### cellranger_v1.0.1
- Increase disk space.

### cellranger_v1.0.0
- The first major version release for the Cellranger pipeline.
9 changes: 6 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,10 +1,13 @@
# skylab
Secondary analysis pipelines for the Human Cell Atlas.

[![Snyk Vulnerabilities for GitHub Repo (Specific Manifest)](https://img.shields.io/snyk/vulnerabilities/github/HumanCellAtlas/skylab/docker/cellranger/requirements.txt.svg?label=Snyk%20Scripts%20Vulnerabilities&logo=Snyk)](https://snyk.io/test/github/HumanCellAtlas/skylab?targetFile=docker/cellranger/requirements.txt)

[![GitHub Release](https://img.shields.io/github/release-pre/HumanCellAtlas/skylab.svg?label=Latest%20Release&style=flat-square&colorB=green)](https://github.com/HumanCellAtlas/skylab/releases)
[![License](https://img.shields.io/github/license/HumanCellAtlas/skylab.svg?style=flat-square)](https://github.com/HumanCellAtlas/skylab/blob/master/LICENSE)
[![Snyk Vulnerabilities for GitHub Repo (Specific Manifest)](https://img.shields.io/snyk/vulnerabilities/github/HumanCellAtlas/skylab/docker/cellranger/requirements.txt.svg?style=flat-square&label=Snyk%20Scripts%20Vulnerabilities&logo=Snyk)](https://snyk.io/test/github/HumanCellAtlas/skylab?targetFile=docker/cellranger/requirements.txt)

## Pipelines
- [cellranger](https://github.com/HumanCellAtlas/skylab/tree/master/pipelines/cellranger) secondary analysis pipeline
- [Optimus](https://github.com/HumanCellAtlas/skylab/tree/master/pipelines/optimus) secondary analysis pipeline
- [Smart-seq2](https://github.com/HumanCellAtlas/skylab/tree/master/pipelines/smartseq2_single_sample) secondary analysis pipeline

## How to run pipelines from skylab
Expand All @@ -17,7 +20,7 @@ For now, use `git clone [email protected]:HumanCellAtlas/skylab.git` and run the pi

Tasks on Cromwell may be run on what are known as "preemptible" machines to reduce costs by a significant amount. The catch with preemptible machines is that they may be "preempted" at any given moment--as in, google may shut down the task to re-use the resources.

Many tasks are set to automatically be `preemptible = 3`, aka they will be run on preemptible instances for up to 3 instances of preemption, after which it will be run on a non-preemptible machine. The Optimus pipeline in particular also has a `preemptible` input that overrides the default preemptible parameter on all tasks run through optimus. This option may be set to 0 to run the entire workflow without using preemptible machines.
Many tasks are set to automatically be `preemptible = 3`, aka they will be run on preemptible instances for up to 3 instances of preemption, after which it will be run on a non-preemptible machine. This option may be set to 0 by passing a task-level input to the workflow (i.e. Optimus.StarAlign.preemptible), causing the task to be run without using preemptible machines. This option is especially useful for long-running tasks, which have a chance to take a very long time to run due to being preempted multiple times.

### maxRetries

Expand Down
2 changes: 1 addition & 1 deletion pipelines/optimus/Optimus.wdl
Original file line number Diff line number Diff line change
Expand Up @@ -17,7 +17,7 @@ workflow Optimus {
description: "The optimus 3' pipeline processes 10x genomics sequencing data based on the v2 chemistry. It corrects cell barcodes and UMIs, aligns reads, marks duplicates, and returns data as alignments in BAM format and as counts in sparse matrix exchange format."
}
# version of this pipeline
String version = "optimus_v0.4.0"
String version = "optimus_v1.0.0"

# Sequencing data inputs
Array[File] r1_fastq
Expand Down
Binary file added pipelines/optimus/Optimus_diagram.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
9 changes: 9 additions & 0 deletions pipelines/optimus/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@
# Optimus Pipeline

![Diagram](Optimus_diagram.png)

The Optimus pipeline is a pipeline for processing 3' single-cell expression data generated with the 10X Genomic V2 assay, developed by the Data Coordination Platform (DCP) of the Human Cell Atlas (HCA) Project.

Optimus is a alignment and transcriptome quantification pipeline. Optimus corrects Cell Barcodes (CBs) and Unique Molecular Identifiers (UMIs), aligns reads to the genome, generates an expression count matrix in a UMI-aware manner, detects empty droplets, calculates summary statistics for genes and cells, and returns outputs in BAM and Zarr file formats. Special care is taken to keep all reads that may be useful to the downstream user, such as unaligned reads or reads with uncorrectable barcodes. This design provides flexibility to the downstream user and allows for alternative filtering or leveraging the data for novel methodological development.

More details can be found in the [Optimus Acceptance Report](https://docs.google.com/document/d/158ba_xQM9AYyu8VcLWsIvSoEYps6PQhgddTr9H0BFmY/edit)

0 comments on commit e3ed43f

Please sign in to comment.