Roadmap

This document sets out the high-level tasks which the vg development team hopes to accomplish in the next few versions of vg and beyond.

By Time

These are the things we hope to achieve on several planning horizons:

Next 3 Months

Research topics: accurate long read Giraffe scaling at Q20: How???

Next 6 Months

Next Year

Running Projects

These are things we are working on, with no particular delivery date goal.

Use of MCMC techniques in the genotyper with multipath alignments

Wishlist

These are things we would like to do eventually.

Alignment
- Adoption of the multipath alignment paradigm as the default
- Graph-to-graph mapping (Xian)
Variant Calling
- Implementation of an HHGA-like machine learning based variant caller
- Integration of variant calling and assembly polishing processes
- Prune the zoo of TraversalFinders, and expose the useful ones to Python
Visualization
- Browser-free tube map
- Better tube map handling of edge cases
  - No haplotypes on a node
  - Starting on a rare haplotype
Infrastructure
- Destructively modernize and unify IO
  - Eliminate VPKG framing if possible in favor of magic numbers everywhere
    - Resolve ensuing questions about GAM format
      - Just use GAF?
    - Handle things like GFA that need to manually sniff
  - Just save from the object; no more save_handle_graph
  - Magic format registration for libvgio magic numbers for loading
  - Depend on libvgio in libbdsg to do the IO there and pick the right handle graph implementation
- Replace Protobuf internal formats with faster ones
- Revision of ID assignment logic to allow deterministic node breaking
- Accept gzipped GFA if practical (can't mmap)
- Improved HandleGraph API
  - Abstract away node boundaries
  - View all sequence as C++17 string_views instead of sequence-owning strings
  - O(1) reverse complement DNAStringView
- CMake-ify the main vg build
- Eliminate old systems and their associated submodules, or factor them out into their own projects
  - vg vectorize could be its own project
    - Update vg vectorize to modern, system Vowpal Wabbit
    - Or pull it out into its own submodule and remove Vowpal Wabbit dependency from vg
  - Eliminate RocksDB from vg; everybody using vg map uses GCSA indexes now.
  - vg genotype
  - vg srpe
- More cross-language support
  - Interoperate with Rust handle graph users/providers
  - Interoperate with Java handle graph users/providers

Start here

vg Manpage

Build VG (or use it in Docker)

File Formats

VG Roadmap

Provide feedback

Saved searches

Use saved searches to filter your results more quickly