ZigZag PoRep #77

porcuquine · 2018-07-03T16:58:44Z

ZigZag PoRep

The one true proof of replication (for now and through initial release, we think).

Current strong focus is on a viable ZigZag PoRep which meets both scaling and security requirements. This will require many optimizations, but ultimately boils down to squeezing better performance (less CPU and less RAM per GiB replicated) and supporting very large sectors.

This issue will serve as a roadmap for that work. The high-level outline below should be broken out into sub-issues. Those working on these issues need to see this work as high-priority, with an emphasis on fast iteration.

Our goal now is to implement a fully viable Proof-of-Replication. This has two broad categories:

Correctness

Some last details of CompoundProof and our circuit implementation have been uncovered in the work @dignifiedquire is doing in More refs less clones #517. This should be completed as an immediate next step.

There is one item in the list below ('Implement challenge derivation correctly') which needs to be updated to reflect current needs (TODO: @porcuquine). However, this is an easy change and low priority for the short-term push. #404 is also a hanging thread but does not block the below.

Performance

PoRep performance relates to our ability to simultaneously meet security and scaling requirements. Good news: we have now run a secure Proof-of-Replication (i.e. sufficient challenges with full parameters). For the moment, I omit most supporting documentation, but that is being worked on in one work stream below. Together, this work represents one (and for now our best) answer to the question posed in #157.

We now need to overcome two sequential hurdles:

Meet scaling requirements with respect to proof size. Based on data from our secure PoRep run, we estimate this to mean we need to replicate and prove a 64GiB sector. On a machine with enough RAM, that should not be an issue. However, 'should' does not always predict reality. Larger sectors #522 may correct the problem, but we should be ready to route around any subsequent obstacles.
Meet scaling requirements with respect to CPU time. This will be harder because, as it turns out, the time required to generate our circuit proofs is significant enough that it creates even greater pressure than proof size does. We will tackle this in several ways:
- Maximize replicable sector size by minimizing required required memory. See issue [TODO: copy bullets into issue @porcuquine]
  - Continue work in disk-backed merkle trees #481.
  - Optimize a low-memory path with minimal memory requirements (also focused on CPU efficiency). [TODO: @porcuquine make issue]
- Minimize CPU time of replication. See issue [TODO: copy bullets into new issue @porcuquine]
  - Optimize a minimal-parallelism path with minimal cpu usage (also focused on memory efficiency). [TODO: @porcuquine make issue, coalesced with above]
- Sealing with Blake2s (was: Hybrid Merkle Trees) #531: Pending calculations verifying a projected solution to the performance equation, implement a hybrid hashing solution for merkle trees. The idea is to allow a tunable public parameter (a tree depth) specifying the portion of the tree which should use blake2s instead of pedersen hashes. This will allow us to trade proof size for speed. This is exactly the lever we need to take advantage of our surplus proof-size budget and apply it to cpu-time performance.

In support of these optimizations, we need to have (and start using) a better way to measure and track change.

create standard bencher config we can use for performance comparison/regression testing #536 is to create and deploy (to infra) a standard bencher config for zigzag.

Because the above (non-exhaustive) list includes multiple overlapping and potentially conflicting optimizations, we need a better configuration story, as sketched in #501. Once implemented, optimizations should integrate and ensure the defaults for go-filecoin allow the nightly and user devnets to continue functioning, while also making it easy to configure benchmarks (especially the ZigZag 'example') for the properties required to both acquire data on the way to and ultimately run a first fully secure, fully scalable proof of replication.

The trajectory of this ongoing work needs to be plotted and adjusted in ongoing benchmarks and experimental runs. In support of this tactical work, and as the first step in a larger project, we will deliver a presentation of key data in a form supporting our calculations. This should eventually serve as documentation of the relationship of many key Filecoin parameters, as well as provide a relationship to hardware requirements. [TODO: @porcuquine make placeholder cryptolab epic].

#477

Historical work, with a few stragglers:

The text was updated successfully, but these errors were encountered:

nicola · 2018-12-03T12:21:36Z

Re-using this Epic to track work for zigzag to be done for stage 3, added some issues to it

dignifiedquire · 2019-11-14T03:01:22Z

no more zigzag for now

jon-chuang · 2020-04-29T03:24:29Z

Hi, I would like to ask if ZigZag PoRep was rolled back due to unsuitability for PoSt due to SEAL stacking attack. If so, I am still interested in it as a means for PoRep on network-controlled data without PoSt-based elections etc. Is there anyone I can contact regarding this?

I have many questions, especially surrounding how long I can make the lockout time, whether this can be made arbitrarily long by stacking many layers. If one can make timeouts take 1 hour, say by stacking 1 million layers, with initial sealing time of say 6 hours, this would be ideal.

I see, @porcuquine , that in your blogpost one can achieve 24 hour timeout. This would be amazing for us. Can I ask if this still holds? Further, what is the initial sealing time? I expect it to be several days then?

porcuquine added this to the Sprint 17 milestone Jul 3, 2018

porcuquine mentioned this issue Jul 6, 2018

Implement ZigZag PoRep. #81

Merged

4 tasks

porcuquine added the Epic label Jul 6, 2018

porcuquine removed this from the Sprint 17 milestone Jul 11, 2018

porcuquine self-assigned this Jan 24, 2019

porcuquine added the cryptocomputelab CryptoComputeLab work label Feb 22, 2019

This was referenced Mar 5, 2019

Rusty Kettle (was Leatherback) #515

Closed

Sealing with Blake2s (was: Hybrid Merkle Trees) #531

Closed

nicola mentioned this issue Mar 21, 2019

Captain log filecoin-project/research#103

Closed

dignifiedquire added the P-PoRep label Jun 4, 2019

dignifiedquire unassigned porcuquine Aug 27, 2019

dignifiedquire closed this as completed Nov 14, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ZigZag PoRep #77

ZigZag PoRep #77

porcuquine commented Jul 3, 2018 •

edited

Loading

nicola commented Dec 3, 2018

dignifiedquire commented Nov 14, 2019

jon-chuang commented Apr 29, 2020 •

edited

Loading

ZigZag PoRep #77

ZigZag PoRep #77

Comments

porcuquine commented Jul 3, 2018 • edited Loading

ZigZag PoRep

Correctness

Performance

nicola commented Dec 3, 2018

dignifiedquire commented Nov 14, 2019

jon-chuang commented Apr 29, 2020 • edited Loading

porcuquine commented Jul 3, 2018 •

edited

Loading

jon-chuang commented Apr 29, 2020 •

edited

Loading