Add method to compile geb to boolean circuits. #105

AHartNtkn · 2023-04-26T15:49:05Z

Adds category of boolean circuits (BITC). The objects in this category are just natural numbers representing the type of bit vectors of a certain length. There are nine methods for constructing morphisms;

(compose x y), which composes morphisms x and y.
(ident n), which is the identity on n.
(fork n), which maps n onto 2*n by copying its inputs.
(parallel x y), which (if x : a -> b and y : c -> d) will be a morphism from a + c -> b + d, running x and y on subvectors.
(swap n m), which maps n + m onto m + n by swapping.
one, which represents the map from 0 onto 1 producing a vector with only 1 in it.
zero, which represents the map from 0 onto 1 producing a vector with only 0 in it.
(drop n), which represents the unique morphism from n to 0.
(branch x y), which (if x : a -> b and y : a -> b) maps 1+a to b by splitting on the first bit to decide which morphism to apply to the remaining bits.

There are other ways to formulate this category, but I think this particular formulation is quite convenient.

I've implemented a to-bitc function in geb.trans which translates geb objects and morphisms into bitc objects and morphisms. Additionally, I've implemented a to-vampir function in bitc.trans which translates a bit morphism into a vampire morphism.

I'm not sure what else is needed, but the core implementation is done for now. In the future, this should be extended to a category whose objects represent vectors over some finite set other than the booleans. The reason I didn't do that hear is because coproducts are only binary and there aren't finite sets beyond so0 and so1, so bitvectors are quite natural and using anything else would require post-hoc optimization, but future versions of geb may want more. Also, I'd like to know what, if any, performance benefits this gives over the univariate polynomial formulation. I didn't test that.

update to main

merge to master

rokopt · 2023-04-26T15:55:38Z

Wow, at first glance this looks brilliant -- a clear model of exactly what VampIR circuits can do is something I've always wanted in Geb (but I don't know VampIR well enough to write such a model).

Without having looked at the code yet, I have one question relating to #89 -- have you thought about what an extension to the BITC which includes VampIR's higher-order functions might look like? Is that even the right place for me to be thinking about where to plug in that support?

rokopt · 2023-04-26T16:07:38Z

Oh, and a similar question to my previous one: would it (as with my previous question, at some point in the future after this PR is in, of course, not adding more work to this PR) make sense to extend the BITC with explicit use of VampIR's constraints? Or is there a different layer at which that should be done, or perhaps does it make sense for layers above VampIR, such as Geb, simply to express all constraints implicitly through computation?

rokopt · 2023-04-26T16:10:30Z

And to test my understanding: is it the case that the BITC is effectively abstracting the underlying multivariate VampIR compilation so that layers above (such as Geb's compilation of SubstMorphs to the BITC) only have to know about compiling to 2^n bits and don't have to know anything about whether the compilation from BITC to VampIR is going to a univariate or to a multivariate polynomial (and if multivariate, how many variables it's using)?

AHartNtkn · 2023-04-26T19:11:40Z

On higher-order functions in VampIR:
VampIR does not arithmetize higher-order functions. A fully normalized VampIR program will have no higher-order functions; it's just a system of polynomial equations. If you want the execution of the higher-order functions to be verified, it would have to be above this level, and it wouldn't necessarily be reflected in VampIR's higher-order functions. lukaszcz's comment is correct.

On constraints:
Yes, I think it makes sense to add constraints here (or to geb). It may require thinking about non-functional categories/allegories/categories of relations. I'm not sure if this is necessary; the only constraint VampIR has is equality checking between field elements. Other kinds of constraints (e.g. computing if one tree is smaller than another) would require their own computation anyway, so I'm not sure it's completely necessary, but it's worth thinking about.

On "test my understanding":
The bits aren't necessarily a power of 2, but yes. The GEB code does not need to encode anything about how many variables the final polynomial has.

mariari · 2023-04-27T09:04:40Z

src/geb/trans.lisp

+(defmethod to-bitc ((obj <substobj>))
+  (typecase-of substobj obj
+    (so0     0)
+    (so1     0)
+    (coprod  (+ 1 (max (to-bitc (mcar obj)) (to-bitc (mcadr obj)))))
+    (prod    (+ (to-bitc (mcar obj)) (to-bitc (mcadr obj))))
+    (otherwise (subclass-responsibility obj))))


Just a question, if we see a <substobj> in a morphism slot then it's the identity, is this saved at all?

I know for things that take objects this is correct

For example

(comp so1 so1)

is just identity . identity for the type so1

I don't know what you mean by "saved". to-bitc, when applied to a substobj, just calculates the bit-width required to store the object (in other words, it converts a GEB object into a BITC object). It's generally used when calculating the bitwidth of a domain. Although, it may not be necessary. (to-bitc (dom X)) should be the same as (dom (to-bitc X)), etc.

I wasn't aware that objects become identities in geb morphism slots. I did not write any code with that in mind.

might be good to update the code with that if possible

Alright, I think I've fixed that.

rokopt · 2023-04-27T13:44:03Z

Thanks for the replies. I have some follow-up questions:

On higher-order functions in VampIR: VampIR does not arithmetize higher-order functions. A fully normalized VampIR program will have no higher-order functions; it's just a system of polynomial equations. If you want the execution of the higher-order functions to be verified, it would have to be above this level, and it wouldn't necessarily be reflected in VampIR's higher-order functions. lukaszcz's comment is correct.

Are you referring to this comment?

On constraints: Yes, I think it makes sense to add constraints here (or to geb). It may require thinking about non-functional categories/allegories/categories of relations. I'm not sure if this is necessary; the only constraint VampIR has is equality checking between field elements. Other kinds of constraints (e.g. computing if one tree is smaller than another) would require their own computation anyway, so I'm not sure it's completely necessary, but it's worth thinking about.

In light of "the only constraint VampIR has is equality checking between field elements": does that mean that any VampIR constraint can be precisely modeled as a category-theoretic equalizer?

On "test my understanding": The bits aren't necessarily a power of 2, but yes. The GEB code does not need to encode anything about how many variables the final polynomial has.

Oops, right, sorry, I shouldn't have said 2^n bits, but rather Fin(2^n), i.e. n bits.

rokopt · 2023-04-27T13:57:50Z

I forgot one more potential-BITC-extension-related question: once we have an initial port of #101 to Lisp to address #61 , which I think will at first reuse the same code that we're currently using in Geb to compile +, *, -, /, %, and < to VampIR, would any of the following make sense from both semantic and performance perspectives?

Extend the BITC category to include those explicitly-numeric operations, and add to it compilation procedures to potentially-multivariate polynomials for each of them
Update the Geb code which compiles its internal numeric operations to compile instead to the existing BITC, meaning that we implement addition, multiplication, and so on ourselves in terms of the existing BITC, using standard algorithms for fixed-width numeric operations, as you've done for the existing BITC operations to implement Geb's substmorph compilation, thus automatically obtaining compilation to potentially-multivariate polynomials for the built-in natural numbers
Write some other category modeling numeric operations and compile it to BITC
Write some other category modeling numeric operations and compile it directly to VampIR operations
Some other change
No change; just leave the code as it is in the initial natural-number implementation, compiling numeric operations using the code that we already have which implements substobj in terms of numeric operations

rokopt · 2023-04-27T14:00:36Z

To me the net code changes look great; would you please squash (and then, if there's a separation that would improve clarity, break up again) the commits in preparation for merging, to get rid of the merge commits and bug fixes to new code (if there are any bug fixes to pre-existing code, then those should be separate commits, but I haven't spotted any)?

rokopt · 2023-04-27T14:24:55Z

To me the net code changes look great; would you please squash (and then, if there's a separation that would improve clarity, break up again) the commits in preparation for merging, to get rid of the merge commits and bug fixes to new code (if there are any bug fixes to pre-existing code, then those should be separate commits, but I haven't spotted any)?

Actually, @AHartNtkn , you don't need to worry about this; @mariari offered to take care of it (thanks!).

AHartNtkn · 2023-04-27T15:49:39Z

Are you referring to this comment?

Yes

In light of "the only [...]

I don't know what you have in mind when you say "modeled as a category-theoretic equalizer". An equalizer in GEB would just be a finite set, and wouldn't produce any constraints when compiled into VampIR. Additionally, equalities between constants would presumably produce somewhat trivial equalizers. 1 + 1 = 2 would be, what, an equalizer between the constant functions from the initial object into 1+1 and 2? Which would just be the initial object again; it wouldn't produce that actual check. You'll have to explain to me what you're imagining.

once we have an initial port of #101 to Lisp to address #61 [...]

I don't really understand what that implementation of natural numbers is actually doing. Is it just creating a type that represents integers mod n, plus the existing arithmetic on it? If I have that right, it does make sense to modify BITC into a new category to support these. This category would have lists of numbers as objects (representing vectors of finite sets, with the number representing the size of the set in that slot). Most of the existing operations would be the same, but there would be constants for any particular number, and branch would have to be adaptable to splitting on larger finite sets.

The arithmetic operations that GEB uses could then be incorporated as bespoke morphisms added manually to the category. + would go from [n, n] to [n]. < would go from [n, m] to 2, etc. Performance-wise, it will be just as efficient as current GEB. The benefits come from not having to use them to perform non-arithmetic data structure manipulations.

If we implemented these operations in BITC (which we could do through an encoding of bitstrings as a recursive type), that would likely be similar in efficiency to using custom operations specifically when range checks are necessary. That is, for % and <. It would be WAY, WAY less efficient for operations that don't require range checks, being +, *, /, and -. Then again, if you're modding everything anyway, that would likely negate any performance benefits. If you want to do a lot of mixed-mods, then it might actually be better to encode everything. But for ZK applications, you'd want to fix a single mod, then most things become much more efficient. If you do that, then I suppose the vectors would represent vectors of sets of size p, where p is the size of the field of your arithmetic circuit. Allowing modding for anything else would require range checks everywhere, so you might as well encode them. This is a bit of a design question; it depends on what you expect to happen. I suppose there could be both; efficient, special +, *, /, and - morphisms for p, and less efficient ones for every other mod. I suppose that would be my recommendation; bespoke +, *, /, and - for the field size, and implementing everything else as an encoded morphism.

mariari · 2023-04-28T04:54:05Z

I can't seem to force push to the branch so I've made a new branch with the changes, please review that for the actual code

mariari · 2023-04-28T05:08:02Z

Judging the big O of to-vampir I think using lists is fine, at first I wasn't sure about subseq, but since the size changes quite often, lists are fine for this

mariari · 2023-04-28T09:16:08Z

See 108

rokopt · 2023-04-28T15:11:02Z

In light of "the only [...]

I don't know what you have in mind when you say "modeled as a category-theoretic equalizer". An equalizer in GEB would just be a finite set, and wouldn't produce any constraints when compiled into VampIR. Additionally, equalities between constants would presumably produce somewhat trivial equalizers. 1 + 1 = 2 would be, what, an equalizer between the constant functions from the initial object into 1+1 and 2? Which would just be the initial object again; it wouldn't produce that actual check. You'll have to explain to me what you're imagining.

Sorry, that was a very under-specified question. Here's what I mean.

You've given us the bitc category to model within Geb what a boolean circuit can express (and we've discussed some possible extensions in earlier comments). So what the STLC-to-Subst-to-poly part of Geb will end up producing when given an STLC program is a morphism of bitc, which has a domain consisting of natural numbers m bits wide and a codomain consisting of natural numbers n bits wide, for some natural numbers m and n.

When we add constraints to a circuit (in particular, for example, if we were to add some representation of constraints to bitc or some extension of it), my understanding is that the effect is that proof verification will fail for some subset of the possible inputs to the program (where the input in this case has the form of some m-bit-wide natural number).

If that much is true, then this is my question:

For any possible set of constraints that we could add to the circuit, is it the case that we could find some natural number k and some pair of morphisms f, g of bitc from m-bit vectors to k-bit vectors such that the subset of inputs for which the constraints would all be satisfied would be precisely the equalizer of f and g?

rokopt · 2023-04-28T15:21:33Z

I don't really understand what that implementation of natural numbers is actually doing. Is it just creating a type that represents integers mod n, plus the existing arithmetic on it?

Yes, that's right.

rokopt · 2023-04-28T15:23:43Z

If you want to do a lot of mixed-mods, then it might actually be better to encode everything. But for ZK applications, you'd want to fix a single mod, then most things become much more efficient.

This is interesting -- I was just about to ask how the finite-field aspect of circuits works. Does this mean that boolean circuits can implement a client-chosen fixed global modulus for the entire circuit, and reasonably efficiently? (Whereas using different moduli in different places would require the client to write manual mod operations anywhere they wanted a modulus that wasn't the globally specified one?)

AHartNtkn · 2023-04-28T16:53:21Z

For any possible set of constraints...

Yes, I do believe that would be the case.

[...] boolean circuits can implement a client-chosen fixed global modulus [...]

No, that's not what I'm saying. Rather, arithmetic circuits come equipped with an intrinsic modulus chosen by the client, and that one modulus can be done efficiently, but any other can be done about as efficiently as possible using an encoded representation, which isn't very efficient. BITC, as it stands, can't implement any mod efficiently (relative to the intrinsic mod of the circuit), but if we wanted to compile to what arithmetic circuits can do efficiently, that would require additional morphisms which are not encoded; but they could only exist for one mod. For other mods, we might as well just encode them instead of having them available as atomic morphisms; any additional efficiency would be rather meager as we would need to decompose everything into bits anyway for range checks.

rokopt · 2023-04-28T16:56:33Z

For any possible set of constraints...

Yes, I do believe that would be the case.

[...] boolean circuits can implement a client-chosen fixed global modulus [...]

No, that's not what I'm saying. Rather, arithmetic circuits come equipped with an intrinsic modulus chosen by the client, and that one modulus can be done efficiently, but any other can be done about as efficiently as possible using an encoded representation, which isn't very efficient. BITC, as it stands, can't implement any mod efficiently (relative to the intrinsic mod of the circuit), but if we wanted to compile to what arithmetic circuits can do efficiently, that would require additional morphisms which are not encoded; but they could only exist for one mod. For other mods, we might as well just encode them instead of having them available as atomic morphisms; any additional efficiency would be rather meager as we would need to decompose everything into bits anyway for range checks.

Aha, I see. So I think I gather that providing a modulus and parameterizing bitc on it would be another possible future extension to the category. Thank you!

AHartNtkn added 10 commits April 25, 2023 00:55

boolean circuit compliation port (not tested)

c7394b8

Merge pull request #1 from anoma/main

f8bc753

update to main

Save progress on bug fixing

9b4cab9

Further bug fixing (imports broken?)

c860949

Fix more bugs, but still not working

386aec5

Merge branch 'main' into main

23c86ea

Merge pull request #2 from anoma/main

e791bf6

merge to master

Move dom and codom

f90cc4b

Bug fixing (It works now)

e70c313

Add comments for future generations

55bdd6b

AHartNtkn added 5 commits April 26, 2023 22:11

Make 1 make constant

621c535

Rename bits.lisp to bitc.lisp

5f570fd

Fix bugs when translating from pairs and cases to bitc

ee2b783

Make implementation slightly closer to original

e897481

Add comments to to-bitc

9ebdc89

mariari reviewed Apr 27, 2023

View reviewed changes

rokopt mentioned this pull request Apr 27, 2023

Geb should use VampIR higher-order functions #89

Open

Allow substobj within substmorph to be handled by to-bitc

f23043d

mariari mentioned this pull request Apr 28, 2023

Add method to compile geb to boolean circuits #108

Merged

mariari mentioned this pull request Apr 28, 2023

Interpreter for bitc #109

Closed

mariari closed this Apr 28, 2023

This was referenced May 3, 2023

bitc could be extended to support modular arithmetic #116

Open

bitc could be extended to support constraints #117

Open

bitc could expose an intrinsic modulus to Geb (and Geb could expose it to clients) #118

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add method to compile geb to boolean circuits. #105

Add method to compile geb to boolean circuits. #105

AHartNtkn commented Apr 26, 2023

rokopt commented Apr 26, 2023

rokopt commented Apr 26, 2023

rokopt commented Apr 26, 2023

AHartNtkn commented Apr 26, 2023

mariari Apr 27, 2023

AHartNtkn Apr 27, 2023

AHartNtkn Apr 27, 2023 •

edited

Loading

mariari Apr 27, 2023

AHartNtkn Apr 27, 2023

rokopt commented Apr 27, 2023

rokopt commented Apr 27, 2023

rokopt commented Apr 27, 2023

rokopt commented Apr 27, 2023

AHartNtkn commented Apr 27, 2023 •

edited

Loading

mariari commented Apr 28, 2023

mariari commented Apr 28, 2023

mariari commented Apr 28, 2023

rokopt commented Apr 28, 2023

rokopt commented Apr 28, 2023

rokopt commented Apr 28, 2023

AHartNtkn commented Apr 28, 2023 •

edited

Loading

rokopt commented Apr 28, 2023

Add method to compile geb to boolean circuits. #105

Add method to compile geb to boolean circuits. #105

Conversation

AHartNtkn commented Apr 26, 2023

rokopt commented Apr 26, 2023

rokopt commented Apr 26, 2023

rokopt commented Apr 26, 2023

AHartNtkn commented Apr 26, 2023

mariari Apr 27, 2023

Choose a reason for hiding this comment

AHartNtkn Apr 27, 2023

Choose a reason for hiding this comment

AHartNtkn Apr 27, 2023 • edited Loading

Choose a reason for hiding this comment

mariari Apr 27, 2023

Choose a reason for hiding this comment

AHartNtkn Apr 27, 2023

Choose a reason for hiding this comment

rokopt commented Apr 27, 2023

rokopt commented Apr 27, 2023

rokopt commented Apr 27, 2023

rokopt commented Apr 27, 2023

AHartNtkn commented Apr 27, 2023 • edited Loading

mariari commented Apr 28, 2023

mariari commented Apr 28, 2023

mariari commented Apr 28, 2023

rokopt commented Apr 28, 2023

rokopt commented Apr 28, 2023

rokopt commented Apr 28, 2023

AHartNtkn commented Apr 28, 2023 • edited Loading

rokopt commented Apr 28, 2023

AHartNtkn Apr 27, 2023 •

edited

Loading

AHartNtkn commented Apr 27, 2023 •

edited

Loading

AHartNtkn commented Apr 28, 2023 •

edited

Loading