-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support CelOptions.maxRegexProgramSize(int)
to limit RE2 program size
#545
Comments
One small note: CEL-Java does not yet have a program planner, so the exception will have to be thrown during evaluation. |
copybara-service bot
pushed a commit
that referenced
this issue
Jan 11, 2025
Addresses #545 PiperOrigin-RevId: 714298569
copybara-service bot
pushed a commit
that referenced
this issue
Jan 11, 2025
Addresses #545 PiperOrigin-RevId: 714298569
copybara-service bot
pushed a commit
that referenced
this issue
Jan 11, 2025
Addresses #545 PiperOrigin-RevId: 714298569
copybara-service bot
pushed a commit
that referenced
this issue
Jan 11, 2025
Addresses #545 PiperOrigin-RevId: 714298569
copybara-service bot
pushed a commit
that referenced
this issue
Jan 11, 2025
Addresses #545 PiperOrigin-RevId: 714298569
copybara-service bot
pushed a commit
that referenced
this issue
Jan 11, 2025
Addresses #545 PiperOrigin-RevId: 714298569
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Feature request checklist
Change
To make CEL environment setup consistent across CEL implementations, I propose to add
CelOptions.maxRegexProgramSize(int)
to CEL-Java. This option should work similar toInterpreterOptions.regex_max_program_size
in CEL-Cpp (see nuances below).RE2 program size should be verified when the CEL program is created from the AST (AFAIK this is how cel-cpp works).
Nuances
The program size represents a very approximate measure of a regexp's "cost". There are no guarantees on the implementation details or claims about the properties of the program size (except "larger numbers are more expensive than smaller numbers").
Currently the program size is the same as the number of instructions of the regex program. However, the number of instructions in a regex depends on the concrete RE2 implementation. The implication of using the number of instructions as the program size is that:
Important
There's no guarantee that RE2 program size has the exact same value in CPP, Go and Java.
We should communicate this in the docs.
For example:
Context
Unlike the canonical C++ RE2 implementation, re2j (Java RE2 port) didn't expose the program size in public APIs. To address this inconsistency I made google/re2j#180, and it was recently merged.
When the next re2j version is released, we'll be able to determine the program size using
Pattern.programSize()
.Example
Some other known regex program sizes for java can be found in re2j's
PatternTest.java
.Related
Pattern.programSize()
andMatcher.programSize()
re2j#180CC @l46kok, @TristonianJones
The text was updated successfully, but these errors were encountered: