Correctness Testing 2.0 #12

fredjoonpark · 2024-12-19T00:52:25Z

Removed our deprecated correctness testing suite in place of a new approach for correctness testing.

Added a new class Mockmetheus in ./correctness. It's designed to mock remote operations from Prometheus using snappy-encoded payloads.
Previously, we had "test" cases that included functions and certain operators that include client-side logic (that stretch beyond remote operations). It now focuses exclusively on the correctness of remote read and write operations.
Added an optional flag FRESH_TSDB for running tests against a freshly new TSDB. The tests are designed to work either way, but defaults to false for convenience.

Running tests

Go to ./correctness and bring up containers with docker compose up -d
Run tests with docker exec -it mockmetheus pytest

Introduced test cases for easy reference:

Remote read on initialization
Remote read with a DNE metric
Remote write with an empty timeseries
Remote write with no labels
Remote write with no samples
Remote write & read with valid data
Remote write & read with multiple metrics
Remote write & read with multiple samples
Remote write & read with label matchers

Additional notes:

generate_test_run_id() - helps achieve test case independence by using an extra label in our read/write requests.
the stub (prom_pb2.py) is generated using: protoc --python_out=. prom.proto

Mockmetheus (pic for reference):

trevorbonas · 2024-12-20T21:12:29Z

correctness/mockmetheus.py

+        """
+        Parses a Prometheus duration string and returns the duration in milliseconds.
+        """
+        pattern = r'(?:(\d+)h)?(?:(\d+)m)?(?:(\d+)s)?$'


Durations can be negative, for example, -5h. This wouldn't match the entirety of a negative duration.

Prometheus shows a negative time duration as one possible value in their documentation:

1s # Equivalent to 1. 2m # Equivalent to 120. 1ms # Equivalent to 0.001. -2h # Equivalent to -7200.

correctness/mockmetheus.py

trevorbonas · 2024-12-20T22:46:18Z

correctness/test_correctness.py

+    assert(write_response == 200)
+
+    # ensures data is ingested into Timestream
+    time.sleep(1)


For this and the other sleeps, can network latency have any impact on whether data is finished ingesting into Timestream?

Technically yes, but I think it should be fair given what we can expect from their docs ("write-to-read latency is in the sub-second range."). Cross-region requests might add additional latency, perhaps we can make the network latency sleep configurable?

Making it configurable is a good idea. I think using a named constant, like SLEEP_TIME, would be good enough.

trevorbonas

Looks good to me once negative durations are supported and the test sleep time can be set with a named constant.

Dockerfile

forestmvey · 2024-12-24T16:48:46Z

correctness/mockmetheus.py

@@ -0,0 +1,289 @@
+import os


Add License header for new files

forestmvey · 2024-12-24T16:59:57Z

correctness/README.md

+1. Bring up containers with `docker compose up -d`
+2. Run tests with `docker exec -it mockmetheus pytest`
+
+### Notes


Add a cleanup section.

forestmvey · 2024-12-24T17:00:08Z

correctness/README.md

-1. Run the following command to execute the correctness tests:
-`go test -v ./correctness`
+1. Bring up containers with `docker compose up -d`
+2. Run tests with `docker exec -it mockmetheus pytest`


Can we run individual tests?

yep! pytest test_correctness.py::test_write_no_data

forestmvey · 2024-12-24T17:04:32Z

correctness/docker-compose.yml

+    working_dir: /app
+    command: ["sleep", "infinity"]
+    environment:
+      AWS_ACCESS_KEY_ID: "XXXXXXXXX"


These values should be read from a env_file that is added to gitignore. Storing credentials in tracked files is an easy way to commit secrets.

correctness/Dockerfile

forestmvey · 2024-12-24T18:55:42Z

correctness/README.md

-2. Execute the following command to save the docker image as a compressed file and update the `version` appropriately:
-`docker save timestream-prometheus-connector-docker | gzip > timestream-prometheus-connector-docker-image-<version>.tar.gz`
+1. Ensure your docker version is >= `20.10.0`, or you have `docker-compose` installed
+2. Update `docker-compose.yml` in this directory with your AWS credentials

 ## How to execute tests


Can we add a section how to run without docker?

forestmvey · 2024-12-24T19:32:21Z

correctness/mockmetheus.py

+It uses the same protobuf definitions as Prometheus to construct snappy-encoded
+payloads for both remote-read and remote-write requests and responses.
+"""
+class Mockmetheus:


What is the motivation for using Python rather than Go. We are doing a lot of regex and custom parsing here for Prometheus requests rather than using Prometheus libraries.

For example we can use the github.com/prometheus/prometheus/promql/parser library for parsing Prometheus query strings. Using Prometheus libraries would also have the added benefit of being able to upgrade the dependencies for functional changes / bugs, and not mixing Go and Python without warrant.

We are doing a lot of regex and custom parsing here for Prometheus requests rather than using Prometheus libraries.

The primary goal of Mockmetheus is to control remote/read operations that a Prometheus client would typically perform. While our regex may not exactly match what Prometheus uses, we can ensure its accuracy through our test setups and the controlled environment we maintain. Our current regex and parsing approach is straightforward and should suffice unless there are specific edge cases or additional pre/post-processing steps that the Prometheus parser handles. If you think there are test cases we might be missing (cases that their parser can handle) please let me know and I will incorporate those tests to verify compatibility with our existing setup.

not mixing Go and Python without warrant.

What's wrong with having Go and Python in one repo? The separation is clear - our connector simply exposes two REST API endpoints, and how we test against these endpoints should not matter as long as we also have the capability of verification using these same endpoints. IOW the choice of language for correctness testing does not impact the functionality of the connector itself. As long as we can effectively test and verify the endpoints, the underlying language used for tests should be flexible. However, if consistency is a priority or if there are specific reasons to standardize on Go, I'm open to rewriting the tests in Go.

Although we can use any language for this implementation, we need some reason for language interop. If there is no benefit or clear purpose here for using Python, I would prefer we implement in Go. There is no need to write the parsing logic when we have the native parsing libraries available. Here is a small Go program which can parse a Prometheus query string, rather than introducing the required logic ourselves:

package main import ( "fmt" "log" "github.com/prometheus/prometheus/promql/parser" ) func main() { query := `scrape_duration_seconds{}[5d]` node, err := parser.ParseExpr(query) if err != nil { log.Fatalf("Error parsing query: %v", err) } PrintQuery(node) } func PrintQuery(node parser.Node) { switch n := node.(type) { case *parser.MatrixSelector: fmt.Println("Metric name:", n.VectorSelector) fmt.Println("Range:", n.Range) default: fmt.Printf("Unknown query node type: %T\n", n) } }

forestmvey

Would like to resolve best approach for using Go vs Python.

correctness testing 2.0

16fc917

trevorbonas reviewed Dec 20, 2024

View reviewed changes

trevorbonas approved these changes Dec 23, 2024

View reviewed changes

forestmvey reviewed Dec 24, 2024

View reviewed changes

Dockerfile Show resolved Hide resolved

forestmvey reviewed Dec 24, 2024

View reviewed changes

correctness/mockmetheus.py

@@ -0,0 +1,289 @@

import os

Copy link

forestmvey Dec 24, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Add License header for new files

forestmvey reviewed Dec 24, 2024

View reviewed changes

correctness/Dockerfile Show resolved Hide resolved

forestmvey reviewed Dec 24, 2024

View reviewed changes

forestmvey requested changes Dec 24, 2024

View reviewed changes

fredjoonpark closed this Jan 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Correctness Testing 2.0 #12

Correctness Testing 2.0 #12

fredjoonpark commented Dec 19, 2024 •

edited

Loading

trevorbonas Dec 20, 2024

forestmvey Dec 24, 2024

trevorbonas Dec 20, 2024

fredjoonpark Dec 23, 2024

trevorbonas Dec 23, 2024

trevorbonas left a comment

forestmvey Dec 24, 2024

forestmvey Dec 24, 2024

forestmvey Dec 24, 2024

fredjoonpark Dec 27, 2024

forestmvey Dec 24, 2024 •

edited

Loading

forestmvey Dec 24, 2024

forestmvey Dec 24, 2024

fredjoonpark Dec 27, 2024

forestmvey Dec 27, 2024 •

edited

Loading

forestmvey left a comment

Correctness Testing 2.0 #12

Correctness Testing 2.0 #12

Conversation

fredjoonpark commented Dec 19, 2024 • edited Loading

Running tests

Introduced test cases for easy reference:

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

trevorbonas left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

forestmvey Dec 24, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

forestmvey Dec 27, 2024 • edited Loading

Choose a reason for hiding this comment

forestmvey left a comment

Choose a reason for hiding this comment

fredjoonpark commented Dec 19, 2024 •

edited

Loading

forestmvey Dec 24, 2024 •

edited

Loading

forestmvey Dec 27, 2024 •

edited

Loading