GitHub - ts-azure-services/stress-testing-oai-endpoints: A repo to test scaling OAI endpoints

This repo contains a number of workflows and scripts to stress test Azure OpenAI endpoints, both individually and through an APIM instance. Most of the workflow and steps are captured in the Makefile and should be followed sequentially. This leverages locust as a testing framework to simulate concurrent users, but also leverages batch approaches to hit the endpoints with load. All this was tested on a Mac, though would work equally on any Unix-based machine. This also relies on several command-line tools like parallel, jq, grep and time. For reference, most of these tools are available natively in the Azure Cloud shell providing an easy deployment option (except for the time utility).

Random Notes

For the Azure OpenAI endpoints, this has leveraged the Global Standard deployment for gpt-4o-mini. This can be customized as needed.
This does not include setup of Application Insights or Log Analytics as part of the workflow.
For most of the "batch" workflows, there is no logic to implement a backoff period. Limits should be understood considering the capacity of the endpoint and/or the logic in place (e.g. with APIM) to handle throttling.
While on a Mac, to monitor CPU and memory usage, consider using btop (available through Homebrew). To determine the number of CPUs, run: sysctl -n hw.ncpu.
Future build:
- Inclusion of text embedding endpoints to test Azure Search workflows.

References

For the APIM tooling, leveraged this great repo to support setup and the custom APIM policy.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
locust-tests		locust-tests
multi-input		multi-input
setup		setup
static-input		static-input
with-apim		with-apim
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Random Notes

References

About

Releases

Packages

Languages

License

ts-azure-services/stress-testing-oai-endpoints

Folders and files

Latest commit

History

Repository files navigation

Random Notes

References

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages