Expose additional beaker caching backends #15349

claudiofr · 2023-01-21T19:15:35Z

Convert beaker mulled resolution, citations and biotools service caches to use the database rather than the file system.

Introduce new config parameters for the beaker cache database url, schema name and cache table name.

Introduce logic to default values of a parameter to values of another parameter. i.e. default cache database url to database_connection. Metadata for this was placed in code. Should probably be placed eventually in config file which would require a change to the config_schema.yml.
Introduced a new unit tests, test_mulled_resolution_cache_db.py, test_citations_db.py
Did not introduce a new test for the biotools_service beaker cache because I could not find a place in the code where it was used.

How to test the changes?

(Select all options that apply)

I've included appropriate automated tests.
This is a refactoring of components with existing test coverage.
Instructions for manual testing are as follows:

License

I agree to license these and all my past contributions to the core galaxy codebase under the MIT license.

claudiofr · 2023-01-21T20:32:29Z

I just looked at why my new test, test_resolution_cache_db.py failed in my fork's workflow. Apparently, the latest Beaker package 1.12.0 was installed there, whereas I did my local development with 1.11.0. It works fine in 1.11.0 but when I upgraded to 1.12.0 it fails. I need to work out why this is so.

claudiofr · 2023-01-22T19:35:35Z

I created a simple test case demonstrating the issue in Beaker 1.12.0 and working in 1.11.0 and opened an issue with it on the Beaker github page. In the meantime, can we revert the main code base back to Beaker 1.11.0 because who knows how long it will take them to fix this.

mvdbeek · 2023-01-31T20:24:58Z

It looks like the database connection isn't released on app shutdown, is there a hook in beaker to do that ? I think that's why the integration tests are eventually failing.

claudiofr · 2023-02-01T16:52:38Z

There is no explicit hook. I looked through the Beaker source code and it looks like they are releasing connections because they are using connections inside a with block. Also, there seems to be one sqlalchemy engine per process as there is a class level dictionary that mains a separate engine for each unique db url/cache table name combination. I'm guessing this is an artefact of how the integration tests are implemented. Each process would have 3 engines in a class level dictionary for each of the 3 caches in galaxy, mulled_resolution, citations, and biotools_service. Each engine by default will have a connection pool of 5 connections. So that could be up to 15 open connections per process. How exactly are the integration tests implemented? Are there multiple threads of execution within each process each running concurrent tests? If so, this could result in all 5 connections in each pool being opened at the same time.This would mean up to 15 connections per process. Are there multiple concurrent processes running tests? If so there could be up to 15 open connections per process. From the error I'm assuming the integration tests are running in postgress rather than sqlite.What are the max number of connections set at for postgres? I could try setting the connection pool size to a very small number just for the integration tests to limit the number of connections.Alternatively, we could up the max allowed postgress connections for the integration tests.This would require a new config parameter for the beaker caches. On Tuesday, January 31, 2023 at 03:25:10 PM EST, Marius van den Beek ***@***.***> wrote: It looks like the database connection isn't released on app shutdown, is there a hook in beaker to do that ? I think that's why the integration tests are eventually failing. — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you authored the thread.Message ID: ***@***.***>

claudiofr · 2023-02-01T20:05:14Z

A simple thing to try is to use the same table for all 3 caches. This can be specified in the galaxy.yml.This will cut down on the max number of connections by 2/3's. What galaxy.yml is used by the integration tests? On Wednesday, February 1, 2023 at 11:41:21 AM EST, Claudio Fratarcangeli ***@***.***> wrote: There is no explicit hook. I looked through the Beaker source code and it looks like they are releasing connections because they are using connections inside a with block. Also, there seems to be one sqlalchemy engine per process as there is a class level dictionary that mains a separate engine for each unique db url/cache table name combination. I'm guessing this is an artefact of how the integration tests are implemented. Each process would have 3 engines in a class level dictionary for each of the 3 caches in galaxy, mulled_resolution, citations, and biotools_service. Each engine by default will have a connection pool of 5 connections. So that could be up to 15 open connections per process. How exactly are the integration tests implemented? Are there multiple threads of execution within each process each running concurrent tests? If so, this could result in all 5 connections in each pool being opened at the same time.This would mean up to 15 connections per process. Are there multiple concurrent processes running tests? If so there could be up to 15 open connections per process. From the error I'm assuming the integration tests are running in postgress rather than sqlite.What are the max number of connections set at for postgres? I could try setting the connection pool size to a very small number just for the integration tests to limit the number of connections.Alternatively, we could up the max allowed postgress connections for the integration tests.This would require a new config parameter for the beaker caches. On Tuesday, January 31, 2023 at 03:25:10 PM EST, Marius van den Beek ***@***.***> wrote: It looks like the database connection isn't released on app shutdown, is there a hook in beaker to do that ? I think that's why the integration tests are eventually failing. — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you authored the thread.Message ID: ***@***.***>

mvdbeek · 2023-02-02T17:44:52Z

I don't think that will work, the integration tests call the app.shutdown hook that releases all resources in the process of shutting down the instance and starting a new one (

galaxy/lib/galaxy_test/driver/driver_util.py

Line 702 in c49abd3

self._app.shutdown()

). This all happens in the same process.

claudiofr · 2023-02-13T23:03:08Z

I changed the config_schema.yml setting the default cache table names for the 3 beaker caches, mulled resolution, citations, biotools, to the same value, beaker_cache rather than having separate tables. Because beaker creates a separate sqlalchemy engine and associated connection pool for each distinct table this will now create a single connection pool for all 3 caches rather than 3 connection pools. This in turn cuts down on the total number of potentially active connections by about 2/3's. As a consequence the integration tests are no longer failing.
Some of the other testing workflows are failing but as far as I can tell it has nothing to do with my changes.
Also support for database caches was broken in beaker v 1.12.0. I created an issue for it in the beaker repository and they fixed it in v 1.12.1. I have specified v 1.11.0 in the various requirements.txt files to play it safe.

mvdbeek · 2023-02-20T08:43:58Z

lib/galaxy/managers/citations.py

+            "cache.type": getattr(config, "citation_cache_type", "ext:database"),
            "cache.data_dir": getattr(config, "citation_cache_data_dir", None),
            "cache.lock_dir": getattr(config, "citation_cache_lock_dir", None),
+            "cache.url": getattr(config, "citation_cache_url", None),
+            "cache.table_name": getattr(config, "citation_cache_table_name", None),
+            "cache.schema_name": getattr(config, "citation_cache_schema_name", None),


I think this should still default to file, but I also think the getattr isn't necessary at all and predates a refactoring of the config instance. All config attributes are always set now, so

Suggested change

"cache.type": getattr(config, "citation_cache_type", "ext:database"),

"cache.data_dir": getattr(config, "citation_cache_data_dir", None),

"cache.lock_dir": getattr(config, "citation_cache_lock_dir", None),

"cache.url": getattr(config, "citation_cache_url", None),

"cache.table_name": getattr(config, "citation_cache_table_name", None),

"cache.schema_name": getattr(config, "citation_cache_schema_name", None),

"cache.type": config.citation_cache_type,

"cache.data_dir": config.citation_cache_data_dir,

"cache.lock_dir": config.citation_cache_lock_dir,

"cache.url": config.citation_cache_url,

"cache.table_name": config.citation_cache_table_name,

"cache.schema_name": config.citation_cache_schema_name,

should work too

mvdbeek · 2023-02-20T08:46:25Z

lib/galaxy/tools/biotools.py

+        "cache.type": getattr(config, "biotools_service_cache_type", "ext:database"),
        "cache.data_dir": getattr(config, "biotools_service_cache_data_dir", None),
        "cache.lock_dir": getattr(config, "biotools_service_cache_lock_dir", None),
+        "cache.url": getattr(config, "biotools_service_cache_url", config.database_connection),
+        "cache.table_name": getattr(config, "biotools_service_cache_table_name", None),
+        "cache.schema_name": getattr(config, "biotools_service_cache_schema_name", None),


Suggested change

"cache.type": getattr(config, "biotools_service_cache_type", "ext:database"),

"cache.data_dir": getattr(config, "biotools_service_cache_data_dir", None),

"cache.lock_dir": getattr(config, "biotools_service_cache_lock_dir", None),

"cache.url": getattr(config, "biotools_service_cache_url", config.database_connection),

"cache.table_name": getattr(config, "biotools_service_cache_table_name", None),

"cache.schema_name": getattr(config, "biotools_service_cache_schema_name", None),

"cache.type": config.biotools_service_cache_type,

"cache.data_dir": config.biotools_service_cache_data_dir,

"cache.lock_dir": config.biotools_service_cache_lock_dir,

"cache.url": config.biotools_service_cache_url,

"cache.table_name": config.biotools_service_cache_table_name,

"cache.schema_name": config.biotools_service_cache_schema_name,

mvdbeek · 2023-02-20T10:22:10Z

packages/tool_util/test-requirements.txt

+beaker==1.11.0 ; python_version >= "3.7" and python_version < "3.12"
+sqlalchemy==1.4.46 ; python_version >= "3.7" and python_version < "3.12"


The package requirements are ideally unpinned. It seems that 1.12.1 fixes your issue, so I think we should remove the pin here (ideally for sqlalchemy too, or if 2.0 fails, to pin it to sqlalchemy<=2)

As far as pinning package versions in requirements.txt is concerned this has always been a best practice in my experience. In fact, the main galaxy requirements.txt specifies specific versions or ranges of versions for all referenced packages. If you do not pin a referenced package to a specific version or a version range there is no way to guarantee that what gets deployed is the same thing that was tested. This is especially true for third party packages as we have seen with the bug introduced in Beaker v1.12.0. I would think that test-requirements.txt should be consistent with the main requirements.txt file.

There's typically a distinction being made between libraries and applications that are shipped to users/deployers. Galaxy as an application pins all dependencies hard (i.e to specific, tested versions, see https://github.com/galaxyproject/galaxy/blob/dev/lib/galaxy/dependencies/pinned-requirements.txt), while libraries should be pinned lightly, so that as the author of an application you have a good chance of being able to pick a set of compatible requirements.

And these packages here are published individually to pypi for re-use in other libraries or applications.

In addition to Marius' comment, the central list of compatible "light" pinnings for Galaxy Python dependencies is in pyproject.toml , and the various *requirements.txt files inside packages/ should mirror them, see my other comment.

mvdbeek · 2023-02-22T09:17:35Z

lib/galaxy/app.py

@@ -331,6 +331,8 @@ def _configure_toolbox(self):
            mulled_resolution_cache = CacheManager(**parse_cache_config_options(cache_opts)).get_cache(
                "mulled_resolution"
            )
+            # If using database cache clear cache table contents


It should be persistent across processes and restarts, what was the reason to change this ?

I guess I don't know the requirements for these caches. How long are entries expected to stay in the caches? I made this change based upon your comment in the meeting that you didn't want the default cache type to be database because you were concerned that data would accumulate in the cache table and not be cleaned out. I also assumed that in a production environment app restarts would be very infrequent. Is that not the case? So I figured this would be a good place to clean out the table to address your concern. I presume data accumulation would also be a problem with a file based cache. So why is there a concern specifically with the database cache?

Restarts can be frequent, for config updates, or because of resources limits, and they can be simultaneous or staggered, so I don't think this is safe.

It's not specific to the database cache, but it's safe to assume people know how to delete a file, while dropping the appropriate table requires a bit more knowledge. The lifetime is typically unlimited, but we may have to drop data occasionally, that for instance was necessary when admins upgraded python versions with incompatible pickle protocols.

That's a long answer to say that I would default to a file-based cache for simplicity reasons as you're doing now, and database is a good option for production deployments with kubernetes, where you can run out of (very low) file lock limits. If the cache needs to be cleared admins have to do this manually, and that's easier with files.

claudiofr · 2023-02-22T14:44:19Z

I had originally mentioned that the beaker docs say that the lock_dir is always required implying that using a database cache would not fix the galaxyproject/galaxy-helm#399 issue in which there were problems with the file system lock directory.
However, I looked at the beaker source code and it appears that when you use a database cache it does not use the file system for locking. Instead it relies on database locking. So it turns out that using a database cache should address issue 399 also.

mvdbeek · 2023-02-22T15:29:22Z

That's great news, thanks for checking!

packages/tool_util/test-requirements.txt

Rather than store cache data in the file system store cache data in the database. Partially addresses issue 15216. For now this was only changed for the mulled resolution cache. Introduced new config parameters for database url, cache table name. Introduced logic to default values of a parameter to values of another parameter. i.e. default cache database url to database_connection. Metadata for this was placed in code. Should probably be placed eventually in config file which would require a change to the config_schema.yml. Introduced a new unit test, test_mulled_resolution_cache_db.py.

ext:database. Rather than store cache data in the file system store cache data in the database. These are the last 2 beaker caches that need to be converted to the database as part of issue 15216. Introduce new config parameters for database url, cache table name. Introduce logic to default values of a parameter to values of another parameter. i.e. default cache database url to database_connection. Metadata for this was placed in code. Should probably be placed eventually in config file which would require a change to the config_schema.yml. Introduced a new unit test, test_citations_db.py Did not introduce a new test for the biotools_service beaker cache because I could not find a place in the code where it was used.

based beaker cache. Change test-requirements.txt to require v beaker 1.11.0 because latest version of beaker apparently introduced a bug that broke the ability to use database as a cache. Also fixed lint errors.

…eaker cache. mypy kept complaining that module 'galaxy' has no attribute 'config' on an 'from galaxy import config' statement. So I changed it to 'import galaxy.config'

…eaker cache.

…ased beaker cache.

…beaker cache.

All 3 beaker caches, mulled_resolution, citations, biotools service, now use the same default cache table, beaker_cache. This reduces the number of open db connections because beaker creates a separate sqlalchemy engine and associated connection pool for each distinct table and each connection pool opens up a certain number of connections. This was causing the integration tests to fail because the max postgress connections(100) were being exceeded.

Clear the caches in the event the cache type is database. This will delete any rows in the cache table that could be left over from a prior run of the app preventing an accumulation of stale data.

…shed yaml file

The test/unit/app directory tree has a test-requirements.txt file that brings in dependencies required by database utility modules imported by the unit test.

Co-authored-by: Nicola Soranzo <[email protected]>

mvdbeek · 2023-06-21T21:39:09Z

Thanks again @claudiofr, this is great work!

nuwang · 2024-02-14T05:36:08Z

Thanks for this fix @claudiofr. Does this mean we can remove this setting now? https://github.com/galaxyproject/galaxy-helm/pull/402/files#diff-86e6e8118f9d5ad6d181dd2e12c268062e9a66f5ef98bd0cc44b93661d08e9b2R412

claudiofr · 2024-02-14T14:26:34Z

You should be able to remove the setting if you set the mulled_resolution_cache_type parameter to ext:database because in this case it uses the database rather than the file system for caching or locking. See the other settings below that apply when you set the cache type to ext:database. Also keep in mind if you use ext:database that a new table with a default name of "beaker_cache" will be created and populated and it should probably be monitored and purged of old data periodically. # Mulled resolution caching. Mulled resolution uses external APIs of # quay.io, these requests are caching using this and the following # parameters #mulled_resolution_cache_type: file # Data directory used by beaker for caching mulled resolution # requests. # The value of this option will be resolved with respect to # <cache_dir>. #mulled_resolution_cache_data_dir: mulled/data # Lock directory used by beaker for caching mulled resolution # requests. # The value of this option will be resolved with respect to # <cache_dir>. #mulled_resolution_cache_lock_dir: mulled/locks # Seconds until the beaker cache is considered old and a new value is # created. #mulled_resolution_cache_expire: 3600 # When mulled_resolution_cache_type = ext:database, this is the url of # the database used by beaker for caching mulled resolution requests. # The application config code will set it to the value of # database_connection if this is not set. #mulled_resolution_cache_url: null # When mulled_resolution_cache_type = ext:database, this is the # database table name used by beaker for caching mulled resolution # requests. #mulled_resolution_cache_table_name: beaker_cache # When mulled_resolution_cache_type = ext:database, this is the # database schema name of the table used by beaker for caching mulled # resolution requests. #mulled_resolution_cache_schema_name: null

nuwang · 2024-02-14T16:15:59Z

Thanks for the clarification. Will discuss this with the Systems SIG and see how we can handle chart defaults going forward.

github-actions bot added area/admin area/documentation area/testing area/tool-framework labels Jan 21, 2023

github-actions bot added this to the 23.0 milestone Jan 21, 2023

dannon modified the milestones: 23.0, 23.1 Jan 22, 2023

martenson added the kind/feature label Feb 14, 2023

mvdbeek reviewed Feb 20, 2023

View reviewed changes

mvdbeek reviewed Feb 22, 2023

View reviewed changes

nsoranzo reviewed Feb 22, 2023

View reviewed changes

packages/tool_util/test-requirements.txt Outdated Show resolved Hide resolved

mvdbeek approved these changes Feb 28, 2023

View reviewed changes

nsoranzo reviewed Feb 28, 2023

View reviewed changes

packages/tool_util/test-requirements.txt Outdated Show resolved Hide resolved

claudiofr added 8 commits June 21, 2023 22:19

Fix testing errors after uploading changes to support a database

add56aa

based beaker cache. Change test-requirements.txt to require v beaker 1.11.0 because latest version of beaker apparently introduced a bug that broke the ability to use database as a cache. Also fixed lint errors.

Fix mypy errors after uploading changes to support a database based b…

9237953

…eaker cache. mypy kept complaining that module 'galaxy' has no attribute 'config' on an 'from galaxy import config' statement. So I changed it to 'import galaxy.config'

Fix lint errors after uploading changes to support a database based b…

1fa1b05

…eaker cache.

Fix formatting errors after uploading changes to support a database b…

2d2ef30

…ased beaker cache.

Fix isort errors after uploading changes to support a database based …

57e9502

…beaker cache.

claudiofr and others added 7 commits June 21, 2023 22:45

Change default beaker cache type to file for the 3 beaker caches.

2d3547a

Clear the beaker caches after creating them on app startup

fe63ba9

Clear the caches in the event the cache type is database. This will delete any rows in the cache table that could be left over from a prior run of the app preventing an accumulation of stale data.

Add new config parameters for citation database beaker cache in tool_…

17f22c2

…shed yaml file

Move test_resolution_cache_db.py to test/unit/app/tools directory

ae97462

The test/unit/app directory tree has a test-requirements.txt file that brings in dependencies required by database utility modules imported by the unit test.

Drop beaker pin

3d186cb

Co-authored-by: Nicola Soranzo <[email protected]>

Update galaxy config schema with newer version of options

f096dbf

Rebuild schema docs

3606c34

mvdbeek approved these changes Jun 21, 2023

View reviewed changes

mvdbeek changed the title ~~Db beaker~~ Expose additional beaker caching backends Jun 21, 2023

mvdbeek merged commit ffa8e76 into galaxyproject:dev Jun 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expose additional beaker caching backends #15349

Expose additional beaker caching backends #15349

claudiofr commented Jan 21, 2023

claudiofr commented Jan 21, 2023

claudiofr commented Jan 22, 2023

mvdbeek commented Jan 31, 2023

claudiofr commented Feb 1, 2023 via email

claudiofr commented Feb 1, 2023 via email

mvdbeek commented Feb 2, 2023

claudiofr commented Feb 13, 2023

mvdbeek Feb 20, 2023 •

edited

Loading

mvdbeek Feb 20, 2023

mvdbeek Feb 20, 2023

claudiofr Feb 22, 2023

mvdbeek Feb 22, 2023 •

edited

Loading

nsoranzo Feb 22, 2023

mvdbeek Feb 22, 2023

claudiofr Feb 22, 2023

mvdbeek Feb 22, 2023

mvdbeek Feb 22, 2023

claudiofr commented Feb 22, 2023

mvdbeek commented Feb 22, 2023

mvdbeek commented Jun 21, 2023

nuwang commented Feb 14, 2024

claudiofr commented Feb 14, 2024 via email •

edited

Loading

nuwang commented Feb 14, 2024

		beaker==1.11.0 ; python_version >= "3.7" and python_version < "3.12"
		sqlalchemy==1.4.46 ; python_version >= "3.7" and python_version < "3.12"

Expose additional beaker caching backends #15349

Expose additional beaker caching backends #15349

Conversation

claudiofr commented Jan 21, 2023

How to test the changes?

License

claudiofr commented Jan 21, 2023

claudiofr commented Jan 22, 2023

mvdbeek commented Jan 31, 2023

claudiofr commented Feb 1, 2023 via email

claudiofr commented Feb 1, 2023 via email

mvdbeek commented Feb 2, 2023

claudiofr commented Feb 13, 2023

mvdbeek Feb 20, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mvdbeek Feb 22, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

claudiofr commented Feb 22, 2023

mvdbeek commented Feb 22, 2023

mvdbeek commented Jun 21, 2023

nuwang commented Feb 14, 2024

claudiofr commented Feb 14, 2024 via email • edited Loading

nuwang commented Feb 14, 2024

mvdbeek Feb 20, 2023 •

edited

Loading

mvdbeek Feb 22, 2023 •

edited

Loading

claudiofr commented Feb 14, 2024 via email •

edited

Loading