-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Testing of large files takes a long time #74
Comments
@ajanett Can you explain why the test needs so long? What are your suggestions? |
See issue #61. The ETS code is now visiting each and every feature instance and examining the geometry BLOB of each one. Yes, this can take a long time, even in a not so large GeoPackage. I don't like the time it is taking but I've looked at the implementation options and it currently is as good as it'll get given the ETS framework. All of the tests that are doing this into one code block in order to reduce the processing time as much as possible. However, the fact remains that each geometry BLOB is looked at now and this will take a LONG time. I've suggested allowing an option on ETS for geopackage to provide for a preliminary test looking only at a certain subset of each feature - say only testing the first N rows ... because in 90% of the time, if there is a problem it'll be in all of the feature instances... therefore do we really need to examine each feature instance all the time? Another option might be to provide a secondary ETS geopackage test that focuses only on the geometry tests since they do take so long. |
There are 2 tests we could remove (or comment out) that are likely taking the most time: 78 (which the requirement for 78 was removed), and 67 which is testing an extension requirement. This will only help a little bit, but since these two involve additional SQL statements, I expect removal of these would provide the maximum relief. It sure would be nice if TEAM Engine at least gave us some feedback that the tests are actually being run. The only other options are those I've listed previously:
|
I am in favour of creating an option (e.g. extended geometry validation) which activates full validation of geometries. |
@dstenger Please specify how you want this option added to the system. I do not know enough about the Team Engine to know what you need here. Once it is specified, I will attempt to implement it. |
I suggest to: |
A quick fix solves the problem temporarily: ab9eb29 However, a clean solution should be implemented as soon as possible. |
I believe the fix needs to change the initial SQL statement in featureGeometryEncodingTesting(), adding the LIMIT clause when we want to restrict the test to a certain number of records, because then we wouldn't be fetching an excessive number of records. |
This is currently assigned to me, but I am not aware of any action expected of me. Please revise and/or re-assign. |
As far as I see, ajanett#1 is not part of the master branch of this repository. We should check whether we want to migrate the code. |
This issue occurs with the current status of master branch.
Testing of large files takes a very long time.
E.g. with version 0.6, a GeoPackage file of size 700 MB can be tested in around 3 Minutes.
With current version of master branch, testing a GeoPackage file of size 700 MB takes more than 1 hour (I cancelled the test run after 1 hour).
What is the reason for this behaviour?
This makes testing of large files very difficult.
The text was updated successfully, but these errors were encountered: