Type-Api support and validation speedup #218

Pfeil · 2024-08-28T11:57:54Z

I replaced the guava cache with ~~an async~~ parallel (no real async in java) cache with higher performance. ~~Some tests do not succeed yet, because the details field is missing in some exception bodies. Not sure why it happens, but it is probably simple to fix and enough to do some speedup experiments.~~

This PR will last until we have a stable and significant performance gain (at least down to 25% or something) and have integrated all low-hanging fruits.

This requires some refactorings in very old parts of the Typed PID Maker, where I want to get rid of a lot of code.

coveralls · 2024-08-29T13:35:47Z

Pull Request Test Coverage Report for Build #421

Details

101 of 136 (74.26%) changed or added relevant lines in 4 files are covered.
3 unchanged lines in 2 files lost coverage.
Overall coverage decreased (-0.5%) to 71.805%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
src/main/java/edu/kit/datamanager/pit/domain/Operations.java	12	18	66.67%
src/main/java/edu/kit/datamanager/pit/pitservice/impl/TypingService.java	0	7	0.0%
src/main/java/edu/kit/datamanager/pit/typeregistry/impl/TypeRegistry.java	72	80	90.0%
src/main/java/edu/kit/datamanager/pit/pitservice/impl/EmbeddedStrictValidatorStrategy.java	17	31	54.84%

Files with Coverage Reduction	New Missed Lines	%
src/main/java/edu/kit/datamanager/pit/pitservice/impl/EmbeddedStrictValidatorStrategy.java	1	57.58%
src/main/java/edu/kit/datamanager/pit/pitservice/impl/TypingService.java	2	31.17%

Totals
Change from base Build #420:	-0.5%
Covered Lines:	871
Relevant Lines:	1213

💛 - Coveralls

# Conflicts: # build.gradle

…mas and structure

- support for records without profiles - support for records with multiple profiles - support for multiple profile attribute keys/types - support for additional attributes - in general, attribute validation and profile validation are now separate tasks

…em for validation

Pfeil · 2025-01-03T19:05:18Z

…experiments # Conflicts: # build.gradle # src/main/java/edu/kit/datamanager/pit/web/impl/TypingRESTResourceImpl.java

…current state of the APIs

…validation error to the user, not as a PidNotFoundError.

…l attributes, now default to true.

…cal pid system

Pfeil · 2025-01-08T17:05:57Z

Note: Something is wrong with the CI.

On all my local machines, the tests run, fail, and end. The CI seems to run in some infinite loop on PID creation, which is not reproducible.
Local tests and the CI use both Java 21
Because the tests do not end, I cannot get any error reports
It always fails in this test (last lines in output):

e.k.datamanager.pit.web.CustomPidsTest   : Started CustomPidsTest in 3.542 seconds (process running for 63.61)
CustomPidsTest > testCrateCustomPidWhenFeatureDisabled() STANDARD_OUT
    2025-01-08T17:16:36.800Z  INFO 4724 --- [    Test worker] e.k.d.p.web.impl.TypingRESTResourceImpl  : Creating PID

The test happens using the in-memory pid system. So it should not be an issue with external pid systems, except for validation. Validation needs external services.
The test uses a custom PID, but disables the feature. So we will try to find a PID which does not exist yet forever, if we would for some reason not find any.
It seems to loop/wait somewhere in the validation process.

Ideas to fix this:

Delete recently used GitHub Caches -> didnt work out
AFAIK there are limitations to the CI in PRs, maybe the CI of the main branch is being executed and does not work any more for some reason? I remember there were some security reasons. Maybe we need to update the main CI definitions?
make sure we run the CI with log level traces next time and see how far we get. We may need to add further traces then and loop...

Pfeil · 2025-01-10T17:47:41Z

I changed the async executor to be single threaded, and now it "hangs" here:

TypeApiTest > queryAttributeInfoOfSimpleType() STANDARD_OUT
    2025-01-10T17:41:57.609Z TRACE 2035 --- [    Test worker] e.k.d.pit.typeregistry.impl.TypeApi      : Querying attribute info for 21.T11148/b8457812905b83046284
    2025-01-10T17:41:57.611Z TRACE 2035 --- [pool-2-thread-1] e.k.d.pit.typeregistry.impl.TypeApi      : Loading attribute 21.T11148/b8457812905b83046284 to cache.
    2025-01-10T17:41:57.613Z TRACE 2035 --- [    Test worker] e.k.d.pit.typeregistry.impl.TypeApi      : Finished querying attribute info for 21.T11148/b8457812905b83046284

Which gives exactly no clue. But I seem to be able to reproduce the issue now locally: All tests I checked run infinitely. I think this is because the way "async" works in java it is not solvable using a single thread (blocking tasks). And this is again because I spawn new tasks from existing ones, wait for a task to finish here and there, etc.

Is this the same issue in the CI, though? If so, I guess it is the fault of the java implementation?

Turns out: No. The single thread executor is seemingly not able to interrupt tasks, which means you can quickly get into deadlocks depending on the complexity of your task. A task that spawns more tasks and need them befor finishing, running on the same executor, won't work. But this is what I currently do. This is why other issues appeared on both sides. But it had not directly something to do with the CI issue.

The solution was to move in the CI from OpenJDK "zulu" to "temurin". Temurin is also what we use in our docker container. The virtual thread executor of zulu seems to behave differently than the openJDK that homebrew provides. In any case, I am planning some additional changes after cleaning up my WIP mess:

use different executors for each cache
double check that there is no future that will implicitly be created with some default executor. Not sure if such a thing is possible, but I want to check if I explicitly defined the executor for each async task that I create. I believe that in my case, all futures come from the async caches, but I'll need to check.

Pfeil force-pushed the validation-speedup-experiments branch 7 times, most recently from 6bb28b6 to aad4408 Compare August 28, 2024 23:20

speedup: use fast, async cache

efb4bf5

Pfeil force-pushed the validation-speedup-experiments branch from aad4408 to efb4bf5 Compare August 29, 2024 12:53

speedup: use default work stealing executor for "async" cache

160bfe0

Pfeil force-pushed the validation-speedup-experiments branch 3 times, most recently from a38cdee to b5fba64 Compare August 29, 2024 15:02

speedup: use extra executors for validation and deserialization

f976bdd

Pfeil force-pushed the validation-speedup-experiments branch from b5fba64 to f976bdd Compare August 29, 2024 15:30

Pfeil mentioned this pull request Aug 30, 2024

Improvements on non-atomic value validation #179

Open

Pfeil added the maintenance Not a bug, but should be done. label Oct 11, 2024

Merge branch 'master' into validation-speedup-experiments

0d8ce40

# Conflicts: # build.gradle

This comment was marked as resolved.

Sign in to view

Pfeil self-assigned this Nov 8, 2024

chore: rename TypeRegistry to DtrTest, as it depends on dtr-test sche…

c7e6cbb

…mas and structure

Pfeil force-pushed the validation-speedup-experiments branch from 6e64ea4 to d4317c8 Compare November 16, 2024 00:49

Pfeil changed the title ~~Validation speedup experiments~~ Type-Api support and validation speedup Nov 16, 2024

Pfeil force-pushed the validation-speedup-experiments branch 3 times, most recently from 4b57404 to 6b07c6f Compare November 19, 2024 19:53

Pfeil added 3 commits November 20, 2024 18:48

feat: add main code base for type-api support

439e8b0

feat: more flexible validation

5ee3e4a

- support for records without profiles - support for records with multiple profiles - support for multiple profile attribute keys/types - support for additional attributes - in general, attribute validation and profile validation are now separate tasks

feat: use virtual threads for async execution

dabaa4d

This comment was marked as outdated.

Sign in to view

feat: read additionalAttributes allowed from profiles and consider th…

0695cc6

…em for validation

Pfeil force-pushed the validation-speedup-experiments branch from 530b249 to d0c231a Compare January 3, 2025 18:20

Pfeil added 3 commits January 7, 2025 10:34

fix: throw error on PID not found

8544909

fix: missing slash after type api endpoint

7938072

fix(tests): check assumptions in etag test setup

85db2a0

Pfeil force-pushed the validation-speedup-experiments branch from d0c231a to 85db2a0 Compare January 7, 2025 09:35

Pfeil added 11 commits January 7, 2025 10:56

Merge remote-tracking branch 'origin/master' into validation-speedup-…

19b030a

…experiments # Conflicts: # build.gradle # src/main/java/edu/kit/datamanager/pit/web/impl/TypingRESTResourceImpl.java

fix(tests): type api test for complex type now properly reflects the …

e7361f4

…current state of the APIs

test: some general profile attributes with type API

18a06ba

cleanup: remove unused domain classes

69b0d57

fix: apply json schema validation correctly

aaba94d

fix: a attribute which is not registered, shall being presented as a …

c4d1aa2

…validation error to the user, not as a PidNotFoundError.

cleanup: apply linter suggestions

24c5beb

fix(tests): profiles which do not specify their handling of additiona…

186e4a8

…l attributes, now default to true.

fix(tests): sanity check about the number of tests to execute with lo…

e61ba54

…cal pid system

cleanup: apply linter suggestions

4fd4543

docs: add note about why this test currently fails

04bff17

Pfeil force-pushed the validation-speedup-experiments branch 7 times, most recently from 0b8163a to 7349a0c Compare January 10, 2025 17:28

WIP: add tracing to debug the CI issue

865261f

Pfeil force-pushed the validation-speedup-experiments branch from 7349a0c to 865261f Compare January 10, 2025 18:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Type-Api support and validation speedup #218

Type-Api support and validation speedup #218

Pfeil commented Aug 28, 2024 •

edited

Loading

coveralls commented Aug 29, 2024 •

edited

Loading

This comment was marked as resolved.

This comment was marked as outdated.

Pfeil commented Jan 3, 2025 •

edited

Loading

Pfeil commented Jan 8, 2025 •

edited

Loading

Pfeil commented Jan 10, 2025 •

edited

Loading

Type-Api support and validation speedup #218

Are you sure you want to change the base?

Type-Api support and validation speedup #218

Conversation

Pfeil commented Aug 28, 2024 • edited Loading

coveralls commented Aug 29, 2024 • edited Loading

Pull Request Test Coverage Report for Build #421

Details

💛 - Coveralls

This comment was marked as resolved.

This comment was marked as outdated.

Pfeil commented Jan 3, 2025 • edited Loading

Pfeil commented Jan 8, 2025 • edited Loading

Pfeil commented Jan 10, 2025 • edited Loading

Pfeil commented Aug 28, 2024 •

edited

Loading

coveralls commented Aug 29, 2024 •

edited

Loading

Pfeil commented Jan 3, 2025 •

edited

Loading

Pfeil commented Jan 8, 2025 •

edited

Loading

Pfeil commented Jan 10, 2025 •

edited

Loading