Performance improvement on codec mapping #409

frdeboffles · 2021-05-21T22:24:59Z

Feature Request

While using this driver for fetching many rows from our table I noticed it was taking quite some time to build the Row objects. I hooked a profiler to the code and noticed it was spending quite some time for each row on the codec.canDecode method.
I wrote this little perf test to validate this finding:

  @Test
  public void perfTestDecode() {
    DefaultCodecs codecs = new DefaultCodecs(TEST);
    long t = System.currentTimeMillis();
    for (int i = 0; i < 100000; i++) {
      for (int c = 0; c < 20; c++) {
        assertThat(
            codecs.decode(
                TEST.buffer(4).writeInt(100), INT4.getObjectId(), FORMAT_BINARY, Integer.class))
            .isEqualTo(100);
        assertThat(
            codecs.decode(
                ByteBufUtils.encode(TEST, "100"), INT2.getObjectId(), FORMAT_TEXT, Short.class))
            .isEqualTo((short) 100);
        assertThat(
            codecs.decode(
                ByteBufUtils.encode(TEST, "test"),
                VARCHAR.getObjectId(),
                FORMAT_TEXT,
                String.class))
            .isEqualTo("test");
        assertThat(
            codecs.decode(
                ByteBufUtils.encode(TEST, "2018-11-04 15:35:00.847108"),
                TIMESTAMP.getObjectId(),
                FORMAT_TEXT,
                LocalDateTime.class))
            .isInstanceOf(LocalDateTime.class);
        assertThat(codecs.decode(ByteBufUtils.encode(TEST, "{100,200}"), INT2_ARRAY.getObjectId(), FORMAT_TEXT, Object.class)).isEqualTo(new Short[]{100, 200});
        assertThat(codecs.decode(ByteBufUtils.encode(TEST, "{100,200}"), INT4_ARRAY.getObjectId(), FORMAT_TEXT, Object.class)).isEqualTo(new Integer[]{100, 200});
      }
    }
    System.out.println("Run in " + (System.currentTimeMillis() - t) + "ms");
  }

Running this on my machine results in:

Run in 13231ms

NOTE That running the same example from 0.8.7.RELEASE results in
Run in 10389ms
So it seems like the decoding performance went down between this release and the current master (at commit 9c773c8)

Is your feature request related to a problem? Please describe

I would rather use this driver and webflux than going the web mvc route and jdbc.

Describe the solution you'd like

I think the class DefaultCodecs would benefits from some simple caching mechanism.
I did implement a simple caching (PR to come) and the results of the above test comes to:

Run in 7294ms

Describe alternatives you've considered

I haven't found alternatives

Teachability, Documentation, Adoption, Migration Strategy

NA

The text was updated successfully, but these errors were encountered:

…decoding performances. * Switched the codec list to a thread safe variant to avoid the synchronized blocks. Even though `CopyOnWriteArrayList` is not super performant it should work fine in this context where the list should not be frequently updated. * Switched `mockito-core` to `mockito-junit-jupiter` for Junit 5 support

mp911de · 2021-05-22T07:46:54Z

Thanks a lot for your suggestion. I had something like this on my mind since iterations aren't really efficient to determine a codec.

protyay · 2021-06-27T13:17:57Z

@mp911de Interested to work in this project. Are you accepting new contributors ?

mp911de · 2021-06-27T15:50:43Z

Sure. Feel free to pick an issue, submit a pull request or participate in issue discussions.

…decoding performances. * Switched the codec list to a thread safe variant to avoid the synchronized blocks. Even though `CopyOnWriteArrayList` is not super performant it should work fine in this context where the list should not be frequently updated. * Switched `mockito-core` to `mockito-junit-jupiter` for Junit 5 support

* Refactored the codec registry to use a CodecFinder (default to SPI definition in the classpath) * Provided 2 implementations of the codec finder, one without cache and another with cache * Added a build cache method that will attempt to fill the cache when the codecs are updated. This cannot covers all the cases like the nested arrays, therefore for those type the cache will be filled dynamically on per-request basis.

* Refactored the codec registry to use a CodecFinder (default to SPI definition in the classpath) * Provided 2 implementations of the codec finder, one without cache and another with cache * Added a build cache method that will attempt to fill the cache when the codecs are updated. This cannot covers all the cases like the nested arrays, therefore for those type the cache will be filled dynamically on per-request basis * Added microbenchmarks for codec encode and decode using the cache based implementation or not

* Refactored the codec registry to use a CodecFinder (default to SPI definition in the classpath) * Provided 2 implementations of the codec finder, one without cache and another with cache * Added a build cache method that will attempt to fill the cache when the codecs are updated. This cannot covers all the cases like the nested arrays, therefore for those type the cache will be filled dynamically on per-request basis * Added microbenchmarks for codec encode and decode using the cache based implementation or not * Enable unixDomainSocketTest IT only when running on Linux

* Refactored the codec registry to use a CodecFinder (default to SPI definition in the classpath) * Provided 2 implementations of the codec finder, one without cache and another with cache * Added a build cache method that will attempt to fill the cache when the codecs are updated. This cannot covers all the cases like the nested arrays, therefore for those type the cache will be filled dynamically on per-request basis * Added microbenchmarks for codec encode and decode using the cache based implementation or not * Disabled unixDomainSocketTest IT when running on Mac or Windows

…decoding performances. * Refactored the codec registry to use a CodecFinder (default to SPI definition in the classpath) * Switched the codec list to a thread safe variant to avoid the synchronized blocks. Even though `CopyOnWriteArrayList` is not super performant it should work fine in this context where the list should not be frequently updated. * Provided 2 implementations of the codec finder, one without cache and another with cache * Added a build cache method that will attempt to fill the cache when the codecs are updated. This cannot covers all the cases like the nested arrays, therefore for those type the cache will be filled dynamically on per-request basis * Added microbenchmarks for codec encode and decode using the cache based implementation or not * Switched `mockito-core` to `mockito-junit-jupiter` for Junit 5 support * Disabled unixDomainSocketTest IT when running on Mac or Windows

* Refactored the codec registry to use a CodecFinder (default to SPI definition in the classpath) * Provided 2 implementations of the codec finder, one without cache and another with cache * Added a build cache method that will attempt to fill the cache when the codecs are updated. This cannot covers all the cases like the nested arrays, therefore for those type the cache will be filled dynamically on per-request basis * Added microbenchmarks for codec encode and decode using the cache based implementation or not * Disabled unixDomainSocketTest IT when running on Mac or Windows

Rename CodecFinder to CodecLookup. Rename default implementations to CachedCodecLookup and DefaultCodecLookup. Extract CodecMetadata interface and turn getFormats() into a default method. Refactor how CodecLookup obtains its actual codecs to prevent methods allowing to alter the internal state of the cache component through updateCodecs(…). The delegate is typically a CodecRegistry for iteration over the actual codecs. Reinstate socket tests on MacOS as sockets are supported on BSD via kqueue. Remove overly complex spy arrangements from tests. Refine tests. [resolves #410][#409] Signed-off-by: Mark Paluch <[email protected]>

…rmances. * Switched the codec list to a thread safe variant to avoid the synchronized blocks. Even though `CopyOnWriteArrayList` is not super performant it should work fine in this context where the list should not be frequently updated. * Switched `mockito-core` to `mockito-junit-jupiter` for Junit 5 support * Refactored the codec registry to use a CodecFinder (default to SPI definition in the classpath) * Provided 2 implementations of the codec finder, one without cache and another with cache * Added a build cache method that will attempt to fill the cache when the codecs are updated. This cannot covers all the cases like the nested arrays, therefore for those type the cache will be filled dynamically on per-request basis * Added microbenchmarks for codec encode and decode using the cache based implementation or not * Disabled unixDomainSocketTest IT when running on Windows [#444][resolves #409]

Rename CodecFinder to CodecLookup. Rename default implementations to CachedCodecLookup and DefaultCodecLookup. Extract CodecMetadata interface and turn getFormats() into a default method. Refactor how CodecLookup obtains its actual codecs to prevent methods allowing to alter the internal state of the cache component through updateCodecs(…). The delegate is typically a CodecRegistry for iteration over the actual codecs. Reinstate socket tests on MacOS as sockets are supported on BSD via kqueue. Remove overly complex spy arrangements from tests. Refine tests. [resolves #444][#409] Signed-off-by: Mark Paluch <[email protected]>

Fix benchmarks after polishing. [#444][#409] Signed-off-by: Mark Paluch <[email protected]>

Use constructor delegation in DefaultCodecs. [#444][#409] Signed-off-by: Mark Paluch <[email protected]>

Fix benchmarks after polishing. [#444][#409] Signed-off-by: Mark Paluch <[email protected]>

Use constructor delegation in DefaultCodecs. [#444][#409] Signed-off-by: Mark Paluch <[email protected]>

frdeboffles added the type: enhancement A general enhancement label May 21, 2021

frdeboffles mentioned this issue May 21, 2021

Introduce codec mapping caches #410

Closed

4 tasks

frdeboffles mentioned this issue Sep 8, 2021

Introduce codec mapping caches (0.8.x branch) #444

Closed

4 tasks

mp911de closed this as completed in 86a8fb5 Sep 22, 2021

mp911de linked a pull request Sep 22, 2021 that will close this issue

Introduce codec mapping caches (0.8.x branch) #444

Closed

4 tasks

mp911de added this to the 0.8.9.RELEASE milestone Sep 22, 2021

mp911de added a commit that referenced this issue Sep 22, 2021

Polishing.

ea35ae4

Fix benchmarks after polishing. [#444][#409] Signed-off-by: Mark Paluch <[email protected]>

mp911de added a commit that referenced this issue Sep 23, 2021

Polishing.

cb39285

Use constructor delegation in DefaultCodecs. [#444][#409] Signed-off-by: Mark Paluch <[email protected]>

mp911de added a commit that referenced this issue Sep 23, 2021

Polishing.

0b52b0f

Fix benchmarks after polishing. [#444][#409] Signed-off-by: Mark Paluch <[email protected]>

mp911de added a commit that referenced this issue Sep 23, 2021

Polishing.

0d18586

Use constructor delegation in DefaultCodecs. [#444][#409] Signed-off-by: Mark Paluch <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance improvement on codec mapping #409

Performance improvement on codec mapping #409

frdeboffles commented May 21, 2021

mp911de commented May 22, 2021

protyay commented Jun 27, 2021

mp911de commented Jun 27, 2021

Performance improvement on codec mapping #409

Performance improvement on codec mapping #409

Comments

frdeboffles commented May 21, 2021

Feature Request

Is your feature request related to a problem? Please describe

Describe the solution you'd like

Describe alternatives you've considered

Teachability, Documentation, Adoption, Migration Strategy

mp911de commented May 22, 2021

protyay commented Jun 27, 2021

mp911de commented Jun 27, 2021