Performance overhead of ReactiveCassandraTemplate #1218

samueldlightfoot · 2022-02-11T11:35:40Z

As brought to light from someone on the Gitter channel, it appears there is a significant performance overhead for query flows that use the ReactiveCassandraTemplate. Notably this includes the reactive @'Repository classes.

The throughput test ran compared writing using ReactiveCassandraRepository::insert vs ReactiveCqlTemplate::execute (cql, args). Local testing gave the following results (writes per second):

ReactiveCassandraRepository::insert 4000 writes/s
ReactiveCqlTemplate::execute 7500 writes/s

As you can see there is a significant difference.

I have tried the test with prepared statements both enabled and disabled and it makes little difference. CPU profiling shows no hotspots for the repository inserts and no discernible difference in the overall profiles. Could it be the mapping layer that builds the statements? I will continue to dig into possibilities.

The test showing the throughput difference can be found here (credits to original author piddubnyi): https://github.com/samueldlightfoot/spring-data-cassandra-performnace

Here are the JProfiler snapshots for runs of both for anyone interested (I may be missing something in my analysis):
Spring Data Performance.zip

Entity:

@Table("snapshot")
@Value
@Builder
@EqualsAndHashCode(callSuper = false)
@RequiredArgsConstructor
public class SnapshotRecord {

    @PrimaryKeyColumn(ordinal = 0, type = PrimaryKeyType.PARTITIONED)
    long id;
    @PrimaryKeyColumn(ordinal = 1, type = PrimaryKeyType.PARTITIONED)
    short market;
    @PrimaryKeyColumn(ordinal = 3, type = PrimaryKeyType.CLUSTERED)
    Instant slot;

    double value;
}

Repository:

public interface SnapshotRepository extends ReactiveCrudRepository<SnapshotRecord, Long> {

    default Mono<Boolean> saveViaCql(ReactiveCqlOperations cqlOps, SnapshotRecord record) {
        return cqlOps.execute(
                "INSERT INTO snapshot (id, market,slot,value) VALUES (?,?,?,?) USING TIMESTAMP ?;",
                ps -> {
                    return ps.bind(
                            record.getId(),
                            record.getMarket(),
                            record.getSlot(),
                            record.getValue(),
                            record.getSlot().toEpochMilli() * 1000
                    );
                }
   );
    }
}

Runner:

Flux<SnapshotRecord> data = Flux.generate(Object::new, (state, sink) -> {
            ThreadLocalRandom random = ThreadLocalRandom.current();
            sink.next(
                new SnapshotRecord(
                    random.nextLong(),
                    (short) random.nextInt(),
                    Clock.systemUTC().instant(),
                    random.nextDouble()
                )
            );
            return state;
        });
        subscription = data
//.flatMap((SnapshotRecord record) -> repository.saveViaCql(cqlOps, record), 512, 2048)
.flatMap(repository::save, 512, 2048) //doing this runs almost 2x slower than previous line
            .doOnNext(d -> success.incrementAndGet())
            .onErrorContinue((throwable, object) -> fail.incrementAndGet())
            .subscribe();

The text was updated successfully, but these errors were encountered:

mp911de · 2022-02-11T13:12:23Z

After a first investigation, the main difference comes from the fact of using the CassandraTemplate that works internally a lot with mapping while the CQL template is a tiny layer above the driver. Even without actual Cassandra interaction, I can yield about 20000 inserts/sec (with Cassandra its 10000 writes/sec) so likely the hotspot is somewhere in CassandraTemplate.

mp911de · 2022-02-11T14:14:33Z

Upon further investigation, it seems that Cassandra is the fastest write-store that we currently support. Disabling the database interaction helped to reveal a few things that didn't bubble up because the actual database was so slow in other Spring Data modules so we never noticed these.

We've identified a few things that we could optimize:

Caching of type information through annotation lookups (used typically to determine the Cassandra target type)
Caching of the CassandraColumnType
Avoid calling Spring's ConversionService (the assignability checks on our side that prevent calling the conversion service didn't consider primitives)
Reactive return type information caching (to decorate return types with Repository invocation listeners)

Applying these changes I can yield now about 60000 inserts/sec (without Cassandra, with Cassandra about 16000 which is close to 18726 using plain CQL).

The overhead in performance drag becomes way smaller and if you consider what CassandraTemplate gives you (entity callbacks, lifecycle events, statement creation, value conversion) then the overhead now of about 12% becomes much better than 50% overhead.

mp911de · 2022-02-11T14:27:23Z

During the analysis of object allocations a method became visible that constructs an INSERT from a collection of values. Due to the immutable nature, the construction seems rather expensive because the objects are created individually and not as batch. Maybe an optimization for the query builders, @adutra?

We now cache the outcome for column types, AnnotatedType lookuop by annotation and bypass the conversion service by considering primitive type wrappers in the assignability check. Closes #1218

Use ClassUtils.isAssignableValue(…) instead of ClassUtils.resolvePrimitiveIfNecessary(target).isAssignableFrom(…). Closes #1218

spring-projects-issues added the status: waiting-for-triage An issue we've not yet triaged label Feb 11, 2022

samueldlightfoot changed the title ~~Performance overhead of Statement building~~ Performance overhead of ReactiveCassandraTemplate Feb 11, 2022

mp911de self-assigned this Feb 11, 2022

mp911de added type: enhancement A general enhancement and removed status: waiting-for-triage An issue we've not yet triaged labels Feb 11, 2022

mp911de mentioned this issue Feb 11, 2022

Performance improvements in ReactiveWrappers and ConvertingPropertyAccessor spring-projects/spring-data-commons#2546

Closed

mp911de added this to the 3.3.2 (2021.1.2) milestone Feb 11, 2022

mp911de closed this as completed in f855b58 Feb 11, 2022

mp911de added a commit that referenced this issue Feb 14, 2022

Polishing.

93f7c57

Use ClassUtils.isAssignableValue(…) instead of ClassUtils.resolvePrimitiveIfNecessary(target).isAssignableFrom(…). Closes #1218

mp911de added a commit that referenced this issue Feb 14, 2022

Polishing.

90d83a4

Use ClassUtils.isAssignableValue(…) instead of ClassUtils.resolvePrimitiveIfNecessary(target).isAssignableFrom(…). Closes #1218

mp911de added a commit that referenced this issue Feb 14, 2022

Polishing.

68ddb12

Use ClassUtils.isAssignableValue(…) instead of ClassUtils.resolvePrimitiveIfNecessary(target).isAssignableFrom(…). Closes #1218

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance overhead of ReactiveCassandraTemplate #1218

Performance overhead of ReactiveCassandraTemplate #1218

samueldlightfoot commented Feb 11, 2022 •

edited

Loading

mp911de commented Feb 11, 2022

mp911de commented Feb 11, 2022

mp911de commented Feb 11, 2022

Performance overhead of ReactiveCassandraTemplate #1218

Performance overhead of ReactiveCassandraTemplate #1218

Comments

samueldlightfoot commented Feb 11, 2022 • edited Loading

mp911de commented Feb 11, 2022

mp911de commented Feb 11, 2022

mp911de commented Feb 11, 2022

samueldlightfoot commented Feb 11, 2022 •

edited

Loading