Use application name as default clientId #3001

notizklotz · 2024-01-24T13:09:07Z

Expected Behavior

If spring.application.name is defined, it should be used as default Kafka clientId unless overriden by more specific configs. I get the behaviour that I think would be a good default by defining this in my application:

spring.kafka.client-id=${spring.application.name}

Current Behavior

The default clientIds are "producer", "adminclient", etc.

Context

We operate fairly large shared Kafka clusters and a couple of important metrics and log output by Kafka only includes the clientId and not the username. Especially for producers this means we see many of our customers implicitly connecting using "producer-1" as clientId as there is no way on Kafka server side to enforce a specific pattern. By using a more specific default this would ease problem analysis on server side. spring.application.name would be an ideal default because this is also used for similar use cases with other technologies.

I could have a look at creating a pull request for this, if this isn't something that has already been rejected in the past.

The text was updated successfully, but these errors were encountered:

sobychacko · 2024-01-24T14:48:08Z

@notizklotz I think we should still preserve producer, consumer, admin client etc. in the client-id name to distinguish them, but we can attach the spring.application.name for shared cluster environments where there are multiple apps connecting to Kafka. A PR would be greatly welcomed!

artembilan · 2024-01-24T15:20:58Z

Note: the spring.kafka.client-id is the part of Spring Boot.
But Soby is right: this common property is out of use and those API-specific are there for consideration.

The "producer", "consumer" and "adminclient" comes from bean names auto-configured for those specific Kafka clients.

We may indeed consider to include spring.application.name into those properties.
For example, DefaultKafkaProducerFactory.CloseSafeProducer:

this.clientId = factoryName + "." + id;

And somewhere there we can resolve it like this.applicationContext.getEnvironment().getProperty("spring.application.name");

…ault clientIds

notizklotz · 2024-01-29T18:12:39Z

I started implementing a proof of concept and it seems to work out well, especially for the cases which currently have a very generic clientId (Consumer without Consumer Group, Producer, Admin Client).

I chose to put the application name before the type (myapp-consumer-1 instead of consumer-myapp-1) to differentiate it better from the Consumer with Consumer Group defaults, which have the Consumer Group name after the type. And it is more similar to the Kafka Streams defaults.

The following table with examples assumes spring.application.name=myapp

Type	Before	After	Notes
consumer (with CG)	consumer-myconsumergroup-1	consumer-myconsumergroup-1	Unchanged. Consumer Group is part of default clientId generated by KafkaConsumer which is enough for identifying the client.
consumer (without CG)	consumer-null-1	myapp-consumer-1
producer	producer-1	myapp-producer-1
admin	adminclient-1	myapp-admin-1
Streams	myapp-398ba268-efe9-4a74-a6e0-dcdc79412538-StreamThread-1-producer myapp-ef4f985f-8ed7-48c4-8abc-bb51a354ab45-StreamThread-1-restore-consumer	myapp-398ba268-efe9-4a74-a6e0-dcdc79412538-StreamThread-1-producer myapp-ef4f985f-8ed7-48c4-8abc-bb51a354ab45-StreamThread-1-restore-consumer	Unchanged. A Streams application id is always required by Kafka Streams and used in the default client ids. `spring.application.name` is already set as default Streams application id by Spring Boot.

artembilan · 2024-01-30T17:46:10Z

@notizklotz ,

Thank you for investigation!

So, as we have discussed before, we have a plan.
The DefaultKafkaProducerFactory, DefaultKafkaConsumerFactory and KafkaAdmin need to be adjusted to incorporate spring.application.name Environment property, if clientId is not set explicitly in the target application.

Does that makes sense?

notizklotz · 2024-01-30T18:02:27Z

@artembilan Yes, that makes sense :-) The code of my PoC is here and I could work it into a proper PR within the next weeks: main...notizklotz:spring-kafka:GH-3001

frosiere · 2024-02-07T12:57:20Z

Very nice proposal. I'm currently facing the same issue.

When deploying in a cloud environment like Kubernetes, the same client id may still refer to multiple instances of a producer, consumer and admin client.

So, to solve this issue, wouldn't it make sense to have a ClientIdResolver with a default implementation referring to what has been proposed above?

The resolver contract would be as follow

public interface ClientIdResolver {
   
     String resolve(Map<String, ?> config, String suffix); 
}

public class DefaultClientIdResolver implements ClientIdResolver {

    private final Environment environment;

    public DefaultClientIdResolver(Environment environment) {
        this.environment = environment;
    }

    @Override
    public String resolve(Map<String, ?> config, String suffix) {
        final var applicationName = environment.getProperty("spring.application.name");
        if (applicationName == null || config.containsKey(CommonClientConfigs.CLIENT_ID_CONFIG)) {
            return (String) config.get(CommonClientConfigs.CLIENT_ID_CONFIG);
        }
        return String.join(".", applicationName, suffix);
    }
}

The resolver would also avoid the resolution of the clientId in 3 different kind of clients...

Users deploying in a cloud would be able to provide another implementation adding the pod id or another relevant information to enable correlation between the clientId and a specific instance of the producer, consumer and admin client.

Hope that the proposal make sense and can help investigations based on the clientId when deploying in a cloud environment.

artembilan · 2024-02-07T14:14:55Z

As far as I know you can use env vars placeholders in those Spring Boot configuration properties, therefore no need in extra logic in the code: https://docs.spring.io/spring-boot/docs/current/reference/htmlsingle/#features.external-config.files.property-placeholders

frosiere · 2024-02-07T14:49:53Z

Fair point. The idea was also to factorize the code to avoid duplicating the same logic in 3 different places, allowing to change the resolution in an easy way. Thanks for the quick reply.

sobychacko · 2024-02-15T23:54:59Z

@notizklotz Please let us know when you can work on a PR. We are postponing this issue to the next milestone (3.2.0-M2).

Fixes: spring-projectsGH-3001

Fixes: #spring-projectsGH-3001

Fixes: #spring-projectsGH-3001 Use Spring Boot's `spring.application.name` property as part of the default clientIds for Consumers, Producers and AdminClients. Helps with identifying problematic clients at server side. * Only use as a fallback if clientId wasn't specified explicitly * Do not use for Consumers with a specified groupId because KafkaConsumer will use the groupId as clientId which already is an identifiable default

notizklotz · 2024-02-19T14:12:31Z

@notizklotz Please let us know when you can work on a PR. We are postponing this issue to the next milestone (3.2.0-M2).

@sobychacko I have the PR ready: #3048

# Conflicts: # spring-kafka-docs/src/main/antora/modules/ROOT/pages/whats-new.adoc

Fixes: #spring-projectsGH-3001 Use Spring Boot's `spring.application.name` property as part of the default clientIds for Consumers, Producers and AdminClients. Helps with identifying problematic clients at server side. * Only use as a fallback if clientId wasn't specified explicitly * Do not use for Consumers with a specified groupId because KafkaConsumer will use the groupId as clientId which already is an identifiable default

Fixes: #GH-3001 * Use Spring Boot's `spring.application.name` property as part of the default clientIds for Consumers, Producers, and AdminClients. Helps with identifying problematic clients on the server side. * Only use as a fallback if clientId wasn't specified explicitly * Do not use for Consumers with a specified groupId because KafkaConsumer will use the groupId as clientId, which already is an identifiable default

sobychacko · 2024-03-11T19:39:33Z

Closed via ab5f0a1.
See the PR: #3048

notizklotz added status: waiting-for-triage type: enhancement labels Jan 24, 2024

artembilan added this to the 3.2.0-M1 milestone Jan 24, 2024

artembilan removed the status: waiting-for-triage label Jan 24, 2024

notizklotz added a commit to notizklotz/spring-kafka that referenced this issue Jan 29, 2024

spring-projectsGH-3001: make Spring Boot application name part of def…

1d9ddba

…ault clientIds

sobychacko modified the milestones: 3.2.0-M1, 3.2.0-M2 Feb 15, 2024

notizklotz added a commit to notizklotz/spring-kafka that referenced this issue Feb 19, 2024

Merge branch 'spring-projects:main' into spring-projectsGH-3001

96eed46

notizklotz added a commit to notizklotz/spring-kafka that referenced this issue Feb 19, 2024

spring-projectsGH-3001: application name as part of default clientIds

6663f94

Fixes: spring-projectsGH-3001

notizklotz added a commit to notizklotz/spring-kafka that referenced this issue Feb 19, 2024

spring-projectsGH-3001: application name as part of default clientIds

297c480

Fixes: #spring-projectsGH-3001

notizklotz added a commit to notizklotz/spring-kafka that referenced this issue Feb 19, 2024

spring-projectsGH-3001: default clientIds with application name

9eb3dbe

Fixes: #spring-projectsGH-3001

notizklotz mentioned this issue Feb 19, 2024

GH-3001: default clientIds with application name #3048

Merged

notizklotz added a commit to notizklotz/spring-kafka that referenced this issue Mar 1, 2024

spring-projectsGH-3001: fix checkstyle errors

9b40ca4

notizklotz added a commit to notizklotz/spring-kafka that referenced this issue Mar 1, 2024

spring-projectsGH-3001: add docs

3ed24ae

notizklotz added a commit to notizklotz/spring-kafka that referenced this issue Mar 1, 2024

Merge remote-tracking branch 'origin/main' into spring-projectsGH-3001

e0a3c34

# Conflicts: # spring-kafka-docs/src/main/antora/modules/ROOT/pages/whats-new.adoc

notizklotz added a commit to notizklotz/spring-kafka that referenced this issue Mar 11, 2024

spring-projectsGH-3001: fix checkstyle errors

5e0489e

notizklotz added a commit to notizklotz/spring-kafka that referenced this issue Mar 11, 2024

spring-projectsGH-3001: add docs

2271724

notizklotz added a commit to notizklotz/spring-kafka that referenced this issue Mar 11, 2024

spring-projectsGH-3001: add section to reference documentation

be516a4

sobychacko closed this as completed Mar 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use application name as default clientId #3001

Use application name as default clientId #3001

notizklotz commented Jan 24, 2024 •

edited

Loading

sobychacko commented Jan 24, 2024

Uh oh!

artembilan commented Jan 24, 2024

Uh oh!

notizklotz commented Jan 29, 2024

Uh oh!

artembilan commented Jan 30, 2024

Uh oh!

notizklotz commented Jan 30, 2024

Uh oh!

frosiere commented Feb 7, 2024

Uh oh!

artembilan commented Feb 7, 2024

Uh oh!

frosiere commented Feb 7, 2024

Uh oh!

sobychacko commented Feb 15, 2024

Uh oh!

notizklotz commented Feb 19, 2024

Uh oh!

sobychacko commented Mar 11, 2024

Uh oh!

Use application name as default clientId #3001

Use application name as default clientId #3001

Comments

notizklotz commented Jan 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

sobychacko commented Jan 24, 2024

Uh oh!

artembilan commented Jan 24, 2024

Uh oh!

notizklotz commented Jan 29, 2024

Uh oh!

artembilan commented Jan 30, 2024

Uh oh!

notizklotz commented Jan 30, 2024

Uh oh!

frosiere commented Feb 7, 2024

Uh oh!

artembilan commented Feb 7, 2024

Uh oh!

frosiere commented Feb 7, 2024

Uh oh!

sobychacko commented Feb 15, 2024

Uh oh!

notizklotz commented Feb 19, 2024

Uh oh!

sobychacko commented Mar 11, 2024

Uh oh!

notizklotz commented Jan 24, 2024 •

edited

Loading