You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Complex Custom Plugin Monitors in Node Problem Detector: Seeking Advice on Scalable Approaches
We are currently implementing complex custom-plugin-monitors for Node Problem Detector (NPD). As our monitoring scripts grow in complexity, we're encountering challenges that may require a reassessment of our current approach. We'd like to seek community advice on best practices for handling advanced monitoring scenarios within the NPD framework.
Current Implementation:
Scripts are written in Python
Scripts are loaded into a ConfigMap and mounted into the NPD pod for execution
Increasing Complexity: Our scripts are becoming more sophisticated, requiring:
Breaking them into modules for better organization
Incorporating third-party libraries
Sharing functions between different monitors
Maintainability Concerns: As complexity grows, managing these scripts within ConfigMaps is becoming cumbersome
Potential Scope Creep: We're questioning whether we're extending beyond NPD's intended use case
Proposed Solutions:
Maintain Current Approach: Continue using ConfigMaps, but improve our build and deployment process to handle modular scripts and dependencies
Separate Monitoring Pods: Deploy complex "detectors" as separate pods/daemonsets, communicating with NPD over a simple protocol
Pros: Clear separation of concerns, easier management of dependencies
Cons: Increased resource usage, potential communication overhead
NPD Plugin System: Explore the possibility of enhancing NPD with a more robust plugin system that can handle complex, modular monitors
Questions for the Community:
What are the recommended best practices for implementing complex custom monitors in NPD?
Are there existing solutions or patterns within the Kubernetes ecosystem for handling advanced node problem detection scenarios?
We appreciate any insights, recommendations, or alternative approaches the community can provide to help us determine the most effective and maintainable solution for our use case.
The text was updated successfully, but these errors were encountered:
a solution to your configMap maintenance toil might simply be to build a custom NPD container image with your dependencies pip installed and your scripts copied into the image itself?
Complex Custom Plugin Monitors in Node Problem Detector: Seeking Advice on Scalable Approaches
We are currently implementing complex custom-plugin-monitors for Node Problem Detector (NPD). As our monitoring scripts grow in complexity, we're encountering challenges that may require a reassessment of our current approach. We'd like to seek community advice on best practices for handling advanced monitoring scenarios within the NPD framework.
Current Implementation:
Challenges:
Increasing Complexity: Our scripts are becoming more sophisticated, requiring:
Maintainability Concerns: As complexity grows, managing these scripts within ConfigMaps is becoming cumbersome
Potential Scope Creep: We're questioning whether we're extending beyond NPD's intended use case
Proposed Solutions:
Maintain Current Approach: Continue using ConfigMaps, but improve our build and deployment process to handle modular scripts and dependencies
Separate Monitoring Pods: Deploy complex "detectors" as separate pods/daemonsets, communicating with NPD over a simple protocol
NPD Plugin System: Explore the possibility of enhancing NPD with a more robust plugin system that can handle complex, modular monitors
Questions for the Community:
We appreciate any insights, recommendations, or alternative approaches the community can provide to help us determine the most effective and maintainable solution for our use case.
The text was updated successfully, but these errors were encountered: