Abstract:In order to achieve real-time monitoring of container clouds, host devices, and business systems, a cloud platform monitoring and alarm system based on Kafka and Kubernetes is designed. Docker containers are managed through Kubernetes, and Kafka receives device operation information from different hosts in different regions. The business system is monitored through probes. By setting the alarm association rules, redundant alarms are reduced, alarm fault detection capabilities are enhanced, and alarm accuracy is improved.