How to Connect Zadig Workflow to Grafana for Service Monitoring

Zadig seamlessly integrates with Grafana to achieve operation and maintenance automation and efficient monitoring 📈

cover.png

Service monitoring plays a crucial role in modern software operations and maintenance, ensuring the robust operation of systems. Grafana, as a comprehensive open-source monitoring tool, provides deep insights into multi-source data through its powerful data integration and visualization capabilities. Zadig, as a cloud-native DevOps platform, streamlines and self-serves business, configuration, and data change tasks in software delivery with its agile and scalable automated workflow engine.

By tightly integrating Zadig with Grafana’s monitoring capabilities, we can significantly improve the real-time response to system anomalies. This article will introduce how to achieve seamless integration between Zadig and Grafana, ensuring continuous monitoring after service changes, providing the operations team with greater flexibility and control, and ensuring the continuous stability of the system.

# Zadig Integration with Grafana

  1. Access Grafana, create a service account, set the role to Viewer, and then add and obtain the token.

01.png

02.png

03.png

04.png

  1. Access Zadig, go to 系统设置 -> 系统集成 -> 监控系统, and add the relevant information for Grafana.

05.png

06.png

# Configure Monitoring Tasks in the Workflow

In the production release workflow, add a Grafana 监测任务 and select the appropriate alert rule.

07.png

08.png

Explanation:

  • Monitoring Time: The time period during which the alert rule is monitored. If the result is abnormal within this time, the workflow monitoring task will fail

  • Alert Rules: You can select multiple pre-defined alert rules from Grafana

  • Failure Strategy: Choose different failure strategies based on different monitoring levels and scenarios

    • Immediate Failure on Single Monitoring Exception: If any monitor produces an abnormal result, the workflow monitoring task fails immediately
    • Failure After All Monitoring Completes: The workflow monitoring task fails only if an abnormal result appears after all monitors have completed

# Workflow Notifications and Alerts

Select the corresponding service and version to execute the workflow. When an exception is detected during the change process, the workflow will quickly provide feedback. Additionally, using the workflow notification (opens new window) capabilities, error results can be received in real-time via instant messaging (IM), email, or a self-built notification system to ensure that issues are promptly addressed.

09.png

10.png

11.png

# Summary

The strong integration between Zadig and Grafana greatly enhances the automation and response rate of service monitoring. This integration allows us to capture any impact of service changes in real-time and ensures that issues are identified and resolved promptly through automated alert mechanisms. It also enhances continuous supervision of service performance, improving the stability and reliability of services through automated processes and immediate alerts, ensuring smooth business operations, and significantly enhancing customer satisfaction.

Background Image

作为一名软件工程师,我们一直给各行各业写软件提升效率,但是软件工程本身却是非常低效,为什么市面上没有一个工具可以让研发团队不这么累,还能更好、更快地满足大客户的交付需求?我们是否能够打造一个面向开发者的交付平台呢?我们开源打造 Zadig 正是去满足这个愿望。

—— Zadig 创始人 Landy