使用Docker-compose快速搭建Zabbix7.0监控平台,请参考文档: Zabbix7.0 Docker-compose部署指南
Nacos(Dynamic Naming and Configuration Service)是阿里巴巴开源的服务注册与配置中心,主要提供两大核心功能:
作为微服务架构的核心组件,Nacos能有效解决服务拓扑动态变化带来的管理难题。
GET http://{nacos_url}/nacos/v1/ns/catalog/services
参数说明:
创建nacos_service_discovery.py实现双模式查询:
#!/usr/bin/python3
import requests
import json
import sys
nacos_url = "http://srebro.cn:8848"
namespace = "base-uat"
page_size = 1000
services_cache = []  # 全局缓存提升性能
def get_all_services():
    global services_cache
    if services_cache:
        return services_cache
        
    services = []
    page_no = 1
    while True:
        url = f"{nacos_url}/nacos/v1/ns/catalog/services?namespaceId={namespace}&pageNo={page_no}&pageSize={page_size}"
        try:
            resp = requests.get(url, timeout=10)
            resp.raise_for_status()
            data = resp.json()
            services.extend(data['serviceList'])
            if len(services) >= data.get('count', 0):
                break
            page_no += 1
        except Exception as e:
            print(f"Error: {e}")
            break
    services_cache = services
    return services
def main():
    if len(sys.argv) > 1:
        service_name = sys.argv[1]
        for service in get_all_services():
            if service["name"] == service_name:
                print(service.get("healthyInstanceCount", 0))
                return
        print(0)
        return
    
    discovery = {"data": []}
    for service in get_all_services():
        discovery["data"].append({
            "{#SERVICE_NAME}": service["name"],
            "{#HEALTHY_COUNT}": service.get("healthyInstanceCount", 0)
        })
    print(json.dumps(discovery))
if __name__ == "__main__":
    main()
$ python3 nacos_service_discovery.py
{
    "data":[
        {
            "{#SERVICE_NAME}":"srebro-app-alarm",
            "{#HEALTHY_COUNT}":1
        },
        {
            "{#SERVICE_NAME}":"srebro-oss",
            "{#HEALTHY_COUNT}":1
        },
        {
            "{#SERVICE_NAME}":"srebro-sys",
            "{#HEALTHY_COUNT}":1
        },
        {
            "{#SERVICE_NAME}":"srebro-meta",
            "{#HEALTHY_COUNT}":1
        },
        {
            "{#SERVICE_NAME}":"srebro-message",
            "{#HEALTHY_COUNT}":2
        },
        {
            "{#SERVICE_NAME}":"srebro-uc",
            "{#HEALTHY_COUNT}":1
        },
        {
            "{#SERVICE_NAME}":"srebro-org",
            "{#HEALTHY_COUNT}":1
        },
        {
            "{#SERVICE_NAME}":"srebro-workflow",
            "{#HEALTHY_COUNT}":1
        },
        {
            "{#SERVICE_NAME}":"srebro-platform-iot-master",
            "{#HEALTHY_COUNT}":1
        },
        {
            "{#SERVICE_NAME}":"srebro-app-risk",
            "{#HEALTHY_COUNT}":1
        },
        {
            "{#SERVICE_NAME}":"srebro-job",
            "{#HEALTHY_COUNT}":1
        },
        {
            "{#SERVICE_NAME}":"srebro-export",
            "{#HEALTHY_COUNT}":1
        },
        {
            "{#SERVICE_NAME}":"srebro-gateway",
            "{#HEALTHY_COUNT}":1
        }
    ]
}
$ python3 nacos_service_discovery.py srebro-app-alarm
1
在zabbix_agentd.conf中添加自定义监控项:【允许自定义KEY ,千万不能忘记 UnsafeUserParameters=1】
# 服务自动发现
UserParameter=nacos.enhanced.discovery,/usr/local/zabbix/bin/nacos_service_discovery.py
# 健康实例数查询
UserParameter=nacos.health.current[*],/usr/local/zabbix/bin/nacos_service_discovery.py "$1"
重启zabbix-agent服务
systemctl restart zabbix-agent
测试使用zabbix-get 获取数据
[root@localhost bin]# /usr/local/zabbix/bin/zabbix_get -s 172.22.33.201 -k "nacos.health.current[srebro-message]"
2

| 属性 | 值 | 
|---|---|
| 模板名称 | Nacos Services Health Monitor | 
| 模板组 | Templates | 

| 参数 | 配置值 | 
|---|---|
| 名称 | Discover Services with Healthy Count | 
| 类型 | Zabbix客户端主动式 | 
| 键值 | nacos.enhanced.discovery | 
| 更新间隔 | 1h | 

监控项原型配置项:
| 参数 | 配置值 | 
|---|---|
| 名称 | Health instances of | 
| 类型 | Zabbix客户端主动式 | 
| 键值 | nacos.health.current["{#SERVICE_NAME}"] | 
| 信息类型 | 数字(无正负) | 
| 更新间隔 | 1m | 

触发器配置参数:
| 参数 | 配置值 | 
|---|---|
| 名称 | 服务 {#SERVICE_NAME} 实例数 < 1 | 
| 表达式 | last(/Nacos Services Health Monitor/nacos.health.current["{#SERVICE_NAME}"])<1 | 
| 严重性 | 灾难 | 
| 描述 | 服务 {#SERVICE_NAME} 健康实例数降为0,可能导致服务不可用 | 
通过监测 → 最新数据查看监控数据:
