使用Docker-compose快速搭建Zabbix7.0监控平台,请参考文档: Zabbix7.0 Docker-compose部署指南
Nacos(Dynamic Naming and Configuration Service)是阿里巴巴开源的服务注册与配置中心,主要提供两大核心功能:
作为微服务架构的核心组件,Nacos能有效解决服务拓扑动态变化带来的管理难题。
GET http://{nacos_url}/nacos/v1/ns/catalog/services
参数说明:
创建nacos_service_discovery.py
实现双模式查询:
#!/usr/bin/python3
import requests
import json
import sys
nacos_url = "http://srebro.cn:8848"
namespace = "base-uat"
page_size = 1000
services_cache = [] # 全局缓存提升性能
def get_all_services():
global services_cache
if services_cache:
return services_cache
services = []
page_no = 1
while True:
url = f"{nacos_url}/nacos/v1/ns/catalog/services?namespaceId={namespace}&pageNo={page_no}&pageSize={page_size}"
try:
resp = requests.get(url, timeout=10)
resp.raise_for_status()
data = resp.json()
services.extend(data['serviceList'])
if len(services) >= data.get('count', 0):
break
page_no += 1
except Exception as e:
print(f"Error: {e}")
break
services_cache = services
return services
def main():
if len(sys.argv) > 1:
service_name = sys.argv[1]
for service in get_all_services():
if service["name"] == service_name:
print(service.get("healthyInstanceCount", 0))
return
print(0)
return
discovery = {"data": []}
for service in get_all_services():
discovery["data"].append({
"{#SERVICE_NAME}": service["name"],
"{#HEALTHY_COUNT}": service.get("healthyInstanceCount", 0)
})
print(json.dumps(discovery))
if __name__ == "__main__":
main()
$ python3 nacos_service_discovery.py
{
"data":[
{
"{#SERVICE_NAME}":"srebro-app-alarm",
"{#HEALTHY_COUNT}":1
},
{
"{#SERVICE_NAME}":"srebro-oss",
"{#HEALTHY_COUNT}":1
},
{
"{#SERVICE_NAME}":"srebro-sys",
"{#HEALTHY_COUNT}":1
},
{
"{#SERVICE_NAME}":"srebro-meta",
"{#HEALTHY_COUNT}":1
},
{
"{#SERVICE_NAME}":"srebro-message",
"{#HEALTHY_COUNT}":2
},
{
"{#SERVICE_NAME}":"srebro-uc",
"{#HEALTHY_COUNT}":1
},
{
"{#SERVICE_NAME}":"srebro-org",
"{#HEALTHY_COUNT}":1
},
{
"{#SERVICE_NAME}":"srebro-workflow",
"{#HEALTHY_COUNT}":1
},
{
"{#SERVICE_NAME}":"srebro-platform-iot-master",
"{#HEALTHY_COUNT}":1
},
{
"{#SERVICE_NAME}":"srebro-app-risk",
"{#HEALTHY_COUNT}":1
},
{
"{#SERVICE_NAME}":"srebro-job",
"{#HEALTHY_COUNT}":1
},
{
"{#SERVICE_NAME}":"srebro-export",
"{#HEALTHY_COUNT}":1
},
{
"{#SERVICE_NAME}":"srebro-gateway",
"{#HEALTHY_COUNT}":1
}
]
}
$ python3 nacos_service_discovery.py srebro-app-alarm
1
在zabbix_agentd.conf
中添加自定义监控项:【允许自定义KEY ,千万不能忘记 UnsafeUserParameters=1】
# 服务自动发现
UserParameter=nacos.enhanced.discovery,/usr/local/zabbix/bin/nacos_service_discovery.py
# 健康实例数查询
UserParameter=nacos.health.current[*],/usr/local/zabbix/bin/nacos_service_discovery.py "$1"
重启zabbix-agent
服务
systemctl restart zabbix-agent
测试使用zabbix-get
获取数据
[root@localhost bin]# /usr/local/zabbix/bin/zabbix_get -s 172.22.33.201 -k "nacos.health.current[srebro-message]"
2
属性 | 值 |
---|---|
模板名称 | Nacos Services Health Monitor |
模板组 | Templates |
参数 | 配置值 |
---|---|
名称 | Discover Services with Healthy Count |
类型 | Zabbix客户端主动式 |
键值 | nacos.enhanced.discovery |
更新间隔 | 1h |
监控项原型配置项:
参数 | 配置值 |
---|---|
名称 | Health instances of |
类型 | Zabbix客户端主动式 |
键值 | nacos.health.current["{#SERVICE_NAME}"] |
信息类型 | 数字(无正负) |
更新间隔 | 1m |
触发器配置参数:
参数 | 配置值 |
---|---|
名称 | 服务 {#SERVICE_NAME} 实例数 < 1 |
表达式 | last(/Nacos Services Health Monitor/nacos.health.current["{#SERVICE_NAME}"])<1 |
严重性 | 灾难 |
描述 | 服务 {#SERVICE_NAME} 健康实例数降为0,可能导致服务不可用 |
通过监测 → 最新数据
查看监控数据: