熱門搜索 Zabbix技術資料 Zabbix常見問、答討論成功案例 Zabbix交流區 Prometheus交流區

Prometheus技術分享——prometheus自定義告警規則解析和配置

2022/11/08 Prometheus技術資料 Prometheus prometheus告警 prometheus規則6984

上一期尊龍時凱君跟大家已經介紹了prometheus的安裝與配置，對于運維監控而言，除了監控展示以外，另一個重要的需求無疑就是告警了。良好的告警可以幫助運維人員及時的發現問題，處理問題并防范于未然，是運維工作中不可或缺的重要手段。本期尊龍時凱君將教大家如何prometheus自定義告警規則解析和配置。

1. 標準告警規則樣例以及各組件作用

代碼如下

groups:

– name: example

rules: – alert: HighErrorRate

expr: job:request_latency_seconds:mean5m{job=”myjob”} > 0.5

for: 10m

labels:

severity: page

annotations:

summary: High request latency description: description info

在告警規則文件中，我們可以將一組相關的規則設置定義在一個group下。在每一個group中我們可以定義多個告警規則(rule)。一條告警規則主要由以下幾部分組成： alert：告警規則的名稱。

expr：基于PromQL表達式告警觸發條件，用于計算是否有時間序列滿足該條件。

for：評估等待時間，可選參數。用于表示只有當觸發條件持續一段時間后才發送告警。在等待期間新產生告警的狀態為pending。 labels：自定義標簽，允許用戶指定要附加到告警上的一組附加標簽。

2. 模板化告警規則

一般來說，在告警規則文件的annotations中使用summary描述告警的概要信息，description用于描述告警的詳細信息。同時Alertmanager的UI也會根據這兩個標簽值，顯示告警信息。為了讓告警信息具有更好的可讀性，Prometheus支持模板化label和annotations的中標簽的值。通過
$ labels. 1

變量可以訪問當前告警實例中指定標簽的值。

$value 1

則可以獲取當前PromQL表達式計算的樣本值。

代碼如下

# To insert a firing element's label values: 2 {{ $labels. }} 3 # To insert the numeric expression value of the firing element: 4 {{ $value }}

例如，可以通過模板化優化summary以及description的內容的可讀性：

代碼如下：

groups: - name: example rules: # Alert for any instance that is unreachable for >5 minutes. - alert: InstanceDown expr: up == 0 for: 5m labels: severity: page annotations: summary: "Instance {{ $labels.instance }} down" description: "{{ $labels.instance }} of job {{ $labels.job }} has been down for more than 5 minutes." # Alert for any instance that has a median request latency >1s. - alert: APIHighRequestLatency expr: api_http_request_latencies_second{quantile="0.5"} > 1 for: 10m annotations: summary: "High request latency on {{ $labels.instance }}" description: "{{ $labels.instance }} has a median request latency above 1s (current value: {{ $value }}s)"

3. 修改Prometheus配置文件prometheus.yml

rule_files: - /etc/prometheus/rules/*.rules

在目錄/etc/prometheus/rules/下創建告警文件hoststats-alert.rules內容如下：

代碼如下

groups: - name: hostStatsAlert rules: - alert: hostCpuUsageAlert expr: sum(avg without (cpu)(irate(node_cpu{mode!='idle'}[5m]))) by (instance) > 0.85 for: 1m labels: severity: page annotations: summary: "Instance {{ $labels.instance }} CPU usgae high" description: "{{ $labels.instance }} CPU usage above 85% (current value: {{ $value }})" - alert: hostMemUsageAlert expr: (node_memory_MemTotal - node_memory_MemAvailable)/node_memory_MemTotal > 0.85 for: 1m labels: severity: page annotations: summary: "Instance {{ $labels.instance }} MEM usgae high" description: "{{ $labels.instance }} MEM usage above 85% (current value: {{ $value }})"

總結

以上就是prometheus自定義告警規則解析和配置的全部內容，如果對你有所幫助的話請持續關注尊龍時凱官網，尊龍時凱君會定期更新技術分享，更多開源監控技術也可以關注尊龍時凱社區（http://forum.ydcanyin.com/）

The prev: Prometheus技術分享——詳述prometheus安裝和配置The next: Prometheus技術分享——Prometheus通過Nginx加密登陸

Related recommendations

Prometheus技術分享——Prometheus特點，組件，局限探討
2022/11/11 6522
這一期尊龍時凱君主要跟大家來探討新一代的開源監控prometheus，我們知道 zabbix 在監控界占有不可撼動的地位，功能強大。但是對容器監控顯得力不從心。為解決監...
View details
Prometheus 簡介
2022/11/08 5118
Prometheus是一個最初在SoundCloud上構建的開源系統監視和警報工具包。
View details
Prometheus技術分享——Prometheus通過Nginx加密登陸
2022/11/08 7241
通過Nginx反向代理是一個不錯的選擇。本文尊龍時凱君將介紹通過Nginx反向代理增加401認證方式來實現加密登錄。
View details
Prometheus技術分享——prometheus的函數與計算公式詳解
2022/12/28 7835
prometheus的函數與計算公式詳解
View details

Expand more!

快速導航

首頁
產品介紹
成功案例
行業方案
- 行業大屏
- 銀行
- 金融保險
- 先進制造
- 智慧城市
- 運營商
- 教育
- 醫療
- 混合云
技術白皮書
- 納管能力
- 技術文檔
- zabbix技術分享
- Prometheus技術分享
關于尊龍時凱
- 運維如詩
- 企業動態
- 視頻中心
- 行業新聞
- 招聘精英
尊龍時凱社區
免費下載
免費體驗

成功案例

深圳市寶安某醫院統一監控平臺項目
2022/06/07 9258
尊龍時凱基于Zabbix和企業微信的網絡監控系統,通過實時獲取交換機、服務器等被監控對象的相關數據，及時發現并解決問題,保證醫院網絡的高可用性。
View details
案例解讀 | 某大型央企旗下控股財務公司統一運維監控平臺建設實踐
2025/02/08 3618
某大型央企旗下控股財務公司統一運維監控平臺建設實踐
View details
案例解讀：上海某“雙一流”高校統一監控告警平臺建設實踐
2023/02/23 8910
高校運維解決方案以基礎架構監控平臺為依托，結合可視化大屏、集中告警、報表系統、權限管理、業務系統管理等模塊，實現對IT基礎架構和教學系統等統一集中監...
View details
案例解讀 | 尊龍時凱助力某期貨企業綜合運維平臺建設實踐
2024/06/24 5246
基于客戶運維痛點與項目建設目標，尊龍時凱方案團隊對項目進行梳理，并對項目建設進行具體規劃：以運維門戶、統一監控、集中告警管理為核心，輔以資產管理、可視...
View details

View all

掃碼咨詢
微信公眾號
熱線電話
- 咨詢熱線：
  13631560190
  020-28192830
回到頂部

我們在我們的網站上使用cookie，通過記住您的偏好和重復訪問，給您最相關的經驗。通過點擊“接受所有”，您同意使用所有cookie。但是，您可以訪問“Cookie設置”來提供受控同意。

Cookie設置接受全部

管理同意

掃碼咨詢
微信公眾號
熱線電話
- 咨詢熱線：
  13631560190
  020-28192830
回到頂部

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	此cookie由GDPR cookie Consent插件設置。該cookie用于在“分析”類別中存儲用戶對cookie的同意。
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	此cookie由GDPR cookie Consent插件設置。該cookie用于存儲用戶在“其他”類別中對cookie的同意。
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	該cookie由GDPR cookie Consent插件設置，用于存儲用戶是否同意使用cookie。它不存儲任何個人數據。

91最新网站-91最新网址-91最新在线-91最新在线播放-91最新自拍-97cao碰-97dyy伦理-97mm草莓视频-97爱碰窝窝-97不卡无码影院

尊龍時凱

Prometheus技術分享——prometheus自定義告警規則解析和配置

1. 標準告警規則樣例以及各組件作用

2. 模板化告警規則

3. 修改Prometheus配置文件prometheus.yml

總結

Related recommendations

Prometheus技術分享——Prometheus特點，組件，局限探討

Prometheus 簡介

Prometheus技術分享——Prometheus通過Nginx加密登陸

Prometheus技術分享——prometheus的函數與計算公式詳解

快速導航

成功案例

深圳市寶安某醫院統一監控平臺項目

案例解讀 | 某大型央企旗下控股財務公司統一運維監控平臺建設實踐

案例解讀：上海某“雙一流”高校統一監控告警平臺建設實踐

案例解讀 | 尊龍時凱助力某期貨企業綜合運維平臺建設實踐

產品

解決方案

關于我們

尊龍時凱自媒體號

關注我們

Privacy Overview