羅奕,陳粵



摘 要: 銀行為了在發生異常時能及時處理,往往會通過監控系統來實現對硬件、網絡、應用系統等的監控和報警。Nagios是一個開源且免費的計算機及網絡系統監控軟件,運行在Linux平臺上,能通過各種插件和SNMP協議,對設備、網絡及各種應用系統進行狀態監控。介紹了Nagios的工作原理和功能,以及在平安銀行成都分行的應用情況。具體應用實踐表明,利用Nagios構建集中監控系統效果非常顯著,為銀行的生產運維提供了有效的監控報警平臺。
關鍵詞: Nagios; 集中監控; SNMP; 報警
中圖分類號:TP319 文獻標志碼:B 文章編號:1006-8228(2013)06-30-04
Construction and application of Nagios-based centralized monitoring system in banks
Luo Yi1, Chen Yue2
(1. Medical information engineering college,Chengdu University of Traditional Chinese Medicine, Chengdu, Sichuan 610075, China;
2. Ping An Bank Chengdu Branch)
Abstract: In order to deal with unexpected abnormal events in time, monitor or alert of devices, networks or applications are realized usually through monitoring systems in banks. Nagios is a free and open-source software running on Linux to monitor computer or networks status. It detects the devices, networks, and applications states by many plug-ins or SNMP protocol. Nagios working principle and primary functions are introduced in this paper, and the actual cases of Ping An Bank Chengdu Branch are analyzed. The practical examples show that constructing centralized monitoring system by using Nagios has good effects and is efficient for bank daily working tasks supporting.
Key words: Nagios; centralized monitoring; SNMP; fault alerting
0 引言
銀行科技部的管理人員最擔心在不知情的情況下發生異常突發事件,比如機房供電異常、設備硬件故障、應用進程終止、網絡通訊中斷等等,而且某些故障發生后,科技人員不能第一時間發現故障,直到出現明顯不良影響,才發現問題,采取補救措施,特別是遇到節假日,這種風險就更大。要使系統能正常穩定運行,管理員就必須時刻關注各個系統的硬件狀況、服務進程、網絡是否正常、CPU、內存使用率是否過高、數據庫可用空間、UPS負載是否合理等等。如果在沒有自動監控工具的幫助下,這些日常必須的檢查工作就需要由人工去做,這樣不僅效率低下,消耗大量的人力資源,而且容易發生漏查、錯查現象。
為改變這種被動局面,銀行往往會引進一些監控系統來實現自動監控功能,用計算機來代替人工進行日常檢查,并在一定的條件下自動報警。……