摘 要:在企業(yè)信息化建設(shè)過(guò)程中形成的數(shù)據(jù)孤島阻礙了企業(yè)信息化程度的進(jìn)一步提高,研究異構(gòu)數(shù)據(jù)集成模型,對(duì)于解決數(shù)據(jù)孤島問(wèn)題具有重要意義。通過(guò)應(yīng)用XML技術(shù)構(gòu)建數(shù)據(jù)集成中間實(shí)體,屏蔽各異構(gòu)數(shù)據(jù)之間的差異,從而形成統(tǒng)一數(shù)據(jù)視圖的方法,實(shí)現(xiàn)了一種異構(gòu)數(shù)據(jù)集成新模型XDIM,并將其應(yīng)用在青海大學(xué)綜合信息服務(wù)平臺(tái)中。該模型采用精簡(jiǎn)的算法,具有靈活高效的特點(diǎn),能夠有效解決多數(shù)據(jù)源環(huán)境下的異構(gòu)數(shù)據(jù)集成問(wèn)題。
關(guān)鍵詞:異構(gòu); XML; 數(shù)據(jù)集成;數(shù)據(jù)孤島
中圖分類(lèi)號(hào):TP311 文獻(xiàn)標(biāo)識(shí)碼:B
文章編號(hào):1004-373X(2010)12-0039-04
Research and Application of XML-based Heterogeneous Data Integration
MA Guo-cai, LIU Hai-xiong
(Qinghai University, Xining 810016, China)
Abstract:The model of heterogeneous data integration is researched to resolve the \"data island\" which is formed in the process of enterprise infomationization and blocks the further progress of enterprise infomationization. A XDIM model of heterogeneous data integration is achieved by constructing the middle data integration components to shield the defference among all the heterogeneous data and form a unified data view. It has been applied in the comprehensive information service platform of Qinghai University. With simple algorithm, the model possesses the flexibility and high-efficiency characteristics, and can effectivly solve the problem of heterogeneous data integration in the environment of multiple data source.
Keywords:heterogeneous data; XML; data integration; data island
0 引 言
隨著計(jì)算機(jī)技術(shù),特別是Internet 技術(shù)的迅猛發(fā)展,在許多行業(yè)、單位或機(jī)構(gòu)、部門(mén)內(nèi)部都逐步實(shí)現(xiàn)了業(yè)務(wù)、信息的計(jì)算機(jī)化管理。但是,各個(gè)行業(yè)、部門(mén)或機(jī)構(gòu)由于業(yè)務(wù)和功能歸屬不同,因此都是根據(jù)自身的需要,構(gòu)建了許多相互隔離的信息服務(wù)和管理系統(tǒng)。甚至在一個(gè)單位內(nèi)部各部門(mén)所采用的計(jì)算環(huán)境由不同平臺(tái)組成,而不是固守任何一個(gè)平臺(tái)。這樣隨著時(shí)間的推移和技術(shù)的進(jìn)步,這些由不同核心技術(shù)構(gòu)建的信息系統(tǒng)就像一個(gè)個(gè)“信息孤島” ,各自有著不同的處理對(duì)象、操作方法和專(zhuān)用客戶端,在各個(gè)環(huán)節(jié)之間存在著數(shù)據(jù)交流和部門(mén)協(xié)同的問(wèn)題[1]。“信息孤島”的存在不僅提高了企業(yè)維護(hù)數(shù)據(jù)的費(fèi)用,而且企業(yè)很難根據(jù)分散的數(shù)據(jù)做出正確的決策[2]。為了改善這種局面,同時(shí)在各個(gè)“信息孤島”之中共享和交換數(shù)據(jù),并且給企業(yè)用戶提供企業(yè)數(shù)據(jù)的集成視圖,從而根據(jù)集成之后的數(shù)據(jù)及時(shí)地調(diào)整業(yè)務(wù)策略,就必須考慮數(shù)據(jù)集成的問(wèn)題。……