趙凱 王華星 施娜



摘要 知識(shí)圖譜與自然語言處理技術(shù)和搜索技術(shù)的結(jié)合越來越廣,成為了近年知識(shí)服務(wù)領(lǐng)域研究的新熱點(diǎn)。目前知識(shí)圖譜在中醫(yī)藥領(lǐng)域的應(yīng)用主要集中在可視化分析,尚無能夠支持自然語言處理領(lǐng)域和知識(shí)服務(wù)領(lǐng)域的中醫(yī)知識(shí)圖譜。本研究使用了Neo4j圖數(shù)據(jù)庫構(gòu)建了基于《傷寒論》桂枝湯類方的小型知識(shí)圖譜,可以實(shí)現(xiàn)對(duì)桂枝湯類方的證、方、藥的可視化分析以及檢索等功能。研究結(jié)果證明了這種方法的可行性,并為今后將中醫(yī)類知識(shí)圖譜與深度學(xué)習(xí)技術(shù)相結(jié)合應(yīng)用的開發(fā)奠定了基礎(chǔ)。
關(guān)鍵詞 知識(shí)圖譜;圖數(shù)據(jù)庫;傷寒論;桂枝湯;自然語言處理;Neo4j;中醫(yī)類方;方證
Abstract In recent years,the increasingly wide combination of knowledge graph,natural language processing as well as search technique has become a new hotspot in the field of knowledge service.Nowadays,the application of knowledge graph in the field of traditional Chinese medicine(TCM)is mainly focused on visual analysis.There is still no TCM knowledge graph that can support the fields of natural language processing and knowledge service.In this paper,Neo4j graph database is used to construct a small knowledge graph based on Guizhi Decoction associated formulas in Treatise on Cold Damage,which can realize functions of visual analysis and searching on syndromes,formulas and medicines of Guizhi Decoction associated formulas.Results of the study prove the feasibility of this method,and lay the foundation for future development of the combination of TCM knowledge graph and deep learning technology.
Key Words Knowledge graph; graph database; Treatise on Cold Damage; Guizhi Decoction; Natural language processing; Neo4j; Chinese medicine formula; Formula and syndrome
中圖分類號(hào):R222 文獻(xiàn)標(biāo)識(shí)碼:A doi:10.3969/j.issn.1673-7202.2019.10.019
傳統(tǒng)AI技術(shù)如深度學(xué)習(xí),如果沒有預(yù)先標(biāo)定好的高質(zhì)量的大規(guī)模數(shù)據(jù)集,在面對(duì)錯(cuò)綜復(fù)雜的臨床醫(yī)學(xué)決策時(shí)往往也束手無策,這時(shí)候,來自現(xiàn)實(shí)世界的經(jīng)驗(yàn)和知識(shí)就顯得格外重要。各種機(jī)器學(xué)習(xí)算法雖然在數(shù)據(jù)的預(yù)測能力上很好,但是在描述能力上非常弱,而知識(shí)圖譜對(duì)于數(shù)據(jù)的描述能力非常強(qiáng)大,恰好填補(bǔ)了這部分的空白。知識(shí)圖譜在國內(nèi)還屬于一個(gè)比較新興的概念,是2012年由谷歌公司首次提出。知識(shí)圖譜本質(zhì)上是一種語義網(wǎng)絡(luò)的知識(shí)庫,是一種基于圖的數(shù)據(jù)結(jié)構(gòu),由節(jié)點(diǎn)和邊組成,主要用來描述真實(shí)世界中存在的各種實(shí)體概念以及之間的關(guān)系。在知識(shí)圖譜里,每個(gè)節(jié)點(diǎn)表示現(xiàn)實(shí)世界中存在的“實(shí)體”,每條邊為實(shí)體與實(shí)體之間的“關(guān)系”?!?br>