中文XML论坛 - 专业的XML技术讨论区--显示贴子

以文本方式查看主题

-  中文XML论坛 - 专业的XML技术讨论区  (http://bbs.xml.org.cn/index.asp)
--  『 Semantic Web(语义Web)/描述逻辑/本体』  (http://bbs.xml.org.cn/list.asp?boardid=2)
----  概念相似度 Similarity Measurement in Ontology  (http://bbs.xml.org.cn/dispbbs.asp?boardid=2&rootid=&id=22329)

--  作者：BenLin
--  发布时间：9/18/2005 10:05:00 PM

--  概念相似度 Similarity Measurement in Ontology
下面有人在讨论关于概念相似度的文章下载，可是没有人讨论怎么定义、计算概念相似度，我来抛个砖。

我所用的是Tanimoto度量，就是用相交的数目处于整体的数目。比如说，A概念的父节点有(K,L,M,N)，B概念的父节点有(O,P,M,N)，那么有(M,N)2个节点相同；总共有(A,B,K,L,M,N,O,P)8个节点，所以A与B的相似度为2/8=0.25。

希望大家说说大家所看到的计算方式。

--  作者：jpz6311whu
--  发布时间：9/18/2005 11:29:00 PM

--
One method is to measure the minimum edge count distance between C1 and C2 by the maximum depth D of a taxonomic hierarchy the only use hyponymy, is a , type links between concepts.
Sim(C1,C2)= -logP(C3)
   Sim (“mammal”,”reptile”) = -log P (“beast”) =-log P(6/12)= -log (0.5)
   Sim (“bird”,”beast”) =-log P (“animal”)= -log(8/12) = - log(0.75)
   Sim (“eagle”,”gull”) =-log P (“bird”)= -log(2/12) = - log(1/6)

--  作者：BenLin
--  发布时间：10/31/2005 11:44:00 PM

--
为什么要用负号? -log()?

我所想的是：相邻的两个concept，越具体（底层）就应该越相识；越抽象（上层）相识度就少一些。

总的说来，你的方法也是找到最小的共同祖先，用它来计算相似度。

--  作者：Jamsauce
--  发布时间：11/1/2005 3:51:00 PM

--
不错不错。

--  作者：jiexincao
--  发布时间：11/1/2005 6:56:00 PM

--

以下是引用jpz6311whu在2005-9-18 23:29:00的发言：
One method is to measure the minimum edge count distance between C1 and C2 by the maximum depth D of a taxonomic hierarchy the only use hyponymy, is a , type links between concepts.
Sim(C1,C2)= -logP(C3)
    Sim (“mammal”,”reptile”) = -log P (“beast”) =-log P(6/12)= -log (0.5)
    Sim (“bird”,”beast”) =-log P (“animal”)= -log(8/12) = - log(0.75)
    Sim (“eagle”,”gull”) =-log P (“bird”)= -log(2/12) = - log(1/6)

可以给个出处吗？先谢谢了！

--  作者：jpz6311whu
--  发布时间：11/1/2005 7:44:00 PM

--
Ontology-Driven Semantic Information Retrieval
                 Huang Hai, Wang Rujin, Huang He, Wei Yuanyuan, Xiao Bo
                           (huanghai@ustc.edu)
（Institute of Intelligent Machines of Chinese Academy of Science，Hefei 230031,China）

--  作者：jiexincao
--  发布时间：11/1/2005 8:05:00 PM

--
非常感谢！

W 3 C h i n a ( since 2003 ) 旗下站点
苏ICP备05006046号《全国人大常委会关于维护互联网安全的决定》《计算机信息网络国际联网安全保护管理办法》

62.500ms