以文本方式查看主题

-  中文XML论坛 - 专业的XML技术讨论区  (http://bbs.xml.org.cn/index.asp)
--  『 最新动态 & 业界新闻 』  (http://bbs.xml.org.cn/list.asp?boardid=35)
----  Yahoo研究人员宣称Semantic Web死了 (提醒:为避免误解此标题,请读完正文)  (http://bbs.xml.org.cn/dispbbs.asp?boardid=35&rootid=&id=47028)


--  作者:admin
--  发布时间:5/17/2007 3:38:00 PM

--  Yahoo研究人员宣称Semantic Web死了 (提醒:为避免误解此标题,请读完正文)

(admin注:本文作者Mor Naaman系Yahoo Research Berkeley研究人员,本文发表于他自己的Blog)

The Emerging-Semantics Web (”The Semantic Web is Dead”)

May 16, 2007 on 10:24 am | by Mor

Last week, I participated in a WWW2007 panel called “Multimedia Metadata Standards in a Semantic Web 3.0“, where I took the opportunity to declare the Semantic Web dead. As you can imagine, such a declaration in front of a crowd of semantic web researchers provoked many responses. While I believe panels should be provocative and entertaining, I also have specific reasons for why I went as far as calling the Semantic Web “dead”. Let me explain what I mean.

There is no way that we can engage the masses in annotating media with “semantic” labels. At best, we can get the people to annotate content (such as Flickr images or YouTube videos) with short text descriptions or tags. This works only because tags are simple; powerful (can be used for many tasks) and, in some systems, carefully engineered to match the user’s natural motivations. Our best hope is to be able to take this bottoms-up annotation, or folksonomy if you will, and try to assign some semantics to it later - Flickr’s Clustering is a great example, as well as Y!RB’s TagMaps and our upcoming SIGIR paper (”Towards Automatic Extraction of Event and Place Semantics from Flickr Tags”, available in pdf).

Now, developers are people too(tm). When Web2.0 developers have any need for a multimedia standard, they will choose something that is simple, powerful, and answers their immediate needs. Semantic Web? Dublin Core? MPEG-7? I don’t think so. The masses of developers have opted to use simple formats such as (partial list) RSS, Microformats, and recently, Flickr machine tags.

Flickr machine tags are, in essence, a bottom-up way to assign “lightweight semantics” to Flickr tags/images. Machine tags are an easy way for developers to create their own semantics and applications around the Flickr platform by using a triple-tag such as “upcoming:event=144945″ to signify that a photo was taken in the event ID 144945 in the Upcoming.org event database. Machine tags are an “emerging format” that will adapt to the needs of developers and will reflect the salient properties of the data - just like tags. Flickr cannot, and probably would not want to regularize or limit the usage of machine tags. Issues will arise - for example, since the namespace for machine tags is not pre-defined (a good thing!) we may have multiple names for the same semantics. Or, developers may use the same names for semantically different subjects.

As with regular tags, it will be up to us, researchers, to make sense of the machine tags via our “tools” - be it schema mapping, pattern extraction, semantic analysis, or any other technique (content anaylsis may even help in the case of multimedia). There is simply no hope in enforcing a complete set of semantics for media (or content) annotation on the web as a whole. Which led me to declare the [grand vision of the] semantic web dead.

Well, as I mentioned above, this claim engaged the audience quite a bit (a good thing!). Some of the panel responses claimed that the machine tags are, indeed, a success, or even an instance, of the semantic web. I mentioned Aaron Cope’s statement: “machine tags are not RDF but they can play RDF on TV”. I also mentioned Dan Catt’s quadruple-hop from machine tags through RDF to JSON using Triplr and Pipes (partially in support of the connection between machine tags and the semantic web). So, it doesn’t matter whether RDF is used or not (Dave Beckett, friend of the program, and the editor of RDF as well as the man behind Triplr was in the crowd and participated in the discussion). Others pointed out that the semantic web is very successful and useful for some domains (I agree).

To summarize and conclude, let me tone down my claim. The Semantic Web is not dead; but the grand vision of “A Semantic Web” will not be achieved, despite Tim’s noble efforts. Instead, we will see the Emerging-Semantics Web, derived from how people/developers use lightweight formats and tags on popular platforms such as Flickr and YouTube. As for the panel, I concluded with quotes from Nat Torkington’s rather amusing post about the future of the web - right on the spot, but in retrospect, maybe I should have spared the injured crowd from this additional insult at that point…

My slides from the panel are embedded below. I also encourage you to check out the slides of the other panelists, all smarter (and perhaps less combative) than me: Sean Bechhofer, Lynda Hardman, and John Smith. And thanks to Susanne Boll and Raphael Trochy for organizing!

点击开新窗口欣赏该FLASH动画![全屏欣赏]



--  作者:superc_7
--  发布时间:5/17/2007 4:01:00 PM

--  
这两年emergent semantic随着web2.0出现了
记得有一句话说的很好“web2.0 is doing the semantic web in a messy way”
所以个人感觉标签和语义网技术的结合还是非常有前途的领域,毕竟为了迎合大势所趋
--  作者:雪褥织影
--  发布时间:5/18/2007 9:33:00 AM

--  
传统领域的人对SW不感冒还是因为觉得SW太复杂,很多人觉得RDF都很复杂,不要再说什么OWL了,一个普通的网页开发者让他去学DL几乎无法想像

如果可以通过简单的方式让publish SW data变得很简单,而不要让开发者直接接触SW data本身,也许能更快地让人接受


--  作者:baojie
--  发布时间:5/23/2007 7:06:00 AM

--  
yahoo research是对SW不感冒

“如果可以通过简单的方式让publish SW data变得很简单,而不要让开发者直接接触SW data本身,也许能更快地让人接受”

其实没有想像的困难。就好像现在的Word是基于XML的,但是用户并不想需要知道XML


--  作者:雪褥织影
--  发布时间:5/23/2007 10:35:00 AM

--  
面向一个或有限个Schema的publish和browse可以比较容易地做到人性化,Tabulator做得就不错,Google最新的Experimental Search也有点这个意思
然而面向任意Schema就不那么容易了
不过这也未必是一个问题,也许根本就不需要一个通用的解决方案?
W 3 C h i n a ( since 2003 ) 旗 下 站 点
苏ICP备05006046号《全国人大常委会关于维护互联网安全的决定》《计算机信息网络国际联网安全保护管理办法》
64.453ms