以文本方式查看主题

-  中文XML论坛 - 专业的XML技术讨论区  (http://bbs.xml.org.cn/index.asp)
--  『 Semantic Web(语义Web)/描述逻辑/本体 』  (http://bbs.xml.org.cn/list.asp?boardid=2)
----  利好报道 - MIT Technology Review: A Smarter Web  (http://bbs.xml.org.cn/dispbbs.asp?boardid=2&rootid=&id=46993)


--  作者:zhaonix
--  发布时间:5/16/2007 9:52:00 PM

--  利好报道 - MIT Technology Review: A Smarter Web
原文较长,在http://www.technologyreview.com/Infotech/18306/page1/ 内容有点像Lassila和Hendler的《Embracing Web 3.0》,但似更谨慎。下面转贴其中讲目前SW应用的部分。其中提到了:
    曾任W3C Semantic Web Initiative leader的Eric Miller的咨询公司Zepheira;
    Tom Gruber等人的RealTravel
    U.K.-based Garlik公司
    神秘的Radar Networks公司(公司主页上写着:目前正出于秘密开发状态,07年内会推出产品)
    Skype创办人建立的P2P TV服务公司 Joost
    
从中可见,对于SW的应用前景,似有两种看法:
1. 只限于RDF和简单本体,且它们有不少用处(以RDF Database的形式,结合其它技术,在实现不同来源的数据集成方面)。如 Hendler的"It's really a realization that a little bit of Semantic Web stuff with what's called Web 2.0 is a tremendously powerful technology." 以及Joost的CTO Dirk-Willem van Gulik 的:"RDF [and the other rudimentary semantic technologies] solve meaningful problems, and it costs less than any other approach would," he says. "The entire remainder"--the more ambitious work with ontologies and artificial intelligence--"is completely academic."
2. 不仅RDF,而且基于推理等复杂的AI技术也会有实用价值。如:神秘的Radar Networks公司的创办人Nova Spivack 说的"First comes what I call the World Wide Database, making data accessible through queries, with no AI involved," Spivack says. "Step two is the intelligent Web, enabling software to process information more intelligently. That's what we're working on."

无论哪一种,上面几家都获得了风险投资。 论坛上的兄弟们加油了!愿早日找到亮点,争取由我们中的某些人在国内首先创办一家这样的web 3.0公司!
---------------------

《A Smarter Web》节选
Into the Real World

If Miller's sunset epiphany showed him the path forward, the community he represented was following similar routes. All around him, ideas that germinated for years in labs and research papers are beginning to take root in the marketplace.

But they're also being savagely pruned. Businesses, even Miller's Zepheira, are adopting the simplest Semantic Web tools while putting aside the more ambitious ones. Entrepreneurs are blending Web 2.0 features with Semantic Web data-­handling techniques. Indeed, if there is to be a Web 3.0, it is likely to include only a portion of the Semantic Web community's work, along with a healthy smattering of other technologies. "The thing being called Web 3.0 is an important subset of the Semantic Web vision," says Jim Hendler, professor of computer science at Rensselaer Polytechnic Institute, who was one of the initiative's pioneer theorists. "It's really a realization that a little bit of Semantic Web stuff with what's called Web 2.0 is a tremendously powerful technology."

Much of that technology is still invisible to consumers, as big companies internally apply the Semantic Web's efficient ways of organizing data. Miller's Zepheira, at least today, is focused on helping them with that job. Zepheira's pitch to companies is fairly simple, perhaps looking once again to Dewey's disorganized libraries. Businesses are awash in inaccessible data on intranets, in unconnected databases, even on employees' hard drives. For each of its clients, Zepheira aims to bring all that data into the light, code it using Semantic Web techniques, and connect it so that it becomes useful across the organization. In one case, that might mean linking Excel documents to payroll or customer databases, in another, connecting customer accounts to personalized information feeds. These disparate data sources would be tied together with RDF and other Semantic Web mechanisms that help computerized search tools find and filter information more efficiently.

One of the company's early clients is Citigroup. The banking giant's global head of capital markets and banking technology, Chris Augustin, is heading an initiative to use semantic technologies to organize and correlate information from diverse financial-data feeds. The goal is to help identify capital-market investment opportunities. "We are interested in providing our customers and traders with the latest information in the most relevant and timely manner to help them make the best decisions quickly," says Rachel Yager, the program director overseeing the effort.

Others are beginning to apply semantic techniques to consumer-focused businesses, varying widely in how deeply they draw from the Semantic Web's well.

The Los Altos, CA-based website RealTravel, created by chief executive Ken Leeder, AdForce founder Michael Tanne, and Semantic Web researcher Tom Gruber, offers an early example of what it will look like to mix Web 2.0 features like tagging and blogging with a semantic data-organization system. The U.K.-based Garlik, headed by former top executives of the British online bank Egg, uses an RDF-based database as part of a privacy service that keeps customers apprised of how much of their personal information is appearing online. "We think Garlik's technology gives them a really interesting technology advantage, but this is at a very early stage," says 3i's ­Waterhouse, whose venture firm helped fund Garlik. "Semantic technology is going to be a slow burn."

San Francisco-based Radar Networks, created by ­EarthWeb cofounder Nova Spivack and funded in part by Allen's Vulcan Capital, plans eventually to release a full development platform for commercial Semantic Web applications, and will begin to release collaboration and information­-sharing tools based on the techniques this year. Spivack himself has been part of the Semantic Web community for years, most recently working with DARPA and SRI International on a long-term project called CALO (Cognitive Agent that Learns and Organizes), which aims to help military analysts filter and analyze new data.

Radar Networks' tools will be based on familiar ideas such as sharing bookmarks, notes, and documents, but Spivack says that ordering and linking this data within the basic Semantic Web framework will help teams analyze their work more efficiently. He predicts that the mainstream Web will spend years assimilating these basic organization processes, using RDF and related tools, while the Semantic Web's more ambitious artificial-intelligence applications wait in the wings.

"First comes what I call the World Wide Database, making data accessible through queries, with no AI involved," Spivack says. "Step two is the intelligent Web, enabling software to process information more intelligently. That's what we're working on."

One of the highest-profile deployments of Semantic Web technology is courtesy of Joost, the closely watched Internet television startup formed by the creators of Skype and Kazaa. The company has moved extraordinarily quickly from last year's original conception, through software development and Byzantine negotiations with video content owners, into beta-testing of its customizable peer-to-peer TV software.

That would have been impossible if not for the Semantic Web's RDF techniques, which Joost chief technology officer Dirk-Willem van Gulik calls "XML on steroids." RDF allowed developers to write software without worrying about widely varying content-use restrictions or national regulations, all of which could be accommodated afterwards using RDF's Semantic Web linkages.

Joost's RDF infrastructure also means that users will have wide-ranging control over the service, van Gulik adds. People will be able to program their own virtual TV networks--if an advertiser wants its own "channel," say, or an environmental group wants to bring topical content to its members--by using the powerful search and filtering ­capacity inherent in the semantic ordering of data.

But van Gulik's admiration goes only so far. While he believes that the simpler elements of the Semantic Web will be essential to a huge range of online businesses, the rest he can do without. "RDF [and the other rudimentary semantic technologies] solve meaningful problems, and it costs less than any other approach would," he says. "The entire remainder"--the more ambitious work with ontologies and artificial intelligence--"is completely academic."



--  作者:superc_7
--  发布时间:5/17/2007 4:06:00 PM

--  
我觉得这两个应用方向都不是特别明朗吧
1、RDF和简单本体的本质性优势到底在哪,或者说为什么我不用成熟的数据库技术而选择尚未成熟的语义web技术?
2、复杂AI推理技术是否受相关领域,如AI领域研究的限制?而且目前来看这种应用只限于某些特定的专业领域,今后到底有多大市场?
无论是哪种情况,Tim最初设想的那种在万维网维度上实现高度自动化的推理的语义网似乎离我们越来越远了
--  作者:zhaonix
--  发布时间:5/17/2007 11:31:00 PM

--  
嗯,其实我也有类似的困惑。
    第1种还好些了,毕竟不同来源的数据中可能涉及了相同的东西,如果能把它们集成起来,说不定真可以产生某种奇妙的价值。但我跟你一样:感觉这里更需要的是传统的数据库技术而不是倚重复杂的逻辑推理的SW。——那种像RSS, FOAF一样简单的、仅仅列出一些概念、属性的本体,不就是关系数据库中的Entity-Relation模型吗? 要说SW带来的新东西,大概就是对URI的重视吧;另外,在SW的语境下,也促进了DB界对大规模、及其分散环境下的数据集成问题的研究兴趣,不知道对不对。

    第2种,即基于强表达能力的本体和复杂的AI方法、指望靠推理来发现隐含知识的,虽然似乎是这几年SW会议文章的主流,但我对其能否实用也很不看好。——AI推理系统,在封闭的环境下也没见有实用的东西推出(虽然FOL推理在50-60年代颇证明了一些数学定理),怎么能指望到了更复杂、更开放的web上会实用起来呢?是不是可以说:“电脑的归电脑,人脑的归人脑”。 但这种SW在一些数据、知识密集的领域如生物信息学的研究工作中,或许会有应用?

    《Embracing Web 3.0》一文中,Tim的在SA2001那篇名作中的两位共同作者在肯定了第1种的同时,也对第2种在将来走向应用持乐观态度;MIT Technology Review这篇好像只是一个记者的报道,报道了:1)现在有不少前述第1种SW的应用正在开发当中;2)第2种的应用前景,有人看好、有人不看好。  (不过这篇我也没看全:()
    毕业要紧,只好把死马当活马医、姑且相信第1种前景不错了:(  另外,SW研究中近几年冒出的第3种流派(我自己总结的)——social web——一开始就走实用路线,似乎挺有前途。只是不知道能不能从SW名下找到一些强大的技术(比如数据挖掘)、使它的tag系统发挥出比现在这种归类、检索更大的价值。


--  作者:superc_7
--  发布时间:5/18/2007 7:30:00 PM

--  
好像是Tim曾说过SW技术的优势之一就是数据集成,简洁的RDF模型确实有这方面的作用,但是对于这一点我体会也不是很深刻
至于对于Social Web的关注,很大程度上是跟这两年来Web2.0兴起相关的,去年ISWC很重要的一个主题就是Semantic Web和Social Web相结合
数据挖掘、复杂网络分析都是早有的东西了,在Social Web中还是很有用的
标签是这些年才出现的,它很类似与SW中所强调的元数据,个人认为这一点上双方还是有很大结合空间的
--  作者:iamwym
--  发布时间:5/19/2007 1:18:00 AM

--  
可能商业前景最好的就是数据集成了。
--  作者:baojie
--  发布时间:5/23/2007 7:00:00 AM

--  
把SW在wiki和blog上实现我看还是挺现实的
--  作者:iamwym
--  发布时间:5/24/2007 4:32:00 PM

--  
问题是WIKI和BLOG现在看起来和商业化的距离比较远啊
--  作者:baojie
--  发布时间:5/24/2007 7:34:00 PM

--  
hoho SW有哪个地方真正商业化了?
--  作者:nepmoon
--  发布时间:5/25/2007 4:02:00 PM

--  
怎么登陆ftp, 我经验值不够啊
--  作者:superc_7
--  发布时间:5/25/2007 10:49:00 PM

--  
wiki和blog等web2.0现在ms都没有成熟的商业模式,不过大家都看好他们,认为以后肯定会找到发财的道
semantic web那一套的问题在于,现在他们对传统数据库没有显示出来明显的比较优势
--  作者:hjx_221
--  发布时间:6/3/2007 10:40:00 PM

--  
商业前景最好的就是数据集成了
W 3 C h i n a ( since 2003 ) 旗 下 站 点
苏ICP备05006046号《全国人大常委会关于维护互联网安全的决定》《计算机信息网络国际联网安全保护管理办法》
82.031ms