新书推介:《语义网技术体系》
作者:瞿裕忠,胡伟,程龚
   >>中国XML论坛<<     W3CHINA.ORG讨论区     计算机科学论坛     SOAChina论坛     Blog     开放翻译计划     新浪微博  
 
  • 首页
  • 登录
  • 注册
  • 软件下载
  • 资料下载
  • 核心成员
  • 帮助
  •   Add to Google

    >> 最新的技术动态
    [返回] 中文XML论坛 - 专业的XML技术讨论区休息区『 最新动态 & 业界新闻 』 → 读写网采访Tim Berners-Lee第二部分:搜索引擎,用于数据的用户界面,Wolfram Alpha等等 查看新帖用户列表

      发表一个新主题  发表一个新投票  回复主题  (订阅本版) 您是本帖的第 17890 个阅读者浏览上一篇主题  刷新本主题   树形显示贴子 浏览下一篇主题
     * 贴子主题: 读写网采访Tim Berners-Lee第二部分:搜索引擎,用于数据的用户界面,Wolfram Alpha等等 举报  打印  推荐  IE收藏夹 
       本主题类别: Ontology Engineering | RDF/RDFS    
     admin 帅哥哟,离线,有人找我吗?
      
      
      
      威望:9
      头衔:W3China站长
      等级:计算机硕士学位(管理员)
      文章:5255
      积分:18406
      门派:W3CHINA.ORG
      注册:2003/10/5

    姓名:(无权查看)
    城市:(无权查看)
    院校:(无权查看)
    给admin发送一个短消息 把admin加入好友 查看admin的个人资料 搜索admin在『 最新动态 & 业界新闻 』的所有贴子 点击这里发送电邮给admin  访问admin的主页 引用回复这个贴子 回复这个贴子 查看admin的博客楼主
    发贴心情 读写网采访Tim Berners-Lee第二部分:搜索引擎,用于数据的用户界面,Wolfram Alpha等等

    ReadWriteWeb Interview With Tim Berners-Lee, Part 2: Search Engines, User Interfaces for Data, Wolfram Alpha, And More...
    from ReadWriteWeb by Richard MacManus

    按此在新窗口浏览图片In part 2 of my one-on-one interview with Tim Berners-Lee, we explore a variety of topics relating to Linked Data and the Semantic Web. If you missed it, in [URL=http://www.readwriteweb.com/archives/interview_with_tim_berners-lee_part_1.php]Part 1 of the interview[/URL] we covered the emergence of Linked Data and how it is being used now even by governments.

    In Part 2 we discuss: how previously reticent search engines like Google and Yahoo have begun to participate in the Semantic Web in 2009, user interfaces for browsing and using data, what Tim Berners-Lee thinks of new computational engine Wolfram Alpha, how e-commerce vendors are moving into the Linked Data world, and finally how the Internet of Things intersects with the Semantic Web.

    Sponsor
    [URL=http://d1.openx.org/ck.php?n=15658&cb=15658]按此在新窗口浏览图片[/URL]
    Semantic Web and Search Engines Like Google, Yahoo

    RWW: You've been talking about the Semantic Web for many years now. Generally the view is that Semantic Web is great in theory, but we're still [URL=http://www.readwriteweb.com/archives/rdf_semantic_web_apps.php]not seeing a large number of commercial web apps that use RDF[/URL] (we've seen a number of scientific or academic ones). However we have [URL=http://www.readwriteweb.com/archives/understanding_the_new_web_era_web_30_linked_data_s.php]begun to see some traction with RDFa[/URL] (embedding RDF metadata into XHTML Web content), for example [URL=http://www.readwriteweb.com/.../search_options_google_search_evolves.php]Google's Rich Snippets[/URL] and [URL=http://www.readwriteweb.com/archives/semtech_making_the_web_searchable_searchmonkey.php]Yahoo's SearchMonkey[/URL]. Has the takeup of RDFa taken you by surprise?

    TBL: Not really, but the takeup by the search engines is interesting. In a way I was happy to see that, it was a milestone for those things to come out of the search engines. The search engines had typically not been keen on the Semantic Web - maybe you could argue that their business is making order out of chaos, and they're actually happy with the chaos. And if you provide them with the order, they don't immediately see the use of it.

    "The search engines have not been keen on the Semantic Web [...] their business is making order out of chaos, and they're actually happy with the chaos."

    Also I think there was misunderstanding in the search engine industry that the Semantic Web meant metadata, and metadata meant keywords, and keywords don't work because people lie. Because traditionally in information retrieval systems, keywords haven't proven up to the task of finding stuff on the Web. One of the reasons is that people lie, the other is that they can't be bothered to enter keywords. So keywords have gotten a bad reputation, then metadata in general was tarred with this 'keywords don't work' brush. Because a lot of Semantic Web data included metadata, then people thought that with Semantic Web data -- again, that people will lie and won't have the time to produce it.

    按此在新窗口浏览图片
    Google rich snippets example; image credit: Matt Cutts

    Now I think there's a realization that when you're putting data online, that people are motivated NOT to lie. For example when your band is going to produce its next album, or when your band is going to play next downtown, you're motivated to put that information up there on the Semantic Web. There's an awful lot of cases when actually data is really important to people; and it's on the web anyway. So I think it's great that some of the search engine companies are starting to read RDFa.

    Does this mean that they [search engines] will start to absorb the whole RDF data model? If they do, then they will be able to start pulling all of the linked data cloud in.

    "The web of linked data and the web of documents actually connect in both directions, with links."

    Will they know what to do with it? Because when it's data in a very organized form, I think some people have been misunderstanding the Semantic Web as being something that tries to make a better search engine - i.e. when you type something into a little box. But of course the great thing about the Semantic Web is that you can query it, you can ask a complicated query of the Semantic Web, like a SQL query (we call it a SPARQL query), and that's such a different thing to be able to do. It really doesn't compare to a search engine.

    You've got search for text phrases on one side (which is a useful tool) and querying of the data on the other. I think that those things will connect together a lot.

    So I think people will search using a search text engine, and find a webpage. On the front of the webpage they'll find a link to some data, then they'll browse with a data browser, then they'll find a pattern which is really interesting, then they'll make their data system go and find all the things which are like that pattern (which is actually doing a query, but they'll not realize it), then they'll be in data mode with tables and doing statistical analysis, and in that statistical analysis they'll find an interesting object which has a home page, and they'll click on that, and go to a homepage and be back on the Web again.

    So the web of linked data and the web of documents actually connect in both directions, with links.
    User Interfaces for Semantic Content

    RWW: At the recent SemTech conference, [URL=http://www.readwriteweb.com/archives/the_state_of_the_market_in_semantic_technologies.php]Tom Tague of Thomson Reuters' Calais project suggested[/URL] that user interfaces for semantic content are key in getting more take-up. With that in mind, I wonder if you've seen some great interfaces or designs for semantic applications in recent months - if so which ones and why did they impress you?

    TBL: I think that whole area is very exciting at the moment. The only piece of hacking I've done over the past few years has been on a thing called [URL=http://www.w3.org/2005/ajar/tab]the Tabulator[/URL] [a data browser and editor], which is addressing exactly that. Partly because I wanted to be able to look at this data. And now there are lots of different ways that people need to be able to look at data. You need to be able to browse through it piece by piece, exploring the world of data. You need to be able to look for patterns of particular things that have happened. Because this is data, we need to be able to use all of the power that traditionally we've used for data. When I've pulled in my chosen data set, using a query, I want to be able to do [things like] maps, graphs, analysis, and statistical stuff.

    按此在新窗口浏览图片
    W3C Tabulator, a data browser/editor; Image credit: [URL=http://www4.wiwiss.fu-berlin.de/bizer/d2r-server/publishing/]wiwiss.fu-berlin.de[/URL]

    So when you talk about user interfaces for this, it's really very very broad. Yes I think it's important. There's also the distinction we can make between the generic interfaces and the specific interfaces.

    There will always be specific interfaces; for example if you're looking at calendar data, there's nothing else like a calendar that understands weeks, months and years. If you're looking at a genome, it's good to have a genetics-specific user interface.

    "I want to be able to do maps, graphs, analysis, and statistical stuff."

    However you also need to be able to connect that data, through generic interfaces. So if my genome data was taken during an experiment which happened over a particular period, I need to be able to look at that in the calendar - so I can connect the genetics to the calendar.

    So one of the things I hope to see is domain-specific things for various different domains, and the generic user interfaces. And hopefully the generic interfaces will be able to tie together all of the domains.

    Next Page: Wolfram Alpha; e-Commerce and Linked Data
    Wolfram Alpha and Natural Language Interfaces

    RWW: An interesting new product was [URL=http://www.readwriteweb.com/archives/wolframalpha_our_first_impressions.php]launched this year[/URL] called [URL=http://www.wolframalpha.com/]Wolfram|Alpha[/URL], described as a 'computational knowledge engine.' It's kind of a mix between Google (search) and Wikipedia (knowledge), and its key attribute is that enables you to compute something. The founders think that 'computing' things on the fly is something we're going to see a lot of in future. What's your take on Wolfram|Alpha?

    TBL: There are two parts to that sort of technology. One of them is a sort of stilted natural language interface. We've seen those sort of natural language queries for years. Boris Katz [from W3C] created a system called [URL=http://start.csail.mit.edu/start-system.html]START[/URL] [a software system designed to answer questions that are posed to it in natural language]. I think with the Semantic Web out there, those sorts of interfaces are going to become important, very valuable, because people will be able to ask more complicated things. The search engine has traditionally been limited to just a phrase, but some of the search engines are now starting to realize that if they put data behind them and have computation engines, then you can ask things like 'what's this many pounds in dollars?' and so on. So yes, those interfaces will become important.

    "Those sorts of interfaces will become important [...] people will be able to ask more complicated things."

    Conversational interfaces have always been a really interesting avenue. We've had voice browser work in W3C, that has been an interesting alternative avenue. It's possible that as compute power goes up, we'll see a prolifieration of machines capable of doing voice. It'll move from the mainframe to being able to run on a laptop or your phone. As that happens, we'll get actual voice recognition and pattern natural language at the front end. That will perhaps be an important part of the Semantic Web.

    按此在新窗口浏览图片

    We talked before about what a great challenge the Semantic Web is going to be from a user interface point of view. Conversational interfaces are going to be part of [solving] that. Of course it's also going to be really valuable to have compositional interfaces - for the visually impaired and so on.

    Wolfram|Alpha is also a large curated database of data sets. Obviously I'm interested in the big data set which is out there, which is Linked Data. This everybody can connect to. I don't really know a lot about the internals of Wolfram|Alpha's data set. I don't know whether they're likely to put any of it out on the web as Linked Data - that might be an interesting addition. I imagine that quite a lot of it may have come from the web of Linked Data.
    e-Commerce and Linked Data

    RWW: There have been [URL=http://www.semanticuniverse.com/articles-semantic-web-based-e-commerce-webmasters-get-ready.html]reports recently[/URL] that both Google and Yahoo will be supporting the Good Relations ontology and linked data for e-commerce. Companies such as Best Buy are already putting out product information in RDFa. What would be your advice to e-commerce vendors right now, to help them transition to this world of structured data on the Web. The same question could be asked across many verticals, but e-commerce seems like one area which has some momentum right now. Would you advise them just to put out their data as Linked Data?

    TBL: Yup! Certainly this year is the year to do it. I've been advising governments to do it and when you look at an enterprise, you find that a lot of the issues are the same. But when you put your data from government or enterprise out there, make sure you don't disturb existing ecosystems. Don't threaten those systems, because you've spent years building them up.

    Maybe there's an analogy with when the Web first started and the first bookshops went online. They were more or less a flyer, saying 'hey we have a great bookshop at 23 Main St, come on down!'. Let's say that a person named Joe owned one of these early online bookshops. If somebody had suggested to Joe that he should put his catalog online, Joe would've felt that that was very proprietary data. And he'd be worried that other bookshops would see where he was weak, so they'd be able to advertise themselves as filling that niche he's weak in.

    "When you put your data out there, make sure you don't disturb existing ecosystems."

    But when his competitors Fred and Albert put their catalogs online, then Joe can check which books people are browsing at Fred and Albert's websites. So Joe would [finally] be pursuaded to put his book catalog up online. But he doesn't put up the prices... until Albert and/or Fred does. And even if catalog and pricing is up there, nobody puts their stock levels online. And there was a period of time when nobody [i.e. online booksellers] had their stock levels up. But people got fed up with ordering stuff that wasn't in stock. So the first book shop to actually tell you about stock levels suddenly was then unbelievably attractive to its customers.

    So there's this syndrome of progressive competitive disclosure. This happens when people realize that if you're going to do business with somebody, if you're going to have your partners up and down the supply chain, really it's useful to check the data web - and life goes much more quickly and open.

    Best Buy may be what starts the ball rolling [among e-commerce vendors]. Now if I want to look out for what [products are] available, I can write a program to see what there is. If somebody wants to compete with Best Buy, to my program they'll be invisible unless they can get their data up in RDF. Doesn't matter whether they use RDFa or RDF XML, as long as it maps in a standard fashion to the RDF model, then they will be visible.

    Next Page: Internet of Things; Conclusion
    The Internet of Things

    RWW: I'm fascinated by how the Internet is becoming more and more integrated into the real world. For example the [URL=http://www.readwriteweb.com/archives/5_companies_building_the_internet_of_things.php]Internet of Things[/URL], where everyday objects become [URL=http://www.readwriteweb.com/archives/pachube_internet-enabled_environments.php]Internet connected via sensors[/URL]. Have you been following this trend closely too, and if so what impact do you think this will have on the Web in say 5 years time?

    TBL: It connects very much with Semantic Web [and] with linked data. With Linked Data you've got the ability to give a thing a URI. So I can give a URI to my phone, and I can say that's my phone in Linked Data. And also the company that made it can give a URI to the model of the phone. They can also put online all the specs of the phone, and then I can make a link to say that my phone is an example of that product. So now any system which is dealing with me and has access to that data will be able to figure out the sorts of things I can do with my phone, which actually is really valuable. Especially if the phone breaks.

    "The Semantic Web is a web of things, conceptually. Tying an actual thing down to a part of the web is the last mile."

    The Semantic Web has already given URIs to things, and to types of things. When the things themselves have an RFID chip in them, then I think it's a very exciting world. One can take that RFID chip, go to the Internet and find out the data about the thing. Whether we'll be able to do that, whether the manufacturers will be open enough to allow me to turn data about the identifier of the thing into data about the thing, is yet to be seen. But it's a very exciting idea.

    按此在新窗口浏览图片
    [URL=http://www.pachube.com/]Pachube[/URL], an example of the Internet of Things (see [URL=http://www.readwriteweb.com/archives/pachube_internet-enabled_environments.php]ReadWriteWeb profile[/URL])

    Similarly, I'd like to be able to scan a barcode and get back nutritional information about what's in - for example - a can of food. But we don't have that yet. To get that sort of thing, which is very powerful, we need to build look-up systems, which allow you to translate an RFID code or a barcode into an HTTP address.

    The Semantic Web is a web of things, conceptually. Tying an actual thing down to a part of the web is the last link - the last mile. Give the thing a notion of its own identity in the web.
    Conclusion

    RWW: The over-riding message in both [URL=http://www.readwriteweb.com/archives/interview_with_tim_berners-lee_part_1.php]Part 1[/URL] and 2 of our interview with Tim Berners-Lee, is for companies and organizations to make their data available online. Preferably as Linked Data, which uses a subset of Semantic Web technologies. But Berners-Lee noted, in Part 1 of our interview, that he'd even be happy with the data in CSV (comma separated values) format.

    It's clear that we've seen a lot of progress in linked data already in 2009. In upcoming posts on ReadWriteWeb, we'll continue to track this trend and explain how organizations can contribute their data.


       收藏   分享  
    顶(0)
      




    ----------------------------------------------

    -----------------------------------------------

    第十二章第一节《用ROR创建面向资源的服务》
    第十二章第二节《用Restlet创建面向资源的服务》
    第三章《REST式服务有什么不同》
    InfoQ SOA首席编辑胡键评《RESTful Web Services中文版》
    [InfoQ文章]解答有关REST的十点疑惑

    点击查看用户来源及管理<br>发贴IP:*.*.*.* 2009/7/10 10:17:00
     
     Humphrey 帅哥哟,离线,有人找我吗?狮子座1981-7-23
      
      
      威望:1
      等级:研二(搞定了DL,再搞定F-Logic!)
      文章:937
      积分:5743
      门派:W3CHINA.ORG
      注册:2008/3/12

    姓名:(无权查看)
    城市:(无权查看)
    院校:(无权查看)
    给Humphrey发送一个短消息 把Humphrey加入好友 查看Humphrey的个人资料 搜索Humphrey在『 最新动态 & 业界新闻 』的所有贴子 引用回复这个贴子 回复这个贴子 查看Humphrey的博客2
    发贴心情 
    访谈的第二部分,竟然会谈到搜索引擎,确实让我感到些许意外。
    Wolfram Alpha实质是一个电子商务网站,刚看到主题名中它和搜索引擎以及用户界面这样的术语并列出现时还以为是一种网络技术的名称呢!着实让我吃惊不小,又出来一个我不熟悉的新东西?!
    Hoo, Hoo, Hoo

    ----------------------------------------------
    鸿丰

    点击查看用户来源及管理<br>发贴IP:*.*.*.* 2009/7/11 11:38:00
     
     admin 帅哥哟,离线,有人找我吗?
      
      
      
      威望:9
      头衔:W3China站长
      等级:计算机硕士学位(管理员)
      文章:5255
      积分:18406
      门派:W3CHINA.ORG
      注册:2003/10/5

    姓名:(无权查看)
    城市:(无权查看)
    院校:(无权查看)
    给admin发送一个短消息 把admin加入好友 查看admin的个人资料 搜索admin在『 最新动态 & 业界新闻 』的所有贴子 点击这里发送电邮给admin  访问admin的主页 引用回复这个贴子 回复这个贴子 查看admin的博客3
    发贴心情 
    关于Wolfram Alpha,这里有个访谈:http://www.infoq.com/cn/articles/wolfram-alpha-interview

    ----------------------------------------------

    -----------------------------------------------

    第十二章第一节《用ROR创建面向资源的服务》
    第十二章第二节《用Restlet创建面向资源的服务》
    第三章《REST式服务有什么不同》
    InfoQ SOA首席编辑胡键评《RESTful Web Services中文版》
    [InfoQ文章]解答有关REST的十点疑惑

    点击查看用户来源及管理<br>发贴IP:*.*.*.* 2009/7/13 16:38:00
     
     Humphrey 帅哥哟,离线,有人找我吗?狮子座1981-7-23
      
      
      威望:1
      等级:研二(搞定了DL,再搞定F-Logic!)
      文章:937
      积分:5743
      门派:W3CHINA.ORG
      注册:2008/3/12

    姓名:(无权查看)
    城市:(无权查看)
    院校:(无权查看)
    给Humphrey发送一个短消息 把Humphrey加入好友 查看Humphrey的个人资料 搜索Humphrey在『 最新动态 & 业界新闻 』的所有贴子 引用回复这个贴子 回复这个贴子 查看Humphrey的博客4
    发贴心情 
    您亲自做的访谈哇!太厉害了!
    有机会了解这个“计算知识搜索引擎”Wolfram Alpha,一个没有使用语义网技术却采用了本体检索方式来给出精确结果的特殊工具。对于这样的结构而言计算所采用的规则(推理规则)就显得至关重要了,而这或许也就是它的核心所在吧。
    事实上,又来了一个叫做NKS的新技术名称。从它的主旨思想来看,NKS和我最初接触语义网应用时想象的语义网的作用是十分类似的,只是不知道它的前景会充满光明还是像NLP一般黯淡下来。
    最后,Wolfram虽然盈利模式尚未确定,但是摊子已经铺得很大了。希望它不会在财政方面出现问题而陷入窘境,毕竟有了它,本体和语义网的应用又进一步得到拓展,而我们则又有了一个可供参考的标杆。

    ----------------------------------------------
    鸿丰

    点击查看用户来源及管理<br>发贴IP:*.*.*.* 2009/7/13 17:22:00
     
     GoogleAdSense狮子座1981-7-23
      
      
      等级:大一新生
      文章:1
      积分:50
      门派:无门无派
      院校:未填写
      注册:2007-01-01
    给Google AdSense发送一个短消息 把Google AdSense加入好友 查看Google AdSense的个人资料 搜索Google AdSense在『 最新动态 & 业界新闻 』的所有贴子 访问Google AdSense的主页 引用回复这个贴子 回复这个贴子 查看Google AdSense的博客广告
    2024/12/11 22:56:17

    本主题贴数4,分页: [1]

    管理选项修改tag | 锁定 | 解锁 | 提升 | 删除 | 移动 | 固顶 | 总固顶 | 奖励 | 惩罚 | 发布公告
    W3C Contributing Supporter! W 3 C h i n a ( since 2003 ) 旗 下 站 点
    苏ICP备05006046号《全国人大常委会关于维护互联网安全的决定》《计算机信息网络国际联网安全保护管理办法》
    125.000ms