--Using Lifecycles and Interceptors to update Lucene searches

本站首页 管理页面写新日志退出

« November 2025 »
日一二三四五六
1
2 3 4 5 6 7 8
9 10 11 12 13 14 15
16 17 18 19 20 21 22
23 24 25 26 27 28 29
30

公告

我的分类（专题）

首页(1304)
Eclipse(8)
J2ME(3)
OpenSymphony(16)
Hibernate(97)
Tapestry(23)
J2SE(72)
Symbian(2)
eXtremeComponents(13)
JBoss(33)
Javascript(13)
MySQL(72)
Java Open Source(104)
DWR(Ajax)(29)
Spring(61)
WebWork(15)
Apache(jakarta)(77)
软件设计(6)
算法(22)
Acegi(2)
Subversion(44)
Dojo(Ajax)(2)
Wicket(3)
IDEA(2)
ESB(6)
TinyMCE+FCKeditor(20)
Grails(1)
Prototype(Ajax)(32)
设计模式(20)
Prototype(0)
FreeMarker(17)
集成测试(14)
codehaus.org(2)
AOP(13)
Java代码(7)
Struts 2.0(6)
Groovy(5)
Linux(10)
网站架构(70)
Cache(11)
Python(40)
网络与系统管理(34)
shell/bash(4)
Pylons学习(2)
Django(88)
Ruby on Rails(120)
Ubuntu(4)
Quixote(3)
视频处理(20)
Web(UI+UE)(2)
TurboGears(25)
jQuery(2)
iBatis(7)
CentOS(2)
MySQL集群(1)
SELinux(1)

日志更新

Java中压缩与解压--中文文件名乱码解
对当前目录下所有文件进行压缩代码
java zip 中文问题
iBatis for Paging
再析在spring框架中解决多数据源的问
如何在spring框架中解决多数据源的问
SELinux 的配置小解
apache+mod_ssl中证书生成方
StatSVN的使用（续）
[原创]MySQL的LIST分区体验与总

留言板

签写新留言

我也想装饰元件
谢谢
飘过！
模板的问题
mule 求助
extremecomponents.cs
搜索呢？
[Apache(jakarta)]Apa
jsper报表的制作!
求助一下,关于compass的

链接

SpringSide
SpringFramework中文论坛
 BlogJava
Java开源大全
 Java视线论坛
 CSDN Java频道
 JavaScud开源平台
 JavaAPI中文文档
 一个不错的提供代码示例的站点
 Spring 中文开发手册(1.1.PR)
Springframework
Hibernate
Java版模式速查手册
 良葛格學習筆記
 javareference
java2s
GRAILS

Blog信息

blog名称:
日志总数:1304
评论数量:2242
留言数量:5
访问次数:7660973
建立时间:2006年5月29日

[Hibernate]Using Lifecycles and Interceptors to update Lucene searches
软件技术

lhwork 发表于 2007/1/22 14:52:33

Everybody but your boss understands that a relational database isn't "searchable" in the usual sense - you have to explicitly identify keywords, maintain search tables, etc. Fortunately it's easy to do incremental updates of Lucene (http://jakarta.apache.org/lucene/docs/index.html) indexes via the Hibernate Lifecycle interface. For each class we wish to make searchable, we start by providing a method that creates a Lucene Document to describe the instance:public class Example implements Lifecycle { /** hibernate id */ private Long id; /** various fields we want to search */ private String name; private String department; private String skills; ... /** * Return a Lucene Document that provides the searchable elements * of the object. */ Document getDocument() { Document d = new Document(); d.add(Field.Keyword("id", id.toString())); d.add(Field.Keyword("classname", this.getClass().getName()); d.add(Field.Keyword("name", name); if (department != null) { d.add(Field.Keyword("department", department); } if (skills != null) { d.add(Field.Unstored("skills", skills); } } Where the four standard types of fields are: Keyword Indexed and stored in the index verbatim. This field is suitable for URLs, dates, personal names, telephone numbers, etc. For this technique to work we must store the Hibernate ID as a keyword. Text Tokenized, indexed and stored in the index. This field can be searched, but you don't want to use it for large fields. Unstored Tokenized, indexed but not stored in the index. This field is ideal when indexing large amounts of text that does not need to be retrieved in its original form, e.g., bodies of web pages or PDF documents. Unindexed Stored in the index verbatim, but unsearchable. These values are normally used to provide displayable text for search results. Since we store the Hibernate ID as a keyword, we can immediately retrieve the full object from the database and don't need unindexed or text fields, although they may be useful for non-hibernated tools. We now need to provide Lifecycle methods that incrementally update a Lucene index.public class Example implements Lifecycle { /** Directory containing Lucene index */ File idx; ... /** * Open up a Lucene IndexWriter. */ protected IndexWriter getIndexWriter() { return new IndexWriter(idx, new StopAnalyser(), false); } /** * Open a Lucene IndexReader. */ protected IndexReader getIndexReader() { return IndexReader.open(idx); } /** * Saving an object for the first time - add it to the Lucene * index... */ public boolean onSave(Session s) throws CallbackException { try { IndexWriter writer = getIndexWriter(); writer.addDocument(getDocument()); writer.close(); } catch (IOException e) { throw new CallbackException(e.getMessage()); } } /** * Updating an object - must delete old object and reinsert it. */ public boolean onUpdate(Session s) throws CallbackException { try { IndexReader reader = getIndexReader(); reader.delete(new Term("id", id.toString())); IndexWriter writer = getIndexWriter(); writer.addDocument(getDocument()); writer.close(); } catch (IOException e) { throw new CallbackException(e.getMessage()); } return false; } /** * Deleting an object. */ public boolean onDelete(Session s) throws CallbackException { try { IndexReader reader = getIndexReader(); reader.delete(new Term("id", id.toString())); } catch (IOException e) { throw new CallbackException(e.getMessage()); } return false; } /** * Loading an object - we don't have to do anything here. */ public void onLoad(Session s, Serializable id) { } } In practice these methods would often be put into a base class for all persistent objects, with the getIndexReader and getIndexWriter methods overridden if desired to provide disjoint indexes. Implementation of the user interface and search functionality is left as an exercise for the reader. Gavin points out that Interceptor.onUpdate() is only called on explicit calls to update(). This isn't hard to code... but murder to maintain. All it takes is one oversight and your index will get out of sync with your database. A second solution, using Interceptors, is discussed below Another approach is to incrementally update the Lucene index in an Interceptor. We begin by defining a new interface.public interface Searchable { /** * Get Lucene IndexWriter - can be different for each class allowing * multiple indexes */ public IndexWriter getIndexWriter(); /** * Get Lucene IndexReader - must refer to same directory as * getIndexWriter. */ public IndexReader getIndexReader(); /** * Get Lucene Document describing our searchable content. The * term keywords "id" and "classname" are reserved by our interceptor. */ public Document getDocument(); } Any persistent class that we want to make searchable simply implements these three methods. We now define an interceptor. To mix things up a bit the interceptor handles the Hibernate ID and classname, not the target class.public class LuceneInterceptor implements Interceptor, Serializable { /** * Drop object from Lucene index */ public void drop(Searchable entity, Long id) throws IOException { IndexReader reader = entity.getIndexReader(); reader.delete(new Term("id", id.toString())); } /** * Add object to Lucene index */ public void add(Searchable entity, Long id) throws IOException { Document doc = entity.getDocument(); doc.add(Field.Keyword("id", id.toString()); doc.add(Field.Keyword("classname", entity.getClass().getName())); IndexWriter writer = entity.getIndexWriter(); writer.addDocument(doc); writer.close(); } /** * Method called when an existing record is updated. */ public boolean onFlushDirty ( Object entity, Serializable id, Object[] currentState, Object[] previousState, Object[] propertyNames, Types[] types) throws CallbackException { if (entity instanceof Searchable) { if (id instanceof Long) { try { drop((Searchable) entity, (Long) id); add((Searchable) entity, (Long) id); } catch (IOException e) { throw new CallbackException(e.getMessage()); } } else { // unsupported ID } } return false; } /** * Method called when a new record is saved. */ public boolean onSave ( Object entity, Serializable id, Object[] state, Object[] propertyNames, Types[] types throws CallbackException { if (entity instanceof Searchable) { if (id instanceof Long) { try { add((Searchable) entity, (Long) id); } catch (IOException e) { throw new CallbackException(e.getMessage()); } } else { // unsupported ID } } return false; } /** * Method called when an existing record is deleted. */ public boolean onDelete ( Object entity, Serializable id, Object[] state, Object[] propertyNames, Types[] types) throws CallbackException { if (entity instanceof Searchable) { if (id instanceof Long) { try { drop((Searchable) entity, (Long) id); } catch (IOException e) { throw new CallbackException(e.getMessage()); } } else { // unsupported ID } } return false; } // rest of methods elided as they follow default behavior }

阅读全文(2903) | 回复(0) | 编辑 | 精华

发表评论：

昵称：
密码：
主页：
标题：

验证码： (不区分大小写,请仔细填写,输错需重写评论内容！)

站点首页 | 联系我们 | 博客注册 | 博客登陆

Sponsored By W3CHINA
W3CHINA Blog 0.8 Processed in 1.499 second(s), page refreshed 144801406 times.
《全国人大常委会关于维护互联网安全的决定》《计算机信息网络国际联网安全保护管理办法》
苏ICP备05006046号