基于Lucene的搜索引擎作者姓名:王旭专业班级:2010050704指导教师:涂德志摘要从1994年至今,万维网经过了二十年的飞速发展,当前的万维网数据规模到底有多大无从估量。随着网络信息资源的急剧增长,现如今,信息已经不再是一种稀缺的资源,我们的注意力反而变得稀缺了。人们越来越多地关注如何快速有效地从海量的网络信息中,抽取出潜在的、有价值的信息,使之有效地在管理和决策中发挥作用。搜索引擎提供了一种便捷的获取网络信息的途径,只要你能在电脑上打字,那么你就能通过“输入关键字+自行浏览”的用户交互方式快速查找到自己感兴趣的资源。目前Web搜索引擎(SearchEngine)技术正成为计算机科学界和信息产业界争相研究、开发的对象。搜索引擎是指互联网上一种提供用户查询的一类应用。通过人工目录整理或者是网络爬虫收集互联网上已经存在的网页,在用户输入查询词后,将相关网页迅速展现给用户。用户自行浏览后选择最合适期望的链接,进入查看。关键词:网络信息资源Web搜索引擎查询ABSTRACTSince1994,theWorldWideWebaftertwodecadesofrapiddevelopment,howmuchthecurrentsizeoftheWorldWideWebisincalculable.Withtherapidgrowthofnetworkinformationresources,nowadays,theinformationisnolongerascarceresource,however,ourattentionbecamescarce.moreandmoreconcernedabouthowquicklyandefficientlyfromthevastamountsofnetworkinformation,toextractpotentiallyvaluableinformationtoeffectivelyplayaroleinthemanagementanddecision-making.Searchenginesprovideaconvenientwaytoobtainnetworkinformation,aslongasyoucantypeonacomputer,thenyoucanthroughthemode:"keywords+browse",toquicklyfindtheresourcesyouareinterested.CurrentlyWebsearchengine(SearchEngine)technologyisbecomingthetargetcomputerscienceandinformationindustrycompeteondevelopment.SearchengineontheInternetreferstoamethodofprovidingauserqueriesaclassofapplications.SortingthroughartificialcatalogorwebcrawlerstocollectWebpagesontheInternetalreadyexist,aftertheuserentersthequerywords,therelevantpagesquicklypresentedtotheuser.Choosethemostappropriatelink,browsethedesiredpostintoview.Keywords:NetworkInformationResourcesWebSearchEngineConsult目录第1章前言.......................................................11.1搜索引擎的学术背景与实际意义.............................11.2国内外文献综述...........................................21.3课题来源及主要研究内容...................................2第2章相关技术介绍.................................................42.1JSP与Tomcat.............................................42.2SQLSever数据库.........................................42.3Ajax简介................................................52.4Lucene介绍..............................................5第3章搜索引擎原理.................................................83.1搜索引擎体系结构.........................................83.2搜索引擎主要模块功能介绍.................................93.2.1搜索器(Crawler)..................................103.2.2索引器(Indexer)..................................113.2.3检索器(Searcher)..................................123.2.4用户接口((UserInterface)..........................12第4章系统分析....................................................134.1需求分析................................................134.2系统可行性分析..........................................134.2.1社会可行性分析.................................134.2.2技术可行性分析.................................144.2.3经济可行性分析.................................14第5章总体设计....................................................155.1系统构架.........