文本操作(textoperation)标引词(indexingterm)倒排文档(invertedfile)用户反馈(userfeedback)检索评价(retrievalevaluation)查询语言(querylanguage)用户界面(userinterface)特别检索(adhocetrieval)用户需求档(userprofile)语词加权(term-weighting)数据检索(dataretrieval)用户任务(Usertask)查全率(R,RecallRatio)查准率(P,PrecisionRatio)漏检率(O,OmissionRatio)误检率(M,MissRatio)用户负担(usereffort)面向用户(user-oriented)扩展布尔模型(extendedBooleanmodel)词干提取(stemming)参考测试集(referencetestcollection)扩展模式extendedpatterns容错查询searchingallowingerrors有序包含(orderedinclusion)无序包含(unorderedinclusion)查询重构queryreformulation查询扩展queryexpansion语词重新加权termreweighting用户相关反馈UserRelevanceFeedback描述性元数据DescriptiveMetadata语义元数据SemanticMetadata信息检索(InformationRetrieval,IR)受控词汇表(controlledvocabulary)文本压缩textcompression压缩比(compressionratio)词汇分析lexicalanalysis排除停用词eliminationofstopwords词汇表(vocabulary)事件表(occurrence)倒排文档invertedfiles散列变换(hashing)查询语法树(querysyntaxtree)移位-或(shift-or)顺序检索(sequentialsearch)并行计算(parallelcomputing)分布计算(distributedcomputing)算术逻辑单元(arithmeticlogicunit,ALU)虚拟处理器(virtualprocessor)中介器(broker)数字图书馆(DigitalLibrary,DL)信息存取任务(informationaccesstasks)信息可视化(informationvisualization)上下文关键词(keyword-in-context,KWIC)半结构化数据(semi-structureddata)动态服务器(dynamicserver)档案库服务器(archiveserver)精确匹配(exactmatch)基于内容(content-based)收集器-标引器(crawler-indexer)机器人robots蜘蛛spiders查询界面thequeryinterface响应界面theanswerinterface网页级别PageRank漫游WebCrawlingtheWeb广度优先breadth-firstd深度优先epth-firstfashion专指性查询Specificqueries泛指性查询Broadqueries网络目录WebDirectories元搜索引擎Metasearchers用户培训TeachingtheUser网络目录WebDirectories元搜索引擎Metasearchers软件代理SoftwareAgents工程索引(EngineeringIndex,EI)从用户的角度(fromauser-centeredperspective)杜威十进分类法(DeweyDecimalClassification)结构化文本检索(structuredtextretrieval,STR)联机公共检索目录(onlinepublicaccesscatalog,OPAC)多媒体信息检索(MultimediaInformationRetrieval,MIR)从计算机学科的角度(fromacomputer-scienceperspective)文献逻辑表示(视图)(logicalviewofthedocument)检索性能评价(retrievalperformanceevaluation)通用标记语言(SGML,standardgeneralmarkuplanguage)机读目录记录(MachineReadableCatalogingRecord,MARC)资源描述框架(ResourceDocumentFramework,RDF)XML(eXtensibleMarkupLanguage,可扩展标记语言)HTML(HyperTextMarkupLanguage,超文本标记语言)分布式信息检索(distributedinformationretrieval)通过图像内容查询(QuerybyImageContent,QBIC)DistributedArchitecture分布式结构CentralizedArchitecture集中式结构国会图书馆分类法(LibraryofCongressClassification)联机计算机图书馆中心(OnlineComputerLibraryCenter,OCLC)数字图书馆创新项目(DigitalLibrariesInitiative,DLI)基于数字化对象标识符(DigitalObjectIdentifier,DOI)1booleanretrieval布尔检索2thetermvocabularyandpostingslists词汇表和文档记录列表3dictionariesandtolerantretrieval字典和容错检索4indexconstruction构建索引5indexcompression索引压缩6scoring,termweighting,andthevectorspacemodel评分,词条权重和向量空间模型7computingscoresinacompletesearchsystem全文搜索系统中的计算评分8evaluationininformationretrieval9relevancefeedbackandqueryexpansion关联信息反馈和查询扩展10xmlretrievalxml检索11probabilisticinformationretrieval概率信息检索12languagemodelsforinformationretrieval信息检索中的语言模型13textclassificationandnaivebayes文本分类和贝叶斯算法14vectorspaceclassification向量空间模型15supportvectormachinesandmachinelearningondocuments支持向量空间模型的关于文档的机器学习16flatclustering扁平聚类17hierarchicalclustering分等级的聚类18matrixdecompositionsandlatentsemanticindexing矩阵分解和潜在语义索引19websearchbasicsWeb搜索基础20webcrawlingandindexes网页收集和索引21linkanalysis链接分析