- 浏览: 271642 次
- 性别:
- 来自: 北京
文章分类
- 全部博客 (161)
- 【**计划】 (2)
- 【**Core Java**】 (30)
- 【**JAVA EE】 (6)
- JDBC (3)
- Hibernate专题系列 (0)
- 【**OS】 (14)
- 【**架构设计/设计模式】 (11)
- 【Hadoop】 (3)
- 【**分布式】 (9)
- 模板 (1)
- C (2)
- 常用工具 (1)
- Oracle (2)
- 【Tips】 (3)
- 【数据库】 (2)
- 玩转Ubuntu (0)
- 【计算机网络/网络编程】 (7)
- 【**Search Engine】 (21)
- 【**专题**】 (6)
- 【**Python】 (10)
- XML (1)
- 【**Open Source Framework】 (1)
- 【高级主题】 (1)
- 【存储】 (3)
- 【笔试面试】 (2)
- 【**数据结构与算法设计】 (20)
- 【其他】 (3)
- 【编程练习】 (2)
- 【待完成】 (12)
- 【工作】 (6)
- 【软件研发】 (4)
- 【**多线程多进程编程】 (5)
- 【Web Service】 (1)
- 【表达式解析/JavaCC系列】 (5)
- 【缓存系统:Memcached】 (1)
- 【Java IO/NIO】 (5)
- 【JVM运行机制及内存管理】 (7)
最新评论
-
107x:
...
python list排序 -
yuzhu223:
...
【Python基础】Python的lambda函数与排序 -
Tonyguxu:
分析查询结果的打分小于11.query=1065800715* ...
lucene打分机制的研究 -
Tonyguxu:
query=139320661963.013709 = (MA ...
lucene打分机制的研究 -
Tonyguxu:
query=10658007150.6772446 = (MA ...
lucene打分机制的研究
http://wiki.apache.org/lucene-java/InformationRetrieval
Books
-
Introduction to Information Retrieval , Manning, Raghavan & Schütze, 2007
-
Managing Gigabytes [KevinBurton ] I can vouch for M.G. as I have a copy and it's a GREAT book. Should be called Managing Terabytes. Its not light reading by any means and you'll probably have to sit down with each chapter for a bit. ([DavidSpencer ] -- I agree, great thorough book, must have)
-
Modern Information Retrieval , Ricardo Baeza-Yates and Berthier Ribeiro-Neto, 1999
-
Foundations of Statistical Natural Language Processing , Chris Manning and Hinrich Schütze, 1999
- Readings in Information Retrieval
-
Mining the Web - Discovering Knowledge from Hypertext Data by Soumen Chakrabarti , Morgan-Kaufmann. A good book that covers all the aspects of web and text mining.
Web links
-
Introduction to Information Retrieval , Manning, Raghavan & Schütze, 2007 This book is available online in PDF form.
-
Big list of links to IR resources: http://www-csli.stanford.edu/~schuetze/information-retrieval.html
-
Inquery Query help [PaulElschot ] This engine has a more elaborated query language than Lucene. However, Lucene supports most of the mechanisms used by the Inquery operators. Recommended: the section on Query Operator Types that makes the distinction between Belief List Operators and Proximity List Operators.
-
Course on Information Retrieval [JoaquinDelgado ] A very solid (and free) online course on "intelligent information retrieval" with focus on practical issues prepared by Prof. Mooney (Univ. of Texas), a well known expert in Machine Learning and IR.
-
course notes from the Stanford course on IR: http://www.stanford.edu/class/cs276/handouts/lecture1.pdf to http://www.stanford.edu/class/cs276/handouts/lecture16.pdf
-
http://www.dcs.gla.ac.uk/~tombrosa/AIS/SMART-tutorial/weights.html
-
http://trec.nist.gov/pubs/trec3/t3_proceedings.html TREC (Text REtrieval Conference) 3 Proceedings, including Salton's paper on SMART.
-
http://www.soi.city.ac.uk/~ser/idf.html The Spärck Jones / Robertson IDF page. Karen Spärck Jones is the author of the paper which introduced IDF.
User Discussions on Java-User
http://www.gossamer-threads.com/lists/lucene/java-user/43042
发表评论
-
【Lucene】建索引核心类介绍
2012-06-08 17:28 1037IndexWriter 负责创建新索引或打开已有索引, ... -
优秀文章汇总
2012-05-08 18:48 723搜索引擎技术之概要预览 http://blog.csd ... -
【Lucene】lucene查询Query对象
2012-05-08 18:41 1380PrefixQuery 前缀查询。 如 test* 会匹配 ... -
【工作】日志检索结果的排序改进分析
2012-04-27 18:07 912下图是现在生产环境的部署图,索引文件分布在70-7 ... -
【Lucene】查询term后加上'*'对打分的影响
2012-04-25 18:14 2057BooleanWeight里sum ... -
lucene.search.Weight
2012-04-25 15:39 966org.apache.lucene.search Cl ... -
lucene.search.Similarity
2012-04-20 10:31 2514Similarity defines the componen ... -
lucene打分机制的研究
2012-04-22 17:46 5803提出问题 目前在查询时,会将得分小于1的查询结果过滤掉。 ... -
tokenize和tokenizer到底怎么翻译?
2012-03-28 10:32 3524在编写词法分析器(Lexer)或语法分析器(Parse ... -
【Lucene】更合理地使用Document和Field
2012-03-27 09:39 5397writer = ...; //#1 Prepared ... -
【Lucene】构建索引
2012-03-17 23:16 731Lucene索引的过程是什么? step1 收集待 ... -
信息检索类小程序
2012-03-17 00:37 8111.对四大名著txt实现索引和搜索功能 2. -
【Lucene】Scoring
2012-03-13 23:47 1137http://lucene.apache.org/core/o ... -
【Lucene】lucene的评分机制
2012-03-07 16:24 916测试环境里查询条件1065800714,为什么Score ... -
【Lucene】搜索的核心类简介
2012-03-05 18:48 1350注:Lucene版本为3.4 I ... -
【Lucene】How to make indexing faster
2012-02-16 14:54 780http://wiki.apache.org/lucene-j ... -
【Lucene】index包IndexWriter
2011-12-25 01:50 767Q1:IndexWriter作用是什么? Q2:索引过 ... -
【Lucene】store包SimpleFSDirectory
2011-12-24 23:43 778store包SimpleFSDirectory -
【Lucene】store包FSDirectory
2011-12-24 13:39 1399源码中涉及以下知识点: 1.java.security.Me ... -
【Lucene】store包Directory
2011-12-11 17:23 1277说明 lucene的版本是3.0.3 结构及类图 文件类 ...
相关推荐
Learning to Rank for Information Retrieval(LETOR) 是Microsoft的一个信息检索相关度排序的数据集,有 Supervised ranking Semi-supervised ranking Rank aggregation Listwise ranking 四种setting,提供了...
information retrieval 最经典教材 Introduction to Information Retrieval By Christopher D. Manning Stanford University Prabhakar Raghavan Yahoo! Research Hinrich Schutze University of Stuttgart
Introduction to Information Retrieval is the first textbook with a coherent treat- ment of classical and web information retrieval, including web search and the related areas of text classification ...
This book provides an overview of the important issues in information retrieval, and how those issues affect the design and implementation of search engines. Not every topic is covered at the same ...
Learning to Rank for Information Retrieval and Natural Language Processing
An Introduction to Information Retrieval An Introduction to Information Retrieval 增加页码,去除水印 2018 最新
一本介绍information retrieval的好书
Introduction to Information Retrieval.pdf
Introduction to Information Retrieval-2009
Introduction to Information Retrieval Christopher D. Manning Stanford University Prabhakar Raghavan Yahoo! Research
Information Retrieval for Music and Motion
information retrieval lecture2 ppt.
信息检索(Information Retrieval)是用户进行信息查询和获取的主要方式,是查找信息的方法和手段。狭义的信息检索仅指信息查询(Information Search)。即用户根据需要,采用一定的方法,借助检索工具,从信息集合...
Modern Information Retrieval:A Brief Overview
信息检索导论,Christopher D. Manning等著,cmu stanford都在用该书作为IR入门书籍。本文档是该书的习题答案,英文版,pdf格式。
本书由MSRA刘铁岩所写,介绍了排序学习的一些基本概念以及方法。排序学习指的是使用机器学习的方法来对网页进行排序,书中所讲的内容对于有机器学习背景的同学来说应该还是比较容易的。