Google搜索与Inter网的信息检索.ppt
文本预览下载声明
Google搜索与Inter网的信息检索 马志明 约有626,000项符合中国科学院数学与系统科学研究院的查询结果,以下是第1-100项。(搜索用时 0.45 秒) How can google make a ranking of 626,000 pages in 0.45 seconds? Nevanlinna Prize(2006)Jon Kleinberg Page Rank, the ranking system used by the Google search engine. Query independent content independent. using only the web graph structure Page Rank, the ranking system used by the Google search engine. Can a surfer jump from page 5 of site 1 to a page in site 2 ? Ranking Websites, a Probabilistic View Ying Bao, Gang Feng, Tie-Yan Liu, Zhi-Ming Ma, and Ying Wang n webs in N sites, Based on the above discussions, the direct approach of computing the AggregateRank ξ(α) is to accumulate PageRank values (denoted by PageRankSum). However, this approach is unfeasible because the computation of PageRank is not a trivial task when the number of web pages is as large as several billions. Therefore, Efficient computation becomes a significant problem . AggregateRank 1. Divide the n × n matrix into N × N blocks according to the N sites. Experiments In our experiments, the data corpus is the benchmark data for the Web track of TREC 2003 and 2004, which was crawled from the .gov domain in the year of 2002. It contains 1,247,753 webpages in total. From: pcchairs@sigir2008.confmaster Sent: Thursday, April 03, 2008 9:48 AMDear Yuting Liu, Bin Gao, Tie-Yan Liu, Ying Zhang, Zhiming Ma, Shuyuan He, Hang Li We are pleased to inform you that your paperTitle: BrowseRank: Letting Web Users Vote for Page Importancehas been accepted for oral presentation as a full paper and for publication as an eightpaper in the proceedings of the 31st Annual International ACM SIGIR Conference on Research Development on Information Retrieval. Congratulations!! learning to rank The goal of learning to rank is to construct a real-valued function that can generate a ranking on the documents a
显示全部