文档详情

互联网信息检索系统的研究与实现本科-学士学位论文.doc

发布:2017-08-15约字共58页下载文档
文本预览下载声明
毕业论文 互联网信息检索系统的研究与实现 摘 要 互联网信息检索系统(搜索引擎)是专门提供信息检索服务的平台,它将互联网上大量的网页数据采集到服务器,经过处理形成的信息数据库和索引数据库,实现对用户提出的各种信息检索作出响应。 本系统使用Microsoft Visual Studio 2005为主要开发工具,以Windows Server 2003操作系统为运行环境,主要实现了网页数据的抓取,网页数据存储,数据的索引,数据的检索,日志管理等功能。 本文对互联网信息检索统中几个关键技术的设计和实现进行了研究。从理论上对这些关键技术进行了详细的讨论,并完成了基于L互联网信息检索系统的实现。文章从下面几个方面进行了讨论: 首先,本文介绍搜索引擎的市场需求和研究状态。这一部分阐述了搜索引擎丰富的历史背景和客观的用户需求,自身的特点,以及人们对搜索引擎的关注程度。 其次,本文讨论了搜索引擎中基本结构、实现的理论基础和实现方法。这一部分研究了搜索引擎的关键技术,将中文分词技术、数据采集技术和数据索引技术有机的结合起来,并对全文检索引擎L进行分析和研究。 最后,详细描述了一个基于L的互联网信息检索系统的设计与实现。 关键词 搜索引擎;L;数据存储;信息检索 ABSTRACT Internet information retrieval system (search engine) is designed to provide a platform for information retrieval services.It will collect a lot of pages data on the Internet to the server,and processed form of the information database and index database.Made to achieve the user to respond to the various information retrieval. The system uses Microsoft Visual Studio 2005 as the main development tool, to run Windows Server 2003 operating system environment, the main achievement of the web crawl data, web data storage, data indexing, data retrieval, logging management and other functions. In this paper, several Internet information retrieval system design and implementation of key technologies were studied. Theory on these key technologies are discussed in detail, and completed the Internet information retrieval system based on L realization. The article discussed the following aspects: First of all, the article describes the search engine market demand and research status.This part discusses the search engine rich historical background and objective of the user requirements, its own characteristics, as well as people paid more attention to search engine. Secondly, the article discusses the basic structure of search engines, to achieve the theoretical basis and implementation methods. This part of the search engines key technology, Chinese word segmentation, data acquisition and data indexing technology combine organic, and full-text s
显示全部
相似文档