Design and Implementation of the Graduation Thesis Full-text Retrieval System with Lucene
|School||Ocean University of China|
|Course||Applied Computer Technology|
|Keywords||Thesis Paper Submission Full - text retrieval system Lucene|
Now, with the improvement of the university network environment, library automation conditions optimization, many of the university library has been or is being proceed Characteristic Database Construction work. Which the student thesis library building is also an important work of the Library. Student thesis are generally very focused combination of professional building with local characteristics and historical data as a resource and information it can provide important reference for the graduation design work for teachers and students, and therefore useful The value of building a database. Features database of construction must rely on certain software platform for most libraries, this software platform needs to be achieved through the purchase of products. Once selected, but this software generally can not be easily changed for each library to develop an independent and stable, flexible configuration and in the interests of library work requires retrieval system is very important a work. The full-text search the computer indexing process be retrieved through every word in the scanned article. The retrieval to create an index for each word in the document, the specified number of occurrences and location of the word in the article, when the user query, the search program to be retrieved according to the prior establishment of the index, and the result is fed back to the user's retrieval way. This paper analyzes the study and application of the current field of information retrieval, to study the characteristics of full-text retrieval system, the main algorithm, the theory of full-text search and full-text search trends and technology hotspot. The popular open source full-text search toolkit Lucene architecture and the main function modules were analyzed, parsing Lucene indexing algorithm, search algorithms, sorting algorithms principle. At the same time, in conjunction with the unit thesis library building practical, based on Lucene toolkit based on the thesis text retrieval system analysis and design, the remote submit papers, full-text search function to simplify the library staff collection of the student dissertation electronic document process, improve the work efficiency. The system the retrieval performance testing and application experiments, summed up the characteristics of the system, verify the indicators of full-text retrieval system, application of the standard to the library of full-text search system.