Research and Implement of the Computer-Aided Copy Detection System for Document |
|
Author | ZuoXiaoLong |
Tutor | PengXinGuang |
School | Taiyuan University of Technology |
Course | Applied Computer Technology |
Keywords | Copy Detection Text blocks Similarity ASP.NET |
CLC | TP311.52 |
Type | Master's thesis |
Year | 2008 |
Downloads | 142 |
Quotes | 1 |
With the increasingly rich network of digital resources and network environment on the change in the way people access information, digital documents at your fingertips, copy the document becomes easier. In recent years, academic plagiarism phenomenon often found in newspapers, on the Internet growing number of duplicate pages reduces the retrieval efficiency, a lot of inconvenience to the user. Copy the document detection technique is a digital document in order to prevent the proliferation of illegal copying and presented in intellectual property protection and has important applications in information retrieval, it can prevent the occurrence of plagiarism, the retrieval efficiency of the Internet in recent years, research in the field of data security hot spots. Document copy detection is to determine whether a given document plagiarism or copied on another one or more articles of the document content, plagiarism does not just mean intact copy, but also a shift of the original transformation, synonyms and rephrasing restatement and other means. This paper describes the development of the document copy detection technology background, basic concepts, research status, applications, and scientific significance. Followed by an analysis of existing detection system functions and features, and explore the build system needs XML, ASP.NET, ADO.NET, and SQL Server, and other related technology and its characteristics, proposed the establishment of the B / S three-tier structure of the document is copied concept of computer-aided detection system. Secondly, the paper copy of the document design computer-aided detection system architecture, as well as databases, user registration login module, the document upload module, document detection module, the system management module. The system uses SQL Server 2005 as the back-end database server, XML represents the document file, ASP.NET component ADO.NET to access the database, with Internet Information Server 5.1 as a Web server, a Web server written in C # using the relevant procedures, the client web browser to access the system. Again, the details of the specific function of each module of the system implementation, including user login register, upload files, and document detection, user documentation management, system's basic settings management, user management, document management system. Based on the above work, based on the realization of the sentence in the English document copying computer-aided detection system to provide users with online document copying computer-aided detection services. On this basis, a lot of system testing, the test proved that the system has a strong practicality.