Design and Implementation of Fault-Tolarance Test Platform for Distributed File System
|School||Harbin Institute of Technology|
|Course||Computer Science and Technology|
|Keywords||Fault injection Distributed File System Fault tolerance test System call hijacking Server failure|
With the rapid development of modern information technology , it is the capacity of the storage system , the performance requirements while also increasing emphasis on fault tolerance . As a distributed file system for storage system management core level , also no doubt need to have higher fault tolerance at the same time , the fault tolerance of a distributed file system test to ensure that the quality of the system also will become particularly important . In-depth study of traditional fault tolerance test , two shortcomings traditional distributed file system fault tolerance test , fault injection is mainly by hand , is not only a waste of time and manpower , but also increases the human error occurred probability , lead to lower test accuracy ; second is to consider only the failure of the server level , incomplete coverage . This improved automated distributed test platform is designed and implemented , will be distributed in each node of the target system fault injection tools , the workload simulation tools and system monitoring tools to integrate , test the platform host can complete test ; this platform is a multi-level fault injection , fault set to cover the server level , directory and file-level , can be injected into the server failure , and the directory file operation failed , such as the failure to improve the fault coverage . In this paper, the research project requires a distributed file system architecture , fault tolerance mechanisms , as well as traditional fault tolerance test indicators rely , in accordance with the characteristics of the target system , service continuity , foreign service delay time and service performance degradation than three indicators test target system fault tolerance . Platform tools complement each other , in the testing process , the requirements of the different fault injection tool for fault injection predictor monitoring tools will give appropriate monitoring results , the work load simulation tools can simulate the work load with the test , the tester can be provided in accordance with the monitoring result and the load operation , the final test results is given . In this paper, the development platform of the target file system tests to verify the availability of the test platform , and gives the test results and analysis .