FlatLFS:一种面向海量小文件处理优化的轻量级文件系统
付松龄
国防科技大学 计算机学院, 湖南 长沙 410073,slfu@nudt.edu.cn
廖湘科
国防科技大学 计算机学院, 湖南 长沙 410073
黄辰林
国防科技大学 计算机学院, 湖南 长沙 410073
王 蕾
国防科技大学 计算机学院, 湖南 长沙 410073
李姗姗
国防科技大学 计算机学院, 湖南 长沙 410073
摘要:

海量小文件的高效存储和访问是当前分布式文件系统面临的主要挑战之一。以GFS和HDFS为代表的分布式文件系统大多面向海量大文件的高效存储和访问设计,缺乏小文件处理的针对性优化,导致访问海量小文件时效率低下。针对分布式文件系统中海量小文件访问时的数据服务器优化问题,提出了一种采用扁平式数据存储方法的轻量级文件系统FlatLFS,取代传统文件系统对上层分布式文件系统提供数据存储和访问支持,提高了数据服务器处理小数据块时的I/O性能,从而提升了整个分布式文件系统的性能。实验表明,当数据块大小设定为1M时,FlatLFS的随机读性能分别比ext3、ext4、reiserfs高135%、112%和122%。

基金项目:

国家核高基重大专项(2012zx01040001)

FlatLFS: a lightweight file system for optimizing the performance of accessing massive small files
FU Songling
College of Computer, National University of Defense Technology, Changsha 410073, China,slfu@nudt.edu.cn
LIAO Xiangke
College of Computer, National University of Defense Technology, Changsha 410073, China
HUANG Chenlin
College of Computer, National University of Defense Technology, Changsha 410073, China
WANG Lei
College of Computer, National University of Defense Technology, Changsha 410073, China
LI Shanshan
College of Computer, National University of Defense Technology, Changsha 410073, China
Abstract:

The storage and access of massive small files are one of the challenges in the design of DFS(Distributed file system). Most of the DFSs, such as GFS and HDFS, are designed for handling massive big files. The performance of DFSs decreases greatly when accessing massive small files without special optimization for small files. This research focuses on the optimizing of the performance of data server in handling massive small files, and presents a Flat Lightweight File System called FlatLFS in which the user data are managed flat in disks. FlatLFS is supposed to substitute the traditional file system when accessing user data for upper DFSs. With the improvement of the performance of small data block processing on data servers by FlatLFS, the performance of the whole DFSs is greatly improved. The effectiveness of FlatLFS is proved with intensive experiments: when the size of data block is 1M, the performance of random read of FlatLFS is 135%, 112% and 122% higher than ext 3 ,ext4 and reiserfs respectively. 


【下载PDF全文】  
相似文献(共20条):关闭