210 likes | 288 Views
Reducing Fsck time using divide and conquer approach. Presented by: Kamaya Tapadiya Nidhi Ruhatiya Savita Kulkarni Teja Tamboli. Recent News… Main server for Linux kernel source kernel.org suffered file system corruption. It took over a week to repair the file system.
E N D
Reducing Fsck time using divide and conquer approach Presented by: • Kamaya Tapadiya • Nidhi Ruhatiya • Savita Kulkarni • Teja Tamboli
Recent News… • Main server for Linux kernel source kernel.org suffered file system corruption. • It took over a week to repair the file system.
Filesystems are vulnerable to hardware changes. • Disk capacity is growing exponentially. • Disk bandwidth is not increasing at the same pace. • Seek time is relatively much longer. • Per bit error rate is not improving as fast as disk capacity • is growing. • Increased I/O errors means frequent and longer • FileSystem Consistency checK (FSCK).
Fsck - Filesystem Consistency Check • Scandisk : windows :: Fsck : Linux • Repairs filesystem inconsistency caused by hardware • errors, software bugs or other data corruption. • It needs to traverse the entire filesystem looking for • errors and fix them.
Existing Filesystem • Single meta data block corruption needs the entire filesystem to be unmounted and checked. • During this process the data is completely unavailable.
Goals to achieve… • To reduce the fsck time. • To maintain the on disk data structures with no changes to the existing filesystem.
Proposed solution… • To divide the on disk filesystem format into small fault isolated domains – chunks. • Each chunk has a standalone set of metadata (block number space, allocation bitmaps, superblock). • Each chunk is individually repairable. • Namespace and disk space are still shared.
/root/dir/file.txt USER LEVEL KERNEL LEVEL /root/dir/file.txt Fsckable chunks Chunk 1 (metadata) Chunk 2 (metadata) Chunk 3 (metadata) Chunk 4 (metadata) root dir file.txt - - - - - - - - - - - - - - - - - - - - -Whole Device- - - - - - - - - - - - - - - - - - - - -
Implementation Stackable Filesystem Filesystem stacked above another base filesystem.
User Process System calls Mount() Ioctl() Error Codes System Call Interface USER LEVEL V F S KERNEL LEVEL Filesystem data Vnode Interface Error codes Stackable File System Filesystem data and operations Error codes Vnode Interface Lower File System
Implementation Issues 1.Handling Large Files Forward inode number Chunk A (/movie/cd1) Chunk B (/movie.cont/cd1.cont) Backward inode number
2. Handling Hardlinks • Hard link- a way of referring an inode with different file names. • Cross Chunk reference- source file is in one chunk and the target file needs to be created in other chunk. • In that case continuation inode for the parent directory of the target file is placed in the same chunk as the source file. • Making sure that the target file entry and the source file are in the same chunk.
3. Inode Number Management • First 4 bits indicates chunk number eg. for chunk 2 (0010) Chunkfs 0010 1001 0011 1100 0110 1111 1011 1100 Basefs 0000 1001 0011 1100 0110 1111 1011 1100
Partial Fsck Chunk File System Chunk 1 Chunk 2 Chunk 2 Chunk 3 Chunk 4 Fsck corruption
To Summarize… • Makes no changes to the on-disk filesystem layout. • Could be ported to support other filesystems with minimum changes to chunkfs. • Reduced downtime due to degraded mode of operation - • Parts of the filesystem can undergo isolated fsck, while others are online and serving requests. • Reduced fsck memory requirements.
References • Kernel Summit 2006 • The 2006 Linux Filesystem Workshop • http://lwn.net/Articles/190224/ • lkml.org/lkml/2006/6/9/476 • The Design Of Unix operating system -Maurice J. Bach • Understanding Linux Kernel -Daniel P. Bovet and Macro Cesati
Achievements… • Won second prize at Techkriti IIT- Kanpur for the event SOFTWARE CORNER. • Won second prize at Cummins College of Engineering for the event INNOVATION’07 in the category of system programming.
Reserved fields in inode: • Forward inode number is stored in Osd1.linux1.l_i_reserved1 • Backward inode number is stored in Osd2.linux2.l_i_reserved2 New flags used: • HAS_CONT_INODE (20 ’th BIT) • IS_CONT_INODE (21 ’th BIT)
Software engineering perspective • Implements the spiral model. • Modularity. • Regression testing. • Use of SVN for maintaining code.