• Home
  • About

Research IT

A journal of supporting research with IT.

Feeds:
Posts
Comments
« Not All Apps Are Fit for the Cloud | The Intelligent Enterprise Blog
Sun Data Center: 165,000 sq. ft. into 700 sq. ft., Reduces Power Usage by 1 Million kWh per Month »

What do you consider a “large file system” and are you worried about fsck times?

January 15, 2009 by Gary

Out of necessity, we have some file systems that are 25 TB.  As of 2009, we consider 25 TB a large file system and we are concerned about the potential downtime that may result if an fsck is needed.

Some storage vendors advertise that they can have single file systems that are hundreds of terabytes or even a petabyte.  Often, however, there is no mention of when or if fsck or similar operations would be needed and how long they take.

ZFS claims to eliminate the need for fsck and Chunkfs (ext2 enhancements from around 2006) claims to reduce fsck times by splitting the repair domain.  Further, “journaling file systems only speed fsck time in the case of a system crash, disconnected disk, or other interruptions in the middle of file system updates. They do not speed recovery in the case ‘real’ metadata corruption” (see third paragraph here).

1.) What do you consider a large file system? (What file system do you use for them?)
2.) Are you concerned about fsck times?  (Why or why not?)
3.) Can you predict fsck times based on some parameters (e.g., inodes used, disk size, etc.)?
4.) Any special cases related to fsck or similar operations for clustered file systems?

About these ads

Share this:

  • Digg

Like this:

Like Loading...

Posted in Uncategorized | Tagged file systems, fsck | 1 Comment

One Response

  1. on January 23, 2009 at 11:51 pm Gary Stiehr

    See http://www.linkedin.com/answers/technology/information-technology/information-storage/TCH_ITS_IST/399859-4231678 for some interesting replies to this question.



Comments are closed.

  • Tags

    cancer cloud clustering consolidation cooling creativity databases data center density energy efficiency file systems fsck genomics grid HPC innovation MySQL neuroscience power sequencing XD
  • Recent Posts

    • The Research IT Blog is Moving
    • Open Innovation in Cancer Research
    • A Human Genome Per Day? The Genome Center at Washington University Scales Up on Illumina Sequencers
    • Tidal Energy to Provide One Fifth of /Blue/ Data Center Power
    • Sun Data Center: 165,000 sq. ft. into 700 sq. ft., Reduces Power Usage by 1 Million kWh per Month
  • Archives

    • June 2009
    • February 2009
    • January 2009
    • July 2008
    • June 2008

Blog at WordPress.com.

Theme: MistyLook by WPThemes.


Follow

Get every new post delivered to your Inbox.

Powered by WordPress.com
%d bloggers like this: