So, by now, if you’re one of the 70,000 or so people testing Windows Home Server, you’ll be regularly backing up your home computers each night, and by and large, you’ll have seen that it’s a pretty seamless experience - once your home server has checked out what computers you have on your network, it just works.
That’s the point - it just works - WHS sucessfully hides a lot of pretty amazing technology from the user, to make using the home server as simple as possible. So, I thought it would be interesting to take a look at some of that technology and see if I can do any kind of decent job of explaining it to the everyday user. I’m calling these posts “Under the Hood“, and this may be the first and last if I can’t figure out the technology myself.
First up is Windows Home Server backup, or rather, one particular element of backup which provides a great benefit to the user.
Let’s say you have a two desktops and a laptop on your home network - the desktops each have a 250Gb drive, and the laptop and 80Gb drive. And let’s say they’re all half full. That’s 290Gb of storage to back up every night and hang on, I only have 750Gb of storage in my home server - it’s going to fill up in like three days!
How does WHS fit in so many daily backups of all my computers in a limited amount of space?
The answer’s actually pretty simple, thanks to a piece of smart thinking.
Each night, Windows Home Server checks the data on each of your machines and only backs up data that it hasn’t backed up before - i.e. if a file on your desktop hasn’t been changed since it was originally backed up, it doesn’t need to be backed up again, so that’s one way of saving space.
Better still, if you have the same data on multiple machines, WHS only stores one copy of that data, but registers that it belongs on each machine. So when it comes to restoring the files, WHS knows which machines to restore that data to.
Let me give you an example - Windows system files. The laptop I’m writing this post on is a Windows XP Pro machine. The Windows System folder (holding all of the operating system files) is 2.05 Gb. I definitely want this folder backed up in case something goes wrong with my laptop. But those same system files also exist on the XP Media Center machine I use upstairs in the den - they’re exactly the same operating system files. (XP Media Center is very much based on XP Pro. It just has a prettier dress and a few new dance moves). So WHS backs up the files once, but knows that they’re needed on both machines if I choose to restore either of them. Pretty smart, and saves storage space and backup time.
That’s why your first WHS backup often takes a while - the first backup of your first machine post-installation is literally backing up everything. Subsequently, data on your other machines is compared to the data backed up from the first machine, and if it’s the same, there’s no need to back it up - it’s already safely stored.
Hope you’re still with me - it’s time to get under the hood.
So, how does this all work? Windows Home Server uses a version of a technology known as Single Instance Storage. The general idea behind Single Instance Storage (or SIS as it’s known) is that which I’ve tried to bring to life above - keeping one copy of data that multiple computers share. SIS is used quite commonly in lots of backup solutions and other server products - email systems, file servers that sort of thing.
In the example above, I mentioned that WHS each night compares data it has already backed up (the Windows System files on my laptop) with data on other machines (the same files on my Media Center machine) and then takes a call on whether to back those files up too or not.
Windows Home Server doesn’t actually compare or even store the whole files themselves. It works with fragments of those files - called clusters. Your Windows Home Server comes with a built-in, custom-designed database which has two jobs:
1. Store fragments of your data (clusters)
2. Store metadata (data about data) that describes how to reassemble those fragments of data into an entire file system if required.
So for the first backup of the first machine, WHS examines every cluster on that machine, and generates a hash (a checksum generated from performing a calculation on the data itself) which is stored in the database. If another cluster is examined and is found to be the same (by comparing the hash values), then this is noted in the database, but the cluster is not stored again.
The following day, the same process occurs, but only new or changed clusters are copied to the database.
Working at this deeply granular level, using single instance storage means that Windows Home Server is massively efficient at only storing the data it really needs, whilst still guaranteeing that you can backup individual files, folders or indeed your entire computer should you wish.
Head hurting? Yep, mine too. Thanks to MVP Doug Knox and Charlie Kindel for providing a lot of the insight above. Their heads aren’t hurting as much as mine ![]()

June 3rd, 2007 at 3:01 pm
That info filled in the blanks at a “granular level,” but was easy to digest. Beautifully explained! Thanks!
June 5th, 2007 at 12:41 pm
G8 articel ,
love it thank you.
answered alot of my back up Questions.
Cant wait till RC1 release.
Iam currently a MS beta tester and missed the beta train
June 5th, 2007 at 12:47 pm
Great to know it was useful (and hopefully easy to read) - am working on the next “Under the Hood” as we speak, so keep a look out for it….
December 25th, 2007 at 4:25 pm
A good explanation of how WHS does its backup stuff. Unfortunately, I’m one of the few unlucky ones who is having problems with the WHS backup feature. When I try to backup my Windows XP Home desktop I get this message, right at the beginning of the backup :
“The backup failed because your computer was unable to create a volume snapshot. To resolve this problem, try running “chkdsk /f /r” on each of your hard drives.”
I’ve followed this instruction and run chkdsk a number of times and it reports my C drive as “clean”, but the failure message keeps appearing. I think it’s a problem with Windows XP on my desktop rather than WHS because my two laptops have been backed up by WHS with no problems.
So far I’ve been unable to track down any solutions to this problem, so if anyone can help me to get the “seamless” backup from WHS that nearly everyone else is experiencing, I would be truly grateful.
December 25th, 2007 at 7:21 pm
Mick,
Have you checked out this thread?
http://forums.microsoft.com/windowshomeserver/showpost.aspx?postid=1769752&siteid=50&sb=0&d=1&at=7&ft=11&tf=0&pageid=1
Terry
January 12th, 2008 at 10:44 pm
I currently have two Ready NAS devices - one went belly up after a firmware upgrade and currently is inoperable and I may have lost unique data, to early to tell. Supposedly the whole idea behind raid is to help prevent that, but alas - some I looking for a way to backup my NAS data.
Anyway, I have been looking at the WHS and wondring if it would be possible to back it up?
Would this scenerio work
Use WHS as my central server, it would contain ITUNES library, untique data, pc backup and Tivo “download” (I copy all my tivo programs to my NAS now because I lost a big hard disk in a tivo and lost all my programming)
THEN could I backup the WHS data to the two NAS for safekeeping using Norton Ghost? Basically the NAS devices would just be set to backup all the data on WHS.
Is that doable?
Thanks
February 20th, 2008 at 1:37 am
An interesting backup device for those who need to be able to take take their multimedia storage with them and display them to friends or clients is the Epson P-5000 Multimedia Storage Viewer which won “Best Storage Backup in Europe 2007” from the Technical Imaging Press Association (TIPA). Auto conversion and the ability to display and manipulate RAW images are handy features.
http://www.backtrackreviews.com/2008/02/19/epson-p-5000-and-p-3000-multimedia-storage-viewers/