Greetings All -
I have a situation I want to get your guidance on.
I have a Windows Home Server (WHS) which is a headless machine with 4 drive bays that recently hung. This particular machine is a network attached server that does not have a monitor nor keyboard/mouse directly attached. As a result, in this case, when the server became inaccessible via Remote Desktop Connection, I was left with very little information gathering capability. I could get a reply across the network to a "ping" request, but could otherwise not see into the machine at all.
It should be noted here that I have 2 backup copies of the data that was stored on the server, so I am not panicked about data loss, but would nonetheless like to restore the server as I have put many hours into configuration and refinement over the years and although I have backups, my last batch was made in September 2017, so I would lose some files that I would otherwise like to keep.
So, with little else to do, I did a hard shutdown, rebooted the server and hoped it would boot up and allow me to remotely connect to the machine. Well I did this twice but ultimately, I was never able to establish an RDC to the server.
I removed the drive (WHS Disk 0) and connected it directly to a bench machine that I have set up Lubuntu on for recovery operations. The failing drive is a 1.5TB WD Black with 2 partitions: SYS (21 GB NTFS) and DATA (~1.5 TB NTFS). I also directly connected a new 2.0 GB WD Black drive to the bench machine.
I used Gddrescue to clone the failing SYS drive to the new 2 GB drive and got "99.99% pct rescued" according to the gddrescue result summary screen after 5.5 hours.
The command line I used is below:
sudo ddrescue -d -f /dev/sdb /dev/sda /home/administrator/RecoveryLogs/WHS_SYS.log
This is where I would like your guidance.
One additional point is that when configuring my bench machine with Linux to attempt to rescue the failing SYS drive, the bench machine found my server SYS drive and started that OS ( I had not changed my boot order so it found the wrong drive to boot from). Inadvertently, I got the only glimpse into the failing SYS drive when I saw a message that read something to the effect that there was an error with the ntoskrnl.exe file and the boot process halted.
I didn't write the message down and I recognize the file, but it also explains why I want to do more than just clone the failing drive because I expect that corruption occurred that I would need to fix before attempting to reinstall the cloned drive into my server and try booting it up. I don't want to have to do another hard shutdown as WHS is temperamental by nature and the pooled drives it manages are very susceptible to corruption.
I appreciate any guidance you can offer. Let me know what information I've overlooked.
Cheers,
I have a situation I want to get your guidance on.
I have a Windows Home Server (WHS) which is a headless machine with 4 drive bays that recently hung. This particular machine is a network attached server that does not have a monitor nor keyboard/mouse directly attached. As a result, in this case, when the server became inaccessible via Remote Desktop Connection, I was left with very little information gathering capability. I could get a reply across the network to a "ping" request, but could otherwise not see into the machine at all.
It should be noted here that I have 2 backup copies of the data that was stored on the server, so I am not panicked about data loss, but would nonetheless like to restore the server as I have put many hours into configuration and refinement over the years and although I have backups, my last batch was made in September 2017, so I would lose some files that I would otherwise like to keep.
So, with little else to do, I did a hard shutdown, rebooted the server and hoped it would boot up and allow me to remotely connect to the machine. Well I did this twice but ultimately, I was never able to establish an RDC to the server.
I removed the drive (WHS Disk 0) and connected it directly to a bench machine that I have set up Lubuntu on for recovery operations. The failing drive is a 1.5TB WD Black with 2 partitions: SYS (21 GB NTFS) and DATA (~1.5 TB NTFS). I also directly connected a new 2.0 GB WD Black drive to the bench machine.
I used Gddrescue to clone the failing SYS drive to the new 2 GB drive and got "99.99% pct rescued" according to the gddrescue result summary screen after 5.5 hours.
The command line I used is below:
sudo ddrescue -d -f /dev/sdb /dev/sda /home/administrator/RecoveryLogs/WHS_SYS.log
This is where I would like your guidance.
- Should I run gddrescue again and specify the -r switch to attempt repeated reads of bad blocks? I had not done this on the first pass as I am under the impression that the first pass should be limited to just trying to get what you can with as little stress imparted to the source drive as possible.
- If I run it again, what switches should I specify?
- Despite reading multiple "guides" and posts, it isn't clear to me whether I should have created an image of my failing drive instead of cloning the drive to a new disk. Should I also make an image of the drive to yet another device? (does having the image give me more data recovery options than having the cloned disk?)
One additional point is that when configuring my bench machine with Linux to attempt to rescue the failing SYS drive, the bench machine found my server SYS drive and started that OS ( I had not changed my boot order so it found the wrong drive to boot from). Inadvertently, I got the only glimpse into the failing SYS drive when I saw a message that read something to the effect that there was an error with the ntoskrnl.exe file and the boot process halted.
I didn't write the message down and I recognize the file, but it also explains why I want to do more than just clone the failing drive because I expect that corruption occurred that I would need to fix before attempting to reinstall the cloned drive into my server and try booting it up. I don't want to have to do another hard shutdown as WHS is temperamental by nature and the pooled drives it manages are very susceptible to corruption.
I appreciate any guidance you can offer. Let me know what information I've overlooked.
Cheers,