Twenty years of history from old drives in the basement
A few days ago I was looking into using the WayBack Machine to retrieve content from my old websites in order to consolidate posts in one location. However, the WayBack machine was missing content and I had lost track of backups that may contain these old files.
I suspected this pile of drives in the basement may have the information I was looking for:
However, I don’t have hardware with SCSI, IDE, and SATA interfaces. Google and Amazon to the rescue, 2 days later this arrived:
After plugging the drives in, lsblk
displayed the partitions:
jemurray@kali:~/old_drive_data/drive3-sun$ lsblk
NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINT
sda 8:0 0 238.5G 0 disk
├─sda1 8:1 0 237.5G 0 part /
├─sda2 8:2 0 1K 0 part
└─sda5 8:5 0 976M 0 part [SWAP]
sdb 8:16 0 8.5G 0 disk
├─sdb1 8:17 0 4.6G 0 part
├─sdb2 8:18 0 512.4M 0 part
├─sdb3 8:19 0 8.5G 0 part
└─sdb8 8:24 0 3.4G 0 part
I was shocked to find these drives were in working order. In fact, only one had issues retrieving data. For this drive, I used the GNU ddrescue
utility and let it run over night. It clinked, clattered, and made noises you would think had no chance of data recovery. Six hours later, 99.99% of the sectors were recovered:
jemurray@kali:~/old_drive_data/drive3-sun$ sudo ddrescue /dev/sdb1 ./sdb1.img sdb1mapfile
[sudo] password for jemurray:
GNU ddrescue 1.23
Press Ctrl-C to interrupt
Initial status (read from mapfile)
rescued: 11730 kB, tried: 0 B, bad-sector: 0 B, bad areas: 0
Current status
ipos: 4290 MB, non-trimmed: 0 B, current rate: 0 B/s
opos: 4290 MB, non-scraped: 0 B, average rate: 227 kB/s
non-tried: 0 B, bad-sector: 45056 B, error rate: 2 B/s
rescued: 4916 MB, bad areas: 11, run time: 5h 59m 38s
pct rescued: 99.99%, read errors: 101, remaining time: n/a
time since last successful read: 3h 24m 56s
Finished
This image is a UFS filesystem from my old Sun Ultra 10 workstation:
jemurray@kali:~/old_drive_data/drive3-sun$ file sdb1.img
sdb1.img: Unix Fast File system [v1] (big-endian), last mounted on /, last written at Thu Feb 23 13:33:55 2006, clean flag 253, number of blocks 4801608, number of data blocks 4724854, number of cylinder groups 109, block size 8192, fragment size 1024, minimum percentage of free blocks 1, rotational delay 0ms, disk rotational speed 90rps, TIME optimization
After ddrescue
finished, the image(s) mounted without any trouble:
jemurray@kali:~/old_drive_data/drive3-sun$ sudo mount ./sdb1.img /mnt
[sudo] password for jemurray:
mount: /mnt: WARNING: device write-protected, mounted read-only.
I can’t recommend Linux enough. It is able to mount NTFS, FAT, UFS, APFS, and of course EXT file systems. No additional drivers or software was necessary.
The files are all there:
jemurray@kali:~/old_drive_data/drive3-sun$ ls -al /mnt
total 50
drwxr-xr-x 26 root root 512 Feb 17 2006 .
drwxr-xr-x 18 root root 4096 Nov 24 04:33 ..
lrwxrwxrwx 1 root root 9 Nov 26 2005 bin -> ./usr/bin
drwxr-xr-x 2 root root 512 Nov 30 2005 cdrom
drwxr-xr-x 17 root sys 4096 Feb 17 2006 dev
drwxr-xr-x 7 root sys 512 Nov 29 2005 devices
drwxr-xr-x 78 root sys 5120 Feb 17 2006 etc
drwxr-xr-x 3 root sys 512 Jan 29 2006 export
dr-xr-xr-x 2 root root 512 Nov 26 2005 home
Most importantly, I found the website files I was looking for.
I believe this drive was part of an external backup system when I was using Linux as a desktop and hosting a webserver at XO:
jemurray@kali:~/old_drive_data/drive1/hk2/home/home/jemurray/public_html$ ls -al
total 22516
drwxr-xr-x 18 jemurray users 4096 Dec 31 2005 .
drwx-----x 22 jemurray users 4096 Feb 10 2006 ..
-rw-r--r-- 1 root root 3669129 Oct 13 2005 access_log
drwxr-xr-x 2 jemurray users 4096 Jan 9 2002 backup
drwxr-xr-x 2 jemurray users 4096 Jan 23 2005 bryce_mail
drwxr-xr-x 6 jemurray users 4096 Feb 18 2004 cellpix.org
drwx-----x 4 jemurray users 4096 Jan 7 2002 dev
-rwxr-xr-x 1 jemurray users 47017 May 10 2005 dnstop
-rw-r--r-- 1 root root 100524 Oct 8 2005 error_log
drwx------ 19 jemurray users 4096 Apr 6 2004 gallery
drwxr-xr-x 2 jemurray users 4096 May 31 2003 geoff
-rw-r--r-- 1 jemurray users 20 Sep 30 2004 index.html
-r-xr-xr-x 1 jemurray users 28672 Apr 8 2005 Keygen.exe
drwxr-xr-x 4 jemurray users 4096 Jan 6 2002 laptux
drwxr-xr-x 2 jemurray jemurray 4096 Dec 31 2005 logs
drwxr-xr-x 4 jemurray users 4096 Oct 6 2004 ManSec
drwxr-xr-x 3 jemurray jemurray 4096 Oct 4 2005 mdev
drwxr-xr-x 4 jemurray users 4096 Dec 16 2003 mus
drwxr-xr-x 21 jemurray users 4096 Jul 17 2003 OLDSITE
drwxr-xr-x 2 jemurray users 4096 Dec 13 2003 resume
drwxr-xr-x 3 jemurray jemurray 4096 Dec 31 2005 visualeyes
drwxr-sr-x 25 jemurray jemurray 4096 Feb 5 2006 zweck
drwxr-xr-x 13 jemurray users 4096 Jul 17 2003 zweck-old
In addition to multiple versions of my website, 20 years of old email, IRC logs, and many other snippets of history that I believed were long gone were all there. I feel like an archeologist on a historical dig.
The next step is to start reconstructing my old website articles and transferring them into markdown so Hugo can process them. Pandoc to the rescue here.