Archival repositories that intend to develop programmatic solutions for managing and preserving born-digital holdings will need to establish a dedicated computer workstation (and related devices) to support responsible capture, transfer, appraisal, and preservation steps. This brief examination provides baseline recommendations for implementing and equipping a dedicated Windows-based PC workstation.[1]
First Things First
Assessment is a critical first step before spending money to establish a workstation and supporting infrastructure. A survey of computer media in existing holdings will help identify media drives needed for a workstation. A comprehensive inventory of born-digital material will provide a baseline for quantifying immediate storage needs[2].
Partner with IT
Technical requirements should, to the degree possible, be coordinated with institutional information technology (IT) support. IT staff may be required to procure equipment, approve non-standard specifications (such as increased amounts of memory), locate obsolete hardware, install software, or establish dark archive storage.
Workstation as Quarantine
A born-digital workstation will preferably be exempted from other uses, and may even be isolated on the local network through firewall configurations, in order to minimize the risks from malware posed by both obsolete media and the Web. As OCLC notes, “the workstation serves the same purpose as the quarantine room many archives use for new acquisitions that have not yet been reviewed for mold, insects, etc.[3]” Archivists can create a quarantine-like environment by forbidding web browsing on the workstation or manually disconnecting the ethernet cable connecting the PC to the network when working with archival material. Alternatively, IT staff can configure a ‘grey’ (not quite ‘dark’) workstation that connects to archival network storage and allows the machine to receive regular technical updates, but prevents all other access, including access to the Web. This second option is more secure but less convenient, limiting an archivist’s ability to quickly refer to resources or documentation, or blocking online registration of licensed software.
Computer Hardware Specifications
As the profession shifts from focusing on its legacy media backlog to acquiring gigabyte and terabyte-scale collections, archivists will appreciate having a workstation with some horsepower. Computers with multi-core processors and increased memory will enable better system performance when transferring or processing large deposits of data[4]. Four to eight gigabytes of RAM is considered minimal; 16 gigabytes of RAM would better suit archivists working with gigabyte- and terabyte-scale collections. Depending on the specifications chosen, a PC in this range might cost somewhere between $500 and $1000. Archives that expect to be working with significant amounts of system-intensive formats like digital video might consider top-of-the-line equipment.
For setups that demand a significant number of peripheral disk drives, make sure the workstation has several USB ports, including front-facing ports for convenience.
Hardware Peripherals
Repositories with born-digital material on fixed legacy media will need disk drives in order to mount and read media. In certain cases, supporting devices will be needed to provide write-protection, or support interfacing between a modern workstation and an antiquated hard drive. The devices recommended below reflect the practical tools available based on the formats most frequently encountered by archivists; equipment should be selected based on local needs.
5.25” floppy disk drive: These can sometimes be found cheaply on Ebay (between $50 and $100), but archivists would be wise to query contacts in IT for retired hardware[5]. Such drives are also notoriously unreliable, which may necessitate acquiring more than one.
Floppy disk controller card: Floppy controllers can help newer machines read media connected via obsolete disk drives[6]. One commonly used device is the FC5025 floppy controller[7] from Device Side Data, which costs around $55. Documentation associated with the FC5025 recommends using it with a Teac FD-55GFR model 5.25” floppy disk drive.
3.5 inch disk drive: 3.5” floppy disk is one of the more ubiquitous formats encountered by archivists. USB-connected 3.5” floppy drives remain widely available at minimal cost ($20), and can be purchased new or used.
Optical media drive: Most PCs currently come packaged with internal optical media drives for reading CDs and DVDs, but archivists should closely monitor the ongoing obsolescence of optical media drives by computer manufacturers[8].
Memory card reader: USB Multi-format flash memory card readers can be purchased for as little as $10, and support reading a variety of flash drive formats such as SD, CompactFlash, and MicroSD, found in digital cameras and other devices.
Write-blocker: These devices allow an archivist to write from but not to the connected media, protecting files (and their metadata) from unintended alteration. Hardware write-blockers, which are connected via USB between the media and the workstation, are most practical[9]. A typical hardware write-blocker, like the Tableau T8-R2[10], will accommodate most needs at a cost of $300.
External hard drive: External hard drives are handy for transferring new acquisitions of born-digital material. A one terabyte hard drive can be purchased for around $100. It should be noted that external hard drives formatted for Windows will present problems when seeking to transfer files from an Apple computer, which might necessitate purchasing multiple hard drives and formatting them for use with different file systems.
Software
The variety of software that might be utilized in support of working with born-digital materials is too great to fully explore here[11], but a typical setup would include virus protection, forensic software, file characterization/appraisal tools, data manipulation applications (for working with metadata), and perhaps even emulation software[12]. Because digital archive solutions are fluid, archivists should plan for the likelihood that new tools will constantly be installed, updated, assessed, added to the evolving workflow, or removed from it. Ideally, the archivist working with these materials will have the necessary workstation administrative privileges to support this level of exploration.
Because many of the applications likely to be used are open source, it’s important to understand that system updates can affect the use of such tools. For example, DROID[13], a Java application developed by the National Archives to support batch appraisal and format identification, will not function with the latest release of Java, which might require a workstation to run concurrent versions of Java. Likewise 64-bit machines may need to be run as 32-bit in order to support certain emulation tools. Coordination with IT staff will help the archivist navigate these technical challenges.
Conclusion
A workstation built using the computer hardware and devices identified here should accommodate the needs of most born-digital archival programs, at a cost somewhere between $1200 and $1700.
A final caveat is in order: these recommendations have a shelf life. The theory underlying our work with born-digital material provides solid footing for the profession moving forward, but we should consider the tools and technologies we employ to be transient and in constant need of review and revision.