Unstuck in the Mud: Concrete Tasks for Forward Motion

Last time, we talked about the first steps we were taking to identify the workflows and procedures that need to be teased out and codified as official policies for Special Collections born-digital materials. Over the last month our group has been revising existing policies and drafting new ones where necessary. Given the title of the post, it’s important to point out that one way to prevent getting stuck is to accept that first drafts can be rough-and-ready. It’s too easy to get hung-up on word choice and formatting when writing formal policy language—the key is to just start writing and iron out the details later.

Another common sticking point for working groups is the existence of (real or perceived) external dependencies. “We can’t finish our work until they finish theirs!” The Born-Digital Working Group consists of two active subgroups, Tools and Policies/Procedures, supported by a less-active Administrative group for when higher-level support is needed. As we drafted policies, it became increasingly obvious that many decisions depended on input from the Tools group and on the larger institutional capacity for digital preservation. What file formats can we safely claim to preserve? What physical formats are we equipped to work with? The Tools group was facing a similar quandary: How could they firmly select tools without knowing the workflows (i.e. procedures) and the policy requirements that the tools need to meet?

Although the sub-groups were partly created to avoid the difficulty of finding large blocks of time compatible with diverse schedules, there was no helping it—the two groups had to spend some time together to solve these issues. With deadlines looming, we scheduled a joint meeting for April 1, informally dubbed “The Conclave.” No one was allowed to leave until we resolved the problems and had clear tasks to see us through to our All-Hands meeting in May.

Some of the topics we discussed included a debate over the creation of disk images and how we can clearly articulate the implications of this process to donors; the need to determine more precisely what file formats we will commit to migrating over time; and the identification of multiple work spaces for files that would ensure the appropriate levels of access. Perhaps the most helpful exercise was a detailed walkthrough of theBitCurator workflow, led by Porter Olsen and drawn on the whiteboard by Joshua Westgard. Doing this clearly illustrated where each group needed input from the other, allowing us to break off and tackle specific problems for 30 minutes before reconvening to wrap-up and assign concrete next steps.

workflows

One of the outcomes of this three-hour discussion was a realization that we need to more clearly define the goals of a Special Collections born-digital records program. Drawing on previous surveys of staff needs and expectations we have begun to concisely define the staff and system needs for working with these materials. We have also been coordinating more closely with Jennie Knies in the UMD Libraries’ Digital Stewardship unit to translate the collection requirements into technical requirements. This will be used by our IT developers to examine software and tools. The goal is to make sure that we have documented our full vision, regardless of technological limitations (or possibilities).  Our development team will then work to identify the best way to implement this within the context of our other requirements, and our existing systems.  We have created a spreadsheet that attempts to tie all of this information together, linking requirements, potential tools, workflow stage, policy, and priority.

spreadsheet

We will all come together as a group in May to see what we have accomplished over the course of the semester. Sometimes it seems like we have been spinning our wheels, but the Conclave helped pull us out of the mud and sharpened our focus and priorities for the future.

Born-Digital Working Group: Configuring FRED

Submitted by: Eric Cartier, April 5, 2013FRED at UMD Libraries

In mid-March, the Tools subgroup met FRED, our Forensic Recovery of Evidence Device. The subject lines we’ve shared since then (e.g., “tinkering with FRED today”) reflect the approach we’re taking:  careful, playful, open-minded. We marveled at all the ports, laid out and photographed the various cables and adapters included in the toolbox, and took turns at the keyboard. There was much to do before any imaging occurred, though.

We spoke at length about network security, viruses, connecting to the Internet, and safeguarding personally identifiable information, which we’re sure to obtain in future images we make. Porter noted that Digital Intelligence, the company that manufactures FRED, assumes that one will connect the machine to the Internet, while Josh played the devil’s advocate, acting Thomas Pynchon-paranoid. The immediate action we took at the conversation’s conclusion was to connect to the Internet via a USB network adapter to install Microsoft Security Essentials. Next we updated all the Windows, Adobe, and Java applications. A clean machine, we agreed, should be virus protected and fitted with all the latest software updates.

The FRED system has two drives, one of which is dual partitioned into Windows 7 Ultimate (64 bit) and Win98 DOS. This is the operating system environment we initially worked in, where we made other essential downloads including BitCurator and Oracle VM VirtualBox. Later, because BitCurator is native Linux, we chose to install SUSE Linux 12.1 on FRED’s empty DATA drive.

FRED accessories

Returning to Windows 7, the first device we connected to the UltraBay 3D Hardware Write-Blocker was Digital Stewardship’s 2 TB external hard drive, which contained images of some media from the Bill Bly Collection. Tableau Imager didn’t recognize it, nor did it register a 2 GB thumb drive that we inserted in the USB 3.0 port, although each device was visible on the list of the computer’s drives. Reading through the text-based instructions again, we discovered that the UltraBay has a power supply independent of the FRED tower (Digital Intelligence does not include diagrams or screenshots in its instructions), which, once turned on, allowed us to image the thumb drive. No matter which target directory we selected, however, the external hard drive repeatedly failed to image, due to lack of storage space. Tableau Imager offers EnCase E01 and Raw Disk dd imaging options, both of which are set to capture all the bits, so 2 TB was a bit much to ask of the machine.

Our progress configuring FRED has been fun and sometimes frustrating, but always steady. Over the next couple of months, our goal is to attempt to image every imaginable format on FRED and our BitCurator Digitization Workstation. Which system, with which software (BitCurator, Tableau Imager, FTK Imager), works most effectively? Learning what’s possible to accomplish with our equipment will be a beneficial exercise to complete before the arrival of our National Digital Stewardship Residency fellow in September.