The Library of Congress is pleased to announce the release of the 2016-2017 Recommended Formats Statement (http://www.loc.gov/preservation/resources/rfs/). The proliferation of ways in which works can be created and distributed is a challenge and an opportunity for the Library (and for all institutions and organizations which seek to build collections of creative works) and the Recommended Formats Statement is one way in which the Library seeks to meet the challenge and take full advantage of the opportunity. By providing guidance in the form of technical characteristics and metadata which best support the preservation and long-term access of digital works (and analog works as well), the Library hopes to encourage creators, vendors, archivists and librarians to use the recommended formats in order to further the creation, acquisition and preservation of creative works which will be available for the use of future generations at the Library of Congress and other cultural memory organizations.
The engagement with the Statement that the Library has seen from others has been extremely heartening. In response to interest in our work from representatives in the architectural community who see their design work imperiled by insufficient attention to digital preservation, we have updated the Statement to align more closely with developments in this field. Most importantly of all, we now include websites as a category of its own in the Statement. Websites are probably the largest field of digital expression available for creators today, yet most creators tend to take a passive role in ensuring the preservation and long-term access of their websites. By including websites in the Recommended Formats Statement, we hope to encourage website creators to engage more fully in digital preservation, as we aim to do with all the other forms of digital works included in the Statement, by making their websites more preservation-friendly.
The Library remains committed to acquiring and preserving digital works and to providing whatever support it can to other similarly committed stakeholders. We shall continue to build our collections with their preservation and long-term access firmly in mind; and we shall continue to engage with others in the community in efforts such as the Recommended Formats Statement. We encourage any and all feedback and comments (http://www.loc.gov/preservation/resources/rfs/contacts.html) others might have on the Statement that might make it more useful for both our needs and for the needs of anyone who might find it worthwhile in their own work. And we shall continue to engage in an annual review process to ensure that it meets the needs of all stakeholders in the preservation and long-term access of creative works.
What is the Public Broadcasting Preservation Scholarship?
The Public Broadcasting Preservation Scholarship will fund public media representatives from Louisiana Public Broadcasting, Wisconsin Public Television, Minnesota Public Radio, CUNY-TV, Howard University Television (WHUT), WYSO-FM, and Pacifica Radio Archives to participate in a week-long training event focused on digital preservation of public media.
The residencies will begin in July 2016 with a week-long immersion week in Boston, taught by leading experts in the field of audiovisual preservation. WGBH has launched a crowdfunding campaign to fund the Public Broadcasting Preservation Scholarship, in connection with the AAPB NDSR. The Scholarship will fund the host mentors to travel and participate in immersion week. You can find it here: igg.me/at/aapb-pbps
The scholarship would help host mentors gain and sharpen the skills that are needed to sustain digital preservation activities at beyond the term of the 10-month residency. This knowledge would improve their ability to preserve their at-risk materials for many years to come. As a supporter of the Public Broadcasting Preservation Scholarship, you could take us many steps closer to reaching our goal.
Public broadcasting stations have been on the front lines of history for more than 60 years. Help public media professionals gain the skills necessary to preserve this audiovisual historic record for posterity by supporting the American Archive of Public Broadcasting Public Broadcasting Preservation Scholarship.
The following blog post was written by Margaret Bresnahan from Minnesota Public Radio.
I’m writing to share the next installment in the American Archive success story. Thanks to the cataloging done during the American Archive inventory project, Minnesota Public Radio was able to identify about 900 MPR News stories covering the Hmong settlement in Minnesota, with recordings dating from 1975 to present day. This discovery led to a collaboration with the Minnesota Historical Society (MNHS), informing an exhibition/celebration that launches this month (March 2015), and it led to new broadcasts from the MPR News Room.
Marking the 40th anniversary of the first large-scale arrival of Hmong people in Minnesota, MPR News recently launched a Hmong collection page and broadcast a few news stories–all using archive recordings to tell the story of Hmong-Minnesotans. Two of our main collaborators in the News Room plan on continuing the coverage throughout the year, bringing more archive recordings on air and online. This is a wonderful example of the power of access. The inventory made it clear that these recordings existed and enabled this great use of archive material to tell a contemporary, ongoing story.
Here are some links to the archive usage, and more are to come:
The following is a guest post by Emily Halevy, Director of Media Management Sales at Crawford Media Services. In this blog post, Emily records her interview with Chip Stephenson, Crawford Project Manager, and David Braught, Crawford Logistics Coordinator. Crawford and the AAPB Project Team recently completed the American Archive of Public Broadcasting Digitization Project, funded by the Corporation for Public Broadcasting. Crawford’s role in the project was the coordination and digitization of approximately 40,000 hours of public broadcasting video and audio archival content, as well as the transcoding of approximately 20,000 born-digital files, contributed by more than 100 stations and organizations nationwide!
Now that the digitization is complete, the files will be preserved and made accessible as much as possible through the American Archive of Public Broadcasting, and the AAPB Project Team at WGBH and the Library of Congress is excited to begin working on these efforts. Continue reading below for an account of Crawford’s experience throughout the AAPB digitization project.
Happy New Year, Everyone! I’m delighted to be a guest blogger for the American Archive of Public Broadcasting, once again! As we come to the end of this migration project, I thought this time it would be fun to sit down with Chip Stephenson and David Braught and discuss some of the successes and challenges this project brought. It’s also a great time to reflect on the importance and value of the project as a whole.
Emily: What’s the first thing that comes to mind now that the project is over?
Chip: It’s over? What? We’ve been living it for over three years!
David: It’s hard to believe it’s over.
Chip: Well, it’s not quite over yet. We’re still wrapping- the engineers are finalizing data, project management is compiling spreadsheets and financials. But we’re almost there.
David: I’ve never worked on anything like this before- the logistics- everything.
Chip: Logistics of shipping, receiving, and accounting for all of the content. And then the amount of data, file configurations, bags, copying files for the individual stations. Over 125 different spreadsheets- audio, video, born digital, plus over 100 stations, which sometimes had multiple spreadsheets. It was more like 100 individual projects than one big project.
David: And every station had its own set of quirks to deal with.
Chip: Every station required multiple phone calls and emails to set things up. It’s an amazing project. The stations were all great to work with and they all had an amazing amount of work to do to make it happen. Some like New Jersey Network and University of Maryland had an incredible amount of content.
David: I’m sure the stations wanted to kill me with the number of emails about checking their files so we could delete them from our system.
Chip: Our engineers were amazing.
David: I can’t say enough good things about our engineers. Guy (Boyd) was able to adapt and push through data, JP (Lesperance) handling all of the born digital, Nathan (Lewis) re-transcoding every single proxy to meet the requirements for the Library of Congress, Herve (Bergeron) and Dr. Dave (Wolaver) switching out and repairing decks.
Chip: And don’t forget the thousands of tapes baked and repaired by Dr. Dave as well.
David: It really was a tremendous team effort.
Emily: We really do have a great team, don’t we? And we can’t leave out the migrators.
Chip: At the peak we had 3 audio migrators running 5 days a week, 24 hours a day. We had 5 video migrators digitizing content, with one pod running 5 days a week, 16 hours a day, and the other pod running 24 hours, 5 days a week. There were even many months running 7 days a week. There were also others just doing QC. And others handling born digital content, copying files into working storage, and then checking to be sure they worked and renaming and creation of the proxy file.
David: Haha! So what was the question again?
Emily: The question was “What’s the first thing that comes to mind now that’s it is over?”
David: Evidently everything! Haha!
Chip: You never understand the true complexity of the project until you look back and have time to reflect. Before the project even started, during a visit by Stephanie (Sapienza) and Caitlin (Hammer) from CPB, we were reviewing the process and we all started to realize how complex the overall project was going to be. Caitlin kept asking me, “How are you going to do this?” And my answer was “One station at a time.” Thinking about all of it at once was just overwhelming. So David and I sat down and thought about how we wanted to parse this project out. How do we want to think about this on a daily and weekly basis? So we came up with an operational spreadsheet, which then became two spreadsheets, which then became multiple spreadsheets. And there were times over the past year when we just took a deep breath and said, “Ok. 40 stations down, 60 to go.”
David: It was a constant balancing act. Nothing ended up being accurate in terms of tape counts. More audio, less video, double ¾”, which is more time consuming. We had to rearrange our thinking and the pods on a regular basis. And adjust accordingly.
Chip: But working with CPB, then the transfer of the host to WGBH went incredibly smooth. We had some discussions about what they thought and what we thought, but it was very easy moving through issues and problems as they came up.
David: And we always got great support from CPB and then WGBH.
Emily: What turned out to be the most challenging aspect of the project? (If you could name one thing.)
Chip: For me-
David: Oh! Born digital.
Chip: For me it was the born digital for a couple of reasons…
David: Well you take the issues we had with receiving the physical assets and multiply that times a million.
Emily: The born digital was one of the “orphan items” that wasn’t completely fleshed out when we got started.
Chip: We started the born digital about 8 months later than we’d hope and there were many more individual steps dealing with the stations and how they’d build their drive and name their files and create their spreadsheets. So we had to develop ways to review the file names and correct them to make them legal- spaces had to be replaced with underscores, no illegal characters, they all must have file extensions, etc. Then we had to combine GUIDs for the project with the individual station’s file name. When you do this with thousands and thousands and thousands of files, it becomes complex. And then we had to create proxy files for all of them. And the process you use to create a proxy of one file type might be different from another file type. And then all of the files needed to be QC’d and compared to the master file. Some stations, when they built their initial hard drives, had a large amount of bad files. Sometime up to 50% of the files were bad. And we had to give the stations time to rebuild. Remember the whole purpose of this project was to migrate, capture and acquire as many of these files as possible. Migrate as much as we could within the time frame we had to work with and that time frame was closing in on us.
Emily: Again- another area where we got great support from Casey and the American Archive team.
Were there any hurdles that turned out to be no big deal?
David: Just getting the content here.
Chip: In the beginning, logistics were slow. We were still trying to figure out the most efficient way to get stuff here.
David: And at the start the stations didn’t really know what they were getting into, but honestly, it went smooth.
Chip: We started to realize- let’s not worry about having too many tapes here, let’s worry about not having enough.
David: KQED for instance, they were ready to ship immediately. So we told Robert (Chehoski), “Alright, let’s bring it on!”
Chip: At one point, we had the equivalent of 65 pallets of assets in our crypt. And of course it was interesting shipping things from Alaska. But every single station helped us find a way to get their assets to us. And every single station, despite issues (time of year, reduction of staff, etc.) they all worked their butts off. They all worked really hard to pull, barcode, pack and ship their tapes to us and make this a success. Between dozens of Fed Ex shipments, three semi-truck runs across the country and an airline delivery, we managed to get everything here and under budget!
Emily: What did you learn from the project?
Chip: Efficiency. Efficiency. Efficiency. Rethink everything you do and realize there might be a better way to do something. And if it sounds like there might be, try it. When David and I sat down and put a plan together we realized quickly we were too rigid. We needed to be flexible. We had to find compromises throughout the project. There were many times we’d get off the phone with a station and say to each other, “How is this going to work?” We could not be afraid to come up with new solutions for the stations. We had to be receptive to their ideas, especially when it came to timing.
David: It didn’t do any good to stick to a timeline that wouldn’t work for them.
Chip: Initially, our idea was to do all the beta tapes together, then all the DVCPro tapes together, but we ended up digitizing several formats simultaneously.
David: Sometimes even 6 video tape formats simultaneously.
Chip: We had a few stations that had only one or two formats, but most of them had a little of everything.
David: Halfway through the project we realized we were dealing with 20 stations at one time- shipping tapes, migrating, moving data, shipping delivery drives, bagging and backing up file data, literally tracking upwards of 30 stations in a given period.
Chip: So being as flexible as possible was important, because no matter how well you thought you had it figured out, it changed on you. And, honestly, at first we fought it, but then we realized that it just wasn’t going to work. So stop fighting it. We had to maintain the flow of tapes required in order to meet the deadline, and being rigid was not going to get us there.
David: I don’t know if a day went by without asking Dr. Dave to switch out tape decks to accommodate our revised workflows.
Emily: What was your favorite “found” item from the project?
David: For me, it was the famous Akira Kurosawa footage. One of our migrators found that the tape label didn’t match the content. It was labeled as a cooking show, but turned out to be an interview with Kurosawa and George Lucas and Francis Ford Coppola. I was like, “Give me that tape!” It turned out to be a program that was thought lost for many years at the station.
Chip: For me, at one point it was all hands on deck, so I had to QC several hundred files. The content just happened to be all the history of New York City and Boston and The Revolutionary War. WNET had a whole series on the history of Manhattan dating back to the revolution. Growing up in that area, I knew a lot of the city’s history, but I never really knew the intricate history of Manhattan and the Bronx and Queens. I didn’t know that Wall Street really was a wall. I learned there’s a fence in Bowling Green Park, which still exists to this day, that was erected in 1770 to protect a statue of George III. The history in this collection is amazing. Meanwhile, I was supposed to be spending 2-3 minutes QC’ing these files and 20 minutes later I had to stop myself and get back to work!
David: That happened all the time!
Chip: The programming is so great! From arts and symphonies to theatricals, history- everything you can think of from all across the country.
Emily: Hence the “American Archive” project!
Chip: Now that the project is coming to an end, I’m just dealing with the data and the files. We did massive shipments out in October and November. It was amazing. The last truck run went up in first week of December. Right now we’re just pulling the little tidbits and reviewing everything and making sure we crossed all of our Ts and dotted all of our Is. We’re shipping out LTO tapes to the Library of Congress. And I’m a little sad it’s come to an end. On the other side, it’s a great sense of accomplishment. A year of planning and discussions. Two years of migration. Then changing all of the planning several times throughout. It all comes back to flexibility. Understanding you can’t be rigid.
The following is a guest post by Producer/Writer Elizabeth Deane.
Every Picture Tells a Story had its premiere in the Great Hall of the Library of Congress in February, 2014, at the launch of the American Archive of Public Broadcasting (AAPB).
Sound and images from six decades of public media filled that stately space, giving the audience a six-minute tip-of-the-iceberg glimpse at some of the treasures that will be part of the AAPB collection.
We’d made the film drawing mostly on media that had already been digitized by the AAPB — the first wave of stories that I had come to think of as locked away, imprisoned on ¾” videotape, VHS and Betacam tapes, ¼” audio tape, DVCPRO and more —the dreaded “obsolete formats” that can be such a barrier to access.
Few stations maintain playback machines for them any more, and the few in existence can be tricky to maintain and possibly risky to use; if they’re not working properly they can damage the footage, sometimes irrevocably.
Worse, as Every Picture points out, old videotapes can deteriorate, and the images are lost forever.
I found it heartening to know that even as the launch ceremony unfolded on that wintery day in Washington, trucks containing thousands of video and audio tapes from public stations all over the country were rolling towards Atlanta, where Crawford Media Services would create multiple digital versions of each tape — television and radio shows, raw footage, even outtakes and experiments — in science, natural history, drama, children’s programs, arts, education, history, local lore, news, and more — the entire broad and inspiring realm of public media programming.
Master copies will be kept safe for future generations at the Library of Congress, with access copies going to WGBH to be added to the growing AAPB database, and made available on a forthcoming website, when rights permit, to a national audience – researchers and scholars, filmmakers, educators, students, and kids of all ages. In addition, all of the digitized materials will be made available to researchers who visit WGBH and the Library’s Moving Image and Recorded Sound Research Centers.
The film is a celebration of the American Archive of Public Broadcasting at its moment of birth, just beginning to tap into its vast collection. “As of this posting close to a year later, all of it has been digitized,” says AAPB Project Manager Casey Davis. “But much of it came with only a brief description. Now we have the pleasure of watching and listening, so we can improve our records and make this remarkable collection more discoverable for all.”
Watch for the new AAPB website, set to launch with the first batch of records in April 2015, with video and audio to follow in October.