Website Search
Find information on spaces, staff, and services.
Find information on spaces, staff, and services.
The University of Wisconsin Digital Collections Center (UWDCC) is committed to ensuring the long-term accessibility and sustainability of its collections. Towards that end, we have adopted a number of recommended formats for the purposes of submitting, processing, distributing, and preserving digital resources. We encourage submitters to provide resources in these formats, and we will internally process, distribute, and preserve our resources in these formats.
These recommended formats are derived from the Library of Congress Recommended Formats Statement 2019-2020.
Text submissions. Page layout formats:
If submitting plain text files: UTF-8 encoding
UWDC follows recommendations laid out in the Federal Agencies Digital Guidelines Initiative for content digitized in-house. For image submissions from project partners, we request that files be no smaller than 1500 pixels on the longest side; 300 dpi for non-textual content; we prefer 600 dpi for textual (bitonal) content. We prefer image submission in color. We accept 16 bit and 8 bit depth, in a preferred color space of Adobe RGB. Currently we downsample to 8 bit, but we may change that for preservation masters in 16-bit. We accept 8 bit depth grayscale images.
These are the file formats we prefer, in descending order of preference:
We prefer the final production version of audio resources over pre-production versions; files in native sampling frequency rather than up-sampled frequency; uncompressed files rather than compressed; with embedded metadata rather than without (metadata TBD). If a file is compressed, we will only accept standard compression schemes.
We prefer the following formats in order of preference:
We prefer final production version of video resources over pre-production versions, with the original production resolution and frame rate (i.e. 1080p24; 720p60, etc.).
File-based, in order of preference:
The following is substantially copied from the Library of Congress Recommended Formats for websites.
Website creators can improve the archivability of web content by following best practices such as:
Resources that address this further and may be helpful to content creators can be found on the Library of Congress Guide to Creating Preservable Websites (https://www.loc.gov/programs/web-archiving/for-site-owners/creating-preservable-websites/)
Websites are generally crawled and content is downloaded for packaging and preservation using a number of tools designed for this purpose.
We use the Web ARChive (WARC) format to package and preserve websites.