Methods Identify signed archives used Google, generic search terms weeded list down to 166 unique servers, 2804 archives Build a tool to autocheck based on 'extract-0.1' Download archives Check Post-process the data