When 22-year-old programmer Aaron Swartz decided last fall to help an open government activist amass a public and free copy of millions of federal court records, he didnâ€™t not expect heâ€™d end up with an FBI agent trying to stake out his house.
But thatâ€™s what happened, as Swartz found out this week when got his FBI file through a Freedom of Information Act request. A partially-redacted FBI report shows the feds mounted a serious investigation of Swartz for helping put public documents onto the public web.
The FBI ran Swartz through a full range of government databases starting in February, and drove by his home, after the U.S. court system told the feds heâ€™d pilfered some 18 million pages of documents worth $1.5 million dollars. Thatâ€™s how much the public records would have cost through the federal judiciaryâ€™s paywalled PACER record system, which charges eight cents a page for most legal filings.
â€œI think its pretty silly they go after people who use the library to try to get access to public court documents,â€ Swartz said. â€œIt is pretty silly that instead of calling me up, they sent an FBI agent to my house.â€
The feds also checked Swartzâ€™s Facebook page, ran his name against the Department of Labor to figure out his work history, looked for outstanding warrants and prior convictions, checked to see if his mobile phone number had ever come up in a federal wiretap or pen register, and checked him against the records in a private data brokerâ€™s database.
The Great Court Records Caper began last year when the judiciary and the Government Printing Office experimented with giving away free access to PACER at 17 select libraries around the country. Swartz decided to use the trial to grab as many of the public court records as he could and, perversely, release them to the public.
He visited one of the libraries â€” the 7th U.S. Circuit Court of Appeals library in Chicago â€” and installed a small PERL script heâ€™d written. The code cycled sequentially through case numbers, requesting a new document from PACER every three seconds. In this manner, Swartz got nearly 20 million pages of court documents, which his script uploaded to Amazonâ€™s EC2 cloud computing service.
Or, as the FBI report put it, the public records were â€œexfiltrated.â€
The script ran for a couple of weeks â€” from September 4 to 22, until the court systemâ€™s IT department realized something was wrong. Someone was downloading everything. None of the records, of course, were private or sealed, and Lexis Nexis has a copy of of PACERâ€™s database that it sells a high markup. But Swartz wasnâ€™t paying anything.
The Government Printing Office abruptly shut down the free trial and reported to the FBI that PACER was â€œcompromised,â€ the FBI file reveals. The Administrative Office of the U.S. Courts told the FBI in March that Swartz had gained unauthorized access to the free PACER account.
â€œAARON SWARTZ would have known his access was unauthorized because it was with a password that did not belonged [sic] to him,â€ reads the FBI report summarizing the judiciaryâ€™s position.
Swartz says his script only ran on the library computer. It didnâ€™t use a password at all, but used the PACER authentication cookie set in the PCâ€™s browser.
He donated the 19,856,160 pages to public.resource.org, an open government initiative spearheaded by Carl Malamud as part of a broader project to make public as many government databases as Malamud can find. It was Malamud who previouslyÂ shamed the SEC into putting all its EDGAR filings online in the â€™90s, and he used $600,000 in donations to buy 50 years of documents from the nationâ€™s appeals court, which he promptly put on the internet for anyone to download in bulk. …