Google Says “No, No” to Mr. or Ms. Pirate; What About Hate Speech?

Fred von Lohmann posted that Google has changed its algorithm. Now “it’ll start generally downranking sites that receive a high volume of copyright infringement notices from copyright holders.” The Verge reports that:

because its existing copyright infringement reporting system generates a massive amount of data about which sites are most frequently reported — the company received and processed over 4.3 million URL removal requests in the past 30 days alone, more than all of 2009 combined. Importantly, Google says the search tweaks will not remove sites from search results entirely, just rank them lower in listings. Removal of a listing will still require a formal request under the existing copyright infringement reporting system — and Google is quick to point out that those unfairly targeted can still file counter-notices to get their content reinstated into search listings.

The data-driven basis makes sense to me. So what other areas could be monitored and adjusted? I disagree with the idea that search engines should take on policing roles for certain speech that Danielle Citron and others have urged. But this shift may open the door to more arguments for Google to be a gatekeeper and policer of content. Assuming enough data is available, Google or any data-driven service, could make decisions to include or exclude entries (or shift ranking). Those moves already happen. But the difficult question will now be why or why not act on some issues but not others. James Grimmelman has a work in progress on search and speech that gets into this question. I believe the algorithm issues still control. Nonetheless, by nodding to the copyright industry, Google may be opening the door to further calls to be the Internet’s gatekeeper. Of course, if it does that, others will attack Google for doing just that from competition and other angles.

Contrarian Statutory Interpretation Continued (CDA Edition)

Following my contrarian post about how to read the Computer Fraud and Abuse Act, I thought I would write about the Communication’s Decency Act. I’ve written about the CDA before (hard to believe it has been almost 3 years!), but I’ll give a brief summary here.

The CDA provides immunity from the acts of users of online providers. For example, if a user provides defamatory content in a comment, a blog need not remove the comment to be immune, even if the blog receives notice that the content is defamatory, and even if the blog knows the content is defamatory.

I agree with most of my colleagues who believe this statute is a good thing for the internet. Where I part ways from most of my colleagues is how broadly to read  the statute.

Since this is a post about statutory interpretation, I’ll include the statute:

Section 230(c)(1) of the CDA states that:

No provider or user of an interactive computer service shall be treated as the publisher or speaker of any information provided by another information content provider.

In turn, an interactive computer service is:

any information service, system, or access software provider that provides or enables computer access by multiple users to a computer server, including specifically a service or system that provides access to the Internet and such systems operated or services offered by libraries or educational institutions.

Further, an information content provider is:

any person or entity that is responsible, in whole or in part, for the creation or development of information provided through the Internet or any other interactive computer service.

So, where do I clash with others on this? The primary area is when the operators of the computer service make decisions to publish (or republish) content.  I’ll give three examples that courts have determined are immune, but that I think do not fall within the statute:

  1. Web Site A pays Web Site B to republish all of B’s content on Site A. Site A is immune.
  2. Web Site A selectively republishes some or all of a story from Web Site B on Site A. Site A is immune.
  3. Web Site A publishes an electronic mail received by a reader on Site A. Site A is immune.

These three examples share a common thread: Site A is immune, despite selectively seeking out and publishing content in a manner that has nothing to do with the computerized processes of the provider. In other words, it is the operator, not the service, that is making publication determinations.

To address these issues, cases have focused on “development” of the information. One case, for example, defines development as a site that “contributes materially to the alleged illegality of the conduct.” Here, I agree with my colleagues that development is being defined too broadly to limit immunity. Development should mean that the provider actually creates the content that is displayed. For that reason, I agree with the Roommates.com decision, which held that Roommates developed content by providing pre-filled dropdown lists that allegedly violated the Fair Housing Act. It turns out that the roommate postings were protected speech, but that is a matter of substance, and not immunity. The fact that underlying content is eventually vindicated does not mean that immunity should be expanded. To the extent some think that the development standard is limited only to development of illegal content (something implied by the text of the Roommates.com decision), I believe that is too limiting. The question is the source of the information, not the illegality of it.

The burning issue is why plaintiffs continue to rely on “development” despite its relatively narrow application. The answer is that this is all they currently have to argue, and that is where I disagree with my colleagues. I believe the word “interactive” in the definition must mean something. It means that the receipt of content must be tied to the interactivity of the provider. In other words, receipt of the offending content must be automated or otherwise interactive to be considered for immunity.

Why do I think that this is the right reading? First, there’s the word “interactive.” It was chosen for a reason. Second, the definition of “information content provider” identifies information “provided through the Internet or any other interactive computer service.” (emphasis added). This implies that the provision of information should be based on interactivity or automation.

There is support in the statute for only immunizing information directly provided through interactivity. Section, 230(d), for example, requires interactive service providers to notify their users about content filtering tools. This implies that the information being provided is through the interactive service.  Sections 230(a) and (b) describe the findings and policy of Congress, which describe interactive services as new ways for users to control information and for free exchange of ideas.

I think one can read the statute more broadly than I am here. But I also believe that there is no reason to do so. The primary benefit of Section 230 is a cost savings mechanism. There’s is no way many service providers can screen all the content on their websites for potentially tortious activity. There’s just no filter for that.

Allowing immunity for individualized editorial decisions like paying for syndicated content, picking and choosing among emails, and republishing stories from other web sites runs directly counter to this cost saving purpose.  Complaining that it costs too much to filter interactive user content is a far cry from complaining that it costs to much to determine whether an email is true before making a noninteractive decision to republish it. We should want our service providers to expend some effort before republishing.

Social Search; It’s Might Be Around for a Bit

Hey! Bing is innovating! It has added social to search based on its relationship with Facebook. Oh wait, Google did that with Google+. So is this innovation or keeping up with the Joneses, err Pages and Brins? I thought this move by MS would happen faster given that FB and MS have been in bed together for some time. So did Google innovate while Microsoft and Facebook imitated? Maybe. Google certainly plays catch-up too. The real questions may turn on who executes and/or can execute better. That seems to be part of the innovation game too.

Facebook is top dog in social; Google in search. The thing they both (with MS lurking in the wings to make a big comeback (an odd thing given how well MS does as it is)) are doing is to take recommendations to a new level (with ads thrown in of course). I have tried logged in search. I must say I was surprised. To be clear, I find there is mainly rot in social network data just as there is in search. Whether I would have used Google+ had I not been at Google is unclear. Probably not. But I did. Then I searched for some law review articles and some basic technology information. WOW. The personal results at the top had links to blog posts by people whom I followed on Google + AND THEY WERE…RELEVANT. Blew my mind. My search time went down and I found credible sources faster. Will that last? Who knows? Someone may find ways to game the system, but the small experiences make me hopeful. Now to Facebook and Bing.

If Google can do well with a much smaller set of users for Google +, Facebook and Bing might do really well. After all, Facebook has the social piece and MS has some search computer science types. Whoever wins here may offer the next thing in search. I like conducting logged out searches and logged in. When logged in, I like the potential for seeing things from friends and people I trust. For example, if I start to be interested in cameras and search gives me posts by friends I’d ask anyway, that is a pretty cool result. I can read the post and call the friend for deeper advice or just use what they posted.

All in this space will, of course, cope with privacy concerns etc. But I think that this new level of relevance has the chance to co-exist with those concerns and users may flock to one of these services to have results well-beyond the current ones in search without social. In other words, let the games continue.

Hey Look at Me! I’m Reading! (Or Not) Neil Richards on Social Reading

Do you want everyone to know what book you read, film you watch, search you perform, automatically? No? Yes? Why? Why Not? It is odd to me that the ideas behind the Video Privacy Protection Act do not indicate a rather quick extension. But there is a debate about whether our intellectual consumption should have privacy protection, and if so, what that should look like. Luckily, Neil Richards has some answers. His post on Social Reading is a good read. In response to the idea that automatic sharing is wise and benefits all captures some core points:

Not so fast. The sharing of book, film, and music recommendations is important, and social networking has certainly made this easier. But a world of automatic, always-on disclosure should give us pause. What we read, watch, and listen to matter, because they are how we make up our minds about important social issues – in a very real sense, they’re how we make sense of the world.

What’s at stake is something I call “intellectual privacy” – the idea that records of our reading and movie watching deserve special protection compared to other kinds of personal information. The films we watch, the books we read, and the web sites we visit are essential to the ways we try to understand the world we live in. Intellectual privacy protects our ability to think for ourselves, without worrying that other people might judge us based on what we read. It allows us to explore ideas that other people might not approve of, and to figure out our politics, sexuality, and personal values, among other things. It lets us watch or read whatever we want without fear of embarrassment or being outed. This is the case whether we’re reading communist, gay teen, or anti-globalization books; or visiting web sites about abortion, gun control, or cancer; or watching videos of pornography, or documentaries by Michael Moore, or even “The Hangover 2.”

And before you go off and say Neil doesn’t get “it” whatever “it” may be, note that he is making a good distinction: “when we share – when we speak – we should do so consciously and deliberately, not automatically and unconsciously. Because of the constitutional magnitude of these values, our social, technological, professional, and legal norms should support rather than undermine our intellectual privacy.”

I easily recommend reading the full post. For those interested in a little more on the topic, the full paper is forthcoming in Georgetown Law Journal and available here. And, if you don’t know Neil Richards’ work (SSRN), you should. Even if you disagree with him, Neil’s writing is of that rare sort where you are better off by reading it. The clean style and sharp ideas force one to engage and think, and thus they also allow one to call out problems so that understanding moves forward. (See Orwell, Politics and the English Language). Enjoy.

Infrastructure: The Social Value of Shared Resources

I am excited to announce that Oxford University Press has published my book, Infrastructure: The Social Value of Shared Resources. I owe a huge debt to my Madisonian colleagues for their support along the way. I will post more about the book in the next few weeks, but here are some links and a short abstract:

The book is described here (OUP site) and here (Amazon). The Introduction and Table of Contents are available here.

Short abstract:

“Infrastructure resources are at the center of many contentious public policy debates, ranging from what to do about our crumbling roads and bridges, to whether and how to protect of our natural environment, to patent law reform, to electromagnetic spectrum allocation, to providing universal health care, to energy policy, to network neutrality regulation and the future of the Internet. Each involves a battle to control infrastructure resources, set the terms and conditions under which the public gets access, and determine how the infrastructure and various infrastructure-dependent systems evolve over time. This book advances strong economic arguments for managing and sustaining infrastructure resources as commons. The book identifies resource valuation and attendant management problems that recur across many different fields and many different resource types, and it develops a functional economic approach to understanding and analyzing these problems and potential solutions.”