big data Archives

We are releasing three longitudinal datasets of Yelp review recommendations with over 2.5M unique reviews.

September 14, 2022 by Ryan Amos

By Ryan Amos, Roland Maio, and Prateek Mittal Online reviews are an important source of consumer information, play an important role in consumer protection, and have a substantial impact on businesses’ economic outcomes. Some of these reviews may be problematic; for example, incentivized reviews, reviews with a conflict of interest, irrelevant reviews, and entirely fabricated […]

Filed Under: Other Topics Tagged With: big data, data release, dataset, longitudinal, Reviews

What should we do about re-identification? A precautionary approach to big data privacy

March 19, 2015 by Arvind Narayanan

Computer science research on re-identification has repeatedly demonstrated that sensitive information can be inferred even from de-identified data in a wide variety of domains. This has posed a vexing problem for practitioners and policy makers. If the absence of “personally identifying information” cannot be relied on for privacy protection, what are the alternatives? Joanna Huey, […]

Filed Under: Privacy & Security Tagged With: big data, Privacy, re-identification

My Bill to #OpenPACER in memory of #aaronsw – Open for Comment and Available on Github

February 1, 2013 by Stephen Schultze

I unveiled a draft bill at an event on Capitol Hill this week. It is drafted in Legislative XML, allows you to comment, and the code is available on github. Here’s the video: The Open PACER Act provides for free and open access to electronic federal court records. The courts currently offer an expensive and […]

Filed Under: Uncategorized Tagged With: big data, courts, Law, open access

Smart Campaigns, Meet Smart Voters

November 19, 2012 by Ed Felten

Zeynep pointed to her New York Times op-ed, “Beware the Smart Campaign,” about political campaigns collecting and exploiting detailed information about individual voters. Given the emerging conventional wisdom that the Obama campaign’s technological superiority played an important role in the President’s re-election, we should expect more aggressive attempts to micro-target voters by both parties in […]

Filed Under: Privacy & Security Tagged With: big data, Politics, Privacy

My NYT Op-Ed: "Beware the Smart Campaign"

November 17, 2012 by Zeynep Tufekci

I just published a new opinion piece in the New York Times, entitled “Beware the Smart Campaign”. I react to the Obama campaign’s successful use of highly quantitative voter targeting that is inspired by “big data” commercial marketing techniques and implemented through state-of-the-art social science knowledge and randomized field experiments. In the op-ed, I wonder […]

Filed Under: Other Topics, Privacy & Security Tagged With: big data, election, Privacy, Voting

We are releasing three longitudinal datasets of Yelp review recommendations with over 2.5M unique reviews.

What should we do about re-identification? A precautionary approach to big data privacy

My Bill to #OpenPACER in memory of #aaronsw – Open for Comment and Available on Github

Smart Campaigns, Meet Smart Voters

My NYT Op-Ed: "Beware the Smart Campaign"

Contributors

Archives by Month

We are releasing three longitudinal datasets of Yelp review recommendations with over 2.5M unique reviews.

What should we do about re-identification? A precautionary approach to big data privacy

My Bill to #OpenPACER in memory of #aaronsw – Open for Comment and Available on Github

Smart Campaigns, Meet Smart Voters

My NYT Op-Ed: "Beware the Smart Campaign"

What We Discuss

Contributors

Archives by Month