<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>paidContent &#187; data</title>
	<atom:link href="http://paidcontent.org/tag/data/feed/" rel="self" type="application/rss+xml" />
	<link>http://paidcontent.org</link>
	<description>The economics of digital content</description>
	<lastBuildDate>Thu, 20 Jun 2013 06:43:49 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='paidcontent.org' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://0.gravatar.com/blavatar/89ee7e1250b4095eefb87d28e6e64947?s=96&#038;d=http%3A%2F%2Fs2.wp.com%2Fi%2Fbuttonw-com.png</url>
		<title>paidContent &#187; data</title>
		<link>http://paidcontent.org</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://paidcontent.org/osd.xml" title="paidContent" />
	<atom:link rel='hub' href='http://paidcontent.org/?pushpress=hub'/>
		<item>
		<title>Through a PRISM darkly: Tracking the ongoing NSA surveillance story</title>
		<link>http://gigaom.com/2013/06/07/through-a-prism-darkly-tracking-the-ongoing-nsa-surveillance-story/</link>
		<comments>http://gigaom.com/2013/06/07/through-a-prism-darkly-tracking-the-ongoing-nsa-surveillance-story/#comments</comments>
		<pubDate>Fri, 07 Jun 2013 18:50:47 +0000</pubDate>
		<dc:creator>Mathew Ingram</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[apple]]></category>
		<category><![CDATA[big-data]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[facebook]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[leak]]></category>
		<category><![CDATA[NSA]]></category>
		<category><![CDATA[prism]]></category>
		<category><![CDATA[secrecy]]></category>
		<category><![CDATA[secret]]></category>
		<category><![CDATA[security]]></category>
		<category><![CDATA[surveillance]]></category>
		<category><![CDATA[Terrorism]]></category>
		<category><![CDATA[U.S.]]></category>
		<category><![CDATA[yahoo]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=655651</guid>
		<description><![CDATA[The past few days have seen a blizzard of leaks about surveillance activity by the government's ultra-secret NSA arm, including data collection from phone companies and internet giants. Here is what you need to know about this developing story.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=paidcontent.org&#038;blog=33319749&#038;post=230823&#038;subd=gigaompaidcontent&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>It was a relatively quiet week for internet news until <em>Guardian</em> blogger Glenn Greenwald dropped a bombshell on Thursday, <a href="http://www.guardian.co.uk/world/2013/jun/06/nsa-phone-records-verizon-court-order">with a story that showed</a> the National Security Agency was collecting data from Verizon thanks to a secret court order. But that was just the beginning: the <em>Washington Post</em> later revealed <a href="http://www.washingtonpost.com/investigations/us-intelligence-mining-data-from-nine-us-internet-companies-in-broad-secret-program/2013/06/06/3a0c0da8-cebf-11e2-8845-d970ccb04497_story.html">an even broader program</a> of surveillance code-named PRISM, which involved data collection from the web&#8217;s largest players &#8212; including Google, Facebook and Apple &#8212; and then the <em>Wall Street Journal</em> said data is <a href="http://online.wsj.com/article/SB10001424127887324299104578529112289298922.html">also being gathered</a> from ISPs and credit-card companies.</p>
<p>This story is moving so quickly that it is hard to keep a handle on all of the developments, not to mention trying to follow the denials and non-denials from those who are allegedly involved, and the threads that tie this particular story to the <a href="http://www.motherjones.com/politics/2013/06/timeline-nsa-domestic-surveillance-bush-obama">long and sordid history</a> of the U.S. government&#8217;s surveillance of its own citizens. So we thought it would be useful to try and collect what we know so far in a single post, which will be updated as often as possible with new information.</p>
<table border="0" width="100%" cellpadding="3" cellspacing="3">
<tr>
<td><a href="http://gigaom.com/2013/06/07/through-a-prism-darkly-tracking-the-ongoing-nsa-surveillance-story/#Guardian">1) The Guardian leak</a></td>
<td><a href="http://gigaom.com/2013/06/07/through-a-prism-darkly-tracking-the-ongoing-nsa-surveillance-story/#PRISM">5) Tracking down PRISM</a></td>
<td> <a href="http://gigaom.com/2013/06/07/through-a-prism-darkly-tracking-the-ongoing-nsa-surveillance-story/#door">9) Is there a back door?</a></td>
</tr>
<tr>
<td><a href="http://gigaom.com/2013/06/07/through-a-prism-darkly-tracking-the-ongoing-nsa-surveillance-story/#widens">2) The leak widens</a></td>
<td><a href="http://gigaom.com/2013/06/07/through-a-prism-darkly-tracking-the-ongoing-nsa-surveillance-story/#ripples">6) The ripples spread</a></td>
<td><a href="http://gigaom.com/2013/06/07/through-a-prism-darkly-tracking-the-ongoing-nsa-surveillance-story/#work">10) How it might work</a></td>
</tr>
<tr>
<td><a href="http://gigaom.com/2013/06/07/through-a-prism-darkly-tracking-the-ongoing-nsa-surveillance-story/#WaPo">3) The Washington Post leak</a></td>
<td><a href="http://gigaom.com/2013/06/07/through-a-prism-darkly-tracking-the-ongoing-nsa-surveillance-story/#goog">7) Google&#8217;s denial</a></td>
<td><a href="http://gigaom.com/2013/06/07/through-a-prism-darkly-tracking-the-ongoing-nsa-surveillance-story/#good">11) For your own good</a></td>
</tr>
<tr>
<td><a href="http://gigaom.com/2013/06/07/through-a-prism-darkly-tracking-the-ongoing-nsa-surveillance-story/#fallout">4) The fallout</a></td>
<td><a href="http://gigaom.com/2013/06/07/through-a-prism-darkly-tracking-the-ongoing-nsa-surveillance-story/#zuck">8) Zuckerberg&#8217;s denial</a></td>
<td><a href="http://gigaom.com/2013/06/07/through-a-prism-darkly-tracking-the-ongoing-nsa-surveillance-story/#snowdon">12) The leaker revealed</a></td>
</tr>
</table>
<p><a name="Guardian" id="Guardian"><br />
<h2 id="the-guardian-leak">The Guardian leak</h2>
<p></a></p>
<p><em>Guardian</em> blogger and former lawyer Glenn Greenwald reports that the <a href="http://www.guardian.co.uk/world/2013/jun/06/nsa-phone-records-verizon-court-order">NSA has gotten a secret order</a> from the Foreign Intelligence Surveillance Court that allows it to collect data about phone calls made by &#8220;millions of customers&#8221; on the Verizon network: location data, time and other identifying info about the call &#8212; everything except the actual content of the calls themselves (the <em>Guardian</em> has a background piece about <a href="http://www.guardian.co.uk/world/2013/jun/06/phone-call-metadata-information-authorities">what kind of metadata</a> is available with such an order). </p>
<blockquote id="quote-the-national-securit"><p>&#8220;The National Security Agency is currently collecting the telephone records of millions of US customers of Verizon, one of America&#8217;s largest telecoms providers, under a top secret court order issued in April. The order&#8230; requires Verizon on an &#8216;ongoing, daily basis&#8217; to give the NSA information on all telephone calls in its systems, both within the US and between the US and other countries.&#8221;</p></blockquote>
<p><a href="http://gigaom2.files.wordpress.com/2013/06/verizon-court-order.png"><img src="http://gigaom2.files.wordpress.com/2013/06/verizon-court-order.png?w=708" alt="Verizon court order"    class="aligncenter size-full wp-image-655716" /></a></p>
<p><a name="widens" id="widens"><br />
<h2 id="the-leak-widens">The leak widens</h2>
<p></a></p>
<p>Other stories that follow the <em>Guardian</em> report quote anonymous sources saying the Verizon court order is a renewal of an order that has been in place for some time, and add that other telecom companies <a href="http://online.wsj.com/article/SB10001424127887324299104578529112289298922.html">such as AT&amp;T are also involved</a> in similar programs. Greenwald notes in his story that the NSA started a program of bulk collection of telephone, internet and email records in 2001 under President Bush and this later caused controversy <a href="http://yahoo.usatoday.com/news/washington/2006-05-10-nsa_x.htm">when it was reported in 2006</a> that the NSA had been saving all of this information and was analyzing it to try and detect terrorism.</p>
<p>Information-security experts and other industry watchers note after Greenwald&#8217;s story is published that the NSA and other government agencies <a href="http://www.techdirt.com/articles/20130606/23460923352/trip-down-memory-lane-people-warned-what-would-happen-when-congress-passed-bills-to-enable-vast-spying.shtml">have had these kinds of abilities</a> for years thanks to laws such as the Protect America Act and the FISA Amendments Act. ProPublica has a roundup of what the government can find out about you and your behavior <a href="http://www.propublica.org/special/no-warrant-no-problem-how-the-government-can-still-get-your-digital-data">without a search warrant</a>, and security expert Bruce Schneier says that what we don&#8217;t know about the government&#8217;s surveillance programs <a href="http://www.theatlantic.com/politics/print/2013/06/what-we-dont-know-about-spying-on-citizens-scarier-than-what-we-know/276607/">is even more frightening</a> than what we do know. </p>
<p>Meanwhile, our Stacey Higginbotham wonders whether the NSA story <a href="http://gigaom.com/2013/06/05/will-the-latest-nsa-surveillance-scandal-be-a-wake-up-call-for-the-power-of-data/">will be a wakeup call</a> about the power of big data, while Derrick Harris looks at how the security agency and other government entities <a href="http://gigaom.com/2013/06/06/heres-how-the-nsa-analyzes-all-that-call-data/">analyze the vast amounts</a> of information that come from such programs.</p>
<p><a href="http://gigaom2.files.wordpress.com/2013/06/ohanian-tweet.png"><img src="http://gigaom2.files.wordpress.com/2013/06/ohanian-tweet.png?w=708" alt="Ohanian tweet"    class="aligncenter size-full wp-image-655718" /></a></p>
<p>Freelance journalist Joshua Foust argues that the NSA revelations won&#8217;t cause most people to change their behavior &#8212; including their habit of voting for politicians who enact the kind of legislation that permits such surveillance &#8212; <a href="http://joshuafoust.com/nine-dashed-off-points-on-the-nsa-scandal/">because they simply don&#8217;t care enough</a> about the issue. Some experts said the kind of data the NSA is getting can be very powerful when it comes to finding patterns of behavior, but research from the Cato Institute says that <a href="http://www.cato.org/publications/policy-analysis/effective-counterterrorism-limited-role-predictive-data-mining">even mining large amounts</a> of data can turn out to be not that helpful when it comes to catching terrorists.</p>
<p>The <em>Wall Street Journal</em>, meanwhile, said that the NSA&#8217;s surveillance program was &#8220;legal and necessary&#8221; and the furor over the disclosure of this program was misplaced:</p>
<blockquote id="quote-nobodys-civil-libert2"><p>&#8220;Nobody&#8217;s civil liberties are violated by tech companies or banks that constantly run the same kinds of data analysis. We bow to no one in our desire to limit government power, but data-mining is less intrusive on individuals than routine airport security. The data sweep is worth it if it prevents terror attacks that would lead politicians to endorse far greater harm to civil liberties.&#8221;</p></blockquote>
<p><a name="WaPo" id="WaPo"><br />
<h2 id="the-washington-post-leak">The Washington Post leak</h2>
<p></a></p>
<p>Within hours of the <em>Guardian</em> story appearing, the <em>Washington Post</em> reports that it has been <a href="http://www.washingtonpost.com/investigations/us-intelligence-mining-data-from-nine-us-internet-companies-in-broad-secret-program/2013/06/06/3a0c0da8-cebf-11e2-8845-d970ccb04497_story.html">leaked an internal slide presentation</a> from the NSA that describes a program it calls PRISM &#8212; which involves the collection of email and other personal data from internet companies including Google, Microsoft, Facebook, Apple and Yahoo. According to the <em>Post</em> report (and a subsequent <em>Guardian</em> report based on a similar leak), this program has been underway since at least 2007, and involves what one NSA slide refers to as &#8220;data collected directly from the servers&#8221; of the companies named.</p>
<p><a href="http://gigaom2.files.wordpress.com/2013/06/prism-screenshot.png"><img src="http://gigaom2.files.wordpress.com/2013/06/prism-screenshot.png?w=708" alt="prism screenshot"    class="aligncenter size-full wp-image-655721" /></a></p>
<p>All of the companies who are reportedly involved in PRISM (which refers to them as &#8220;partners&#8221;) <a href="http://gigaom.com/2013/06/06/silicon-valley-denies-reports-the-u-s-government-has-direct-access-to-its-servers/">deny any knowledge of such a program</a>, and say they only provide data when forced to do so by court order, and that they have no &#8220;back door&#8221; systems that would allow the NSA to do what it claims to be doing. These denials are <a href="http://thinkprogress.org/security/2013/06/06/2118531/direct-access-nsa-spying/?mobile=nc">met by widespread skepticism</a>, and many observers &#8212; including TechCrunch founder turned VC Michael Arrington &#8212; wonder why insiders working at the tech giants allegedly involved in the program <a href="http://uncrunched.com/2013/06/06/triangulating-on-truth-the-totalitarian-state/">wouldn&#8217;t have leaked the information earlier</a>.</p>
<p><a name="fallout" id="fallout"><br />
<h2 id="the-ongoing-fallout">The ongoing fallout</h2>
<p></a></p>
<p>Some tech-industry observers say the denials from internet companies may be true, because they aren&#8217;t convinced the companies in question would even have to know about the NSA&#8217;s collection practices in order for them to work. The original <em>Washington Post</em> story is updated early Friday to note that it&#8217;s <a href="http://www.washingtonpost.com/investigations/us-intelligence-mining-data-from-nine-us-internet-companies-in-broad-secret-program/2013/06/06/3a0c0da8-cebf-11e2-8845-d970ccb04497_story.html">not clear whether &#8220;direct access&#8221;</a> to the servers of those companies would be required, and quotes from another leaked document that says the program allows NSA officers to send &#8220;content tasking instructions directly to equipment installed at company-controlled locations,&#8221; which could mean boxes installed at ISP switches.</p>
<p><a href="http://gigaom2.files.wordpress.com/2013/06/hippeau-tweet.png"><img src="http://gigaom2.files.wordpress.com/2013/06/hippeau-tweet.png?w=708" alt="hippeau tweet"    class="aligncenter size-full wp-image-655722" /></a></p>
<p>Several sources note that former AT&amp;T employee Mark Klein <a href="http://www.washingtonpost.com/wp-dyn/content/article/2007/11/07/AR2007110700006.html">revealed in 2007</a> that he had come across documents that showed the telecom company installed equipment &#8212; using glass prisms as &#8220;splitters&#8221; &#8212; that allowed the NSA to <a href="http://seattletimes.com/html/politics/2004001159_spying08.html">make a copy of the data stream coming</a> from the AT&amp;T network and send it to data-storage centers operated by the security agency. This was alleged to be part of a larger program that stored telephone calls, emails and other internet activity for the government and had been underway for years.</p>
<p>Some network analysts speculate that the NSA <a href="http://www.guardian.co.uk/world/2013/jun/07/nsa-prism-records-surveillance-questions">may be making use of equipment</a> installed at CDNs (content delivery networks), which handle much of the data traffic for companies like Google and Yahoo. Laws passed in the U.S. require equipment makers such as Cisco <a href="http://t.co/OyeCis6GE5">to build into their products</a> a way for law enforcement officials to tap into the streams they carry, and the NSA could be searching those streams directly instead of copying or storing all the data itself (since the cost of the program is a relatively cheap-sounding $20 million, according to the Post leak).</p>
<p><a href="http://gigaom2.files.wordpress.com/2013/06/gore-tweet.png"><img src="http://gigaom2.files.wordpress.com/2013/06/gore-tweet.png?w=708" alt="gore tweet"    class="aligncenter size-full wp-image-655723" /></a></p>
<p>In a statement about the leaks, the Office of the Director of National Intelligence said that it does its best to work &#8220;within the constraints of the law&#8221; to collect information related to national security, and <a href="http://www.dni.gov/index.php/newsroom/press-releases/191-press-releases-2013/868-dni-statement-on-recent-unauthorized-disclosures-of-classified-information">that unauthorized leaks such as those</a> to the Guardian and Post &#8220;threatens potentially long-lasting and irreversible harm to our ability to identify and respond to the many threats facing our nation.&#8221;</p>
<p><a name="PRISM" id="PRISM"><br />
<h2 id="trying-to-track-down-prism">Trying to track down PRISM</h2>
<p></a></p>
<p>A search for entities that might be involved in the NSA program turns up software from a relatively secretive startup called Palantir &#8212; which has been funded by the CIA through its investment arm &#8212; that <a href="http://talkingpointsmemo.com/archives/2013/06/is_this_who_runs_prism.php">happens to be named PRISM</a>. According to descriptions of the software, it allows clients of Palantir to sift through massive amounts of data and find patterns quickly. </p>
<p>Others are skeptical, however, that the software described could be used to do what the NSA appears to be doing, and security-industry sources say the NSA usually builds its own products and doesn&#8217;t like to use those from third parties. On Friday afternoon, <a href="http://www.theverge.com/2013/6/7/4406760/palantir-denies-prism-software-related-to-surveillance">Palantir told The Verge</a>: &#8220;Palantir&#8217;s Prism platform is completely unrelated to any US government program of the same name.&#8221;</p>
<p>Former Reuters social-media editor Matthew Keys said on Twitter that he had found several references to the PRISM program in classified job listings dating back to 2007:</p>
<p><a href="http://gigaom2.files.wordpress.com/2013/06/matthew-keys-tweet.png"><img src="http://gigaom2.files.wordpress.com/2013/06/matthew-keys-tweet.png?w=708" alt="Matthew Keys tweet"    class="aligncenter size-full wp-image-655839" /></a></p>
<p>Not wanting to be left out, the secretive activist group Anonymous released some classified documents that refer to Defense Department information technology &#8212; but they appear to be mostly jargon-filled <a href="http://venturebeat.com/2013/06/07/anonymous-defense-department-leak/">descriptions of the department&#8217;s IT infrastructure</a>, with little or no connection to PRISM or any NSA-related data collection practices.</p>
<p><a href="http://gigaom2.files.wordpress.com/2013/06/baio-tweet.png"><img src="http://gigaom2.files.wordpress.com/2013/06/baio-tweet.png?w=708" alt="baio tweet"    class="aligncenter size-full wp-image-655726" /></a></p>
<p><a name="ripples" id="ripples"><br />
<h2 id="the-ripples-spread-outside-the">The ripples spread outside the U.S.</h2>
<p></a></p>
<p>As our man in Europe &#8212; David Meyer &#8212; noted in a couple of posts Friday morning, the repercussions from the PRISM and NSA revelations <a href="http://gigaom.com/2013/06/07/nsa-spying-scandal-fallout-expect-big-impact-in-europe-and-elsewhere/">are being felt in Europe</a> as well, with some critics calling for changes to the so-called &#8220;Safe Harbor&#8221; program, which allows data about EU citizens to be stored by non-EU companies. And the <em>Guardian</em> has reported that the U.K. government <a href="http://www.guardian.co.uk/technology/2013/jun/07/uk-gathering-secret-intelligence-nsa-prism">appears to have been getting</a> information via the PRISM program, which was designed to focus on the communication activity of non-U.S. residents (since U.S. law still technically prevents the government from spying on its own citizens without a warrant).</p>
<p>Meanwhile, President Obama &#8212; whom many critics have accused of carrying on with surveillance programs started by his Republican predecessor, despite his disavowal of such methods while campaigning &#8212; said through a spokesman that he <a href="http://www.politico.com/story/2013/06/report-nsa-verizon-call-records-92315.html?hp=t1">&#8220;welcomes the discussion&#8221;</a> about privacy and security:</p>
<blockquote id="quote-the-president-welcom3"><p>&#8220;The president welcomes the discussion of the trade-off between security and civil liberties. The close examination of some of these complicated issues could cause people to arrive at differing opinions&#8230; The president welcomes that debate.&#8221;</p></blockquote>
<p><a href="http://gigaom2.files.wordpress.com/2013/06/jared-keller-tweet.png"><img src="http://gigaom2.files.wordpress.com/2013/06/jared-keller-tweet.png?w=708" alt="Jared Keller tweet"    class="aligncenter size-full wp-image-655779" /></a></p>
<p>Late Friday, the <em>Guardian</em> <a href="http://www.guardian.co.uk/world/2013/jun/07/obama-china-targets-cyber-overseas">posted another security-related scoop</a>, publishing what it called a &#8220;secret presidential directive&#8221; that orders the U.S. government&#8217;s top national security and intelligence officials to draw up a list of potential overseas targets that the U.S. could hit with cyber-attacks. The story goes on to say this operation:</p>
<blockquote id="quote-can-offer-unique-and4"><p>&#8220;can offer unique and unconventional capabilities to advance US national objectives around the world with little or no warning to the adversary or target and with potential effects ranging from subtle to severely damaging&#8221;</p></blockquote>
<p><a name="goog" id="goog"><br />
<h2 id="google-denial-and-sir-tim-bern">Google denial and Sir Tim Berners-Lee</h2>
<p></a></p>
<p>The creator of the world wide web, Sir Tim Berners-Lee, <a href="http://www.webfoundation.org/2013/06/web-inventor-speaks-out-on-prism/">posted a statement</a> at the Web Foundation blog saying:</p>
<blockquote id="quote-today%e2%80%99s-reve5"><p>&#8220;Today’s revelations are deeply concerning. Unwarranted government surveillance is an intrusion on basic human rights that threatens the very foundations of a democratic society. I call on all Web users to demand better legal protection and due process safeguards for the privacy of their online communications, including their right to be informed when someone requests or stores their data.&#8221;</p></blockquote>
<p>And Google co-founder Larry Page posted a response Friday afternoon to the accusations in the <em>Guardian</em> and <em>Post</em> stories, written with Chief Legal Officer David Drummond, saying the company <a href="http://googleblog.blogspot.ca/2013/06/what.html">does not provide the government</a> with &#8220;back door&#8221; access to its servers, and had never heard of the PRISM program until Thursday:</p>
<blockquote id="quote-press-reports-that-s6"><p>&#8220;Press reports that suggest that Google is providing open-ended access to our users’ data are false, period&#8230; Any suggestion that Google is disclosing information about our users’ Internet activity on such a scale is completely false.&#8221;</p></blockquote>
<p><a href="http://gigaom2.files.wordpress.com/2013/06/soghoian-tweet1.png"><img src="http://gigaom2.files.wordpress.com/2013/06/soghoian-tweet1.png?w=708" alt="Soghoian tweet1"    class="aligncenter size-full wp-image-655831" /></a></p>
<p><a name="zuck" id="zuck"><br />
<h2 id="zuckerberg-denial">Zuckerberg denial</h2>
<p></a></p>
<p>Facebook co-founder and CEO Mark Zuckerberg posted a statement about PRISM on his Facebook page late Friday, <a href="https://www.facebook.com/zuck/posts/10100828955847631">saying he wanted to respond personally</a> to the &#8220;outrageous press reports&#8221; about his company&#8217;s involvement in the surveillance scheme. In language very similar to the Google denial, Zuckerberg said the network has not been part of any program to give the U.S. government &#8220;direct access&#8221; to its servers.</p>
<blockquote id="quote-facebook-is-not-and-7"><p>&#8220;Facebook is not and has never been part of any program to give the US or any other government direct access to our servers. We have never received a blanket request or court order from any government agency asking for information or metadata in bulk, like the one Verizon reportedly received. And if we did, we would fight it aggressively. We hadn&#8217;t even heard of PRISM before yesterday.&#8221;</p></blockquote>
<p><a href="http://gigaom2.files.wordpress.com/2013/06/ashkan-tweet.png"><img src="http://gigaom2.files.wordpress.com/2013/06/ashkan-tweet.png?w=708" alt="Ashkan tweet"    class="aligncenter size-full wp-image-655880" /></a></p>
<p><a name="door" id="door"><br />
<h2 id="does-the-nsa-even-need-a-back-">Does the NSA even need a back door?</h2>
<p></a></p>
<p>Christopher Mims at the <em>Atlantic</em> business site Quartz quotes NSA veteran and whistle-blower William Binney &#8212; who was part of a group <a href="http://en.wikipedia.org/wiki/William_Binney_(U.S._intelligence_official)">that asked the Defense Department</a> to investigate the NSA in 2002 &#8212; saying the security agency could probably get its hands <a href="http://qz.com/92369/why-nsa-has-access-to-80-of-online-communication-even-if-google-doesnt-have-a-back-door/">on about 80 percent of the web traffic</a> that passes through the U.S. without even having direct access to the servers of companies like Google. That&#8217;s because the NSA has access to at least one of the largest communications hubs on the continent, <a href="https://www.eff.org/nsa-spying">as described by</a> the Electronic Frontier Foundation.</p>
<p><a href="http://gigaom2.files.wordpress.com/2013/06/ambinder-tweet.png"><img src="http://gigaom2.files.wordpress.com/2013/06/ambinder-tweet.png?w=708" alt="Ambinder tweet"    class="aligncenter size-full wp-image-655851" /></a></p>
<p>The <em>Wall Street Journal</em> posted a story that quoted unnamed security experts who said the tech companies mentioned in the PRISM presentation could be telling the truth about not providing &#8220;direct access&#8221; to their servers, <a href="http://online.wsj.com/article/SB10001424127887324798904578531672407107306.html?mod=WSJ_hpp_LEFTTopStories">but still have their data collected</a> by the NSA. The <em>Journal</em> said U.S. officials told the paper that the NSA &#8220;receives copies of the data through a system they set up with a court order.&#8221;</p>
<blockquote id="quote-one-industry-executi8"><p>&#8220;One industry executive familiar with the handling of data requests from U.S. intelligence agencies said companies have set up ways to cope with the volume of data by automating parts of the process. This method would allow data to be funneled to intelligence agencies without the need for manual steps by company employees.&#8221;</p></blockquote>
<p>At The Daily Beast, writer Megan McArdle <a href="http://www.thedailybeast.com/articles/2013/06/07/internet-companies-deny-they-re-helping-the-nsa-collect-user-data-should-we-believe-them.html">looked at the issue of whether</a> tech company denials should be believed or not, and quoted privacy expert Julian Sanchez from the Cato Institute saying there are a number of ways that the NSA could get the data it wants without requiring direct access, including the &#8220;secret room&#8221; with splitter equipment that Mark Klein described at AT&amp;T (mentioned above):</p>
<blockquote id="quote-most-likely-is-that-9"><p>&#8220;Most likely&#8230; is that they&#8217;ve got something akin to the &#8220;Secret Room&#8221; that Mark Klein disclosed in AT&amp;T hubs where traffic is being cloned (the companies would need to provide the relevant SSL encryption keys) split off into NSA&#8217;s own machines. It would be literally true, in that case, that the NSA does not have direct access to Google&#8217;s servers.&#8221;</p></blockquote>
<p><a name="work" id="work"><br />
<h2 id="how-prism-might-work-in-practi">How PRISM might work in practice</h2>
<p></a></p>
<p>Late Friday, the <em>New York Times</em> posted a story that said some tech companies resisted the NSA&#8217;s demands <a href="http://www.nytimes.com/2013/06/08/technology/tech-companies-bristling-concede-to-government-surveillance-efforts.html">to provide easier ways to get access</a> to user data &#8212; including Twitter &#8212; but that some consented, opened up discussions with the security agency about developing methods to share that data, and even &#8220;changed their computer systems to do so.&#8221;</p>
<blockquote id="quote-in-at-least-two-case10"><p>&#8220;In at least two cases, at Google and Facebook, one of the plans discussed was to build separate, secure portals, like a digital version of the secure physical rooms that have long existed for classified information, in some instances on company servers. Through these online rooms, the government would request data, companies would deposit it and the government would retrieve it.&#8221;</p></blockquote>
<p>In other words, &#8220;companies were essentially asked to erect a locked mailbox and give the government the key&#8221; and Facebook actually built such a system, the NYT story said. Declan McCullagh at CNET explained in a post that according to his sources, <a href="http://news.cnet.com/8301-13578_3-57588337-38/no-evidence-of-nsas-direct-access-to-tech-companies/">all that the PRISM process does is automate</a> something that is required under FISA (the Foreign Intelligence Surveillance Act) &#8212; so court orders are given to the tech companies and they have simply made the process of handing over that information easier.</p>
<p>Marc Ambinder, a security expert who writes for The Week, also described his understanding of how PRISM functions &#8212; in a nutshell, PRISM <a href="http://theweek.com/article/index/245360/solving-the-mystery-of-prism">is just a piece of software that allows</a> the NSA to collect and interpret data that is handed over under FISA. The actual software itself isn&#8217;t classified, which is why mentions of it show up online and in job postings. In McCullagh&#8217;s piece, a former NSA lawyer says that the slide presentation the Washington Post published is &#8220;suffused with a kind of hype that makes it sound more like a marketing pitch than a briefing.&#8221;</p>
<p>Meanwhile, for those trying to keep track at home, the Electronic Frontier Foundation has put together a <a href="https://www.eff.org/nsa-spying/timeline">comprehensive timeline of events related to</a> NSA surveillance activity over the past decade:</p>
<p><a href="http://gigaom2.files.wordpress.com/2013/06/eff-spying-timelines.png"><img src="http://gigaom2.files.wordpress.com/2013/06/eff-spying-timelines.png?w=708" alt="EFF spying timelines"    class="aligncenter size-full wp-image-655988" /></a></p>
<p><a name="good" id="good"><br />
<h2 id="it-was-for-your-own-good">It was for your own good</h2>
<p></a></p>
<p>First tech companies claimed they didn&#8217;t know anything about PRISM and weren&#8217;t supplying data (or at least not direct access), and now the story some sources close to those companies are telling is that they set up portals or some other method of complying with FISA requests in order to &#8220;protect the innocent,&#8221; <a href="http://techcrunch.com/2013/06/08/cooperation-methods-protected-innocents-from-prism/">according to a post at TechCrunch</a>.</p>
<blockquote id="quote-the-nsa-may-have-wan11"><p>&#8220;The NSA may have wanted full firehoses of data from Google, Facebook and other tech giants, but the companies attempted to protect innocent users from monitoring via compliance systems that created segregated data before securely handing it over as required by law.&#8221;</p></blockquote>
<p>The <em>Guardian</em> has responded to criticisms of its original description of PRISM and the whole notion of &#8220;direct access&#8221; &#8212; as well as the <a href="http://plus.google.com/+google/posts/TMh6gUVrwMq">repeated denials from Google executives</a> and others that this has been taking place &#8212; by posting another slide from the leaked NSA presentation. While some have speculated (as mentioned above) that PRISM could mean simply sucking data from ISP equipment, the <a href="http://www.guardian.co.uk/world/2013/jun/08/nsa-surveillance-prism-obama-live?guni=Network%20front:network-front%20full-width-1%20bento-box:Bento%20box:Position2#block-51b36893e4b0cc6424372292">NSA slide contrasts this method</a> of getting data with PRISM&#8217;s, which it describes again as &#8220;collection directly from the servers&#8221; of the companies mentioned. </p>
<p><a href="http://gigaom2.files.wordpress.com/2013/06/guardian-slide.png"><img src="http://gigaom2.files.wordpress.com/2013/06/guardian-slide.png?w=708" alt="Guardian slide"    class="aligncenter size-full wp-image-656012" /></a></p>
<p>The Director of National Intelligence <a href="http://www.dni.gov/index.php/newsroom/press-releases/191-press-releases-2013/872-dni-statement-on-the-collection-of-intelligence-pursuant-to-section-702-of-the-foreign-intelligence-surveillance-act">released another statement</a> on Saturday, calling the disclosures by the <em>Guardian</em> and <em>Washington Post</em> about NSA data collection &#8220;reckless&#8221; and filled with &#8220;significant misimpressions.&#8221; So DNI James Clapper said he had declassified some details about the program, <a href="http://www.dni.gov/files/documents/Facts%20on%20the%20Collection%20of%20Intelligence%20Pursuant%20to%20Section%20702.pdf">published in a fact sheet</a> (PDF link). Among other things, it says:</p>
<blockquote id="quote-prism-is-not-an-undi12"><p>&#8220;PRISM is not an undisclosed collection or data mining program. It is an internal government  computer system used to facilitate the government’s statutorily authorized collection of foreign intelligence information from electronic communication service providers under court supervision, as authorized by Section 702 of the Foreign Intelligence Surveillance Act.&#8221;</p></blockquote>
<p>The <em>Washington Post</em> published a follow-up story on Saturday that described the PRISM process in much the same way as earlier stories from the Guardian and the New York Times: as a system or software that <a href="http://www.washingtonpost.com/world/national-security/us-company-officials-internet-surveillance-does-not-indiscriminately-mine-data/2013/06/08/5b3bb234-d07d-11e2-9f1a-1a7cdee20287_story_1.html">allowed the NSA to process FISA requests</a> for information more quickly &#8212; and the paper reiterated earlier statements that because the program was top secret, only a few individuals within those companies would even know about it, let alone be able to discuss it. According to the Post:</p>
<blockquote id="quote-executives-at-some-o13"><p>&#8220;Executives at some of the participating companies, who spoke on the condition of anonymity, acknowledged the system’s existence and said it was used to share information about foreign customers with the NSA and other parts of the nation’s intelligence community.&#8221;</p></blockquote>
<p>Much of the criticism about the original <em>Post</em> story and the <em>Guardian</em> story has focused on the description of PRISM as allowing &#8220;direct access&#8221; to the servers of companies like Google, Facebook and Yahoo &#8212; something the leaders of those companies have strenuously denied providing. The most recent <em>Post</em> story suggests that at least some of the debate over this term is semantic, and that its sources say PRISM did allow the NSA to get data from those companies directly: </p>
<blockquote id="quote-intelligence-communi14"><p>&#8220;Intelligence community sources said that this description, although inaccurate from a technical perspective, matches the experience of analysts at the NSA. From their workstations anywhere in the world, government employees cleared for PRISM access may &#8216;task&#8217; the system and receive results from an Internet company without further interaction with the company’s staff.&#8221;</p></blockquote>
<p><a name="snowdon" id="snowdon"><br />
<h2 id="nsa-whistle-blower-reveals-his">NSA whistle-blower reveals his identity</h2>
<p></a></p>
<p>In another bombshell, the <em>Guardian</em> <a href="http://www.guardian.co.uk/world/2013/jun/09/edward-snowden-nsa-whistleblower-surveillance">revealed the identity of the whistle-blower</a> who sent them the leaked documents about PRISM and the NSA surveillance program: he is Edward Snowden, a 29-year-old former technical assistant at the Central Intelligence Agency, and he is now living in Hong Kong and expects he will &#8220;never see home again.&#8221; He said his family doesn&#8217;t know about his activities, and that he fully expects to be charged and potentially face jail time for his actions.</p>
<p>In <a href="http://www.guardian.co.uk/world/2013/jun/09/nsa-whistleblower-edward-snowden-why?guni=Network%20front:network-front%20full-width-1%20bento-box:Bento%20box:Position1:sublinks">an interview</a> with the <em>Guardian</em>, Snowden says that he gradually became frustrated with what the NSA was doing and believed it was wrong &#8212; but originally held off on leaking anything because he thought Barack Obama would change those policies when he was elected president. But Snowden says the president continued with &#8220;the policies of his predecessor&#8221; and so he decided to come forward and let the American public know what was happening behind closed doors:</p>
<blockquote id="quote-i-dont-want-to-live-15"><p>&#8220;I don&#8217;t want to live in a society that does these sort of things … I do not want to live in a world where everything I do and say is recorded. That is not something I am willing to support or live under.&#8221;</p></blockquote>
<p>Snowden also said the documents he leaked clearly show that &#8220;the NSA routinely lies in response to Congressional inquiries about the scope of surveillance in America&#8221; and that the abilities that he had as a contractor with the CIA were beyond what most people can even imagine:</p>
<blockquote id="quote-you-are-not-even-awa16"><p>&#8220;You are not even aware of what is possible. The extent of their capabilities is horrifying. We can plant bugs in machines. Once you go on the network, I can identify your machine. You will never be safe whatever protections you put in place.&#8221;</p></blockquote>
<h2 id="the-reaction">The reaction</h2>
<p>In a post written for <em>The Atlantic</em> magazine, James Fallows said that the <a href="http://www.theatlantic.com/politics/archive/2013/06/edward-snowden-in-hong-kong/276692/">most frightening and important part</a> about PRISM and the rest of the NSA surveillance activity revealed by Snowden is that it is all legal under the Foreign Intelligence Surveillance Act and other legislation.</p>
<blockquote id="quote-that-these-programs-17"><p>&#8220;That these programs are legal &#8212; unlike the Nixon &#8220;Plumbers&#8221; operation, unlike various CIA assassination programs, unlike other objects of whistle-blower revelations over the years &#8212; is the most important fact about them. They&#8217;re being carried out in &#8220;our&#8221; name, ours as Americans, even though most of us have had no idea of what they entailed.&#8221;</p></blockquote>
<p>Fallows &#8212; and others such as <a href="http://editors.talkingpointsmemo.com/archives/2013/06/whats_the_deal_with_hong_kong.php?ref=fpblg">Talking Points Memo founder</a> Josh Marshall &#8212; raised some question marks about the wisdom of Snowden&#8217;s choice of Hong Kong, which is still part of China and therefore not particularly open to harboring whistle-blowers. However, according to some experts in the law, Hong Kong might be a good place to seek asylum because of a loophole that <a href="http://www.globalpost.com/dispatch/news/regions/asia-pacific/china/130610/why-edward-snowden-hong-kong-extradition-asylum">could allow Snowden to remain there</a> indefinitely. </p>
<p>Icelandic MP Birgitta Jonsdottir, an early supporter of WikiLeaks and of freedom-of-information laws in general, told <em>Forbes</em> magazine that she plans to try and <a href="http://www.forbes.com/sites/andygreenberg/2013/06/09/icelandic-legislator-im-ready-to-help-nsa-whistleblower-seek-asylum/">get her country to offer</a> Snowden political asylum. But observers of the political scene in Iceland say this might be more difficult than it would have been in the past, since the new Conservative government is <a href="http://preview.reuters.com/2013/6/10/iceland-may-not-be-the-haven-us-leaker-hopes-1">seen as more friendly to</a> the Obama administration.</p>
<p>Daniel Ellsberg &#8212; the man <a href="http://en.wikipedia.org/wiki/Daniel_Ellsberg#The_Pentagon_Papers">who leaked the famous &#8220;Pentagon Papers&#8221;</a> in 1971 and revealed that the government had been lying about the Vietnam War &#8212; said in a piece written for the <em>Guardian</em> that Snowden&#8217;s leaks give the United States a chance to &#8220;roll back what is tantamount to an executive coup against the U.S. constitution.&#8221; Ellsberg said that Snowden&#8217;s revelations were the most important leak in the history of the United States, including his own.</p>
<blockquote id="quote-since-911-there-has-18"><p>&#8220;Since 9/11, there has been, at first secretly but increasingly openly, a revocation of the bill of rights for which this country fought over 200 years ago. In particular, the fourth and fifth amendments of the US constitution, which safeguard citizens from unwarranted intrusion by the government into their private lives, have been virtually suspended.&#8221;</p></blockquote>
<p>Meanwhile, David Kirkpatrick &#8212; author of the book &#8220;The Facebook Effect&#8221; &#8212; asked whether the secrecy and privacy invasions involved in the PRISM program <a href="https://www.linkedin.com/today/post/article/20130609225334-16549-did-obama-just-destroy-the-u-s-internet-industry?_mSplash=1">might impair the growth of</a> social networks and cloud services like Facebook.</p>
<blockquote id="quote-do-we-really-want-to19"><p>&#8220;Do we really want to impair such powerful tools for spreading dialogue, political discourse, and U.S. values? Is it worthwhile to impair the extraordinary financial and commercial success of these great flagships for the American economy? Does Obama want Facebook et al just to be seen as tools of American power?&#8221;</p></blockquote>
<p>Politico took a look at <a href="https://www.propublica.org/article/nsa-black-hole-5-basic-things-we-still-dont-know-the-governments-snoop">some of the things that we still don&#8217;t know</a> about PRISM and the activity involved in the NSA&#8217;s surveillance program &#8212; including how much data the spy agency has been collecting from phone companies as well as tech companies like Google, whether this data collection has actually thwarted any specific terrorist attempts or not (something that is the subject of much debate) and how exactly the PRISM program works in practice.</p>
<p>Meanwhile, the Daily Beast has a piece that looks at the group within the U.S. intelligence apparatus that hunt down leakers like Snowden, <a href="http://www.thedailybeast.com/articles/2013/06/10/inside-the-q-group-the-directorate-hunting-down-andrew-snowden.html">a kind of internal police force</a> called the Associate Directorate for Security and Counterintelligence &#8212; or the Q Group for short. And Salon magazine has a feature and interview with Laura Poitras, the documentary film-maker who was contacted by Snowden and <a href="http://www.salon.com/2013/06/10/the_woman_behind_the_nsa_scoops/">later helped both the <em>Post</em> and the <em>Guardian</em></a> write their stories about the leak.</p>
<p>Got anything I am missing? Let me know at <a href="mailto:mathew@gigaom.com">mathew@gigaom.com</a></p>
<p><em>Post and thumbnail images courtesy of <a href="http://www.shutterstock.com/gallery-540784p1.html">Shutterstock / Lightspring</a> and the Washington Post</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=paidcontent.org&#038;blog=33319749&#038;post=230823&#038;subd=gigaompaidcontent&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/PaidContent_RSS_300x250&#038;sz=300x250&#038;c=71625"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/PaidContent_RSS_300x250&#038;sz=300x250&#038;c=71625" /></a></p>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/06/07/through-a-prism-darkly-tracking-the-ongoing-nsa-surveillance-story/feed/</wfw:commentRss>
		<slash:comments>25</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/06/shutterstock_94364473.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/06/shutterstock_94364473.jpg?w=150" medium="image">
			<media:title type="html">Surveillance</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/0bdf7ab171ade0708a11fa3378e6d8cb?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">Mathew</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/06/verizon-court-order.png" medium="image">
			<media:title type="html">Verizon court order</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/06/ohanian-tweet.png" medium="image">
			<media:title type="html">Ohanian tweet</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/06/prism-screenshot.png" medium="image">
			<media:title type="html">prism screenshot</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/06/hippeau-tweet.png" medium="image">
			<media:title type="html">hippeau tweet</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/06/gore-tweet.png" medium="image">
			<media:title type="html">gore tweet</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/06/matthew-keys-tweet.png" medium="image">
			<media:title type="html">Matthew Keys tweet</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/06/baio-tweet.png" medium="image">
			<media:title type="html">baio tweet</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/06/jared-keller-tweet.png" medium="image">
			<media:title type="html">Jared Keller tweet</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/06/soghoian-tweet1.png" medium="image">
			<media:title type="html">Soghoian tweet1</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/06/ashkan-tweet.png" medium="image">
			<media:title type="html">Ashkan tweet</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/06/ambinder-tweet.png" medium="image">
			<media:title type="html">Ambinder tweet</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/06/eff-spying-timelines.png" medium="image">
			<media:title type="html">EFF spying timelines</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/06/guardian-slide.png" medium="image">
			<media:title type="html">Guardian slide</media:title>
		</media:content>
	</item>
		<item>
		<title>Why digital book publishers are starting to embrace data</title>
		<link>http://paidcontent.org/2013/04/17/why-digital-book-publishers-are-starting-to-embrace-data/</link>
		<comments>http://paidcontent.org/2013/04/17/why-digital-book-publishers-are-starting-to-embrace-data/#comments</comments>
		<pubDate>Wed, 17 Apr 2013 19:49:01 +0000</pubDate>
		<dc:creator>Eliza Kern</dc:creator>
				<category><![CDATA[data]]></category>
		<category><![CDATA[ebooks]]></category>
		<category><![CDATA[open road integrated media]]></category>
		<category><![CDATA[paidcontent live 2013]]></category>
		<category><![CDATA[sourcebooks]]></category>
		<category><![CDATA[the atavist]]></category>

		<guid isPermaLink="false">http://paidcontent.org/?p=227901</guid>
		<description><![CDATA[How are book publishers learning more about our evolving reading habits? Not surprisingly, ebook publishers are turning the industry toward thinking more about making data-driven decisions.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=paidcontent.org&#038;blog=33319749&#038;post=227901&#038;subd=gigaompaidcontent&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>With more than 20 percent of Americans <a href="http://libraries.pewinternet.org/2012/12/27/e-book-reading-jumps-print-book-reading-declines/" target="_blank">over the age of 16 having read an ebook in the past year</a>, and publishers seeing more than 20 percent of revenues come from ebook sales, there’s no question the future of ebooks is bright, and the industry has a lot of potential customers.</p>
<p>But how exactly ebook publishers reach that audience and how the industry tracks who’s interested in reading what is less clear. A series of ebook publishers who spoke at our <a href="http://event.gigaom.com/paidcontent/schedule/?utm_source=media&amp;utm_medium=editorial&amp;utm_campaign=intext&amp;utm_term=227901+why-digital-book-publishers-are-starting-to-embrace-data&amp;utm_content=elizakern" target="_blank">PaidContent Live conference in New York on Wednesday</a> talked about the critical importance of gathering data on readership and consumption, and using it to transform the industry:</p>
<p>“The old eveolution of the book publishers used to be very allergic to data. And what you just heard is a very different approach from that. For us it’s about metadata and surfacing. And then rinse and repeat,” said <a href="http://event.gigaom.com/paidcontent/speakers/?utm_source=media&amp;utm_medium=editorial&amp;utm_campaign=intext&amp;utm_term=227901+why-digital-book-publishers-are-starting-to-embrace-data&amp;utm_content=elizakern#dominique_raccah">Dominique Raccah</a>, the publisher and CEO of Sourcebooks. “Metadata is a new term in our industry, but it really is the key.”</p>
<p>Raccah pointed out that unless publishers know who is reading the content, it’s hard to craft specific marketing messages or know what people respond to:</p>
<p>“It’s really important to know that book publishers know a lot about what touches readers,” she said. “So it’s important to help craft those messages in interesting ways.”</p>
<p><a href="http://event.gigaom.com/paidcontent/speakers/?utm_source=media&amp;utm_medium=editorial&amp;utm_campaign=intext&amp;utm_term=227901+why-digital-book-publishers-are-starting-to-embrace-data&amp;utm_content=elizakern#rachel_chou">Rachel Chou</a>, the CMO for Open Road Integrated Media, said they’ve seen a lot of success working with Twitter, as well as sponsored stories in Facebook, to drive traffic and understand where customers are coming from.</p>
<p>“Then after a while, you start understanding what the best partners are,” she said.</p>
<p>Evan Ratliff, founder and CEO of <a href="https://www.atavist.com/">Atavist</a>, said they have a small team but because they’re especially focused on finding customers by building up the Atavist brand, understanding data on the company’s products is important.</p>
<p>“We’re also on a very small level, we’re experiment with different ways of reaching people and social media,” he said.</p>
<p><a href="http://paidcontent.org/2013/04/17/paidcontent-live-2013-coverage/">Check out the rest of our paidContent Live 2013 coverage here</a>, and a video embed of the session follows below:</p>
<p><iframe src="http://new.livestream.com/accounts/74987/events/2000322/videos/16657153/player?autoPlay=false&amp;height=360&amp;mute=false&amp;width=640" width="640" height="360" frameborder="0" scrolling="no"></iframe><br>
A transcription of the video follows on the next page</p>
<p><a href="http://paidcontent.org/2013/04/17/why-digital-book-publishers-are-starting-to-embrace-data/2/">Go to page 2 (of 2) on paidContent .</a></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=paidcontent.org&#038;blog=33319749&#038;post=227901&#038;subd=gigaompaidcontent&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/PaidContent_RSS_300x250&#038;sz=300x250&#038;c=937916"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/PaidContent_RSS_300x250&#038;sz=300x250&#038;c=937916" /></a></p>]]></content:encoded>
			<wfw:commentRss>http://paidcontent.org/2013/04/17/why-digital-book-publishers-are-starting-to-embrace-data/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
	
		<media:thumbnail url="http://gigaompaidcontent.files.wordpress.com/2013/04/img_3411.jpg?w=150" />
		<media:content url="http://gigaompaidcontent.files.wordpress.com/2013/04/img_3411.jpg?w=150" medium="image">
			<media:title type="html">paidContent Live 2013 Rachel Chou Open Road Integrated Media Laura Hazard Owen</media:title>
		</media:content>

		<media:content url="http://2.gravatar.com/avatar/bd7905cba2440e49d86bd328573730f7?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">elizakern</media:title>
		</media:content>
	</item>
		<item>
		<title>Buy laxative, get a fiber ad on Facebook: social network mulls expanding offline reach</title>
		<link>http://paidcontent.org/2013/02/22/buy-laxative-get-a-fiber-ad-on-facebook-social-network-mulls-expanding-offline-reach/</link>
		<comments>http://paidcontent.org/2013/02/22/buy-laxative-get-a-fiber-ad-on-facebook-social-network-mulls-expanding-offline-reach/#comments</comments>
		<pubDate>Fri, 22 Feb 2013 19:54:24 +0000</pubDate>
		<dc:creator>Jeff John Roberts</dc:creator>
				<category><![CDATA[advertising]]></category>
		<category><![CDATA[Dalalogix]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[Epsilon]]></category>
		<category><![CDATA[facebook]]></category>
		<category><![CDATA[targeted ads]]></category>

		<guid isPermaLink="false">http://paidcontent.org/?p=225000</guid>
		<description><![CDATA[Facebook plans to tap into the loyalty reward data collected by retailers like drug stores to serve you ads based on your off-line purchases.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=paidcontent.org&#038;blog=33319749&#038;post=225000&#038;subd=gigaompaidcontent&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><span style="font-size:13px;line-height:19px;">It&#8217;s clever but still creepy: Facebook is exploring partnerships</span><span style="font-size:13px;line-height:19px;"> with data companies that provide loyalty cards to retailers like drug stores and retailers in order to show you ads based on your shopping habits.</span></p>
<p>As <a href="http://adage.com/article/digital/facebook-partner-acxiom-epsilon-match-store-purchases-user-profiles/239967/?utm_source=feedburner&amp;utm_medium=feed&amp;utm_campaign=Feed:+adage/homepage+%28Advertising+Age+-+Homepage%29">AdAge reports</a>, Facebook may obtain the phone numbers and email addresses that customers supply to drug stores in return for discounts and loyalty points. The company will then match the numbers and emails with user Facebook profiles in order to create new marketing opportunities. This means, for instance, that I might buy ex-lax at the store and then see ads for All-Bran in my Facebook feed.</p>
<p>This merging of online and offline data creates powerful marketing opportunities and, as ad industry people like to point out, will let people see ads they&#8217;re theoretically interested in seeing. In this perfect world of data-based marketing, no company will waste their money showing baby food or make-up ads to the likes of me &#8212; and I will be happier because I don&#8217;t have to see those ads.</p>
<p>The flip side, of course, is the creepy factor. Facebook and the data companies (in this case Epsilon, Acxiom and Datalogix) do take care to protect so-called &#8220;PPI&#8221; (protected personal information) by scrubbing out names and just using other techniques to make the data anonymous. But one still shudders about the unencrypted information escaping into the wild through carelessness or hacking. In my case, it just increases my resolve to provide retailers with fictitious names and numbers.</p>
<p>For now, we may still be a ways of from the onslaught of fiber ads in our Facebook. According to AdAge, the new &#8220;custom audience&#8221; options are still winding their way through nervous corporate legal departments.</p>
<p>(Image by gcpics via Shutterstock)</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=paidcontent.org&#038;blog=33319749&#038;post=225000&#038;subd=gigaompaidcontent&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/PaidContent_RSS_300x250&#038;sz=300x250&#038;c=464032"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/PaidContent_RSS_300x250&#038;sz=300x250&#038;c=464032" /></a></p>]]></content:encoded>
			<wfw:commentRss>http://paidcontent.org/2013/02/22/buy-laxative-get-a-fiber-ad-on-facebook-social-network-mulls-expanding-offline-reach/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaompaidcontent.files.wordpress.com/2013/02/shutterstock_100141400.jpg?w=150" />
		<media:content url="http://gigaompaidcontent.files.wordpress.com/2013/02/shutterstock_100141400.jpg?w=150" medium="image">
			<media:title type="html">Fiber</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/05dfcf765f1554b08954bb9e1ee63363?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">jeffjohnroberts</media:title>
		</media:content>
	</item>
		<item>
		<title>If Twitter wants to be a media company, it needs to get a lot better at relevance</title>
		<link>http://paidcontent.org/2013/01/22/if-twitter-wants-to-be-a-media-company-it-needs-to-get-a-lot-better-at-relevance/</link>
		<comments>http://paidcontent.org/2013/01/22/if-twitter-wants-to-be-a-media-company-it-needs-to-get-a-lot-better-at-relevance/#comments</comments>
		<pubDate>Tue, 22 Jan 2013 22:56:31 +0000</pubDate>
		<dc:creator>Mathew Ingram</dc:creator>
				<category><![CDATA[curation]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[facebook]]></category>
		<category><![CDATA[Future of Media]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[google plus]]></category>
		<category><![CDATA[Media]]></category>
		<category><![CDATA[paidContent Live]]></category>
		<category><![CDATA[Relevance]]></category>
		<category><![CDATA[social-media]]></category>
		<category><![CDATA[twitter]]></category>

		<guid isPermaLink="false">http://paidcontent.org/?p=223581</guid>
		<description><![CDATA[Evidence of Twitter's ambition to become a media entity continues to accumulate, but if it wants to fulfil its role as a digital-media player, it is going to have to get a lot better at finding relevant content for its users.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=paidcontent.org&#038;blog=33319749&#038;post=223581&#038;subd=gigaompaidcontent&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>We’ve been arguing for some time now that <a href="http://gigaom.com/2012/01/31/sorry-dick-but-twitter-is-definitely-a-media-entity/">Twitter is becoming a media entity</a> in its own right, and some of the company’s moves around the Summer Olympics and other events have helped <a href="http://gigaom.com/2012/08/20/twitter-at-the-crossroads-growing-up-is-hard-to-do/">flesh out that theory</a>. John Battelle of Federated Media argues much the same thing in a new blog post — that Twitter wants to become a media company, and that doing so <a href="http://battellemedia.com/archives/2013/01/portrait-of-twitter-as-a-young-media-company.php">means curating and even creating</a> or “co-creating” content for its users. While this is undoubtedly true, Twitter is going to need to become a lot better at relevance and discovery if it really wants to be a new-media player.</p>
<p>In his post, Battelle describes how his thinking has been influenced by some of the recent offerings Twitter has come up with around broadcast events — such as <a href="http://oscars.topsy.com/">the “Oscars Index,”</a> (a partnership with Topsy) which tracks the sentiment around the Oscar-nominated movies and personalities leading up to the Academy Awards by analyzing tweets about them. Although he doesn’t mention it, Twitter also recently announced an even more ambitious effort <a href="http://blog.twitter.com/2012/12/coming-soon-nielsen-twitter-tv-rating.html">to create a verified “Twitter TV Rating”</a> for TV shows as part of a partnership with media-metrics company Nielsen.</p>
<p><a href="http://paidcontent.org/2013/01/22/if-twitter-wants-to-be-a-media-company-it-needs-to-get-a-lot-better-at-relevance/twoscar/" rel="attachment wp-att-223583"><img src="http://gigaompaidcontent.files.wordpress.com/2013/01/twoscar.png?w=708&#038;h=429" alt="Twitter Oscars" width="708" height="429" class="aligncenter size-large wp-image-223583"></a></p>
<h2 id="having-lots-of-data-is-great-r">Having lots of data is great — relevance is better</h2>
<p>The Federated Media founder points out quite rightly that one of the things that makes Twitter a potential gold mine — both for media companies and for advertisers — is the number of signals that the network can accumulate about users, their behavior and their interests. <a href="http://www.cnn.com/2013/01/07/tech/social-media/library-congress-twitter/index.html">More than half a billion tweets a day</a> is a lot of data, and somewhere in the midst of that are the keys to delivering better content, and better advertising (which is increasingly becoming the same thing, a topic we’ll be discussing <a href="http://event.gigaom.com/paidcontent/?utm_source=media&amp;utm_medium=editorial&amp;utm_campaign=intext&amp;utm_term=223581+if-twitter-wants-to-be-a-media-company-it-needs-to-get-a-lot-better-at-relevance&amp;utm_content=mathewingram">at paidContent Live in New York</a> on April 17). As Battelle puts it in his post:</p>
<blockquote id="quote-twitter-presents-a-m"><p>“Twitter presents a massive search problem/opportunity. For example, Twitter’s gotten better and better at what’s called “entity extraction” – identifying a person, place, or thing, then associating behaviors and attributes around that thing… Real time entity extraction crossed with signals like those described above is the Holy Grail.”</p></blockquote>
<p>This is fundamentally the same goal that both Google and Facebook are focused on as well: how do you show users only things that are relevant to them, and hide those that aren’t — in real time? Facebook has gotten criticism for <a href="http://www.wired.com/business/2012/11/george-takei-facebook/">the way it tweaks the news feed</a> based on its algorithms, but the reality is that most users don’t want to see everything that streams through their networks. And Google started its Google+ social network, and <a href="http://gigaom.com/2011/09/30/its-official-google-will-be-connected-to-everything/">built it into everything it does</a>, in part because it needs more data signals about its users.</p>
<p>The problem for all of these companies is that doing this is really, really hard — every user’s stream consists of billions of data signals, and deciphering which are meaningful and which aren’t is a complicated business. To get a sense of how difficult it is, all you have to do is <a href="https://twitter.com/i/discover">look at Twitter’s “Discover” tab</a>, or the “Trends” listings, or look at the promoted tweets and promoted trends that show up in your stream (of course, Facebook is almost as bad with its sponsored pages, and it has orders of magnitude more data).</p>
<h2 id="relevance-is-the-key-to-digita">Relevance is the key to digital media</h2>
<p><a href="http://paidcontent.org/2013/01/22/if-twitter-wants-to-be-a-media-company-it-needs-to-get-a-lot-better-at-relevance/twitter-icons/" rel="attachment wp-att-94651"><img src="http://gigaompaidcontent.files.wordpress.com/2012/02/twitter-icons-o.jpg?w=150&#038;h=100" alt="Twitter Icons" width="150" height="100" class="alignleft size-thumbnail wp-image-94651"></a></p>
<p>As Battelle notes, Twitter has gotten better at discovery, and <a href="http://blog.twitter.com/2012/05/discover-better-stories.html">the revamped version of its Discover tab</a> is better than it used to be — and so are the suggestions Twitter sends to users for other accounts they should follow. But the Discover tab in particular is still light-years away from where it needs to be in order for it to be a compelling content-discovery mechanism for users, and <a href="http://gigaom.com/2012/01/20/twitter-acquisition-confirms-that-curation-is-the-future/">the curated email newsletter Twitter sends out</a> is even worse: it shows me Canadian news because I live in Toronto, even though it knows (or should) that I rarely ever tweet about that topic.</p>
<p>Simply put, relevance is the key attribute for any digital-media entity in the 21st century. Newspapers and other traditional sources of content are terrible at suggesting or curating relevant content for individual readers, but no one really expects them to be any good at it — they have zero experience in doing that. Twitter, however, has enough data that it arguably should be much better than it is. And it needs to get there quickly, before Google or Facebook (or <a href="http://allthingsd.com/20130121/new-yahoo-coo-henrique-de-castro-hints-at-the-future-of-the-web-portal-pro-tip-get-personal/">god forbid, even Yahoo</a>) get much better.</p>
<p>Coming up with visualizations about the Oscars or highlighting tweets about NASCAR may be useful for reeling in big media brands, but users are going to need a little more than that before they trust Twitter to curate content for them.</p>
<p><em>Post and thumbnail images <a href="http://creativecommons.org/licenses/by-sa/2.0/deed.en">courtesy</a> of <a href="http://www.shutterstock.com/gallery-710830p1.html">Shutterstock / noporn</a> and <a href="http://battellemedia.com/archives/2013/01/twitters-makin-media.php">John Battelle</a></em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=paidcontent.org&#038;blog=33319749&#038;post=223581&#038;subd=gigaompaidcontent&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/PaidContent_RSS_300x250&#038;sz=300x250&#038;c=848824"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/PaidContent_RSS_300x250&#038;sz=300x250&#038;c=848824" /></a></p>]]></content:encoded>
			<wfw:commentRss>http://paidcontent.org/2013/01/22/if-twitter-wants-to-be-a-media-company-it-needs-to-get-a-lot-better-at-relevance/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
	
		<media:thumbnail url="http://gigaompaidcontent.files.wordpress.com/2013/01/shutterstock_106636133.jpg?w=150" />
		<media:content url="http://gigaompaidcontent.files.wordpress.com/2013/01/shutterstock_106636133.jpg?w=150" medium="image">
			<media:title type="html">social media</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/0bdf7ab171ade0708a11fa3378e6d8cb?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">Mathew</media:title>
		</media:content>

		<media:content url="http://gigaompaidcontent.files.wordpress.com/2013/01/twoscar.png?w=708" medium="image">
			<media:title type="html">Twitter Oscars</media:title>
		</media:content>

		<media:content url="http://gigaompaidcontent.files.wordpress.com/2012/02/twitter-icons-o.jpg?w=150" medium="image">
			<media:title type="html">Twitter Icons</media:title>
		</media:content>
	</item>
		<item>
		<title>Music site This Is My Jam could spin out from Echo Nest</title>
		<link>http://gigaom.com/2013/01/13/music-site-this-is-my-jam-could-spin-out-from-echo-nest/</link>
		<comments>http://gigaom.com/2013/01/13/music-site-this-is-my-jam-could-spin-out-from-echo-nest/#comments</comments>
		<pubDate>Sun, 13 Jan 2013 13:00:40 +0000</pubDate>
		<dc:creator>Bobbie Johnson</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[Andreas Jansson]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[Echo Nest]]></category>
		<category><![CDATA[facebook]]></category>
		<category><![CDATA[Hannah Donovan]]></category>
		<category><![CDATA[last.fm]]></category>
		<category><![CDATA[Matthew Ogle]]></category>
		<category><![CDATA[music]]></category>
		<category><![CDATA[music data]]></category>
		<category><![CDATA[Online Music]]></category>
		<category><![CDATA[pandora]]></category>
		<category><![CDATA[Ralph Cowling]]></category>
		<category><![CDATA[social-media]]></category>
		<category><![CDATA[song-sharing site]]></category>
		<category><![CDATA[spotify]]></category>
		<category><![CDATA[the echo nest]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=601406</guid>
		<description><![CDATA[A year after it launched as a skunkworks project inside music data company The Echo Nest, trendy social music site This Is My Jam is "looking at options" for going independent — as well as getting ready to launch some fun new site exploration features.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=paidcontent.org&#038;blog=33319749&#038;post=223254&#038;subd=gigaompaidcontent&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Buzzy song-sharing site <a href="http://www.thisismyjam.com">This Is My Jam</a> could be going independent from its parent company as it prepares to take the next step in its evolution.</p>
<p>The site, which lets people share their favorite music track-by-track, has proven an underground hit online less than a year after launching publicly: more than 100,000 users have signed up, sharing over 900,000 songs. But the London-based service was started as a pet project inside music data company <a href="http://www.echonest.com">The Echo Nest</a> — and it&#8217;s now exploring what happens next.</p>
<p>&#8220;Up until this point we&#8217;ve been incubated by The EchoNest, but now we&#8217;re looking at options for spinning out in our own right,&#8221; creator Matthew Ogle told me.</p>
<p>It&#8217;s been an unusual course for This Is My Jam so far — in fact, Ogle says that the site &#8220;wasn&#8217;t supposed to happen for a whole bunch of reasons&#8221;. Chief among them? The fact he&#8217;d decided to move out of the online music industry after leaving his role as head of web product at <a href="http://www.last.fm">Last.fm</a> back in 2010.</p>
<p><a href="http://gigaom2.files.wordpress.com/2013/01/timj-logo_cs5.jpg"><img src="http://gigaom2.files.wordpress.com/2013/01/timj-logo_cs5.jpg?w=230&#038;h=300" alt="This Is My Jam logo" width="230" height="300"  class="alignleft size-medium wp-image-601414" /></a>&#8220;Despite swearing off online music forever, in less than a year I&#8217;d been convinced by the awesome folks at The Echo Nest that we could do some cool stuff together,&#8221; he says. &#8220;They didn&#8217;t hire me to make Jam, they hired me in a dual role to be their man in Europe — evangelizing at hackdays, talking to developers — and also to be a kind of internal product skunkworks, prototyping stuff based on new APIs and using that to spark new direction that The Echo Nest could be going with their data.&#8221;</p>
<p>But when it turned out that Jam, an idea he&#8217;d been throwing for a while, had more going for it than the other skunkworks proposals, Ogle&#8217;s focus switched and the parent company funded development. Jam now has four full-time staff, as Ogle brought on former Last.fm refugee Hannah Donovan, and engineers Ralph Cowling and Andreas Jansson.</p>
<h2 id="the-failure-of-frictionless">The failure of frictionless</h2>
<p>The site is one of my favorite services to have launched in the last year or two, and it&#8217;s one I&#8217;ve found myself going back to more often than I expected. Posting your favorite tracks and browsing the tracks of others turns the site into a curious mixture of status update and radio station. It&#8217;s a great tool for telling people about the music you like — lots of people use it to showcase their mood, for example. But it&#8217;s also a treasure trove of music, allowing you to dig around in the tastes of others. Because it&#8217;s almost exclusively focused on what people are listening to <em>now</em>, it has a real-time quality to it… yet the conscious decision that goes into making your choice means that the end result is more personal than Pandora but way more curated than Spotify&#8217;s frictionless sharing. </p>
<p><a href="http://gigaom2.files.wordpress.com/2013/01/thisismyjam.jpg"><img src="http://gigaom2.files.wordpress.com/2013/01/thisismyjam.jpg?w=300&#038;h=200" alt="This Is My Jam screenshot" width="300" height="200"  class="alignleft size-medium wp-image-601407" /></a>In fact, says Ogle, the failure of frictionless sharing to provide anything more than a fleeting dip into a raging river of data left an interesting gap for Jam to fill.</p>
<p>&#8220;I couldn&#8217;t quite believe that in 2011 that social song sharing wasn&#8217;t just a solved problem,&#8221; he explains. &#8220;One thing that we talked about at last.fm a lot — you know that amazing moment in real life when someone grabs you and say &#8216;you have to hear this song, check it out&#8217; and they put the headphones in your ears or put a record on. Anyone who cares about music even a little bit has had that moment.&#8221;</p>
<p>&#8220;We&#8217;re in this golden age of social media, we&#8217;re always connected to all of our friends at all times, and there&#8217;s big data around music — Spotify and Facebook have basically taken scrobbling to the mainstream — [so] there should be more ways than ever for me to go &#8216;I want to hear some new tunes, what are my friends listening to?&#8217; and get good stuff… not just whatever they happened to accidentally listen to on Spotify. Conversely there was no way for me to share a song that people would still see five hours later. Everything was being forced in real-time.&#8221;</p>
<h2 id="curationexploration">Curation/exploration</h2>
<p>Donovan, who heads up the site&#8217;s design, points out that isn&#8217;t just vanity that drives — the performative aspect of social media where you are showing off your taste to others — but a sort of shared curation where users collaborate to uncover interesting tracks, point to classics or dig up forgotten material. </p>
<p>&#8220;This was actually really cool when we discovered this happening, because there&#8217;s no other music service on the internet where it&#8217;s OK to have old stuff mixed in too,&#8221; she says. &#8220;Either everything is organized around the music data thing — singles live inside albums that live inside artists — or it&#8217;s promotional in some respect, in which case they&#8217;re always pushing the latest album or the latest single that just came out.&#8221;</p>
<p>&#8220;On top of that I think there&#8217;s something around our culture today that&#8217;s driven by newness, and how things on the internet always have to be the latest and the newest. I heard somebody say when you&#8217;re curating something, you don&#8217;t necessarily want just the latest or the newest, you want to dig up old things that were really great and put them back in context alongside newer things, or mixed in with other stuff. That&#8217;s the job of the curator and that&#8217;s what makes it really enjoyable for the user or the observer.&#8221;</p>
<p>&#8220;We thought about curation a lot when we were starting Jam. The overall effect is that our users became the curators of this music, and we wound up with this lovely space where you could get Prince and Fleetwood Mac right next to the latest trendy pop band.&#8221;</p>
<p><a href="http://gigaom.files.wordpress.com/2010/01/echonest.jpg"><img src="http://gigaom.files.wordpress.com/2010/01/echonest.jpg?w=708" alt="echonest"    class="alignright size-full wp-image-251805" /></a>That is, in turn, helping the product develop further forward. Coming very soon are some new additions to the site&#8217;s <a href="http://www.thisismyjam.com/explore">Explore</a> pages, which will offer up some new ways for users to browse the site. Explore categories will include &#8220;Breaking&#8221; (songs that have been jammed for the first time recently, and subsequently shared), &#8220;Rare&#8221; (songs that have been jammed only once), and another one that lists just the first jams of newly-registered users. Although it could act as a way for users to introduce themselves to the service, the first jams could also become a sort of <em>Greatest Hits</em> package.</p>
<p>&#8220;A lot of people sign up and start with their favorite song of all time,&#8221; says Ogle. </p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=paidcontent.org&#038;blog=33319749&#038;post=223254&#038;subd=gigaompaidcontent&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/PaidContent_RSS_300x250&#038;sz=300x250&#038;c=454219"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/PaidContent_RSS_300x250&#038;sz=300x250&#038;c=454219" /></a></p>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/01/13/music-site-this-is-my-jam-could-spin-out-from-echo-nest/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/01/ogle-donovan-thisismyjam2.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/01/ogle-donovan-thisismyjam2.jpg?w=150" medium="image">
			<media:title type="html">matt ogle and hannah donovan of This Is My Jam</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/6e5c23eccd5022fef0059f01c98c2ea4?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">bobbiejohnson</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/01/timj-logo_cs5.jpg?w=230" medium="image">
			<media:title type="html">This Is My Jam logo</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/01/thisismyjam.jpg?w=300" medium="image">
			<media:title type="html">This Is My Jam screenshot</media:title>
		</media:content>

		<media:content url="http://gigaom.files.wordpress.com/2010/01/echonest.jpg" medium="image">
			<media:title type="html">echonest</media:title>
		</media:content>
	</item>
		<item>
		<title>Hang on, maybe Facebook did drive Black Friday sales after all</title>
		<link>http://gigaom.com/2012/11/28/hang-on-maybe-facebook-did-drive-black-friday-sales-after-all/</link>
		<comments>http://gigaom.com/2012/11/28/hang-on-maybe-facebook-did-drive-black-friday-sales-after-all/#comments</comments>
		<pubDate>Wed, 28 Nov 2012 22:26:55 +0000</pubDate>
		<dc:creator>Jeff John Roberts</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[ad tech]]></category>
		<category><![CDATA[aggregated knowledge]]></category>
		<category><![CDATA[Conversion Rates]]></category>
		<category><![CDATA[cyber shopping]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[facebook]]></category>
		<category><![CDATA[ibm]]></category>
		<category><![CDATA[online-advertising]]></category>
		<category><![CDATA[twitter]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=589046</guid>
		<description><![CDATA[A study of Black Friday cyber-shopping said that social media advertising was a big bust with few people buying things in response to an ad from Facebook. The story is very different if you use other metrics to define "responded."<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=paidcontent.org&#038;blog=33319749&#038;post=221312&#038;subd=gigaompaidcontent&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>They say Facebook has an ad problem because, unlike Google, the social network is like a party &#8212; and no one wants to leave a party to go shopping. This theory got a boost this week when an IBM study of Black Friday suggested that social media&#8217;s effect on cyber-shopping was nearly zilch.</p>
<p>D&#8217;oh! This is precisely the sort of thing that retailers who have just laid out a million bucks for Facebook or Twitter ads don&#8217;t want to hear. But here&#8217;s <a href="http://www-01.ibm.com/software/marketing-solutions/benchmark-reports/black-friday-2012.html?cm_mmc=holiday2012-benchmark-reports-_-press-release-_-wire-_-text-link">the study</a>, showing that Facebook ads only led to 0.68 percent of shoppers making an online purchase on Black Friday (and Twitter ads produced no sales at all!) :</p>
<p><a href="http://gigaom.com/2012/11/28/hang-on-maybe-facebook-did-drive-black-friday-sales-after-all/screen-shot-2012-11-28-at-4-33-23-pm/" rel="attachment wp-att-589064"><img  alt="" src="http://gigaom2.files.wordpress.com/2012/11/screen-shot-2012-11-28-at-4-33-23-pm.png?w=708"   class="aligncenter size-full wp-image-589064" /></a></p>
<p>Fortunately for retailers (and Facebook), there is more to the story. According to a media intelligence service, <a href="http://www.aggregateknowledge.com/#&amp;panel1-1">Aggregated Knowledge</a> (AK), Facebook ads have a big effect on online purchase decisions &#8212; leading to a 51 percent increase in conversions (the term for when people buy or do something in response to an ad). AK also says Facebook ads are a way for brands to reach big target audiences they wouldn&#8217;t reach elsewhere online, and that &#8220;conversion rates&#8221; are 72 percent cheaper than for other online channels.</p>
<p>AK knows this because it works with large social media advertisers to collect data about Facebook user behavior. This data collection is possible because Facebook recently <a href="http://www.reuters.com/article/2012/11/16/us-facebook-ads-idUSBRE8AF0WC20121116">provided e-tailers with tools</a> that will let them track users to see who responded to an ad. (Yes, there&#8217;s a creepy factor here though major sites like Amazon and Google have long used similar tools).</p>
<p>AK&#8217;s numbers are good news for Facebook but the social network will still have to persuade advertisers and the media to accept them. The challenge lies in getting ad types to accept the notion of &#8220;multitouch attribution&#8221; rather than &#8220;last touch.&#8221; In plain English, this means that conversions should be counted even if the occur hours or days after you leave Facebook &#8212; for instance, you see a bath salts ad on Facebook, surf dozens of other websites, and then buy those bath salts the next day. (TechCrunch&#8217;s Josh Constine has made the same point well <a href="http://techcrunch.com/2012/11/26/the-social-commerce-attribution-problem/">here</a>).</p>
<p>Seen from this lens, Facebook&#8217;s &#8220;party&#8221; problem isn&#8217;t so bad. After all, we discover lots of new desires at a party &#8212; that girl&#8217;s dress, that guy&#8217;s cowboy hat, those delicious cocktail wienies &#8212; that we go out and buy for ourselves later on.</p>
<p><em>(Image by <a href="http://www.shutterstock.com/gallery-463936p1.html">Ariwasabi</a> via Shutterstock)</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=paidcontent.org&#038;blog=33319749&#038;post=221312&#038;subd=gigaompaidcontent&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/PaidContent_RSS_300x250&#038;sz=300x250&#038;c=285034"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/PaidContent_RSS_300x250&#038;sz=300x250&#038;c=285034" /></a></p>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/11/28/hang-on-maybe-facebook-did-drive-black-friday-sales-after-all/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/11/shutterstock_43788973.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/11/shutterstock_43788973.jpg?w=150" medium="image">
			<media:title type="html">shutterstock_43788973</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/05dfcf765f1554b08954bb9e1ee63363?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">jeffjohnroberts</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/11/screen-shot-2012-11-28-at-4-33-23-pm.png" medium="image" />
	</item>
		<item>
		<title>Should publishers invest in audience data? Depends on the publisher</title>
		<link>http://paidcontent.org/2012/11/26/should-publishers-invest-in-audience-data-depends-on-the-publisher/</link>
		<comments>http://paidcontent.org/2012/11/26/should-publishers-invest-in-audience-data-depends-on-the-publisher/#comments</comments>
		<pubDate>Mon, 26 Nov 2012 14:05:51 +0000</pubDate>
		<dc:creator>Jeff John Roberts</dc:creator>
				<category><![CDATA[ad tech]]></category>
		<category><![CDATA[advertising]]></category>
		<category><![CDATA[bluekai]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[Omar Tawakol]]></category>
		<category><![CDATA[programmatic buying]]></category>
		<category><![CDATA[Vikram Somaya]]></category>

		<guid isPermaLink="false">http://paidcontent.org/?p=221054</guid>
		<description><![CDATA[Web publishers can buy tools that let them identify and segment their readers, and then combine that information with other customer data to offer fine-grained audience options to advertisers. As the ad market gets more demanding, the tools may become essential for some - but not all.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=paidcontent.org&#038;blog=33319749&#038;post=221054&#038;subd=gigaompaidcontent&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Many publishers feel buffeted by data tools that let advertisers bid for online ads in real time. The tools have led to a decline in ad prices but, now, publishers have more opportunities to level the playing field by tapping into data sets of their own.</p>
<p>This, anyways, is the promise of BlueKai, a company that helps publishers and brands collect information and marketing insights about their audiences. BlueKai&#8217;s primary customers have always been advertisers but now it&#8217;s turning more attention to publishers, <a href="http://www.bluekai.com/publishers-use-cases.php">offering tools</a> that help them create reader profiles and develop data-driven ad sale strategies.</p>
<p>&#8220;The world before for publishers was &#8216;I have these great audiences, I have these places&#8217; please buy them&#8217;,&#8221; said BlueKai CEO Omar Tawakol in a recent phone interview. Now, though, ad buyers want publishers to supply information that lets them reach very specific groups of people &#8212; soccer dads, not soccer moms for instance. Publishers can offer this information by mining their audience data (including registration info and log-in location) and combining it with data BlueKai has about those users&#8217; web-surfing activities. Such tools can also help publishers track when the same customer is coming to the site through a desktop, tablet or cell phone device.</p>
<p>The data tools are fine in theory but are they a practical investment for publishers who could spend the money on writers or developers instead? According to one executive, some publishers might not have a choice.</p>
<p>Vikram Somaya, a former VP of Operations and Audience at Reuters, says that audience data has become &#8220;table stakes&#8221; for premium publishers if they want to be considered for major ad purchases by national brands. In practice, this means that a company like Ford might only consider buying online ads for a given campaign if the publisher can provide a target audience of car buyers.</p>
<p>&#8220;Anyone who hasn’t picked an [audience data] product, is behind&#8230; it’s not an option to ignore anymore,&#8221; said Somaya in a phone interview. He acknowledged though that, for now, BlueKai-type tools may only be practical for big publishers that are in Comscore&#8217;s list of top 500 web sites. (Somaya, who is now a VP at the Weather Company, used to work at BlueKai but said his opinion is impartial).</p>
<p>A publisher&#8217;s size, however, may not be the only consideration about whether it should invest in audience data. The nature of its readers is a consideration too. The more general a publisher&#8217;s audience, the greater its need for data. As the <a href="http://www.ft.com/cms/s/0/4e8de6ee-3274-11e2-916a-00144feabdc0.html#axzz2CsKGjXnP">FT reported </a>last week (reg. req&#8217;d), Time Inc has just launched a major new ad strategy based on mining customer information so as to offer better-placed ads.</p>
<p>On the other hand, audience data is less critical for more specialized publications. As an executive for a major New York based publication, who did not want to be named, explained by email:</p>
<blockquote><p>While we are a BlueKai customer, our experience is that most advertisers do not require audience segmentation data as part of their buy. It certainly sometimes comes up, but it’s the relatively rare exception, not the rule. My guess is that it’s probably more important for publishers with a more mass audience. Our audience is relatively targeted (and valuable) already and so segmentation isn’t needed to get the audience advertisers are seeking.</p></blockquote>
<p>What this means in practice is that niche publishers can likely forgo fancy data tools for now.</p>
<p><em>(Image by <a href="http://www.shutterstock.com/gallery-776821p1.html">Dusit</a> via Shutterstock)</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=paidcontent.org&#038;blog=33319749&#038;post=221054&#038;subd=gigaompaidcontent&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/PaidContent_RSS_300x250&#038;sz=300x250&#038;c=716855"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/PaidContent_RSS_300x250&#038;sz=300x250&#038;c=716855" /></a></p>]]></content:encoded>
			<wfw:commentRss>http://paidcontent.org/2012/11/26/should-publishers-invest-in-audience-data-depends-on-the-publisher/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:thumbnail url="http://gigaompaidcontent.files.wordpress.com/2012/11/shutterstock_99799694.jpg?w=150" />
		<media:content url="http://gigaompaidcontent.files.wordpress.com/2012/11/shutterstock_99799694.jpg?w=150" medium="image">
			<media:title type="html">shutterstock_99799694</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/05dfcf765f1554b08954bb9e1ee63363?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">jeffjohnroberts</media:title>
		</media:content>
	</item>
		<item>
		<title>Data isn&#8217;t just the new oil, it&#8217;s the new money. Ask Zoë Keating</title>
		<link>http://gigaom.com/2012/11/20/data-isnt-just-the-new-oil-its-the-new-money-ask-zoe-keating/</link>
		<comments>http://gigaom.com/2012/11/20/data-isnt-just-the-new-oil-its-the-new-money-ask-zoe-keating/#comments</comments>
		<pubDate>Wed, 21 Nov 2012 02:31:13 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[big-data]]></category>
		<category><![CDATA[copyright]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[digital-copyright]]></category>
		<category><![CDATA[online data]]></category>
		<category><![CDATA[pandora]]></category>
		<category><![CDATA[streaming media]]></category>
		<category><![CDATA[user data]]></category>
		<category><![CDATA[web privacy]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=586855</guid>
		<description><![CDATA[In the fight about royalties from streaming media services like Pandora, Popular cellist Zoë Keating says she's willing to give up the money in exchange for data. It's an idea that's gaining traction elsewhere, too, as more companies are paying consumers for their truly valuable data.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=paidcontent.org&#038;blog=33319749&#038;post=221002&#038;subd=gigaompaidcontent&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>People love to call data the new oil, but that might be selling it short. It&#8217;s only oil when we&#8217;re talking about pools of unrefined data like the stuff web companies collect, which has to be processed and transformed into something useful. There are certain types of data, though &#8212; especially data about consumers &#8212; that are as good as money in the bank without any work at all. And if you don&#8217;t believe me, ask popular cellist Zoë Keating.</p>
<p>As <a href="http://www.nytimes.com/2012/11/05/business/media/fight-growing-over-online-royalties.html?_r=0">a bill attempting to lower the royalty rates</a> paid to artists by streaming music services such as Pandora works its way through Congress, Keating <a href="http://zoekeating.tumblr.com/post/35737991443/what-i-want-from-internet-radio">took to her Tumblr blog last week</a> and offered a solution that both sides should listen to, but won&#8217;t. You might have <a href="http://www.billboard.biz/bbbiz/industry/digital-and-mobile/value-of-music-streaming-is-data-says-artist-1008018162.story">read about her stance in Billboard</a> or <a href="http://www.itworld.com/big-data/317769/data-ultimate-internet-music-royalty?page=0,1">ITworld</a> already, <a href="http://entertainment.slashdot.org/story/12/11/20/0312215/one-musicians-demand-from-pandora-mandatory-analytics">or perhaps on Slashdot</a>. If you haven&#8217;t, here it is in a nutshell, from Keating&#8217;s blog: &#8220;The law only demands I be paid in money, which at this point in my career is not as valuable as information. I’d rather be paid in data.&#8221;</p>
<p>Leaving aside the entire issue about royalties and copyright (and privacy policies), her statement is still powerful. Keating understands that in order to prosper in a world of digital music &#8212; just like in the world of e-commerce, digital publishing, you name it &#8212; information is power. The names, email address and perhaps mobile numbers of individuals listening to her music are nice, clean data that Keating could use with little to no analytic effort by reaching out to fans when a new tour is coming to town or a new album drops.</p>
<p>Actually, Keating <a href="http://zoekeating.tumblr.com/post/36160121213/more-about-data-vs-royalties">noted in a subsequent blog post on Tuesday</a> that even less-personal data can have a material impact on a performer&#8217;s bottom line. Using postal code data provide to her from iTunes sales, she&#8217;s able to plan tours more efficiently because she knows, or can make a safe assumption, that she has paying fans in certain cities.</p>
<p>Touring and merchandise sales remain most artists&#8217; primary means of income, and the current royalty rate of $.0011 per play doesn&#8217;t add up fast (<a href="https://docs.google.com/spreadsheet/ccc?key=0AkasqHkVRM1OdGhjdExSMzYyMXFZUkZNSUJrY3MwNXc&amp;pli=1#gid=0">at least according to Keating&#8217;s math</a>), so it&#8217;s easy to see why she &#8212; and probably many other up-and-coming or niche performers &#8212; would rather have the data that properties like Pandora almost certainly have.</p>
<p>And whether Keating knows it or not, the idea of using data as a substitute for money extends beyond web radio stations and musicians arguing about royalties. A couple weeks ago, I <a href="http://gigaom.com/data/will-consumers-trade-the-keys-to-the-data-castle-for-a-5-gift-card/">highlighted a handful of attempts</a> to convince consumers to hand over, in exchange for cash rewards or product discounts, valuable data that advertisers can&#8217;t collect by tracking their online activity. This is data such as recent and future purchases, personal interests, your web-surfing habits and where you shop in the physical world.</p>
<p>Just like Keating is willing to forgo one-tenth of one cent per play (real money, even if not a lot) in exchange for data, these brands are willing to trade cash or something like it for data <a href="http://gigaom.com/data/5-ideas-to-help-everyone-make-the-most-of-big-data/">they don&#8217;t have to run through a Hadoop cluster and seven segmentation algorithms</a> before they can tie it to a real person. They know they have to give a little bit in order to improve upon the status quo that&#8217;s good, but not nearly good enough for their purposes.</p>
<p>Previously, the notion that data is the currency of the web meant users gave away their behavior data to web sites in exchange for free services. Slowly but surely, however, that notion seems to be evolving. Maybe Zoë Keating wants data in lieu of royalties for the privilege of streaming her music, and maybe a web site wants my offline location data enough to give me a gift card worth enough that I&#8217;d hand it over. Either way, it&#8217;s all about the realization that some data is worth its weight &#8212; and then some &#8212; in cold, hard cash.</p>
<p><em>Feature image courtesy of <a href="http://www.flickr.com/photos/eschipul/3351462308/sizes/m/in/photostream/">Flickr user eschipul</a>.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=paidcontent.org&#038;blog=33319749&#038;post=221002&#038;subd=gigaompaidcontent&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/PaidContent_RSS_300x250&#038;sz=300x250&#038;c=715703"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/PaidContent_RSS_300x250&#038;sz=300x250&#038;c=715703" /></a></p>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/11/20/data-isnt-just-the-new-oil-its-the-new-money-ask-zoe-keating/feed/</wfw:commentRss>
		<slash:comments>13</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/11/zoe-keating.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/11/zoe-keating.jpg?w=150" medium="image">
			<media:title type="html">Zoe Keating</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>
	</item>
		<item>
		<title>Prismatic&#8217;s Bradford Cross: First we understand media, then the world</title>
		<link>http://gigaom.com/2012/10/02/prismatics-bradford-cross-first-we-understand-media-then-the-world/</link>
		<comments>http://gigaom.com/2012/10/02/prismatics-bradford-cross-first-we-understand-media-then-the-world/#comments</comments>
		<pubDate>Tue, 02 Oct 2012 18:11:57 +0000</pubDate>
		<dc:creator>Mathew Ingram</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[Bradford Cross]]></category>
		<category><![CDATA[content]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[entrepreneur]]></category>
		<category><![CDATA[facebook]]></category>
		<category><![CDATA[Future of Media]]></category>
		<category><![CDATA[Media]]></category>
		<category><![CDATA[newspapers]]></category>
		<category><![CDATA[Prismatic]]></category>
		<category><![CDATA[profile]]></category>
		<category><![CDATA[social-networks]]></category>
		<category><![CDATA[startup]]></category>
		<category><![CDATA[twitter]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=567094</guid>
		<description><![CDATA[Prismatic founder Bradford Cross doesn't come from a traditional media background -- he is a data scientist who specializes in machine learning -- but what he is doing with content recommendations says a lot about how the media business is evolving and what the future might look like.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=paidcontent.org&#038;blog=33319749&#038;post=218564&#038;subd=gigaompaidcontent&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Prismatic co-founder and CEO Bradford Cross doesn&#8217;t have big dreams &#8212; he just wants to <a href="http://gigaom.com/2012/05/03/prismatic-wants-to-be-the-newspaper-for-a-digital-age/">revolutionize the way that we consume media</a>, and then after that he wants to bring his brand of data-powered artificial intelligence to every other form of consumer behavior. Right now, Prismatic is a news aggregation and recommendation engine, similar to other services such as News360 or Zite, but Cross says it is <a href="http://gigaom.com/2012/09/18/prismatic-takes-on-twitter-in-the-race-to-build-a-better-serendipity-engine/">only the first stage in his plan</a> to bring personalization to other aspects of our lives &#8212; to become a kind of smart assistant and serendipity engine for the world.</p>
<p>These dreams may be difficult to see when you look at the cramped office Cross and his five co-workers inhabit in San Francisco (a converted apartment that had a mattress on the floor the last time I visited) but they come to life when you talk to him. Of course, that passion also means it can be hard to get a word in edgewise sometimes, as the Prismatic founder holds forth on his vision for the future, or how current recommendation services are failing users, or the state of affairs in the Twitterverse. Like a lot of startup founders, Cross can be a bit of a whirling dervish of ideas &#8212; but listening to him is almost always worth it.</p>
<p>Prismatic is interesting in part because it isn&#8217;t run by media experts or anyone with journalism training; in fact, Cross has no media background of any kind. His specialty is data science and machine learning, and he says he chose to focus initially on news recommendation because it is an obvious problem &#8212; the <a href="http://gigaom.com/2012/09/18/prismatic-takes-on-twitter-in-the-race-to-build-a-better-serendipity-engine/">deluge of information that we are all submerged in requires smart filters</a> &#8212; and solving it will help lead the way to other similar problems. And one of the reasons why media players of all kinds would be wise to pay attention is that this data-powered filtering ability could be the key to whatever success the media industry has in the future, as traditional gatekeepers are replaced by crowd-powered and algorithm-driven sources.</p>
<p>&#8220;Brad is probably one of the leaders right now in using big data and machine learning to provide consumer services,&#8221; says Jason Freedman, his former co-founder at Flightcaster, the startup that the two worked on for several years <a href="http://blog.flightcaster.com/flightcaster-acquired">before selling it last year</a>. Using that knowledge of data science and machine learning, Cross says that he hopes to eventually learn so much about his users via Prismatic that he can start making smart recommendations about almost everything, from movies or books to varieties of scotch or even people:</p>
<blockquote id="quote-the-idea-is-that-we-"><p>&#8220;The idea is that we become this trusted agent that you rely on to show you things, and over time we can really start to learn a lot about you. We do care a lot about [news recommendation], but we&#8217;ve also thought through how it&#8217;s a stepping stone to something much bigger. And a lot of what we do in the background, and how we slice and dice data and so on&#8230; is relevant across a really wide range of problems.&#8221;</p></blockquote>
<h2 id="lessons-learned-from-failure">Lessons learned from failure</h2>
<p><a href="http://gigaom2.files.wordpress.com/2012/10/screen-shot-2012-10-02-at-11-04-12-am.png"><img src="http://gigaom2.files.wordpress.com/2012/10/screen-shot-2012-10-02-at-11-04-12-am.png?w=300&#038;h=200" alt="" title="Screen Shot 2012-10-02 at 11.04.12 AM" width="300" height="200"  class="alignleft size-medium wp-image-568998" /></a></p>
<p>Cross started out in computer science at Virginia Polytechnic Institute, but later switched to finance after he grew bored with his computer courses. Even then, the now-32-year-old founder was interested in large-scale machine learning projects and what they were capable of &#8212; and that desire would cost him dearly, at least in the short term. After university, he and a friend started their own investment fund and did fairly well (a 30-percent return in their first year, Cross says) and he parlayed that into a job with a small hedge fund called O&#8217;Higgins Asset Management in Miami. </p>
<p>Given his own fund to run, he made even better returns and generated a substantial income for himself, even though he was just 23 at the time. But Cross says he was obsessed with starting his own trading firm, based on his own software. Unfortunately, he admits now that he didn&#8217;t really know what he was doing, and the job he took on &#8212; building his own data-driven trading algorithms &#8212; was far beyond what he was capable of at the time, and he says he learned a painful lesson about entrepreneurialism:</p>
<blockquote id="quote-i-basically-failed-a2"><p>&#8220;I basically failed at it, and lost all the money I had made through actual trading, which is ironic. I had no idea what I was doing. It was a pretty hard-core problem, one of the more complicated things in the world to do. It was a very interesting experience that lasted for about 12 months. I knew it wasn&#8217;t going to work but I didn&#8217;t want to let it go. And eventually I realized I just wasn&#8217;t very good at building software.&#8221;</p></blockquote>
<p>Humbled by that experience, the Prismatic founder says he realized that he needed a lot more than just some stock-trading skills and a computer-science background in order to do what he had in mind, so he decided to try and get a job that would allow him to broaden his skill set. Freedman says this is a classic Cross-type move: when he realizes he needs to learn certain things in order to advance, the Prismatic founder figures out a way to get that knowledge. &#8220;Brad has this thing where he realizes something is holding him back and he just goes into a 100-percent all-out focused sprint to get better at it,&#8221; says his Flightcaster co-founder.</p>
<p>Not long ago, Freedman says he told Cross that if he wanted to hire engineers, he would have to become better known in the hacker community &#8212; but his blog was old and stale, and he never spent any time on <a href="http://news.ycombinator.com">Hacker News</a> or other popular sites. &#8220;So he just went all out, and built a new profile, and wrote all these really smart blog posts, got a new blog design and in seven days had learned everything you need to know about using social media,&#8221; says Freedman, who is now running a startup called 42floors.</p>
<p><a href="http://gigaom2.files.wordpress.com/2012/10/1_product_feeds__2329fb9d.jpg"><img src="http://gigaom2.files.wordpress.com/2012/10/1_product_feeds__2329fb9d.jpg?w=604&#038;h=407" alt="" title="1_product_feeds__2329fb9d" width="604" height="407"  class="aligncenter size-large wp-image-568999" /></a></p>
<h2 id="the-building-blocks-of-a-smart">The building blocks of a smart recommendation engine</h2>
<p>After blowing most of his savings on his failed attempt to build a trading system, Cross wound up getting a job with <a href="http://www.thoughtworks.com/about-us">ThoughtWorks</a>, a consulting company that he contacted when his venture was falling apart. The company&#8217;s specialty was getting parachuted in to fix things at other companies when they were coming off the rails, and one of the companies Cross spent a lot of time working with was Google. The team spent much of its time on the operating system that powered Google&#8217;s storage clusters, but he also worked on consumer products such as Books, Cross says (and met his wife, a Slovak software engineer).</p>
<blockquote id="quote-i-didnt-especially-e3"><p>&#8220;I didn&#8217;t especially enjoy it, but it taught me a lot &#8212; it taught me a lot about what not to do. You get an understanding of really gnarly technical issues [and] at the same time, you develop this sixth sense where you can see early on where something bad will lead you. That was a huge experience for me.&#8221;</p></blockquote>
<p>After the Google stint came Flightcaster, a Y Combinator-funded startup. Freedman and his partner Evan Konwiser needed a more technical co-founder in order to get their funding from the incubator (&#8220;someone who could actually build what they were talking about,&#8221; Cross says) and so they brought Cross in to run the technical side. He designed and built the systems that <a href="http://www.datawrangling.com/how-flightcaster-squeezes-predictions-from-flight-data">sucked in terabytes worth of data from airlines, weather services and other sources</a> and made recommendations about routes or flights. &#8220;Flightcaster was a very early permutation of this kind of thing [social recommendations], which is the whole reason why I joined,&#8221; Cross says.</p>
<p>Once Flightcaster was sold, Cross finally had enough time, knowledge and resources to start his own venture, and Prismatic <a href="http://measuringmeasures.com/blog/2011/1/9/flightcaster-gets-acquired-i-go-on-to-start-woven.html">was born in early 2011 and initially known as Woven</a>. The idea was to use the machine-learning techniques that Flightcaster was based on to filter through and understand social data, and then use that to make recommendations about news and other content. In other words, to create <a href="http://gigaom.com/2012/05/03/prismatic-wants-to-be-the-newspaper-for-a-digital-age/">a kind of customized newspaper for users based on their activity</a> on social networks like Twitter and Facebook.</p>
<h2 id="can-a-mad-scientist-reinvent-t">Can a mad scientist reinvent the way media works?</h2>
<p>Facebook may have close to a billion users and vast storehouses of knowledge &#8212; as does Google &#8212; and Twitter may be <a href="http://gigaom.com/2012/09/22/twitter-ceo-wish-list-curation-tools-tweet-downloads-tom-brady/">focusing more on the kind of smart curation</a> that Prismatic is talking about as the service evolves, but Cross says he is convinced that his small company (which was seed funded with $1.2 million from investors including Battery Ventures and Javelin Venture Partners) has a better chance of solving this problem than any of its much larger competitors &#8212; even though that might seem to some like an excess of hubris:</p>
<blockquote id="quote-google-wont-get-this4"><p>&#8220;Google won&#8217;t get this right, Twitter won&#8217;t get this right, Facebook won&#8217;t get this quite right, Amazon won&#8217;t even get this right &#8212; the company that gets it right needs to have it in its DNA. We think this is a Trojan horse into a much bigger thing&#8230; in five years time or 10 years time, AI will be all over our daily lives, everything we interact with will be intelligent, and the interfaces to it will be completely different. Backtracking from that very distant kind of vision led us to start in this place.&#8221;</p></blockquote>
<p>There&#8217;s no question that the odds of Prismatic succeeding are almost astronomical &#8212; not only are there other older, better-funded players going after a similar goal, including News360 and Zite and even Flipboard, but Twitter and Google and Facebook are not likely to let an upstart take away what could be the future of their business.</p>
<p>If sheer devotion to wrestling with big problems is the secret, then Cross has a good head-start: Freedman says he remembers how Cross was always taking the engineers away for 18-hour hackathons to solve pressing issues with the code base. &#8220;One day I looked around the office and it was just my co-founder and I,&#8221; he says. &#8220;Everyone else was at Brad&#8217;s house, in his living room, working these crazy hours, but also just having fun. He loves a big problem.&#8221; Cross is definitely still a &#8220;mad-scientist type,&#8221; Freedman says, &#8220;but he&#8217;s also a lot more than that.&#8221;</p>
<p>When we spoke recently, Cross was busy trying to manage a hundred different aspects of being a small six-person startup: a deluge of interest based on <a href="http://gigaom.com/2012/08/23/prismatic-wants-to-conquer-the-new-frontier-mobile-news/">the launch of Prismatic&#8217;s new iPhone app</a> (an iPad app is coming soon), calls from potential acquirers, and the need to raise financing to fund the growth that he expects &#8212; or wants &#8212; to see in the future. Is it worth it, I asked? &#8220;It&#8217;s not worth it unless you have something really important that you&#8217;re trying to do,&#8221; he said. &#8220;And I think we do.&#8221;</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=paidcontent.org&#038;blog=33319749&#038;post=218564&#038;subd=gigaompaidcontent&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/PaidContent_RSS_300x250&#038;sz=300x250&#038;c=396523"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/PaidContent_RSS_300x250&#038;sz=300x250&#038;c=396523" /></a></p>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/10/02/prismatics-bradford-cross-first-we-understand-media-then-the-world/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/10/brad-cross.png?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/10/brad-cross.png?w=150" medium="image">
			<media:title type="html">Brad Cross</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/0bdf7ab171ade0708a11fa3378e6d8cb?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">Mathew</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/10/screen-shot-2012-10-02-at-11-04-12-am.png?w=300" medium="image">
			<media:title type="html">Screen Shot 2012-10-02 at 11.04.12 AM</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/10/1_product_feeds__2329fb9d.jpg?w=604" medium="image">
			<media:title type="html">1_product_feeds__2329fb9d</media:title>
		</media:content>
	</item>
		<item>
		<title>The disappearing web: Information decay is eating away our history</title>
		<link>http://gigaom.com/2012/09/19/the-disappearing-web-information-decay-is-eating-away-our-history/</link>
		<comments>http://gigaom.com/2012/09/19/the-disappearing-web-information-decay-is-eating-away-our-history/#comments</comments>
		<pubDate>Thu, 20 Sep 2012 00:09:41 +0000</pubDate>
		<dc:creator>Mathew Ingram</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[analysis]]></category>
		<category><![CDATA[arab spring]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[facebook]]></category>
		<category><![CDATA[information]]></category>
		<category><![CDATA[research]]></category>
		<category><![CDATA[social-media]]></category>
		<category><![CDATA[twitter]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=564770</guid>
		<description><![CDATA[The ability to distribute real-time information through social networks like Twitter is a powerful thing, but a new study points out that one of the downsides of this phenomenon is the fact that much of the content that gets linked to eventually disappears.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=paidcontent.org&#038;blog=33319749&#038;post=218025&#038;subd=gigaompaidcontent&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>One of the characteristics of the modern media age &#8212; at least for anyone who uses the web and social media a lot &#8212; is that we are surrounded by vast clouds of rapidly changing information, whether it&#8217;s blog posts or news stories or Twitter and Facebook updates. That&#8217;s great if you like real-time content, but there is a not-so-hidden flaw &#8212; namely, that <a href="http://en.wikiquote.org/wiki/Heraclitus">you can&#8217;t step into the same stream twice</a>, as Heraclitus put it. In other words, much of that information may (and probably will) disappear as new information replaces it, and small pieces of history wind up getting lost. According to a recent study, which <a href="http://arxiv.org/abs/1209.3026">looked at links shared through Twitter about news events</a> like the Arab Spring revolutions in the Middle East, this could be turning into a substantial problem.</p>
<p>The study, which MIT&#8217;s Technology Review highlighted <a href="http://www.technologyreview.com/view/429274/history-as-recorded-on-twitter-is-vanishing-from/?ref=rss">in a recent post by the Physics arXiv blog</a>, was done by a pair of researchers in Virginia, Hany SalahEldeen and Michael Nelson. They took a number of recent major news events over the past three years &#8212; including the Egyptian revolution, Michael Jackson&#8217;s death, the elections and related protests in Iran and the outbreak of the H1N1 virus &#8212; and tracked the links that were shared on Twitter about each. Following the links to their ultimate source showed that an alarming number of them had simply vanished.</p>
<h2 id="after-two-and-a-half-years-30-">After two and a half years, 30 percent had disappeared</h2>
<p>In fact, the researchers said that within a year of these events, an average of 11 percent of the material that was linked to had disappeared completely (and another 20 percent had been archived), and after two-and-a-half years, close to 30 percent had been lost altogether and 41 percent had been archived. Based on this rate of information decay, the <a href="http://arxiv.org/abs/1209.3026">authors predicted that</a> more than 10 percent of the information about a major news event will likely be gone within a year, and the remainder will continue to vanish at the rate of .02 percent per day.</p>
<p><a href="http://gigaom2.files.wordpress.com/2012/09/screen-shot-2012-09-19-at-7-57-47-pm.png"><img  title="Twitter research chart" src="http://gigaom2.files.wordpress.com/2012/09/screen-shot-2012-09-19-at-7-57-47-pm.png?w=604&#038;h=394" alt="" width="604" height="394" class="aligncenter size-large wp-image-564774" /></a></p>
<p>It&#8217;s not clear from the research why the missing information disappeared, but it&#8217;s likely that in many cases blogs have simply shut down or moved, or news stories have been archived by providers who charge for access (something that many newspapers and other media outlets do to generate revenue). But as the Technology Review post points out, <a href="http://www.technologyreview.com/view/429274/history-as-recorded-on-twitter-is-vanishing-from/?ref=rss">this kind of information can be extremely valuable</a> in tracking how historical events developed, such as the Arab Spring revolutions &#8212; which the researchers note was the original impetus for their study, since they were trying to collect as much data as possible for the one-year anniversary of the uprisings.</p>
<p>Other scientists, and particularly librarians, have also raised red flags in the past about the rate at which digital data is disappearing. The National Library of Scotland, for example, <a href="http://www.bbc.co.uk/news/uk-scotland-18250826">recently warned that key elements of Scottish digital life</a> were vanishing into a &#8220;black hole,&#8221; and asked the government to fast-track legislation that would allow libraries to store copies of websites. Web pioneer Brewster Kahle is probably the best known digital archivist as a result of <a href="http://archive.org/about/">his Internet Archive project</a>, which keeps copies of websites dating back to the early days of the web (Kahle also has a related project called <a href="http://openlibrary.org/about">the Open Library</a>).</p>
<h2 id="getting-access-to-social-data-">Getting access to social data is not easy</h2>
<p><a href="http://gigaom2.files.wordpress.com/2011/02/2149309015_0de38248c9_z.png"><img  title="Birdhouses" src="http://gigaom2.files.wordpress.com/2011/02/2149309015_0de38248c9_z.png?w=210&#038;h=140" alt="" width="210" height="140" class="alignleft size-thumbnail wp-image-297095" /></a></p>
<p>Although the Virginia researchers didn&#8217;t deal with it as part of their study, a related problem is that much of the content that gets distributed through Twitter &#8212; not just websites that are linked to in Twitter posts, but the content of the posts themselves &#8212; is difficult and/or expensive to get to. <a href="http://gigaom.com/2011/06/01/new-twitter-search-is-nice-but-still-needs-work/">Twitter&#8217;s search is notoriously unreliable</a> for anything older than about a week, and access to the complete archive of your tweets is only provided to those who can make a special case for needing it, <a href="http://www.poynter.org/latest-news/mediawire/146785/andy-carvin-obtains-database-of-all-95000-tweets/">such as Andy Carvin</a> of National Public Radio (who is writing a book about the way he chronicled the Arab Spring revolutions).</p>
<p>As my colleague Eliza Kern noted in a recent post, an external service called Gnip <a href="http://gigaom.com/2012/09/19/for-a-price-gnip-brings-you-access-to-all-public-tweets-ever-sent/">now has access to the full archive</a> of Twitter content, which it will provide to companies for a fee. And Twitter-based search and discovery engine Topsy also has an archive of most of the full &#8220;firehose&#8221; of tweets &#8212; although it focuses primarily on content that is retweeted a lot &#8212; and <a href="http://www.usatoday.com/tech/products/story/2012-06-10/pegoraro-twitter-archive/55465622/1">provides that to companies</a> for analytical purposes. But neither can be linked to easily for research or historical archiving purposes. The Library of Congress also has an archive of Twitter&#8217;s content, but it isn&#8217;t easily accessible and <a href="http://www.buzzfeed.com/jwherrman/so-is-the-library-of-congress-still-archiving-twi">it&#8217;s not clear whether new content is being added</a> or not.</p>
<p>Twitter has <a href="http://searchengineland.com/twitter-tweet-archive-tool-coming-128537">talked about providing a service</a> that would let users download their tweets at some point, but it hasn&#8217;t said when such a thing would be available &#8212; and even if users did create their own archive in this way (or by <a href="http://gigaom.com/2011/11/17/twitter-is-a-stream-but-its-also-a-reservoir-of-data/">using tools like Thinkup</a> from former Lifehacker editor Gina Trapani) it would be difficult to link those in a way that would provide the kind of connected historical information the Virginia study is describing. And it&#8217;s not just Twitter: there is no easy way to get access to an archive of Facebook posts either, although <a href="http://www.makeuseof.com/tag/bulk-download-facebook-data-information-archives/">users in Europe can request access</a> to their own archive as a result of a legal ruling there.</p>
<p>For better or worse, much of the content flowing around us seems to be just as insubstantial as the clouds that it is hosted in, and the existing tools we have for trying to capture and make sense of it simply aren&#8217;t up to the task. The long-term social effects of this digital amnesia remain to be seen.</p>
<p><em>Post and thumbnail images <a href="http://creativecommons.org/licenses/by-sa/2.0/deed.en">courtesy</a> of Shutterstock user <a href="http://www.shutterstock.com/gallery-1040698p1.html">Ribah</a> and Flickr user <a href="http://www.flickr.com/photos/seeminglee/2149309015/">See-ming Lee</a></em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=paidcontent.org&#038;blog=33319749&#038;post=218025&#038;subd=gigaompaidcontent&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/PaidContent_RSS_300x250&#038;sz=300x250&#038;c=517098"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/PaidContent_RSS_300x250&#038;sz=300x250&#038;c=517098" /></a></p>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/09/19/the-disappearing-web-information-decay-is-eating-away-our-history/feed/</wfw:commentRss>
		<slash:comments>30</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/09/shutterstock_111156467.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/09/shutterstock_111156467.jpg?w=150" medium="image">
			<media:title type="html">shutterstock_111156467</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/0bdf7ab171ade0708a11fa3378e6d8cb?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">Mathew</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/09/screen-shot-2012-09-19-at-7-57-47-pm.png?w=604" medium="image">
			<media:title type="html">Twitter research chart</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2011/02/2149309015_0de38248c9_z.png?w=210" medium="image">
			<media:title type="html">Birdhouses</media:title>
		</media:content>
	</item>
	</channel>
</rss>
