Since we had to take down our 1.2 million “Osama bin Laden” tweets, we substituted data we can share gathered from Facebook to satisfy the curiosity of researchers who don’t normally handle big data but might want to dip their toe in it. The new sample datasets include comments on the Target FB page, as well as the Wikileaks and Speaker Boehner FB pages.
The biggest dataset we have from Facebook consists of over 830,000 comments on the Official White House Facebook page. This data was downloaded via the Graph API, which is a key component of the Facebook data sharing architecture. At Texifter, we are big fans of the data streams made possible thanks to the Graph API.