Tag Archives: Data Mining

Coding Text Using the “QDAP Method” – Part One

Posted on May 10, 2011 by Stuart Shulman

We did it! The free, open source, Web-based, university-hosted, FISMA-compliant “Coding Analysis Toolkit” CAT recorded its one millionth coding choice. Pretty much all the credit goes to Texifter CTO and chief CAT architect Mark Hoy who has put in many … Continue reading →

Posted in general | Tagged Code Text, Data Mining, DiscoverText, Machine Classifiers, Machine Learning, Social Media, statistics, Texifter, Text Analysis | 1 Comment

CAT on the Brink of 1 Million Recorded Coding Choices

Posted on May 7, 2011 by Stuart Shulman

Texifter manages the Coding Analysis Toolkit (CAT), which is a free, open source, Web-based and FISMA-compliant system launched in the fall of 2007 and hosted by the University of Pittsburgh. CAT is the precursor to PCAT and DiscoverText. This is … Continue reading →

Posted in general | Tagged Adjudication, analytics, CAT, Data Mining, DiscoverText, FISMA-Compliant, Social Media, Texifter | Comments Off

Twitter and History March On

Posted on May 7, 2011 by Stuart Shulman

The Twistory saga continues. The tech news is full of stories these days about firms delivering social media brand management and public opinion results, all magically derived from massive tweet collections. Whether prudent or not, people want to know what … Continue reading →

Posted in general | Tagged API, Data Mining, DiscoverText, Social Media, Texifter, Twistory, twitter, Twitter API | 1 Comment

Throttled, or Not?

Posted on May 6, 2011 by Stuart Shulman

I was watching TweetDeck rattle off reactions to the end of “Twistory” and the “Dustin of Twistory” when my son came in an asked: Why is your tweet stream being throttled? I had missed the message in small print, but … Continue reading →

Posted in general | Tagged API, Data Mining, DiscoverText, Social Media, Texifter, Twistory, twitter, Twitter API | Comments Off

Data We Can Share

Posted on May 6, 2011 by Stuart Shulman

Since we had to take down our 1.2 million “Osama bin Laden” tweets, we substituted data we can share gathered from Facebook to satisfy the curiosity of researchers who don’t normally handle big data but might want to dip their … Continue reading →

Posted in general | Tagged API, Data Mining, DiscoverText, Social Media, Texifter, Twistory, twitter, Twitter API | Comments Off

The First Draft of ‘Twistory’ Revisited – An Update on (Not) Sharing Twitter Collections

Posted on May 5, 2011 by Stuart Shulman

Researchers like new datasets. Many of us build tools and techniques that work nicely with existing data, but may perform poorly with “out-of-sample” datasets. The ability to generate new and interesting big datasets, especially ones that draw a crowd of … Continue reading →

Posted in general | Tagged API, Data Mining, DiscoverText, Social Media, Texifter, Twistory, twitter, Twitter API | 2 Comments

1.2 Million “osama” & “bin laden” Tweets…and Counting

Posted on May 4, 2011 by Stuart Shulman

On the May 1, 2011 evening it was announced that Osama bin Laden had been killed, we started running repeated fetches against the Twitter API for the terms “osama” and “bin laden”. On May 3, we posted more than 1.2 … Continue reading →

Posted in general | Tagged Bin Laden, Data Mining, DiscoverText, Machine Learning, R&D, Text Analytics, Twistory, Twitter API | 6 Comments

Scraping Facebook

Posted on April 20, 2011 by Stuart Shulman

Using a core element of the Facebook architecture (the “Graph API”), we have enabled DiscoverText users to “Connect with Facebook” to register their new accounts. You can learn about this option and how to collect data off Facebook by watching … Continue reading →

Posted in general | Tagged API, Data Mining, DiscoverText, Facebook, Facebook API, Machine Learning, R&D, Social Media | 1 Comment

Text Analysis during the 2011 State of the Union Address

Posted on February 3, 2011 by Mark J. Hoy

As part of the underlying research Texifter is doing on sentiment and topic analysis, we collected data from various Twitter and Facebook feeds +/- 48 hours during the 2011 State of the Union address on Tuesday, January 25th, 2011. Texifter … Continue reading →

Posted in general, research | Tagged API, Congressional Bills, Data Mining, DiscoverText, Obama, R&D, Social Media, State of the Union, Texifter, Text Analytics | 1 Comment

Tag Archives: Data Mining

1.2 Million “osama” & “bin laden” Tweets…and Counting

Contact Us

Texifter Links

Recent Posts

Archives

Blogroll

Meta

@texifter Twitter Updates