Tag Archives: DiscoverText

Dwindling Osama bin Laden Tweets and the RT Champs

Posted on June 1, 2011 by Stuart Shulman

The running count in my DiscoverText “bin Laden” project is ~4.5 million unsharable Tweets. Though we can’t share them, we can describe them. One of the interesting features of this dataset is the rapidly dwindling Tweet rate over the month … Continue reading →

Posted in general | Tagged Bin Laden, Data Mining, DiscoverText, ReTweet, Texifter, Text Analytics, twitter | 2 Comments

Connect Existing Facebook & DiscoverText Accounts

Posted on May 24, 2011 by Mark J. Hoy

Many people have asked us “How to I import Facebook data if I have a regular DiscoverText Account?” – The short answer is that there is no way to pull in Facebook feeds within DiscoverText unless you register and login … Continue reading →

Posted in DiscoverText, product | Tagged DiscoverText, Facebook, Import Data, Texifter, Text Analytics | 1 Comment

New DiscoverText Import Available: Congressional Bills Via GovTrack

Posted on May 18, 2011 by Mark J. Hoy

Tonight we’ve added a new import ability to DiscoverText – for any user with a Professional or Enterprise license (as well as the 30-day free trial license), you can now directly import data on Federal Congressional bills. Thanks to the … Continue reading →

Posted in DiscoverText, product | Tagged Congressional Bills, DiscoverText, GovTrack API, Sunlight Labs API, Texifter | 2 Comments

Coding Text – Part Three

Posted on May 17, 2011 by Stuart Shulman

Researchers interested in large text collections and their itinerant coders tend to muddle through with limited collaborative, cross-disciplinary resources upon which to draw. The generic criteria for high-quality codebook construction and effective coding are underdeveloped, even as the tools and … Continue reading →

Posted in general | Tagged Code Text, Data Mining, DiscoverText, Machine Classifiers, Machine Learning, Social Media, statistics, Texifter, Text Analysis | 1 Comment

The Return of Google Reader Feeds in DiscoverText

Posted on May 11, 2011 by Mark J. Hoy

A number of months ago, Google Reader removed its direct ability to get an RSS feed of the feeds for your reader account. Due to this, we had to take the feed ingestion of Google Reader feeds offline inside of … Continue reading →

Posted in DiscoverText, product | Tagged analytics, API, Data Mining, DiscoverText, Google Reader, Social Media, Texifter | Comments Off

Coding Text Using the “QDAP Method” – Part One

Posted on May 10, 2011 by Stuart Shulman

We did it! The free, open source, Web-based, university-hosted, FISMA-compliant “Coding Analysis Toolkit” CAT recorded its one millionth coding choice. Pretty much all the credit goes to Texifter CTO and chief CAT architect Mark Hoy who has put in many … Continue reading →

Posted in general | Tagged Code Text, Data Mining, DiscoverText, Machine Classifiers, Machine Learning, Social Media, statistics, Texifter, Text Analysis | 1 Comment

CAT on the Brink of 1 Million Recorded Coding Choices

Posted on May 7, 2011 by Stuart Shulman

Texifter manages the Coding Analysis Toolkit (CAT), which is a free, open source, Web-based and FISMA-compliant system launched in the fall of 2007 and hosted by the University of Pittsburgh. CAT is the precursor to PCAT and DiscoverText. This is … Continue reading →

Posted in general | Tagged Adjudication, analytics, CAT, Data Mining, DiscoverText, FISMA-Compliant, Social Media, Texifter | Comments Off

Twitter and History March On

Posted on May 7, 2011 by Stuart Shulman

The Twistory saga continues. The tech news is full of stories these days about firms delivering social media brand management and public opinion results, all magically derived from massive tweet collections. Whether prudent or not, people want to know what … Continue reading →

Posted in general | Tagged API, Data Mining, DiscoverText, Social Media, Texifter, Twistory, twitter, Twitter API | 1 Comment

Throttled, or Not?

Posted on May 6, 2011 by Stuart Shulman

I was watching TweetDeck rattle off reactions to the end of “Twistory” and the “Dustin of Twistory” when my son came in an asked: Why is your tweet stream being throttled? I had missed the message in small print, but … Continue reading →

Posted in general | Tagged API, Data Mining, DiscoverText, Social Media, Texifter, Twistory, twitter, Twitter API | Comments Off

Data We Can Share

Posted on May 6, 2011 by Stuart Shulman

Since we had to take down our 1.2 million “Osama bin Laden” tweets, we substituted data we can share gathered from Facebook to satisfy the curiosity of researchers who don’t normally handle big data but might want to dip their … Continue reading →

Posted in general | Tagged API, Data Mining, DiscoverText, Social Media, Texifter, Twistory, twitter, Twitter API | Comments Off

The First Draft of ‘Twistory’ Revisited – An Update on (Not) Sharing Twitter Collections

Posted on May 5, 2011 by Stuart Shulman

Researchers like new datasets. Many of us build tools and techniques that work nicely with existing data, but may perform poorly with “out-of-sample” datasets. The ability to generate new and interesting big datasets, especially ones that draw a crowd of … Continue reading →

Posted in general | Tagged API, Data Mining, DiscoverText, Social Media, Texifter, Twistory, twitter, Twitter API | 2 Comments

1.2 Million “osama” & “bin laden” Tweets…and Counting

Posted on May 4, 2011 by Stuart Shulman

On the May 1, 2011 evening it was announced that Osama bin Laden had been killed, we started running repeated fetches against the Twitter API for the terms “osama” and “bin laden”. On May 3, we posted more than 1.2 … Continue reading →

Posted in general | Tagged Bin Laden, Data Mining, DiscoverText, Machine Learning, R&D, Text Analytics, Twistory, Twitter API | 6 Comments

Scraping Facebook

Posted on April 20, 2011 by Stuart Shulman

Using a core element of the Facebook architecture (the “Graph API”), we have enabled DiscoverText users to “Connect with Facebook” to register their new accounts. You can learn about this option and how to collect data off Facebook by watching … Continue reading →

Posted in general | Tagged API, Data Mining, DiscoverText, Facebook, Facebook API, Machine Learning, R&D, Social Media | 1 Comment

Tag Archives: DiscoverText

Dwindling Osama bin Laden Tweets and the RT Champs

Connect Existing Facebook & DiscoverText Accounts

New DiscoverText Import Available: Congressional Bills Via GovTrack

1.2 Million “osama” & “bin laden” Tweets…and Counting

Contact Us

Texifter Links

Recent Posts

Archives

Blogroll

Meta

@texifter Twitter Updates