Category Archives: general

Bin Laden, Oil & Foreign Policy

Immediately following President Obama’s speech on the evening of May 1 confirming the execution of Osama bin Laden, the DiscoverText team began collecting all twitter posts which contained the key words “Osama,” and “Bin Laden.”  The bin Laden project collected … Continue reading

Posted in general | Tagged , , , , , , , , , , , | Comments Off

FB Graph API Revisited

Following the blog post about oddities in the Facebook importing that Stu was experiencing while visiting the DMI summer school, I put on my detective hat and went looking for possible reasons why things that should be available from the … Continue reading

Posted in general | Tagged , , , , , , , | Comments Off

Capturing Social Media Dissent

Immediately after the judge read the not guilty verdict in the high profile Casey Anthony criminal case, total internet traffic to major news outlets doubled, and most importantly, outsiders took to social media outlets to display their emotions and opinions. … Continue reading

Posted in general | Tagged , , , , | Comments Off

Making Sense of ESPN Tweets

Should you ever want to visualize the definition of “unstructured data”, there is no need to look beyond the beautiful chaos that is 503,000 ESPN tweets all harvested using DiscoverText. It would be an understatement to call an archive of … Continue reading

Posted in general | Tagged , , , , , , , | Comments Off

Top #GameofThrones Tweeters

Below is another in our series of training videos. In this episode, we introduce you to a great new feature for peaking into the list of top values in a particular meta data field. In the example here, I show … Continue reading

Posted in general | Tagged , , , , , , , | 2 Comments

Monitor Middle East Protests

Dear faithful users and intrigued future users of DiscoverText, My name’s Josh and I’m one of 3 user support specialists at Texifter LLC. For my first Texifter blog entry, I’m going to demonstrate how I’ve been using DiscoverText to capture … Continue reading

Posted in general | Tagged , , , , , , , , , | 3 Comments

A Classifier for the Masses

DiscoverText took a leap forward a few weeks ago with the addition of a beta text classifier from the developers over at uClassify (www.uclassify.com). Integration of this tool into a one-of-a-kind active-learning system inside DiscoverText allows users to create and … Continue reading

Posted in general | Tagged , , , , , | 2 Comments

70,000 Game of Thrones Tweets

I wanted to test the new Cloud Explorer developed by Texifter as part of the DiscoverText suite of text analysis tools. Since the revisions to the old tag cloud tool were inspired by comments from a Game of Thrones fan, … Continue reading

Posted in general | Tagged , , , , , | Comments Off

Download the Recent DiscoverText Webinar

For the next 14 days, you can download and replay the entire 145 MB DiscoverText Webinar. Your PC may require a Codec from GoToMeeting to play the file properly. Many thanks to Stranded Wind for a very generous evaluation! Tweet … Continue reading

Posted in general | Tagged , , , , , | Comments Off

Next Free DiscoverText Webinar Tuesday 12 PM EST

Register here for the next live, free, interactive training Webinar with Dr. Shulman. This Webinar introduces new and existing DiscoverText users to the basic document ingest, search & code features, takes your questions, and demonstrates our newest tool, a machine-learning … Continue reading

Posted in general | Tagged , , , , , | Comments Off

Dwindling Osama bin Laden Tweets and the RT Champs

The running count in my DiscoverText “bin Laden” project is ~4.5 million unsharable Tweets. Though we can’t share them, we can describe them. One of the interesting features of this dataset is the rapidly dwindling Tweet rate over the month … Continue reading

Posted in general | Tagged , , , , , , | 2 Comments

Coding Text – Part Three

Researchers interested in large text collections and their itinerant coders tend to muddle through with limited collaborative, cross-disciplinary resources upon which to draw. The generic criteria for high-quality codebook construction and effective coding are underdeveloped, even as the tools and … Continue reading

Posted in general | Tagged , , , , , , , , | 1 Comment

Coding Text – Part Two

In Part One of the Series “Coding Text the QDAP Way,” I wrote about the problem of idiosyncratic annotation and the lack of diverse, interesting and re-usable annotated data sets. Providing data for replication (when possible) is a requisite for … Continue reading

Posted in general | Tagged , , , , , , | Comments Off

Coding Text Using the “QDAP Method” – Part One

We did it! The free, open source, Web-based, university-hosted, FISMA-compliant “Coding Analysis Toolkit” CAT recorded its one millionth coding choice. Pretty much all the credit goes to Texifter CTO and chief CAT architect Mark Hoy who has put in many … Continue reading

Posted in general | Tagged , , , , , , , , | 1 Comment

CAT on the Brink of 1 Million Recorded Coding Choices

Texifter manages the Coding Analysis Toolkit (CAT), which is a free, open source, Web-based and FISMA-compliant system launched in the fall of 2007 and hosted by the University of Pittsburgh. CAT is the precursor to PCAT and DiscoverText. This is … Continue reading

Posted in general | Tagged , , , , , , , | Comments Off