This is a mirror of official site: http://jasper-net.blogspot.com/

How Do You Visualize 100 GB of Google Text Data?

| Wednesday, January 12, 2011
There is an amazing series of charts that visualizes trigrams and bigrams, portions of sentences that have been extracted from Google's web data set. The graphs highlight word associations and the frequency with which we use them on web pages. Chris Harrison from Carnegie Mellon University found, for example, that the word 'he' is often tied to 'argues,' while 'she' is found often with 'loves.' There are also word-relation charts that highlight words used in combination with their opposites, such as good and bad, peace and war, and PC and Mac

Read more: Slashdot

Posted via email from .NET Info

0 comments: