Feeds:
Posts
Comments

Posts Tagged ‘words’

This new visualization is again more a ‘digital art’ creation than a data visualization. All words of the bible are added one by one as spheres. If the word is already on the screen, the size of the sphere grows. Over time spheres shrink, so that more frequent words ‘survive’. The words ‘the’ and ‘and are omitted because they are very frequent.

This experiment is created with processing and you can see it live on openprocessing.org. As physics engine JBox2D is used.

http://www.openprocessing.org/visuals/?visualID=8516

Read Full Post »

So the bible counts 12.687 distinct words.

But what are the most frequent used words ? Here the graph (made in excel).


Not very intersting. Of course the most frequent words are ‘the’, ‘and’, etc… The word ‘Lord’ is the first ‘meaningful’ word on the 14th place. ‘God’ is 28th, ‘Isreal’ is 49th and ‘Jesus’ is only 112th. Get the full in list below.

Then I tried to visualize the top ‘meaningful’ words and their occurence in he bible. I got inspired by the wunderfull ‘box office’ visualization made by the New York Times. To generate the data I used the tool processing again (it is also great for non-visual data processing) and made the graph in excel. To ease the graph lines (less irrgular peaks) i used moving averages.

Resources:
All 12.687 words with frequency count: view/download.

Read Full Post »

In this second visualisation I try to visualise the whole bible in one picture. To create the pictures i made use of a very handy application called ‘processing’ (http://processing.org). It is a simplified (but very powerfull) programming environment based on java and it’s “ideal for artists”. Check it out.

Every pixel in this 2000×500 picture represente 1 word. The color of the pixel is derived from the letters of the word and should be unique for every word (as we can show 16mio collors, that is no problem because we only have 12.687 distinct words).

This picture is not the original one as it is to heavy to put on the site. Actually it is a standard compressed jpeg so some quality is lost (todo: I should make a lossless copy available somehow).

If you look closely (click to enlarge) you can see variations of colors (horizontal bands) throuhout the picture.

In the next picture (clik to enlarge) you only see the pixels for the word “God”. I could try to make an online application where you can enter a word and get the picture visualizing the words occurence (If you want me to make you one in the meantime, just send me a message).

Read Full Post »

Day 1 : The basics

Number of words

Google Chart 'Number of words'

I start off with a very simple visualization. The number of words in the bible.

All visualizations will be based on the openly available text of the bible with the folowing reference: The King James Bible from Bill McGinnis Ministries (http://patriot.net/~bmcgin/kjvpage.html).

I use a few java programs (java.sun.com) to process the text in different ways and make calculations. The text processing goes very fast (few seconds to process the text). I devlop he programs using the eclipse IDE (www.eclipse.org)

This visualization was made with Google Charts API, an open source graphing solution by Google (http://code.google.com/intl/nl/apis/chart/image_charts.html).

The exact number of words in (this version of) the bible is 789.173. This number is generated by 12.687 distinct words. The most popular distinct word is ‘the’ and is mentioned 63.924 times. TODO : I should make a visualization on the most common words.

For some strange reasons, the graph api always relates values to a 0-1 range, and therefore we cannot show the actual value on top of the bar.

The parameters for this graph are :

cht=bvg
chs=200×300
chma=100,50,50,20
chtt=Number_of_words
chts=000000,16
chf=bg,lg,90,FFE7C6,0,76A4FB,0.5
chxt=x,y
chxr=1,0,1000000,100000
chd=t:78.9173
chxt=x,y
chxs=0,000000,12,0,lt
1,000000,12,1,lt

All documentation can by found at Google Code, Google Chart Api.

Read Full Post »