<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The DoubleThink &#187; tufte</title>
	<atom:link href="http://thedoublethink.com/tag/tufte/feed/" rel="self" type="application/rss+xml" />
	<link>http://thedoublethink.com</link>
	<description>The Art &#38; Science of the New Marketing</description>
	<lastBuildDate>Wed, 23 Jun 2010 22:04:28 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.1</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Tufte&#8217;s principles</title>
		<link>http://thedoublethink.com/2009/08/tufte%e2%80%99s-principles-for-visualizing-quantitative-information/</link>
		<comments>http://thedoublethink.com/2009/08/tufte%e2%80%99s-principles-for-visualizing-quantitative-information/#comments</comments>
		<pubDate>Wed, 05 Aug 2009 12:00:22 +0000</pubDate>
		<dc:creator>Dimitri</dc:creator>
				<category><![CDATA[Data Visualization]]></category>
		<category><![CDATA[data visualization]]></category>
		<category><![CDATA[tufte]]></category>

		<guid isPermaLink="false">http://thedoublethink.com/?p=49</guid>
		<description><![CDATA[
 
 
 
Monday I wrote a post about a very annoying graph that keeps popping up in Keynote presentations.  Edward Tufte invented the concept of Chart Junk.  His most famous work &#8211; The Visual Display of Quantitative Information was published in 1983.  It is probably the most important book ever written on data visualization.  It made the [...]]]></description>
			<content:encoded><![CDATA[<p class="MsoNormal" style="margin: 0in 0in 0pt;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;"><a rel="attachment wp-att-245" href="http://thedoublethink.com/2009/08/tufte%e2%80%99s-principles-for-visualizing-quantitative-information/tufte-book/"><img class="alignnone size-full wp-image-245" title="tufte-book" src="http://thedoublethink.com/wp-content/uploads/2009/04/tufte-book.jpg" alt="tufte-book" width="236" height="286" /></a></span></span></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"> </p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"> </p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"> </p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;">Monday I wrote a post about a <a href="http://thedoublethink.com/2009/08/keynote-chart-junkkeynote-chart-junk/" target="_blank">very annoying graph</a> that keeps popping up in Keynote presentations.  Edward Tufte invented the concept of Chart Junk.  His most famous work &#8211; <em style="mso-bidi-font-style: normal;">The Visual Display of Quantitative Information was </em>published in 1983.<span style="mso-spacerun: yes;">  </span>It is probably the most important book ever written on data visualization.<span style="mso-spacerun: yes;">  I</span>t made the top 100 non-fiction books of the 20<sup>th</sup> century on Amazon.com.  It is packed with examples of best and worst practices in the history of data visualization.<span style="mso-spacerun: yes;">  </span>Even the book itself is designed beautifully.<span style="mso-spacerun: yes;">  </span>Tufte wanted its design to follow the principles it put forward.<span style="mso-spacerun: yes;">  </span>He therefore decided to publish it himself, having to take a 2<sup>nd</sup> mortgage to finance it.<span style="mso-spacerun: yes;">  </span>You have to admire the man’s determination!<span style="mso-spacerun: yes;">  </span>In this book Tufte laid out his key data visualization principles.<span style="mso-spacerun: yes;">  </span></span></span><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;"> </span></span></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"></span> </p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><strong style="mso-bidi-font-weight: normal;"><span style="text-decoration: underline;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;">Graphical Integrity</span></span></span></strong></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><strong style="mso-bidi-font-weight: normal;"><span style="text-decoration: underline;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="text-decoration: none;"></span></span></span></strong></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"></span> </p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;">Visual representations of data must tell the truth.<span style="mso-spacerun: yes;">  </span>Tufte shows a whole range of graphs that either over or under represent the effects in the data.<span style="mso-spacerun: yes;">  </span>He does this by calculating a graph’s Lie Factor which can be calculated by dividing the size of the effect shown in the graphic by the size of the effect in the data.<span style="mso-spacerun: yes;">  </span>If the Lie Factor is greater than 1 the graph overstates the effect.<span style="mso-spacerun: yes;">  </span>Tufte goes on to list the following 6 principles of graphical integrity : </span></span></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;">1. </span></span><span style="font-family: Arial; mso-fareast-font-family: Arial;"><span style="mso-list: Ignore;"><span style="font: 7pt &quot;Times New Roman&quot;;"> </span></span></span><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;">The representation of numbers, as physically measured on the surface of the graph itself, should be directly proportional to the numerical quantities represented</span></span></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;">2. Clear, detailed and thorough labeling should be used to defeat graphical distortion and ambiguity.<span style="mso-spacerun: yes;">  </span>Write out explanations of the data on the graph itself.<span style="mso-spacerun: yes;">  </span>Label important events in the data.</span></span></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;">3. Show data variation, not design variation.</span></span></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;">4. In time-series displays of money, deflated and standardized units of monetary measurement are nearly always better than nominal units.</span></span></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;">5. The number of information carrying (variable) dimensions depicted should not exceed the number of dimensions in the data.</span></span></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;">Graphics must not quote data out of context.</span></span></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"> </p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;">Here is one of Tufte’s examples of a graph with low graphical integrity.<span style="mso-spacerun: yes;">  </span>According to Tufte the Lie Factor of this graph is 14.8.<span style="mso-spacerun: yes;">  </span>A numerical change of 53% is represented by a graphical change (size of horizontal lines) of 783%.</span></span></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"> </p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;"><a rel="attachment wp-att-50" href="http://thedoublethink.com/2009/08/tufte%e2%80%99s-principles-for-visualizing-quantitative-information/integrity/"><img class="alignnone size-medium wp-image-50" title="integrity" src="http://thedoublethink.com/wp-content/uploads/2009/04/integrity-300x164.jpg" alt="integrity" width="337" height="174" /></a></span></span></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;"> </span></span></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><strong style="mso-bidi-font-weight: normal;"></strong></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><strong style="mso-bidi-font-weight: normal;"></strong> </p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><strong style="mso-bidi-font-weight: normal;"><span style="text-decoration: underline;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;">Data-Ink</span></span></span></strong></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"> </p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;">Data Ink is the ink on a graph that represents data.<span style="mso-spacerun: yes;">  </span>Tufte claims that good graphical representations maximize data-ink and erase as much non-data-ink as possible.<span style="mso-spacerun: yes;">  </span>He put forward the data-ink ratio which is calculated by 1 minus the proportion of the graph that can be erased without loss of data-information.<span style="mso-spacerun: yes;">  </span>He puts forward the following 5 principles related to data ink : </span></span></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"> </p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;">1. Above all else show data.</span></span></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;">2. Maximize the data-ink ratio.</span></span></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;">3. Erase non-data-ink.</span></span></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;">4. Erase redundant data-ink.</span></span></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;">5. Revise and edit</span></span></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"> </p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;">Tufte tests these principles on a whole range of examples to come up with a wide range of fresh designs that dramatically improve the legibility of the graphs.<span style="mso-spacerun: yes;">  </span>Here is an example with a very high data-ink ratio.<span style="mso-spacerun: yes;">  </span>It’s an electroencephalogram – a graph that records the electrical activity from the brain.<span style="mso-spacerun: yes;">  </span>This graph would have a data-ink ratio of 1.</span></span></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"> </p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;"><a rel="attachment wp-att-51" href="http://thedoublethink.com/2009/08/tufte%e2%80%99s-principles-for-visualizing-quantitative-information/data-ink/"><img class="alignnone size-medium wp-image-51" title="data-ink" src="http://thedoublethink.com/wp-content/uploads/2009/04/data-ink-300x252.jpg" alt="data-ink" width="300" height="252" /></a></span></span></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"> </p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"> </p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><strong style="mso-bidi-font-weight: normal;"><span style="text-decoration: underline;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;">Chartjunk</span></span></span></strong></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><strong style="mso-bidi-font-weight: normal;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;"> </span></span></strong></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;">Tufte has a whole chapter dedicated to what he calls Chartjunk – the excessive and unnecessary use of graphical effects in graphs.<span style="mso-spacerun: yes;">  </span>He calls out moiré vibration, heavy grids and self-promoting graphs that are used to demonstrate the graphic ability of the designer rather than display the data.<span style="mso-spacerun: yes;">  </span>This is according to Tufte possibly the worst graph ever : “<em style="mso-bidi-font-style: normal;">A series of weird three-dimensional displays appearing in the magazine of American Education in the 1970’s delighted the connoisseurs of the graphically preposterous.<span style="mso-spacerun: yes;">  </span>Here five colors report, almost by happenstance, only 5 pieces of data (since the division within each adds to 100%).<span style="mso-spacerun: yes;">  </span>This may well be the worst graphic ever to find its way into print.”</em></span></span></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;"> <a rel="attachment wp-att-56" href="http://thedoublethink.com/2009/08/tufte%e2%80%99s-principles-for-visualizing-quantitative-information/worst-graph/"><img class="alignnone size-full wp-image-56" title="worst-graph" src="http://thedoublethink.com/wp-content/uploads/2009/04/worst-graph.jpg" alt="worst-graph" width="300" height="514" /></a></span></span></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"> </p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;"> </span></span></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"> </p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><strong style="mso-bidi-font-weight: normal;"><span style="text-decoration: underline;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;">Data Density</span></span></span></strong></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><strong style="mso-bidi-font-weight: normal;"></strong> </p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;">The data density of a graph is the proportion of the total size of the graph that is dedicated displaying data.<span style="mso-spacerun: yes;">  </span>Tufte prefers high data density graphs.<span style="mso-spacerun: yes;">  </span>He wants us to maximize data density and the size of the data matrix within reason.<span style="mso-spacerun: yes;">  </span>One way of achieving this he claims is through the Shrink Principle.<span style="mso-spacerun: yes;">  </span>He claims that most graphs can be shrunk way down without losing legibility or information.</span></span></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;"> </span></span></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"> </p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><strong style="mso-bidi-font-weight: normal;"><span style="text-decoration: underline;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;">Small Multiples</span></span></span></strong></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><strong style="mso-bidi-font-weight: normal;"></strong> </p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"> </p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;">Small multiples are series of the same small graph repeated in one visual.<span style="mso-spacerun: yes;">  </span>Tufte says that small multiples are a great tool to visualize large quantities of data and with a high number of dimensions.<span style="mso-spacerun: yes;">  </span>Below is an example that usesTufte’s sparklines to create a graph of small multiples.<span style="mso-spacerun: yes;">  </span>According to Tufte sparklines are data-intense, design-simple, word-sized graphics.<span style="mso-spacerun: yes;">  </span>Here is an example of how they are use to show movement in stocks.<span style="mso-spacerun: yes;">  </span>This graph has pretty high data density as well. </span></span></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"> </p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><a rel="attachment wp-att-57" href="http://thedoublethink.com/2009/08/tufte%e2%80%99s-principles-for-visualizing-quantitative-information/sparklines/"><img class="alignnone size-full wp-image-57" title="sparklines" src="http://thedoublethink.com/wp-content/uploads/2009/04/sparklines.jpg" alt="sparklines" width="559" height="361" /></a></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;"> </span></span></p>
<p class="MsoNormal" style="margin: 0in 0in 0pt;"><span style="font-family: Arial; mso-bidi-font-family: 'Times New Roman';"><span style="font-size: small;"> </span></span></p>
]]></content:encoded>
			<wfw:commentRss>http://thedoublethink.com/2009/08/tufte%e2%80%99s-principles-for-visualizing-quantitative-information/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>
