Last night Ben gave a presentation on Data visualisation to the crowd at Ignite Melbourne.

Update – Video of Flink Labs’ talk at Ignite Melbourne is now available on youtube.
The PDF of his slides are available at http://flinklabs.com/slides/ignitemelbourne2010.pdf. Below are the preparation notes from the talk.
Thank you to all the organisers and audience for the wonderful event.
Lies, damned lies and MySchool
A talk given at Ignite Melbourne 2010 by Ben Hosken
Good evening. My name is Ben Hosken and I’m a founder of Flink Labs, a data visualisation studio based here in Melbourne. Tonight I’m going to talk about data visualisation, what it is, why its starting to take off and some of the potential problems we as viewers and consumers of data visualisations need to be aware of.
Data visualisation is the process of taking complex multivariate datasets and creating a visual representation of the data. Be it an interactive map, an animated infographic, or a tool for data exploration, data visualisation is at the sweet spot of computing, maths and design.
Data visualisation can be used as a tool for discovery and self exploration of datasets, potentially acting as the interface to the data. And it can also be used to communicate and engage your audience around a set of data.
All data has a point of view and a story to tell, often many stories. Data visualisation can be seen as the process of discovering and telling those stories. The stories can often be very good but even great story tellers can create bad stories.
Data visualisation can also be used as propaganda. Through deliberate misrepresentation and distortion, data visualisation can be used like many media as a tool to influence inappropriately.
So what’s happened recently that is causing the growth in data visualisation?
Our brains are hard wired to handle visual inputs. A very large amount of our brain, the visual cortex, is devoted to processing and understanding visual inputs. And our memory handles visual information better as well. We can recall information presented visually over 6 times as well as non visual information.
As growth in size and complexity of datasets in social networks, government, non-government and corporations accelerates, existing tools such as spreadsheets are no longer catering to the visualisation demands to the creators and audience.
The massive growth in computer power on devices, such as the iphone, upcoming ipad and the laptops we use on our desks, opens up the opportunity for rich interactive visualisations to be used to explore and engage with our data.
And people’s expectations are changing, as data rich experiences such as World of Warcraft improve people’s ability to process complex sets of visual data, the expectation is that these consumer experiences migrate into our day to day work and life.
The best data visualisations provide a point of view, guiding the audience around the data AND give the viewers the ability to interact with the visualisation crafting their own narrative for self exploration of the data.
But there can also be problems. Problems with the data, the way it is analysed, the way it is interpreted and the way it is represented. These problems can be created by mistake or can be intentional distortions aimed to misinform the viewer.
Over emphasis on one measure within the data, often “the average”, can lead to visualisations that ignore interesting outliers and emerging trends. It is important to understand the goal of the visualisation and the unique characteristics of the data when choosing what to visualise.
Appearing as an authoritative measure on a mainstream news show, this pie chart shown on Fox news in America is the worst example of data misrepresentation, not only is a 3D pie chart a poor method to use but in this case the raw data just doesn’t add up.
The question of “compared to what” is at the heart of any good visualisation. In this case, Senator Fielding has shown only the last 10 years of climate change data to bolster his point of view. By quoting data out of context, he distorts and misrepresents in this piece of propaganda.
Launched just over a month ago, the mySchool website gives parents of school aged children a way of viewing their schools NAPLAN test performance against nearby schools and schools of similar socio economic standing.
Based on a single 2 hour test taken every two years, the NAPLAN score provides an exceptionally narrow representation of what constitutes a good school.
With no ability to compare local schools, explore how different types of schools perform, see improvement over time and an overemphasis on averages, it has unfortunately created a service with lots of data but poor representation and data visualisation.
Creators have the responsibility to be ethical and not distort the data they are visualising. Numbers should add up, data should be quoted IN context, and the raw data should be meaningful and accurate.
And consumers of data visualisations should think critically about these new representations. Just as we have learnt to be critical of written pieces, we need to develop the skills to think visually and be critical of what can be a very powerful and persuasive medium.
In a time when we all seek to engage with our customers in more meaningful ways; data visualisation can help audiences to understand our products, services, and data in ways not previously possible.
As the volume and complexity of data increases, data visualisation can help you move beyond the endless rows and columns of faceless data, helping you communicate and engage with your audience. It’s time to move beyond the bar chart.
Thank You.


