Skip main navigation
We use cookies to give you a better experience, if that’s ok you can close this message and carry on browsing. For more info read our cookies policy.
We use cookies to give you a better experience. Carry on browsing if you're happy with this, or read our cookies policy for more information.

Mining for patterns

So now we have a huge pile of data, how do we extract useful information from this pile? It may be hard when you already know what you are looking for but much harder when you have no clue.

Let me tell you a little about my case study. The problem was a very simple one and to be honest, the dataset was small and the complexity of the data was very limited. But what was important is that the approach would also work for far larger datasets and more complex situations.

We were looking for a way to compare the spatial-temporal diffusion patterns of Measles outbreaks on Iceland. Do epidemics always diffuse in a similar way? When this is the case, this information would be very useful for future prediction. Prior to starting this research we thought that perhaps it would be possible that there are differences between epidemics, but that certain medical districts would always behave in a particular way (early or late infection - short or longer infection). This information would also be very useful. It is important to realize that before we started the research we had no idea how we should cluster the data.

We have chosen this case study (Measles in Iceland) for a number of reasons:

  • There was a long time series available
  • A relatively small dataset makes the development of a method simpler
  • Because Iceland is an island the situation (like population) has been stable for a long time

Eventually we decided to use the learning algorithm SOM (Self Organizing Map). The following steps will explain how the analysis were performed. You can also go to the article and read the full text, but before doing so, check the next step that introduces you to data mining, and learning algorithms.

Share this article:

This article is from the free online course:

Geohealth: Improving Public Health through Geographic Information

University of Twente

Get a taste of this course

Find out what this course is like by previewing some of the course steps before you join:

Contact FutureLearn for Support