4.6

## University of Twente

Skip to 0 minutes and 6 secondsWelcome to the second part of this video lecture on disease mapping of spatial analysis. In the following slides, you will be introduced to some of the methods of spatial smoothing, which can be considered when mapping disease rates. When we want to smooth out disease data, the first thing we need to do is to conceptualize what we consider to be our spatial surrounding. How this works, I will show using the following example. Consider the following five municipalities, called A, B, C, D, and E. Each of these municipalities share some borders with their neighboring municipalities, while some do not share borders. This can be conceptualized using the proximity matrix shown in the lower right side.

Skip to 0 minutes and 46 secondsHere, each row considers a municipality, and each of the column represents its neighbors. So for example, A shares a border with B and D, and does not share a border with A, C, and E. And the proximity matrix is represented by a 1 for B and D and a 0 for A, C, and E. This means that we believe that A might be influenced by what happens in B and D. The strength of this association and how this influence effects A is given in a spatial weights matrix.

Skip to 1 minute and 21 secondsThe effect of one area on its neighboring areas is defined by the spatial weights. Spatial weights can be calculated in different ways. The first method we commonly use is by taking the inverse of the distance between two neighboring areas, or more commonly, the inverse of the square distance between two neighboring areas. The second method to define spatial weights is by adjacency. In this case, an area can either be influenced or not be influenced by any of its spatial neighbors. Adjacency is defined as the order of adjacency. For example, you have direct neighbors or neighbors of your neighbors which might influence you. The third method to calculate spatial weight is by the length of the shared border.

Skip to 2 minutes and 3 secondsHere, we see a map of the Northwestern Europe with the Netherlands showing at the upper left side. The Netherlands shares 2/3 of its border with Germany and about 1/3 third of its border with Belgium. Its spatial weights, in this case would be 2/3 of Germany and 1/3 of Belgium. Germany, in contrast, shares its border with a lot of other countries. So the effect which Netherlands has on Germany will be far smaller than the effect of Germany on the Netherlands. The fourth and final methods which we present here is by interaction, or gravity model. In this case, the effect of one area on its neighbors is determined by a property of that specific area. For example, the population size.

Skip to 2 minutes and 46 secondsA larger population having a larger influence on its neighboring areas than a smaller population size.

Skip to 2 minutes and 54 secondsNow, we have conceptualized a spatial neighborhood. And if given some examples on how to define spatial weights, we can continue to calculate some smoothing averages. This formula shows you how to calculate a smooth of local attribute values. The value mu, which is the expected rate in a certain area, for example, a disease incidence rate, equals the sum of incidence rates in its surrounding neighboring areas multiplied by their relative weights, divided by the sum of all weights.

Skip to 3 minutes and 27 secondsHow this works is shown in the following example. Consider again the five municipalities, A, B, C, D and E, with values 1, 3, 4, 2 and 6. To calculate the expected value for example for area A, we use to spacial proximity matrix, which is shown in the middle. Here, the spatial weights are based on the length of the shared border. So for example, A shares 70% of its border with B at 30% of its border with D. Now to calculate the expected value for A, we take 70% times the value of B, which is 3, and 30% times the value of D, which is 2. Adding these up comes to the final expected value for area A, 2.7.

Skip to 4 minutes and 17 secondsAnother, and more sophisticated method to calculate smooth rates is by the method of the empirical Bayes. Here, global smoothers can be used to adjust each area estimate towards a global mean. Likewise, local smoothers based on the directly neighboring areas can be used to adjust each area estimate up or down, depending on the data in neighboring areas.

# Analysis of disease rates - need for spatial smoothing

A problem when mapping rates directly is that for small areas with few inhabitants, these rates are very unstable. This problem can be overcome by generating a smoothed map.

The first step in the map smoothing process is the construction of a proximity matrix. A proximity matrix defines the neighbors for each spatial unit, which are the units that share a common boundary (for example, which district is a direct neighbor of which district). In the proximity matrix each row in the matrix represents a spatial area, for example a district, and each column represents the neighbors. In a proximity matrix, neighbors are indicated by a 1 and non-neighbors are indicated by a 0. When two districts are indicated to be neighbors, we assume that they are influenced by each other. The spatial weights matrix indicates the strength of this association. Spatial weights express the effect of an area on its neighbors. Spatial weights can be calculated using different methods: inverse of the square distance between the areas (1), by adjacency (2), the length of the shared border (3) or by interaction or gravity model (4).

After defining the spatial weights we can take the step of calculating the smoothed values. When we do so, we recalculate the disease rate of the district by including the rates in the neighboring districts using the weights. This results in a smoothed expected value corrected for influences of neighboring districts.