Mapping with ggplot2: stat_binhex maps

Just some notes on the R code used for producing the map on my previous entry about erosion rates across the World.

Figure_01

Sometimes, when one has a lot of points to represent on a map, they overlap in excess resulting in an over-cluttered representation that does not adequately inform about the density of points. This was the case of the global soil erosion dataset on the referred post, so I used the stat_binhex representation on the ggplot library. The stat_binhex function bins a 2D plane, in this case a map, into a regular array of hexagons (there is an equivalent function with rectangles, too). On each hexagon, a color scale informs on the density (number) of points that lie within its surface.

Here is the code for the plot in the mentioned post, the data can be found here.

# We will use the following two libraries

library(ggplot2)
library(maptools)

# Load the dataset of points

dat <- read.table('dat.txt', head=TRUE)
head(dat)

# Load a shape file with the land border, and fortify it
# to conform to ggplot2 requirements

land <- readShapeLines('./land.shp')
land <- fortify(land, region='piece')
colnames(land)[1] <- c('lon')
head(land)

# Define the basic characteristics of the plot

p  <- ggplot(data=dat, aes(x=lon,y=lat))

A standard point plot looks cluttered:

p + geom_point()

map_1

Adding some transparency makes it slightly better, but still not satisfactory:

p + geom_point(alpha=0.25)

map_2

Binhex representation makes the things better:

p + stat_binhex(bins=200, binwidth=c(2,2))

map_3

It is possible to control the number of bins, and also the bin width. Finally, we can add the land border with geom_path(), tweak the color scale, and make some other minor modifications to the theme:

p + stat_binhex(bins=200, binwidth=c(2,2)) +
 geom_path(data=land,aes(group=group), size=0.25) +
 scale_fill_gradientn(colours=c('light gray','blue'),name='Frequency',na.value=NA) +
 coord_equal() +
 ylim(-60,90) +
 labs(x=NULL, y=NULL) +
 theme_bw() +
 theme(legend.position=c(0.075,0.28))

map_4

Please note that, depending on the spatial projection of the data, the hexagons generated by stat_binhex do not correspond to an equal land area.

Sin comentarios

Deja una respuesta

Tu email nunca será compartido con nadie.Los campos obligatorios están marcados con *