Just some notes on the R code used for producing the map on my previous entry about erosion rates across the World.
Sometimes, when one has a lot of points to represent on a map, they overlap in excess resulting in an over-cluttered representation that does not adequately inform about the density of points. This was the case of the global soil erosion dataset on the referred post, so I used the stat_binhex representation on the
ggplot library. The stat_binhex function bins a 2D plane, in this case a map, into a regular array of hexagons (there is an equivalent function with rectangles, too). On each hexagon, a color scale informs on the density (number) of points that lie within its surface.
Here is the code for the plot in the mentioned post, the data can be found here.
# We will use the following two libraries library(ggplot2) library(maptools) # Load the dataset of points dat <- read.table('dat.txt', head=TRUE) head(dat) # Load a shape file with the land border, and fortify it # to conform to ggplot2 requirements land <- readShapeLines('./land.shp') land <- fortify(land, region='piece') colnames(land) <- c('lon') head(land) # Define the basic characteristics of the plot p <- ggplot(data=dat, aes(x=lon,y=lat))
A standard point plot looks cluttered:
p + geom_point()
Adding some transparency makes it slightly better, but still not satisfactory:
p + geom_point(alpha=0.25)
Binhex representation makes the things better:
p + stat_binhex(bins=200, binwidth=c(2,2))
It is possible to control the number of bins, and also the bin width. Finally, we can add the land border with geom_path(), tweak the color scale, and make some other minor modifications to the theme:
p + stat_binhex(bins=200, binwidth=c(2,2)) + geom_path(data=land,aes(group=group), size=0.25) + scale_fill_gradientn(colours=c('light gray','blue'),name='Frequency',na.value=NA) + coord_equal() + ylim(-60,90) + labs(x=NULL, y=NULL) + theme_bw() + theme(legend.position=c(0.075,0.28))
Please note that, depending on the spatial projection of the data, the hexagons generated by stat_binhex do not correspond to an equal land area.