Correlation MDS-space plots added to raster.cor.plot

January 31, 2018, 6:43 am

≫ Next: Playing around with a new version of response plots

≪ Previous: Massive wad of ENMTools-R updates just published

Here's a handy little visualization when you're interested in the correlations between your rasters. Basically it takes the matrix of absolute values of correlation coefficients for a set of rasters, turns it into a distance matrix, and then does MDS scaling on it. The resulting coordinates are turned into a nice little plot, where highly correlated variables are plotted closer to each other than more uncorrelated variables. It's a good way to eyeball relationships during variable selection. Here's one for a set of 20 bioclimatic variables.

Currently this is on the develop branch on GitHub, but I'll be merging it into the master branch as soon as it passes through testing.

↧

Playing around with a new version of response plots

February 1, 2018, 7:53 am

≫ Next: Interactive species and model plots

≪ Previous: Correlation MDS-space plots added to raster.cor.plot

On the develop branch of the R version of ENMTools, I've been playing around with a new way of plotting marginal response functions. In the figure above, the solid blue line represents the relative suitability of habitat as a function of an environmental gradient, with all other variables being held constant at their mean value for all presence and background points. The green and red dashed lines represent the relative frequency of occurrence of different values of the environmental variable. This is a Bioclim model, which is why it looks so chunky.

The goal here is to give you some idea of what your model is saying in the context of the data that generated it, instead of just plotting the response function. Currently it's been tested on GAM, GLM, Bioclim, and Domain models. I'll do some more testing and probably move it over to the main branch before the weekend.

↧

Interactive species and model plots

February 27, 2018, 3:21 am

≫ Next: Recent changes: trim dupes by raster, species from csv file, and improvements to ecospat tests

≪ Previous: Playing around with a new version of response plots

Russell Dinnage has just added a really cool feature: interactive plotting of enmtools.species and enmtools.model objects. Here's a species:

And here's a model:

The maps are zoom-able and pan-able, and draw very quickly. Source maps are downloaded from the internet on the fly, though, which means you do need an internet connection to use these functions.

The functions in question are "interactive.plot.enmtools.species" and "interactive.plot.enmtools.model", respectively. All are now implemented on the master branch of ENMTools on GitHub.

↧

Recent changes: trim dupes by raster, species from csv file, and improvements to ecospat tests

March 22, 2018, 7:24 am

≫ Next: BITC presentation on the past, present, and future of ENMTools

≪ Previous: Interactive species and model plots

I've been making a bunch of small changes recently. Some of it you won't see, but bits of it are there to make your life easier. One nice little trick is that now you can go:

species.list = species.from.file("myfile.csv", species.col = "Species")

That will create a unique enmtools.species object for every unique value in the column named "Species". If you have one species in the file, you just get back an enmtools.species object. If you have multiple species in the file, you get a list of enmtools.species objects. This will save a ton of time, particularly if you're building an entire clade for the aoc tests.

I've also added a "trim.dupes.by.raster" function. Basically you feed that a set of points and a raster, and it trims your data set down so there's a max of one point per grid cell. Nothing dramatic, but it's something people do a lot so it's worth automating. It's easy to use too:

new.points = trim.dupes.by.raster(old.points, env)

Finally, I tweaked a bunch of stuff for the ecospat functions. Most of it you won't notice, but it deals with some potential errors that could come up if you had mismatched NAs in your environment rasters. The big changes are (1) ecospat was calculating p values wrong, so now that's being done in ENMTools, and (2) now the ENMTools ecospat functions will automatically do PCA if you pass them more than two rasters unless told otherwise.->->

↧

BITC presentation on the past, present, and future of ENMTools

March 22, 2018, 7:32 am

≫ Next: Course Reminder: Quantitative geographic ecology using R. Glasgow, April 30 to May 4.

≪ Previous: Recent changes: trim dupes by raster, species from csv file, and improvements to ecospat tests

In case you missed it, I gave a talk for the Biodiversity Informatics Training Curriculum the other day where I talked about the history of ENMTools, what's happening now, and where it's going in the near future. It's a whirlwind tour, but for those of you who are curious about what's coming this is a good overview.

Thanks to Town for having me on, and I apologize to y'all for the technical problems. Everything worked right up until the very second we started recording.

↧

Course Reminder: Quantitative geographic ecology using R. Glasgow, April 30 to May 4.

April 5, 2018, 7:31 am

≫ Next: Issues with installing ENMTools under newest versions of R

≪ Previous: BITC presentation on the past, present, and future of ENMTools

Hey everybody! There are still open slots in my upcoming course with Matt Fitzpatrick entitled "Quantitative geographic ecology using R: modelling genomes, niches, and communities". Sign up now, it's coming up fast!

The course will include introductory lectures, guided computer coding in R, and exercises for the participants, with an emphasis on visualization and reproducible workflows. All modelling and data manipulation will be performed with R. Attendees will learn to use niche modelling algorithms including Maxent, GLM, GAM, and others, and will learn both new and existing methods for conducting comparative studies using ENMs in the new ENMTools R package. Generalized Dissimilarity Modelling (GDM) and Gradient Forest (GF) will be taught for modelling genomic and community-level data. The course is intended for intermediate R users with interest in quantitative geographical ecology.

https://www.prstatistics.com/course/quantitative-geographic-ecology-using-r-modelling-genomes-niches-and-communities-qger01/

↧

Issues with installing ENMTools under newest versions of R

May 17, 2018, 6:07 am

≫ Next: More installation issues: "descendants" not exported from phyloclim, ecospat not found

≪ Previous: Course Reminder: Quantitative geographic ecology using R. Glasgow, April 30 to May 4.

During the course I co-taught with Matt Fitzpatrick in Glasgow a few weeks ago, it came out that a number of people were having trouble installing ENMTools. A lot of this seems to stem from the newest version of R, and in particular how it interacts with your Java installation.

Some Mac users (including me) were able to fix this just by going to the console and typing "R CMD javareconf" and restarting R.

Windows users seemed to have much more significant issues, however. There seemed to be several issues, but here's a solution that helped several of the students, courtesy of Hirzi Luqman:

Error:
* installing *source* package ‘ENMTools’ ...
** R
** data
*** moving datasets to lazyload DB
** tests
** byte-compile and prepare package for lazy loading
Error : .onLoad failed in loadNamespace() for 'rJava', details:
call: dyn.load(file, DLLpath = DLLpath, ...)
error: unable to load shared object '/Library/Frameworks/R.framework/Versions/3.5/Resources/library/rJava/libs/rJava.so':
dlopen(/Library/Frameworks/R.framework/Versions/3.5/Resources/library/rJava/libs/rJava.so, 6): Library not loaded: /Library/Java/JavaVirtualMachines/jdk-9.jdk/Contents/Home/lib/server/libjvm.dylib
Referenced from: /Library/Frameworks/R.framework/Versions/3.5/Resources/library/rJava/libs/rJava.so
Reason: image not found
ERROR: lazy loading failed for package ‘ENMTools’
* removing ‘/Library/Frameworks/R.framework/Versions/3.5/Resources/library/ENMTools’
Installation failed: Command failed (1)

Problem:
ENMtools/rJava was looking for (and couldn't find) the library "libjvm.dylib" in the directory "/Library/Java/JavaVirtualMachines/jdk-9.jdk/Contents/Home/lib/server/", because no such directory existed (this refers to a Java 9 installation); I don't know why it looked for the file there...

Solution: I just copied the same file from the Java 8 directory (or whatever Java version is installed on the computer; in my case the directory was "/Library/Java/JavaVirtualMachines/jdk1.8.0_45.jdk/Contents/Home/lib/server/") to the R library directory "/Library/Frameworks/R.framework/Versions/3.5/Resources/lib/", and it worked

↧

More installation issues: "descendants" not exported from phyloclim, ecospat not found

June 4, 2018, 4:52 am

≫ Next: Minor fixes, new features

≪ Previous: Issues with installing ENMTools under newest versions of R

Well the fun just keeps rolling in with the newest R update and ensuing package updates. Two new issues have just been brought to my attention that prevent new installations of ENMTools: first, the newest version of phyloclim no longer exports the "descendants" function, which ENMTools uses. There's a hot fix for that on the develop branch of ENMTools on Github, where I basically just copied the code over from phyloclim for the time being so it no longer has to be imported.

The second issue is a bit more of a hassle: ecospat has been removed from CRAN, and I'm not sure why. Given that ENMTools requires ecospat to run, this is a bit of an issue. For now, all you can really do is install ecospat manually from the Github mirror of the CRAN repository, and then install ENMTools after that. Hopefully ecospat will get updated and re-uploaded to CRAN soon, but failing that we might need to find a more permanent workaround eventually.

↧

Minor fixes, new features

July 25, 2018, 5:19 am

≫ Next: Predict functions for all model types

≪ Previous: More installation issues: "descendants" not exported from phyloclim, ecospat not found

Hey all! I've been bashing away at ENMTools for the past couple of days, just doing a bunch of bug fixes and adding some new features. If you want to see everything you can go here, but I'll outline the highlights.

1. Added a "bg.source" argument to all modeling functions that allows you to specify whether you want to draw background from the background points or range raster stored in your enmtools.species object, or the geographic area outlined by your environment raster. If you don't specify any bg.source argument it will prioritize them in the order it did previously: background points > range raster > environment layers.

2. Changed the raster.pca function to return the actual pca object along with the raster layers.

3. Fixed a persistent error with ggplot2 plots from hypothesis tests excluding some values. The fix for this isn't perfect yet (and I'm not entirely sure why), but in my experiments with it yesterday the issue is MUCH reduced.

4. Added a plot to the output of enmtools.aoc that shows the averaged overlap values on each node in the phylogeny. The old plots for the hypothesis tests are still there, but if you display the enmtools.aoc object those are now the second plot. The first one is the tree, and it looks like this:

I've also added some plots I've been meaning to put in for a while. These are called "three space plots", because they're meant to visualize the environment spaces representing the presence and background data from a model along with the environment space represented by a set of environment layers.

For these you just type threespace.plot(model, env), where your model is an enmtools.model object and your env is a set of raster layers. That gives you something like this:

The goal here it to visualize how much of that environment space represents sets of conditions your model never saw when it was being built. This is just a first pass and I do have more stuff planned in this direction, but I reckon this alone might be useful for some of you out there.

↧

Predict functions for all model types

July 31, 2018, 10:41 pm

≫ Next: Why add correlations for suitability scores?

≪ Previous: Minor fixes, new features

I've finally gotten around to adding predict() functions for all of the ENMTools model types. You can now project your model onto a new time period or geographic extent, and it gives you back two things: a raster of the predicted suitability of habitats, and a threespace plot (see description here). I've also added some more stuff under the hood that should catch some data formatting issues that were causing errors.

↧

Why add correlations for suitability scores?

October 5, 2018, 6:16 am

≫ Next: Fun fact: you can run a whole bunch of models at once using ENMTools quite easily

≪ Previous: Predict functions for all model types

Hey y'all! After a conversation with some colleagues, I realized that I sort of added Spearman rank correlation as a measure of overlap to ENMTools without really explaining why. Here is a quick and dirty sketch of my thinking on that.

Previous measures of overlap between suitability rasters that were implemented in ENMTools were based on measures of similarity between probability distributions. That's fine as far as it goes, but my feeling is that it's more useful as a measure of species' potential spatial distributions than as a measure of similarity of the underlying model. Here's a quick example:

# Perfect positive correlation
sp1 = seq(0.1, 1.0, 0.001)
sp2 = seq(0.1, 1.0, 0.001)

->->

You can only see one line there because the two species are the same. I wrote a function called olaps to make these plots and to calculate three metrics of similarity that are implemented in ENMTools.

olaps(sp1, sp2)

$D
[1] 1

$I
[1] 1

$cor
[1] 1

Okay, that's all well and good - perfectly positively correlated responses get a 1 from all metrics. Now what if they're perfectly negatively correlated?

# Perfect negative correlation
sp1 = seq(0.1, 1.0, 0.001)
sp2 = seq(1.0, 0.1, 0.001)

olaps(sp1, sp2)
$D
[1] 0.590455

$I
[1] 0.8727405

$cor
[1] -1

What's going on here? Spearman rank correlation tells us that they are indeed negatively correlated, but D and I both have somewhat high values! The reason is that the values of the two functions are fairly similar across a fairly broad range of environments, even though the functions themselves are literally as different as they could possibly be. Thinking about what this means in terms of species occurrence is quite informative; if the threshold for suitability for a species to occur is low (e.g., .0005 in this cartoon example), they might co-occur across a fairly broad range of environments; both species would find env values from 250 to 750 suitable and might therefore overlap across about 2/3 of their respective ranges. That's despite them having completely opposite responses to that environmental gradient, strange though that may seem.

So do you want to measure the potential for your species to occupy the same environments, or do you want to measure the similarity in their estimated responses to those environments? That's entirely down to what question you're trying to answer!

Okay, one more reason I kinda like correlations:

# Random
sp1 = abs(rnorm(1000))
sp2 = abs(rnorm(1000))

Here I've created a situation where we've got complete chaos; the relationship of both species to the environment is completely random. Now let's measure overlaps:

olaps(sp1, sp2)

[1] 0.5641885

[1] 0.829914

$cor

[1] -0.04745993

Again we've got fairly high overlaps between species using D and I, but Spearman rank correlation is really close to zero. That's exactly what we'd expect if there's no signal at all. Of course the fact that species distributions and the environment are both spatially autocorrelated means that we'll likely have higher-than-zero (at least in absolute value) correlations even if there is no causal relationship between the environment and species distributions, but at least it's nice to know that we do have a clear expected value when chaos reigns.

Code for this is here:

https://gist.github.com/danlwarren/5509c6a45d4e534c3e0c0ecf1702bbdd

↧

Fun fact: you can run a whole bunch of models at once using ENMTools quite easily

January 21, 2019, 12:37 am

≫ Next: Low memory usage options for identity.test and background.test

≪ Previous: Why add correlations for suitability scores?

The ENMTools R package contains a function called "species.from.file". This takes a .csv file and creates a list of species objects, one for each value in the "species" column in your .csv file. So you can do:

species.list <- species.from.file("myspecies.csv")

and you'll get back a list with ENMTools species in it. If you wanted to run a bunch of models using those species with the same settings, you could then do:

my.models <- lapply(species.list, function(x) enmtools.gam(x, climate.layers, test.prop = 0.3))

where "climate.layers" is your raster stack. That would create a list of ENMTools GAM objects, setting aside 30% of the data from each for testing.

↧

Low memory usage options for identity.test and background.test

February 13, 2019, 6:41 am

≫ Next: RWTY 1.0.2 now on CRAN

≪ Previous: Fun fact: you can run a whole bunch of models at once using ENMTools quite easily

This is something I've been meaning to do for a while, but just finally got around to because it was screwing someone's analysis up.

Originally, the ENMTools R package was designed to store all replicate models in the output object for the identity and background tests. While that's fine for low resolution or small extent studies, it got to be a real problem for people working with high-resolution data over larger geographic extents.

To deal with this, I've created options for background.test and identity.test that allow you to save the replicate models to .Rda files instead of storing them in the output object. When called using this option, the replicate.models entries in the identity.test and background.test objects contain paths to the saved model files instead of containing the models themselves.

By default these functions just store models in the working directory, but you can specify a directory to save them to instead if you prefer.

To run these tests using the low memory options, just pass the argument low.memory = TRUE. If you want to pass it a directory to save to, just add rep.dir = "PATH", where PATH is your directory name.

Be warned that replicate models WILL be overwritten if they exist. It's a good idea to make a separate directory for the reps from each analysis.

This new functionality is currently only implemented on the "develop" branch, but we'll move it over to the main branch soon.

↧

RWTY 1.0.2 now on CRAN

February 18, 2019, 12:29 am

≫ Next: Low memory options for rangebreak tests are now available on develop branch

≪ Previous: Low memory usage options for identity.test and background.test

The newest version of RWTY, version 1.0.2 is now on CRAN. This is a relatively minor release except for one significant bug fix: there was an issue causing one of the plotting functions to fail when it was called with a single MCMC chain (as opposed to multiple MCMC chains). That's fixed now, and all is well.

The other major change (for the worse, IMO) is that we had to remove the "Plot Comparisons" vignette to get the package in under CRAN's size restrictions. That's a really useful vignette, so it sucks to have to remove it. There are just too many images to make it fit the size requirements, though, and since the entire point of that vignette is comparing those images it doesn't make sense to remove them. You can still get the vignette installed with RWTY by installing from GitHub or just check it out here: http://danwarren.net/plot-comparisons.html

↧

Low memory options for rangebreak tests are now available on develop branch

February 18, 2019, 3:44 am

≫ Next: Now on biorXiv: Evaluating species distribution models with discrimination accuracy is uninformative for many applications

≪ Previous: RWTY 1.0.2 now on CRAN

There are now options available for writing replicate models to disk for the rangebreak tests, instead of storing them in the output object. This works in exactly the same way as it does for the identity and background tests, as outlined here:

http://enmtools.blogspot.com/2019/02/low-memory-usage-options-for.html

Again it's currently only on the develop branch, but we'll move it to main before too terribly long.

↧

Now on biorXiv: Evaluating species distribution models with discrimination accuracy is uninformative for many applications

June 28, 2019, 5:39 am

≫ Next: ENMTools R package now has a DOI

≪ Previous: Low memory options for rangebreak tests are now available on develop branch

I've just recently posted a new paper to biorXiv entitled "Evaluating species distribution models with discrimination accuracy is uninformative for many applications", which I did in collaboration with Nick Matzke and Teresa Iglesias. Many of you have seen me present this work at conferences over the past few years, but this is the first time it's been available in a publicly-accessible format. The conclusions of this paper are fairly problematic for SDM/ENM studies where the goal is to estimate the response of the species to a set of environmental gradients, rather than just to estimate its distribution, and so I feel like it's worthwhile to basically do a preemptive breakdown and FAQ about what we are, and aren't, saying with this one, and to try to respond to some obvious questions it raises.

Main conclusion: Using a very large set of simulations, we find that using discrimination accuracy (AUC/TSS/kappa) on occurrence data is, under a broad range of conditions, not very useful for selecting models that accurately estimate the niche. Here, "accurately estimating the niche" means getting the relative suitability of habitat right. In other words, models that get your species' current spatial distribution right are not necessarily models that accurately estimate the relative suitability of habitat and vice versa. Under many realistic conditions, we simply cannot tell good niche estimates from bad ones this way.

How is this possible? People have actually known that discrimination accuracy can be largely misinformative about calibration for quite some time, and this has even been brought up multiple times before in the SDM/ENM literature. We're not really demonstrating a new problem here, so much as demonstrating the scope of a known problem.

As for how this is possible, consider the following plot:

Here we've got a bunch of presence and absence (or background/pseudoabsence) data, distributed along a one-dimensional environmental gradient. The solid lines represent six different models we could use to explain or generalize this species' distribution along that gradient. These are obviously very different models, right? The issue is that discrimination accuracy essentially asks "how well do you assign higher scores to your presence data than you do to your non-presence data?", and in this example every one of these models assigns higher scores to all of the presence points than they do to the non-presence points.

At first pass the message you might get from this is "okay, discrimination accuracy is saying some models are good that seem quite wacky, but it's also saying that some models are good that might be quite sensible". That's true, but the real issue is not just that it's assigning high discrimination accuracy to all of the models; it's actually assigning the same accuracy to all of thees models. That means that if you had these data and this set of models in hand, you would be unable to choose between them using discrimination accuracy alone; their performance is identical.

This points to the root of the problem, as we see it; if your goal is to estimate a species' continuous response to the environment, discrimination accuracy has very little signal that is useful for model selection because you may have an arbitrarily high number of possible models that have the same (or similar) discrimination accuracy.

Using methods that involve a high number of predictors and complex response functions exacerbates this problem by a substantial amount; we found that discrimination accuracy is most useful for model selection only when you have a few (less than ~4) predictors and when the true marginal suitability function is a realizable case of the modeling algorithm. This means that discrimination accuracy isn't very effective at model selection when you're using (for example) some sort of step function to approximate a smooth one.

That's the statistical argument, but there's an ecological argument to be made here as well: we know that species' spatial distributions are shaped by a number of phenomena besides their environmental niches (biotic interactions, historical biogeographic processes, etc.). This means that an accurate estimate of environmental habitat suitability will, in many cases, over- or under-estimate the actual distribution of the species. Finding that good distribution estimates sometimes aren't great niche estimates (and vice versa) therefore shouldn't be surprising from a purely biological perspective.

Does this mean my models are bad? It doesn't necessarily mean that at all! What it means is that, if you're in a spot where you have a set of candidate models that have a range of performance using discrimination accuracy, discrimination accuracy is often not useful for choosing between them. If all of your models are good niche estimates (which they might be), this may just mean that you didn't necessarily pick the best niche estimate out of the batch.

It does have broader implications for the methodological literature, however. Over the past few decades, we have developed a set of "best practices" for SDM/ENM studies. These "best practices" have been favored largely based on studies where people quantify models' ability to predict withheld occurrence data via some measure of discrimination accuracy. Our results imply that those "best practices" may only be best practices when the goal is to estimate species' distributions, and we may need to entirely reevaluate which of those methods work best for making good niche estimates.

Is it possible that you just screwed up the simulations? It's always possible, but we did a lot of checking and double-checking to make sure our simulations do what they say on the tin. Honestly the results are worrying enough that I would be quite happy to find out that this is all just due to me screwing up the simulation at some point that I haven't managed to find yet. I will say, however, that I've recently been doing an entirely separate set of simulations for another purpose using entirely different methods of simulating species and occurrence data, and have found very similar results.

But you didn't use method X (where X = some method of choosing predictors, selecting study area, favored algorithm, etc.)! This is a comment that seems to pop up every time this work is presented at a meeting or seminar. It's true that we didn't explore every possible combination of methods in our analyses, and model construction was basically automated. We did try to explore a pretty decent space of common methods though, including seven different algorithms, two different sizes of study region, spatially partitioned vs. random model validation, and varying levels of model and niche complexity. Some of it seemed to make a bit of difference, but the only thing we could find that seemed to really improve things markedly was making models with few predictors, and making models with biologically realistic response functions (i.e., those that matched the true niche).

As a result of exploring all of those combinations of approaches, this study involved the construction of many thousands of models, and we can't really do that in a hands-on fashion. "It's a lot of work" isn't really a great argument, though, so let me clarify our reasoning on this topic. There are a few things to note here:

In order to explore the relationship between discrimination accuracy and our ability to estimate the niche (which we're calling "functional accuracy"), we actually need models with a range of performance on both of those axes, including some that are actually bad at one or the other, or both.
The question we're addressing is not "can we make good niche estimates?", or "can we make good distribution estimates?", or even "how do we make the best niche and/or distribution estimates?" The question we're asking has to do with whether or not evaluating models as distribution estimates helps us pick good niche estimates. As such, using methods that have been shown to make better distribution estimates won't necessarily affect our results at all. In fact, even finding methods that make better distribution and niche estimates won't affect our results unless those methods also produce a tighter correlation between discrimination and functional accuracy. Given that we have no a priori reason to think that this is the case, along with the computation time involved, we decided the issue wasn't worth investing another year of computer time in. If, however, someone can demonstrate that there is a set of methods that produces a strong correlation between discrimination and functional accuracy I will be absolutely thrilled. One thing I think might be particularly interesting for future exploration is sample size. We kept sample size constant here at 100 (75 training, 25 testing). I suspect that higher sample sizes might expand the space of methods for which discrimination accuracy is useful in model selection, but that's a question for another day.
Even assuming that some combination of methods does exist that produces a strong correlation between discrimination and functional accuracy, we think it's very useful to demonstrate that there's a broad swath of approaches for which those two things are not correlated. Even if it's not broken everywhere, we feel there's great value to demonstrating that it's broken in a lot of places.
Many of the methods people in the field might be inclined to suggest exploring are considered "best practices" due to previous methodological work in the field. That's reasonable, but we'd highlight the fact that most of those methodological studies favored those methods specifically because they maximize discrimination accuracy. Given that we're showing that discrimination accuracy is largely uninformative here for selecting niche estimates, their performance in those studies is not necessarily relevant here; see point 2.

Why post on biorXiv? The paper is currently in review, and we're hopeful that it will be accepted before too long. However, I'm currently working on several projects that build on this work, and need a citation target for it. Others have asked for something they can cite too, so I decided to just go ahead and put it out there.

Where can I get the simulation code? I've got a github repo for it here, but I'm still in the process of making it prettier and easier to understand. You're welcome to download it now, but if you wait a few days it will likely be far more compact and comprehensible.

How should I change my methods if I'm concerned about this? As stated above, to me the big take-homes are:

Use few predictors in model construction, which you think are important in limiting the species' distribution.
Use methods that produce biologically plausible response functions.
Visualize marginal suitability functions, and think critically about them as biological estimates.

These suggestions basically mirror general best practices in any sort of modeling. However, to my mind we as a field (myself very much included) have often allowed ourselves to violate these common-sense principles in pursuit of models that maximize discrimination accuracy. In light of what Teresa, Nick, and I have found with these simulations, though, I think we should be much more careful about our models as niche estimates, and much less beholden to methods that make maps that are tightly fit to species' current spatial distributions.

One final note: In addition to de-emphasizing discrimination accuracy somewhat in empirical modeling studies, I hope that two things come out of this study. First, I hope that future studies will examine the performance of alternative model selection metrics. Second, I sincerely hope that this will help to emphasize how much SDM/ENM needs a robust simulation literature (see also this excellent review). The question we examined in this study seems pretty fundamental to the field, and yet there really hadn't been much in the way of prior simulation work on it. I think there are a ton of other low-hanging fruit out there that need to be reexamined in a simulation context; we've got a field where a lot of things have been justified by verbal arguments and appeals to common sense, and we really need to figure out whether that stuff actually works.

Addenda:

From Twitter, paraphrased: Why not evaluate other methods of measuring model fit (e.g., calibration)? In this study, our goal was just to look at the most commonly used metrics of model fit in the literature, and so we stuck with discrimination accuracy on occurrence points. People do measure model calibration and other aspects of model performance, but they're a tiny fraction of the published studies. I do strongly suspect that calibration will be a much better tool for model selection when the goal is to estimate the niche, and we should definitely look at that soon.

↧

ENMTools R package now has a DOI

July 4, 2019, 12:27 pm

≫ Next: Calibration metrics added to ENMTools R package

≪ Previous: Now on biorXiv: Evaluating species distribution models with discrimination accuracy is uninformative for many applications

I finally got some time to put together the first real beta release of ENMTools, and got a doi number for it. If you're using the R version of ENMTools, please cite it as follows for now:

Warren, D.L., N. Matzke, M. Cardillo, J. Baumgartner, L. Beaumont, N. Huron, M. Simões, and R. Dinnage. 2019. ENMTools (Software Package). URL: https://github.com/danlwarren/ENMTools. doi:10.5281/zenodo.3268814

↧

Calibration metrics added to ENMTools R package

August 9, 2019, 2:23 am

≫ Next: Code snippet from Tyler Smith for fast plotting of ENMTools models

≪ Previous: ENMTools R package now has a DOI

Given our recent paper on how badly discrimination accuracy performs at selecting models that estimate ecological phenomena, the obvious next question is whether there are existing alternatives that do a better job. The answer is an emphatic YES; calibration seems to be a much more useful method of selecting models for most purposes.

This is obviously not the first time that calibration has been recommended for species distribution models; Continuous Boyce Index (CBI) was developed for this purpose (Boyce et al. 2002, Hirzel et al. 2006), Phillips and Elith (2010) demonstrated the utility of POC plots for assessing and recalibrating models, and Jiménez-Valverde et al. (2013) argued convincingly that calibration was more useful for model selection than discrimination for many purposes. However, discrimination accuracy still seems to be the primary method people use for evaluating species distribution models.

A forthcoming simulation study by myself, Marianna Simões, Russell Dinnage, Linda Beaumont, and John Baumgartner is demonstrating exactly how stark the difference between discrimination and calibration is when it comes to model selection; calibration metrics are largely performing fairly well, while discrimination accuracy is actively misleading for many aspects of model performance. The differences we're seeing are sufficiently stark that I would go so far as to recommend against using discrimination accuracy for model selection for most practical purposes.

We're writing this study up right now, and in the interests of moving things forward as quickly as possible we'll be submitting it to biorXiv ASAP - likely within the next week or two. As part of that study, though, I've implemented a number of calibration metrics for ENMTools models, including expected calibration error (ECE), maximum calibration error (MCE), and CBI. We did not implement Hosmer-Lemeshow largely because ECE is calculated in a very similar way as HL and can be used in statistical tests in much the same way, but scales more naturally (a perfect model has an ECE of 0). In our simulation study we found that ECE was the best performing metric for model selection by far, so for now that's what I personally will be using.

Fairly soon we'll integrate the calibration metrics into all of the model construction functions (e.g., enmtools.glm, enmtools.maxent, etc.). For now though, you can get calibration metrics and plots just by using the enmtools.calibrate function.

That will give you two different flavors of ECE and MCE as well as CBI and some diagnostic plots. ECE and MCE calculations are done using the CalibratR package (Schwarz and Heider 2018), and CBI is calculated using functions from ecospat (DiCola et al. 2017).

Boyce, M.S., Vernier, P.R., Nielsen, S.E., and Schmiegelow, F.K.A. 2002. Evaluating resource selection functions. Ecological Modeling 157:281-300)

Di Cola, V., Broennimann, O., Petitpierre, B., Breiner, F. T., D’Amen, M., Randin, C., … Guisan, A. (2017). ecospat: an R package to support spatial analyses and modeling of species niches and distributions. Ecography, 40(6), 774–787.

Hirzel, A.H., Le Lay, G., Helfer, V., Randon, C., and Guisan, A. 2006. Evaluating the ability of habitat suitability models to predict species presences. Ecological Modeling 199:142-152.

Jiménez-Valverde, A., Acevedo, P., Márcia Barbosa, A., Lobo, J. M., & Real, R. (2013). Discrimination capacity in species distribution models depends on the representativeness of the environmental domain. Global Ecology and Biogeography. doi:10.1111/geb.12007

Phillips, S. J., & Elith, J. (2010). POC plots: calibrating species distribution models with presence-only data. Ecology, 91(8), 2476–2484.

Schwarz, J., & Heider, D. (2018). GUESS: Projecting Machine Learning Scores to Well-Calibrated Probability Estimates for Clinical Decision Making. Bioinformatics . doi:10.1093/bioinformatics/bty984

↧

Code snippet from Tyler Smith for fast plotting of ENMTools models

May 24, 2020, 9:26 pm

≫ Next: Introductory tutorials for the R version of ENMTools

≪ Previous: Calibration metrics added to ENMTools R package

Over on the ENMTools GitHub page, Tyler Smith asked a question about plotting ENMTools models. He pointed out that large models are very slow to plot, largely because of our use of ggplot2. We like ggplot for this because it allows us to store plots in objects easily, and makes it possible for users to modify plots after the fact using all of the features of ggplot and the extensions people have written for it. That said, it's probably frustrating to have a long draw time if you just want to take a quick peek at your model's predictions. Tyler provided a code chunk that does a nice quick plot using base graphics. The end result looks a lot like the standard ggplot plots we've been returning, but takes a fraction of the time to display. Here's that code:

library(viridis) # to match ENMTools color palette

plotTWS <- ...="" function="" p="" x=""> plot(x$suitability, col = viridis(100, option = "B" ),
xlab = "Longitude", ylab = "Latitude",
main = paste("Maxent model for", x$species.name),
bty = 'l', box = FALSE)
points(subset(x$analysis.df, presence == 1), pch = 21,
bg = "white")
points(x$test.data, pch = 21, bg = "green")
}

We're going to see if we can work out a quicker way to do our built-in plots using ggplot, but for now this is a nice workaround!->

↧

Introductory tutorials for the R version of ENMTools

June 2, 2020, 7:57 pm

≫ Next: install.packages("ENMTools") - ENMTools is now on CRAN!

≪ Previous: Code snippet from Tyler Smith for fast plotting of ENMTools models

Hey everybody! I've started recording quick tutorials on the most important bits of ENMTools. Here's one on how to install ENMTools and all of its dependencies:

And here's one on how to build ENMTools species objects and some quick models:

↧