biomonitoR: an R package for managing ecological data and calculating biomonitoring indices

Alex Laini; Simone Guareschi; Rossano Bolpagni; Gemma Burgazzi; Daniel Bruno; Cayetano Gutiérrez-Cánovas; Rafael Miranda; Cédric Mondy; Gábor Várbíró; Tommaso Cancellario

doi:10.7717/peerj.14183

biomonitoR: an R package for managing ecological data and calculating biomonitoring indices

Alex Laini^1,2, Simone Guareschi ^3,4, Rossano Bolpagni¹, Gemma Burgazzi⁵, Daniel Bruno⁶, Cayetano Gutiérrez-Cánovas³, Rafael Miranda⁷, Cédric Mondy⁸, Gábor Várbíró⁹, Tommaso Cancellario^7,10

1Department of Chemistry, Life Sciences and Environmental Sustainability, University of Parma, Parma, Italy

2Department of Life Sciences and Systems Biology, University of Turin, Torino, Italy

3Estación Biológica de Doñana, Consejo Superior de Investigaciones Científicas, Sevilla, Spain

4Geography and Environment, Loughborough University, Loughborough, United Kingdom

5Institute for Environmental Sciences, University of Koblenz-Landau, Landau, Germany

6Instituto Pirenaico de Ecología (IPE), Consejo Superior de Investigaciones Científicas, Zaragoza, Spain

7Department of Environmental Biology, University of Navarra, Pamplona, Spain

8French Agency for Biodiversity (OFB), Vincennes, France

9Centre for Ecological Research, Institute of Aquatic Ecology, Debrecen, Hungary

10Water Research Institute, National Research Council (CNR), Verbania, Italy

DOI: 10.7717/peerj.14183

Published: 2022-10-14
Accepted: 2022-09-14
Received: 2022-07-22

Academic Editor: Eric Ward

Subject Areas: Biodiversity, Bioinformatics, Ecology, Data Science, Freshwater Biology
Keywords: Bioassessment, Decision-making tools, Ecological indicators, Environmental management, Functional metrics, Taxonomic indices

Copyright: © 2022 Laini et al.
Licence: This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.

Cite this article: Laini A, Guareschi S, Bolpagni R, Burgazzi G, Bruno D, Gutiérrez-Cánovas C, Miranda R, Mondy C, Várbíró G, Cancellario T. 2022. biomonitoR: an R package for managing ecological data and calculating biomonitoring indices. PeerJ 10:e14183 https://doi.org/10.7717/peerj.14183

The authors have chosen to make the review history of this article public.

Abstract

The monitoring of biological indicators is required to assess the impacts of environmental policies, compare ecosystems and guide management and conservation actions. However, the growing availability of ecological data has not been accompanied by concomitant processing tools able to facilitate data handling and analysis. Multiple common challenges limit the usefulness of biomonitoring information across ecosystems and biological groups. Biomonitoring data analysis is currently constrained by time-consuming steps for data preparation and a data processing environment with limited integration in terms of software, biological groups, and protocols. We introduce biomonitoR, a package for the R programming language that addresses technical challenges for the management of ecological data and metrics calculation. biomonitoR implements most of the biological indices currently used or proposed in different fields of ecology and water resource management. Its combination of customizable functions aims to support a transferable and comprehensive biomonitoring workflow in a user-friendly environment. biomonitoR represents a versatile toolbox with five main assets: (i) it checks taxonomic information against reference datasets allowing for customization of trait and sensitivity scores; (ii) it supports heterogeneous taxonomic resolution allowing computations at multiple taxonomic levels; (iii) it calculates multiple biological indices, including metrics for both broad and stressor-specific ecological assessments; (iv) it enables user-friendly data visualization, helping both decision-making processes and data interpretation; and (v) it allows working with an interactive web application straight from R. Overall, biomonitoR can benefit the wide biomonitoring community, including environmental private consultants, ecologists and natural resource managers.

Introduction

The Anthropocene has brought multiple and emerging threats to natural ecosystems and their biological communities (Reid et al., 2019). The monitoring of biological indicators (so called “biomonitoring”) is crucial to evaluate the impacts of environmental policies, compare ecosystems and guide management and conservation actions (Goldsmith, 1991; Jackson et al., 2016). In this context, biomonitoring represents a leading example of how ecological knowledge can help to address broad societal issues (Friberg, 2014). Over the years, research and institutional activities (e.g., long-term monitoring observatories, citizen science programmes) as well as the growing capacity for data storage (e.g., international platforms like Global Biodiversity Information Facility), have facilitated access to an increasing amount of ecological and biological data covering large areas, time frames and taxonomic groups (e.g., Grenié et al., 2022). Such heterogeneous sources and data availability brought new challenges and have not been adequately coupled with corresponding processing tools able to fully explore the great potential of biological databases, especially from a monitoring perspective.

Due to their long-standing exposure to human exploitation (Feio et al., 2021; Friberg, 2014), the case of freshwater ecosystems is emblematic. A multitude of information has been produced and indices have been developed for freshwater systems, especially following the implementation of environmental legislation such as the Water Framework Directive in Europe, the Clean Water Act in USA or the Canadian Protection Act (Birk et al., 2012; Vitecek, Johnson & Poikane, 2021). In this case, biomonitoring relies on a set of biological elements such as phytoplankton, phytobenthos, macrophytes, macroinvertebrates, and fish for assessing the status and integrity of communities and ecosystems (Birk et al., 2012). Complexity arises due to the different type of metrics used in freshwater biomonitoring including diversity measures (e.g., taxa richness, Shannon index) and biological indices that rely on taxon-specific sensitivity (Buss et al., 2015). Functional trait-based indices (e.g., Mondy et al., 2012; Laini et al., 2022) also emerged as novel tools able to provide complementary information and potentially enhanced performance over traditional indices (Bruno et al., 2016; Cadotte, Carscadden & Mirotchnick, 2011, but see Wilkes et al., 2017 for an exception). Moreover, new and consolidated threats (e.g., biological invasions, climate change) call for further pressure-based indices, like biocontamination (Arbačiauskas et al., 2008) and flow intermittency (Chadd et al., 2017) metrics.

In this context, an integrative data processing environment for biomonitoring is still lacking and would facilitate data-processing and index calculation, which are currently poorly integrated in terms of specific national protocols and informatic programs. One of the most important data-processing challenges is addressing the inconsistency among taxonomic, trait, and sensitivity scores because mismatches can lead to excluding taxa from analyses (Guareschi & Wood, 2019). Likewise, for highly diverse taxa (e.g., invertebrates), data frequently relies on mixed taxonomic resolution (e.g., species, genus, and family) due to inherent difficulties in identifying taxa at species level (e.g., in early life stages) (Jones, 2008). Heterogeneous taxonomic resolution and nomenclature discrepancies complicate assigning sensitivity scores or trait values to taxa and requires time-consuming data manipulations to properly organize the dataset. A thorough exploitation of these data, which are often only partially explored, will optimize the use of ecological data and reduce inefficiencies (Patterson et al., 2010; Grenié et al., 2022). Similarly, functional indices calculated from trait-based information require complex calculations that may prevent their wide use in biomonitoring (Maire et al., 2015).

Here, we introduce biomonitoR, a new R package (R Core Team, 2021) supporting the calculation of a wide range of biomonitoring indices and the effortless management of biological information from different sources (Fig. 1). The package favours a smooth and reproducible workflow in biomonitoring science and allows both new operations (e.g., metrics calculation and traits manipulation) as well as existing ones currently requiring the use of several packages. This versatility will help biomonitoR gain a prominent position among the informatic tools in ecology and monitoring. A detailed comparison with popular R packages focusing on community data analysis is illustrated in Table 1, displaying the niche occupied by biomonitoR within them.

Materials & Methods

Import data

The first step consists in comparing user’s data to a reference taxonomic dataset with the function as_biomonitor. User’s data consists in a data.frame with taxa on the first column and occurrences/abundances on the following columns, while the taxonomic reference dataset includes the relationships of taxa among levels. biomonitoR comes with four built-in reference datasets for diatoms, macrophytes, macroinvertebrates and fish derived from Diat.barcode (Rimet et al., 2019), AlgaeBase (Guiry et al., 2014), freshwatercology.info (Schmidt-Kloiber & Hering, 2015), and FishBase (Froese & Pauly, 2019), respectively. To ensure an up-to date and global taxonomy information, several functions (e.g., get_gbif_taxa_tree) are already available, permitting the user to build a biomonitoR reference dataset from online resources such as the Global Biodiversity Information Facility (GBIF), World Register of Marine Species (WoRMS), National Biodiversity Network Atlas (NBN Atlas) and International Union for Conservation of Nature (IUCN) (SM1). Moreover, users can also provide their own reference dataset or build it from a taxonomic tree with the function ref_from_tree. This is a crucial step in data handling to organize and clean the dataset as well as unify writing styles (e.g., lexical variants). Taxa names that do not match with any taxon of the reference taxonomy are excluded to ensure consistency of further steps. Nonetheless, suggestions are proposed either silently or interactively by setting the correct_names argument to TRUE. To explore the general data structure, the object generated by as_biomonitor can be subsetted or plotted by leveraging on the interactive plotly package (Sievert, 2020). The second step consists in using the function aggregate_taxa to aggregate taxa at different taxonomic levels according to the user’s data resolution. The obtained object can be used to calculate all the indices implemented in biomonitoR at the desired taxonomic resolution.

Figure 1: Example of workflow in environmental and biomonitoring science (from field work, data manipulation to results dissemination).
biomonitoR represents the toolbox that supports biomonitoring tasks and fluently connects the different phases.

Download full-size image

DOI: 10.7717/peerj.14183/fig-1

Table 1:

Comparison between the biomonitoR package and other R packages.

Comparison between the biomonitoR package and four popular R packages for analysing ecological community data. Package versions and dates are also specified.

Tasks and measures	vegan 2.6-2 2022-04-17	FD 1.0-12.1 2022-05-02	taxize 0.9.100 2022-04-22	BiodiversityR 2.14-3 2022-08-06	biomonitoR 0.9.3 2022-06-13
Richness and diversity metrics	Yese.g., diversity, fisher.alpha, specnumber, simpson.unb, taxondive, tsallis	No	No	Yese.g., diversityresults	Yes allrich, berpar, brillouin, esimpson, fisher, get_taxa_abundance, get_taxa_richness, invberpar, invsimpson, mcintosh, margalef, menhinick, pielou, richness, shannon, simpson
Biomonitoring indices	No	No	No	No	Yes aspt, bioco, bmwp, dehli, epsi, ept, eptd, fuzzy_trait_ratio, get_taxa_abundance, get_taxa_richness, ibmr, igold, life, psi, whpt,
Functional indices	Yese.g., treedive	Yese.g., dbFD, fdisp	No	No	Yes csi, cwm, f_disp, f_divs, f_red, f_eve, f_rich, fuzzy_trait_ratio
Comparison with reference dataset, handling heterogeneous taxonomic resolution	No	No	Yese.g., classification	No	Yes as_biomonitor, get_gbif_taxa_tree, get_iucn_taxa_tree, get_nbn_taxa_tree, get_worms_taxa_tree
Managing trait information	No	No	No	No	Yes assign_traits, average_traits, manage_traits, sample_traits
Ordinations, clustering	Yese.g., cca, rda, metaMDS	No	No	Yese.g., CAPdiscrim	No
Beta diversity analysis	Yese.g., betadiver	No	No	No	No
Null modelling	Yese.g., hiersimu, commsim	No	No	No	No

DOI: 10.7717/peerj.14183/table-1

Multiple taxonomic levels and back trace

biomonitoR calculates ecological indices at different taxonomic levels. The desired taxonomic level (e.g., species, family) can be specified with the argument tax_lev. This is a key feature to develop robust methods based on taxonomic sufficiency, the level of taxonomic detail to which organisms must be identified to recognize ecological patterns (Jones, 2008).

To keep track of the computations performed by biomonitoR, the original information can be obtained by setting the traceB argument to TRUE. This is especially useful when mismatches between taxonomic and sensitivity/traits datasets lead to the exclusion of some taxa. To further assure the transparency of the entire process, default scores are made available by using the show_scores function. biomonitoR is designed to be flexible, such that users can use their own scores or trait information. For example, in river biomonitoring, the calculation of the internationally used Average Score Per Taxon index (ASPT; Hawkes, 1998, see SM2) can be adapted to user-defined scores and taxonomic aggregation rules (e.g., combining taxa with similar stressor-specific responses). This feature can be extremely helpful when users want to test indices sensitivity or for simulation studies.

Tools for managing trait-based information and indices availability

biomonitoR allows users to overcome problems frequently arising during the computation of trait-based indices. The package includes several functions to manage functional traits information allowing the calculation of a range of functional metrics most of which are based on the FD (Laliberté & Legendre, 2010) and ade4 packages (Dray & Dufour, 2007). For example, the function assign_traits matches the taxa to those of the traits dataset. This is particularly useful when traits of several taxa in the trait dataset (e.g., species- and genus-level) can be assigned to a higher taxonomic level (e.g., family level). Currently, biomonitoR comes with a built-in macroinvertebrate trait dataset (European fauna: Schmidt-Kloiber & Hering, 2015; Tachet et al., 2010) but users can also add their own (see an example of loading a dataset including North American insect traits in SM3). A finer control over trait information can be achieved with the manage_traits function, that allows users to select the traits belonging to the nearest taxon in the taxonomic tree (e.g., SM3). Finally, traits at fine levels (e.g., species or genus) can be averaged at coarser levels (family) (average_traits) or a random taxon (e.g., genus) can be selected as representative (sample_traits). The calculation of some functional indices (e.g., functional richness, dispersion and evenness) requires summarizing trait variation into a low dimensional trait space. This can be achieved through a principal coordinate analysis based on a matrix of Euclidean distances or Gower dissimilarities. The quality of the trait space can be evaluated and adjusted using the function select_pcoa_axes to avoid poor representations (Maire et al., 2015). This function implements three ways of evaluating the quality of the functional space: (i) the correlation between the Euclidean distance of the n selected axis and the overall distance; (ii) the r² proposed by Legendre & Legendre (2012) and (iii) the mean squared deviation approach proposed by Maire et al. (2015). If the species-by-species distance matrix is not Euclidean, select_pcoa_axes calculates the performance of four widely used transformations (Cailliez, Lingoes, square root and quasi-Euclidean; see Legendre & Legendre, 2012). Sometimes, data transformation is not enough to make the species-by-species matrix Euclidean (e.g., when two or more taxa share the same traits) with potential negative effects on downstream data analysis. To resolve this issue, biomonitoR comes with two approaches. The first option aggregates the abundance of taxa with the same traits (zerodist_rm). The second option (add_bias_to_traits) differentiates taxa with identical trait profiles by adding a small amount (random bias) to the traits of each taxon (see R code and examples in SM3).

Overall, both taxonomic-based indices and more novel measures still pending to be fully incorporated in national or international biomonitoring schemes are available in the package. biomonitoR currently allows the calculation of more than 30 indices (including diversity, biomonitoring, and functional measures) as well as a wide range of complementary taxonomic measures (e.g., richness or abundance of a specific taxon, pair and combination of taxa, see Table 2 and examples of applications in SM2).

Data visualization

biomonitoR relies on the plotly package for providing flexible and high-quality graphs. Three types of plots are currently available to visualize the data. The first is a Sankey diagram that links indicator taxa (De Cáceres & Legendre, 2009) from a desired to an upper taxonomic level through groups provided by the user or identified by a cluster analysis (e.g., Fig. 2). The second plots the prevalence of a taxonomic level (e.g., family) within an upper taxonomic level (e.g., order, see SM4a-c). The third is an interactive barplot reporting the proportion of taxa within a desired taxonomic level, with sites ordered according to the results of a cluster analysis (see examples and codes in SM4d).

Table 2:

Summary of the main indices implemented in the biomonitoR package. Full details about references, acronyms and R functions are available in Table SM2.

Index type	Index name	Target group
Taxonomic-based indices (richness, diversity and evenness)	Richness or abundance of a taxon or combination of taxa, Berger-Parker, Berger-Parker inverse, Brillouin, Fisher alpha, Margalef, McIntosh, Menhinick, Pielou, Shannon, Simpson, Simpson evenness, Simpson inverse	All biological groups
Biomonitoring indices (ecological and single pressure assessments)	ASPT^a, BMWP^a, WHPT^a, PSI^a, EPSI^a, LIFE^a, Flow-T, DEHLI, EPT, log₁₀(SEL_EPTD +1), 1-GOLD, RCI, ACI, SBCI, IBMR	Macroinvertebrates, Macrophytes, other biological groups
Functional trait-based indices	Functional Richness, Diversity, Redundancy, Evenness and Dispersion, Community trait specialization index, Taxon Specialization Index, Community Weighted Mean	All biological groups

DOI: 10.7717/peerj.14183/table-2

Notes:

aThese indices can be calculated using the composite family approach that aggregates families having similar ecological requirements. Moreover, biomonitoR provides, by default, four sets of sensitivity scores for ASPT and BMWP (two developed UK, one in Italy and one in Spain) and two sets of sensitivity scores for LIFE (developed in UK).

Figure 2: Example of Sankey plot application (function plot_indicator_taxa) showing indicator taxa for an aquatic insect community in intermittent and perennial river sites.
Orders are listed on the left and families/genera on the right.

Download full-size image

DOI: 10.7717/peerj.14183/fig-2

Results & Discussion

Package installation and basic usage example

The biomonitoR package (version 0.9.3) is available on GitHub (https://github.com/alexology/biomonitoR). The package can be installed as follows:

library(devtools)

install_github(“alexology/biomonitoR”, ref = “main”, build_vignettes = TRUE)

An R Shiny implementation (Chang et al., 2021), as interactive app of the package, is available from the R console running the command line:

library(shiny)

runGitHub(“biomonitoR_app”, “TommasoCanc”, ref = “main”, subdir = “biomonitoR_app”)

as well as in the webpage (https://tcanceco.shinyapps.io/biomonitoR_app/).

These complementary options allow working with a user-friendly graphical interface that will benefit different audiences (either familiar or not with R programming language), thus making biomonitoR widely accessible and interactive.

Usage example

# Load the package

library(biomonitoR)

# Load one of the built-in macroinvertebrate datasets (mi_prin)

data(mi_prin)

# Details about the dataset

?mi_prin

# Import data in the biomonitoR format

data_asb <- as_biomonitor(mi_prin, group = “mi”)

# Inspect the overall community structure by plotting the object

plot(data_asb)

# Aggregate taxa at different taxonomic levels

data_agg <- aggregate_taxa(data_asb)

# Calculate a taxonomic index

# e.g., Shannon index at taxa and family level

shannon(data_agg, tax_lev = “Taxa”)

shannon(data_agg, tax_lev = “Family”)

# Calculate a biomonitoring index

# e.g., ASPT index with function aspt (see ?aspt for details)

aspt(data_agg)

Detailed scripts for running further analyses (e.g., data manipulation, biomonitoring indices calculations, function customization, plots) are provided in SM2-3-4. The package is being regularly updated and we encourage interested users to look for the latest version in GitHub and to report any suggestion for further development.

Limitations and perspectives

Despite its flexibility, biomonitoR may presents some punctual limitations. The reference dataset needs to be implemented according to the taxonomic levels that are currently available in the package. This can be an issue in the occasional cases where species-groups, life stage information or operational taxonomic units are needed. With minor adjustments (e.g., considering species groups as species) their use is still possible with the current implementation. Moreover, according to Tables 1–2, one of the main biomonitoR assets (i.e., multiple index calculation) is biased towards the biomonitoring of running waters. This is due to the unparalleled pressures affecting freshwater biodiversity worldwide and the long history of biological monitoring in rivers, that date back to the beginning of the 20th century (Bonada et al., 2006). Nonetheless, a detailed biomonitoring toolbox, not constrained at local scale, is still missing (see Table 1) and biomonitoR aspires to fill this gap. It should be noted that, despite the primary focus at European level, the package can potentially achieve a more international perspective after further customizations. Most of the river biomonitoring metrics tested in Africa, South-Central America, Mexico, Australia, and Asia are, in fact, directly related to the Biological Monitoring Working Party scoring systems (BMWP/ASPT, Aschalew & Moog, 2015; Eriksen et al., 2021; Feio et al., 2021) originally developed in the UK and already available in biomonitoR (Table 2).

Regardless of the primary freshwater focus, the package can instantly manage biological data from other domains (e.g., marine, terrestrial) and communities (e.g., non-aquatic plants, vertebrates) if a reference dataset is uploaded by the users or specifically built from online sources with the internal functions. In addition, general-purpose indices for biodiversity analysis and assessment (e.g., taxonomic and functional measures) can be already computed.

Conclusions

Rigorous and updated biomonitoring procedures and tools are crucial for biodiversity conservation research (e.g., early warning, long-term assessment, and observatories) and to properly inform science-based policies. Here we presented biomonitoR, a new R tool that creates the basis for a common environment for dataset management that allows users to be in an advantageous position in research and applied fields related to environmental science.

The set of functions, available in a single open access package, support task automation and greatly facilitate repeatability and reproducibility among studies. Therefore, biomonitoR may also ensure greater consistency among biomonitoring programs which could lead to more far-reaching analyses at spatial and temporal scales.

The package represents a flexible toolbox for (i) checking taxonomic, sensitivity and traits information against a reference taxonomic database; (ii) managing heterogeneous taxonomic resolution and computing indices at multiple taxonomic levels; (iii) calculating metrics and biological indices for both general and stressor-specific ecological assessment; (iv) interpreting results through interactive data visualization; (v) working with a user-friendly web application directly available in R.

Overall, biomonitoR enables high-performing options for better management and harmonization of ecological and biomonitoring data while facilitating the interpretation and dissemination of results. Therefore, the package has significant value in decision-making processes and benefits the wide biomonitoring community.

Supplemental Information

R file with usage examples

Example code for community data analyses with the biomonitoR package

DOI: 10.7717/peerj.14183/supp-1

Download

Reference datasets in biomonitoR and list of the main metrics available in biomonitoR

DOI: 10.7717/peerj.14183/supp-2

Download

Default plots in biomonitoR: Examples and code

DOI: 10.7717/peerj.14183/supp-3

Download

[1] Arbačiauskas K, Semenchenko V, Grabowski M, Leuven RSEW, Paunović M, Son MO, Csányi B, Gumuliauskaite S, Konopacka A, Nehring S, Van der Velde G, Vezhnovetz V, Panov VE. 2008. Assessment of biocontamination of benthic macroinvertebrate communities in European inland waterways. Aquatic Invasions 3(2):211-230

[2] Aschalew L, Moog O. 2015. Benthic macroinvertebrates based new biotic score ETHbios for assessing ecological conditions of highland streams and rivers in Ethiopia. Limnologica 52:11-19

[3] Birk S, Bonne W, Borja A, Brucet S, Courrat A, Poikane S, Solimini A, Van de Bund W, Zampoukas N, Hering D. 2012. Three hundred ways to assess Europe’s surface waters: an almost complete overview of biological methods to implement the Water Framework Directive. Ecological Indicators 18:31-41

[4] Bonada N, Prat N, Resh VH, Statzner B. 2006. Developments in aquatic insects biomonitoring: a comparative analysis of recent approaches. Annual Review of Entomology 51(1):495-523

[5] Bruno D, Gutiérrez-Cánovas C, Velasco J, Sánchez-Fernández D. 2016. Functional redundancy as a tool for bioassessment: a test using riparian vegetation. Science of the Total Environment 566–567:1268-1276

[6] Buss DF, Carlisle DM, Chon T-S, Culp J, Harding JS, Keizer-Vlek HE, Robinson WA, Strachan S, Thirion C, Hughes RM. 2015. Stream biomonitoring using macroinvertebrates around the globe: a comparison of large-scale programs. Environmental Monitoring and Assessment 187(1):4132

[7] Cadotte MW, Carscadden K, Mirotchnick N. 2011. Beyond species: functional diversity and the maintenance of ecological processes and services. Journal of Applied Ecology 48(5):1079-1087

[8] Chadd RP, England JA, Constable D, Dunbar MJ, Extence CA, Leeming DJ, Murray-Bligh JA, Wood PJ. 2017. An index to track the ecological effects of drought development and recovery on riverine invertebrate communities. Ecological Indicators 82:344-356

[9] Chang W, Cheng J, Allaire JJ, Sievert C, Schloerke B, Xie Y, Allen J, McPherson J, Dipert A, Borges B. 2021. Shiny: web application framework for R. R package version 1.6.0 software

[10] De Cáceres MD, Legendre P. 2009. Associations between species and groups of sites: indices and statistical inference. Ecology 90(12):3566-3574

[11] Dray S, Dufour A. 2007. The ade4 package: implementing the duality diagram for ecologists. Journal of Statistical Software 22(4):1-20

[12] Eriksen TE, Brittain JE, Søli G, Jacobsen D, Goethals P, Friberg N. 2021. A global perspective on the application of riverine macroinvertebrates as biological indicators in Africa, South-Central America, Mexico and Southern Asia. Ecological Indicators 126:107609

[13] Feio MJ, Hughes RM, Callisto M, Nichols SJ, Odume ON, Quintella BR, Kuemmerlen M, Aguiar FC, Almeida SF, Alonso-EguíaLis P, Arimoro FO, Dyer FJ, Harding JS, Jang S, Kaufmann PR, Lee S, Li J, Macedo DR, Mendes A, Mercado-Silva N, Monk W, Nakamura K, Ndiritu GG, Ogden R, Peat M, Reynoldson TB, Rios-Touma B, Segurado P, Yates AG. 2021. The biological assessment and rehabilitation of the world’s rivers: an overview. Water 13(3):371

[14] Friberg N. 2014. Impacts and indicators of change in lotic ecosystems. WIREs Water 1(6):513-531

[15] Froese R, Pauly D. 2019. FishBase. World Wide Web electronic publication.

[16] Goldsmith FB. 1991. Monitoring for conservation and ecology. London: Springer Science & Business Media. 296

[17] Grenié M, Berti E, Carvajal-Quintero J, Dädlow GML, Sagouis A, Winter M. 2022. Harmonizing taxon names in biodiversity data: a review of tools, databases and best practices. Methods in Ecology and Evolution

[18] Guareschi S, Wood PJ. 2019. Taxonomic changes and non-native species: an overview of constraints and new challenges for macroinvertebrate-based indices calculation in river ecosystems. Science of the Total Environment 660:40-46

[19] Guiry MD, Guiry GM, Morrison L, Rindi F, Miranda SV, Mathieson AC, Parker BC, Langangen A, John DM, Bárbara I, Carter CF, Kuipers P, Garbary DJ. 2014. AlgaeBase: an on-line resource for algae. Cryptogamie, Algologie 35:105-115

[20] Hawkes HA. 1998. Origin and development of the biological monitoring working party score system. Water Research 32(3):964-968

[21] Jackson MC, Weyl OLF, Altermatt F, Durance I, Friberg N, Dumbrell AJ, Piggott JJ, Tiegs SD, Tockner K, Krug CB, Leadley PW, Woodward G. 2016. Recommendations for the next generation of global freshwater biological monitoring tools. Advances in Ecological Research 55:615-636

[22] Jones FC. 2008. Taxonomic sufficiency: the influence of taxonomic resolution on freshwater bioassessments using benthic macroinvertebrates. Environmental Reviews 16:45-69

[23] Laini A, Burgazzi G, Chadd R, England J, Tziortzis I, Ventrucci M, Vezza P, Viaroli P, Wood PJ, Guareschi S. 2022. Using invertebrate functional traits to improve flow variability assessment within European rivers. Science of the Total Environment 832:155047

[24] Laliberté E, Legendre P. 2010. A distance-based framework for measuring functional diversity from multiple traits. Ecology 91:299-305

[25] Legendre P, Legendre L. 2012. Numerical ecology (III Edition). Amsterdam: Elsevier. 1006

[26] Maire E, Grenouillet G, Brosse S, Villéger S. 2015. How many dimensions are needed to accurately assess functional diversity? A pragmatic approach for assessing the quality of functional spaces. Global Ecology and Biogeography 24(6):728-740

[27] Mondy CP, Villeneuve B, Archaimbault V, Usseglio-Polatera P. 2012. A new macroinvertebrate-based multimetric index (I2M2) to evaluate ecological quality of French wadeable streams fulfilling the WFD demands: a taxonomical and trait approach. Ecological Indicators 18:452-467

[28] Patterson DJ, Cooper J, Kirk PM, Pyle RL, Remsen DP. 2010. Names are key to the big new biology. Trends in Ecology & Evolution 25(12):686-691

[29] R Core Team. 2021. A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing. software

[30] Reid AJ, Carlson AK, Creed IF, Eliason EJ, Gell PA, Johnson PTJ, Kidd KA, MacCormack TJ, Olden JD, Ormerod SJ, Smol JP, Taylor WW, Tockner K, Vermaire JC, Dudgeon D, Cooke SJ. 2019. Emerging threats and persistent conservation challenges for freshwater biodiversity. Biological Reviews 94(3):849-873

[31] Rimet F, Gusev E, Kahlert M, Kelly M, Kulikovskiy M, Maltsev Y, Mann D, Pfannkuchen M, Trobajo R, Vasselon V, Bouchez A. 2019. Diat.barcode, an open-access barcode library for diatoms: new insights. Database release version 7 software

[32] Schmidt-Kloiber A, Hering D. 2015. www.freshwaterecology.info—an online tool that unifies, standardises and codifies more than 20,000 European freshwater organisms and their ecological preferences. Ecological Indicators 53:271-282