CaecilianTraits, an individual level trait database of Caecilians worldwide

CaecilianTraits, an individual level trait database of Caecilians worldwide

Background & Summary

Biodiversity is a multidimensional concept that extends beyond species richness alone1,2. It has become increasingly popular to study biodiversity based on functional traits3. These studies have tested hypotheses and yielded important conclusions in the formation of biodiversity4, biodiversity conservation5,6, community ecology7, species coexistence8 and ecosystem stability8,9. Functional trait variation mirrors different adaptive capacities among species. However, differences in traits exist not only among species but also among individuals10. Variations in traits among individuals shape differences in their abilities to adapt to environments and interact with other species or individuals, thereby influencing their survival. Infraspecific variations in functional traits are also related to the ability to adapt environment changes and various disturbances. Considering the climatic changes and habitat alterations caused by human activities, it is urgently important to understand trait variation at the individual level11. Individual trait-based approaches have been applied in studies of plant ecology10,12. However, most zoological databases provide data only at the species level13,14. Meanwhile, databases providing continuous morphological traits are rare15,16. High-resolution continuous data can provide fine-grained resolution about the ecological roles of species17,18. Furthermore, discrete data or functional groupings may be difficult to capture differences at the individual level. Instead, continuous measurement data can better capture variations in adaptive capabilities among individuals.

The order Gymnophiona of Amphibia, commonly known as caecilians, is a very ancient19 and unique group of amphibians20. The morphology, habits and life history of caecilians are very different from those of other amphibian species20,21,22. With the exception of a few aquatic or semi-aquatic species, they are predominantly fossorial, inhabiting subterranean habitats or forest leaf litter23. Consequently, they are among the least understood groups of amphibians24. A comprehensive database of morphological traits will help us to better understand this taxon. Moreover, trait database is also important for biodiversity conservation of caecilians. The biodiversity of caecilian may already be under severe threat, but we still lack knowledge about their biodiversity and threats they face24. Currently, 222 species of caecilians have been described25,26. Among the 202 species listed in the IUCN Red List, 91 are classified as Data Deficient27. Trait-based predictions of threat levels are widely used across taxa28,29. A comprehensive database will facilitate forecasting the potential threats faced by caecilians. Additionally, we do not yet have a comprehensive understanding of amphibian biodiversity30,31. On average, about 153 new amphibian species were descripted each year in last decade25,26. For Gymnophiona, 67 species were descripted since 2000, accounting about 30% of all species. Accessible and comprehensive morphological traits databases will also facilitate taxonomists in exploring the biodiversity of caecilians.

Based on amounts of literature32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76,77,78,79,80,81,82,83,84,85,86,87,88,89,90,91,92,93,94,95,96,97,98,99, we compiled the most comprehensive morphological traits database of caecilians at the individual level100. Our primary source of data was species description literature, as morphological traits were descripted in detail in such literature. Our database encompasses 218 out of 222 species across the order Gymnophiona (Fig. 1). Among these species, 72.94% have morphological traits recorded for multiple individuals (Fig. 1). We calculated the standard deviation to represent the degree of intraspecific variation for the most complete trait. The infraspecific variation supported the importance of considering individual level data in ecological or evolutionary studies. At the same time, we explored the relationship between standard deviation and sample size. The scatter plot shows a triangular pattern, indicating that there is no clear relationship between sample size and standard deviation, but when the sample size is small, a few species may exhibit very large standard deviations (Fig. 2). Therefore, we recommended increasing the sample size if it was possible to reduce the impact of random errors. We hope that this database will facilitate macroecological and macroevolutionary studies of this interesting and important taxon.

Fig. 1
CaecilianTraits, an individual level trait database of Caecilians worldwide

Number of data records and traits of each genus.

Full size image
Fig. 2
figure 2

(a) Data completeness for each trait in the individual-level dataset; (b) Data completeness for each trait in the species-level dataset; (c)The scatter plot shows the relationship between the standard deviation of total length and the sample size. Descriptions of the traits are in Supplementary Table 1.

Full size image

Methods

We organised the literature of all species based on taxonomic databases25,26. Due to numerous taxonomic revisions, a species may have multiple synonyms, which made searching literature using the species name directly difficult to implement. For the taxonomic system, we considered both Amphibian Species of the World25 and AmphibiaWeb26. Whenever possible, we recorded data for every specimen for each species. In cases where species description work did not provide measurements for each specimen, we recorded the mean and standard deviation of the traits. When mean values of traits were available and the holotype specimen was also descripted, we recorded both sets of data. We included as many morphological traits as possible to offer researchers more options when using the database (Fig. 2 and Supplementary Table 1). This approach also made our database more useful to other researchers, such as taxonomists.

To facilitate species level ecological and evolutionary studies, we also compiled morphological, ecological, and life history data at the species level. Based on species descriptions from the IUCN Red List and AmphibiaWeb, we collected information about habitat, microhabitat, mode of reproduction (oviparous or viviparous), mode of development (presence or absence of a water-dependent larval stage), and clutch size. For species not listed in IUCN Red List and AmphibiaWeb, we referred to field records in the original descriptions and other natural history records to compile the data. Additionally, we organized the biome101 for species based on distribution data from IUCN Red List. Biomes comprising less than 5% of the species range were not considered. For species without distribution data in the IUCN Red List, we extracted the biome information using distribution data collected from GBIF (https://www.gbif.org/) and original species description literature. We integrated these data with morphometric data. For the morphometric data, we primarily used individual level data to calculate averages. However, when population level data had a larger sample size than individual level data in our database, we chose the population level data to represent the species’ traits.

Data Records

The data100 table was organized in an excel file. The first sheet recorded the description of the data. The second sheet was the individual level dataset. The first column recorded the references. The second, third, and fourth columns recorded the taxonomy of the species, which included family, genus and binomial species name. The fifth column recorded the type the data, which were individual-level data (I), population-level data (P) or species-level data (S). The sixth column recorded the specimen number. For population level data, this column recorded the information on sampling locations. The seventh column recorded the sex information, M indicated male, F indicated female and J indicated juvenile. The subsequent columns recorded the measurements (Fig. 2). Descriptions of each trait were also listed in Supplementary Table 1. The third sheet was the species-level dataset. For the species-level dataset, the first three columns recorded the taxonomy of the species. The next columns recorded biome, habitats, microhabitats, metadata of microhabitats, reproduction, presence or absence of aquatic larvae, clutch size, metadata of reproduction, and references. The subsequent columns recorded the species level morphometric data.

For the individual-level dataset, a total of 1523 data entries were recorded in this dataset, including 1482 individual level data entries for 215 species, 35 population level data entries for 6 species, and 6 species level data entries for 4 species. Among the species level data, two species had data for both males and females. For the individual level data, the highest data completeness was 60.7%, with an average completeness of 26.11%, meaning that each record contained an average of 14.62 traits. For the population level data, the average data completeness was 33.0%. There were ten traits that covered more than 70% of the species. Among the individual level data, eight traits had a coverage rate exceeding 70%, with the highest coverage still being total length at 99.3% (Supplementary Table 1). This trait is also the most commonly used trait in ecological studies.

For the species-level dataset, habitats have the highest completeness at 96% (Supplementary Table 1). The completeness of biome and microhabitats is also high, both exceeding 90%. In contrast, the completeness for reproductive mode and whether larvae depend on water is relatively low, at 68% and 59%, respectively (Supplementary Table 1). The completeness for traits about clutch size are the lowest, with the highest completeness for maximum clutch size being only 7%. For measurement data, the completeness of most traits is slightly higher than that of the individual-level dataset (Supplementary Table 1). The highest coverage was for total length, which covered 98% of the species. We also calculated the data completeness by family (Supplementary Table 2). For the data with the highest completeness, total length, families with a larger number of species, such as Caeciliidae and Ichthyophiidae, are more likely to have missing data.

Compared to other databases16, our database provides individual-level measurement data and covered the most species. To facilitate researchers interested in species-level information, we have also compiled species-level data. Notably, the microhabitat data in our dataset differs from those in other datasets102,103. Moreover, we observed differences in microhabitat types from different sources102,103,104,105. Therefore, we had provided metadata information to support future research.

Technical Validation

We employed two strategies to ensure the accuracy in the data included in the database. First, we used different methods, such as boxplots or frequency histograms, to detect potential mistakes. We created a boxplot for each trait to quickly identify the potential mistakes. For example, misplacement of decimal points was a common mistake during data collection, and this type of error could be quickly detected through boxplot visualization. If the trait of a record was identified as a potential mistake, we would recheck the data of that species. When creating boxplots, we used the mean values instead of raw values for species level and population level data. Second, we randomly selected 10% of the species and checked the original sources again. The database recorded a total of 218 species. We randomly selected 22 species and re-verified the data for these species.

Related Articles

Energy metabolism in health and diseases

Energy metabolism is indispensable for sustaining physiological functions in living organisms and assumes a pivotal role across physiological and pathological conditions. This review provides an extensive overview of advancements in energy metabolism research, elucidating critical pathways such as glycolysis, oxidative phosphorylation, fatty acid metabolism, and amino acid metabolism, along with their intricate regulatory mechanisms. The homeostatic balance of these processes is crucial; however, in pathological states such as neurodegenerative diseases, autoimmune disorders, and cancer, extensive metabolic reprogramming occurs, resulting in impaired glucose metabolism and mitochondrial dysfunction, which accelerate disease progression. Recent investigations into key regulatory pathways, including mechanistic target of rapamycin, sirtuins, and adenosine monophosphate-activated protein kinase, have considerably deepened our understanding of metabolic dysregulation and opened new avenues for therapeutic innovation. Emerging technologies, such as fluorescent probes, nano-biomaterials, and metabolomic analyses, promise substantial improvements in diagnostic precision. This review critically examines recent advancements and ongoing challenges in metabolism research, emphasizing its potential for precision diagnostics and personalized therapeutic interventions. Future studies should prioritize unraveling the regulatory mechanisms of energy metabolism and the dynamics of intercellular energy interactions. Integrating cutting-edge gene-editing technologies and multi-omics approaches, the development of multi-target pharmaceuticals in synergy with existing therapies such as immunotherapy and dietary interventions could enhance therapeutic efficacy. Personalized metabolic analysis is indispensable for crafting tailored treatment protocols, ultimately providing more accurate medical solutions for patients. This review aims to deepen the understanding and improve the application of energy metabolism to drive innovative diagnostic and therapeutic strategies.

Tissue macrophages: origin, heterogenity, biological functions, diseases and therapeutic targets

Macrophages are immune cells belonging to the mononuclear phagocyte system. They play crucial roles in immune defense, surveillance, and homeostasis. This review systematically discusses the types of hematopoietic progenitors that give rise to macrophages, including primitive hematopoietic progenitors, erythro-myeloid progenitors, and hematopoietic stem cells. These progenitors have distinct genetic backgrounds and developmental processes. Accordingly, macrophages exhibit complex and diverse functions in the body, including phagocytosis and clearance of cellular debris, antigen presentation, and immune response, regulation of inflammation and cytokine production, tissue remodeling and repair, and multi-level regulatory signaling pathways/crosstalk involved in homeostasis and physiology. Besides, tumor-associated macrophages are a key component of the TME, exhibiting both anti-tumor and pro-tumor properties. Furthermore, the functional status of macrophages is closely linked to the development of various diseases, including cancer, autoimmune disorders, cardiovascular disease, neurodegenerative diseases, metabolic conditions, and trauma. Targeting macrophages has emerged as a promising therapeutic strategy in these contexts. Clinical trials of macrophage-based targeted drugs, macrophage-based immunotherapies, and nanoparticle-based therapy were comprehensively summarized. Potential challenges and future directions in targeting macrophages have also been discussed. Overall, our review highlights the significance of this versatile immune cell in human health and disease, which is expected to inform future research and clinical practice.

Achieving at-scale seascape restoration by optimising cross-habitat facilitative processes

Cross-habitat facilitative processes can enhance seascape restoration outcomes but there is uncertainty around the spatial dependencies of these processes across habitats. We synthesised the influence of environmental parameters on six processes underpinning cross-habitat facilitation and identified the linear distances over which they operate between habitats. All six process types occur at distances commonly used in seascape restoration demonstrating how harnessing facilitation can scale-up restoration to meet national and international goals.

Advance in peptide-based drug development: delivery platforms, therapeutics and vaccines

The successful approval of peptide-based drugs can be attributed to a collaborative effort across multiple disciplines. The integration of novel drug design and synthesis techniques, display library technology, delivery systems, bioengineering advancements, and artificial intelligence have significantly expedited the development of groundbreaking peptide-based drugs, effectively addressing the obstacles associated with their character, such as the rapid clearance and degradation, necessitating subcutaneous injection leading to increasing patient discomfort, and ultimately advancing translational research efforts. Peptides are presently employed in the management and diagnosis of a diverse array of medical conditions, such as diabetes mellitus, weight loss, oncology, and rare diseases, and are additionally garnering interest in facilitating targeted drug delivery platforms and the advancement of peptide-based vaccines. This paper provides an overview of the present market and clinical trial progress of peptide-based therapeutics, delivery platforms, and vaccines. It examines the key areas of research in peptide-based drug development through a literature analysis and emphasizes the structural modification principles of peptide-based drugs, as well as the recent advancements in screening, design, and delivery technologies. The accelerated advancement in the development of novel peptide-based therapeutics, including peptide-drug complexes, new peptide-based vaccines, and innovative peptide-based diagnostic reagents, has the potential to promote the era of precise customization of disease therapeutic schedule.

The spatial coupling and its influencing mechanism between rural human-habitat heritage and key rural tourism villages in China

Exploring the influencing factors and its influencing mechanism of the spatial coupling between rural human-habitat heritage (RHH) and key rural tourism villages (RTV) at county scale from the perspective of space can expand the theoretical research on the spatial coupling mechanism between RHH and RTV, and further provide theoretical reference and data support for the coordinated development and high-quality development of RHH and RTV in China. At the same time, previous studies have failed to systematically analyze the influencing factors and influencing mechanisms of the spatial coupling between RHH and RTV at the county scale, which restricted decision makers from formulating coordinated development measures between RHH and RTV at the macro level. In this study, bivariate spatial autocorrelation model and spatial coupling coordination model were used to quantitatively analyze the spatial coupling level between RHH and RTV at the county scale in China. Then, the linear regression (OLS) model, geographically weighted regression (GWR) model, and optimal parameter GeoDetector (OPGD) model were integrated to systematically analyze the linear influencing, spatial heterogeneity effect and interactive effect of natural environment and socioeconomic factors on the spatial coupling level between RHH and RTV in China, and explore the interactive influencing mechanism. The results show that the spatial coupling level of RHH and RTV in China show a significant east-west differentiation. There were 2024, 473, 293, 55 and 6 areas of severe, moderate, mild, basic and moderate coordination between RHH and RTV, respectively. Among them, severe and moderate discoordination areas are mainly distributed in Northeast China, arid and semi-arid areas in Western China, plateau areas in Southwest China, densely populated urban agglomerations and plains agricultural areas in the Middle East China. Mild discoordination areas and basic and moderate coordination areas are mainly located in transition zones in mountainous and plain areas, economically developed mountainous and hilly counties along the southeastern coast, and coastal tourist cities. Economic and population factors are the fundamental factors that affect the spatial coupling between RHH and RTV. Rural tourism facilities and rural public service facilities are important external driving forces for the coupling development of RHH and RTV, and Sociocultural environment factors are the important internal driving forces. Different surface forms, different climate conditions and different ecological environment conditions can form different natural textures and spatial organizations. Suitable climate conditions, sufficient water sources and ecological environment conditions can form more suitable rural settlement construction conditions and production and living conditions, and ultimately affect the protection and activation of rural human settlement heritage and the development and layout of key tourist villages. The spatial coupling relationship between RHH and RTV is the result of the complex interaction between the natural directivity law caused by natural environmental factors and the humanistic directivity law caused by human social and economic activities.

Responses

Your email address will not be published. Required fields are marked *