Data Leakage and Loss in Biodiversity Informatics

dc.contributor.authorPeterson, T.A.
dc.contributor.authorAsase, A.
dc.contributor.authorCanhos, D.A.L.
dc.contributor.authorde Souza, S.
dc.contributor.authorWieczorek, J.
dc.date.accessioned2019-07-26T10:00:01Z
dc.date.available2019-07-26T10:00:01Z
dc.date.issued2018-11
dc.description.abstractThe field of biodiversity informatics is in a massive, “grow-out” phase of creating and enabling large-scale biodiversity data resources. Because perhaps 90% of existing biodiversity data nonetheless remains unavailable for science and policy applications, the question arises as to how these existing and available data records can be mobilized most efficiently and effectively. This situation led to our analysis of several large-scale biodiversity datasets regarding birds and plants, detecting information gaps and documenting data “leakage” or attrition, in terms of data on taxon, time, and place, in each data record. We documented significant data leakage in each data dimension in each dataset. That is, significant numbers of data records are lacking crucial information in terms of taxon, time, and/or place; information on place was consistently the least complete, such that geographic referencing presently represents the most significant factor in degradation of usability of information from biodiversity information resources. Although the full process of digital capture, quality control, and enrichment is important to developing a complete digital record of existing biodiversity information, payoffs in terms of immediate data usability will be greatest with attention paid to the georeferencing challenge.en_US
dc.identifier.otherDOI: 10.3897/BDJ.6.e26826
dc.identifier.urihttp://ugspace.ug.edu.gh/handle/123456789/31794
dc.language.isoenen_US
dc.publisherBiodiversity Data Journalen_US
dc.subjectBiodiversity dataen_US
dc.subjectDigitizationen_US
dc.subjectFitness for useen_US
dc.subjectGeographic referencingen_US
dc.subjectGeoreferencingen_US
dc.subjectInformaticsen_US
dc.subjectPlaceen_US
dc.subjectTaxonen_US
dc.subjectTimeen_US
dc.subjectUsabilityen_US
dc.titleData Leakage and Loss in Biodiversity Informaticsen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Data Leakage and Loss in Biodiversity Informatics.pdf
Size:
518.85 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.6 KB
Format:
Item-specific license agreed upon to submission
Description: