Automating the calculation of the ad hoc distance thresholds for DNA barcoding identification using two R functions
Authors
Sonet, G.
Jordaens, K.
Nagi, ZT.
Breman, F.
De Meyer, M.
Backeljau, T.
Virgilio, M.
Discipline
Biological sciences
Subject
Invertebrates
Audience
Scientific
Date
2013Metadata
Show full item recordDescription
The probability of erroneous DNA barcoding identification is higher when identification is based on a large distance between the query (the barcode sequence of the specimen to identify) and its best match in a reference barcode library. Yet, the number of these incorrect identifications can be reduced by setting a distance threshold above which identification is considered unreliable. To this end, it has been proposed to use an ad hoc distance threshold producing identifications with a user-defined error probability (e.g. 5%). Here, we propose two R scripts to automate the calculation of ad hoc distance thresholds for reference libraries of DNA barcodes. Both scripts, a user manual and an example file are freely available upon request on the JEMU website (http://jemu.myspecies.info).
Citation
Sonet, G.; Jordaens, K.; Nagi, ZT.; Breman, F.; De Meyer, M.; Backeljau, T.; Virgilio, M. (2013). Automating the calculation of the ad hoc distance thresholds for DNA barcoding identification using two R functions. , 5th International Barcode of Life Conference (Kunming, P.R. China), 174,Identifiers
Type
Conference
Peer-Review
No
Language
eng