
Using the benchmark set as reference, the number of false-negative and false-positive duplicate references for each method was identified, and accuracy, sensitivity, and specificity were determined.

The default settings were then used in Ovid multifile search, EndNote desktop, Mendeley, Zotero, Covidence, and Rayyan to de-duplicate the sample of references independently. References were de-duplicated via manual abstraction to create a benchmark set. MethodsĪ heterogeneous sample of references was obtained by conducting a similar topical search in MEDLINE, Embase, Cochrane Central Register of Controlled Trials, and PsycINFO databases. We examined the accuracy and efficiency of commonly used electronic methods for flagging and removing duplicate references during this process.

As this type of evidence synthesis is increasingly pursued, the use of various electronic platforms can help researchers improve the efficiency and quality of their research.

Systematic reviews involve searching multiple bibliographic databases to identify eligible studies.
