Introduction

Ever wondered if your favourite preprint has been published or wanted to link a publication and a preprint for some reason? Then {doilinker} can help! It implements a method developed by Cabanac, Oikonomidi and Boutron to match preprints and publications using information retrieved from Crossref.

Installation

The package can be installed from GitHub:

# install.packages("remotes")
remotes::install_github("lazappi/doilinker")

Usage

First load the package.

Then we can search for linked references.

results <- search_doi_links("10.1101/133173")
#> Retrieving query data from Crossref...
#> Decided query is a preprint
#> Retrieving results data from Crossref...
#> Found 20 results
#> Checking results for matches...
#> Done!
head(results)
#> # A tibble: 6 × 39
#>   alternative.id container.title            created    deposited published.print
#>   <chr>          <chr>                      <chr>      <chr>     <chr>          
#> 1 1305           Genome Biology             2017-09-12 2023-08-… 2017-12        
#> 2 NA             PLOS Computational Biology 2018-06-25 2023-09-… NA             
#> 3 2546           Genome Biology             2021-12-15 2023-11-… 2021-12        
#> 4 2388           Genome Biology             2021-06-08 2021-06-… 2021-12        
#> 5 NA             Bioinformatics             2022-08-25 2022-10-… 2022-10-14     
#> 6 NA             Development                2019-07-11 2021-04-… 2019-07-01     
#> # ℹ 34 more variables: published.online <chr>, doi <chr>, indexed <chr>,
#> #   issn <chr>, issue <chr>, issued <chr>, member <chr>, prefix <chr>,
#> #   publisher <chr>, score <chr>, source <chr>, reference.count <chr>,
#> #   references.count <chr>, is.referenced.by.count <chr>, title <chr>,
#> #   type <chr>, update.policy <chr>, url <chr>, volume <chr>, language <chr>,
#> #   short.container.title <chr>, author <list>, link <list>, reference <list>,
#> #   page <chr>, subject <chr>, funder <list>, update_to <list>, …

The results contain information retrieved from Crossref. Most of the columns are self-explanatory but a few to be aware of are:

  • score - The query score returned by Crossref. Not used for matching but can be useful to check.
  • match - Whether each result is a match according to the Cabanac, Oikonomidi and Boutron method

Please refer to the Crossref documentation for more details on the other fields.

Parameters

By default all results from Crossref are returned but you can filter to only matches by setting filter_matches = TRUE. You can also change the number of results returned by setting the limit parameter.

Citation

If you use this package please cite the original publication for the linking method (if you want to cite the package directly as well that would be great 😉 but crediting the original authors is the important part):

Cabanac G, Oikonomidi T, Boutron I. “Day-to-day discovery of preprint-publication links.” Scientometrics. 2021;1–20. DOI: 10.1007/s11192-021-03900-7

@ARTICLE{Cabanac2021-ge,
  title    = "Day-to-day discovery of preprint-publication links",
  author   = "Cabanac, Guillaume and Oikonomidi, Theodora and Boutron, Isabelle",
  journal  = "Scientometrics",
  pages    = "1--20",
  month    =  apr,
  year     =  2021,
  language = "en"
}