tidypmc: Parse Full Text XML Documents from PubMed Central

Parse XML documents from the Open Access subset of Europe PubMed Central <https://europepmc.org> including section paragraphs, tables, captions and references.

Version: 2.0
Imports: xml2, tokenizers, stringr, tibble, dplyr, readr
Suggests: europepmc, tidytext, rmarkdown, knitr, testthat, covr
Published: 2024-08-27
DOI: 10.32614/CRAN.package.tidypmc
Author: Chris Stubben [aut, cre]
Maintainer: Chris Stubben <chris.stubben at hci.utah.edu>
BugReports: https://github.com/ropensci/tidypmc/issues
License: GPL-3
URL: https://github.com/ropensci/tidypmc
NeedsCompilation: no
Materials: NEWS
CRAN checks: tidypmc results

Documentation:

Reference manual: tidypmc.pdf
Vignettes: Parse PMC FTP files (source, R code)
Introduction to tidypmc (source, R code)

Downloads:

Package source: tidypmc_2.0.tar.gz
Windows binaries: r-devel: tidypmc_2.0.zip, r-release: tidypmc_2.0.zip, r-oldrel: tidypmc_2.0.zip
macOS binaries: r-release (arm64): tidypmc_2.0.tgz, r-oldrel (arm64): tidypmc_2.0.tgz, r-release (x86_64): tidypmc_2.0.tgz, r-oldrel (x86_64): tidypmc_2.0.tgz
Old sources: tidypmc archive

Linking:

Please use the canonical form https://CRAN.R-project.org/package=tidypmc to link to this page.