Provides a 'tm' Source to create corpora from articles exported from the 'LexisNexis' content provider as HTML files. It is able to read both text content and meta-data information (including source, date, title, author and pages). Note that the file format is highly unstable: there is no warranty that this package will work for your corpus, and you may have to adjust the code to adapt it to your particular format.
Version: | 1.4.1 |
Imports: | utils, NLP, tm (≥ 0.6), xml2, ISOcodes |
Published: | 2019-10-19 |
DOI: | 10.32614/CRAN.package.tm.plugin.lexisnexis |
Author: | Milan Bouchet-Valat [aut, cre], Tom Nicholls [ctb] |
Maintainer: | Milan Bouchet-Valat <nalimilan at club.fr> |
BugReports: | https://github.com/nalimilan/R.TeMiS/issues |
License: | GPL-2 | GPL-3 [expanded from: GPL (≥ 2)] |
URL: | https://github.com/nalimilan/R.TeMiS |
NeedsCompilation: | no |
Materials: | NEWS |
In views: | NaturalLanguageProcessing |
CRAN checks: | tm.plugin.lexisnexis results |
Reference manual: | tm.plugin.lexisnexis.pdf |
Package source: | tm.plugin.lexisnexis_1.4.1.tar.gz |
Windows binaries: | r-devel: tm.plugin.lexisnexis_1.4.1.zip, r-release: tm.plugin.lexisnexis_1.4.1.zip, r-oldrel: tm.plugin.lexisnexis_1.4.1.zip |
macOS binaries: | r-release (arm64): tm.plugin.lexisnexis_1.4.1.tgz, r-oldrel (arm64): tm.plugin.lexisnexis_1.4.1.tgz, r-release (x86_64): tm.plugin.lexisnexis_1.4.1.tgz, r-oldrel (x86_64): tm.plugin.lexisnexis_1.4.1.tgz |
Old sources: | tm.plugin.lexisnexis archive |
Reverse imports: | R.temis |
Reverse suggests: | RcmdrPlugin.temis |
Please use the canonical form https://CRAN.R-project.org/package=tm.plugin.lexisnexis to link to this page.