Efforts are made to make Chinese text mining easier, faster, and robust to errors. Document term matrix can be generated by only one line of code; detecting encoding, segmenting and removing stop words are done automatically. Some convenient tools are also supplied.
Version: | 0.2.3 |
Depends: | R (≥ 3.6.0) |
Imports: | jiebaR, NLP, tm (≥ 0.7), stringi, slam (≥ 0.1-37), Matrix, purrr |
Published: | 2020-09-11 |
DOI: | 10.32614/CRAN.package.chinese.misc |
Author: | Jiang Wu [aut, cre] (from Capital Normal University) |
Maintainer: | Jiang Wu <textidea at sina.com> |
License: | GPL-3 |
URL: | https://github.com/githubwwwjjj/chinese.misc/blob/master/README.md |
NeedsCompilation: | no |
CRAN checks: | chinese.misc results |
Reference manual: | chinese.misc.pdf |
Package source: | chinese.misc_0.2.3.tar.gz |
Windows binaries: | r-devel: chinese.misc_0.2.3.zip, r-release: chinese.misc_0.2.3.zip, r-oldrel: chinese.misc_0.2.3.zip |
macOS binaries: | r-release (arm64): chinese.misc_0.2.3.tgz, r-oldrel (arm64): chinese.misc_0.2.3.tgz, r-release (x86_64): chinese.misc_0.2.3.tgz, r-oldrel (x86_64): chinese.misc_0.2.3.tgz |
Old sources: | chinese.misc archive |
Reverse imports: | LDABiplots, LDAShiny |
Please use the canonical form https://CRAN.R-project.org/package=chinese.misc to link to this page.