Package: grafzahl 0.0.12
grafzahl: Supervised Machine Learning for Textual Data Using Transformers and 'Quanteda'
Duct tape the 'quanteda' ecosystem (Benoit et al., 2018) <doi:10.21105/joss.00774> to modern Transformer-based text classification models (Wolf et al., 2020) <doi:10.18653/v1/2020.emnlp-demos.6>, in order to facilitate supervised machine learning for textual data. This package mimics the behaviors of 'quanteda.textmodels' and provides a function to setup the 'Python' environment to use the pretrained models from 'Hugging Face' <https://huggingface.co/>. More information: <doi:10.5117/CCR2023.1.003.CHAN>.
Authors:
grafzahl_0.0.12.tar.gz
grafzahl_0.0.12.zip(r-4.7)grafzahl_0.0.12.zip(r-4.6)grafzahl_0.0.12.zip(r-4.5)
grafzahl_0.0.12.tgz(r-4.6-any)grafzahl_0.0.12.tgz(r-4.5-any)
grafzahl_0.0.12.tar.gz(r-4.7-any)grafzahl_0.0.12.tar.gz(r-4.6-any)
grafzahl_0.0.12.tgz(r-4.6-emscripten)
manual.pdf |manual.html✨
card.svg |card.png
grafzahl/json (API)
| # Install 'grafzahl' in R: |
| install.packages('grafzahl', repos = c('https://gesistsa.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/gesistsa/grafzahl/issues
Pkgdown/docs site:https://gesistsa.github.io
- ecosent - A Corpus Of Dutch News Headlines
- supported_model_types - Supported model types
- unciviltweets - A Corpus Of Tweets With Incivility Labels
Last updated from:9040661931. Checks:9 OK. Indexed: yes.
| Target | Result | Time | Files | Syslog |
|---|---|---|---|---|
| linux-devel-x86_64 | OK | 183 | ||
| source / vignettes | OK | 196 | ||
| linux-release-x86_64 | OK | 164 | ||
| macos-release-arm64 | OK | 129 | ||
| macos-oldrel-arm64 | OK | 110 | ||
| windows-devel | OK | 122 | ||
| windows-release | OK | 77 | ||
| windows-oldrel | OK | 97 | ||
| wasm-release | OK | 121 |
Exports:detect_condadetect_cudaget_amharic_datagrafzahlhydratesetup_grafzahltextmodel_transformeruse_nonconda
Dependencies:assertthatclicodetoolscpp11farverfastmatchforeachggplot2glmnetgluegowergtablehereisobandISOcodesiteratorsjsonlitelabelinglatticelifecyclelimemagrittrMatrixpngquantedaR6rappdirsRColorBrewerRcppRcppEigenRcppTOMLreticulaterlangrprojrootS7scalesshapeSnowballCstopwordsstringisurvivalvctrsviridisLitewithrxml2yaml
Readme and manuals
Help Manual
| Help page | Topics |
|---|---|
| Detecting Miniconda And Cuda | detect_conda detect_cuda |
| A Corpus Of Dutch News Headlines | ecosent |
| Download The Amharic News Text Classification Dataset | get_amharic_data |
| Fine tune a pretrained Transformer model for texts | grafzahl grafzahl.character grafzahl.corpus grafzahl.default textmodel_transformer |
| Create a grafzahl S3 object from the output_dir | hydrate |
| Prediction from a fine-tuned grafzahl object | predict.grafzahl |
| Setup grafzahl | setup_grafzahl |
| Supported model types | supported_model_types |
| A Corpus Of Tweets With Incivility Labels | unciviltweets |
| Set up grafzahl to be used on Google Colab or similar environments | use_nonconda |
