Package: grafzahl 0.0.11
grafzahl: Supervised Machine Learning for Textual Data Using Transformers and 'Quanteda'
Duct tape the 'quanteda' ecosystem (Benoit et al., 2018) <doi:10.21105/joss.00774> to modern Transformer-based text classification models (Wolf et al., 2020) <doi:10.18653/v1/2020.emnlp-demos.6>, in order to facilitate supervised machine learning for textual data. This package mimics the behaviors of 'quanteda.textmodels' and provides a function to setup the 'Python' environment to use the pretrained models from 'Hugging Face' <https://huggingface.co/>. More information: <doi:10.5117/CCR2023.1.003.CHAN>.
Authors:
grafzahl_0.0.11.tar.gz
grafzahl_0.0.11.zip(r-4.5)grafzahl_0.0.11.zip(r-4.4)grafzahl_0.0.11.zip(r-4.3)
grafzahl_0.0.11.tgz(r-4.4-any)grafzahl_0.0.11.tgz(r-4.3-any)
grafzahl_0.0.11.tar.gz(r-4.5-noble)grafzahl_0.0.11.tar.gz(r-4.4-noble)
grafzahl_0.0.11.tgz(r-4.4-emscripten)grafzahl_0.0.11.tgz(r-4.3-emscripten)
grafzahl.pdf |grafzahl.html✨
grafzahl/json (API)
# Install 'grafzahl' in R: |
install.packages('grafzahl', repos = c('https://gesistsa.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/gesistsa/grafzahl/issues
- ecosent - A Corpus Of Dutch News Headlines
- supported_model_types - Supported model types
- unciviltweets - A Corpus Of Tweets With Incivility Labels
Last updated 8 months agofrom:f340adce85. Checks:OK: 1 NOTE: 6. Indexed: yes.
Target | Result | Date |
---|---|---|
Doc / Vignettes | OK | Oct 10 2024 |
R-4.5-win | NOTE | Oct 10 2024 |
R-4.5-linux | NOTE | Oct 10 2024 |
R-4.4-win | NOTE | Oct 10 2024 |
R-4.4-mac | NOTE | Oct 10 2024 |
R-4.3-win | NOTE | Oct 10 2024 |
R-4.3-mac | NOTE | Oct 10 2024 |
Exports:detect_condadetect_cudaget_amharic_datagrafzahlhydratesetup_grafzahltextmodel_transformeruse_nonconda
Dependencies:assertthatclicodetoolscolorspacefansifarverfastmatchforeachggplot2glmnetgluegowergtablehereisobandISOcodesiteratorsjsonlitelabelinglatticelifecyclelimemagrittrMASSMatrixmgcvmunsellnlmepillarpkgconfigpngquantedaR6rappdirsRColorBrewerRcppRcppEigenRcppTOMLreticulaterlangrprojrootscalesshapeSnowballCstopwordsstringisurvivaltibbleutf8vctrsviridisLitewithrxml2yaml
Readme and manuals
Help Manual
Help page | Topics |
---|---|
Detecting Miniconda And Cuda | detect_conda detect_cuda |
A Corpus Of Dutch News Headlines | ecosent |
Download The Amharic News Text Classification Dataset | get_amharic_data |
Fine tune a pretrained Transformer model for texts | grafzahl grafzahl.character grafzahl.corpus grafzahl.default textmodel_transformer |
Create a grafzahl S3 object from the output_dir | hydrate |
Prediction from a fine-tuned grafzahl object | predict.grafzahl |
Setup grafzahl | setup_grafzahl |
Supported model types | supported_model_types |
A Corpus Of Tweets With Incivility Labels | unciviltweets |
Set up grafzahl to be used on Google Colab or similar environments | use_nonconda |