Generate a data frame containing stop words.

generate_stopwords(stopwords, combine_with = NULL)

Arguments

stopwords

Character vector. Vector containing stop words.

combine_with

Data frame containing stop words. Optional. Data frame provided here must have only two columns, namely word and lexicon. This data frame is combined with the data frame created from stopwords. Exemplary data frames are

Value

Data frame containing stop words.

Details

Generate data frame containing stop words from a character vector. This data frame consists of two columns, namely word, containing the stop words, and lexicon, containing the string "self-defined". Additionally, the created data frame can be combined with other stop words containing data frames, e.g. tidytext::stop_words or stopwords_miretrieve.

References

Silge, Julia, and David Robinson. 2016. “tidytext: Text Mining and Analysis Using Tidy Data Principles in R.” JOSS 1 (3). The Open Journal. https://doi.org/10.21105/joss.00037.

See also