tidytuesday

Textrecipes series: Feature Hashing

This is the fourth blog post in the textrecipes series where I go over the various text preprocessing workflows you can do with textrecipes. This post will be showcasing how to perform feature hashing) (also known as the hashing trick).

Textrecipes series: TF-IDF

This is the third blog post in the textrecipes series where I go over the various text preprocessing workflows you can do with textrecipes. This post will be showcasing how to perform term frequency-inverse document frequency (Tf-IDF for short).

Textrecipes series: lexicons

This is the second blog post in the textrecipes series where I go over the various text preprocessing workflows you can do with textrecipes. This post will be covering how to use lexicons to create features.

Textrecipes series: Term Frequency

This is the first blog post in a series I am starting to go over the various text preprocessing workflows you can do with textrecipes. In this post will we start simple with term frequencies.

tidytuesday: Part-of-Speech and textrecipes with The Office

This post was written before the change to textrecipes to support spacyr as an engine to step_tokenize(). It is still a good demonstration of how to use a custom tokenizer.