tidytext

tidytext brings tidy data principles to text mining by converting text into a tidy data frame format. It provides tools for tokenization, sentiment analysis, n‑gram creation, and term‑document matrices, enabling interoperability with dplyr, ggplot2, and other tidyverse workflows.

Features

Tokenizes text into tidy format (unnest_tokens)
Supports sentiment lexicons (e.g. Bing, NRC) and TF-IDF computation
Converts tm or quanteda objects into tidy data formats
Easy integration with dplyr/ggplot2 for analysis and visualization
Functions for n-grams, word co-occurrence, and document-term matrices
Compatible with existing tidy data pipelines in R

Project Samples

Project Activity

See All Activity >

Follow tidytext

tidytext Web Site

User Reviews

Be the first to post a review of tidytext!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Related Categories

R Natural Language Processing (NLP) Tool

Registered

2025-07-30

Similar Business Software

LM-Kit.NET

LM-Kit.NET is a cutting-edge, high-level inference SDK designed specifically to bring the advanced capabilities of Large Language Models (LLM) into the C# ecosystem. Tailored for developers working within .NET, LM-Kit.NET provides a comprehensive suite of powerful Generative AI tools, making...

See Software
Google AI Studio

Google AI Studio is a unified development platform that helps teams explore, build, and deploy applications using Google’s most advanced AI models, including Gemini 3. It brings text, image, audio, and video models together in one interactive playground. With vibe coding, developers can use...

See Software
kama DEI

kama.ai is a Responsible AI Agent platform that blends knowledge graph AI with advanced generative models for trustworthy Hybrid AI Agents. It empowers industries such as finance, education, healthcare, and Indigenous services with culturally aware, ethical, and accurate AI. By incorporating...

See Software
Enterprise Bot

Enterprise Bot, based in Switzerland, is a pioneer in Conversational AI, Process Automation, and Generative AI. With the trust of esteemed enterprise giants across industries like Generali, SIX, SBB, DHL, and SWICA, Enterprise Bot is revolutionizing both customer and employee experiences....

See Software
Quaeris

Align analytics to your everyday business workflows. Your business relies on people, data and documents, but the process of using them is broken. QuaerisAI enables seamless downstream workflows across your People, Documents and Data Assets. Use natural language search on data, documents and...

See Software
GPT-4

GPT-4 (Generative Pre-trained Transformer 4) is a large-scale unsupervised language model, yet to be released by OpenAI. GPT-4 is the successor to GPT-3 and part of the GPT-n series of natural language processing models, and was trained on a dataset of 45TB of text to produce human-like text...

See Software