Word Cloud displays the most relevant words and short phrases in word cloud format. Text variables often contain words that are highly indicative of the response. Use Word Cloud on either the Insights page or the Leaderboard. Operationally, each version of the model behaves the same—use the Leaderboard tab to view a word cloud while investigating an individual model and the Insights page to access, and compare, each word cloud for a project. Additionally, word clouds are available for multimodal datasets (i.e., datasets that mix images, text, categorical, etc.)—a word cloud is displayed for all text from the data.
The Word Cloud for a model is based on the data used to train that model, not on the entire dataset. For example, a model trained on a 32% sample size will result in a Word Cloud that reflects those same 32% of rows.
Word clouds are supported in the following model types:
- Binary classification: All variants of ElasticNet Classifier (linear family models) with the exception of TinyBERT ElasticNet classifier and FastText ElasticNet classifier.
- Multiclass: Stochastic Gradient Descent
- Regression: Ridge Regressor, ElasticNet Regressor, Lasso Regressor
Click Word Cloud to display the chart:
- Text strings are displayed in a color spectrum from blue to red, with blue indicating a negative effect and red indicating a positive effect.
- Text strings that appear more frequently are displayed in a larger font size, and those that appear less frequently are displayed in smaller font sizes.
With a Word Cloud, you can:
- Mouse over a word to display the coefficient value (1) specific to that word.
- For multiclass projects, use the Select Class dropdown (2) to choose the specific class you want to see the word cloud for.
- Check the Filter Stop Words box (3) to remove stop words (commonly used terms that can be excluded from searches) from the display.
See this note for a description of how DataRobot handles single-character "words."