Reviewing Automated Insights

Automated Insights Dialog Basics

WATCH A TUTORIAL:  https://lityx.com/automated-insights-tutorial/

 

When you open the dialog, if the insights are not ready yet, you will see a message explaining the reason (for example, they may still be in progress, or you may have never setup the insights for the dataset).

If they are ready, you will see the target variables listed in the drop down at the top of the dialog.  Your options will look like the following:

 

 

Target Variable to Analyze - select the target variable for which you wish to analyze Automated Insights. If a variable you had requested for automated insights does not appear here, it likely means there was an error computing insights for that variable.

Insight Type - You will see three options: Key Drivers, Multivariate Segmentation, and Segmentation Details.  Each is described in more detail below.

Go and Refresh buttons - Click Go after changing either the variable or the insight type.  Click Refresh if Automated Insights is still processing.  If it has since completed, you will see the variable list become populated.

 

Insight Type - Key Drivers

The Key Drivers insight provides an evaluation of the most important predictors of the selected target variable.  They are ranked according to relative importance, with 100 assigned to the variable found to be most predictive.  These insights should be interpreted as directional - a starting point for understanding and further exploration or deeper machine learning modeling.

 

Insight Type - Multivariate Segmentation

The Multivariate Segmentation insight type provides insight into how multiple variables combine to improve understanding of the target variable.  The output is viewed in the form of a tree.  The full dataset is the starting point at the top of the segmentation diagram.  The dataset is continuously broken into smaller segments based on the values of predictor variables.  Each "node" in the tree structure can be considered as representing a segment of the dataset, and is evaluated for its effect on the target variable.  See below for more information on evaluating the information provided.

 

 

NOTE: The Key Drivers and the Multivariate Segmentation outputs provide different views of the key relationships in the dataset.  The Key Drivers result is a univariate estimation of overall impact which evaluates each predictor variable one at a time, independently from others.  The multi-variate segmentation allows you to evaluate how combinations of variables play a role in making predictions.  You may find that a variable that appears often in the Multivariate Segmentation tree, even if in lower branches could be as or more important in the Key Drivers list than variables that appear near the top of the Multivariate Segmentation tree.

 

Display and exporting options

The multivariate segmentation in many cases has a large number of segments that will not all display on the screen at one time.  In addition, you may want to customize what pieces of information are shown, or may want to export the tree.  Here are some options you have:

Panning and Zooming - use your mouse to drag the tree display up, down, left or right to pan around larger trees.  You can also zoom in or out (to display less or more of the tree on your screen) by holding the Shift key down and scrolling your mouse wheel.

Re-centering - If you have panned and zoomed often and want to easily return to the starting position of the tree, with the top node centered and back to normal zoom, use the "Re-center" button as shown in the screenshot above.

Display Options - Use the Display Options dropdown to turn on or off any of the pieces of information with each segment box.  This can be used to show only what you are interested in, or to reduce the amount of space taken by the full tree.  In particular, you can use the Tree Depth dropdown within the Display Options to restrict how many levels of the tree are shown.

Export - the Export button allows you to export the tree to JPG, PNG, or PDF format.  All of the selected display options are respected in the exported file.  Exporting to a PDF file affords you additional flexibility to further edit and modify the tree to more detailed specifications you may want.  See documentation to help you with that here: Export to PDF and Resize

 

Insight Type - Segmentation Details

The Segmentation Details insight can be thought of a tabular view of the graphical multivariate segmentation.  Each segment (node) in the multivariate segmentation is represented as a row in the Segmentation Details table, along with information on that segment.  The table includes a full description of the segment definition.  The rows can be sorted by clicking on column headings.  In particular, it can be interesting to sort the table based on the average value of the target variable to immediately see which segments tend to lead to strong (or weak!) results.  The Segment Size column provides information on how prevalent each segment is overall.