Aroh's Microsoft 365 Musings: Power BI and AI

Hi All,

On 14 Mar 2020 (Sat), I spoke on a Power BI and AI - The Kingdom of possibility.

In recent months, Power BI Team has released a couple of AI visuals in Power BI Desktop.
I will walk through each visual and there benefits.

Sample Power BI PBIX and Excel Datasets are stored here.

Video:

Power BI Dataflow (Power BI Service Only)

There are many definitions about Power BI Dataflow

Self-service data prep or Self-self ETL (Extract-Transform-Load)

The way that is reusable & repeatable for others

A collection of entities and entities are similar to tables

Definition

Dataflow is simply Power Query in the cloud.

Power Query process that runs in the cloud independently from any Power BI reports.

Dataflows are used to ingest, transform, integrate, and enrich big data/AI

A lot of ML capabilities are built on top of Dataflow. (More computation power on cloud than Desktop)

Step by Step to create a Dataflow

Open browser window and type PowerBI.com

Create a new App workspace or use an existing app workspace.

Create a new Dataflow

Click on Add new entities

I have already created a Customer Feedback Excel file in OneDrive for Business account

:

Under the Power Query window of Power BI Dataflow, select Excel.

Under Authentication, link dropdown use Organizational account

and type full URL for OneDrive Excel account

https://insightsg-my.sharepoint.com/personal/aroh_shukla_Insight_com/Documents/14Mar2020Datacamp/customerfeedback.xlsx

Load all the tables and click on Transform data.

You can do a basic transformation like User first row as headers for this dataset and do it for all rows.

Click on Save & Close button.

Save the Dataflow.

Open Power BI desktop >> Get Data >> Power BI dataflows.

Locate Customer Feedback and select all tables.

Align Data Relationships in the Data Relationship view

Now, you can use the same Power BI Desktop's tools and techniques

Key Influencers Visual (Power BI Desktop)

When to use key influencers

>> Factors affect the metric being analyzed.

>> Contrast the relative importance of these factors
How AI helps

>> Visual a couple of algorithms like ML.NET, One-hot encoding, Replace missing value, etc.
cc
Considerations

>> Direct Query is not supported

>> Azure Analysis Services and SSAS is not supported

>> Publish to Web is not supported

>> NET FX 4.6 above or higher is required.

>> Aggregates and measures are not supported.
It’s the 1st AI visual in Power BI.
Finds patterns of your data to try and explain what influences your metric and value that goes up and down.
Right now, it does not support measures.
Note that its AI visual, the data set must have ample information so that AI can find patterns.
USE CASE:

You are the Project Manager who is interested in looking into data about customers who are using our cloud service who have given low ratings.
SAMPLE DATASET - Let us check the dataset

COMPANY
The table shows the company size.

CUSTOMER TABLE(45 K rows)
Tables detail customer and company ID. Important rating column that Key Influencer value will use.

DEVICE (69 ROWS)
Provides devises information such as browser, mobile, tablet, etc.

GEOGRAPHY

ROLE
The role of a person in the organization

SUBSCRIPTION
The type of subscription.

SUPPORT TICKETS (126K rows)
Number of support tickets with customer and dates

STEP BY STEP

Open the latest version of Power BI Desktop.
Let’s drag the Key Influencer Visual. From the Customer table, drag the Rating in the Analyse column.

We have all the information about the customer who has given us a low or high rating

Note you have 2 tabs on Key Influencer Visual (KIV) called Key Influencers and Top Segments.

You also have a drop-down that influences the Rating (key measure) which has a value of High or Low.

We need to look for the factor that customers gave a low rating.
Drag Country-Region column of Geography table and place under Explain by of KIV
We also, drag Role in Org from the Roles table,

Company size from the Company table, the theme they chose from the Customer table,

and subscription type from the Subscription table
Notice that every time we drag a column in the Explain by column the Key Influencers visuals re-run the analysis based on factors that impact the low rating
E.g. France was top the Influencers, but we add more dimensions and it looks like Role in Organisation as the consumer is the top Influencers.
What is the meaning of the consumer is key Influencer?

Let us dissect somebody is a consumer of cloud service of giving low rating is 2.5 times is higher if they are administrator or publisher.

Consumer rating is 14.93% and the average score is 5.78% of all roles (admins and publisher)

Likelihood of average score is 5.78% and 14.93% customer rating result in 2.57 times.
14.93 / 5.75 = 2.557 times
Key Influencer visual also considers the number of data points. It is a very important point.
Note that the Administrator role also has a high value of low ratings of 13.42%

But the Administrator role is NOT showing up in Key Influencers visual at all

If we drag rating from Country Table, Role in Org from Role table and rating count from Country table. and rating column from Country Table.

Role in Org from Role Table

Drag rating column and rating count from Customer Table

and the number of counts for the administrator is much lower.
We are also able to compare different types of variables and rank them alongside each other. E.g. the 2^nd influencer is Theme where usability rating is low, and the rating is 28.68% as compared to all other themes which are 11.35%

We can filter the ratings that we are driving low by checking the box – Only show values that are influencers.

By checking the box, we are presented with only 4 key factors that are giving low ratings.
Let’s check what happens if you look for company size. Drag Company size as the table

and make it as a slicer.
The consumer is not the lowest rating, but security becomes the key influencer for larger enterprises and consumer does not appear at all.
The Key Influencer Visual also works for the continuous fields as well. For example, we drag Tenure to the Key Influencer Visual

• Tenure becomes the top influencer with rating is as low as 29.49% with an average score of 10.39%. That is leading to the likelihood of tenure to 29.49 / 10.39 = 2.8 times.
What does it mean? If tenure increases then the likelihood of a low rating also increases.
This means the customers who have been with us longer, are giving us a lower rating.
Let's look at the number of support tickets, and we want to see if the number of support tickets that customer impacts their likelihood of giving us a low rating.

Customers can have no support tickets, one support ticket or many support tickets.

Drag the Support Ticket ID to Key Influencers Visual

· We can see that when the count of support tickets goes up, the likelihood of receiving a low rating increases.
Now, we have seen many factors that influence a specific attribute that effects. But we want to which segments are giving us a low rating then we have to move to the Top Segments tab.

The height of the bubble represents what proportion of that segment has low ratings.
The size of the bubble represents how many customers we have in a particular segment.

Let us click on Segment 1 bubble

· We see the customers who are not the publisher, and the number of support tickets is more than 4. The tenure is more than 29.

74% of the customer has given us a low rating.