Excel Guide for Data Analysts
Excel Guide for Data Analysts
DATE :
Aim:
Features of Ms-Excel:
 Better Drag-and-Drop: Do you want to move a group of cells? Excel’s drag and drop
 feature lets you reposition selected portion of your spreadsheet by simply dragging
 them with your mouse.
 Cell tips and Scroll Tips: To help you get around better with mouse, Excel now
 includes scroll tips. When you click and drag a scroll bar, a small window tells you what
 row or column you are heading for.
 Number Formatting: It’s easy to format numbers with excel’s new number formatting
 feature. Select your numbers and choose cells command from format menu.
 Templates and Template wizard: Excel’s template facility has been greatly enhanced.
 You can choose from a variety of elegantly designed templates for your home or
 business. You can even have a template wizard link your worksheets to a database.
 Shared Lists: you can now have worksheets that are shared simultaneously over a
 network.
 Conditional Formatting: Conditional formatting helps users to quickly focus on
 important aspects of a spreadsheet or to highlight errors and to identify important
 patterns in data.
 Sorting and Filtering: Excel spreadsheets help us make sense of large amounts of data.
 To make it easier to Dind what you need, you can reorder the data or pick out just the
 data you need, based on parameters you set within Excel. Sorting and Diltering your
 data will save you time and make your spreadsheet more effective.
 Excel Charts:Excel charts help you communicate insights & information with ease. By
 choosing your charts wisely and formatting them cleanly, you can convey a lot.
EX. NO: 2(i) NUMERICAL OPERATIONS (MAX, MIN, AVG, SUM, SQRT, ROUND)
DATE :
Aim:
 To get the input from user and perform numerical operations like MAX, MIN, AVG, SUM, SQRT,
ROUND using MS Excel.
Description:
i. The Average function calculates the average, or arithmetic mean, of its arguments.
Formulas:
 
 SUM(number1, [number2], …)
 
 AVERAGE(number1, [number2], …)
 
 MAX(B2:B6)
 
 MIN(B2:B6)
 
 SQRT(number)
 
 ROUND(number, num_digits)
 OUTPUT:
Result:
Thus, the numerical operations on exploratory data analysis were successfully executed and
verified.
EX. NO: 2(ii) IMPORT/EXPORT OPERATIONS
DATE :
Aim:
Description:
Import Data:
Export Data
 Click on "Save As," choose a location, and select Excel Workbook as the Dile format.
 Click on "Save As," choose a location, and select the desired text-based format.
4. PDF:
 Select the data you want to export.
 Click on "Save As," choose a location, and select PDF as the Dile format.
Output:
Import:
Export Data
Result:
Thus, the import/export operations were successfully executed and verified.
EX. NO: 3 PERFORM STATISTICAL OPERATIONS
DATE :
Aim:
 To perform statistical operations like mean, mode, standard deviation, skewness and
kurtosis by using MS Excel.
Description:
Mean :
Mean is also known as average of all the numbers in the data set which is calculated
by below equation.
Median :
Mode :
Mode is the number which occur most often in the data set.Here 150 is occurring
twice so this is our mode.
Variance :
Variance is the numerical values that describe the variability of the observations
from its arithmetic mean and denoted by sigma-squared(σ2 )
Standard Deviation :
Skewness:
Kurtosis:
Procedure:
1. If you haven't already installed the Analysis ToolPak , Click the Microsoft Office
 button, then click on the Excel Options , and then select Add-Ins , Click Go, check
 the Analysis ToolPak box, and click Ok.
2. Select Data tab, then click on the Data Analysis option, then selects Descriptive
 Statistics from the list and Click Ok. [Data tab >> Data Analysis >> Descriptive
 Statistics].
 OUTPUT:
Result:
Thus, the statistical operations were successfully performed and verified.
EX. NO: 4 PERFORM Z-TEST, T-TEST & ANOVA
DATE :
Aim:
Description:
T-Test:
The t-Test is used to test the null hypothesis that the means of two populations are equal.
Procedure:
1. First, perform an F-Test to determine if the variances of the two populations are
equal. This is not the case.
2. On the Data tab, in the Analysis group, click Data Analysis.
4. Click in the Variable 1 Range box and select the range A2:A7.
5. Click in the Variable 2 Range box and select the range B2:B6.
6. Click in the Hypothesized Mean Difference box and type 0 (H0: μ1 - μ2 = 0).
8. Click OK.
Z-Test:
The Z.TEST function is one such hypothesis test function. It tests the mean of the two
sample data sets when the variance is known and the sample size is large.
Procedure:
 1. First, we need to calculate the variables for these two values using the
 VAR.P function.
 3. For the “Variable 1 Range,” select “Student 1” scores. For the “Variable 2
 Range,” select “Student 2” scores.
ANOVA Test:
A single factor or one-way ANOVA is used to test the null hypothesis that the means of
several populations are all equal.
Procedure:
3. Click in the Input Range box and select the range A2:C10.
 5. Click OK.
Output:
T-Test:
Z-Test:
 Anova Test:
Result:
Thus, the Z-test, T-test, and ANOVA were successfully performed and verified.
EX. NO: 5(i) PERFORM DATA PRE-PROCESSING OPERATIONS - HANDLING
 MISSING DATA
DATE :
Aim:
Description:
Handling missing data is an essential step in data pre-processing. In Microsoft Excel, you
can perform various operations to handle missing data effectively. Here are some common
techniques:
3. Click on "Conditional Formatting" and choose "Highlight Cells Rules" > "Text
 that Contains..."
Using Filter:
1. Click on the Dilter icon in the header of the column with missing data.
2. Unselect the checkbox for blank values to hide rows with missing data.
3. Copy the visible data or delete the rows.
Using Formulas:
Using Trendline:
2. Add a trendline.
3. Display the equation on the chart and use it to interpolate missing values.
Output:
Using COUNTBLANK
Function:
3. Replace Missing
Result:
Thus, the data preprocessing operations for handling missing data were successfully performed and
verified.
EX. NO: 5(ii) PERFORM DATA PRE-PROCESSING OPERATIONS - NORMALIZATION
DATE :
Aim:
Description:
To “normalize” a set of data values means to scale the values such that the mean of all of the
values is 0 and the standard deviation is 1.
The formula that we used to normalize a given data value, x, was as follows:
Normalized value = (x – x) / s
where
,
 x = data value
x = mean of dataset
If a particular data point has a normalized value greater than 0, it’s an indication that the
data point is greater than the mean. Conversely, a normalized value less than 0 is an
indication that the data point is less than the mean.
Procedure:
 1. Find the mean by using the =AVERAGE(range of values) function to Dind the
 mean of the dataset.
Result:
Thus, the data preprocessing operations for normalization were successfully performed and verified.
EX. NO: 6 PERFORM DIMENSIONALITY REDUCTION OPERATION USING
 PCA, KPCA & SVD
DATE:
Aim:
 To perform dimensionality reduction operation using PCA, KPCA & SVD using MS
Excel
.
Description:
1. KPCA involves applying the kernel trick to extend PCA for non-linear data.
3. Consider using programming languages with libraries that support KPCA, such as
 scikit- learn in Python.
1. SVD decomposes a matrix into three other matrices, representing the singular
 vectors and singular values.
2. While Excel has a singular value decomposition function (SVD), it may not be
 sufDicient for more advanced applications.
3. Manual steps for SVD:
PCA:
Result:
Thus, the dimensionality reduction operations using PCA, KPCA, and SVD were successfully
performed and verified.
EX. NO: 7 PERFORM BIVARIATE AND MULTIVARIATE ANALYSIS ON THE
 DATASET
DATE:
Aim:
Description:
Bivariate analysis
Bivariate analysis is the simultaneous analysis of two variables. It explores the concept of
the relationship between two variables whether there exists an association and the
strength of this association or whether there are differences between two variables and the
signiDicance of these differences.
Procedure:
Multivariate Analysis
Multivariate:
Result:
Thus, the bivariate and multivariate analysis on the dataset were successfully performed and
verified.
EX. NO: 8 APPLY AND EXPLORE VARIOUS PLOTTING FUNCTIONS ON THE
 DATA SET.
DATE:
Aim:
To apply and explore various plotting functions on the data set by using MS-Excel.
Description:
1. Bar/Column Graphs
A bar graph shows information about two or more groups. Bar graphs are mainly used to
make comparisons across a range.
2. Pie Graphs
A pie chart is nothing but a circular graph representing data in the form of a pie/circle. It is
divided into different sections, each one representing a proportion of the whole.
3. Line Graphs
A line graph is formed by connecting a series of values/data points using straight lines. A
line graph can be used when you want to check whether the values are increasing or
decreasing over some time.
4. Scatter Plot
A scatter plot, also called a coordinate graph, uses dots to represent the data values for two
different variables, one on each axis. This graph is used to Dind a pattern/ relationship
between two sets of data.
5. Area Chart
An area chart depicts the change of two or more data points over time. They are similar to
the line charts, except the area charts are Dilled with color below the line. This chart is
useful to visualize the area of various series relative to each other.
Output:
Dataset:
2. Pie Graphs
 Census
 Resident
 Total
 Populatio
 n-
 AB:Qr-1-2
 000
 AlabamaAlaskaArizonaArkansasCaliforniaColorado ConnecticutDelawareDistrict of
 ColumbiaFloridaGeorgiaHawaii IdahoIllinoisIndianaIowaKansasKentucky
 LouisianaMaineMarylandMassachusettsMichiganMinnesota
3. Line Graphs
4. Scatter Plot
 5. Area Chart
Result:
Thus, the application and exploration of various plotting functions on the dataset were successfully
performed and verified.
EX. NO: 9 FEATURES OF POWER BI DESKTOP
DATE :
Aim:
Power BI Desktop
 Power BI Desktop is a free application you install on your local computer that lets you connect
to, transform, and visualize your data. With Power BI Desktop, you can connect to multiple different
sources of data, and combine them (often called modeling) into a data model. This data model lets
you build visuals, and collections of visuals you can share as reports, with other people inside your
organization. Most users who work on business intelligence projects use Power BI Desktop to
create reports, and then use the Power BI service to share their reports with others.
People who are responsible for such tasks are often considered data analysts (sometimes
referred to as analysts) or business intelligence professionals (often referred to as report
creators). Many people who don't consider themselves an analyst or a report creator use
Power BI Desktop to create compelling reports, or to pull data from various sources. They
can build data models, and then share the reports with their coworkers and organizations.
There are three views available in Power BI Desktop, which you select on the left side of the
canvas. The views, shown in the order they appear, are as follows:
 
 Report: You create reports and visuals, where most of your creation time is
 spent.
 
 Data: You see the tables, measures, and other data used in the data model
 associated with your report, and transform the data for best use in the report's
 model.
 
 Model: You see and manage the relationships among tables in your data
 model.
Connect to data
To get started with Power BI Desktop, the Dirst step is to connect to data. There are many
different data sources you can connect to from Power BI Desktop.
To connect to data:
 The Get Data window appears, showing the many categories to which Power
 BI Desktop can connect.
2. When you select a data type, you're prompted for information, such as the URL and
 credentials, necessary for Power BI Desktop to connect to the data source on your
 behalf.
 3. After you connect to one or more data sources, you may want to transform the
 data so it's useful for you.
With Power Query Editor, you make changes to your data, such as changing a data type,
removing columns, or combining data from multiple sources. It's like sculpting: you start
with a large block of clay (or data), then shave off pieces or add others as needed, until the
shape of the data is how you want it.
Create visuals
After you have a data model, you can drag fields onto the report canvas to create visuals. A
visual is a graphic representation of the data in your model. There are many different types
of visuals to choose from in Power BI Desktop. The following visual shows a simple column
chart.
Create reports
A collection of visuals, in one Power BI Desktop Dile, is called a report. A report can have
one or more pages, just like an Excel Dile can have one or more worksheets. With Power BI
Desktop you can create complex and visually rich reports, using data from multiple sources,
all in one report that you can share with others in your organization.
EX. NO: 10 PREPARE & LOAD DATA
DATE :
Aim:
Procedure:
Get Data:
 Click on "Home" in the Power BI ribbon.
 Choose the type of data source you want to connect to (e.g., Excel, SQL Server,
SharePoint, etc.).
Query Editor:
Power BI will open the Query Editor, where you can transform and shape your data.
 Use the Query Editor tools to clean, Dilter, and transform the data as needed.
 You can remove unnecessary columns, Dilter rows, replace values, etc.
Merge and Append Queries:
If needed, you can merge tables or append queries to combine data from multiple sources.
Once you're done with data transformation, close the Query Editor.
Load Data:
 Click on "Close & Apply" to load the data into Power BI.
Data Model:
Create Visualizations:
Use the Fields pane to drag and drop Dields onto the report canvas to create visualizations.
Customize Visualizations:
Create Relationships:
If you have multiple tables, deDine relationships between them in the data model.
Save Report:
Share Report:
Share the report with others by publishing it to the Power BI service or exporting it in
different formats.
OUTPUT:
Result:
The data preparation and loading process in Power BI Desktop was successfully executed, enabling
further analysis and visualization of the data.
 EX. NO: 11 DEVELOP THE DATA MODEL
DATE :
Aim:
Procedure:
1. Load Data:
Click on "Home" and select "Get Data" to connect to your data source.
Use the Query Editor to clean, transform, and shape the data as needed.
Click on "Close & Apply" to load the data into Power BI.
2. Create Relationships:
1. Manage Relationships:
Go to the "Data" view by clicking on the "Data" icon in the left sidebar.
2. Degine Relationships:
Select a Dield in one table and drag it to the corresponding Dield in another table.
Fine-tune relationships using the "Manage Relationships" dialog (click on "Model" in the
left sidebar and then "Manage Relationships").
2. Write Formulas:
Write DAX (Data Analysis Expressions) formulas to create calculated columns based on
existing data.
4. Create Measures:
1. Add Measures:
Click on "Modeling" in the left sidebar, and then click on "New Measure" to create measures.
Write DAX formulas to create measures for aggregations, calculations, and other analytics.
1. Manage Fields:
Arrange tables and Dields in the "Fields" pane for better organization.
results.
1. Optimize Model:
Ensure that data types are set correctly for each column.
 Output:
Result:
Successfully developed the data model in Power BI Desktop, incorporating the prepared data and
establishing relationships to support analysis and visualization.
 EX. NO: 12 DAX CALCULATIONS
DATE :
Aim:
Procedure:
 Go to Data View:
Enter a name for the column and write your DAX formula.
2. Create Measures:
5. Statistical Functions:
SalesStdev = STDEV.P('Sales'[SalesAmount]
6. Text Functions:
Result:
Successfully implemented DAX calculations in Power BI Desktop, enabling advanced analysis and
insights into the data.
 EX. NO: 13 DESIGN A REPORT
DATE :
Aim:
Procedure:
1. Connect to Data:
 Open Power BI Desktop:
Click on "Home" and select "Get Data" to connect to your data source.
 Transform and Load Data:
Use the Query Editor to clean and transform the data as needed.
Click on "Close & Apply" to load the data into Power BI.
2. Create Visualizations:
 Go to Report View:
Drag and drop visualizations from the Visualizations pane onto the report canvas.
 Congigure Visuals:
ConDigure each visual by dragging Dields from the Fields pane to the appropriate well in the
visual.
 Arrange Visuals:
Use the "View" tab to apply themes for consistent colors and styles across visuals.
 Add Titles and Text Boxes:
Use the "Text box" tool to add titles, subtitles, and annotations.
4. Create Interactivity:
 Add Slicers:
Drag Dields to the "Slicer" visual to allow users to Dilter data dynamically.
 Create Drillthroughs:
Right-click on a data point and select "Drillthrough" to create detailed drillthrough pages.
 Use Bookmarks:
Create bookmarks to save speciDic views or states of the report.
 Use Measures:
Drag and drop measures from the Fields pane to create dynamic and calculated insights.
 Add Trendlines and Reference Lines:
 Test Interactivity:
Interact with slicers, buttons, and other interactive elements to ensure they work as
expected.
 Save and Share:
 Publish to Power BI Service:
If you want to share the report online, click on "Publish" to publish it to the Power BI
service.
 Export Report:
Export the report in different formats (PDF, PowerPoint) for sharing with others.
 Output:
Result:
Successfully designed a comprehensive report in Power BI Desktop, presenting key insights and
trends in the data through interactive visualizations and intuitive navigation options.
 EX. NO: 14 CREATE A DASHBOARD AND PERFORM DATA ANALYSIS
DATE :
Aim:
Procedure:
1. Connect to Data:
Click on "Home" and select "Get Data" to connect to your data source.
 Transform and Load Data:
Use the Query Editor to clean and transform the data as needed.
Click on "Close & Apply" to load the data into Power BI.
2. Create Visualizations:
 Go to Report View:
Drag and drop visualizations from the Visualizations pane onto the report canvas.
 Congigure Visuals:
ConDigure each visual by dragging Dields from the Fields pane to the appropriate well in the
visual.
3. Build Dashboard:
 Go to Dashboard View:
 Arrange Tiles:
In the Dashboard view, arrange the pinned visuals (tiles) to create a visually appealing
layout.
 Add Text Boxes and Images:
Enhance the dashboard by adding text boxes, images, and other elements for context.
4. Create Interactivity:
 Add Slicers:
Drag Dields to the "Slicer" visual to allow users to Dilter data dynamically.
 Create Drillthroughs:
Right-click on a data point and select "Drillthrough" to create detailed drillthrough pages.
 Use Bookmarks:
 Use Measures:
Drag and drop measures from the Fields pane to create dynamic and calculated insights.
 Add Trendlines and Reference Lines:
 Test Interactivity:
Interact with slicers, buttons, and other interactive elements to ensure they work as
expected.
 Save and Share:
 Publish to Power BI Service:
If you want to share the report online, click on "Publish" to publish it to the Power BI
service.
 Export Report:
Export the report in different formats (PDF, PowerPoint) for sharing with others.
Output:
Result:
Successfully created an interactive dashboard in Power BI Desktop, enabling data analysis and
visualization of key insights and trends for informed decision-making.
EX. NO: 15 PRESENTATION OF A CASE STUDY- OPERATIONAL
 PROCESS IMPROVEMENT FOR A GROUP OF DENTAL
 HOSPITALS
DATE :
Aim:
The Challenge
The client wanted to improve their operational processes to make use of Power BI reports
to identify the pitfalls of the processes, take steps to overcome the pitfalls identiDied in
order to maximize their revenue. They also wanted to capture certain traits related to
patients, such as which topographical region showed most patients.
The Solution
We helped them infer an assessment index based on the above information. This traits of
this index helped them identify where they were lacking in their assessment process and
they improved it to a considerable level. Also, they could now predict when and how many
interns to hire.
Other most import factor was treatment. We developed the reports that helped them study
the treatment process. This included analysing the following parameters,
Several other reports, such as a map showing topography of the patients, regularity of
employees, year-year and quarter-quarter revenue, branch wise revenue etc. were also
developed.