Prepare the Data
1.1 Connecting to Data
·
Discover and connect to data sources
Learn how to connect Power BI to
different data sources or to an existing shared semantic model.
·
Adjust data source settings
Modify settings such as credentials
and privacy levels to ensure secure and appropriate access.
·
Select the right storage mode
Decide between using Import or
DirectQuery mode, depending on your data and performance needs.
·
Use parameters for flexibility
Create and manage parameters to make
your reports more dynamic and easier to update.
1.2 Profiling and Cleaning Data
·
Review data properties and statistics
Explore the data in Power Query
Editor by checking statistics and column details before loading it.
·
Fix inconsistencies and quality issues
Address problems like mismatched
data types, unexpected or missing values, and other data integrity concerns.
·
Handle errors during import
Resolve any errors that occur while
bringing data into Power BI to ensure smooth processing.
🔍 Key points to remember:
- Always
examine and verify data types in Power Query Editor before loading;
incorrect types can block relationships or calculations later.
- You
can adjust data types either in Power Query Editor or directly within the
Power BI Desktop’s Report View.
- Any
changes you make — like altering data types — are saved as "Applied
Steps," making it easy to track or undo transformations.
Power Query Editor Tips for Cleaning Data
·
Rename queries to make them clearer and
easier to manage.
·
Replace unexpected values to standardize
data.
·
Fill in or replace null values where
appropriate.
·
Remove duplicate rows to ensure data
uniqueness.
1.3 Transforming and Loading Data
· Assign
the right data types to each column for accurate calculations and
relationships.
· Create
or modify columns using custom formulas and transformations to prepare your
data for analysis.
· Group
and aggregate data to summarize details, such as computing totals or
averages by category.
· Reshape
your data by pivoting, unpivoting, or transposing tables to match your
reporting needs.
· Convert
semi-structured data (like JSON or XML) into tables that Power BI can
analyze.
· Build
fact and dimension tables to support a robust data model.
· Know
when to use reference vs duplicate queries — reference queries reuse logic
without copying data, while duplicates make a full copy, which can impact
performance.
· Merge
and append queries:
- Append
adds rows from one table to another, best when columns match. It doesn't
automatically remove duplicates.
- Merge
combines columns from two tables into a single one, based on a common key
column.
· Define
keys and relationships by identifying the correct columns to link tables
together.
· Configure
how queries load into your data model to optimize performance and control
what gets loaded into Power BI.
0 Comments