logo资料库

[PDF]Analyzing Data with Power BI and Power Pivot for Excel.pdf

第1页 / 共267页
第2页 / 共267页
第3页 / 共267页
第4页 / 共267页
第5页 / 共267页
第6页 / 共267页
第7页 / 共267页
第8页 / 共267页
资料共267页,剩余部分请下载后查看
Analyzing Data with Microsoft Power BI and Power Pivot for Excel Alberto Ferrari and Marco Russo
PUBLISHED BY Microsoft Press A division of Microsoft Corporation One Microsoft Way Redmond, Washington 98052-6399 Copyright © 2017 by Alberto Ferrari and Marco Russo. All rights reserved. No part of the contents of this book may be reproduced or transmitted in any form or by any means without the written permission of the publisher. Library of Congress Control Number: 2016931116 ISBN: 978-1-5093-0276-5 Printed and bound in the United States of America. First Printing Microsoft Press books are available through booksellers and distributors worldwide. If you need support related to this book, email Microsoft Press Support at mspinput@microsoft.com . Please tell us what you think of this book at https://aka.ms/tellpress . This book is provided “as-is” and expresses the author’s views and opinions. The views, opinions and information expressed in this book, including URL and other Internet website references, may change without notice. Some examples depicted herein are provided for illustration only and are fictitious. No real association or connection is intended or should be inferred. Microsoft and the trademarks listed at https://www.microsoft.com on the “Trademarks” webpage are trademarks of the Microsoft group of companies. All other marks are property of their respective owners. Acquisitions Editor: Devon Musgrave Editorial Production: Polymath Publishing Technical Reviewer: Ed Price Copy Editor: Kate Shoup Layout Services: Shawn Morningstar Indexing Services: Kelly Talbot Editing Services Proofreading Services: Corina Lebegioara Cover: Twist Creative • Seattle
Contents at a glance Introduction Chapter 1 Introduction to data modeling Chapter 2 Using header/detail tables Chapter 3 Using multiple fact tables Chapter 4 Working with date and time Chapter 5 Tracking historical attributes Chapter 6 Using snapshots Chapter 7 Analyzing date and time intervals Chapter 8 Many-to-many relationships Chapter 9 Working with different granularity Chapter 10 Segmentation data models Chapter 11 Working with multiple currencies Appendix A Data modeling 101 Index
Contents Introduction Who this book is for Assumptions about you Organization of this book Conventions About the companion content Acknowledgments Errata and book support We want to hear from you Stay in touch Chapter 1. Introduction to data modeling Working with a single table Introducing the data model Introducing star schemas Understanding the importance of naming objects Conclusions Chapter 2. Using header/detail tables Introducing header/detail Aggregating values from the header Flattening header/detail Conclusions Chapter 3. Using multiple fact tables Using denormalized fact tables Filtering across dimensions Understanding model ambiguity Using orders and invoices Calculating the total invoiced for the customer Calculating the number of invoices that include the given order of the given customer Calculating the amount of the order, if invoiced Conclusions Chapter 4. Working with date and time Creating a date dimension Understanding automatic time dimensions Automatic time grouping in Excel Automatic time grouping in Power BI Desktop Using multiple date dimensions Handling date and time Time-intelligence calculations Handling fiscal calendars Computing with working days Working days in a single country or region Working with multiple countries or regions Handling special periods of the year Using non-overlapping periods Periods relative to today Using overlapping periods Working with weekly calendars
Conclusions Chapter 5. Tracking historical attributes Introducing slowly changing dimensions Using slowly changing dimensions Loading slowly changing dimensions Fixing granularity in the dimension Fixing granularity in the fact table Rapidly changing dimensions Choosing the right modeling technique Conclusions Chapter 6. Using snapshots Using data that you cannot aggregate over time Aggregating snapshots Understanding derived snapshots Understanding the transition matrix Conclusions Chapter 7. Analyzing date and time intervals Introduction to temporal data Aggregating with simple intervals Intervals crossing dates Modeling working shifts and time shifting Analyzing active events Mixing different durations Conclusions Chapter 8. Many-to-many relationships Introducing many-to-many relationships Understanding the bidirectional pattern Understanding non-additivity Cascading many-to-many Temporal many-to-many Reallocating factors and percentages Materializing many-to-many Using the fact tables as a bridge Performance considerations Conclusions Chapter 9. Working with different granularity Introduction to granularity Relationships at different granularity Analyzing budget data Using DAX code to move filters Filtering through relationships Hiding values at the wrong granularity Allocating values at a higher granularity Conclusions Chapter 10. Segmentation data models Computing multiple-column relationships Computing static segmentation Using dynamic segmentation Understanding the power of calculated columns: ABC analysis
Conclusions Chapter 11. Working with multiple currencies Understanding different scenarios Multiple source currencies, single reporting currency Single source currency, multiple reporting currencies Multiple source currencies, multiple reporting currencies Conclusions Appendix A. Data modeling 101 Tables Data types Relationships Filtering and cross-filtering Different types of models Star schema Snowflake schema Models with bridge tables Measures and additivity Additive measures Non-additive measures Semi-additive measures Index
Introduction Excel users love numbers. Or maybe it’s that people who love numbers love Excel. Either way, if you are interested in gathering insights from any kind of dataset, it is extremely likely that you have spent a lot of your time playing with Excel, pivot tables, and formulas. In 2015, Power BI was released. These days, it is fair to say that people who love numbers love both Power Pivot for Excel and Power BI. Both these tools share a lot of features, namely the VertiPaq database engine and the DAX language, inherited from SQL Server Analysis Services. With previous versions of Excel, gathering insights from numbers was mainly a matter of loading some datasets and then starting to calculate columns and write formulas to design charts. Yes, there were some limitations: the size of the workbook mattered, and the Excel formula language was not the best option for huge number crunching. The new engine in Power BI and Power Pivot is a giant leap forward. Now you have the full power of a database and a gorgeous language (DAX) to leverage. But, hey, with greater power comes greater responsibility! If you want to really take advantage of this new tool, you need to learn more. Namely, you need to learn the basics of data modeling. Data modeling is not rocket science. It is a basic skill that anybody interested in gathering insights from data should master. Moreover, if you like numbers, then you will love data modeling, too. So, not only is it an easy skill to acquire, it is also incredibly fun. This book aims to teach you the basic concepts of data modeling through practical examples that you are likely to encounter in your daily life. We did not want to write a complex book on data modeling, explaining in detail the many complex decisions that you will need to make to build a complex solution. Instead, we focused on examples coming from our daily job as consultants. Whenever a customer asked us to help solve a problem, if we felt the issue is something common, we stored it in a bin. Then, we opened that bin and provided a solution to each of these examples, organizing them in a way that it also serves as a training on data modeling. When you reach the end of the book, you will not be a data-modeling guru, but you will have acquired a greater sensibility on the topic. If, at that time, you look at your database, trying to figure out how to compute the value you need, and you start to think that—maybe—changing the model might help, then we will have accomplished our goal with this book. Moreover, you will be on your path to becoming a successful data modeler. This last step—that is, becoming a great data modeler—will only come with experience and after many failures. Unfortunately, experience is not something you can learn in a book. Who this book is for This book has a very wide target of different kind of people. You might be an Excel user who uses Power Pivot for Excel, or you may be a data scientist using Power BI. Or you could be starting your career as a business-intelligence professional and you want to read an introduction to the topics of data modeling. In all these scenarios, this is the book for you. Note that we did not include in this list people who want to read a book about data modeling. In fact, we wrote the book thinking that our readers probably do not even know they need data modeling at all. Our goal is to make you understand that you need to learn data modeling and then give you some insights into the basics of this beautiful science. Thus, in a sentence if you are curious about what data modeling is and why it is a useful skill, then this is the book for you. Assumptions about you We expect our reader to have a basic knowledge of Excel Pivot Tables and/or to have used Power BI as a reporting and modelling tool. Some experience in analysis of numbers is also very welcome. In the book, we do not cover any aspect of the user interface of either Excel or Power BI. Instead, we focus only on data models, how to build them, and how to modify them, so that the code becomes easier to write. Thus, we cover “what you need to do” and we leave the “how to do it” entirely to you. We did not want to write a step-by-step book. We wanted to write a book that teaches complex topics in an easy way. One topic that we intentionally do not cover in the book is the DAX language. It would have been impossible to treat data modeling and DAX in the same book. If you are already familiar with the language, then you will benefit from reading the many pieces of DAX spread throughout this book. If, on the other hand, you still need to learn DAX, then read The Definitive Guide to DAX , which is the most comprehensive guide to the DAX language and ties in well with the topics in this book. Organization of this book The book starts with a couple of easy, introductory chapters followed by a set of monographic chapters, each one covering some specific kind of data model. Here is a brief description of each chapter: Chapter 1 , “Introduction to data modeling ,” is a brief introduction to the basic concepts of data modeling. Here we introduce what data modeling is, we start speaking about granularity, and we define the basic models of a data warehouse—that is, star schemas, snowflakes, normalization, and denormalization. Chapter 2 , “Using header/detail tables ,” covers a very common scenario: that of header/detail tables. Here you will find discussions and solutions for scenarios where you have, for example, orders and lines of orders in two separate fact tables. Chapter 3 , “Using multiple fact tables ,” describes scenarios where you have multiple fact tables and you need to build a report that mixes them. Here we stress the relevance of creating a correct dimensional model to be able to browse data the right way. Chapter 4 , “Working with date and time ,” is one of the longest of the book. It covers time intelligence calculations. We explain how to build a proper date table and how to compute basic time intelligence (YTD, QTA, PARALLELPERIOD, and so on), and then we show several examples of working day calculations, handling special periods of the year, and working correctly with dates in general. Chapter 5 , “Tracking historical attributes ,” describes the use of slowly changing >dimensions in your model. This chapter provides a
分享到:
收藏