Category Archives: Business Intelligence

MDX: Tail

2020-08-13 Cordt

To retrieve the last member of a set you can use the TAIL() function.

The following example returns the total turnover over the last member in 2020-02 for company 4711.

SELECT {
	TAIL(NONEMPTY([DatumStapel].[JMT].[T].Members, { ([Datenherkunft].[Datenherkunft].&[200], [Company].[Company].&[4711], [DatumStapel].[JMT].[M].&[2020]&[2]) }))
} on columns
, { [Measures].[Umsatz] } on rows
FROM [Finance]

MDX: ParallelPeriod with CurrentMember

2020-08-12 Cordt

Problem

You want to create a Calculated Member that displays the Totals of the previous year using ParallelPeriod.

Solution

See below for a Year- and at the end of the post for a Month-based version.
It is important to mention, that the Hierarchy used in ParallelPeriod function is the same Hierarchy used on an axis!
And another clue is, to use CurrentMember property on the hierarchy’s level and not below on a Sub-Level (here: J, M or T)!

WITH MEMBER [Measures].[UmsatzNegiertLY] AS
'
	([Measures].[Umsatz Negiert], 
	ParallelPeriod([DatumStapel].[JMT].[J]
		, 1
		,[DatumStapel].[JMT].CurrentMember)
	)
'
SELECT 
{ [DatumStapel].[JMT].[J].&[2019] : [DatumStapel].[JMT].[J].&[2020] } ON columns
,{
	[Measures].[Umsatz Negiert], 
	Measures.UmsatzNegiertLY } ON rows
FROM Finance

WITH MEMBER [Measures].[UmsatzNegiertLY] AS
'
	([Measures].[Umsatz Negiert], 
	ParallelPeriod([DatumStapel].[JMT].[M]
		, 12
		,[DatumStapel].[JMT].CurrentMember)
	)
'
SELECT 
{ [DatumStapel].[JMT].[M].&[2019]&[1] : [DatumStapel].[JMT].[M].&[2020]&[12] } ON columns
,{
	[Measures].[Umsatz Negiert], 
	Measures.UmsatzNegiertLY } ON rows
FROM Finance

M (Power Query), Power BI

Power Query (M-Language) Apply Where Clause to Table DateTime.Now() / GetDate()

2019-10-22 Cordt

If you want to apply a WHERE clause to the Data of a table, for example to limit the result to only valid rows (where GETDATE() between ValidFrom and ValidTo) go to the Advanced Query Editor and add the Table.SelectRows() statement, as follows:

let
    Quelle = Sql.Database("localhost\DWH_DEV", "DWH", [CommandTimeout=#duration(0, 0, 5, 0)]),
    etl_Datastore = Quelle{[Schema="etl",Item="Datastore"]}[Data],
    ds = Table.SelectRows(etl_Datastore, each [ValidFrom] <= DateTime.Date(DateTime.LocalNow()) and [ValidTo] >=DateTime.Date(DateTime.LocalNow()))
in
    ds

Power BI

Add KPI Symbol to Table Visual in Power BI

2019-10-22 Cordt

If you want to display an Icon within a Table instead of a boolean True/False Text, go ahead and apply the screenshotted solution.

Remember that you have to use a Measure as basement field (Basierend auf Feld). I’ve included an example to the right in the grayed-out rectangle (Anz_Failed = … ).

Possible result may be the following…

Business Intelligence, Performance

SQL Server Connection Packet Size

2019-10-21 Cordt

What is the “best” PacketSize argument to establish a connection to a SQL Server?
The answer is: It depends (on your very specific network)!

In our case a setting of a value higher than 1500 Bytes does not have a positive outcome. That is because the MTU of the Network has a maximum of 1.500 Bytes per Packet. This is often the case, because the implementation of the IP Protocol v2 uses this as the maximum packet size.

To figure out, what is the maximum value in your network, use the following Power Shell Script, originally authored by Thomas Stringer (http://sqlsalt.com), found in this great article by Robert L Davies (http://sqlsoldier.net/wp/sqlserver/networkpacketsizetofiddlewithornottofiddlewith):

$UpperBoundPacketSize = 1500
 
if ($args -eq "-Q") {$QuietMode = 1}
else {$QuietMode = 0}
 
$IpToPing = "mp-bi"
 
do {
    if ($QuietMode -eq 0) {Write-Host "Testing packet size $UpperBoundPacketSize"}
    $PingOut = ping $IpToPing -n 1 -l $UpperBoundPacketSize -f
    $UpperBoundPacketSize -= 1
} while ($PingOut[2] -like "*fragment*")
 
$UpperBoundPacketSize += 1
 
$Mtu = $UpperBoundPacketSize + 28
 
Write-Host "MTU: $Mtu" -ForegroundColor Green

Business Intelligence, Security

Reset MySQL Root Password On Windows

2019-10-17 Cordt

If the installation took place using the Windows Installer, you need to do the following:

Stop the MySQL service (via services.msc)
Create a Text File with Read Access to Everyone (easiest way) containing the following code. In this example the file is named C:\Temp\mysql_reset_root.txt:
Invoke the following cmd command (depending on your MySQL Version you may adjust the directory name) with Administrative Access:
In my case after invoking the command it dit not return a value and idled for more than 30 minutes. But I was able to login very shortly.

-- Text File Content of C:\Temp\mysql_reset_root.txt

ALTER USER 'root'@'localhost' IDENTIFIED BY '{YOURNEWPASSWORDHERE}';

-- Run Cmd with Administrator rights...

C:\Program Files\MySQL\MySQL Server 8.0\bin\mysqld --defaults-file="C:\\ProgramData\\MySQL\\MySQL Server 8.0\\my.ini" --init-file=C:\\Temp\\mysql_reset_root.txt

For more information go here… https://dev.mysql.com/doc/mysql-windows-excerpt/5.7/en/resetting-permissions-windows.html

Analysis Services, Business Intelligence, SQL Server Features

Add Linked Server to OLAP Cube in SQL Server

2019-10-15 Cordt

To use the OpenQuery() procedures within SQL Server to query a SQL Server Analysis Services Cube, you need to add a Linked Server with Credentials of a valid Domain User, as below:

USE [master]
GO

/****** Object:  LinkedServer [CUBE_FINANCEV2]    Script Date: 10/15/2019 11:53:16 AM ******/
EXEC master.dbo.sp_addlinkedserver @server = N'CUBE_FINANCEV2', @srvproduct=N'MSOLAP', @provider=N'MSOLAP', @datasrc=N'mp-bi', @catalog=N'Finance-v2'
 /* For security reasons the linked server remote logins password is changed with ######## */
EXEC master.dbo.sp_addlinkedsrvlogin @rmtsrvname=N'CUBE_FINANCEV2',@useself=N'False',@locallogin=NULL,@rmtuser=N'MUC\mp_BI_SQL_AS',@rmtpassword='########'
EXEC master.dbo.sp_addlinkedsrvlogin @rmtsrvname=N'CUBE_FINANCEV2',@useself=N'True',@locallogin=N'MUC\mp_BI_SQL_AS',@rmtuser=NULL,@rmtpassword=NULL
GO

EXEC master.dbo.sp_serveroption @server=N'CUBE_FINANCEV2', @optname=N'collation compatible', @optvalue=N'false'
GO
EXEC master.dbo.sp_serveroption @server=N'CUBE_FINANCEV2', @optname=N'data access', @optvalue=N'true'
GO
EXEC master.dbo.sp_serveroption @server=N'CUBE_FINANCEV2', @optname=N'dist', @optvalue=N'false'
GO
EXEC master.dbo.sp_serveroption @server=N'CUBE_FINANCEV2', @optname=N'pub', @optvalue=N'false'
GO
EXEC master.dbo.sp_serveroption @server=N'CUBE_FINANCEV2', @optname=N'rpc', @optvalue=N'false'
GO
EXEC master.dbo.sp_serveroption @server=N'CUBE_FINANCEV2', @optname=N'rpc out', @optvalue=N'false'
GO
EXEC master.dbo.sp_serveroption @server=N'CUBE_FINANCEV2', @optname=N'sub', @optvalue=N'false'
GO
EXEC master.dbo.sp_serveroption @server=N'CUBE_FINANCEV2', @optname=N'connect timeout', @optvalue=N'45'
GO
EXEC master.dbo.sp_serveroption @server=N'CUBE_FINANCEV2', @optname=N'collation name', @optvalue=null
GO
EXEC master.dbo.sp_serveroption @server=N'CUBE_FINANCEV2', @optname=N'lazy schema validation', @optvalue=N'false'
GO
EXEC master.dbo.sp_serveroption @server=N'CUBE_FINANCEV2', @optname=N'query timeout', @optvalue=N'0'
GO
EXEC master.dbo.sp_serveroption @server=N'CUBE_FINANCEV2', @optname=N'use remote collation', @optvalue=N'true'
GO
EXEC master.dbo.sp_serveroption @server=N'CUBE_FINANCEV2', @optname=N'remote proc transaction promotion', @optvalue=N'true'
GO

To Query the Processing Status or the last date and time of processing of the Cube, you may create and execute the following Stored Procedure (that uses the created Linked Server from above):

USE [DWH]
GO
/****** Object:  StoredProcedure [etl].[GetCubeProcessingStatus]    Script Date: 14.10.2019 18:21:42 ******/
SET ANSI_NULLS ON
GO
SET QUOTED_IDENTIFIER ON
GO
-- =============================================
-- Author:		Cordt Bien
-- Create date:		29.05.2019
--
-- Last Modified:	Date, Name (short reason.)
--
-- Description:		Returns the date and time of the last processing of a cube or perspective. May return multiple rows.
--
-- Info:			EXEC etl.GetCubeProcessingStatus 'Finance-v2', NULL
--			OR		EXEC etl.GetCubeProcessingStatus 'Finance-v2', 'Datev-Finance'
-- =============================================
CREATE PROCEDURE [etl].[GetCubeProcessingStatus]
		@cubeDatabaseName sysname = 'Finance-v2',	-- must not be null!
		@cubeOrPerspectiveName sysname = NULL		-- optional to limit returned result to one row.
--WITH EXECUTE AS 'InternalExecutor' -- (IF YOU USE THE "WITH EXECUTE AS" clause, the Execution will fail due to Security Context Exceptions of the Linked Server)
AS
BEGIN
	SET NOCOUNT ON;

	-- Check Preconditions
	BEGIN TRY
		IF @cubeDatabaseName IS NULL RAISERROR('cubeDatabaseName must not be null!', 11, 1);

	END TRY
	BEGIN CATCH	
		EXEC internal.ReThrowError @@PROCID;
		RETURN -1;
	END CATCH

	-- Implementation
	BEGIN TRY

		IF @cubeDatabaseName = 'Finance-v2' BEGIN
			SELECT DatabaseName, CubeOrPerspectiveName, CubeOrPerspectiveDisplayName, DATEADD(hour, DATEDIFF(hour, GETUTCDATE(), GETDATE()), LastProcessDate) as LastProcessDate, BaseCubeName 
			FROM OpenQuery(CUBE_FINANCEV2
					, N'SELECT [CATALOG_NAME]  AS DatabaseName,[CUBE_NAME] AS  CubeOrPerspectiveName,[CUBE_CAPTION] AS CubeOrPerspectiveDisplayName,[LAST_DATA_UPDATE] AS LastProcessDate,[BASE_CUBE_NAME] AS BaseCubeName FROM $system.mdschema_cubes WHERE CUBE_SOURCE=1')
			WHERE (@cubeOrPerspectiveName IS NULL OR CAST(CubeOrPerspectiveName as nvarchar(128)) = @cubeOrPerspectiveName);

		END
		ELSE BEGIN
			RAISERROR('A cubeDatabaseName other than "Finance-v2" is currently not supported or implemented! DEV-Info: Add a new linked server to the requested cube and insert it to this stored procedure (etl.GetCubeProcessingStatus).', 11, 1);

			SELECT TOP 0 DatabaseName, CubeOrPerspectiveName, CubeOrPerspectiveDisplayName, DATEADD(hour, DATEDIFF(hour, GETUTCDATE(), GETDATE()), LastProcessDate) as LastProcessDate, BaseCubeName 
			FROM OpenQuery(CUBE_FINANCEV2
					, N'SELECT [CATALOG_NAME]  AS DatabaseName,[CUBE_NAME] AS  CubeOrPerspectiveName,[CUBE_CAPTION] AS CubeOrPerspectiveDisplayName,[LAST_DATA_UPDATE] AS LastProcessDate,[BASE_CUBE_NAME] AS BaseCubeName FROM $system.mdschema_cubes WHERE CUBE_SOURCE=1')
			WHERE (@cubeOrPerspectiveName IS NULL OR CAST(CubeOrPerspectiveName as nvarchar(128)) = @cubeOrPerspectiveName);
		END

	END TRY
	BEGIN CATCH	
		EXEC internal.ReThrowError @@PROCID;
		RETURN -2;
	END CATCH
END

Business Intelligence, SQL Server Features

Create dbo.SysSSISLOG Table in your Database

2019-06-27 Cordt

When you script your database and want to include the dbo.SYSSSISLOG table, because you’ve created Views or procedures referencing it, the following piece of code does the job:

if not exists (select * from dbo.sysobjects where id = object_id(N'[dbo].[sysssislog]'))
CREATE TABLE [sysssislog] (	[id] [int] NOT NULL IDENTITY PRIMARY KEY,	[event] [sysname] NOT NULL,	[computer] [nvarchar] (128) NOT NULL,	[operator] [nvarchar] (128) NOT NULL,	[source] [nvarchar] (1024) NOT NULL,	[sourceid] [uniqueidentifier] NOT NULL,	[executionid] [uniqueidentifier] NOT NULL,	[starttime] [datetime] NOT NULL,	[endtime] [datetime] NOT NULL,	[datacode] [int] NOT NULL,	[databytes] [image] NULL,	[message] [nvarchar] (2048) NOT NULL,) ON [PRIMARY] TEXTIMAGE_ON [PRIMARY]
go
CREATE PROCEDURE [dbo].[sp_ssis_addlogentry]  @event sysname,  @computer nvarchar(128),  @operator nvarchar(128),  @source nvarchar(1024),  @sourceid uniqueidentifier,  @executionid uniqueidentifier,  @starttime datetime,  @endtime datetime,  @datacode int,  @databytes image,  @message nvarchar(2048)AS  INSERT INTO sysssislog (      event,      computer,      operator,      source,      sourceid,      executionid,      starttime,      endtime,      datacode,      databytes,      message )  VALUES (      @event,      @computer,      @operator,      @source,      @sourceid,      @executionid,      @starttime,      @endtime,      @datacode,      @databytes,      @message )  RETURN 0
go
execute [sp_MS_marksystemobject] N'[dbo].[sp_ssis_addlogentry]'
go
execute [sp_MS_marksystemobject] N'[sysssislog]'
go

The previous script is the same code that Visual Studio or SSIS itself executes when running a ETL Package the first time.

Business Intelligence, Performance, Uncategorized

Fastest Way to retrieve Data from Data Warehouse in Excel via VBA Macro

2019-06-25 Cordt

Use a code similar to this one in VBA:

Include following References: Microsoft ActiveX Data Objects 6.* Library

Be sure to return values as “datetime” instead of “date” or “datetime2” to get recognized as a date value in Excel.

Public Sub GetDwhData()
    Dim wksTarget As Worksheet: Set wksTarget = Worksheets(1)
    Dim cellAddressStart As String: cellAddressStart = "B2"
    
    Application.ScreenUpdating = False
    Application.Calculation = xlCalculationManual
    
    On Error GoTo ErrHandling
    
    Set objMyConn = New ADODB.Connection
    Set objMyRecordset = New ADODB.Recordset
    Dim strSQL As String

    objMyConn.ConnectionString = "Provider=SQLOLEDB;Data Source=tcp:localhost,1344;Initial Catalog=DWH;Integrated Security=SSPI"
    objMyConn.Open

    strSQL = "select TOP 1000 * from dbo.Datasource"

    Set objMyRecordset.ActiveConnection = objMyConn
    objMyRecordset.Open strSQL

    'Copy Data to Excel...
    wksTarget.Range(cellAddressStart).CopyFromRecordset (objMyRecordset)
        
        
    objMyRecordset.Close
    objMyConn.Close
    
    wksTarget.Calculate 'Formeln neu berechnen
        
ErrHandling:
    Application.Calculation = xlCalculationSemiautomatic
    Application.ScreenUpdating = True
    
End Sub

Analysis Services

SSAS: How to Sort parent Hierarchy Members of a Dimension by another Attribute

2019-05-20 Cordt

Scenario: You have a small Dimension containing a Hierarchy. You need all the child members as well as the parent members to be sorted.

Solution excerpt: Add the Sort Attribute by which the Parent Attribute should be ordered by as a new Attribute Relationship to the Dimension and then set the OrderBy property to Attribute Key and the OrderByAttribute to the Sort Attribute

Example

Parent Attribute: “PuL”
Child Attribute: “BwaZeileKey“
Child Sort Attribute: “Sortierung“
Parent Sort Attribute: “PulSortierung“

Extract of BwaZeileDim which is joined to PulDim that contains the PulSortierung

Result

See how the parent members are sorted by a userdefined key attribute and not by name

Business Intelligence, SQL Server Errors

SSIS Error: 0xc001000e The connection manager “” is not found.

2019-03-25 Cordt

Solution:

If your .dtsx XML file (search for LogProv) does not contain the tag “DTS:ConfigString=“, you missed to populate the Connection Configuration DropDown in the Logging-Wizard dialog:

<DTS:LogProviders>
    <DTS:LogProvider
      DTS:ConfigString="CON_DWH"
      DTS:CreationName="Microsoft.LogProviderSQLServer"
      DTS:Description="Schreibt Protokolleinträge für Ereignisse in eine SQL Server-Datenbank."
      DTS:DTSID="{9FE46402-8C22-49AB-887C-056C89DD00CE}"
      DTS:ObjectName="SSIS-Protokollanbieter für SQL Server">
      <DTS:ObjectData>
        <InnerObject />
      </DTS:ObjectData>
    </DTS:LogProvider>
  </DTS:LogProviders>

Business Intelligence, SQL Server Errors

Fehler im OLAP-Speichermodul: Ein doppelter Attributschlüssel wurde bei der Verarbeitung gefunden

2019-03-22 Cordt

Fehler im OLAP-Speichermodul: Ein doppelter Attributschlüssel wurde bei der Verarbeitung gefunden:
Tabelle: TABELLE,
Spalte: SPALTENNAMEINTABELLE,
Wert: 0.
Das Attribut ist ‘ATTRIBUTENAMEINDIMENSION‘.

Mögliche Lösung:

Gehe zu ATTRIBUTENAMEINDIMENSION und entferne die Name-/ValueColumn-Einträge.

Ursache m.M.n: “Bug/Fehlverhalten” in SSAS

Weitere Hilfe, Warnungsfehler in Hierarchien hier:

https://social.msdn.microsoft.com/Forums/sqlserver/de-DE/91e6b21c-e9ad-43ea-8eb8-d6b5f543d27d/ssas-defaulteigenschaften-von-dimensionen-fhren-zu-warnungen

Analysis Services

Supported Data Sources of SSAS Multidimensional vs. Tabular vs. Excel Power Pivot

2016-03-10 Cordt

Supported Data Sources

Source and Version	Multidimensional				Tabular				Excel Power Pivot
Source and Version	2008 R2	2012	2014	2016	2012	2014	2016	2016 Direct Query	Excel Power Pivot
SQL Server 7.0 / 2000	yes	no	no	no	no	no	no	no	no
SQL Server 2005	yes	yes	yes	(no)?	yes	yes	(no)?	(no)?	yes
SQL Server 2008 – 2012	yes	yes	yes	yes	yes	yes	yes	yes	yes
SQL Server 2014	?	(yes)*	yes	yes	(yes)*	(yes)*	yes	yes	(yes)*
SQL Server 2016	?	(yes)*	(yes)*	yes	(yes)*	(yes)*	yes	yes	(yes)*
SQL Azure Database	yes	yes	yes	yes	yes	yes	yes	yes	yes
SQL Server Parallel Data Warehouse (PDW) / Analytics Platform System (APS)	yes	yes	yes	yes	yes	yes	yes	yes	yes
Oracle relational 9i, 10g, 11g	yes (9.0)	yes	yes	yes	yes	yes	yes	yes	yes
Oracle relational 12g	?	?	?	yes	?	?	yes	yes	?
Teradata relational V2R6, V12	yes (v2R6)	yes	yes	yes	yes	yes	yes	yes	yes
IBM DB2 relational V8.1	yes (with EE only)	yes	yes	yes	yes	yes	yes	no	yes
Informix relational V11.10	no	yes	yes	yes	yes	yes	yes	no	yes
Sybase relational	no	yes	yes	yes	yes	yes	yes	no	yes
Text Files (.CSV, .TAB, .TXT)	no	no	no	no	yes	yes	yes	no	yes
Microsoft Excel 97-2007 (.xlsx, .xlsm, xlsb, xltx, xltm)	no	no	no	no	yes	yes	(no)?	no	yes
Microsoft Excel 2010 (.xlsx, .xlsm, xlsb, xltx, xltm)	no	no	no	no	yes	yes	yes	no	yes
Microsoft Excel 2013-2016 (.xlsx, .xlsm, xlsb, xltx, xltm)	no	no	no	no	(no)?	(no)?	yes	no	?
Excel documents in SharePoint	no	no	no	no	no	no	no	no	no
Microsoft Access 2003	yes (x86 only)	yes	?	?	yes	yes	no	no	yes
Microsoft Access 2007	yes (x86 only)	yes	yes	?	yes	yes	no	no	yes
Microsoft Access 2010	?	yes	yes	yes	yes	yes	yes	no	yes
Microsoft Access 2013	?	?	yes	yes	(no)?	?	yes	no	?
Microsoft Access 2016	?	?	?	yes	(no)?	?	yes	no	?
Access documents in SharePoint	no	no	no	no	no	no	no	no	no
Power Pivot Workbook published to SharePoint	no	no	no	no	yes	yes	yes	no	yes
SSAS Analysis Services Cube (2005)	no	no	no	no	yes	yes	(no)?	no	yes
SSAS Analysis Services Cube (2008-2008R2)	no	no	no	no	yes	yes	yes	no	yes
SSAS Analysis Services Cube (2012-2014)	no	no	no	no	(yes)*	(yes)*	yes	no	(yes)*
SSAS Analysis Services Cube (2016)	no	no	no	no	?	?	yes	no	(yes)*
Data Feeds Atom 1.0 format (exposed via WCF Data Service) (.atomsvc or .atom)	no	no	no	no	yes	yes	yes	no	yes
Office Database Connection files	no	no	no	no	yes	yes	yes	no	yes

Analysis Services

Add KPI (Key Performance Indicator) based on Delivery Days between two Dates to Tabular Project

2016-03-10 Cordt

You have two Dates in your Tabular Project

Create a new calculated column:

Give it the name “DeliveryDays“.

2. Enter following formula:

=IF(isblank([OrderDate]);0;IF([ShipDate]<[OrderDate];0;[ShipDate]-[OrderDate]))*1

3. Go to the measure grid (bottom) and enter a new measure to the previously created column:

Average of DeliveryDays:=AVERAGE([DeliveryDays])

4. Create KPI by right clicking on the newly created Average measure:

5. A new dialog will pop up and you have to define the KPI details:

After hitting OK you will notice a KPI Indicator Symbol in your measure grid:

6. Analyze the Results in Excel:

This was a quick tutorial of how to create a KPI using a new calculated column in a Tabular Project using SSDT Visual Studio 2013.

Business Intelligence

BUG SSDT-BI Visual Studio The operation could not be completed. No such interface supported. (when saving project)

2016-03-04 Cordt

Problem

You retrieve the Exception:

Microsoft Visual Studio

The operation could not be completed. No such interface supported.

Solution

Go to Visual Studio Menu TOOLS | Options…

Under Project and Solutions | General check the Option to Save a Project during creation which is by default unchecked:

This workaround is only a solution to new projects. For your current project you may go to the temporary folder (Right Click on your Project and Open Path in Explorer) and copy the contents of your project to another place.

Business Intelligence

Data Warehouse Semantics Basics

2016-03-03 Cordt

Surrogate Keys

Surrogate Keys or IDs uniquely identify a record in a data set (table). Typically they (should) consist of only one column. Best example: ID column that is automatically incremented.

A Surrogate Key is also known as Meaningless Key.

Surrogate Keys should be used for internal purposes inside a relational database. They should be hidden to end users. To display unique identifiers to end users you may want to introduce an Alternate Key.

Surrogate Keys are “artificial” or a “substitute“. Some folks say a Surrogate Key is always a substitute for something and if there is nothing to substitue they cannot use the term Surrogate Key. I encourage you to be as precise as necessary.

Alternate Keys

Alternate Keys usually are a readable identifier for humans. You may want to include Alternate Keys if you need to display readable identifiers for a record to end users. It also may contain certain “meaning” in its value. This might be the case if it is derived from multiple columns of a record.

Alternate Keys may be artificial but are not limited to this. For example: A book record may contain an internal ID. It may also carry a ISBN number that may uniquely identify this book. Another example could be a product that has a Product No used in publications and order systems. This column may also be used as an Alternate Key column.

Natural Keys

Natural Keys are columns that out-of-nature create a unique identifier for a record. They contain readable text. Natural Keys may be created across multiple columns. Examples for a natural key are an ISBN number, a social insurance number or the tuple {FirstName, LastName, Address, BirthDate}.

Natural Keys are fine and you may want to use them as a basis for your Alternate Keys for display purposes.

But !NEVER! use Natural Keys as the Primary Key column to create relations between tables!! Many years ago I worked in a fashion company in which they used readable product numbers (consisting of a category and other stuff) as Primary Keys and after 25 years of business the numbers went out. Nice work followed :-).

ID Column

An ID or Identifier column is usually a non-readable internal column used to uniquely identify a record and to create relations between tables.

I recommend naming an Identifier column with only the two letters ID. Following the DRY principle you should not repeat the table name in all columns. Sticking to this principle will also make your queries easier to read. The ID column will always be addressed by a prefix table like myTableName.ID.

A Foreign Key column is always named following the schema: Table name + ID.

If your Product table has its own ID column and a ProductCategoryID column then it is easy to understand what might be the PK (*) and what is a FK to another table.

Imagine you have a large table with 20 Foreign Keys to other tables and its own ID column. It’s easy using Intellisense to distinguish the ID column from all the other Foreign Keys without having to find out the name of the column that is the ID column of that table (as this must not be the Primary Key and therefore marked with a yellow key symbol).

I encourage you to use the term ID in relational databases and omitting the term Key from being used in an OLTP system.

You may use the term Key as a substitue for the term ID in Data Warehouse scenarios. But please decide on which term you exclusively want to use.

Don’t start mixing Key with ID columns in the same database!

Conformed Dimension

Non-Conformed Dimension

Shared Dimension

Normalized Dimension

Denormalized Dimension

Degenerated Dimension

Additive Measures

Semi-additive Measures

Non-additive Measures

Analysis Services, Business Intelligence

Analysis Services Tabular Project Dialogs in SSDT-BI

2016-03-03 Cordt

About

This tutorial gives you an overview of common dialogs and windows used to configure Analysis Services Tabular Projects.

Initial Dialogs

Create a New Project

The created file will be named TabularProject1.smproj.

Tabular Model Designer (Choose a Connection)

Further information on Compatibility Levels MSDN here.

Model.bim Properties window

Build Action: Compile | None
Copy To Output Directory: Do not copy | Copy always | Copy if newer
Data Backup: Back up to disk | Do not back up to disk

If enabled: An ABF file is automatically created when a BIM file is saved. For further information on how to a restore a .abf file see mssqltips here. For contents of the backuped .abf file see MSDN here.

Direct Query Mode: On | Off

Benefits if enabled: Data will always be up-to-date. Data sets can be huge (larger than RAM of the SSAS instance). May be faster due to query acceleration in the data source (xVelocity column indexes for example). Security can be enforced (for example row-level security in RDBMS database). Complex formulas might be faster if the back-end can optimize the query plan. Restrictions: All your data in the Tabular Model must come from a single relational data source (either SQL Server, Oracle or Teradata). You must not use data from other sources. If you need data from elsewhere you first have to create a DWH that contains all data. Then you can use this DWH as the only source for your Tabular Model. Excel in DirectQuery mode: MDX queries from Excel to a Tabular Model is only supported in SQL Server 2016 (or above) and the compatibility level is set to 1200. For further information see MSDN here.

Workspace Retention: Keep in memory | Unload from memory | Delete workspace

Keep in memory: The workspace database (that includes the model metadata, imported data and credentials) will be kept in memory after closing a model. Your model will load faster but you might lose all of your memory. Unload from memory: The workspace database will be written to disk on closing and reloaded into memory on reload. Should be used on a remote workspace database or if you have a limited RAM amount. Delete workspace: The workspace database will be deleted on closing your model. Your model will always need more time to load as it must restore any settings. Use this option when you are rarely working with your model.

Workspace Server: This is the connection to a SSAS Tabular Mode Instance used for storing and editing temporary the current in-memory model. See Workspace Database property to examine the temporary database name created on this instance.

Import a new Data Model from Data Source

This section describes the steps to import related Fact and Dimension tables from Adventureworks DW 2014. Go to the menu bar in SSDT, click on MODEL and then Import From Data Source…

The Table Import Wizard opens.

Table Import Wizard Step 1

Select the relational data source instance and Database to connect to.

Table Import Wizard Step 2

Enter the credentials of a user specific to this project to connect to the data source. You should always try to create an Active Directory user for each Project you are creating to be able to give project specific permissions to this user in the data source. Don’t use a “sa” account.

Table Import Wizard Step 3

Decide whether to select data from given tables or views or by specifying your own SQL Query.

Table Import Wizard Step 4 (Select Tables and Views)

Select the Tables/Views. In this example I checked the FactInternetSales table and then used the handy Select Related Tables button.

Preview & Filter dialog:

Table Import Wizard Step 5 (Finishing -> Importing)

The Data is transferred and the selected tables are imported into your model. Hit Close.

Configure your Tabular Model

When you switch to Diagram View, you will see the following result:

Table Properties

Default Field Set: Determines which attributes are being displayed by default when a client connects to this table. See also MSDN here.
Hidden: True | False. Specifies if the table is hidden in client tools.
Partitions: Lets you define logical partitions for single processing. Can be very helpful on huge tables. They allow parallel processing. See also MSDN here.

Source Data: View or edit the source data.

Table Behavior: Only affects PowerView. Does not apply to Excel Pivot! You can define table behavior properties for tables of primary interest (for example: Customer, Employee, …). In most cases you do not need to set properties for these table types: Date, Category, Group, Department, Lookup and Summary tables… . For details and example of table behavior see MSDN here.

Column Properties

For an (currently in-)complete property overview see MSDN here.

Column Name: Name of the column stored in the model. Also referred to as Friendly Name. This property value is the name of a column displayed in client reporting tools.
Data Category: One of the following values:

Data Format: Lets you define the displayed format of this column in client tools. Options follow (Numeric and Date based columns). See also MSDN here.

Data Type: Underlying Data Type used on Import. Do not change.
Default Label: True | False. Set to true to set a specific column as the Label column displayed in client tools for this table.
Hidden: True | False: Set to true to hide this column in client reporting tools.
Row Identifier: True | False: Set to true to set a specific column as the unique ID column. You will use a ID or surrogate key column.
Sort By Column: Specify another column to sort values in this column. You cannot specify a formula or measure.
Summarize By: By default a client reporting tool uses the SUM aggregation on values. If you need to set a specific calculation method use one of the following:

Table Detail Position: If you have a Default Field Set defined, you may change the order of a column by setting this value.

Creating a Hierarchy

We will create a Hierarchy on the Date Dimension as follows. First you will have to click the “Create Hierarchy” button on the upper right of a table:

Then you will give it a name: “Years and Parts”. After that you drag & drop the following three columns onto the Years and Parts hierarchy column:

Analyze Results with Excel Pivot

We will now analyze the results and the effects of the created hierarchy in Excel. Go to the menu bar in VS2013, open the MODEL menu and click Analyze in Excel.

Leave the “Current Windows User” option set in the following dialog and hit OK.

Excel opens with the following result:

Analysis Services, Business Intelligence

Picking the right SQL Server Data Tools (SSDT) Version for SSAS

2016-03-02 Cordt

You have SQL Server 2014 or SQL Server 2012 (SSAS/SSRS/SSIS)

and Visual Studio 2015: Download SSDT-BI Preview for VS2015
and Visual Studio 2013: Download SSDT-BI for VS2013
and Visual Studio 2012: Download SSDT-BI for VS2012

You have SQL Server 2016 CTP (SSAS/SSRS/SSIS)

and Visual Studio 2015: Download SSDT-BI Preview for VS2015

All of these SSDTs also support SQL Server 2008 and 2008 R2.

If you only have SQL Server 2008 or SQL Server 2008 R2

install Business Intelligence Development Studio (BIDS) via SQL Server 2008/R2 Setup

If you picked the SSDT-BI for VS2013 version, the installation will look like this:

Problem

Solution

Example

Result

Supported Data Sources

You have two Dates in your Tabular Project

Problem

Solution

Surrogate Keys

Alternate Keys

Natural Keys

ID Column

Conformed Dimension

Non-Conformed Dimension

Shared Dimension

Normalized Dimension

Denormalized Dimension

Degenerated Dimension

Additive Measures

Semi-additive Measures

Non-additive Measures

About

Initial Dialogs

Create a New Project

Tabular Model Designer (Choose a Connection)

Model.bim Properties window

Import a new Data Model from Data Source

Table Import Wizard Step 1

Table Import Wizard Step 2

Table Import Wizard Step 3

Table Import Wizard Step 4 (Select Tables and Views)

Table Import Wizard Step 5 (Finishing -> Importing)

Configure your Tabular Model

Table Properties

Column Properties

Creating a Hierarchy

Analyze Results with Excel Pivot

Embracing your Transact-SQL Code and Coding Tips.