Posts Tagged ‘Data Governance’

Microsoft Opens Up Collaborative and Self-Service BI

Monday, June 21st, 2010

Just over a week ago I was invited to attend an analyst briefing at the Microsoft BI conference in New Orleans that was running alongside the Microsoft TechEd conference.  The conference itself was very well attended with several thousand delegates.  Several things were on show at this event including SharePoint 2010, SQL Server 2008 R2, Office 2010,  PowerPivot, PerformancePoint services 2010. Also on show was  SQL Server Data Warehousing Edition (also known as the Madison project) – the massively parallel edition of SQLServer that will be shipped later this year.

The one thing that stood out for me was the seismic shift towards collaborative BI.   As my friend Colin White so aptly put it in the analyst briefing, “Microsoft have brought BI to collaboration rather than collaboration to BI”.  This is an important point because what it is says is that there is little point adding collaborative features to a BI platform if these are not the services associated with a mainstream collaborative platform.  There is far more value in integrating a BI platform with the company collaboration software to tap into things like collaborative workspaces, presence awareness, unified communication, shared calendar etc. etc.  In Microsoft”s case this is of course the SharePoint product which has become viral in most organisations.

It is no surprise therefore that Microsoft’s BI initiative is built around 3 main components and not just SQL Server.  These are:

  • Office,
  • SharePoint
  • Microsoft SQL Server 2008 R2

Note that SQL Server 2008 R2 includes StreamInsight, Microsoft’s complex event processing (CEP) engine and Microsoft Master Data Services

While there we were take through an excellent demo to show the power of collaboration and what it can do when integrated with BI.  It even included the Microsoft Round Table device which although it has been available for some four years, was the first time I have actually encountered one.

What the demo showed me was the speed with which BI and BI ‘components’ can be spread among a community of users. My conclusion is that integration of SQL Server 2008 R2 with Sharepoint 2010 takes this to another level in that the rate that business intelligence can be shared it is almost ‘twitter speed’.  For those of you using twitter, you will know that as soon as something of interest breaks, re-tweets can spread it across masses of people in a matter of minutes.  This is the feeling I got during the demo.  It fuels mass sharing, mass reuse and mass development of BI applications and artifacts.  In particular reports and dashboards. It certainly fits with Microsoft”s vision of BI for everyone.

Several new features open up the flood gates for collaborative BI to share intelligence with other without the need for IT. For example,

BI reports can be managed by Sharepoint in document libraries. You can also preview reports before opening them up.

Also Microsoft is fueling development by business users on the back of what power users have done, thereby bypassing IT.  This is because there is now a capability whereby Microsoft ReportBuilder 3.0 can access PowerPivot workflows uploaded to SharePoint sites.  You can also export to Excel from PowerPivot.  Power users using PowerPivot (originally referred to as Gemini), can take data from different data sources (including newly supported Atom feeds), merge and join that data. Relationships between tables can be managed inside of PowerPivot.  PowerPivot power users can then create workflows that process this data and can upload these to Sharepoint sites.  ReportBuilder 3.0 (or any BI client) can then treat the PowerPivot workflow as a data source.  Not only that but ReportBuilder can create report parts which are sharable in a report part gallery do that other users can reuse them by simply dragging an dropping the report parts onto a new report for rapid development without having to know the detail underneath.

Hopefully by now you have got the picture – power users building their own workflows in PowerPivot, publishing them to SharePoint, other users using them as data sources in reports, report parts created, and a gallery of parts to be shared across a community of users.  Powerful stuff, and we are not done yet.

In Sharepoint 2010 there is a new site template called Business Intelligence Center.  What you can now do is create a new site in SharePoint using the Business Intelligence Center template. This template includes chart web parts and Excel services workbook access. It also includes a PerformancePoint library so that you can start building your dashboard very rapidly including access to reports and report parts. With is mechanism, Microsoft is opening up dashboard development to the masses and also allowing ‘social’ performance management whereby dashboards and/or dashboard components can be rated.  All this integrated with SharePoint and Office is in my opinion going to take self-service BI development to another level that it could easily have a ‘popcorn effect’ with masses of BI being produced rapidly and IT nowhere in sight.  There is no doubt that it opens up the flood gates for business innovation and sharing.  Personalised dashboard development using PerformancePoint Services 2010 integrated with SharePoint 2010.

A Question of Governance?

My only concern with this is the issue of governance.  What Microsoft have done is to put mass development in the hands of the business.  If you think upi have seen anythng on self-service BI, just wait until SharePoint 2010, Office 2010 and SQL Server 2008 R2 move into production in your shop. You ain’t seen nothing yet.

However I see very little with respect to data governance. What about business glossaries? What about metadata lineage?  In a world of increasing regulation and legislation to prevent corporate catastrophes, can anything be audited? Can it be tracked back to where the data come from? How has the data been transformed by the power users? iWhat does the data mean?  I have as yet seen little from Microsoft in the form of metadata management and data governance despite the fact that Master Data Services is also delivered as part of this SQL Server release.  While there is no doubt that this is coming (confirmed by the Microsoft guys I spoke with on the exhibition floor booth) my only fear is will be too late.  Will the horses have already bolted with self-service BI unstoppable and off down a track without lineage to help users know that the data is trusted.

Equally, scorecard and dashboard development is bottom up. Everyone (with authority) can create their own scorecards and dashboards rapidly but there appears to be no framework whereby these can be slotted into a multi-level  strategy management unlike say SAP with SAP Strategy Management.  So what is the answer? Is it all bets are off and just let the business figure out the best way to manage on the back of socially rated scorecards and dashboards?  What happened to business strategy?  Many companies set a strategy at executive level and want enterprise wide business strategy execution.   This latter approach is top-down.  What Microsoft is fueling is bottom up.  My opinion is we need both and not one or the other.

Freedom Versus Governance – A Delicate Balancing Act

It is pretty clear then that setting aside the new SQL Server Data Warehousing Edition, this is very much a Collaborative BI release by Microsoft.  It is a major leap forward in what the business users can do for themselves.  We have two forces at work here.  Freedom versus governance.  We have to get the balance right.  Too much freedom and we could have chaos with no ability to audit what has been done or whether the BI is trusted. Too much governance and we put innovation in a straight jacket or kill it altogether.   All I would say is that IT had better get a data governance program underway soon to control data all the way out to data marts and cubes. If that is done then there is no doubt that the business can be empowered to innovate which is what should happen. Without a data governance program however, I think it is really going to be hard to get alignment with what the business is doing given the sheer speed of development that is now possible with this release.  Let’s hope governance, innovation and collaboration are a winning combination.

Enterprise Data Governance – Cheers Arthur!

Friday, September 25th, 2009

First of all let me apologise to all my readers for not having blogged for a while. This year has turned out to be manic – crazily busy. I also confess to having become addicted to twitter – a “tweetaholic” where I have been micro-blogging. If you want to see my tweets you can do so here.  So I return to my blog the day after “Arthur Day” – 250 years ago yesterday a certain young Irishman named Arthur Guinness started a beer making company in Dublin.

My topic today is that exciting topic of Enterprise Data Governance.  From research I did in a survey it was clear that many companies at the end of 2008 were not fully underway with Enterprise Data Governance in terms of getting their data under control and into a trusted, well managed state. Many had more to do in terms of organising themselves together with getting the necessary technology and processes in place to do this.  But the question I get asked the most is how do you know how well or poorly your company is governing its data? There are a few questions you can ask that will give you a good inkling.  These are as follows:

  • Do you know what data exists in your enterprise?
  • Do you have an inventory of data items in use?
  • How many names have you got for the same data item?
  • How many metrics with same name but with different formulae?
  • Do your Excel metrics formulae, DBMS metrics formulae, BI tool metrics formulae, ETL tool calculations, … all agree?

If the answer is no to any of these questions, what chance do you stand of remaining compliant or of trusting your data? If you don’t know how many different variations of a data item exist in your enterprise how can you govern your data? Some other questions to ask here from a business perspective are:

  • How many times do your core processes break because of dirty data?
  • Have your company ever messed up an order and angered a customer because of dirty data?
  • In terms of compliance, do you trust your data enough to tell it to a judge?

In my opinion what companies actually need is an interactive data map so that you can press a button and see where your customer data is or where your order’s data is.  In order to be able to do this you need to have common data definitions for your customer data attributes and for your order data attributes etc. In fact you need to have a common set of enterprise wide definitions for all core entity data, transaction data and metrics.

Having established this, the next step is to discover where you data actually is. Therefore Data Discovery technology (albeit a new area in data management) is critical help do this. In fact I would go as far as to say that without data discovery technology it is very difficult to get data under control. Increasingly therefore we are seeing vendors acquire or build this kind of software.  Once you data is located you need to map disparate data definitions for the same data to common enterprise wide definitions to be able to see where data is. Physical column names, data models, BI tool semantic layers, reports, SPREADSHEETS, files, XML schema, Access databases… If you can’t tie all these to the same corporate definitions how to you govern data?  This is where data dictionaries/ business glossaries are key.  Lineage matter.

Ultimately the objective is to get consistency across the enterprise. You have to unravel your spaghetti ball. This means you need to get organised correctly, get the right technologies in place and get the right processes in place for enterprise data governance.

Master data is also part of the program. You need to find out where your master data is maintained. How is it synchronised? What screens on what applications are used to update it? Do you know? MDMis not as simple as it looks. It is often a multi-year investment. So ask yourself do you want to start with read-only or read write? If  you buy an MDM system and you start updating master data centrally, what happens if you are still updating it also in other applications? Companies need a well thought out strategy for MDM as part of the Data Governance program. I will be addressing this in further blogs in the near future.

For now though, let me salute my fellow Irish countryman. Arthur Guinness. If you are stressed out on Data Governance at the end of a hard week there is nothing like a decent pint to help you unwind. Cheers!

Enterprise Information Management In Demand

Wednesday, August 12th, 2009

Recently I have noticed a lot of companies raising priority on Enterprise Information Management projects as it becomes clear that many companies realise that they do not have information under control. This includes both structured and unstructured data. It is clear that for unstructured data, enterprise content management, content authoring, tagging, search and taxonomy are all key. Also master data management has a part to play in deciding  facets that can be used in taxonomies.  Structured data needs to consider data naming and definitions, data modelling, data discovery, data mapping, data profiling, data cleansing, data integration, provisioning and data quality monitoring. There is a lot of work out there to do! The big question is how do you tie the two together. The secret is in MDM!  What is your strategy for content authoring, content storage, content tagging, taxonomy, search, business glossaries, data integration ……Get in touch and let me know

Data Governance Losing Priority in Some European Countries

Monday, March 30th, 2009

Having just got back from a presentation tour in mainland Europe, it seems that in the countries I have spoken that Data Governance came out with a thumbs down vote among CIOs present in my sessions.  In particular in Belgium it would appear to be not on their radar.  Having probed for feedback into what exactly is high priority among CIOs attending my sessions it is almost as if raw ‘survival’ is taking hold. In other words, any IT project linked to business survival in this tough economic climate will get attention but not much else.  Customer retention, self- service, cost reduction/containment and growth are high on the list. One CIO explained to me that his company’s priority over the next 12 months was to allow customers to customise the products and services they offer much more in the future. Therefore in addition to offering their own product lines on the web, they would be integrating their e-procurement with many back end e-suppliers so they can buy ‘on-demand’ to match what a customer wants. This means they want to allow customers to create their own custom ‘package’ before buying on-line and will stretch beyond their own products to stand out from the crowd.  It seems to me that data governance and data quality to some extent are taking a back seat in favour of investment that will keep the revenue rolling in. I would be interested in your feedback.  Is data governance a high priority in your organisation?

Trusted BI Critical in a Tough Climate

Tuesday, January 27th, 2009

It’s pretty clear that Business Intelligence is becoming even more strategic in this tough economic climate as companies seek to have greater insight to help them stay profitable and keep making money. By default almost it would seem that trusted intelligence has got to be there for BI to be reliable and support confident decision making. Therefore on the eve of the Data Governance conference which starts next week (Feb 2-5) in London I thought I would put in a plug for Enterprise Data Governance. This is a fast growing topic and requires organisational change, processes and technology to manage it successfully. Structured data needs to be formally defined and named (a shared business vocabulary) and BI systems (data models, BI tool business views, reports etc) need to be changed to reflect these commonly understood data definitions.

In addition, data in disparate systems needs to be identified and mapped to the common definitions so as to gather knowledge on how to turn disparate data into trusted data. At this point you can then make sure that data integration is done in a fashion that creates trusted data for use in BI systems. Overall it is critical that you build modular data integration jobs (e.g. 1 data integration job per dimension) so that you can re-use data integration ‘services’ if needs be to guarantee trusted data every time.  These days of course, data quality is built in to many data integration jobs and it is important to strive for this. When you have achieved this your data integration tool should provide a valuable set of metadata to support lineage should a user need to track data back to where it came from.  Once your trusted data is available then you can monitor it to keep quality high and take action if quality deteriorates over time.

I’d be interested in your thoughts on enterprise data governance. Let me know what you are doing in this space.