<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>SAND News &#187; Arthur&#8217;s Blog Next Generation DW</title>
	<atom:link href="http://www.sandmtl.com/news/tag/arthurs-blog-next-generation-dw/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.sandmtl.com/news</link>
	<description></description>
	<lastBuildDate>Thu, 04 Mar 2010 04:27:16 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Building Corporate Memory Into a Next-Generation Data Warehouse</title>
		<link>http://www.sandmtl.com/news/building-corporate-memory-into-a-next-generation-data-warehouse/</link>
		<comments>http://www.sandmtl.com/news/building-corporate-memory-into-a-next-generation-data-warehouse/#comments</comments>
		<pubDate>Fri, 22 May 2009 13:00:46 +0000</pubDate>
		<dc:creator>Arthur</dc:creator>
				<category><![CDATA[Arthur's Blog]]></category>
		<category><![CDATA[Arthur's Blog Next Generation DW]]></category>

		<guid isPermaLink="false">http://www.sandmtl.com/news/?p=169</guid>
		<description><![CDATA[It is now possible to design and implement a corporate memory within the data warehouse using a number of mature, tested and well-understood products and methodologies that can be deployed relatively quickly and administered with minimal DBA overhead. These solutions can grow with relatively linear scalability in terms of both cost and performance, while providing [...]]]></description>
			<content:encoded><![CDATA[<p>It is now possible to design and implement a corporate memory within the data warehouse using a number of mature, tested and well-understood products and methodologies that can be deployed relatively quickly and administered with minimal DBA overhead. These solutions can grow with relatively linear scalability in terms of both cost and performance, while providing powerful support for both power analysts and reporting users. The ingredients for a successful data warehouse implementation that makes use of the corporate memory concept involve hardware, software and architectural design components, as listed below:</p>

<p><span id="more-169"></span>
<div class="right"><img src="http://www.sandmtl.com/news/images/portraits/ritchie_arthur.png" alt="Arthur Ritchie" /></div>
<ul>
    <li>Hardware:
<ul>
    <li>Cheap but powerful SMP hardware</li>
    <li>Reasonably priced, fast and efficient SAN devices</li>
    <li>Reasonably priced, fast and effective networks and switches for linking multiple SMP boxes together.</li>
</ul>
</li>
    <li>Architecture:
<ul>
    <li>Nearline data storage (not archiving) to hold the detail data used to build and maintain aggregates and cubes. There is a very clear differentiation between the Nearline 2.0 and archiving approaches to data and performance management.</li>
    <li>Database federation techniques enabling reduction of the amount of data in key RDBMS “hot spots” and minimization of batch window requirements</li>
    <li>Well-defined uses for aggregation, indexing, and MOLAP cubes to support reporting.</li>
</ul>
</li>
    <li>Software:
<ul>
    <li>Column-based data management technologies for avanced analytics, offering:
<ul>
    <li>The ability to change data models &#8220;on the fly&#8221; to meet emerging requirements</li>
    <li>Support for very wide tables (tens of thousands of columns), enabling very large numbers of KPIs</li>
    <li>The ability to add new data types &#8220;on the fly&#8221;</li>
    <li>The ability to allow existing applications to continue to work as data models change over time</li>
    <li>The ability to present all available data in a simple, easily usable format,  eliminating the need for analysts to navigate complex data relationships and slowly changing dimension constructs.</li>
</ul>
</li>
    <li>Better exploitation of new hardware and architectural techniques by traditional RDBMS systems</li>
    <li>The ability to support massively parallel operations, allowing users to iteratively run very complex queries to find patterns in the data without having their thought process interrupted, without impacting other power users and other reporting users, and without interfering with the ability to meet critical SLAs.</li>
</ul>
</li>
</ul>
Together, these components can work within a next-generation data warehouse design that supports standard reporting while also giving expert analysts the ability to access the raw &#8220;truth&#8221; of the original detail data. This makes it possible for the organization&#8217;s most creative thinkers to extend their thought processes &#8220;outside the box&#8221;, to challenge existing dogmas and provide alternative and hopefully more useful versions of &#8220;the truth&#8221;. In my subsequent posts, I will look more closely at the details of these various architectural components and elaborate on how each of them can work to contribute to a powerful, flexible, and efficient decision support infrastructure.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.sandmtl.com/news/building-corporate-memory-into-a-next-generation-data-warehouse/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Data &#8220;Dumping Grounds&#8221; and the Importance of Corporate Memory</title>
		<link>http://www.sandmtl.com/news/data-dumping-grounds-and-the-importance-of-corporate-memory/</link>
		<comments>http://www.sandmtl.com/news/data-dumping-grounds-and-the-importance-of-corporate-memory/#comments</comments>
		<pubDate>Mon, 04 May 2009 15:34:30 +0000</pubDate>
		<dc:creator>Arthur</dc:creator>
				<category><![CDATA[Arthur's Blog]]></category>
		<category><![CDATA[Arthur's Blog Next Generation DW]]></category>

		<guid isPermaLink="false">http://www.sandmtl.com/news/?p=168</guid>
		<description><![CDATA[Received wisdom about data warehousing instructs us not to create a &#8220;dumping ground&#8221; for our raw detail data. But why not? This principle is a legacy from the not-so-distant past when it was impractical to keep huge amounts of data around if it was not being actively used – so once aggregates had been built, [...]]]></description>
			<content:encoded><![CDATA[<p>Received wisdom about data warehousing instructs us not to create a &#8220;dumping ground&#8221; for our raw detail data. But why not? This principle is a legacy from the not-so-distant past when it was impractical to keep huge amounts of data around if it was not being actively used – so once aggregates had been built, the original details were simply discarded. Of course, this meant that the organization was then confined to working with a particular &#8220;version of the truth&#8221; that someone had imposed on the data; there was no way to revisit the original details should the need arise for a change of perspective. </p>

<p><span id="more-168"></span></p>

<p><div class="right"><img src='http://www.sandmtl.com/news/images/portraits/ritchie_arthur.png' alt='Arthur Ritchie' /></div>Now, however, <a href="http://www.sandmtl.com/news/tag/nearline-20/">Nearline 2.0</a> technology allows us to store massive quantities of data in an easily acccessible format without undue administration requirements, such an approach is economically viable not only in terms of storage and retrieval, but also from a human resources perspective. </p>

<p>Creating a &#8220;corporate memory&#8221; of this sort brings a number of major benefits:</p>

<ul><li>If it turns out that any or all of the original data is required for a new project (involving new aggregations, &#8220;cubes&#8221; or KPIs, for example), it is substantially less expensive and much easier to access it.</li>
<li>If it becomes necessary to assess the value of the corporate data, this can be done carefully over a period of time using data mining techniques, rather than in a rush to meet specific project deadlines.</li>
<li>The original detail data will only need to be handled once, and then can be retained as a permanent record for use in auditing the data warehouse if this becomes necessary (to prove compliance, to aid in conflict resolution, or simply to rectify &#8220;human errors” that were made at some point in the process)</li>
<li>It becomes possible to document in the metadata layer who accessed the information, when and for what reason – incidentally, this can also provide valuable insight into the abilities of those who are responsible for manipulating the raw data.</li></ul>

<p>In my next post, I&#8217;ll cover the ingredients for building Corporate Memory into a Next-Generation Data Warehouse.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.sandmtl.com/news/data-dumping-grounds-and-the-importance-of-corporate-memory/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Decision Support for Users Who Don&#8217;t Know What They Don&#8217;t Know</title>
		<link>http://www.sandmtl.com/news/decision-support-for-users-who-dont-know-what-they-dont-know/</link>
		<comments>http://www.sandmtl.com/news/decision-support-for-users-who-dont-know-what-they-dont-know/#comments</comments>
		<pubDate>Fri, 24 Apr 2009 13:10:46 +0000</pubDate>
		<dc:creator>Arthur</dc:creator>
				<category><![CDATA[Arthur's Blog]]></category>
		<category><![CDATA[Arthur's Blog Next Generation DW]]></category>

		<guid isPermaLink="false">http://www.sandmtl.com/news/?p=167</guid>
		<description><![CDATA[Since the beginning of the computer era, system designers have struggled to reconcile conflicting aims of performance vs. functionality and maintainability vs. adaptability. In the case of Business Intelligence, there has been no less of a need for tradeoffs in order to deliver workable systems. However, BI system design has also typically been constrained even [...]]]></description>
			<content:encoded><![CDATA[<p>Since the beginning of the computer era, system designers have struggled to reconcile conflicting aims of performance vs. functionality and maintainability vs. adaptability. In the case of Business Intelligence, there has been no less of a need for tradeoffs in order to deliver workable systems. However, BI system design has also typically been constrained even further by four fundamental realities:</p>

<p><span id="more-167"></span></p>

<p><div class="right"><img src='http://www.sandmtl.com/news/images/portraits/ritchie_arthur.png' alt='Arthur Ritchie' /></div><ul>
<li>Users don&#8217;t know what they don&#8217;t know &#8211; they will always have difficulty  articulating what they need, until they actually start working with the data.</li></p>

<p><li>We in IT tend to want to implement relatively restrictive systems with well-defined acceptance criteria, in order to get buy-in from users. This protects us from backlash – and we know that users with looming deadlines will inevitably settle for less than they really require.</li></p>

<p><li>Our IT training and experience has taught us that any system able to accommodate all the information that might be needed would be beyond our means – and even if it were affordable, it would be unlikely to perform adequately. As a result, we have gone to great lengths to determine the minimum amount of information required, then extracted that information and transformed it into the records that we retain.</li></p>

<p>However, it must be admitted that we almost never accurately judge what is “the minimum amount of information required” – and this typically leads to a series of costly attempts later on to find the missing data and retrofit it into our carefully designed data models.</p>

<p><li>Generally, corporate executives have given up on the possibility of understanding the complexities of the computer world, and so have relinquished control of computer systems to the IT department. This lack of overarching leadership often results in a conflict situation, with the IT department facing various frustrated end-user groups who “don’t know what they don’t know”.</li>
</ul> </p>

<p>If IT departments are to make any progress in implementing Decision Support systems that really work, things need to change. We urgently need to approach the task of designing data warehouses in a spirit of humility and  mutual co-operation. We must also acknowledge that some aspects of the resulting system may miss the mark &#8211;  so we will need to build into the system a means of detecting and correcting problems, and for accepting user feedback and then efficiently incorporating the necessary adaptations into  the system.</p>

<p>For a data warehouse environment to be of real value, it needs to be able to operate on the basis that users don’t know what they don’t know, by supporting the twists and turns of &#8220;incremental thinking&#8221; whereby they only gradually come to an understanding of their requirements. If a system designer comes right out and asks users what they want to know, they will typically be met with uncertainty. At this point, individuals anxious to promote their own way of thinking may step in with the assurance that they know what is really needed, while business experts who need to anticipate the effects of any number of potential scenarios from hurricanes to &#8220;credit crunches&#8221; often find themselves unable to make their voices heard. Locking in to a rigid, inflexible design on the basis of such inadequate input practically guarantees that the system will fail to deliver value over the long term.</p>

<p>In a Business Intelligence environment, incremental thinking usually involves &#8220;feeling one&#8217;s way&#8221; towards a goal by asking successive questions. Frequently, the approach will change – in terms of query strategies and the data sets we want to analyze – as insight is gained into the problem. Analysts work much like a detective will operate at the scene of a crime, picking up and examining various clues but not being able to link them together until the necessary understanding is developed. There are many spectacular examples of the enormous business value that can be derived from being able to ask any question that may arise in an analyst&#8217;s train of thought, particularly in highly dynamic activities like fuel hedging and currency trading. Success in these areas requires that plenty of historical details be available for unfettered and recursive access – allowing for the types of question that often provide the biggest business benefits.</p>

<p>However, it is obviously not necessary or even feasible to scan all the details every time a question needs to be answered. For this reason, a combination of traditional reporting and analysis architectures with a <a href="http://www.sandmtl.com/news/tag/nearline-20/">Nearline 2.0</a> repository (used to store massive amounts of detail data) can be the ideal solution. This architecture takes advantage of breakthroughs in database federation techniques and cheap, scalable SMP processors to create a massively parallel architecture that can easily scale to meet future needs, with little or no chance of technological obsolescence. In future posts I will look more closely at how such an architecture can be designed and implemented to enable high-performance, flexible decision support for business &#8220;at the speed of thought&#8221;.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.sandmtl.com/news/decision-support-for-users-who-dont-know-what-they-dont-know/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Redefining the Role of IT in Business Intelligence</title>
		<link>http://www.sandmtl.com/news/redefining-the-role-of-it-in-business-intelligence/</link>
		<comments>http://www.sandmtl.com/news/redefining-the-role-of-it-in-business-intelligence/#comments</comments>
		<pubDate>Mon, 06 Apr 2009 16:00:42 +0000</pubDate>
		<dc:creator>Arthur</dc:creator>
				<category><![CDATA[Arthur's Blog]]></category>
		<category><![CDATA[Arthur's Blog Next Generation DW]]></category>

		<guid isPermaLink="false">http://www.sandmtl.com/news/?p=165</guid>
		<description><![CDATA[If our businesses are going to survive, we need to stop designing Business Intelligence systems that tell us what we want to hear, and which work well in good times but behave incomprehensibly during periods of significant inconsistency. Instead, we need to build systems that empower our best analysts to help correct flaws in our [...]]]></description>
			<content:encoded><![CDATA[<p>If our businesses are going to survive, we need to stop designing Business Intelligence systems that tell us what we want to hear, and which work well in good times but behave incomprehensibly during periods of significant inconsistency. Instead, we need to build systems that empower our best analysts to help correct flaws in our activities and identify opportunities that we can exploit. We need to be in a position where existing paradigms can be challenged and replaced by new ones on an ongoing basis. However, just as new scientific theories need to fit with observed reality, these new business approaches must be well supported by the facts as recorded in a company’s information repository.</p>

<p><span id="more-165"></span></p>

<p><div class="right"><img src='http://www.sandmtl.com/news/images/portraits/ritchie_arthur.png' alt='Arthur Ritchie' /></div>The recent crisis in financial systems, resulting in major corporations going cap-in-hand to governments for handouts, dramatically illustrates why we need to move beyond reliance on reporting systems to adoption of a challenge-based system. With a challenge-based system, key analysts and &#8220;out-of-the-box&#8221; thinkers with sufficient analytic capability and insights are given unconstrained access to the data warehouse so that they can identify problems and recommend corrective action based on the facts. In this way,  the people currently controlling the “spin” on corporate information can be educated (or bypassed if necessary), and executives can be better informed and made more responsive to constantly changing business needs and opportunities. </p>

<p>The IT department cannot continue to dictate how IT-held data will be used: users have to be empowered to make these decisions themselves within their area of primary expertise. The focus of IT should instead be on making data available to end users, ensuring that it is not corrupted and that it can be audited when required, and on infrastructural issues like security, disaster recovery and so on. The job of the IT department should thus go far beyond merely distributing preconfigured reports, to encompass the more essential task of empowering users to get what they really need, when and how they need it, and to make sure that adequate resources are in place to get the job done. In short, IT departments need to work in a similar way to utility companies, providing users with infrastructure options but not with final solutions – and as with a power company, for example, fair and appropriate charge-out policies and procedures need to be put in place, so that IT can be transformed from a cost centre into a profit centre.</p>

<p>So,  the primary role of IT should be to provide unfettered access to data, to make sure that the data will not be lost for any reason (even natural disasters), and to meet various levels of demand. While security has traditionally been another key issue for IT, even this area might be better handled by a specialized department, with IT providing assistance as needed. In any event, IT should not be able to prevent anyone with the appropriate security levels (which in some cases may exceed those of the head of IT) from getting to the data they need. Often, invocations of security or the impact on performance for “other system users” are simply a ruse to cover-up for the fact that the data is not readily available or that the system is not designed to cope with the demand.</p>

<p>We need to understand that a functional Business Intelligence (BI) system is governed by the same rules that govern any other large architectural undertaking like an office building or an elevated highway. It must meet the diversified needs of the community it serves and those of the people whose lives it affects. It also has to fit within the constraints already in place in the environment in which it is to be established – since very few of us get to work with a &#8220;clean slate&#8221;, and the legal and administrative hurdles can often be formidable.</p>

<p>Successful implementation of BI systems thus needs to be considered as part art and part science. Pure science has not been able to solve all society’s problems, and those who try to adopt a purely &#8220;scientific&#8221; approach to BI will not be successful. Fundamentally, the field of technology is not just about pursuing technical innovation, but about helping human beings become more effective and productive in pursuing their goals. In my following posts, I will be looking at how IT departments can go about designing systems that allow for just this sort of flexible, on-demand provision of information services to analysts, with the ultimate aim of enabling truly effective decision support for maximum repsonsiveness in today&#8217;s unpredictable business environment.</p>

<p>Arthur Ritchie</p>
]]></content:encoded>
			<wfw:commentRss>http://www.sandmtl.com/news/redefining-the-role-of-it-in-business-intelligence/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Business Intelligence: An Oxymoron?</title>
		<link>http://www.sandmtl.com/news/business-intelligence-an-oxymoron/</link>
		<comments>http://www.sandmtl.com/news/business-intelligence-an-oxymoron/#comments</comments>
		<pubDate>Tue, 24 Mar 2009 20:41:59 +0000</pubDate>
		<dc:creator>Arthur</dc:creator>
				<category><![CDATA[Arthur's Blog]]></category>
		<category><![CDATA[Arthur's Blog Next Generation DW]]></category>

		<guid isPermaLink="false">http://www.sandmtl.com/news/?p=164</guid>
		<description><![CDATA[An old joke has it that the term &#8220;military intelligence&#8221; is an oxymoron – and in light of the current global financial crisis, it is tempting to put &#8220;Business Intelligence&#8221; in this category as well. Our inability to predict or deal competently with major events, from wars in the Middle East to the meltdown of [...]]]></description>
			<content:encoded><![CDATA[<p>An old joke has it that the term &#8220;military intelligence&#8221; is an oxymoron – and in light of the current global financial crisis, it is tempting to put &#8220;Business Intelligence&#8221; in this category as well. Our inability to predict or deal competently with major events, from wars in the Middle East to the meltdown of global financial systems, shows just how ineffective our Business Intelligence/Data Warehousing strategies or “fit-for-purpose” reporting systems can be in responding to events as they unfold in this complex world. We are now confounded by the facts: we cannot predict the future; the largest military powers cannot conquer and control much weaker opponents; economists cannot adequately monitor essential financial systems. Automated trading systems, whose rules we once thought we understood and controlled, seem to have taken on a life of their own. </p>

<p><span id="more-164"></span></p>

<p><div class="right"><img src='http://www.sandmtl.com/news/images/portraits/ritchie_arthur.png' alt='Arthur Ritchie' /></div>In my view, unless talented analysts are given unfettered access to whatever corporate data they require and the ability to analyze it as they see fit in the context of the many external data sources that are available, we will continue to find ourselves unprepared to deal with the unexpected. Implemented correctly, corporate Business Intelligence (BI)  systems can support an organization&#8217;s best analysts as they challenge traditional business dogmas and develop a practicable way forward based on the facts, as recorded in detailed corporate data. To achieve this, IT departments need to stop acting as “data jailors” who strictly control which data will be accessible, in what form, and start empowering creative thinkers to realize their maximum potential, be it in marketing, manufacturing, distribution or some other field. In order to do this, however, IT departments need to start acting more like a power utility service: enabling &#8220;decision support&#8221; (to revive an older term for Business Intelligence) by providing corporate information or raw data as required, in the right amounts at the right time, while also serving as &#8220;consultants&#8221; who help end users access the data they require, when and how they need it. </p>

<p>In this series of blog posts, I will be discussing how an effective information infrastructure for decision support can be implemented without resorting to a “rip and replace&#8221; strategy that would involve completely scrapping the existing data warehouse. As the foundation of an optimal system for enterprise data management and decision support, I will be proposing the concept of Data Warehouse as a tiered architecture that combines three database models/technologies to support both standard reporting and &#8220;power analytics&#8221; as well as highly accessible storage of massive amounts of granular data to be used by the reporting and analytics engines.</p>

<p>Arthur Ritchie</p>
]]></content:encoded>
			<wfw:commentRss>http://www.sandmtl.com/news/business-intelligence-an-oxymoron/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
