<?xml version="1.0" encoding="UTF-8"?> <rss version="2.0" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" ><channel><title>ETL Tools</title> <atom:link href="http://www.etltool.com/feed/" rel="self" type="application/rss+xml" /><link>http://www.etltool.com</link> <description>software, tools &#38; research</description> <lastBuildDate>Sun, 16 Jun 2013 07:29:54 +0000</lastBuildDate> <language>en-US</language> <sy:updatePeriod>hourly</sy:updatePeriod> <sy:updateFrequency>1</sy:updateFrequency> <generator>http://wordpress.org/?v=3.4.1</generator> <item><title>New Additions to the Oracle Big Data Appliance</title><link>http://www.etltool.com/new-additions-to-the-oracle-big-data-appliance/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=new-additions-to-the-oracle-big-data-appliance</link> <comments>http://www.etltool.com/new-additions-to-the-oracle-big-data-appliance/#comments</comments> <pubDate>Sun, 16 Jun 2013 07:20:06 +0000</pubDate> <dc:creator>etltool</dc:creator> <category><![CDATA[Big Data]]></category> <category><![CDATA[Data warehouse]]></category> <category><![CDATA[Oracle]]></category> <category><![CDATA[Performance]]></category><guid isPermaLink="false">http://www.etltool.com/?p=619</guid> <description><![CDATA[Oracle today announced the availability of Oracle Big Data Appliance X3-2 Starter Rack and Oracle Big Data Appliance X3-2 In-Rack Expansion. Oracle Big Data Appliance X3-2 Starter Rack enables customers to jumpstart their first Big Data projects with an optimally sized appliance. Oracle Big Data Appliance X3-2 In-Rack Expansion helps customers easily and cost-effectively scale their [...]]]></description> <content:encoded><![CDATA[<p>Oracle today announced the availability of Oracle Big Data Appliance X3-2 Starter Rack and Oracle Big Data Appliance X3-2 In-Rack Expansion. Oracle Big Data Appliance X3-2 Starter Rack enables customers to jumpstart their first Big Data projects with an optimally sized appliance. Oracle Big Data Appliance X3-2 In-Rack Expansion helps customers easily and cost-effectively scale their footprint as their data grows.<span id="more-619"></span></p><p>The new configurations include Oracle <a title="Big Data seems to become Big Business. Why?" href="http://www.passionned.com/data-integration/big-data/">Big Data</a> Appliance X3-2 Starter Rack, containing six Oracle Sun servers within a full-sized rack with redundant Infiniband switches and power distribution units; as well as Oracle Big Data Appliance X3-2 In-Rack Expansion, which includes a pack of six additional servers to expand the above configuration to 12 nodes and then to a full rack of 18 nodes.</p><table style="margin-bottom: 5px;" border="0" cellspacing="1" cellpadding="1" width="100%"><tbody><tr><td bgcolor="#fcfcfc"><em>This press release has been written by a vendor and published as a news item on this website. It does not necessarily reflect the results of our 100% vendor independent research.</em></td></tr></tbody></table><p>Both new systems include the existing software stack for Oracle Big Data Appliance X3-2: Oracle Linux, Oracle Hotspot Java Virtual Machine, Cloudera&#8217;s Distribution Including Apache Hadoop (CDH), Cloudera Manager and Oracle NoSQL Database.</p><p>Integrating Hadoop into existing enterprises, Oracle also provides powerful SQL access to HDFS data. This enables organizations to leverage their existing Oracle SQL skillsets and tools to seamlessly query and analyze data stored in Hadoop.</p><p>Additionally, Oracle Big Data Appliance X3-2 (full rack configuration) is now available through Oracle Infrastructure as a Service (IaaS), where organizations can obtain Oracle Big Data Appliance X3-2 on-premise, behind their firewall, for a monthly fee. With Oracle IaaS for Oracle Big Data Appliance, organizations can now eliminate upfront capital expenditures for their Big Data needs.</p><p>The Oracle Big Data Appliance X3-2 product family is comprised of Engineered Systems that simplify the implementation and management of Big Data by integrating hardware and software to acquire, organize and analyze <a title="Big Data seems to become Big Business. Why?" href="http://www.passionned.com/data-integration/big-data/">Big Data</a>.</p><h5>Supporting Quotes</h5><p>“With these new additions to the Oracle Big Data Appliance product family, Oracle is meeting the needs of a wider number of customers and expanding our offering to handle everything from the first proof of concept to a large scale enterprise deployment,” said Çetin Özbütün vice president, Data Warehousing and Big Data Technologies, Oracle. “We are focused on bringing the power of big data to our existing enterprise customers, with an integrated platform of engineered systems and key capabilities like SQL support for HDFS.”</p><p>&#8220;Oracle and Cloudera share a commitment to making Hadoop more accessible to an ever-broader set of users and organizations,&#8221; said Tim Stevens, VP of Business and Corporate Development at Cloudera. &#8220;These new options for deploying Oracle Big Data Appliance are ideal for the many enterprises that are just beginning to implement Hadoop and need a solution that enables them to start small and scale as their requirements grow.&#8221;</p><p>Source: Oracle.</p> ]]></content:encoded> <wfw:commentRss>http://www.etltool.com/new-additions-to-the-oracle-big-data-appliance/feed/</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>Be a part of the biggest BI Survey in the world</title><link>http://www.etltool.com/be-a-part-of-the-biggest-bi-survey-in-the-world/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=be-a-part-of-the-biggest-bi-survey-in-the-world</link> <comments>http://www.etltool.com/be-a-part-of-the-biggest-bi-survey-in-the-world/#comments</comments> <pubDate>Fri, 14 Jun 2013 10:29:57 +0000</pubDate> <dc:creator>etltool</dc:creator> <category><![CDATA[Business Intelligence]]></category> <category><![CDATA[News]]></category> <category><![CDATA[Research]]></category><guid isPermaLink="false">http://www.etltool.com/?p=614</guid> <description><![CDATA[In close collaboration with BARC, Passionned Group would like to invite you to participate in The BI Survey 13, the world&#8217;s largest survey of Business Intelligence and Performance Management users. Be a part of the biggest BI Survey in the world. The BI Survey 13 is strictly vendor-independent. BARC does not accept vendor sponsorship of the [...]]]></description> <content:encoded><![CDATA[<p><img class="size-full wp-image-2502 alignleft" style="margin-right: 25px;" title="Business Intelligence Survey" src="http://www.passionned.com/wp/wp-content/uploads/bi-survey-business-intelligence.png" alt="Business Intelligence Survey" width="157" height="161" />In close collaboration with BARC, Passionned Group would like to invite you to participate in The BI Survey 13, the world&#8217;s largest survey of <a title="Business Intelligence (BI) helps organizations to improve their performance by providing the key information they need." href="http://www.passionned.com/business-intelligence/">Business Intelligence</a> and <a title="Performance Management can’t be successful without good Key Performance Indicators (KPIs)." href="http://www.passionned.com/performance-management/">Performance Management</a> users. Be a part of the biggest BI Survey in the world.<span id="more-614"></span></p><p>The BI Survey 13 is strictly vendor-independent. BARC does not accept vendor sponsorship of the Survey, and the results are analyzed, compared and published without any vendor involvement.</p><p>To take part in this year&#8217;s survey, visit:<br /> <a title="Be a part of the biggest BI Survey in the world" href="https://digiumenterprise.com/answer?link=1348-T9K4SWA8" target="_blank">https://digiumenterprise.com/answer?link=1348-T9K4SWA8</a></p><p>As a participant, you will:</p><ul><li>Receive a summary of the results from the full Business Intelligence survey;</li><li>Be entered into a draw to win one of ten $50 Amazon vouchers;</li><li>Ensure that your experiences are included in the final analyses.</li></ul><p>BARC&#8217;s annual Business Intelligence survey gathers input from a large number of organizations to analyze their buying decisions, implementation cycles and the benefits they achieve from using <a title="Business Intelligence software automates the process of selecting data from different source systems, transform that data into valuable information and combine it in a single report or dashboard. " href="http://www.passionned.com/business-intelligence/business-intelligence-software/">Business Intelligence software</a>.</p><p>You will be able to answer questions on your usage of a BI tool from any BI vendor. Your answers will be used anonymously and your personal details will never be passed on to <a title="An overview of the Business Intelligence vendors" href="http://www.passionned.com/business-intelligence/business-intelligence-vendors/">Business Intelligence vendors</a> or other third parties.</p><p>Business and technical users, as well as vendors and consultants, are all encouraged to participate.</p><p>The BI Survey 13 should take about 20 minutes to complete. For further information, please contact Nikolai Janoschek at BARC (njanoschek at barc.de).</p><p>Click on the image below to take part in The BI Survey 13<br /> <a title="Be a part of the biggest BI Survey in the world" href="https://digiumenterprise.com/answer?link=1348-T9K4SWA8" target="_blank"><img class="alignnone size-full wp-image-2494" style="margin-top: 15px; margin-bottom: 15px;" title="Be a part of the biggest BI Survey in the world" src="http://www.passionned.com/wp/wp-content/uploads/business-intelligence-survey-13-barc.jpg" alt="Be a part of the biggest BI Survey in the world" width="600" height="83" /></a></p> ]]></content:encoded> <wfw:commentRss>http://www.etltool.com/be-a-part-of-the-biggest-bi-survey-in-the-world/feed/</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>Major Update of the ETL Tools &amp; Data Integration Survey</title><link>http://www.etltool.com/major-update-of-the-etl-tools-data-integration-survey/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=major-update-of-the-etl-tools-data-integration-survey</link> <comments>http://www.etltool.com/major-update-of-the-etl-tools-data-integration-survey/#comments</comments> <pubDate>Sat, 01 Sep 2012 13:29:13 +0000</pubDate> <dc:creator>etltool</dc:creator> <category><![CDATA[News]]></category><guid isPermaLink="false">http://www.etltool.com/?p=494</guid> <description><![CDATA[In the last week of September 2012, Passionned&#8217;s ETL Tools &#38; Data Integration Survey has been updated to reflect the latest versions of the major ETL tools in the market. Two ETL products were added to the survey and four ETL tools have been updated to show their new functions and features. In the list [...]]]></description> <content:encoded><![CDATA[<p>In the last week of September 2012, Passionned&#8217;s ETL Tools &amp; Data Integration Survey has been updated to reflect the latest versions of the major ETL tools in the market. Two ETL products were added to the survey and four ETL tools have been updated to show their new functions and features. In the <a title="List of ETL tools" href="http://www.etltool.com/list-of-etl-tools/">list of ETL tools</a> you will find a list of all the ETL tools included in the survey, including both commercial and open source ETL tools.<span id="more-494"></span></p><p>The newest release of Informatica&#8217;s Powercenter, version 9.5, is now in the survey and the ETL products SAS Data Integration Studio, SAP Data Services, Talend&#8217;s Open Studio for Data Integration, version 5.2 and CloverETL have undergone updates.</p><p>SAP&#8217;s Data Services product has become so easy to use that they have set the standard for other products to measure themselves against.</p><p>The two products that have been added are:</p><ul><li>Syncsort DMExpress 5.5;</li><li>Expressor (QlikView) Data Integration Platform 3.7. ( now QlikView)</li></ul><p>If you want to compare all the major ETL tools on 90 criteria, or a selection of them, you can purchase the <a title="Order and download the ETL Tools &amp; Data Integration Survey" href="https://www.etltool.com/order.php">ETL Tools &amp; Data Integration Survey 2013</a>.</p><p><a href="https://www.etltool.com/order.php"><img src="http://www.etltool.com/download-the-etl-tools-survey-button.png" alt="Buy and download the ETL Survey Report" border="0" hspace="2" vspace="2" /></a></p><p>The information and insights it contains will save you a lot of time and the survey is 100% vendor independent and free from any vendor bias.</p> ]]></content:encoded> <wfw:commentRss>http://www.etltool.com/major-update-of-the-etl-tools-data-integration-survey/feed/</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>Oracle Announces Availability of Oracle Exalytics In-Memory Machine</title><link>http://www.etltool.com/oracle-announces-availability-of-oracle-exalytics-in-memory-machine/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=oracle-announces-availability-of-oracle-exalytics-in-memory-machine</link> <comments>http://www.etltool.com/oracle-announces-availability-of-oracle-exalytics-in-memory-machine/#comments</comments> <pubDate>Tue, 28 Feb 2012 08:20:09 +0000</pubDate> <dc:creator>etltool</dc:creator> <category><![CDATA[Data Integration]]></category> <category><![CDATA[News]]></category> <category><![CDATA[Oracle]]></category> <category><![CDATA[Performance]]></category><guid isPermaLink="false">http://www.etltool.com/?p=386</guid> <description><![CDATA[Oracle today announced the availability of Oracle Exalytics In-Memory Machine, the industry’s first high-speed engineered system featuring in-memory business intelligence (BI) software and hardware to deliver extreme performance for analytic and performance management applications. Oracle today also announced availability of a new release of Oracle Business Intelligence Foundation Suite that features 87 new product capabilities [...]]]></description> <content:encoded><![CDATA[<p>Oracle today announced the availability of Oracle Exalytics In-Memory Machine, the industry’s first high-speed engineered system featuring in-memory business intelligence (BI) software and hardware to deliver extreme performance for analytic and performance management applications.<span id="more-386"></span><br /><table style="margin-bottom: 5px;" border="0" cellspacing="1" cellpadding="1" width="100%"><tbody><tr><td bgcolor="#fcfcfc"><em>This press release has been written by a vendor and published as a news item on this website. It does not necessarily reflect the results of our 100% vendor independent research.</em></td></tr></tbody></table></p><ul><li><span style="text-indent: -18pt;">Oracle today also announced availability of a new release of </span><span style="text-indent: -18pt;">Oracle Business Intelligence Foundation Suite</span><span style="text-indent: -18pt;"> that features 87 new product capabilities and enhancements including new visualizations, contextual interactions, performance and scale improvements, optimizations for Oracle Exalytics, and simplification for end users, developers and administrators. </span></li><li><span style="text-indent: -18pt;">With Oracle Exalytics, organizations can deliver custom and packaged analytic and performance management applications that run faster, are easier to use, and support more users than ever before. The engineered system enables customers to complement their dashboard and reporting projects with scenario modeling, planning and forecasting all running in a single environment.</span></li><li><span style="text-indent: -18pt;">To provide ultimate performance and scalability with reduced deployment complexity and lower Total Cost of Ownership, Oracle Exalytics is architected as an engineered system featuring the industry-leading <a title="Oracle BI Enterprise Edition (OBIEE)" href="http://www.passionned.com/business-intelligence/business-intelligence-vendors/oracle-bi-enterprise-edition-obiee/">Oracle BI</a> Foundation Suite and </span><span style="text-indent: -18pt;">Oracle TimesTen In-Memory Database for Exalytics</span><span style="text-indent: -18pt;"> enhanced for an Oracle server designed for in-memory analytics.</span></li><li><span style="text-indent: -18pt;">With interactive real-time analysis enabled by Oracle Exalytics, organizations can maximize profitability, increase revenues and market share, and react more quickly to changing business conditions.</span></li></ul><h2 class="presssubheader">Breakthrough Innovations Deliver Optimized In-Memory Analytics</h2><ul><li><span style="text-indent: -18pt;">A complete, high-performance engineered system, Oracle Exalytics features Oracle’s Sun Fire server with 1 Terabyte of RAM, powered by the Intel Xeon processor E7-4800 with a total of 40 processing cores, 40 Gb/s InfiniBand and 10 Gb/s Ethernet connectivity, and Integrated Lights Out Management.</span></li><li><span style="text-indent: -18pt;">Oracle BI Foundation includes an extensive set of new capabilities in </span><span style="text-indent: -18pt;"><a title="Oracle BI Enterprise Edition (OBIEE)" href="http://www.passionned.com/business-intelligence/business-intelligence-vendors/oracle-bi-enterprise-edition-obiee/">Oracle Business Intelligence Enterprise Edition</a></span><span style="text-indent: -18pt;"> and </span><span style="text-indent: -18pt;">Oracle Essbase</span><span style="text-indent: -18pt;"> for parallel performance and in-memory analytics that deliver to users real-time speed-of-thought visual analysis with sub-second responsiveness.</span></li><li><span style="text-indent: -18pt;">Oracle TimesTen In-Memory Database for Exalytics is a new release of the industry-leading in-memory RDBMS specifically designed for analytics.</span><ul><li><span style="text-indent: -18pt;">With new support for OLAP grouping sets, built-in analytic functions, and columnar compression, Oracle TimesTen In-Memory Database for Exalytics delivers instantaneous response over large data sets.</span></li></ul></li><li><span style="text-indent: -18pt;">Oracle Exalytics simplifies management with an innovation called heuristic adaptive in-memory caching that calculates what data is best stored in-memory for overall best performance, and adapts to changing workloads. Unlike other appliances or in-memory tools, Oracle Exalytics does not constrain applications only to data that fits in-memory, but also leverages connected databases, data warehouses and <a title="OLAP tools are widely used to analyze information from different perspectives and provide functions like drill-down, slice-and-dice and to offer a very high performance. " href="http://www.passionned.com/business-intelligence/on-line-analytical-processing-olap/olap-tools/">OLAP-tools</a>.</span></li><li><span style="text-indent: -18pt;">Benchmarking and customer testing of Oracle Exalytics demonstrates relational OLAP (ROLAP) reporting and dashboard performance gains of 10x to 100x, and multidimensional OLAP (MOLAP) modeling performance increases of up to 79x.</span></li></ul><h2 class="presssubheader">Fastest Performance for Over 80 Oracle BI and EPM Applications</h2><ul><li><span style="text-indent: -18pt;">Eighty-plus Oracle BI and Oracle Hyperion performance management applications are available today for organizations to use in conjunction with Oracle Exalytics without application changes. These include:</span><ul><li><span style="text-indent: -18pt;">All horizontal and industry-focused </span><span style="text-indent: -18pt;">Oracle BI Applications</span><span style="text-indent: -18pt;"> such as Oracle Financial Analytics, Oracle Human Resources Analytics, Oracle Sales Analytics, and Oracle Procurement and Spend Analytics, among others; and,</span></li><li><span style="text-indent: -18pt;">Oracle Hyperion Planning</span><span style="text-indent: -18pt;"> for planning and forecasting using more detailed data and scaling to support larger numbers of users with reduced plan cycle time and greater plan accuracy.</span></li></ul></li><li><span style="text-indent: -18pt;">Oracle BI Foundation Suite includes built-in mobile support for iPad and iPhone. As part of an Oracle Exalytics solution, on-the-go mobile analysis scales to support many thousands of concurrent users with interactive responsiveness.</span></li></ul><h2 class="presssubheader">Open System Supports Oracle and Third-Party Enterprise Data Sources</h2><ul><li><span style="text-indent: -18pt;">An open solution, Oracle Exalytics is suited for use in heterogeneous IT environments, and can access and analyze data from Oracle or third-party relational, <a title="OLAP is an abbreviation for On-Line Analytical Processing and gives users the ability to analyse information thoroughly from multiple perspectives easily." href="http://www.passionned.com/business-intelligence/on-line-analytical-processing-olap/">OLAP</a> or other data sources including IBM DB2, IBM Netezza, Microsoft SQL Server and Analysis Services, SAP Business Information Warehouse (BW), Sybase Adaptive Server Enterprise (ASE), and Teradata Warehouse, among others, in any combination.</span></li><li><span style="text-indent: -18pt;">Oracle Exalytics integrates with and complements </span><span style="text-indent: -18pt;">Oracle Exadata Database Machine</span><span style="text-indent: -18pt;"> via 40 Gb/s InfiniBand connection and software enhancements to deliver unparalleled query performance and sub-second interactive visualization across very large data sets.</span></li></ul><p>“One of the real advantages of Oracle Exalytics In-Memory Machine is of course the speed,” said John Anker Moeller, Senior Vice President, Nykredit. “Oracle Exalytics will give us around 35x to 70x faster reports.”</p><table style="margin-bottom: 15px;" width="100%" border="0" cellspacing="1" cellpadding="1"><tbody><tr><td bgcolor="#fcfcfc">Download our 100% vendor independent &amp; in-depth evaluation of <a href="https://www.etltool.com/order.php">Oracle Warehouse Builder &amp; Oracle Data Integrator in the ETL &amp; Data Integration Tool Survey</a>.</td></tr></tbody></table> ]]></content:encoded> <wfw:commentRss>http://www.etltool.com/oracle-announces-availability-of-oracle-exalytics-in-memory-machine/feed/</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>Bikes (SQL) can&#8217;t ever beat cars (ETL tools) &#8211; Data Vault discussion</title><link>http://www.etltool.com/bikes-sql-cant-ever-beat-cars-etl-tools-data-vault-discussion/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=bikes-sql-cant-ever-beat-cars-etl-tools-data-vault-discussion</link> <comments>http://www.etltool.com/bikes-sql-cant-ever-beat-cars-etl-tools-data-vault-discussion/#comments</comments> <pubDate>Thu, 26 Jan 2012 11:35:57 +0000</pubDate> <dc:creator>etltool</dc:creator> <category><![CDATA[Data Vault]]></category> <category><![CDATA[News]]></category> <category><![CDATA[SQL]]></category><guid isPermaLink="false">http://www.etltool.com/bikes-sql-cant-ever-beat-cars-etl-tools-data-vault-discussion/</guid> <description><![CDATA[Bikes (SQL) can't ever beat cars (ETL tools) - Data Vault discussion]]></description> <content:encoded><![CDATA[<p><strong>Question: Which ETL tools can support Data Vault modeling out-of-the-box? What are the challenges and issues building a data vault with ETL tools?</strong><span id="more-85"></span></p><p><strong>1. Marcel de Wit</strong> &#8211; See another <a href="http://lnkd.in/fAdkiM" target="_blank">discussion</a> on LinkedIn (still active; Dutch)</p><p><strong>2. Daan van Beek</strong> &#8211; Thanks Marcel, I did read the comments of that discussion too, it was the reason I started this discussion actually. After reading it, it was still unclear to me whether <a title="How can you quickly determine the main differences, strengths and weaknesses and the costs of ETL tools you want to buy?" href="http://www.etltool.com/">ETL tools</a> do support Data Vault out-of-the-box like slowly changing dimensions or not. So, who knows the answer(s)? The vendors?</p><p><strong>3. Jos van Dongen</strong> &#8211; Confused by your question: imho ETL tools are for <a title="What is ETL, Extract Transform and Load?" href="http://www.etltool.com/what-is-etl-extract-transform-and-load/">ETL</a>, data modeling tools are for data modeling. You ask for ETL tools that support DV modeling. So strictly speaking: none <img src='http://www.etltool.com/wp/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' /> . But out of the box support for DV (to some extent, depending on whether you actually want to generate a model from the source system) can be found in BI-Ready, Quipu and Wherescape 3D (still in beta); at Antonius Hospital we developed a <a title="Pentaho Data Integration is a powerful, metadata-driven ETL tool designed to bridge the gap between business and IT; Turning your company's data into increased profits." href="http://www.etltool.com/vendors/pentaho-data-integration-kettle/">Pentaho Kettle</a> framework for loading a DV.</p><p><strong>4. Juan José van der Linden </strong>- Hi, I don&#8217;t think there are ETL tools that support loading a Data Vault out of the box with special steps/transformations. Pentaho Kettle supports adding new steps as plugins. So if someone develops steps for loading hub/link/sats it can be made Data Vault aware. Quipu and BI-Ready support generating a Data Vault model and loading code from a datamodel. This model can be a business model or a source model. BI-Ready generates SQL as loading code. Quipu currently generates by default SQL code. The code generation of Quipu is template based. There are already users of Quipu building templates that generate Pentaho Kettle ETL transformations for loading the Data Vault. Regards, JJ.</p><p><strong>5. Marco Schreuder</strong> &#8211; Compared to Dimensional modeling Data Vault loading is very simple: 95% inserts. You can use a combination of a lookup transformation to determine new rows and a data transformation task to inserts them. For satellites you have to determine the old rows and enddate them. You can use a simple SQL execute task for this. So&#8230; you really don&#8217;t need a dedicated component.</p><p><strong>6. Ronald Damhof </strong>- Agree with marco. My simple answer: all do Nuff said <img src='http://www.etltool.com/wp/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /></p><p><strong>7. Juan José van der Linden</strong> &#8211; Loading hubs and links is always easy in a Data Vault. Also loading a satellite when there is only 1 change per business key per load cycle is also easy. It becomes tricky when you&#8217;re loading from a CDC system and have more changes per business key per load cycle or when you have a bi-temporal satellite. But even then a template system helps you to speed up developing.</p><p><strong>8. Erik Fransen</strong> &#8211; Only those ETL-tools that are used by qualified DWH-designers are able to deal with Data Vault and data marts schemes and related data logistics correctly.</p><p><strong>9. Daan van Beek</strong> &#8211; @jos you&#8217;re right, but why do almost all ETL tools available in the market support star schemas with slowly changing dimensions? That&#8217;s because it should make the life of a DWH developer much easier.</p><p>@marco @ronald the same holds true for star schema&#8217;s, 95% inserts. In my view <a title="ETL tools: software, tools &amp; research" href="http://www.etltool.com/">ETL tools</a> are created and used to get rid of hand coding SQL? Are you suggesting, just load a data vault with SQL and forget about ETL tools?</p><p><strong>10. Erik Fransen</strong> &#8211; @Daan that&#8217;s what we are doing at our customers, and it works great. No need for expensive ETL anymore, but the right skills and holding on to your architecture-rules is crucial.</p><p><strong>11. Martijn Evers</strong> &#8211; @Daan, You have to separate 2 different types of ETL steps:</p><p>1. Formal (semantical equivalent) model (to model) transformations (e.g. 3NF to Data Vault)<br /> 2. Pure data transformations (cleaning &amp; matching, conforming etc).</p><p>Model transformation steps change the model but leave the data as is, while data transformations steps change the data (sometimes significantly) but do not change the model/shape of the data. ETL tools can do both, and usually in one BIG push. Most ETL tools however are not very smart on model to transformations (exceptions: Kalido, BIReady, Wherescape Red) and hence deliver poor ROI there. Most Data Vault (Oriented DWH) Architectures by default separate these 2 types of ETL actions (They should IMO) to accomplish an effective ETL divide &amp; conquer strategy. Hence they can use different and far simpler tooling like (somewhat complex) metadata driven SQL or even Business Rule Engines to accomplish their task With Data Vault oriented DWH Architectures you can still use (powerful) <a title="ETL software" href="http://www.etltool.com/etl-software/">ETL software</a> but their weakness in model 2 model transformations and their focus on handling big and complex ETL steps (of any kind) is seen more as a liability then as an asset. Esp. in combining model transformations and data transformations, which can compromise things like reusability, maintainability, auditability &amp; performance.</p><p><strong>12. Daan van Beek</strong> &#8211; @Erik in general ETL tools would letting us lower the costs of data warehouse projects and maintenance by a factor of 3 to 5, are you so to speak sending all the ETL developers back to the stone age <img src='http://www.etltool.com/wp/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' /> @Martijn, a distinction that is helpful, but a transformation for slowly changing dimensions (formal) is almost available in all ETL tools, why do the vendors not embrace Data Vault? Are users not asking for it? Or is it to complex?</p><p><strong>13. Ronald Damhof </strong>- @daan; pls send met the research that states that ETL tools lower the costs of DWH projects &amp; maintenance by a factor 3 to 5. Never seen that kind of research, software is not responsible for lower project or maintenance costs &#8211; it never is &#8211; so pls link.</p><p>@daan; &#8220;why do vendors not embrace Data Vault&#8221;, since when is normal-sense modeling something that needs to be supported by ETL vendors? Do not get it. I was not kidding when I said all ETL tools support Data Vault. I suppose you have researched the typical &#8216;complexity&#8217; of DV mappings in ETL &#8211; can you show us what you find complex?</p><p>@daan; &#8220;users not asking for it&#8221; &#8211; that made me laugh.</p><p><strong>14. Martijn Evers </strong>- @Daan, Except for slowly changing dimensions, of which DV uses a specific and simple case for loading satellites, there is not a lot of DV specific work a classic ETL tool can do, unless it can also do model transformations, which is usually not the case. Current ETL tools just can&#8217;t provide a very good ROI on Data Vault, so you can skip them in favor of simpler approaches like generated SQL.</p><p><strong>15. Daan van Beek</strong> &#8211; @Ronald, &#8220;software is not responsible for lower project or maintenance costs &#8211; it never is&#8221; &#8211; really confused, &#8216;all&#8217; increase in productivity of the last twenty years or so is mostly due to automation (for the last centuries mainly due to technology), and you are saying software (in combination with hardware) never can&#8217;t held be responsible for that?? Why do you think that?</p><p>@Martijn, thanks for giving a quick reply to @Ronald <img src='http://www.etltool.com/wp/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> &#8220;current ETL tools just can&#8217;t provide a very good ROI on Data Vault&#8221;</p><p><strong>16. Ronald Damhof</strong> &#8211; @daan; lol &#8211; not gonna get that research, am I?? You mentioned about research regarding ETL tools (not automation in general, you can automate ETL without ETL tools)&#8230;.show me &#8211; you got numbers &#8211; factor 3 to 5 according to you in DWH projects and maintenance. I am curious what research you are referring 2. If you provide me the research I will answer your question &#8211; ok? @daan; all ETL tools support DV &#8211; the statement of Martijn does not change a thing to that statement</p><p><strong>17. Daan van Beek</strong> &#8211; @ronald, Where is this going! You see things that aren&#8217;t there. I didn&#8217;t mention the word research at all, it&#8217;s common sense like we know that bikes (SQL) can&#8217;t ever beat cars (ETL tools) or a Ferrari (PowerCenter for example) in terms of speed, functionality and usability. And by the way, you said earlier in this discussion, &#8216;all do&#8217;. It would be fair if you try at least to prove your claim.</p><p>Please feel free to comment.</p> ]]></content:encoded> <wfw:commentRss>http://www.etltool.com/bikes-sql-cant-ever-beat-cars-etl-tools-data-vault-discussion/feed/</wfw:commentRss> <slash:comments>1</slash:comments> </item> <item><title>Why All Content Matters: Getting Started with Unstructured Data</title><link>http://www.etltool.com/why-all-content-matters-getting-started-with-unstructured-data/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=why-all-content-matters-getting-started-with-unstructured-data</link> <comments>http://www.etltool.com/why-all-content-matters-getting-started-with-unstructured-data/#comments</comments> <pubDate>Thu, 26 Jan 2012 11:34:14 +0000</pubDate> <dc:creator>etltool</dc:creator> <category><![CDATA[Business Intelligence]]></category> <category><![CDATA[News]]></category> <category><![CDATA[Unstructured data]]></category><guid isPermaLink="false">http://www.etltool.com/why-all-content-matters-getting-started-with-unstructured-data/</guid> <description><![CDATA[Why All Content Matters: Getting Started with Unstructured Data]]></description> <content:encoded><![CDATA[<div class="1234"><p><span style="color: #808080;">After users become familiar with your BI project&#8217;s benefits, they&#8217;ll likely want more. Be prepared to provide analysis of unstructured data. We&#8217;ll show you how to begin.</span> <span id="more-104"></span></p><p>5/28/2008 by Kirby Lunger</p><p>You&#8217;ve spent the last five years defining, establishing, and building an analytical environment for your organization. You received accolades for finally providing access to structured information from your company&#8217;s transactional systems through a business intelligence (BI) tool with underlying data marts, a data warehouse, and a data integration tool. Now &#8212; all of a sudden, it seems &#8212; your colleagues are asking for access to other kinds of content such as e-mail, documents, and audio-visual media through your analytical architecture so they can use this content for predictive analytics in the BI application. Where should you start?</p><p><strong>Content Definitions</strong><br /> To help your coworkers access this &#8220;unstructured&#8221; content, you first need to understand the types of data they want to retrieve. You probably have a good handle on the traditional transactional data that is housed in your analytical environment, especially the information in your databases (stored using multidimensional, relational, and other legacy formats). What your colleagues are asking for is pretty much all of the other content in the organization, which could make up as much as 80 percent of all corporate information assets. Users also want the ability to analyze content about the organization which is traditionally only available &#8220;outside the firewall!&#8221; This unstructured content is less ordered in terms of its information hierarchy, but the information is just as valuable for your performance management application. TDWI has described two types of data sources: semi-structured and unstructured. Semi-structured data includes spreadsheets, flat files, XML documents and RSS feeds. Unstructured data inside your organization is everything else you can imagine, such as e-mail, word processing files, audio-visual content, Web pages, and text fields in all of your organization&#8217;s applications. You may also be asked to provide access to, and analysis of, information outside your organization. This content comes in similar formats but is used for different business purposes.</p><p><strong>Analysis Types</strong><br /> Your colleagues need access to this semi-structured and unstructured content to answer questions such as What are our company&#8217;s contractual obligations across the enterprise? Is our organization meeting its compliance reporting requirements? Your coworkers may also want to gain access to content created outside of your company to address questions such as What do our customers think of our products and services? What are our competitors doing? What trends and buzz in the marketplace could influence our organization?</p><p><strong>Architectural Approaches</strong><br /> The most commonly accepted approach today is to use textual analytics and/or extract, transform, and load (ETL) software to impose order on a data set that may be comprised of many different types of data. These tools deconstruct textual content (often using natural language processing) into data about specific, defined items such as customers or products. These items then are translated into a traditional data structure, such as records in a database row, or entities in a hierarchy. This approach provides some clear advantages. The most obvious is that this content can be integrated into your current BI environment for presentation and analysis. This is a good incremental step if you are trying to get a sense of what value you can derive from this kind of analysis. However, this is only an incremental step to a more fully featured solution. This approach only addresses a subset of the semi-structured and unstructured data, and does not provide new analytical tools for exploring combinations of structured, semi-structured, and unstructured content. A more radical and interesting approach to this problem is to apply techniques that are not commonly used in a more structured environment, primarily from the search software field. Today, we generally assume that when a user accesses the BI platform, he has a precise query in mind such as when your sales manager asks, &#8220;What is the most recent Region One sales forecast and how does it compare to actual sales?&#8221; In contrast, most search platform users are not totally sure what they are looking for. They may have a few parameters in mind (e.g., my new car should be red and have a high-safety rating), and they need the search platform to help them find information that is as relevant as possible to their query. The sales manager using a search paradigm might ask, &#8220;I see that performance was off in Region One last quarter. What were the causes of performance decline in Region One last quarter?&#8221; Such &#8220;fuzzy&#8221; search logic implies a different approach to integrating your semi-structured and unstructured content into your analytic platform. Rather than looking at text as your only data source, you need to provide access to all types of content. This means that rather than folding this content into your data warehouse through an ETL process, you may need to consider some of the newer content ETL products just introduced for this type of initiative. Data visualization and presentation also evolve when you take a more &#8220;search&#8221;-focused approach to such content. In its simplest application, this means searching in your BI application, optimized for the content your users will be trying to search. In a more complex scenario, you could offer analysis techniques such as a content terrain (or &#8220;heat&#8221; mapping) which is similar to a regular topographical map: this visual technique demonstrates content clusters based on particular areas of concentration within your enterprise.</p><p><strong>Getting Started</strong><br /> To start down this path, you will obviously need to take a more holistic view of your organization&#8217;s information and technology architecture to learn what data is available to your end users. You also need to spend time learning what is missing today from the BI environment. Don&#8217;t be surprised if people at first cannot articulate their needs in this arena &#8212; most people do not believe current tools can support this analysis!</p><p>Source and more information: <a href="http://www.esj.com/business_intelligence/article.aspx?EditorialsID=8970" target="_blank">Enterprise Systems Journal</a></p><p>&nbsp;</p></div> ]]></content:encoded> <wfw:commentRss>http://www.etltool.com/why-all-content-matters-getting-started-with-unstructured-data/feed/</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>Kimball University: Should You Use An ETL Tool?</title><link>http://www.etltool.com/kimball-university-should-you-use-an-etl-tool/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=kimball-university-should-you-use-an-etl-tool</link> <comments>http://www.etltool.com/kimball-university-should-you-use-an-etl-tool/#comments</comments> <pubDate>Thu, 26 Jan 2012 11:34:14 +0000</pubDate> <dc:creator>etltool</dc:creator> <category><![CDATA[ETL tools]]></category> <category><![CDATA[News]]></category> <category><![CDATA[Performance]]></category> <category><![CDATA[SQL]]></category><guid isPermaLink="false">http://www.etltool.com/kimball-university-should-you-use-an-etl-tool/</guid> <description><![CDATA[Kimball University: Should You Use An ETL Tool?]]></description> <content:encoded><![CDATA[<p>You can still hand-code an extract, transform and load system, but in most cases the self-documentation, structured development path and extensibility of an ETL tool is well worth the cost. Here&#8217;s a close look at the pros and cons of buying rather than building.</p><p><strong>By Joy Mundy</strong></p><p>The Extract, Transformation, and Load (<a title="What is ETL?" href="http://www.etltool.com/what-is-etl-extract-transform-and-load/">ETL</a>) system is the most time-consuming and expensive part of building a data warehouse and delivering business intelligence to your user community.<span id="more-103"></span> A decade ago the majority of ETL systems were hand crafted, but the market for <a title="ETL software - a list and comparison" href="http://www.etltool.com/etl-software/">ETL software</a> has steadily grown and the majority of practitioners now use ETL tools in place of hand-coded systems.</p><p>Does it make sense to hand-code an ETL system in 2008, or is an ETL tool a better choice? Kimball Group now generally recommends using an ETL tool, but a custom-built approach can still makes sense. This article summarizes the advantages and disadvantages of ETL tools and offers advice on making the choice that&#8217;s right for you.</p><p>ADVANTAGES OF ETL TOOLS</p><p><strong>Visual flow and self-documentation</strong>.The single greatest advantage of an ETL tool is that it provides a visual flow of the system&#8217;s logic. Each ETL tool presents these flows differently, but even the least-appealing of these user interfaces compare favorably to custom systems consisting of stored procedures, SQL and operating system scripts, and a handful of other technologies. Ironically, some ETL tools have no practical way to print the otherwise-attractive self documentation.</p><p><strong>Structured system design</strong>. ETL tools are designed for the specific problem of populating a data warehouse. Although they are only tools, they do provide a metadata-driven structure to the development team. This is particularly valuable for teams building their first ETL system.</p><p><strong>Operational resilience</strong>. Many of the home-grown ETL systems I&#8217;ve evaluated are fragile: they have too many operational problems. ETL tools provide functionality and practices for operating and monitoring the ETL system in production. You can certainly design and build a well instrumented hand-coded ETL application, and ETL tool operational features have yet to mature. Nonetheless, it&#8217;s easier for a data warehouse / business intelligence team to build on the management features of an ETL tool to build a resilient ETL system.</p><p><strong>Data-lineage and data-dependency functionality</strong>. We would like to be able to right-click on a number in a report and see exactly how it was calculated, where the data was stored in the data warehouse, how it was transformed, when the data was most recently refreshed, and what source system or systems underlay the numbers. Dependency is the flip side of lineage: we&#8217;d like to look at a table or column in the source system and know which ETL modules, data warehouse tables, OLAP cubes, and user reports might be affected by a structural change. In the absence of ETL standards that hand-coded systems could conform to, we must rely on <a title="ETL vendors" href="http://www.etltool.com/etl-vendors/">ETL vendors</a> to supply this functionality — though, unfortunately, few have done so to date.</p><p><strong>Advanced data cleansing functionality</strong>. Most ETL systems are structurally complex, with many sources and targets. At the same time, requirements for transformation are often fairly simple, consisting primarily of lookups and substitutions. If you have a complex transformation requirement, for example if you need to de-duplicate your customer list, you should use a specialized tool. Most ETL tools either offer advanced cleansing and de-duplication modules (usually for a substantial additional price) or they integrate smoothly with other specialized tools. At the very least, ETL tools provide a richer set of cleansing functions than are available in SQL. Consider buying the <a href="https://www.etltool.com/order.php">ETL &amp; Data Integration Tool Survey</a> to see how the ETL tools compare on this aspects.</p><p><strong>Performance</strong>. You might be surprised that performance is listed last under the advantages of the ETL tools. It&#8217;s possible to build a high-performance ETL system whether you use a tool or not. It&#8217;s also possible to build an absolute dog of an ETL system whether you use a tool or not. I&#8217;ve never been able to test whether an excellent hand-coded ETL system outperforms an excellent tool-based ETL system; I believe the answer is that it&#8217;s situational. But the structure imposed by an ETL tool makes it easier for an inexperienced ETL developer to build a quality system.</p><p><a href="https://www.etltool.com/order.php"><img class="alignnone" style="margin-bottom: -10px;" src="http://www.etltool.com/download-the-etl-tools-survey-button.png" alt="" /></a></p><p>Source and more information: <a href="http://www.intelligententerprise.com/channels/information_management/showArticle.jhtml?articleID=207002081&amp;pgno=1" target="_blank">Intelligent Enterprise</a></p> ]]></content:encoded> <wfw:commentRss>http://www.etltool.com/kimball-university-should-you-use-an-etl-tool/feed/</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>Growing data volumes present problems for data warehouse projects</title><link>http://www.etltool.com/growing-data-volumes-present-problems-for-data-warehouse-projects/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=growing-data-volumes-present-problems-for-data-warehouse-projects</link> <comments>http://www.etltool.com/growing-data-volumes-present-problems-for-data-warehouse-projects/#comments</comments> <pubDate>Thu, 26 Jan 2012 11:34:14 +0000</pubDate> <dc:creator>etltool</dc:creator> <category><![CDATA[Data warehouse]]></category> <category><![CDATA[News]]></category><guid isPermaLink="false">http://www.etltool.com/growing-data-volumes-present-problems-for-data-warehouse-projects/</guid> <description><![CDATA[Growing data volumes present problems for data warehouse projects]]></description> <content:encoded><![CDATA[<div class="1234">Many companies are seeing very significant increases in data volumes and these are having an impact on their data warehouse programmes, according to the latest survey from PMP Research. The research has been commissioned by the Evaluation Centre (www.evaluationcentre.com). <span id="more-102"></span>&nbsp;</p><p>Most of the organisations polled (68%) report that data volumes have increased substantially over the past three years, with a further 25% indicating more modest rises. Only 2% reckon that data volumes have stayed constant over that time period.</p><p>Half (54%) of the sample have already initiated data warehousing projects, with 10% currently evaluating their options and 22% considering such a development. One in five has started a programme within the past year, while a quarter (24%) have had a data warehousing programme for the past two years and the same proportion (24%) for between three and five years.</p><p>The main reason for implementing a data warehouse emerges as the desire to improve the flow of management information, cited by 61%. However, the accuracy and reliability of corporate information remains a crucial issue for many, with 28% admitting that data quality was ‘bad&#8217; prior to implementing a data warehouse.</p><p>Just 10% describe their data quality as ‘good&#8217; before implementation, and none believe it was ‘excellent&#8217;. In contrast, after implementing a data warehouse, the majority (45%) now report that data quality is ‘good&#8217; and 6% say it is ‘excellent&#8217;.</p><p>Making improvements on this scale takes time, with 16% of companies spending up to 12 months cleaning up their data and 13% more than a year. There are clear benefits from this exercise, though, as 48% now report no difficulties in analysing the data obtained from their data warehouse.</p></div> ]]></content:encoded> <wfw:commentRss>http://www.etltool.com/growing-data-volumes-present-problems-for-data-warehouse-projects/feed/</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>Value-Driven Data Warehousing</title><link>http://www.etltool.com/value-driven-data-warehousing/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=value-driven-data-warehousing</link> <comments>http://www.etltool.com/value-driven-data-warehousing/#comments</comments> <pubDate>Thu, 26 Jan 2012 11:34:13 +0000</pubDate> <dc:creator>etltool</dc:creator> <category><![CDATA[Business Intelligence]]></category> <category><![CDATA[Data warehouse]]></category> <category><![CDATA[News]]></category><guid isPermaLink="false">http://www.etltool.com/value-driven-data-warehousing/</guid> <description><![CDATA[Value-Driven Data Warehousing]]></description> <content:encoded><![CDATA[<div class="1234">&#8220;I am tired of IT people. They&#8217;ll talk to you for half an hour about what they do, but by the time they&#8217;re finished, you still don&#8217;t have any idea what they&#8217;ve told you or why it&#8217;s important,&#8221; so complained a friend of mine in a recent conversation.<span id="more-101"></span>What was the irritant giving rise to my friend&#8217;s frustration? Ahem … my own ineffective attempt to explain to him what it is that I, a data warehousing/business intelligence (DW/Business Intelligence) practitioner, do.Sometimes the right metaphor is helpful. It can clarify abstract concepts for the uninitiated and, even for the expert, be a means for synchronizing designs and vocabularies and analyzing problems. To that end, let me propose a metaphor for describing what DW/Business Intelligence is all about and, perhaps more importantly, suggest where the field is broken: the metaphor is that of an information supply chain.</p><p>This metaphor &#8211; the information supply chain &#8211; is in some ways a simple extension upon thoughts already well developed by others, most notably Bill Inmon&#8217;s Corporate Information Factory (CIF), and yet the supply chain metaphor doesn&#8217;t seem to enjoy much use in the marketplace, at least not as a framework for holistically describing both the DW and Business Intelligence, together.</p><p>Inmon describes the enterprise data warehouse and the various satellite systems that surround it &#8211; operational data stores (ODSs); extract, transform and load (<a title="What is ETL?" href="http://www.passionned.com/data-integration/etl-extract-transform-and-load/">ETL</a>); data marts, etc. &#8211; as similar to a brick-and-mortar factory. The factory inputs many disparate raw ingredients (data from all the company&#8217;s operational systems), subjects those raw data inputs to a series of processes wherein the data is refined and integrated (ETL) and finally spits out a finished good on the other end (corporate information). Whether a practitioner leans more toward Kimball&#8217;s dimension bus architecture or Inmon&#8217;s CIF as the guiding vision for how to build a <a title="A data warehouse, is a database for (operational) management information, and provides a single version of the truth" href="http://www.passionned.com/data-integration/data-warehouse/">data warehouse</a> (or like this author you feel the two approaches are more complementary than contradictory), Inmon&#8217;s metaphor is simple to understand and helpful for explaining what data warehousing is all about.1</p><p>Extending the metaphor to encompass the field of business intelligence is beneficial, since definitions of Business Intelligence seem pretty slippery, and it can be unclear how Business Intelligence overlaps or is different from the enterprise data warehouse. I believe the reason for this confusion, at least in part, comes down to how the term &#8220;BI&#8221; is often used &#8211; that is to say &#8211; it is used to convey purpose, rather than function. Business Intelligence is typically defined as: the collection of <a title="Business Intelligence tools provide management and employees reliable information and insights in order to enhance decision making, social collaboration and a better performance." href="http://www.passionned.com/business-intelligence/business-intelligence-tools/">Business Intelligence tools</a> and techniques used to help companies make better decisions. This definition is very broad. It expresses a worthy purpose, but it casts a huge net within which almost anything can fall and thereby blurs important distinctions.</p><p>Read the full article: <a href="http://www.dmreview.com/article_sub.cfm?articleId=1082702" target="_new">DMReview.com</a></p></div> ]]></content:encoded> <wfw:commentRss>http://www.etltool.com/value-driven-data-warehousing/feed/</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>How To Mine All That Customer Data</title><link>http://www.etltool.com/how-to-mine-all-that-customer-data/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=how-to-mine-all-that-customer-data</link> <comments>http://www.etltool.com/how-to-mine-all-that-customer-data/#comments</comments> <pubDate>Thu, 26 Jan 2012 11:34:13 +0000</pubDate> <dc:creator>etltool</dc:creator> <category><![CDATA[News]]></category><guid isPermaLink="false">http://www.etltool.com/how-to-mine-all-that-customer-data/</guid> <description><![CDATA[How To Mine All That Customer Data]]></description> <content:encoded><![CDATA[<div class="1234"> <em>A&#8217;dam &#8211; September, 22th 2007</em> &#8211; The seeming simplicity of a term like &#8220;business intelligence&#8221; belies the rich potential its implementation can provide to a small or midsize company. Data mining is the backbone of BI&#8211;it means sorting through company data and identifying and extracting valuable information about operations.<span id="more-100"></span> In a customer-centric business, the information can be particularly compelling. We turned to Gordon Linoff, a principal at Data Miners, to learn what data mining for BI can do for small and midsize businesses.</p><p> bMighty: What is it about the current business climate that makes mining data for BI about a company&#8217;s customers so vital?</p><p> Linoff: Once upon a time&#8211;and this is a rosy view of the ancient world&#8211;the hardware store owner knew you. If you went to the grocery store, someone there knew what items you needed. If a woman walked into a bank to make a deposit and she was pregnant, [the bank manager] knew and offered her a car loan. We traded all that for lower interest rates and lower prices, we traded all that for more efficiency, but we lost the intimacy with our customers. How can we get to know our customers again? That&#8217;s one component of business intelligence.</p><p> Where do small and midsize companies fit into this BI picture?</p><p> Well, it depends upon their relationship with their customers. If they only have a handful of larger clients, they know them. But one of our clients, The Vermont Country Store, is a cataloger that sends to several million people. The company has about 400 employees. How are 400 people going to know about several million people? They need to use the information they collect in order to better serve their customers and increase revenu.</p><p> But there&#8217;s not an overall approach, because businesses are different, and there are several different levels that businesses operate on. Some are campaign-driven, to encourage customers to buy, and [business intelligence helps] them optimize each campaign and [some use] traditional predictive models associated with data mining, tailoring aspects of their Web sites to their customers or sending e-mails to their customers to return to their site.</p><p> For example, on Amazon.com, which is not a small company, if I bought Harry Potter IV, [they know] I will buy Harry Potter V, VI, and VII, [because they're asking] what&#8217;s the next type of thing that I&#8217;ll buy? You can get associations or market basket analysis. For companies in retail, association rules are something they&#8217;re interested in. What things sell together?</p><p> How can a company&#8217;s IT managers maximize the benefits of business intelligence?</p><p> The IT manager needs to enable communication&#8211;it&#8217;s the most important thing. In most companies, when [management] asks questions, [the IT manager] says it can&#8217;t be done, because people get trained not to make decisions off of data. For instance, if the marketing department of a company wants to plan a promotion, and needs to know what other items sold last year when soup was on sale, the IT manager might say, &#8220;we&#8217;ll know in three months.&#8221; So the question stops being asked and opportunities are lost. Then people start making decisions based on intuition and they [lose] the ability to make more informed decisions.</p><p> How can they get that ability to make more informed decisions?</p><p> It&#8217;s the people on the front line&#8211;the worst-paid people&#8211;who know the customers. You need a mechanism to get that information out. I have another great example. There was a guy who owned a chain of drugstores that sold chocolates on Valentine&#8217;s Day. It was big business for them. Using their BI system, the SAS OLAP server, they sliced and diced their data to discover that two of the stores had higher sales in chocolate. Both were managed by the same person.</p><p> They discovered that instead of placing the lower-priced chocolates on a lower shelf and the more expensive chocolates higher up he was mixing them together. It turns out, guys were coming in and reaching for the cheaper stuff and then saying, &#8220;she&#8217;s worth it.&#8221; This demonstrates that people on the front line know what&#8217;s going on, so you need to have systems to detect what they know. You need systems in place to communicate the best practices.</p><p> Source: <a href="http://www.forbes.com/entrepreneurstechnology/2007/09/21/microsoft-oracle-amazon-ent-tech-cx_ng_0921bmightydatamining.html">Forbes</a></div> ]]></content:encoded> <wfw:commentRss>http://www.etltool.com/how-to-mine-all-that-customer-data/feed/</wfw:commentRss> <slash:comments>0</slash:comments> </item> </channel> </rss>
<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Minified using disk: basic
Page Caching using disk: enhanced
Object Caching 636/637 objects using disk: basic

Served from: www.etltool.com @ 2013-06-20 05:33:08 -->