Welcome to EMC Consulting Blogs Sign in | Join | Help

SSIS Junkie

ETL and Data Integration

There is an interesting article today on The Reg authored by a researcher from Bloor Research who gives his views on IBM's lack of visiblity in the ETL arena. Whilst IBM's efforts (or lack thereof) in this area are unlikely to affect me as a Microsoft BI practitioner it is pleasing to see that researchers are beginning to wake up and see the benefit that ETL can bring to an enterprise and, moreover, the advantage of adopting a enterprise wide data integration strategy [In other articles I have seen mention of Data Integration Centres of Excellence within an enterprise - an interesting concept].

Previously ETL has been thought of as a mechanism for getting data into a data warehouse but there are bound to be lots more opportunities to leverage the heavy lifting capability of ETL tools within an enterprise as data volumes increase and with it the need for centralised data management. In time, I can see that the definition of an ETL tool is going to change, or at least the areas that it encompasses will.

In the article the author touches on an emerging need for synergy between 2 areas that were previously thought to be seperate, those being ETL & EAI/EII/Data Integration/Data Federation (call it what you will); it seems that in this day and age the lines are blurring. We at Conchango have started to think about these issues and will be providing thought leadership in this area. Watch this space!

The author states: "ETL ... probably isn't the best approach if you have heterogeneous databases across your organisation". In my humble opinion...this is wrong. ETL tools are, these days, designed to be able to access heterogenous data sources and they do it well. Indeed, the tool that I know and love, SQL Server Integration Services (SSIS), has a number of tools available for accessing data that does not necassarily exist in traditional relational format. I will concede that it probably doesn't handle unstructured data as well - perhaps that is the next step! However I'm sure that Don Farmer would want me to point out that SSIS together with BizTalk presents a pretty compelling picture for enterprise wide data integration. What is key here is that the tools are just that, tools. The overall message will be that the Windows Server system can fulfill this capability and it can probably do it cheaper than anyone else.


Published Friday, March 11, 2005 9:26 AM by jamie.thomson



jamie.thomson said:

I'm shocked. SSIS does not handle unstructured data as well? Get ye thence to the text mining components (Term Extraction and Term Lookup) and try them out - when combined with the fuzzy components and data mining (for finding clusters of related items by term) and you can build awesome unstructured data handling.

OK, I've calmed down now. Wonderful blog, Jamie. And great points about how the Windows Server System components work together to make the most cost effective platform for enterprise integration.
March 13, 2005 5:11 AM

TrackBack said:

Database Daily
March 14, 2005 1:46 PM

TrackBack said:

March 14, 2005 5:29 PM

TrackBack said:

March 15, 2005 10:00 AM

TrackBack said:

April 18, 2005 1:24 PM

TrackBack said:

April 19, 2005 1:20 PM

Philip said:

Good blog!
November 2, 2005 6:24 PM
New Comments to this post are disabled

This Blog


Powered by Community Server (Personal Edition), by Telligent Systems