<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-9173851091724725518</id><updated>2012-01-23T11:59:32.540-08:00</updated><title type='text'>Big Honking Databases</title><subtitle type='html'>Technology and business issues affecting BI. [Opinions expressed here are my own, and in no way constitute an endorsement or opinion of my employer]</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>48</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-4857348691874282423</id><published>2010-01-28T20:46:00.000-08:00</published><updated>2010-01-28T20:47:39.006-08:00</updated><title type='text'>Insert Cricket Sound...</title><content type='html'>I've been so jammed up lately I never got a chance to update this blog. As you might have heard (if you were kind enough to check back here for new content) I have left the IT/database industry for the horology one, have started a new job about a month ago, and am in the process of moving to Switzerland to pursue a new position and embark on a new career. I won't go into the details here and will just point you to &lt;a href="http://whatwatch-jeromepineau.blogspot.com/"&gt;my new blog&lt;/a&gt; for all the juicy news.&lt;br /&gt;&lt;br /&gt;If you've been following my posts on here for the past year or so, I thank you for your readership and, most of all, your precious participation. I've learned a boatload of valuable lessons working on this blog and interacting with you since the early days.&lt;br /&gt;&lt;br /&gt;I won't be discussing software and BI issues on here any longer obviously, but if you also happen to be interested in social media, marketing, brand management, community building (oh yeah, and watches!) then as they say back in NYC where I grew up - Meetcha 'round the corner! :)&lt;br /&gt;&lt;br /&gt;Peace.&lt;br /&gt;Jerome.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-4857348691874282423?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/4857348691874282423/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2010/01/insert-cricket-sound.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/4857348691874282423'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/4857348691874282423'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2010/01/insert-cricket-sound.html' title='Insert Cricket Sound...'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-4227698556768425339</id><published>2009-12-21T09:36:00.000-08:00</published><updated>2009-12-21T09:36:05.084-08:00</updated><title type='text'>Social Media Marketing 101 - The New Fountain of Youth</title><content type='html'>&lt;div style="background-color: white; counter-reset: __goog_page__ 0; font-family: Verdana; font-size: 10pt; line-height: normal; margin-bottom: 6px; margin-left: 6px; margin-right: 6px; margin-top: 6px; min-height: 1100px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px;"&gt;I was recently asked to opine on what makes a social media manager and his (or her) strategy successful in today's market. Having dabbled in the field for a little while now (perhaps what can be considered a long time in this emerging medium) it forced me to stop and reflect on my own trials and tribulations in the realm of social media marketing (SMM). The first thing I pondered is how to even define "success" in social media and community management. It's not something that can be pinned down easily like an engineering project where criteria are clearly defined. Namely, does the product work as expected, is budget under control, and are the customers happy? Managing a brand or product via social media channels is more akin to wine making or political campaigning. It's something that takes considerable time, involves a myriad of tools and tactics, and, quite honestly, more than a little bit of luck. And it also takes a special kind of personality.&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;So first off, what is "social media"? It is a set of web-based communication channels connecting a brand with existing and potential customers. These typically include a &lt;a href="http://www.jeromepineau.com/"&gt;website&lt;/a&gt;, &lt;a href="http://www.twitter.com/JeromePineau"&gt;Twitter&lt;/a&gt;, video channels (YouTube or Vimeo, for example), several social networking sites like MySpace, &lt;a href="http://www.facebook.com/jerome.pineau"&gt;Facebook&lt;/a&gt;, or &lt;a href="http://www.linkedin.com/in/jeromepineau"&gt;LinkedIn&lt;/a&gt;, maybe some IM services, throw in a couple photo sites like Flikr, and perhaps some opt-in email or SMS push channels. Social media marketing (SMM) and social media optimization (SMO, a subset), is the art of juggling all these channels simultaneously to build brand recognition and increase customer acquisition. The reality is in fact a bit more complicated, but those are the basics.&amp;nbsp;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;Note that I called this SMM endeavor an "art" - and purposely so - because to the best of my knowledge, there are no clear well-defined best-practice recipes for success in this business and most companies tend to fly this bird by the seat of their pants. Some better than others. Yes, there are numerous self-proclaimed social media experts and consultants out there. People like &lt;a href="http://www.briansolis.com/"&gt;Brian Solis&lt;/a&gt;, for example, who clearly know what they're doing and contribute substantially to educating the community about what works and what doesn't but in reality, no one really knows what works for sure. There are no magic formulas or guarantees. Just lists of obvious pitfalls to avoid and generally-accepted best practices, but that's about it. So here are my distilled two cents on SMM strategy based on personal and vicarious experiences of the past couple of years.&lt;br /&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;&lt;b&gt;You know your SMO strategy is starting to pay off when:&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;1. Your web analytics indicate growing popularity and visit stickiness.&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;2. You see increased public participation in all your social media outlets.&lt;/div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;3. You start getting unsollicited interest from the press and industry influencers.&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;4. You find it easier to establish relationships and partnerships with distribution channels.&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;5. You start getting attention from competitors.&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;6. Customers start evangelizing your product to their peers.&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;7. Your brand awareness starts spreading outside your targeted segments.&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;8. You start seeing measurable and sustained growth in customer acquisition.&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;9. You start seeing repeat orders from customers.&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;10. You pick up positive comments about the experience of doing business with you.&lt;/div&gt;&lt;br /&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;&lt;b&gt;What does it take to become successful at this game? A couple of tips here:&lt;/b&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;1. Do not profess expertise on topics you know little about. Eventually, it will show.&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;2. Always remain honest. Never lie. Your most important asset is credibility. You can fix almost any mistake except credibility damage.&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;3. Be genuine. Maintaining a special "online" personality is not genuine.&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;4. Be obsessed about customers. Walk in their shoes. They're always right.&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;5. Do not let fear of mistakes (or failure) paralyze you. It's OK to make mistakes. What's important is quick admission and correction.&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;6. Be either loved or hated. Middle ground is not compelling.&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;7. It's not a job, it's a way of life and a mission. Understand the expectations. It's a marriage, not a date.&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;8. At the end of the day, it's all about the "story". Understanding show business is crucial.&lt;/div&gt;&lt;br /&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;9. If you suck at building relationships in real life, you will suck at it online. The inverse is not necessarily true.&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;10. Question everything, keep an open mind, but be wary of "experts" (the field is too new).&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;I've had my share of SMM boo-boos in the past years; online successes and failures. Stuff I thought would never work (that did) and things I knew for sure would take off (but never did). There's nothing like this new media to bang some realism back into one's ego and topple pre-conceived notions. That's why I love it. The beauty of this new "frontier" is its never-ending capacity to change, evolve and teach. Try it. It'll keep you young :)&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-4227698556768425339?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/4227698556768425339/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/12/social-media-marketing-101-new-fountain.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/4227698556768425339'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/4227698556768425339'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/12/social-media-marketing-101-new-fountain.html' title='Social Media Marketing 101 - The New Fountain of Youth'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-4035195819019938695</id><published>2009-12-07T12:34:00.000-08:00</published><updated>2009-12-07T12:39:03.828-08:00</updated><title type='text'>The Wailing Wall of Open Source BI</title><content type='html'>&lt;div style="background-color: white; counter-reset: __goog_page__ 0; font-family: Verdana; font-size: 10pt; line-height: normal; margin-bottom: 6px; margin-left: 6px; margin-right: 6px; margin-top: 6px; min-height: 1100px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px;"&gt;Henry David Thoreau once wrote: "The mass of men lead lives of quiet desperation". Much the same can be said of the multitude of users struggling with open source reporting and analysis tools like Mondrian or Jaspersoft. The difference, of course, if that those folks happen to be pretty vocal. And nowhere more so than on those vendors' own "support" forums.&lt;br /&gt;&lt;br /&gt;I double-quote the term purposely. Because what you witness on these forums falls far (very far) short of what I consider to be minimally acceptable customer support levels. Now, it's a fact that many people find solace in these communities - after all, given the massive amounts of questions posed there, some are bound to get answered quickly and (hopefully) correctly. And it's a fact that many people achieve success (or some level of it) with open source BI solutions. At what cost, we can only surmise, but clearly, resilient, persistent and courageous people are getting some work done on these platforms.&lt;br /&gt;&lt;br /&gt;But more often than not, the levels of post abandonment (ignored questions) and the arrogant responses are high enough to be shocking. I am stunned at the number of times where posting users are treated like it's their fault. The implication is that they are stupid or negligent. As a matter of fact, if you spend the time analyzing the language semantics, what you see are fearful, timid, often desperate users mustering the courage to post questions in the hope that somehow, someday, they will be answered by the Grand Wizards of [fill in the OSS BI vendor name]. This is not unlike the Wailing Wall in Jerusalem, where masses of people stick written prayers between the stones.&lt;br /&gt;&lt;span style="font-family: 'Times New Roman'; font-size: medium;"&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: 'Times New Roman'; font-size: medium;"&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: 'Times New Roman'; font-size: medium;"&gt;&lt;div class="separator" style="clear: both; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_dCMl0ieMmTw/Sx1mjcKO-gI/AAAAAAAACwU/RgJ_RB4KBc0/s1600-h/wailingwall.gif" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://3.bp.blogspot.com/_dCMl0ieMmTw/Sx1mjcKO-gI/AAAAAAAACwU/RgJ_RB4KBc0/s320/wailingwall.gif" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;/div&gt;&lt;/span&gt;I find that these forum "masters" are too often condescending and consequently, they generate a "master-pupil" environment I find somewhat repulsive. It reminds me a lot of the old school systems in Europe, where Professors would stand on an elevated stage above the class to seal their social superiority. There's nothing wrong with respect but in my opinion, it has to be earned. When you berate, ignore or insult those who seek to learn from you, you are far from deserving respect, Grasshopper.&lt;br /&gt;&lt;br /&gt;I do realize that these vendors provide some level of "enterprise" support for those wishing to license the software but given what you see in the public forums, it's hard to imagine it can be any better on the paid side because, quite honestly, Open Source companies simply don't "get" service. They are too often blinded by the magnificence of their code or product and forget that without users and healthy bi-directional communities, there wouln't be a company, Open Source or not.&lt;br /&gt;&lt;br /&gt;My experience navigating through several Open Source forums was similar. At the time I figured, it's just me. I'm an impatient SOB and it's my problem. But when you start analyzing these forums (and I spent hours doing so), you quickly realize the disease is wide-spread.&amp;nbsp; All of a sudden, it's not just you anymore - almost everyone has the darn infection! I often wonder why people put up with this nonsense. Deep inside, I know the answer of course: because they don't have a choice. Either they can't find an easier product to use, or they are compelled to use it for organizational reasons (as in, my boss told me to check this out &amp;lt;sigh&amp;gt;). I do believe the advent of SaaS BI analytical tools spell the end of an era for the multitudes who must suffer through Open Source to get even the simplest of reports and dashboard out in less than three months!&lt;br /&gt;&lt;br /&gt;But don't take my word for it. After all, I work on the SaaS side of BI and am likely biased. Even worse, I actually blogged a while ago about the merits of using Mondrian for OLAP. Indeed, after many months of putzing with it, I was able to accomplish something useful but I have twenty years of experience in IT. Read that again: two decades of messing around with difficult stuff and figuring out how to make it work. So sure, for someone like me (and if forced into it) you bet open source can work. But what about the poor guys who don't have that kind of experience? What about the people tasked with just getting reports or analytics done? I feel bad for those folks who invest time and considerable effort in open source solutions only to be left at the altar of success. And if you don't believe me when I say they're out there en masse, then feast your eyes on the following selected quotes and links from Pentaho and Jaspersoft OSS BI forums (they are reproduced verbatim, grammar, spelling and emotional content intact).&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;"It seems I first have to dive in pentaho source code to figure how the printing function in pentaho is working"&lt;br /&gt;&lt;br /&gt;"I am new user to Pentaho. I downloaded mondrian-3.1.2.13008. Now please advise me the steps I need to follow to configure modrian in my system."&lt;br /&gt;&lt;br /&gt;"Although I'm no expert, I think Spreadsheet services has been discontinued, well I've never seen a Pentaho Employee mention otherwise"&lt;br /&gt;&lt;br /&gt;"I've been trying to use IIF all morning but I'm getting nowhere."&lt;br /&gt;&lt;br /&gt;"I have two questions, which I barely dare to ask, but I am sitting here and can't get over my problem:"&lt;br /&gt;&lt;br /&gt;"I tryed all combinations. The only combination that works is fact table,dimension tables and aggregates tables in the same Mysql schema, and a datasource that point to that schema."&lt;br /&gt;&lt;br /&gt;"Hi everybody. Though I've run the demo applications and read almost every thread about this, still I can't get an exact understanding of the relation existing between Pentaho Server and Mondrian."&lt;br /&gt;&lt;br /&gt;"I think that the preconfigured examples of JSPs, catalogs etc. are confusing at best - bad practice at worst."&lt;br /&gt;&lt;br /&gt;"I am fairly new to Mondrian, and I am stuck at ths point"&lt;br /&gt;&lt;br /&gt;"This might be very trivial for some of you but I'm really having a hard time figuring this out. So any help is greatly appreciated. Thanks so much."&lt;br /&gt;&lt;br /&gt;"I've goggle this one down to my knuckles. I suspect my chopsticks are tangled on the Mondrian/Pentaho side? I would appreciate any thoughts the community might have."&lt;br /&gt;&lt;br /&gt;"Nobody can help? If so, does anyone know where else I can go to find the answer ... I've pretty much exhausted Google's list of MDX links, and am getting a little desperate now. Don't want to tell my client "it can't be done" without good cause..."&lt;br /&gt;&lt;br /&gt;"Desperate for assistance. Last big issue to overcome and deliver my pentaho bi solution."&lt;br /&gt;&lt;br /&gt;"Need some Help again.. I told my Boss that I will get the prototype ready by tommorow. Any help would be appreciated"&lt;br /&gt;&lt;br /&gt;"I'm quite desperate.... ... please... help...."&lt;br /&gt;&lt;br /&gt;"Helloo...???? Anyone there?? My question may be silly, still any of your ideas would be much helpfull..Am really desperate to find the logic.. Am not good in java and am in verge of my project and solving this issue resolves half of my task...Please respond..!!!"&lt;br /&gt;&lt;br /&gt;"We are pretty desperate by now. We tried to manually re-insert the UNC server part into the file name, but this is getting dirtier and dirtier as we build new jobs."&lt;br /&gt;&lt;br /&gt;"Sorry bothering you again, but I'm in a desperate situationn. We are trying to run the new Pentaho GA, but we have a lot of problems when we try to run this with Oracle."&lt;br /&gt;&lt;br /&gt;"I'm really fed up now trying to make charts working over couple of days. I have created a database connection and given the field names correctly. Can someone please help me? I'm really desperate to get this working.."&lt;br /&gt;&lt;br /&gt;"I know this has been logged by both myself and others before, however I am desperate for an answer here."&lt;br /&gt;&lt;br /&gt;"I&amp;nbsp; checkd out all the sample dashboards of cdf.. but am clueless to create a dashboard of my own.. is there a dashboard builder for cdf??"&lt;br /&gt;&lt;br /&gt;"I dont consider myself a fool when it comes to working with technology, and I have found myself frustrated and confused beyond belief at what should be a seemingly simple task..."&lt;br /&gt;&lt;br /&gt;"And...regarding the timestamp....how in the heck to I tell that to not show up if I have no idea where it's coming from? Sorry if I seem frustrated, but I am."&lt;br /&gt;&lt;br /&gt;"I've just recently begun working with Pentaho, so forgive me if this is a known issue or feature...I did a search on these forums and couldn't find an answer...a frustrated contractor in DC."&lt;br /&gt;&lt;br /&gt;"I got frustrated that i cannot get a decent quality, well formatted pdf output."&lt;br /&gt;&lt;br /&gt;"However, I do have to say I really had to grind through the first 10 days or so in order to get this thing down to a point of not hitting hurdles every step of the way. In fact, I've got a colleague who is really pretty frustrated and trying to jump ship"&lt;br /&gt;&lt;br /&gt;"A few weeks ago I was very frustrated that I was completely lost withing thePentaho suite. I cannot afford training (I'm broke, plus I live too far away from the traing venues) so my only means at this stage is all of the above"&lt;br /&gt;&lt;br /&gt;"I’m not trying to start a ruckus or anything I just wanted to state when I am coming from and what I would like to see from Pentaho. Judging from the many unanswered post on this subject I think there are many more like me out there."&lt;br /&gt;&lt;br /&gt;"Sorry about the begginer question. But i'm getting frustrated"&lt;br /&gt;&lt;br /&gt;"I am getting increasingly more frustrated it is quite hard to get started. The documentation seems only to touch the very basics. "&lt;br /&gt;&lt;br /&gt;"I am a bit frustrated right now as I haven't been able to solve any of my problems."&lt;br /&gt;&lt;br /&gt;"...first, *I* don't think i'm an idiot. what frustrated me was the lack of TASK-ORIENTED information i found. i had a fairly simple, fairly small TASK i needed to GET DONE... i dont care about kettle or apis or what great problems Kettle could solve... I had a TASK and an IRATE BOSS."&lt;br /&gt;&lt;br /&gt;"I'm pretty frustrated at this point: I mean seriously how much more simple could a transformation be!?!?!?!?!?!"&lt;br /&gt;&lt;br /&gt;"If I am posting about that, it is because I ve been frustrated much!"&lt;br /&gt;&lt;br /&gt;"I was very excited to find Pentaho and eagerly wanted to create my first Dashboard. Now I’m just completely frustrated. And as I search these posts, I see thousands of people are looking for the same basic answer with little to no response from Pentaho: How do you create/deploy a non-sample dashboard; i.e. how do you actually use this thing?"&lt;br /&gt;&lt;br /&gt;"Maybe a non-IT guy doesn't want to deal with SQL queries at all...&lt;br /&gt;&lt;br /&gt;Now let's look at a couple of "answers" to many of these posts (unfortunately, questions outnumber answers significantly):&lt;br /&gt;&lt;br /&gt;"You use the search button. Version 2/3/3.5 brings in lots of changes to the way Pentaho does things, please mention which version of the BI server you are running.&lt;br /&gt;Join the Unofficial Pentaho IRC channel on freenode. Server: chat.freenode.net Channel: ##pentaho - Please try and make an effort and search the wiki and forums before posting!"&lt;br /&gt;&lt;br /&gt;"If all that is true then it is clearly a bug in mondrian and you should report it in jira."&lt;br /&gt;Here, the moderator seems to question whatever the premise of the question was (as if people bothered posting lies...)&lt;br /&gt;&lt;br /&gt;"The procedure is exactly as I have told you. You must have done something wrong somewhere. Just check through carefully;"&lt;br /&gt;Typical "you must have screwed up something" response. How encouraging and a little condescending if you ask me.&lt;br /&gt;&lt;br /&gt;"If not, please post the error. We're not clairvoyant."&lt;br /&gt;&lt;br /&gt;"There might be a simpler way... search the doc."&lt;br /&gt;&lt;br /&gt;In this &lt;a href="http://jasperforge.org/plugins/espforum/view.php?group_id=112&amp;amp;forumid=102&amp;amp;topicid=44337"&gt;post&lt;/a&gt;, a user actually bothers pasting a novel sized exception asking for help. Specifically, he asks: "&lt;span style="font-style: normal;"&gt;&lt;span style="text-align: left;"&gt;&lt;span style="font-size: x-small;"&gt;Any help or even pointers to documentation more than Spring/Acegi provides would be greatly appreciated&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;".&lt;br /&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;What does the Jasper guy reply with?&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;"&lt;span style="font-size: x-small;"&gt;JAAS authentication can certainly be done.&amp;nbsp;&lt;/span&gt;&lt;span style="font-size: x-small;"&gt;Have a look at the Acegi documentation...&lt;/span&gt;&lt;span style="font-size: x-small;"&gt;The forum is also a great resource. - Is he kidding? Did he even read the post?&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;Here's another &lt;a href="http://jasperforge.org/plugins/espforum/view.php?group_id=112&amp;amp;forumid=102&amp;amp;topicid=57673"&gt;classic&lt;/a&gt;: a user is complaining about the advanced search in the Jasper forums. The moderator's suggestion is terse: "it is what it is". Then he suggests using Google! Are these people serious?&lt;br /&gt;&lt;br /&gt;How about this comment: "I am desperate to use JasperReports , however I am having hard time understainding it".&amp;nbsp; Or this one: "Still can't compile - getting desparate...I am getting nowhere trying to get my reports to compile."&lt;br /&gt;&lt;br /&gt;What can we conclude from this mass of desperation, frustration and confusion? Why would these vendor create and maintain such an environment to begin with? Because quite honestly, if I were looking for a BI platform for a critical project or client, and came upon these forums in the process, I'd run so fast the other way it's not even funny.&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-top: 0px;"&gt;As I mentioned above, if you have some serious software development and IT background, the luxury of time, (the right hardware and software) and a propensity for hacking complex nebulous systems, or the courage to read a &lt;a href="http://forums.pentaho.org/showthread.php?t=72301&amp;amp;highlight=difficult"&gt;600 page book&lt;/a&gt; on working with Pentaho, for example, these open source solutions might just do the trick. If your organization or client is insisting on going the Open Source route, and you have no say in the tooling selection, then clearly you have to deal with the pain, hope for the best, and pray for a check (or a job) at the end of the ordeal.&lt;br /&gt;&lt;br /&gt;But if you're just a normal analyst or consultant tasked with a BI implementation on a tight deadline with an impatient boss breathing down your neck, there is an alternative. And if you're just a guy (or gal) who needs to get the job done now (as in, this week) without the drama and headaches of open source theatrics, you'll likely be looking at a SaaS BI solution.&amp;nbsp; Now, SaaS BI may not turn out to be your bag, but you owe it to yourself to at least give it a whirl and here's why (putting my pitching hat on):&lt;br /&gt;&lt;br /&gt;It takes very little time, effort and money to try it out. In many cases, it's free. There's nothing to install or configure. It takes hours or days, not weeks or months, to build and showcase prototypes. Most of the time (as in 80%) the functionality is sufficient to do the job. And features/functionality increases dramatically with each release. Oh, and there's updated documentation. But here's the kicker: you'll actually get just-in-time support from people who actually care about your success. &amp;nbsp;As a matter of fact, they're invested in it. They will welcome your questions and use them to improve the offering. They won't ignore you and they won't treat you like an inferior species.&lt;br /&gt;&lt;br /&gt;I know this is sounding a lot like "Miracle on 34th Street". Must be the season. Truth be told, SaaS BI is not for everyone, and your mileage may vary. But it's worth an honest shot. Because the alternative is an endless Wailing Wall of pain, misery and isolation that no one in BI should have to put up with anymore.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-4035195819019938695?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/4035195819019938695/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/12/wailing-wall-of-open-source-bi.html#comment-form' title='37 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/4035195819019938695'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/4035195819019938695'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/12/wailing-wall-of-open-source-bi.html' title='The Wailing Wall of Open Source BI'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_dCMl0ieMmTw/Sx1mjcKO-gI/AAAAAAAACwU/RgJ_RB4KBc0/s72-c/wailingwall.gif' height='72' width='72'/><thr:total>37</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-5145936484658054145</id><published>2009-11-29T17:40:00.000-08:00</published><updated>2009-12-08T14:24:01.376-08:00</updated><title type='text'>Of CRM Golden Calves and Ten Commandments of SaaS Landlordship.</title><content type='html'>&lt;span style="font-family: Verdana, sans-serif;"&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;&lt;div class="MsoNormal"&gt;&lt;div&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;&lt;span style="font-family: Arial; font-size: small;"&gt;&lt;span style="font-size: 13px;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;&lt;span style="font-family: Arial; font-size: small;"&gt;I have a pet peeve concerning two things I’ve been meaning to write about lately, namely CRM and multi-tenancy. I recently read an excellent article referencing both concepts and thought it might be a good stepping stone to a quick blog post.&lt;br /&gt;&lt;br /&gt;In his article &lt;a href="http://www.destinationcrm.com/Articles/Web-Exclusives/Viewpoints/Don%E2%80%99t-Get-Conned---57660.aspx"&gt;Don’t Get Conned – The many disguises worn by software-as-a-service&lt;/a&gt;, Matt Wallach discusses CRM software and the concept of SaaS multi-tenancy. &amp;nbsp;Matt’s business is life sciences, and he discusses the state of CRM software for his industry. &amp;nbsp;His argument is along the lines of “be wary of on-premise wolves in SaaS sheep clothing”. The wolves, in this case, are on-premise CRM vendors who simply throw servers over a wall and call it a SaaS day. &amp;nbsp;The sheep (aka the good guys) are those genuine SaaS vendors riding a multi-tenant architecture. He then proceeds to define what multi-tenancy means using the well-known “neighborhood versus apartment” analogy. &amp;nbsp;Sounds innocuous enough (and the piece is well-written) but here’s the problem I have.&lt;br /&gt;&lt;br /&gt;I am going to get a lot of heat for this statement, but truth be told, CRM is a hoax. There is no such thing as CRM. &amp;nbsp;It’s all smoke and mirrors, and you cannot nurture customer relationships (or any other form of relationship) using a bunch of bits. It’s a cop-out. I know of CRM market segments and sizes. I am well aware of CRM dissemination and popularity in corporate America. I don’t question its omnipresence in virtually all businesses in one form or another. My claim, however, is that it does not and cannot work as billed. &amp;nbsp;In numerous cases, it is simply a waste of “feel-good” money. Here’s why.&lt;br /&gt;&lt;br /&gt;The companies and industries making the most use of CRM packages are those with the worst customer service. This in itself should be eye-opening. Airlines are perfect examples. Can anyone think of an industry with worse customer service? I can’t. Yet they spend gazillions on CRM and boast about it frequently. How about retail? When’s the last time these guys used CRM to do anything besides manage their “rewards programs” (who are they rewarding please tell me?) or push advertizing gimmicks? How about Telecoms? How many people are happy with their cell carriers in the US? When was the last time your interaction with a carrier’s customer service department was either useful, efficient, or pleasant? How about the automobile industry or even car rental outfits? When was the last time you were wowed by one of those big corporate CRM consumers?&lt;br /&gt;&lt;br /&gt;Now, I’m not saying all companies who use CRM manage customers poorly. Some do get lucky. And others use it as a tool, not a means to an end, which is perfectly fine. &amp;nbsp;What I am pointing out is that customer service quality seems inversely proportional to the resources spent on CRM at the corporate level. And that’s ironic. The problem is in the C-suite. These folks see CRM as a panacea but inherently, deep down, these people do not have customer service “genes”. &amp;nbsp;They have Board of Directors genes, profit margin genes, or MBA ones, but they don’t know the first thing about worshipping customers – It’s a form of autism on their part.&lt;br /&gt;&lt;br /&gt;Because in my experience, customer service is not something you can learn on the fly (although I suspect it is taught in business schools). &amp;nbsp;And it’s not something you can acquire via software. Companies, like people, are either born with it or not. It’s a nature not a nurture trait. And no amount of software or resources will change the behavior of a company whose culture isn’t obsessively centered on the customer. It sounds obvious, and everyone talks about it, but most of the time, it’s nothing more than lip service. &amp;nbsp;It’s a sham I tell you.&lt;br /&gt;&lt;br /&gt;Luckily, the solution is simple: throw the software out the window. Really, honestly, you don’t need it. You can’t even prove you need it anyway. Most of the time, you don’t know how to interpret or filter the results. And worse yet, you have too much information. &amp;nbsp;You get overloaded with it in a drunken stupor and it ends up clouding your judgment. &amp;nbsp;And shortly thereafter, you throw common sense out the window. &amp;nbsp;Malcom Gladwell makes excellent anecdotal arguments against information overload in Blink, one of his classics. &amp;nbsp;It’s a fascinating read (specifically I refer to section 4 of chapter 4). &amp;nbsp;CRM and software overload have the same effect on corporate management as those described by Gladwell affecting ER doctors’ decision making. Too much information leads to disastrous results.&lt;br /&gt;&lt;br /&gt;I’m not suggesting the demise of customer or performance management software as a whole obviously. &amp;nbsp;Keep the databases, the warehouses and the BI in-house (because analysis and insight are still obviously crucial) but take your big honking CRM software and pull the plug. You will likely learn more (and spend a whole lot less) by talking to your five top customers. If you want to stay in control and get 360 customer views, sign up for something like &lt;a href="http://www.teamsupport.com/"&gt;TeamSupport &lt;/a&gt;– it’s all you need. Most importantly, do something truly revolutionary: get out of the building and walk a mile in your customers’ shoes.&lt;br /&gt;&lt;br /&gt;Order your own product, fly your own airline (in coach), rent your own cars, dial your customer service numbers repeatedly – from a roaming zone, for example – visit your retail stores (incognito) and talk to your customers. Order your own food, visit your own bathrooms, wear your own clothes. Engage customers one on one, on the streets, in the stores, at their home, face to face, and listen like you mean it. Listen like your life (and job) depended on it – because it does. &amp;nbsp;You cannot manage customer relationships from a spreadsheet or a SQL query any more than you can steer a marriage successfully via the web (yes I know, there’s probably an iPhone app for that). It’s that simple. &amp;nbsp;Your fancy CRM system is the blue pill. Take the red one!&lt;br /&gt;&lt;br /&gt;So now, having sufficiently ticked off everyone in the CRM industry, let me tackle the SaaS “badge of honor” also known as multi-tenancy. Here’s a reality check: no one really knows what multi-tenancy means. How do I know this? Because if you ask three people what it means you will get five different answers. &amp;nbsp;And when you search for it online, you get multiple descriptions, some more convincing than others. So unless I missed something, there is no official definition, only qualified opinions.&lt;br /&gt;&lt;br /&gt;To me, the gist of it involves partitioning software application space very quickly and cheaply to accommodate exponential mass market adoption cycles (there’s a mouthful). &amp;nbsp;Exactly how you achieve this, from a technical perspective, is open to interpretation. Some people partition inside the database and share schemas (like Salesforce.com, where every tenant shares database tables), others partition on the application layer where processes may serve multiple requests, some consider virtualization to be a form of multi-tenancy, and yet others implement a hybrid approach.&lt;br /&gt;&lt;br /&gt;At the end of the day, multi-tenancy is a design methodology. Multi-tenancy (however you implement it) allows faithful cloud landlords to follow basic rules. If Moses had come down from “The Cloud” instead of Mount Sinai, he might have brought these back instead:&lt;br /&gt;&lt;br /&gt;(1) You shall provision tenants quickly without affecting other parts of the system.&lt;br /&gt;(2) You shall be able to provide deterministic system metrics on a 24/7 basis.&lt;br /&gt;(3) You shall let non-technical staff (or software) handle provisioning.&lt;br /&gt;(4) You shall monitor your platform’s health day and night.&lt;br /&gt;(5) You shall implement and frequently simulate doomsday scenarios.&lt;br /&gt;(6) You shall make it as easy to provision one tenant as it is for one hundred.&lt;br /&gt;(7) You shall segregate and protect thy tenant’s data at all cost.&lt;br /&gt;(8) You shall upgrade transparently and automatically for all tenants.&lt;br /&gt;(9) You shall allow tenants some degree of customization but not more.&lt;br /&gt;(10) You shall have a simple, clear and consistent pricing model.&lt;br /&gt;&lt;br /&gt;In my opinion, what goes on behind the scenes to support this is irrelevant because it’s irrelevant to the user. The user just wants to be “on-boarded” and provisioned pronto and safely without hassle. If you happen to be able to pull this off technically and cost-effectively, then you’re multi-tenant! If not, you have some pain coming your way.&lt;br /&gt;&lt;br /&gt;Whether or not you can support thousands of tenants per instance (however you define “instance”) is, for all intents and purposes, really your problem, not the user’s. &amp;nbsp;And I have yet to meet a prospect who’s first and foremost concern involves “multi-tenancy”.&lt;br /&gt;&lt;br /&gt;This might suggest we cease using the term as a marketing asset because I don’t think it resonates clearly. In my experience, the only things that matter to a cloud user are simplicity, service levels, trust, and cost. So let’s give users exactly what matters to them, explain it simply, and keep it real.&lt;br /&gt;&lt;br /&gt;Yours in BI.&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Arial;"&gt;&lt;span class="Apple-style-span" style="font-family: monospace; white-space: pre-wrap;"&gt;&lt;span class="Apple-style-span" style="font-size: x-small;"&gt;BRBR45DDTK3B&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-5145936484658054145?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/5145936484658054145/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/11/of-crm-golden-calves-and-ten.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/5145936484658054145'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/5145936484658054145'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/11/of-crm-golden-calves-and-ten.html' title='Of CRM Golden Calves and Ten Commandments of SaaS Landlordship.'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-6357002593609156293</id><published>2009-11-27T09:31:00.000-08:00</published><updated>2009-12-02T15:52:18.910-08:00</updated><title type='text'>S&amp;M: Where the SaaS rubber doesn’t meet the BI road.</title><content type='html'>&lt;div class="MsoNormal"&gt;&lt;span style="font-family: Arial; font-size: small;"&gt;&lt;span style="font-size: 13px;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Arial; font-size: small;"&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Arial; font-size: small;"&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Arial; font-size: small;"&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Arial; font-size: small;"&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Arial; font-size: small;"&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/_dCMl0ieMmTw/SxAP1fpVAjI/AAAAAAAACwE/MXQaOcZtuSY/s1600/2009-11-18+15.58.47.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://1.bp.blogspot.com/_dCMl0ieMmTw/SxAP1fpVAjI/AAAAAAAACwE/MXQaOcZtuSY/s320/2009-11-18+15.58.47.jpg" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;/div&gt;&lt;/span&gt;&lt;span style="font-family: Arial; font-size: small;"&gt;&lt;br /&gt;I was lucky enough to be at Dreamforce 2009 last week and wanted to pen down a few thoughts while the event is still fresh in my mind. I don’t think there was any earth-shattering news there, and I got the feeling (both onsite and online) that a lot of people didn’t really grasp the value of Benioff’s announcement (or strategy) about “socializing” the platform with &lt;a href="http://blogs.zdnet.com/BTL/?p=27542"&gt;Chatte&lt;/a&gt;r. &amp;nbsp;I, for one, certainly couldn’t make sense of Colin Powell’s presence at one of the keynotes (not sure what he can possibly offer the world of SaaS but maybe I missed something). &amp;nbsp;But overall it was an enlightening conference and here are some of my impressions (and they pertain mostly to the SaaS BI realm).&lt;br /&gt;&lt;br /&gt;First, the sheer number of bodies at the event was impressive. I understand 18,000 people took part and that is quite a large crowd given how undersold (to put it mildly) other conferences have been this year. Businesses have been reluctant to invest in conferences in 2009 as evidenced by abysmal attendance numbers and the rising popularity of “ &lt;a href="http://mervadrian.wordpress.com/2009/10/08/kalido-virtual-conference-scores-big/"&gt;virtual conferencing&lt;/a&gt;”. &amp;nbsp;So if one conference was preferred over all others, it must have been Dreamforce 2009, because it seemed like everybody and his mother sent people there.&lt;br /&gt;&lt;br /&gt;Second, I was impressed by the level of “education” the typical attendee exhibited. I didn’t really see or hear people asking basic “big picture” questions. Rather, the inquiries were very focused, deep, and to the point, revealing mature customers (buyers) who had done some serious homework. Actually, most of these folks have had meaningful experience in the cloud (some good, some bad) and knew how to hit the right vendor pressure points. From my standpoint, it is always vastly better (and more enriching) to deal with well educated buyers in a no-nonsense approach. This is exactly the user profile I experienced at Dreamforce.com manning the GoodData booth.&lt;br /&gt;&lt;br /&gt;Third, and I realize this is subjective, but to be honest, there are a lot of small clueless companies out there having nothing to do with cloud per say who clutter these shows for the publicity of slapping “cloud” onto their marketing literature. I’m not going to name names, but let’s just say when you sell gardening shoes, mailboxes, or kitchen countertops, I’m not sure you should be spending marketing dollars on Dreamforce.&lt;br /&gt;&lt;br /&gt;Fourth, I didn’t pick up any “religious” fervor at the show from either buyer or vendor sides. I assumed this was going to be a major rah-rah for everything cloud (with Open Source type of fervor) but I found the discussions to be much more measured and rational with most people objectively comparing both approaches (when applicable) with few pre-conceived notions. I believe this is a sign of industry maturation as people are getting better at separating the wheat from the chaff. I feel for the most part that SaaS limitations are well understood by most (not all) people and expectations are becoming more reasonable for the most part.&lt;br /&gt;&lt;br /&gt;Fifth, I believe that SaaS beachheads have been claimed. &amp;nbsp;This is particularly true in the BI space. This has a lot to do with perception obviously but in my opinion, the winners and losers have already been tagged. &amp;nbsp;Most companies (if not all) are fairly new in the cloud space yet already, they have public reputations as in “these guys aren’t serious” or “these folks are the ones you want to talk to”. &amp;nbsp;Obviously first-to-market matters a great deal in any industry and cloud is no different. &amp;nbsp;Except with SaaS of course, you can be first to market with a virtual product (vaporware, in the on-premise world), and it takes longer for people to read the fine print but, eventually, they do. &amp;nbsp;When people have an interest in a particular SaaS domain, they go directly to the “top dogs” without stopping anywhere else. &amp;nbsp;I believe there is plenty of space for new companies in the cloud but for those having established an early lead (perceived or not, and with compelling technology), the future seems bright.&lt;br /&gt;&lt;br /&gt;Sixth, everybody in the BI cloud space faces the exact same problems. &amp;nbsp;And no one has clear answers so this is still a very “trial and error” process. &amp;nbsp;The difference is between vendors who admit this, and those who don’t (mostly to themselves). &amp;nbsp;This is a sweeping statement but overwhelmingly, when you talk to other vendors, the same themes come out time and again. &amp;nbsp;Namely, how to scale fast enough (or onboard efficiently with minimal customer “touch”), and how to control sales and marketing costs which are turning out to be higher than anticipated. &amp;nbsp; Although adoption is growing, in my opinion, the technical hurdles are not remotely as high as the business ones.&lt;br /&gt;&lt;br /&gt;Most of the BI cloud vendors have managed to get by on minimal engineering costs. &amp;nbsp;Offshore labor is cheap enough that you can afford competent engineering teams in India, Central Europe or China (to name a few) for literally dollars a day and minimal liability. &amp;nbsp;Some of these vendors are making ends meet with two-man engineering teams! &amp;nbsp;And only two I know of have engineering teams exceeding ten people. So clearly, the money pit is elsewhere.&lt;br /&gt;&lt;br /&gt;And elsewhere is S&amp;amp;M (no, not the fun kind) namely Sales and Marketing. The original proposition for cloud was that the “service”, unlike traditional enterprise software, was going to kind of sell itself. &amp;nbsp;S&amp;amp;M budgets were going to be minimal. No more travelling field sales force, expensive face-to-face customer visits, pre or post-sales engineers. &amp;nbsp;It was all going to be “automatic” and on the web. But my limited experience contradicts this.&lt;br /&gt;&lt;br /&gt;Because now, adoption and competition are growing. &amp;nbsp;For example nowadays in SaaS BI, you have dozens of vendors. You skim enough to get to the “serious” ones (see #5 above) and now you’re left with maybe four or five guys. &amp;nbsp;Next year, there will likely be ten serious contenders. &amp;nbsp;The more competition you have, the higher sales cycles and costs get. &amp;nbsp;Next thing you know, you’re back to boots on the ground and to a more traditional enterprise software sales models. This is the danger facing many SaaS players these days. &amp;nbsp;CEOs and investors are edgy about this emerging trend. It breaks the anticipated mold.&lt;br /&gt;&lt;br /&gt;The other problem is what I call customer “touch-too-much”. &amp;nbsp;In a SaaS model, efficient on-boarding is crucial. This is not only about flipping the proverbial switch – because properly-engineered multi-tenant systems achieve this quite well – but more about the time it takes to get a user’s business problem solved. &amp;nbsp;Namely, the costly interaction spent on a given customer to handle specific needs and the amount of customization needed to achieve satisfaction (and final sign-off on the purchase order). POCs, sales cycles and marketing costs are growing.&lt;br /&gt;&lt;br /&gt;This is a huge problem in the BI space because the requirements phase can be long and even there agility is not necessarily a Holy Grail. &amp;nbsp;The basic problem is simple: it is very difficult to remove the “human factor” when implementing BI. &amp;nbsp;Cookie cutter never satisfies a particular business problem entirely. The money’s in the customization and the subject matter expertise. &amp;nbsp;You must solve difficult business problems, not engineering ones. &amp;nbsp;The same challenges apply to software engineering, and history has seen a flurry of “blissful automation” endeavors fail in that space (remember 4GL?). &amp;nbsp;At the end of the day, you can’t remove what’s between the chair and the keyboard, and you can’t efficiently and consistently automate it in software – whether in the cloud or not.&lt;br /&gt;&lt;br /&gt;Now, this is not so bad for a company like Salesforce.com because they’re a platform play. &amp;nbsp;So by definition, they provide efficient functionality (large offering surface), but it’s all fairly mediocre and “cookie cutter” – Users are free to (try and) customize their modules as they see fit. &amp;nbsp;In the analytics space, for example, Salesforce reporting and analysis is shallow. Consequently, users looking for customer data insight and trending statistics (say for pipeline analysis) will look for integrated solutions fitting their specific business needs.&lt;br /&gt;&lt;br /&gt;But for the after-market players who “plug” into something like Salesforce, it’s a major hurdle. Unless they can very quickly and cheaply customize their offerings for a myriad of different business cases, and move through POCs quickly, the SaaS hosting and costing model won’t do them much good. &amp;nbsp;In my opinion, most existing BI SaaS vendors are currently struggling with this conundrum. Lucidera seems to have as well. Its demise was a shot across the bow. &amp;nbsp;The first SaaS BI player to move past this problem wins the game and keeps the investors happy.&lt;br /&gt;&lt;br /&gt;I have a hard time thinking of a SaaS BI vendor currently striking a reasonable balance between zero S&amp;amp;M and massive S&amp;amp;M. &amp;nbsp;On one end of the scale, I see folks adamantly opposed to spending a dime on marketing and expecting serendipitous results. On the other end, I see vendors placing heavy bets on misguided or highly-targeted verticals. &amp;nbsp;I see both strategies as precarious and hope for more level-headedness in the coming months.&lt;br /&gt;&lt;br /&gt;No matter which way this goes, we're in for a wild ride. No prisoners will be taken. It's a great time to be in this business.&lt;br /&gt;&lt;br /&gt;Yours in BI.               &lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-6357002593609156293?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/6357002593609156293/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/11/s-where-saas-rubber-doesnt-meet-bi-road.html#comment-form' title='5 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/6357002593609156293'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/6357002593609156293'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/11/s-where-saas-rubber-doesnt-meet-bi-road.html' title='S&amp;M: Where the SaaS rubber doesn’t meet the BI road.'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_dCMl0ieMmTw/SxAP1fpVAjI/AAAAAAAACwE/MXQaOcZtuSY/s72-c/2009-11-18+15.58.47.jpg' height='72' width='72'/><thr:total>5</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-608555457326975243</id><published>2009-10-27T12:04:00.000-07:00</published><updated>2009-10-27T12:04:32.713-07:00</updated><title type='text'>Of Birthrights and DNA in the BI Cloud World</title><content type='html'>&lt;span class="Apple-style-span" style="font-family: Verdana, sans-serif;"&gt;I have been so busy getting ramped up on &lt;a href="http://www.gooddata.com/"&gt;GoodData&lt;/a&gt; that I haven’t had a minute to post here in over a week. &amp;nbsp;On Friday, I am headed out to our R&amp;amp;D Center in &lt;a href="http://www.easyprague.cz/eecera2007/download/images/prague-bridges.jpg"&gt;Prague&lt;/a&gt;, Czech Republic - looks horrible doesn't it? :) -&lt;g&gt;&amp;nbsp;for a week to do a brain-meld with our engineering team and deep-dive the GoodData architecture. I’ve learned a lot about it in the past two weeks but clearly, there’s nothing like sitting down with the hard-core developers and discovering how the sausage has really been made for the past two years.&lt;/g&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;One of the unique things about this platform is the fact it was designed for the cloud from the get-go. “Born and bred in the cloud” as they say. &amp;nbsp;And that was several years ago, before the cloud “explosion”, when the idea of running a full BI stack in the cloud was just a fantasy to most people. &amp;nbsp;They said it couldn’t be done (matter of fact, some folks still claim that, but reality proved them wrong in 2009) but &lt;a href="http://roman.stanek.org/"&gt;Roman Stanek&lt;/a&gt; and his guys did it and I think the results speak for themselves (here's &lt;a href="http://www.gooddata.com/pr/black-duck-software-selects-good-data-for-customer-analytics/"&gt;one&lt;/a&gt; of many examples).&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;I think the “born and bred in the cloud” stamp is important because there is so much hype and hyperbole about all things “cloud” these days. Especially in the BI space. &amp;nbsp;A lot of times when you hear or read things about clouds, it tends to feel like &lt;a href="http://emap.wordpress.com/2009/10/25/cloud-computing-new-business-opportunities/"&gt;this&lt;/a&gt;&amp;nbsp;(with apologies to my Thailand readers who obviously understand that script).&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;To separate the wheat from the chaff, you really have to ask the right questions. Namely, is your architecture really cloud-based or did you simply throw bits over the wall onto a bunch of hosted boxes (or VMs)? &amp;nbsp;Do you understand the meaning and implications of real multi-tenancy? Are you simply shifting technology because it’s hip (marketing-wise), or do you truly possess cloud DNA? And on the non-technical side, do the economics make sense?&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;In the mid-1990s, ASPs (application service providers) were all the rage. Everyone wanted to “go ASP”. &amp;nbsp;I worked for some of them. But the “software” crawled, got in the users’ way, was hell to maintain and deploy, and the costing model didn’t work. &amp;nbsp;Many (most) went belly-up. But hey, at the time, it sounded good and the VCs wrote checks. &amp;nbsp;But already back then, painful lessons were learned. &amp;nbsp;Namely that it’s virtually impossible to change horses in the middle of a race.&amp;nbsp;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;The inherent value of “born and bred in the cloud” platforms is not simply technical to me. &amp;nbsp;Clearly the engineering is intricate and “cool”, but what’s really at stake is what Roman calls “price-based costing”. &amp;nbsp;(Speaking of pricing and clouds, check out Roman’s latest &lt;a href="http://roman.stanek.org/2009/10/27/will-moores-law-find-its-way-to-the-cloud/"&gt;blog post&lt;/a&gt;, it’s à propos!). &amp;nbsp;This is a Peter Drucker concept but it applies really nicely to cloud-based economics. &amp;nbsp; &amp;nbsp;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;Price-based costing is defined as “choosing a desired sales price and costing out production to meet that sales price with a desired profit margin”. &amp;nbsp;The idea is to first determine what you want to charge for a product or service, and then work the development economics backwards to achieve that price point. When you’re starting from scratch with a cloud-based technology, you can pull that off, because parameters, cost and elasticity are linearly deterministic. &amp;nbsp;The up-front costs are fairly low and the flexibility is fantastic. &amp;nbsp;Additionally, the tools are cheap, commonly available and simple. &amp;nbsp;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;Consider that the GoodData backend is almost all coded in Perl on Linux with a REST interface! &amp;nbsp;(I’m not suggesting Perl or Linux are “simplistic” but I worked the MSFT development stack all my life, and I can see orders of magnitude in complexity reduction here). &amp;nbsp;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;So now, if you have legacy bits and a “ground-based” engineering platform (and people), you’re stuck on a price structure in this bearish economy, and you can’t just flip to a cloud model as a panacea, no matter what your Marketing people tell you. &amp;nbsp;Whether you sell proprietary or open source software, you’re in the same predicament. Sure, you can provide less functionality or cripple your offering (or worse yet, try to make do with less people), but just throwing it inside a “cloud” won’t do it economically, and the outcome will likely be painful. The DNA just isn’t there.&amp;nbsp;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;So when some BI vendors simply slap some code (open source or not) onto AMIs (say like Mondrian for example) and call it a BI on-demand offering with a ROLAP engine, it makes me wonder if they “get” the concept at all. &amp;nbsp;Mind you I ponder the same thing about big league BI tooling players who simply re-package their (oh so heavy) bits into virtual boxes and call it a cloud day as well.&amp;nbsp;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;With a genuine “born and bred” cloud model, you have fine grained leeway in adjusting the pricing dial. Guess what, you can even turn it down all the way to zero and still survive for a long time at given data volumes. &amp;nbsp;At GoodData, for example, you can get 10MB accounts for free. &amp;nbsp;Not a huge amount, certainly, yet sufficient to “play with”. &amp;nbsp;But no matter how many people sign up for that, GoodData’s cloud platform can handle it both volume wise and cost wise at ridiculous scale and at the flip of a switch. &amp;nbsp;Scalability in a “born and bred” cloud platform is very (did I say very?) cheap. &amp;nbsp;And pricing can be finely tuned by throttling the technology, not the other way around. And this isn’t about taking a loss for marketing purposes. GoodData, like any other sound business (except OSS maybe), isn’t about giving stuff away for free, believe you me.&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;So I guess my point would be: caveat emptor. &amp;nbsp;The next time you’re considering replacing that big honking on-premise BI tool with the vendor’s “new improved” cloud-based on-demand version, check under the hood and make sure that “born and bred” DNA is there (hint: not likely). &amp;nbsp;And the next time you’re considering a “native” SaaS cloud vendor for your BI analytics, ask the right questions about what makes their backend tick. &amp;nbsp;You might be in for an interesting smoke and mirrors show.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-608555457326975243?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/608555457326975243/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/10/of-birthrights-and-dna-in-bi-cloud.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/608555457326975243'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/608555457326975243'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/10/of-birthrights-and-dna-in-bi-cloud.html' title='Of Birthrights and DNA in the BI Cloud World'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-6637883920518479792</id><published>2009-10-19T10:46:00.000-07:00</published><updated>2009-10-19T10:51:25.004-07:00</updated><title type='text'>On Bringing the Good News While Clearing Out Fungus.</title><content type='html'>&lt;span class="Apple-style-span" style="font-family: Verdana, sans-serif;"&gt;I wanted to spend a little time sharing some good news with you. &amp;nbsp;I am going to be Technical Evangelist for a relatively new company called &lt;a href="http://www.gooddata.com/"&gt;GoodData&lt;/a&gt;. &amp;nbsp;As evangelism means "bringing the Good News" to the world, this is truly a match made in Heaven for yours truly, as I will now be bringing the GoodData news to the world.&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;Many of you will have heard about GoodData before of course. &amp;nbsp;Most people in our industry know it was founded by serial entrepreneur &lt;a href="http://roman.stanek.org/"&gt;Roman Stanek&lt;/a&gt; of NetBeans and Systinet fame. &amp;nbsp;Sun acquired NetBeans circa 1999 for $10M and HP grabbed Systinet in 2006 for a mere bag of shells ($100M of them actually).&amp;nbsp;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;GoodData is backed by heavy-hitting investors including &lt;a href="http://bit.ly/7oDNP"&gt;Andreessen Horowitz&lt;/a&gt; &amp;nbsp;(Marc was also an initial angel investor) but also &lt;a href="http://oatv.com/"&gt;O'Reilly AlphaTech Ventures&lt;/a&gt; &amp;nbsp;and &lt;a href="http://www.generalcatalyst.com/"&gt;General Catalyst&lt;/a&gt;. &amp;nbsp;It's always nice when you can point to Marc Andreessen pitching your company on &lt;a href="http://bit.ly/Ng7y9"&gt;CNNMoney&lt;/a&gt;. &amp;nbsp;GoodData also cleared another &lt;a href="http://bit.ly/XFhMh"&gt;funding &lt;/a&gt;round last week as anyone following our industry will have heard. &amp;nbsp;This kind of news does not go unnoticed these days.&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;So what is GoodData about? &amp;nbsp;GoodData is an on-demand business intelligence platform for collaborative analytics (phew!). &amp;nbsp;This means the software runs in the cloud (Amazon's EC2 to be precise), so clearly the model benefits from all the usual technical, deployment, and financial benefits SaaS brings to the table. &amp;nbsp;But more importantly, and what really sold me on the concept, is the following equation (and those who follow me know my affinity for all things KISS):&lt;/span&gt;&lt;br /&gt;&lt;div style="text-align: center;"&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;&lt;b&gt;gd = BI - BS&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;In English, this says GoodData is business intelligence without all the "bullshiitake" (term borrowed from my hero &lt;a href="http://www.guykawasaki.com/"&gt;Guy Kawasaki&lt;/a&gt;). &amp;nbsp;So let's talk a little bit about that breed of mushroom.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;When you need to implement a BI project but depend on IT to setup infrastructure and purchase software before you can even get started, that's bullshiitake.&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;When you need a PhD or significant training to start using a complicated overkill BI tool, that's bullshiitake.&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;When it takes you weeks or even months to analyze time-critical data for your company and show results to an impatient boss, that's bullshiitake.&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;When you can't collaborate with your peers and customers (internal or external) or integrate with outside apps while doing agile BI, that's bullshiitake.&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;GoodData represents a new way of thinking and acting about BI by removing these "mushrooms". &amp;nbsp;In that sense, GoodData is really re-inventing how BI consumers and producers interact, behave, grow, and produce results together. &amp;nbsp; And that, to me, goes way beyond a pure technology play. &amp;nbsp;It is a “next curve” move and an industry-shifting vision I wanted to help forge.&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;So in the coming weeks and months, I am going to be talking about the technical and social aspects of this industry-shifting platform. You'll be able to reach me at &lt;a href="mailto:jerome.pineau@gooddata.com"&gt;my new work email&lt;/a&gt; and I will be twitting on the &lt;a href="http://www.twitter.com/gooddata"&gt;@gooddata&lt;/a&gt; stream as well.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;My pitch to BI professionals will be simple. If you seek independence, control of your environment, quick and deep results, and the flexibility to enhance your company’s bottom line, then you need to try out GoodData. We represent a fundamental new way of thinking about BI. &amp;nbsp;My role will be to show you how and why.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-6637883920518479792?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/6637883920518479792/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/10/on-bringing-good-news-while-clearning.html#comment-form' title='4 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/6637883920518479792'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/6637883920518479792'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/10/on-bringing-good-news-while-clearning.html' title='On Bringing the Good News While Clearing Out Fungus.'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>4</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-1297171588403093053</id><published>2009-10-11T15:48:00.000-07:00</published><updated>2009-10-11T15:48:19.624-07:00</updated><title type='text'>BI in a New York Minute is Born</title><content type='html'>&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/_dCMl0ieMmTw/StJeafXoRGI/AAAAAAAACvQ/L2pSBPiCqAE/s1600-h/1.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://2.bp.blogspot.com/_dCMl0ieMmTw/StJeafXoRGI/AAAAAAAACvQ/L2pSBPiCqAE/s320/1.png" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;/div&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;In my last post, I mentioned I was going to do a "trial &amp;amp; error" run on a recent concept of providing weekly BI tidbits (news items) for people on the go. &amp;nbsp;Having had some time to reflect on this endeavor a little further over the weekend, and after consulting with some key people, I've decided to try another tack. &amp;nbsp;Fact is, I prefer keeping my main blog for "deeper" less frequent posts.&amp;nbsp;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;So instead, I've decided to create another way to dessiminate this information. Additionally, I'm giving up on the idea of posting items on a weekly basis. Why cover weekly periods when in fact, BI news is&amp;nbsp;occurring&amp;nbsp;pretty much 24/7 in real time nowadays including weekends. &amp;nbsp;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;To deliver better, I have created a new blog called "&lt;a href="http://biminute.blogspot.com/"&gt;BI News in a New York Minute&lt;/a&gt;" where I simply post tidbits in real time. &amp;nbsp;When I get them, you get them. &amp;nbsp;People can subscribe to this stream via the blog (obviously) or by pulling feeds from http://biminute.blogspot.com/atom.xml. &amp;nbsp;Additionally, I've linked BI News in a New York Minute to Twitter so updates are also broadcast there when available (almost, but not annoyingly, in real time). &amp;nbsp;So if you aren't following me in twitland yet, point &lt;a href="http://www.twitter.com/JeromePineau"&gt;here&lt;/a&gt; and click "follow" :)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;This is just one modality of a more general media concept I am working on for the Business Intelligence industry. &amp;nbsp;In the meantime, I am hopeful (and grateful if) you will give me feedback and suggestions!&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;Yours in BI.&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;J.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-1297171588403093053?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/1297171588403093053/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/10/bi-in-new-york-minute-is-born.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/1297171588403093053'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/1297171588403093053'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/10/bi-in-new-york-minute-is-born.html' title='BI in a New York Minute is Born'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_dCMl0ieMmTw/StJeafXoRGI/AAAAAAAACvQ/L2pSBPiCqAE/s72-c/1.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-7198295840563576644</id><published>2009-10-09T16:13:00.000-07:00</published><updated>2009-10-09T16:25:08.612-07:00</updated><title type='text'>Your Weekly BI News in a “New York Minute”</title><content type='html'>I’m a glutton for BI information to the point of addiction. Aday rarely goes by without my “scanning” three hundred or so feeds, blogs, twits,websites and other news and information sources pertaining to businessintelligence. &amp;nbsp;I did the same thing whenI was on the software development side, mind you, but using significantly less onlinesources and way more books. I find BI to be so dynamic an industry that by thetime most books are published, the information is already obsolete.&amp;nbsp; And that’s just on the technical side. &amp;nbsp;On the business front, things change andevolve even faster.&lt;br /&gt;&lt;br /&gt;So I started looking around for an aggregated “digest” ofthe day’s salient BI news points. I was looking for something that spanned themultitude of industry areas in BI from engineering and product news (platformto front-end), to business development, partnerships, personnel (who quit,joined or got fired), distribution (SaaS and cloud issues), venture funding, importantconferences, and so on.&amp;nbsp; In a nutshell,all the areas that affect our industry one way or another. To my surprise, I couldn’tfind anything readily available.&amp;nbsp; Somethingsomeone could take on a train or pull down to a mobile device during a dailycommute to get a quick “scoop” on the week’s happenings in BI.&lt;br /&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;Then I figured, why not try my hand at compiling such a list ona weekly basis. &amp;nbsp;I know this is fairlysubjective, but while scanning my own sources, I always “pull over” severalitems for more in-depth examination. I do this very quickly (I’m a very fastreader) and often “bucketize” these links using Delicious.com for furtherexamination and/or analysis.&amp;nbsp; Clearlypeople have different areas of interest but I figured this pruned list couldbenefit some folks with no more than a “New York Minute” to spare. &lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;I’m not sure how best to format and present this yet. I didn’twant to start off with some sort of mailing list offering because, quitehonestly, I’ve never stuck with a mailing list subscription more than a coupleof weeks myself. &amp;nbsp;So for now I’m going totry and post this on my blog on a weekly basis every Friday.&amp;nbsp; If I detect minimal interest, then I’ll try anotherapproach. &amp;nbsp;I'll also include more items in future weeks. &amp;nbsp;This is more of a "trial and error" endeavor at this point. &amp;nbsp;Meanwhile, and without furtherado (or order), here’s my BI Digest for the week of October 5&lt;sup&gt;th&lt;/sup&gt;,2009 "in a New York Minute".&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;Discrete SaaS player &lt;/b&gt;&lt;a href="http://www.1010data.com/"&gt;&lt;b&gt;1010Data&lt;/b&gt;&lt;/a&gt;&lt;b&gt; scores a big &lt;/b&gt;&lt;a href="http://bit.ly/1aAx8R"&gt;&lt;b&gt;contract&lt;/b&gt;&lt;/a&gt;&lt;b&gt; in the retails space by signing up Dollar General.&lt;/b&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;MicroStrategy is far from Open Source but &lt;/b&gt;&lt;a href="http://bit.ly/4pyDui"&gt;&lt;b&gt;giving away&lt;/b&gt;&lt;/a&gt;&lt;b&gt; software for free nevertheless now. Trojan horse or good value?&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;Awesome &lt;/b&gt;&lt;a href="http://w.on24.com/r.htm?e=161142&amp;amp;s=1&amp;amp;k=E6ECB5B97A75E74DCCD5563EED7F9AFE"&gt;&lt;b&gt;webinar&lt;/b&gt;&lt;/a&gt;&lt;b&gt; on what was learned in the past 20 years in BI compliments of &lt;/b&gt;&lt;a href="http://www.intelsols.com/aboutus.cfm/Who%20We%20Are"&gt;&lt;b&gt;Claudia Imhoff&lt;/b&gt;&lt;/a&gt;&lt;b&gt; and &lt;/b&gt;&lt;b&gt;&lt;a href="http://www.wherescape.com/"&gt;WhereScape&lt;/a&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;a href="http://www.wherescape.com/"&gt;&lt;/a&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;"But can it core a apple"? On integrating OLTP and OLAP in the same database engine. One of the most commented (and educational, for me) Curt Monash &lt;/b&gt;&lt;a href="http://bit.ly/1R3I6g"&gt;&lt;b&gt;posts&lt;/b&gt;&lt;/a&gt;&lt;b&gt; I’ve seen.&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;First of a two-part in-depth &lt;/b&gt;&lt;a href="http://bit.ly/7gzq5"&gt;&lt;b&gt;series&lt;/b&gt;&lt;/a&gt;&lt;b&gt; on MapReduce vs. relational databases.&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;You might not be paying your DBA enough money to put up with &lt;/b&gt;&lt;a href="http://bit.ly/9QskU"&gt;&lt;b&gt;this&lt;/b&gt;&lt;/a&gt;&lt;b&gt; (alternatively, you might be using the wrong DBMS platform &lt;g&gt;).&lt;/g&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;g&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;g&gt;BI and DW “implementors“ take heed. You may need more &lt;/g&gt;&lt;/b&gt;&lt;a href="http://bit.ly/PMLXc"&gt;&lt;b&gt;duct&lt;/b&gt;&lt;/a&gt;&lt;b&gt; tape after all (this doesn’t just apply to the software development community, believe me).&lt;/b&gt;&lt;/span&gt;&lt;/g&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;g&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;For a deep (and I mean deep) &lt;/b&gt;&lt;a href="http://bit.ly/QQeg2"&gt;&lt;b&gt;dive&lt;/b&gt;&lt;/a&gt;&lt;b&gt; into what customer analytics in retail are all about.&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/g&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;g&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;A really novel (read: smart) &lt;/b&gt;&lt;a href="http://bit.ly/AFgmx"&gt;&lt;b&gt;way&lt;/b&gt;&lt;/a&gt;&lt;b&gt; to engage your market without the BS, hassle and expenses of conventional physical conferences as described by Merv Adrian who tried it.&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/g&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;g&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;a href="http://www.blogger.com/goog_1255129840570"&gt;Dashboard&lt;/a&gt;&lt;/b&gt;&lt;b&gt;&lt;a href="http://bit.ly/5rSco"&gt;s&lt;/a&gt; in Excel 2010 (yawn). Sorry it’s been hard to get excited about MSFT BI in, oh say about 18 months now.&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/g&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;g&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;Can (or should) InfoBright actually kick MonetDB’s rear-end? The &lt;/b&gt;&lt;a href="http://bit.ly/4p7W5h"&gt;&lt;b&gt;numbers&lt;/b&gt;&lt;/a&gt;&lt;b&gt; are (almost) in.&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/g&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;g&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;In-memory &lt;/b&gt;&lt;a href="http://bit.ly/1j9nf6"&gt;&lt;b&gt;databases&lt;/b&gt;&lt;/a&gt;&lt;b&gt; and the &lt;/b&gt;&lt;a href="http://www.kognitio.com/"&gt;&lt;b&gt;Kognitio&lt;/b&gt;&lt;/a&gt;&lt;b&gt; men who love them.&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/g&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;g&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;Well, looks like nobody is &lt;/b&gt;&lt;a href="http://bit.ly/2uxXpH"&gt;&lt;b&gt;using EC2&lt;/b&gt;&lt;/a&gt;&lt;b&gt; after all (NOT!)&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/g&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;g&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;Larry &lt;/b&gt;&lt;a href="http://bit.ly/bwVLW"&gt;&lt;b&gt;invites&lt;/b&gt;&lt;/a&gt;&lt;b&gt; Marc to Oracle Open World next week. See rumors fly...&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/g&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;g&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;At least one person is &lt;/b&gt;&lt;a href="http://bit.ly/4zKriN"&gt;&lt;b&gt;excited&lt;/b&gt;&lt;/a&gt;&lt;b&gt; about Gemini (but it’s a monkey so..).&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/g&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;g&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;If you live and breathe “fabric” and grok &lt;/b&gt;&lt;b&gt;&lt;a href="http://www.blogger.com/goog_1255129840575"&gt;fiber over Etherne&lt;/a&gt;&lt;/b&gt;&lt;b&gt;&lt;a href="http://bit.ly/bDp8z"&gt;t&lt;/a&gt;, this one’s for you.&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/g&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/b&gt;&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;a href="http://www.wherescape.com/"&gt;&lt;/a&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;And that's the way BI is. &amp;nbsp;Goodnight, and enjoy your weekend!&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-7198295840563576644?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/7198295840563576644/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/10/your-weekly-bi-news-in-new-york-minute.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/7198295840563576644'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/7198295840563576644'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/10/your-weekly-bi-news-in-new-york-minute.html' title='Your Weekly BI News in a “New York Minute”'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-329975621909731189</id><published>2009-09-30T10:43:00.000-07:00</published><updated>2009-09-30T11:01:56.323-07:00</updated><title type='text'>Baking MapReduce into Database Engines - Worth the Reduction Sauce?</title><content type='html'>&lt;span style="font-family: Verdana, sans-serif;"&gt;MapReduce (implementations include Hadoop and CloudDB) has gained popularity in the industry. It also serves as marketing fodder for several new-breed ADBMS vendors who now claim to support it in various forms. So what is really behind this magic pixel dust, what problems does it solve, and how relevant is it to someone deciding on a new (or additional) ADBMS platform these days? &lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;First, let’s point out MapReduce is not a technology but an algorithm. Wikipedia defines an algorithm as “an effective method for solving a problem using a finite sequence of instructions.” In MapReduce’s case, the problem being solved is the processing and analysis of very large data sets. The solution is a parallelized divide-and-conquer approach and works like this. First, you split up the “problem” into small manageable chunks. Second, you fan out each chunk in parallel to individual “work units” (maps). Third, you take individual results from each unit and recombine them into your final result (reducers). In SQL parlance, conceptually, it’s like doing a select aggregate with a group by. &lt;br /&gt;&lt;br /&gt;There are file based and database-centric applications of MapReduce in existence. Of course, the presumption is that your “problem space” can be split up in distinct pieces and recombined without information loss. And not all problems are either large enough or mathematically suited to this approach. But luckily data management is, by definition, perfectly well suited because, as a mathematician once told me “every data management task can be broken down into two and only two activities: partitioning and equivalence”. &lt;br /&gt;&lt;br /&gt;Some folks think MapReduce is a modern breakthrough concept, but they’re wrong. The application of this algorithm to the management of large data is nothing new, as pointed out by Dr. Stonebraker in a 2008 &lt;a href="http://databasecolumn.vertica.com/2008/01/mapreduce_a_major_step_back.html"&gt;posting&lt;/a&gt;. What’s “new” about MapReduce is that Google has popularized it. And the thought is, if Google can process and analyze the entire world via MapReduce, then clearly MapReduce must be the Holy Grail of monster data management. But Google has unique challenges (gargantuan data volumes), and some very impressive (and plentiful) gray matter at its disposal.&lt;br /&gt;&lt;br /&gt;Because the interesting thing about this “divide and conquer” approach is that, although fairly easy to conceptualize, it’s incredibly hard to implement properly. The human brain is really not “wired” to think in parallel. Research has shown that the top brains can at best juggle seven different objects simultaneously (and for short time periods). To understand the intellectual challenges at play here, I strongly recommend watching this Google &lt;a href="http://www.youtube.com/watch?v=yjPBkvYh-ss&amp;amp;feature=channel"&gt;video series&lt;/a&gt;. &lt;br /&gt;&lt;br /&gt;As I understand it, implementing MapReduce correctly and efficiently is probably as hard as conquering multi-threaded programming. And in twenty years, I have met three people who really understood multi-threading correctly and two of them were Russian PhDs. I've had battle-tested architects tell me they would rather shave with broken glass than tackle the risk and difficulty of multi-threading (luckily, they weren’t designing operating or flight-control systems!). My point is, it takes some pretty special skill and talent to do it right. Nothing inherently wrong with that, but it’s neither quick nor cheap.&lt;br /&gt;&lt;br /&gt;So why then would database vendors race to support MapReduce? After all, dealing with and managing relational systems is complicated enough as is. But at least people have been trained in the art for decades, and SQL is lingua franca. So the pitfalls and solutions are well established. Additionally, Codd’s premise guaranteed abstraction by separating the logical layer (SQL and a normalized schema) from the physical one (hardware and storage). But MR is heavy with non-standard cross-layer implementation details by necessity. Clearly a step backward from the KISS principle (even a “major step backwards” if you buy into Dr. Stonebraker’s argument that MapReduce is offensive).&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;Regardless, three well-known new-breeders, namely Aster Data, Vertica and Greenplum jumped on the bandwagon early on and announced “MapReduce implementations” for their product. I wondered what compelled them to invest time and resources into something that didn’t seem essential (or cheap) to the market at large. Are users really clamoring for MapReduce support in their warehouse engines?&lt;br /&gt;&lt;br /&gt;To learn more, I went to YouTube and checked out Aster’s video “&lt;a href="http://www.youtube.com/watch?v=2zuVT3kzoxA"&gt;In Database MapReduce Applications&lt;/a&gt;”. In it, I learned that graph theory problems (think: travelling salesman) were well suited to MapReduce but not SQL. Examples included social networking (LinkedIn), Government (intelligence), Telecom (routing statistic), and retail (CRM, affinities), and finance (risk, fraud). Pretty much anything that can be modeled using interconnected nodes. But a connection from a node to another is really a “relation”, and so clearly well suited to a “relational engine”. So I might have missed something.&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;I also learned that existing applications typically extracted data from the database, performed some analytic work on it, and then pushed the data back into the store. In other words, they couldn’t perform processing inside the database. I found that generalization hard to swallow but reminiscent of numerous past battles on whether “business logic” belongs in the application or database layer.&lt;br /&gt;&lt;br /&gt;Aster’s implementation of MapReduce is “deep inside” their engine, from what I understand. One example I could find was yet another YouTube video called “&lt;a href="http://www.youtube.com/watch?v=-ydLymJwEeQ"&gt;In-Database MapReduce Example: Sessionize&lt;/a&gt;”. In it, Shawn Kung shows a MapReduce function being used inside a SQL statement to “sessionize” user IDs in a clickstream context. Aster also provides very basic how-to’s on their &lt;a href="http://www.asterdata.com/mapreduce/writing.php#1"&gt;website&lt;/a&gt; and &lt;a href="http://bit.ly/PWGan"&gt;blog&lt;/a&gt;. Clearly Aster is targeting this new MapReduce capability at the DBA side of their users, and it looks a lot like leveraging UDFs to me. Aster’s conclusion: “we need to think beyond conventional databases.” I’m all for that!&lt;br /&gt;&lt;br /&gt;Next, I wanted to learn about Vertica’s implementation. Especially since Vertica’s own Dr. Stonebraker had initially nailed MapReduce pretty hard as mentioned above. But Vertica’s new position seems to be that MapReduce is a-ok after all, provided it remains external and doesn't pollute the purity of the relational engine. I couldn't find much on YouTube or their website save for a press release dated 8/4/09 stating “With version 3.5, Vertica also introduces native support for MapReduce via connectivity to the standard Hadoop framework”. It seems the “scoop” on the Vertica/MapReduce wedding is best described in their corporate &lt;a href="http://databasecolumn.vertica.com/2009/09/the_scoop_on_hadoop_and_vertic.html"&gt;blog&lt;/a&gt;. Basically Vertica is OK with "integrating" or connecting to but not "ingesting" MapReduce (via Hadoop) if I understand clearly.&lt;br /&gt;&lt;br /&gt;I was also able to glean some tidbits from Omer Trajman on Twitter. Namely that Vertica supports Hadoop “adapters” which allow you to read and write into the database (which is basically the press release). I wish I had more in-depth information about Vertica’s MR functionality but even a basic search for the term on their overly busy website yields zero information and, unless I missed it, I couldn’t find any relevant webcasts either. &lt;br /&gt;&lt;br /&gt;Greenplum were, if I am not mistaken, first to support MapReduce. Greenplum has the best MR resource &lt;a href="http://www.greenplum.com/video/mapreduce-demos.html"&gt;online&lt;/a&gt; if you ask me. It’s clear, detailed and full of insight. Greenplum has a merged/cooperative DBA/programmer approach in their offering. Programmers can write maps and reducers in their language of choice, leveraging DBA generated data sets as (and if) needed, and DBAs can use MR functions along with SQL without (presumably) getting their hands dirty. There isn’t much to add to this excellent resource so I won’t. &lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;So having mapped out all these facts, what can we reduce from it (I’m so funny) and more importantly, should any of this stuff matter to prospects when evaluating ADBMS vendors? IMHO, you might benefit from a MR-enabled ADBMS if: &lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style="font-family: Verdana, sans-serif;"&gt;(1) You have petabytes (or more) of data, an MPP architecture, and a search, scientific research, or mining problem a high-performance SQL engine cannot handle.&lt;br /&gt;&lt;br /&gt;(2) You don’t have heavy legacy systems. Integrating (or migrating) existing business and relational code with a new-breed MR-enabled engine can’t be fun, quick or cheap. You might be one of the lucky few with pet projects on the table.&lt;br /&gt;&lt;br /&gt;(3) You’re in Academia and have access to numerous cheap and competent programming resources, lots of metal, plenty of time, and limited pressure to succeed.&lt;br /&gt;&lt;br /&gt;(4) Your organization has a track record of successful projects dependant on symbiotic working relationships between your DBAs and your programmers. In my experience, DBAs and programmers don’t work well together. They have different goals and approaches. And it seems intellectual and political integration of both resources would be a sine qua non condition to success with an MR database product.&lt;br /&gt;&lt;br /&gt;Short of that, I can’t imagine too many people lining up at the MR-ADBMS vendors’ doors simply based on their MapReduce capabilities. And I don’t think vendors make that case either. In my opinion, supporting MR in the product simply says “Hey, look at me, I’m at the forefront of technology. See how smart I am.” But as a buyer, I’d be a little concerned about overreach. &lt;br /&gt;&lt;br /&gt;In fact, I wonder how these vendors spread resources efficiently (and economically!) between database engine building, cloud provisioning (which Aster and Vertica now pitch), and MapReduce integration. I suppose marketing requires less focus than engineering as a discipline but still, that’s a lot on one’s plate.&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-329975621909731189?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/329975621909731189/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/09/baking-mapreduce-into-database-engines.html#comment-form' title='9 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/329975621909731189'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/329975621909731189'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/09/baking-mapreduce-into-database-engines.html' title='Baking MapReduce into Database Engines - Worth the Reduction Sauce?'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>9</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-8997232504736061613</id><published>2009-09-25T13:32:00.000-07:00</published><updated>2009-09-25T13:32:11.384-07:00</updated><title type='text'>SELECT SUM(blessings) FROM working_in_bi</title><content type='html'>&lt;span style="font-family: Arial; font-size: small;"&gt;&lt;span class="Apple-style-span" style="font-size: 13px;"&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;If you’re working in the business intelligence industry, you should really count your blessings.&amp;nbsp; Mind you, you can do that if you’re employed anywhere these days.&amp;nbsp; But there’s something special about what I call the “BI Family”.&amp;nbsp; I’m not referring to a specific segment of BI, and I am not focusing on any particular job function. I’m talking about working in the BI industry as a whole.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;I feel somewhat qualified to comment on this because, in the past twenty years, I’ve worked in numerous industries.&amp;nbsp; To name a few: life sciences, research, semi-conductor, telecom, payroll, accounting, systems integration, consulting, audio-visual, online media, financials, accounting, and insurance.&amp;nbsp; So I have a lot of background to compare from.&amp;nbsp; And believe me, there are worse adoption outcomes than membership in the “BI Family”. &amp;nbsp;Here’s my subjective top ten list of why working in this industry is really cool (no specific order).&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;#1. &lt;b&gt;Brains&lt;/b&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;People are really smart. I’m not suggesting other industries spawn dummies, but the proportion of high IQs in the BI world always amazes me. &amp;nbsp;It’s often humbling and always stimulating.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;#2. &lt;b&gt;Strength&lt;/b&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;As we all know, the BI market is not only huge, but getting larger and growing yearly to the tune of 8-10% a pop. Other industries are not so healthy to say the least.&amp;nbsp;&amp;nbsp; Simply put, BI is clearly not a fad, and no one questions its future.&amp;nbsp; It’s one of the fuels of our free-market system by protecting businesses and providing them with competitive tools. &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;&amp;nbsp;#3. &lt;b&gt;Meaning&lt;/b&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;Guy Kawasaki says: “make meaning”.&amp;nbsp; He’s right.&amp;nbsp; Life’s too short to be in a meaningless industry.&amp;nbsp; And if BI isn’t about making meaning then I don’t know what is. &amp;nbsp;The whole purpose of the industry is to make meaning and support critical decision making. &amp;nbsp;This industry yields real-life significant solutions to crucial sectors like health, research, medicine, and defense (to name a very few). &amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;#4. &lt;b&gt;Quality of life&lt;/b&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;In light of ominous HR &lt;a href="http://bit.ly/NWDWi"&gt;predictions&lt;/a&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;, news of recurring layoffs, and current employment trends in the IT industry, the BI sector has been relatively spared. Clearly I haven’t done any formal polling but I get the “vibe” that people are generally pretty happy to be in this game. &amp;nbsp;And why not. Compensation is generally good. &amp;nbsp;And location-wise, BI companies are clustered around national funding clusters namely Northern California and the New York/Boston areas.&amp;nbsp; These comprise some of the most magnificent (and most expensive, granted) landscapes and vibrant urban areas in the country. &amp;nbsp;Other industries have centers in, shall we say, less compelling geographic areas.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;#5. &lt;b&gt;Funding&lt;/b&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;I lamented the VC situation in my previous post, and clearly this doesn’t apply to all segments of BI, but if you have any sort of compelling BI proposition with the word “cloud” in your business plan, trust me you will get a VC’s attention.&amp;nbsp; Maybe not an official invite to pitch in person, but most likely a phone call.&amp;nbsp; In other industries, that opportunity is long gone.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;#6. &lt;b&gt;Gratification&lt;/b&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;BI projects used to take many months (sometimes years) to implement (when they even got completed).&amp;nbsp; But nowadays the industry is in “agile” mode.&amp;nbsp; And those who don’t embrace that won’t likely be in this business much longer. &amp;nbsp;This means you get to build solutions and see results quickly.&amp;nbsp; That’s gratifying.&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;#7. &lt;b&gt;Globalism&lt;/b&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;BI is world-wide.&amp;nbsp; True, so are most other industries, but from my experience, there is less of an “us versus them” attitude.&amp;nbsp; It has a fraternal feel to it.&amp;nbsp; Hands and minds seamlessly reach across continents in ways I have not experienced elsewhere. (I’ll go hug a tree now).&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;#8. &lt;b&gt;Bozos&lt;/b&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;Most people are genuinely nice and unassuming.&amp;nbsp; I know this sounds naïve at best but it’s true.&amp;nbsp; I have not seen the level of ego, axe grinding, or personal animosity frequent in other industries.&amp;nbsp; Every contact I’ve initiated from top analysts to CEOs in this industry has been followed up promptly with courteous, genuine and insightful discussion. I’ve found most people to be more generous with their time and advice than in many other industries. Maybe they fear less for their jobs or fancy titles. In either case, the BI industry is fairly low on the bozo scale.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;#9. &lt;b&gt;Passion&lt;/b&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;People in BI are passionate about their field.&amp;nbsp; I’m not saying they get out of bed every morning to go save the world (onward BI soldiers), but overall they value their work and their contribution. &amp;nbsp;Most people I’ve met in this business are workaholics.&amp;nbsp; They know their stuff inside-out and boy do they love to talk about it. &amp;nbsp;Passion signals a great, vibrant industry.&amp;nbsp; Additionally (and this key), there seems to be better customer advocacy in this industry than others.&amp;nbsp; It’s not perfect, but vendors often do listen and react accordingly.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;#10. &lt;b&gt;Innovation&lt;/b&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;I hate to reveal this well-kept secret (don’t tell anyone!), but there isn’t a lot of desire to innovate in the financial, accounting or insurance fields, for example. &amp;nbsp;I’d be preaching to the choir by pointing out the myriad of new-breed ADBMS players out there, but also the multitude of new OLAP, data mining and analysis products, approaches and new (non-relational) ways of looking at data, cloud BI, EC2, etc. &amp;nbsp;We’ve seen orders of magnitude of both hardware and software innovation in the BI world.&amp;nbsp; It is a rich intellectual field teaming with innovation levels typical of a “new frontier” because it is.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;So what’s the point of this apologist diatribe?&amp;nbsp; Just to remind people in this field to count their blessings. There are many worse places and industries to be in. &amp;nbsp;And it’s easy to take things for granted in the heat and excitement of daily business life.&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;In the past eighteen months I’ve met many challenges in this business.&amp;nbsp; From coding to QA, to technical writing, from sales engineering to evangelism, from product management to market analysis. You name it. &amp;nbsp;So I’ve seen a lot of the facets in a very intense, very short amount of time. &amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;And I’ve also been lucky to interact with numerous players in the industry, many of which have generously spent time and resources supporting my self-education efforts with their insight, connections, and advice.&amp;nbsp; You guys (and gals) know who you are and I thank you for the help. There are many &lt;a href="http://bit.ly/1UmCU1"&gt;mensches&lt;/a&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;&amp;nbsp;in this business.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Georgia, 'Times New Roman', serif;"&gt;For the first time in my life, I think I can say I’ve found a home here in the BI industry.&amp;nbsp; I’ve never felt this way in the past twenty years, and I’m not exactly sure how to explain it, but like the old pair of shoes my wife keeps insisting I ditch, it just feels right and I’d like to keep it that way.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;/span&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-8997232504736061613?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/8997232504736061613/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/09/select-sumblessings-from-workinginbi.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/8997232504736061613'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/8997232504736061613'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/09/select-sumblessings-from-workinginbi.html' title='SELECT SUM(blessings) FROM working_in_bi'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-3949291432984817377</id><published>2009-09-22T13:20:00.000-07:00</published><updated>2009-09-22T13:50:34.944-07:00</updated><title type='text'>Please Stop Making More ADBMS Sausage</title><content type='html'>&lt;p class="MsoNormal"&gt;If you’re thinking about building a new startup in the high-performance analytical database (ADBMS) market, hat’s off to you: kudos and respect my brother.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I’ve been in the kitchen, and I’ve seen the sausage being made. But let me tell you something: &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;you might be a day late and a dollar short to the party. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;In the past years, I’ve often pondered where the new-breed high-performance analytical database industry was headed.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Will the existing players manage to survive?&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And is there a chance in hell for new ones to succeed in this market?&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;If you had asked me to peek into my newly minted BI crystal ball in early 2009, I would have said “no way”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Why? Because at the time I was predicting the demise of a majority of the twelve or so players in this space based on the observation that the field was too crowded and too expensive.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I figured, in this cut-throat competitive space, and with tough economic times ahead, we’d be lucky to see two, maybe three survivors come 2010.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Since then, not only have several other significant players, technologies and business models popped up (for example, Groovy, XtremeData, VectorWise, Hadoop/MR and the OSS guys), but we have clearly not seen the level of attrition I was anticipating.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Nobody has (officially) gone out of business save Dataupia as best I can tell, and Datallegro got a check from Steve Ballmer.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Sure, some folks are experiencing tougher times than others (I dare say several are hanging by a thread) but overall, resilience has been the name of the game. So what gives?&lt;/p&gt;  &lt;p class="MsoNormal"&gt;From my point of view, there are two types of new-breeders really:&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;those living in “comfortably numb” mode (quietly outliving peers may not be a bad strategy these days), and those kicking it into high-gear with a vengeance. &lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In the latter category I can’t help but think of Vertica, Netezza, Aster Data and ParAccel.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It takes a lot of “cojones“, cash and luck to build a new ADBMS company. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;But even blessed with all these, and given the proper planetary alignments, I would advise anyone considering a start from scratch nowadays to ponder the following points.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;First, it is insanely expensive and complex to develop systems software to produce a complete analytical database engine (and I mean “complete” in a holistic Product Management sense).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Tony Bain highlights some of the database startups challenges in his excellent series starting &lt;a href="http://bit.ly/4flp2q"&gt;here&lt;/a&gt;&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;(things are not significantly different between OLTP and OLAP in this respect). But systems software is a different animal than your run-of-the-mill corporate enterprise application.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;On average you’re looking at 200,000 man-hours and anywhere from $60M to $100M to fund such a venture to completion.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This is just to get rolling. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Additionally, you cannot just put out a database product and call it a day.&lt;span style="mso-spacerun:yes"&gt;   &lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Maintaining (enhancing) a behemoth of Oracle or SQL Server stature runs hundreds of millions of dollars every single year. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Everything from equipment to talent costs more when developing database software.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Believe you me, the folks who will work on your SQL optimizer, inter-fabric communications, parallel or compression schemes better not be affordable newbies.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Your development platforms won’t likely be the average oh-hum laptop attached to cheap storage.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;An efficient QA or Performance Group will cost a small fortune in payroll and redundant equipment. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;And seasoned performance architects don’t run the streets. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;You cannot assemble a database engine product by cobbling together open source bits and distributed talent like a new Web 2.0 RIA venture.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;You can’t take a Kia (even a dozen of them) to the Indy 500.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Second, I think it’s no longer possible to find sufficient levels of Venture Capital funding for such endeavors. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;My feelings on this issue are re-enforced when I read articles like &lt;a href="http://bit.ly/3xkd5"&gt;this&lt;/a&gt;,&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;or &lt;a href="http://bit.ly/4BfZ3R"&gt;this&lt;/a&gt;. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;I think the writing was on the wall for several years now.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Reports of the VC’s demise are greatly exaggerated, but the funds, the endurance, and the risk acceptance levels are gone. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Small bets on small returns are in. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Large bets on IPO-driven returns are out (for now). &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Even if you manage to score a major industry name like Mike Stonebraker on your Board, I think VCs in this space (those that are left and not now engaged in M&amp;amp;A) will say “talk to the hand”.&lt;span style="mso-spacerun:yes"&gt;   &lt;/span&gt;Investors in numerous existing new-breeders are biting their nails to the bone (or looking for ways out).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;So to me, the train has left the station.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And unless you can pony up your own seed money, trying to fund such a project via institutional money is currently, in my opinion, an exercise in futility.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Third, the field is already too crowded and spread out very thin. For a great overview of the major players out there, don’t miss Bloor Research’s competitive analysis &lt;a href="http://bit.ly/1u5s3v"&gt;paper&lt;/a&gt;. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;A lot of existing players do not have sufficient “boots on the ground” to make headway against larger ones, much less established powerhouses.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Heck, even going against Vertica’s deep-pocketed marketing is no piece of cake.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Worse yet, in this business, success is not guaranteed by technical superiority.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I know it sounds heretic saying this about an industry dominated by performance claim testosterone, but it’s true.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Besides technical prowess, you need to get the word out louder than everyone else.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Unfortunately, everyone has the same “word”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In a crowded space this means you have to yell “Fire!” pretty darn loud and relentlessly to get noticed.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I think a lot of “database deals” are sealed on the golf course, more so than POC or bake-offs.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Mind you, this is probably the case with most enterprise software.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;But to get your foot in the door, you need BOD and Primadona action.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;BOD are well-connected heavy hitters on your board.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Primadonas are the star sales guys (or gals) currently working for your competition.&lt;span style="mso-spacerun:yes"&gt;   &lt;/span&gt;Those you’ll have to poach with sweetheart deals to come work for you, a totally unproven new-breeder with a year of runway to go.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Fourth, pricing pressure in this business is relentlessly choking. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;This is a consequence of my previous point.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;A little over a year ago, word on the street was $100K/TB retail ($50K/TB street price) but now we’re seeing $20K/TB retail (TwinFin land), which probably means you can do $10K/TB on the street.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Aster Data is pitching an appliance for $50K (1TB, includes Dell hardware), &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;and Oracle’s new improved &lt;a href="http://bit.ly/zfzoG"&gt;Exadata V2&lt;/a&gt;  (SATA storage) even touts $5,700/TB so I mean, at these margins, you’re basically talking about giving stuff away, and in a lot of cases, I suspect that’s what’s going on.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;So unless you’ve been around the block a bit and have some ammo in the bank, I don’t know how a newcomer can sustain this type of pricing “carpet bombing”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;As if that weren’t enough, you have OSS and cloud players breathing down your neck.&lt;span style="mso-spacerun:yes"&gt;   &lt;/span&gt;Customers expect more for less and perception of BI as a “commodity” is growing.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In this pricing environment, survivability for a newbie is improbable at best. &lt;/p&gt;  &lt;p class="MsoNormal"&gt;Fifth, several windows of “technology opportunity” for ADBMS are closing.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;For example, if your great idea for a new ADBMS company involves a columnar approach, you might be too late to the party. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;If your “innovation” hinges on massive parallelism, compression, in-memory caching/cubing schemes, super-fast intra-nodal fabrics, hybrid MPP/SMP, hybrid row-column storage (PAX-like), or yet another SQL chip accelerator or super-duper FPGA, you might have missed the boat (on the other hand, if you figured out how to do analytics on compressed encrypted data, then you might be on to something).&lt;/p&gt;  &lt;p class="MsoNormal"&gt;I believe the top new-breeders did all the technical legwork in the past 4-5 years.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It took Mike Stonebraker long enough, but Vertica pretty much put columnar on the map.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And most of that engineering is now mature enough (and well proven) to warrant acquisition interest from the big boys.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Initially, the big guys took a “wait and see” attitude (remember, OLTP butters their bread anyway, not analytical OLAP) but now, having seen results and traction on others’ dime, I think they’re ready to pony up some cash (classic buy vs. build decision) to absorb the bits and pieces suiting their marketing strategies. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;By doing so, the good ole boys re-invent themselves and say “hey look, we have columnar technology as well now!”&lt;span style="mso-spacerun:yes"&gt;   &lt;/span&gt;(How Sybase didn’t corner this market with IQ is beyond me, especially having read Seth Grime’s excellent &lt;a href="http://bit.ly/Y5rs0"&gt;paper&lt;/a&gt; about it).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Better yet, by integrating new technologies into existing code bases, the big dogs can say “hey, we have the best of both worlds for OLTP and OLAP” (Oracle’s latest Exadata comes to mind).  And perhaps “look, we have SMP on the processing side and MPP in storage layer”, or vice-versa, thereby returning to the old “one-size-fits-all” GP-RDBMS paradigm so &lt;a href="http://smartdatacollective.com/Home/21488"&gt;criticized&lt;/a&gt; by Stonebraker (but so convenient for the corporate user).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span style="mso-spacerun:yes"&gt;&lt;/span&gt;And clearly, given the growing popularity of “operational analytics”, an OLTP+OLAP offering is compelling.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;So I think the “proof-of-concept” window for many new-breed technologies, specifically MPP columnar (but others as well, for instance, acceleration hardware, where Ingres is picking up VectorWise, or MPP where Microsoft snapped up Datallegro), has closed.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The winners (and their results) are in and acquirers will likely make their move in 2010. This dovetails nicely with a recent TDWI survey claiming half the respondents plan to replace their DW platform between 2010 and 2012 (apparently this is presented &lt;a href="http://www.tdwi.org/display.aspx?id=9499"&gt;here &lt;/a&gt;on October 7&lt;sup&gt;th&lt;/sup&gt;).&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;All this being said, is it possible that a brand new software endeavor currently in stealth-mode development in Nepal might suddenly dominate the analytical database scene within months? How about a revolutionary FPGA/SQL Chip/Flash/Optical hardware contraption that could blow the hinges off industry standards and benchmarks? &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Sure why not.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Real innovation is usually unexpected, and often unintended.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;But I don’t see it being driven by the classic “VC funds startup makes big database scores big IPO” model much longer. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;When I look at things like &lt;a href="http://bit.ly/axsOW"&gt;Hadoop&lt;/a&gt;&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;and current developments in the OSS space for VLDB analytics, I still have trouble grasping the business model, but I clearly see “life force” innovation at work &lt;a href="http://bit.ly/wvoNN"&gt;here&lt;/a&gt;.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;A year ago I would never have expected a place like Visa to stray from the “Big Threes” but nowadays these guys are &lt;a href="http://bit.ly/NUIqV"&gt;messing&lt;/a&gt; with Hadoop! People are also doing amazing things with MapReduce implementations and BigTable KV types of massive data storage systems. &lt;/p&gt;  &lt;p class="MsoNormal"&gt;How does open source fare against the five points mentioned above?&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Pretty darn well if you ask me. Costs are significantly lower, venture capital is not needed or minimal, engineering is crowd-sourced, there’s more breathing room, market entry is viral and massive, distribution and testing self-fueled, and pricing (or lack thereof) better controlled.&lt;span style="mso-spacerun:yes"&gt;   &lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span style="mso-spacerun:yes"&gt;&lt;/span&gt;Additionally, open source seems shielded from the “Borg Effect”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I don’t see how massive proprietary shops like Oracle, IBM or Microsoft can successfully “absorb” these entities.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I don’t think Larry has a clue what to do with MySQL. He can’t really unload it, but he can’t really integrate it either.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Darn Trojan horse!&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In 2008, Infobright went Open Source and raised $10M in the process.&lt;span style="mso-spacerun:yes"&gt;   &lt;/span&gt;Looking at the results, I think these guys were smart!&lt;/p&gt;&lt;p class="MsoNormal"&gt;So if you’re thinking about building yet another high-performance analytical database engine the classical way (and not going OSS), my advice to you is simple: unless you have $60M in the bank and technology significant (and new) enough to impress people like &lt;a href="http://dbmsmusings.blogspot.com/"&gt;Daniel Abadi&lt;/a&gt; (good luck on that one), you might be climbing up a greased pole.&lt;span style="mso-spacerun:yes"&gt;  I'm not saying it's impossible mind you, but t&lt;/span&gt;here’s been a lot of cooks making the same sausage over the last five to six years to last us a while. Maybe it's time to look at the next curve.&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;br /&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-3949291432984817377?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/3949291432984817377/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/09/please-stop-making-more-adbms-sausage.html#comment-form' title='6 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/3949291432984817377'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/3949291432984817377'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/09/please-stop-making-more-adbms-sausage.html' title='Please Stop Making More ADBMS Sausage'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>6</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-6320278877679631722</id><published>2009-09-17T11:14:00.000-07:00</published><updated>2009-09-17T11:52:29.757-07:00</updated><title type='text'>Vaya Con Dios - The Day I left XSPRADA</title><content type='html'>&lt;p class="MsoNormal"&gt;Effective today, I will no longer be working at XSPRADA.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;There, I said it.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Catharsis, take me away!&lt;span style="mso-spacerun:yes"&gt; I&lt;/span&gt;t was definitely a tough decision but this is par for the course in the startup world, and God knows I’ve been there done that, but in this case, my close long-time personal relationship with the founders and unwavering worship of the technology for the past ten years make this particularly bittersweet.  &lt;/p&gt;  &lt;p class="MsoNormal"&gt;I was very lucky to work with some of the best, most resilient people this industry has to offer and I don’t claim this lightly.&lt;span style="mso-spacerun:yes"&gt;  As you know, &lt;/span&gt;I am parsimonious with compliments.&lt;span style="mso-spacerun:yes"&gt;  But "t&lt;/span&gt;hey" say that tough times never last, only tough people do.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This is why I am still convinced that XSPRADA technology will someday take its proper place in the world come what may.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;A failed endeavor is only one during which you have learned nothing.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And in this case, the XSPRADA opportunity has enriched me in personal and professional ways beyond my wildest expectations.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;So I depart a richer, more experienced man for it.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In an industry too often clouded by “smoke and mirrors”, I am always reminded of the following advice I received a while back:&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;/p&gt;&lt;ol&gt;&lt;li&gt;Don't profess about things you are not sufficiently familiar with.&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;&lt;/li&gt;&lt;li&gt;Don't assume something isn't true that might be, or is that isn't&lt;/li&gt;&lt;li&gt;Listen at least twice as long as you talk.&lt;/li&gt;&lt;li&gt;Ask twice as many questions as you answer and LISTEN to the answers.&lt;/li&gt;&lt;li&gt;When asked about something and you don't know, say "I don't know."&lt;/li&gt;&lt;li&gt;If there is even the slightest possibility that you could be wrong, acknowledge it....&lt;/li&gt;&lt;li&gt;and don't forget to thank people for their time and advice.&lt;/li&gt;&lt;/ol&gt;&lt;p&gt;&lt;/p&gt;            &lt;p&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;You combine that advice with the one found &lt;a href="http://u.nu/2y993"&gt;here&lt;/a&gt; and believe you me you’ve got yourself one kick-ass sales engineer there.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;They don’t run the streets.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I’d say these points could (or should) constitute the Seven Commandments of the Sales Engineer (and probably any other profession, except perhaps politician).&lt;span style="mso-spacerun:yes"&gt;   &lt;/span&gt;They were given to me when I started by Chris Piedmonte, founder of XSPRADA, and a guy whose courage, integrity and technical brilliance are, in my book, without equal.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Be that as it may, I must soldier on.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Naturally, I will not be able to discuss topics pertaining to XSPRADA technology as an insider from now on, but Lord knows there is sufficient ADBMS/BI material out there that’s interesting enough to cover and discuss on a regular basis.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Namely, the new “PAX Analytica” movement as originally &lt;a href="http://u.nu/5v993"&gt;brought up&lt;/a&gt; by Curt Monash and professionally laid-out by Daniel Abadi as usual in his excellent &lt;a href="http://u.nu/8v993"&gt;post&lt;/a&gt;.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span style="mso-spacerun:yes"&gt;&lt;/span&gt;Also, the old one-size fits all (OLAP+OLTP+whatever) versus dedicated engines (columnar OLAP) has been revived with the recent ORCL &lt;a href="http://u.nu/6u993"&gt;announcement&lt;/a&gt; touting the new improved Exadata V2 (exit HP, enter Sun).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This deserves addressing in more detail.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It leads to serious implications for current and potential customers.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span style="mso-spacerun:yes"&gt;&lt;/span&gt;Relevant as well is a discussion about the future.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Will the “big boys” end up swallowing “new-breed” technology and integrating it (what I call the Borg effect, as discussed on Daniel Lemire's excellent &lt;a href="http://u.nu/5ey83"&gt;post&lt;/a&gt;).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Or will they become obsolete allowing the new-breeders to survive long-term as independent replacement entities?&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;And then there’s a recent &lt;a href="http://u.nu/55a93"&gt;thread&lt;/a&gt; about analytical speed which is very relevant at this point in time I believe.&lt;span style="mso-spacerun:yes"&gt;  Finally, &lt;/span&gt;a little bird told me there’s about to be some really interesting rumble (again) pertaining to the infamous (or not, depending on which side you’re on) TPC organization.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Indeed, there’s no lack of interesting topics out there!&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;Additionally, I think there’s a compelling story behind employment search and provisioning in the BI industry, so I’ll be penning some thoughts about that as I go along. &lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In my experience, you can infer a lot about an industry’s state by checking its recruiting culture and pulse.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Joy Chen claims in a recent blog &lt;a href="http://u.nu/4x993"&gt;posting&lt;/a&gt; that 54% of workers plan to resign after the recession.  If this prediction is correct, the impact on our industry is sure to be felt and that, IMHO, is worth discussing.&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span style="mso-spacerun:yes"&gt;&lt;/span&gt;So what’s really happening behind the employment scene in BI? I’ll be sharing some thoughts about that as I embark on the new path the BI Gods have charted for me.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;So thanks for sticking around, and as they say where I come from (well, ok maybe a little further South) Vaya con Dios!&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-6320278877679631722?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/6320278877679631722/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/09/vaya-con-dios-day-i-left-xsprada.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/6320278877679631722'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/6320278877679631722'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/09/vaya-con-dios-day-i-left-xsprada.html' title='Vaya Con Dios - The Day I left XSPRADA'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-1618064350785025784</id><published>2009-08-26T15:29:00.000-07:00</published><updated>2009-08-26T15:39:31.296-07:00</updated><title type='text'>Mole Whackers Need not Apply</title><content type='html'>&lt;p class="MsoNormal"&gt;Two completely different events caught my attention lately.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;One of them is a post by Curt Monash called &lt;a href="http://u.nu/4wx23"&gt;Bottleneck Whack-A-Mole&lt;/a&gt;,&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;and the other is the much-publicized &lt;a href="http://u.nu/6wx23"&gt;alliance &lt;/a&gt;for “BI in the Cloud” comprising &lt;a href="http://www.rightscale.com"&gt;RightScale&lt;/a&gt;, &lt;a href="http://www.talend.com"&gt;Talend&lt;/a&gt;, &lt;a href="http://www.jaspersoft.com"&gt;Jaspersoft&lt;/a&gt; and &lt;a href="http://www.vertica.com"&gt;Vertica&lt;/a&gt;.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;In the post, Curt describes software development (or developing a good software product) as “a process of incremental improvement”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Fair enough.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The analogy he draws is between constantly fixing and improving performance bottlenecks and the annoying (if entertaining) arcade game of &lt;a href="http://360digest.com/wp-content/uploads/2009/06/whack-a-mole.jpg"&gt;Whack-A-Mole&lt;/a&gt; where you have to be fast enough to clobber enough of the critters as they randomly pop up from below.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;He then makes the point that “Improving performance in, for example, a database management system has a lot in common with Whack-A-Mole.” &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Having spent most of my life designing, developing, improving and testing commercial and enterprise software applications, I have to say I don’t totally agree with his analogy for several reasons.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;First, call me old-fashioned, but I’m an ardent believer in the fact that software building is deterministic.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Whack-A-Mole engineering is not.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The age-old controversy about software being more of an art than a science may never be resolved, but at the end of the day, I feel software is (should be) a scientific, engineering-driven, deterministic &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;endeavor like any other engineering discipline.&lt;span style="mso-spacerun:yes"&gt;   &lt;/span&gt;With Whack-A-Mole engineering, buildings and airplanes fall to the ground.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;That’s not good.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In my experience, those who seek to “romanticize” software engineering are typically adverse to proper planning, design and testing as being too “dry” or unworthy an endeavor. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;That’s nonsense. &lt;/p&gt;  &lt;p class="MsoNormal"&gt;Second, there is a distinct difference in the way you develop “regular” software from “system software” and I’ve learned this from sitting in the front row the past several years at XSPRADA watching database software being built from the ground up.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It’s a little bit like the difference between building a tree house and a major commercial skyscraper.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And I believe that playing Whack-A-Mole games while trying to bring up a building is a scary proposition at best (especially for future tenants).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And yet, the example Curt provides involves Oracle’s Exadata, of all products!&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;He states: “When I spoke to Oracle’s development managers last fall, they didn’t really know how many development iterations would be needed to get the product truly unclogged” – This statement is mind-boggling to me.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Because for one thing, it suggests that Exadata is “clogged” (ouch) but worse, that their engineering people have no clue as to how they might eventually (if ever) snake the blockages out of it!&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;So, basically it’s a trial and error approach to building a database.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Notwithstanding their “professed optimism” that it wouldn’t take “many iterations at all” to finally figure things out, it certainly doesn’t give me (or any reasonable person) a warm feeling about a multi-million dollar product claiming to be the world's ultimate analytical machine.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;I think there’s a lot to be said for sound engineering practices, proper planning and testing, setting expectations and deterministic engineering management practices in the world of system software.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;That Oracle (or Netezza for that matter, also referenced in the post) might just be going along whacking moles instead is a scary proposition indeed.&lt;span style="mso-spacerun:yes"&gt;   &lt;/span&gt;Even if this little game is limited to “performance engineering” as Curt suggest (as if there was a more important endeavor in an ADBMS), that’s a serious allegation in my book.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I say leave the arcade games to the kids, and let the real engineers design and implement database and system software please.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;There’s no room for amateurs in this game.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;On to my next point of interest: the new Gang of Four in the Cloud (with apologies to design pattern aficionados) comprising RightScale, Talend, Vertica and Jaspersoft have recently promoted and demonstrated a “bundled” on-demand package for the cloud.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I attended their webcast yesterday and was impressed, but with reservations. &lt;/p&gt;  &lt;p class="MsoNormal"&gt;Each of these vendors is impressive on its own, no doubt about it.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;But it seems to me the bundled proposition might be confusing at best to the unwary customer.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This new offering is billed by the marketing folks as “Instant BI, just add water” which drives me nuts.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Look, it might be simple in theory, and it might take a few minutes to setup the stack on your own (as Yves de Montcheuil from &lt;a href="http://www.talend.com/management/management.php"&gt;Talend &lt;/a&gt;claims) but it’s still a long way to actually accomplishing anything serious in a few simple clicks. Sorry, not going to happen anytime soon.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;You still have to work your way through provisioning and instance management (RightScale), data integration and loading (Talend), feeding and configuring the database (Vertica), and setting up the reports/analytics you might need (Jaspersoft).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;All of which can be accomplished just as easily (or not) internally by the way.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It’s true you’d still have to purchase or license Vertica internally, which may or may not match the SaaS pricing I’m not sure (and either way, Vertica has a SaaS offering as well) but the other components are open source so, I’m not sure I see the big advantage there.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;An interesting thing I noticed as well is that some people didn’t seem to understand what RightScale’s role was in the whole offering.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This tells me they don’t really grasp the intricacies of “the cloud” – because instance and infrastructure management for enterprise in the cloud is not trivial and you do need something like RightScale to grease the wheels (it’s an abstraction layer really), but I think many people assume moving to the cloud is “magic” and makes all these issues disappear.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;If that were the case, you wouldn’t need RightScale in the mix.  Beware undermanaging expectations I'd say.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Additionally, the pricing model (which is supposed to be so much simpler in the cloud) is confusing at best as each vendor has its own menu.&lt;span style="mso-spacerun:yes"&gt;   &lt;/span&gt;The best answer to that I can remember was “starting at $1,700 per month” – I’m not sure what to make of that.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;So I think from an engineering/technical standpoint, this endeavor is noble, but from a “let’s make things simpler and transparent for the user” perspective, there’s still a lot of work to be done. &lt;span style="mso-spacerun:yes"&gt;  In other words, it's a nice play for the vendors holding hands, but I'm not sure how beneficial it might be to the average enterprise user.&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;As usual, caveat emptor – Beware promises of a holy grail in BI as there is no such thing.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It’s all about work.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Hard, detailed and careful work with proper planning and budgeting.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;In that respect, setting up successful BI solutions is a lot like running and implementing software projects.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;There are no shortcuts, and it’s not a job for mole whackers.&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-1618064350785025784?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/1618064350785025784/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/08/mole-whackers-need-not-apply.html#comment-form' title='24 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/1618064350785025784'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/1618064350785025784'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/08/mole-whackers-need-not-apply.html' title='Mole Whackers Need not Apply'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>24</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-3259306186570857491</id><published>2009-08-17T13:27:00.000-07:00</published><updated>2009-08-17T13:54:25.891-07:00</updated><title type='text'>Oh yeah? Well my database is SMALLER than your database!</title><content type='html'>&lt;p class="MsoNormal"&gt;Contrary to popular edict, smaller is not always better, unless of course you’re talking about analytical database engines.&lt;span style="mso-spacerun:yes"&gt;   &lt;/span&gt;In that respect, it’s hard to find an ADBMS that can fit on hard media like a CD or a USB stick.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;For example, I don’t think SQL Server, Oracle, DB2, Greenplum, Aster, ParAccel, or the myriad of other ADBMS vendors can fit all their bits in a tight spot.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Even in the open source realm, I doubt you can wedge InfoBright (MySQL) or IceBreaker (Ingres) onto a stick, much less shlep their bits around as an email attachment.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;One exception to this is the &lt;a href="http://www.bcsolution.com/2009/07/vstick-coming/"&gt;V-stick&lt;/a&gt; from Vertica.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;When I first read about this, I initially thought it was a hoax but apparently not.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It’s pretty cool too because it includes the O/S, web server, GUI and the engine all together on a 16GB thumb drive.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;How an engine like Vertica, designed around distributed MPP, can possibly operate representatively (using terabyte-size data) on a thumb drive is beyond me, and I’ve never heard of anyone actually using this gizmo but I’d sure love to get my hands on one and review it if it’s still available.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;The other exception of course is RDM/x, the XSPRADA database engine.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The reason is simple: its total deployment footprint is around 10MB.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;That includes the 32/64 ODBC drivers and a couple DLLs.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The engine itself is currently around 6MB.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Last I looked the installer clocked in at 16,760KB.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This means you can actually deploy RDM/x onto a memory stick if you want to.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I tried it, it works. It’s pretty cool.&lt;span style="mso-spacerun:yes"&gt;   &lt;/span&gt;But after a while I wondered, why would anyone care about this?&lt;/p&gt;  &lt;p class="MsoNormal"&gt;The reason is two-fold.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;First, it’s really easy to try out software that is small and self-contained without expanding large amounts of time and resources.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Yes, you can download RDM/x from our &lt;a href="http://www.xsprada.com/"&gt;website &lt;/a&gt;but in many cases (like secured firewalled enterprises), that’s not an option. &lt;/p&gt;  &lt;p class="MsoNormal"&gt;Second, it means we’re a good candidate for embedded applications.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Because if I can fit my database engine on a stick (or in an email), I can probably embed it in instruments and devices as well either as raw C++ code or libraries.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;But for quick POCs, size and simplicity really does matter.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Say you’re suddenly tasked with evaluating solutions to deploy a BI solution inside your company.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Suppose you’re a Microsoft shop. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Suppose additional capex is not an option, and suppose further you have a week to show results (namely a set of nicely formatted reports, pivot tables or dashboards).&lt;span style="mso-spacerun:yes"&gt;   &lt;/span&gt;Now what?&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;If you have significant in-house experience with SQL Server and associated SSAS, SSIS, SSRS, and Excel (and assuming you have a clear and deep understanding of the business scope and goals to begin with) you’re probably going to:&lt;/p&gt;&lt;p class="MsoNormal"&gt;(1) Figure out where your source data is coming from (connection strategies)&lt;/p&gt;  &lt;p class="MsoNormal"&gt;(2) Model your DW (figure out grain on facts, dimensions etc, need to figure out BIDS and SSAS) &lt;/p&gt;  &lt;p class="MsoNormal"&gt;(3) Establish some preliminary ETL process (including incremental loads, need to figure out SSIS)&lt;/p&gt;  &lt;p class="MsoNormal"&gt;(4) Load your warehouse (if you screw it up, then need to drop and do it over)&lt;/p&gt;  &lt;p class="MsoNormal"&gt;(5) Setup an SSAS cube structure (figure out SSAS via SSMS or BIDS then publish the thing)&lt;/p&gt;  &lt;p class="MsoNormal"&gt;(6) Figure out what queries to generate (talk to DW DBA or learn MDX)&lt;/p&gt;  &lt;p class="MsoNormal"&gt;(7) Figure out what BI tool to use (Excel or browser, depends on policies and audience)&lt;/p&gt;&lt;p class="MsoNormal"&gt;(8) Generate the reports (canned or ad-hoc)/dashboards/pivot tables for the POC&lt;/p&gt;&lt;p class="MsoNormal"&gt;Now, if you have no prior experience with the Microsoft BI toolset, and you can whip this little project up in a week, guess what, you need to quit your job and start a consulting company because clearly, as a NYC recruiter once told me “you’re so money”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;But if you’re a normal person with little prior BI experience (and the terms ROLAP, MOLAP, SCD and MDX don’t ring a bell), you’re in a bind.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;So another thing you can do is download a tiny analytical database (say, the XSPRADA RDM/x engine, for example) and throw, say, 100GB of data at it (this is just a small POC remember?), then plop Excel on top of it and generate some really cool reports or pivot tables to show the boss (in under a week) it can be done.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;How hard is that to do?  This hard:&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;b style="mso-bidi-font-weight:normal"&gt;Figure out where your source data is coming from.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/b&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Yup, that one is pretty universal in the BI world. Difference here is all your data sources will export as CSV to feed the XSPRADA engine.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;So at least that’s consistent across all sources (be they structured, semi-structured or not).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;CSV is data format lingua-franca so your connection "strategy" is this: get everything out as CSV. Plain and simple.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;b style="mso-bidi-font-weight:normal"&gt;Model your data warehouse.&lt;o:p&gt;&lt;/o:p&gt;&lt;/b&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;That’s always a smart thing to do for obvious reasons although the XSPRADA engine is schema-agnostic and you can feed it normalized or star/snowflake models at will. The secret phrase is: “we don’t care”! So for a quick POC, if you find yourself "forced" to feed RDM/x a 3NF model, no worries.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;b style="mso-bidi-font-weight:normal"&gt;Establish some preliminary ETL process.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/b&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;RDM/x runs against initial CSV data islands directly off disk.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Point to the CSV files using the XSPRADA SQL extensions for DDL and you’re done. &lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;You’ll likely be doing this via script or code (C++, &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Java or .NET to the ODBC driver directly or via a JDBC-ODBC bridge).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;For incremental loads, just plop the new CSV files on disk and point RDM/x to them using the INSERT INTO…FROM extension.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This process can be done in real time without disruption while other queries are running.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;No hassle there.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;b style="mso-bidi-font-weight:normal"&gt;Load your warehouse.&lt;o:p&gt;&lt;/o:p&gt;&lt;/b&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;That’s executing a single line of SQL DDL code such as &lt;/p&gt;  &lt;p class="MsoNormal"&gt;CREATE TABLE ….FROM “c:\file1.csv;c:\file2.csv…c:\file32.csv”; or INSERT INTO…FROM “c:\file1.csv;c:\file2.csv…c:\file32.csv”; &lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;Made a mistake of want to modify the schema and “reload” real quick? Not a problem. Simply re-issue the same DDL command and the table/schema is instantly updated.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;From a trial and error perspective (which, in a POC situation, is fairly typical), that’s a high-five.&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;b style="mso-bidi-font-weight:normal"&gt;Setup an SSAS cube structure (figure out SSAS via SSMS or BIDS)&lt;o:p&gt;&lt;/o:p&gt;&lt;/b&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;There is no concept of cubes inside the XSPRADA engine.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;RDM/x automatically slices and dices based on incoming queries in real time.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;So if you want to “cube” just feed the engine slicing OLAP queries.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;RDM/x automatically restructures and aggregates in real time.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;No need to pre-define or pre-load cubes, deal with hierarchies or materialized views.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I &lt;a href="http://jeromepineau.blogspot.com/2009/07/of-views-cubes-patents-and-haystacks.html"&gt;blogged&lt;/a&gt; about this earlier. RDM/x is a lot like Luke 11:9 – Ask and you shall receive.&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;b style="mso-bidi-font-weight:normal"&gt;Figure out what queries to generate (talk to DW DBA)&lt;o:p&gt;&lt;/o:p&gt;&lt;/b&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;That’s where an external tool using MDX (along with an MDX expert!) can come in handy (most people don’t roll their own SQL for OLAP, although it can certainly be done in POC mode).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;One cool thing about RDM/x is its ability to “withstand” poorly-formulated SQL because the queries are optimized against the internal mathematical model.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;RDM/x is typically more “SQL-forgiving” than most other engines.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And a poorly formulated query is likely transformed internally to still yield optimal performance.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;So even if you’re no SQL guru, the RDM/x engine is still on your side.&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;b style="mso-bidi-font-weight:normal"&gt;Figure out what BI tool to use (Excel, no brainer)&lt;o:p&gt;&lt;/o:p&gt;&lt;/b&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;Connect Excel to the XSPRADA engine directly via ODBC or connect Mondrian to RDM/x (via bridge) then connect Excel to Mondrian via the SimbaO2X ODBO/XMLA connector.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Alternatively, make the argument that using OSS like Pentaho or Jaspersoft against RDM/x directly is more flexible and accessible (not to mention cheaper!) than messing with Excel.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Depending on your user base and corporate standards, that argument may or may not hold water.&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;b style="mso-bidi-font-weight:normal"&gt;Generate the reports/dashboards/KPI/Pivot Table/ad-hoc queries required by management.&lt;o:p&gt;&lt;/o:p&gt;&lt;/b&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;Exactly the same way you would using any other tool and/or SQL.&lt;/p&gt;&lt;p class="MsoNormal"&gt;At the end of the day (or in our case, the week), it’s all about “time to results” and “pain to results”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In those types of situations, smaller and simpler clearly has a significant advantage over the rest.  And speaking of smaller, I have run over my allocated space for this posting :)&lt;/p&gt;&lt;table&gt;&lt;tbody&gt;                    &lt;/tbody&gt;&lt;/table&gt;&lt;p&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-3259306186570857491?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/3259306186570857491/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/08/oh-yeah-well-my-database-is-smaller.html#comment-form' title='6 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/3259306186570857491'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/3259306186570857491'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/08/oh-yeah-well-my-database-is-smaller.html' title='Oh yeah? Well my database is SMALLER than your database!'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>6</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-2500706782556553021</id><published>2009-08-14T13:40:00.000-07:00</published><updated>2009-08-16T10:04:54.771-07:00</updated><title type='text'>Bits &amp; Pieces Summer Posting</title><content type='html'>&lt;p class="MsoNormal"&gt;I thought I would do a “freeform post” today to celebrate the lazy 2009 Summer and the fact that most of the planet (but not the BI world for some reason) seems to be on vacation at the moment.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;As you know I’ve been following ParAccel with interest for a short while now wondering how they would deploy those $22M of Sales &amp;amp; Marketing greenbacks they just scored.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I read this interesting &lt;a href="http://www.b-eye-network.com/view/11151"&gt;article &lt;/a&gt;about them lately and it looks like “customer acquisition” might be part of their strategy.&lt;span style="mso-spacerun:yes"&gt;   Good move.  &lt;/span&gt;Unfortunately, I couldn’t determine who the “other database products” or the other “columnar-MPP database” vendor might refer to.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I can only surmise it might be Vertica.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;If anyone knows who else OfficeMax looked at, please share the wealth.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;For all my whining about missing TDWI in my own backyard (San Diego) lately, it seems I didn’t miss much after all according to Merv Adrian who &lt;a href="http://u.nu/68wu"&gt;posted &lt;/a&gt;about the conference shortly thereafter.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;From the looks of it, the highlight might have been a sunset ride on the &lt;a href="http://www.lyzasoft.com/"&gt;Lyzasoft&lt;/a&gt; yacht in the San Diego bay.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Talk about good PR!&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Andy Heyler wrote a good piece about the demise of Dataupia called &lt;a href="http://andyonsoftware.com/2009/08/no-data-utopia/"&gt;No Data Utopia&lt;/a&gt;.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In it he refers to the “awkwardly named” Dataupia.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Yes Dataupia is a weird name.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;But so are several others such as Kickfire, Tokutek, or Calpont, for example.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And although XSPRADA is admittedly rather funky, at least it’s an acronym (Extended Set Processing for Rapid Algebraic Data Access).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;To me the money quote in there is “&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi- ;font-family:Arial;color:#333333;"&gt;you need to have a clearly differentiated position in such a crowded market&lt;/span&gt;&lt;/span&gt;”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It’s pretty much what I’ve been saying for a while (and common sense if you ask me).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;At the moment, I don’t see any of the players in this market besides &lt;a href="http://www.xsprada.com/"&gt;XSPRADA&lt;/a&gt; with a “clearly differentiated position” on anything. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;At the end of the day, it’s still all about the prisons that are columns and rows.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Netezza announced the long-awaited (not) TwinFin product line prompting a flurry of nasty posts from competitors like &lt;a href="http://u.nu/3b3v"&gt;Kognitio&lt;/a&gt;,&lt;span class="apple-style-span"&gt;&lt;span style="line-height:115%;Arial&amp;quot;,&amp;quot;sans-serif&amp;quot;; font-family:&amp;quot;;font-size:10.0pt;color:black;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;and an interesting Monash &lt;a href="http://u.nu/4bwu"&gt;post&lt;/a&gt; about data warehouse pricing. &lt;span class="apple-style-span"&gt;&lt;span style="line-height:115%; Arial&amp;quot;,&amp;quot;sans-serif&amp;quot;;font-family:&amp;quot;;font-size:10.0pt;color:black;"&gt;&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;Much like traditional software license pricing, ADBMS prices seem to be reaching for the bottom (stay tuned for $19.95 per terabyte while supplies last!).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;And as someone recently said, the bottom is open source.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Should be interesting to see what happens. Personally, I think a lot of this stuff is going to become commoditized.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The play looks a lot like printers or razors, where the actual hardware is sold dirt cheap, but the paper or blades cost a fortune to replenish.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Caveat emptor.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;A relatively new ADBMS vendor called &lt;a href="http://www.xtremedatainc.com/"&gt;XtremeData&lt;/a&gt; has emerged.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It looks like they’re based in the US (Schaumburg, IL to be precise) but the actual brains of the operation are in India somewhere.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;They’ve certainly been vocal on several BI blogs (namely, &lt;a href="http://u.nu/9n9p"&gt;DBMS2&lt;/a&gt;).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;To me the funniest thing is their “ChalkTalks with Faisal” screencasts.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Faisal is apparently their India-based CTO.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The entire presentation is like a Netmeeting whiteboard session where Faisal keeps talking while drawing stuff on a whiteboard in a Flintstonish manner.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;All that’s missing are the stick figures.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It wouldn’t be so bad if they realized the audio is horrible, due to the incessant noise from the marker writing on the board.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It sounds exactly like squealing puppies in the background.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It’s totally distracting albeit very amusing.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;SQL Server 2008 R2 is &lt;a href="http://u.nu/9mwu"&gt;out&lt;/a&gt;&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;but without the Gemini (or Madison née Datallegro) pieces I guess.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;That’s Office 2010, parts of which are going into the cloud if I understand correctly.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This whole Madison/Gemini “revolution” in BI is starting to get a little, shall we say, boring for lack of materialization.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Not sure what’s going on with Microsoft lately but I’m getting more and more concerned.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Even .NET seems to be taking a &lt;a href="http://u.nu/78au"&gt;backseat&lt;/a&gt; to Java/J2EE.&lt;/span&gt;&lt;/span&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It wasn’t like that a year ago.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I am sensing a scary downward spiral.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;One thing’s for sure, I have yet to see anything remotely connected to .NET or C# in the BI programming world, save for those .NET C# extensions Aster provided in their engine recently for doing MapReduce (and of course the PushBI initiative but even there…).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;To me it seems the entire ADBMS/BI code stack is Java on Linux (SuSE, Red Hat and CetnOS).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I’m talking about the supporting/ecosystem tools of course, not the underlying engines (those are mostly C/C++ I believe).&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;I could be wrong but, it’s not looking good for Microsoft.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;My prediction: &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;this behemoth will eventually split off into a myriad of smaller entities, some of which will survive, some of which won’t.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The sum of the parts may be worth more than the whole.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;Ingres has actually managed to generate some &lt;a href="http://u.nu/584v"&gt;buzz&lt;/a&gt; in the US lately by &lt;a href="http://www.ingres.com/vectorwise/"&gt;announcing&lt;/a&gt; it is teaming up with a company called VectorWise (well, it’s a research outfit actually) to develop a “project”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;No customers yet but a lot of very fancy PhD types in Amsterdam (they did MoneyDB/X100) and a first option to buy VectorWise is rumored should the venture be successful. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Time will tell. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;It always amazes me why Ingres isn’t more of a household name in the US.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In Europe, they’re very popular (at least that’s what my French Ingres experts tell me &lt;g&gt;).&lt;o:p&gt;&lt;/o:p&gt;&lt;/g&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;Finally, a recent &lt;a href="http://u.nu/4k2v"&gt;article&lt;/a&gt; is claiming that BI is used by only 8% of employees in the enterprise and that’s, of course, only counting the shops that have implemented it to begin with.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Actually I find that number high and would have guessed more like 5%.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This is not surprising in light of the recent economic woes and scandals we’ve witnessed recently.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In most of these cases, it wasn’t a lack of technology or resources at play but rather a conscious choice to ignore reality. &lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The tools are there. The desire is not.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;&lt;span style="mso-spacerun:yes"&gt;&lt;/span&gt;In my opinion, this disconnect is very apparent in numerous retail outfits.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Places like &lt;a href="http://www.homedepot.com/"&gt;Home Depot&lt;/a&gt;, &lt;a href="http://www.wholefoodsmarket.com/"&gt;Whole Foods&lt;/a&gt;, AT&amp;amp;T or Circuit City (RIP) for example, who clearly have resources and tools to perform and exploit top-notch business intelligence but still manage to provide mediocre service or product at best consistently. &lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;At Home Depot, they can’t (or won't) keep track of inventory correctly.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I was once told it was because there was too much theft (both internal and external) to update the databases frequently enough!&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Consequently, they can’t tell you if they have some items in the store or not. Not all items, just some of them.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And when they can, they’re unable to locate them physically inside the store (as in what aisle and section).&lt;span style="mso-spacerun:yes"&gt;   &lt;/span&gt;That’s shocking to me given what we constantly hear about RFID and data warehousing investments at these large box places.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;But Lowes does a much better job of this so clearly, it’s not a technology issue.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;At Whole Foods (at least the one in Irvine where I live) the quality and quantity of their product is inconsistent at best.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Some days the self-serve fish is fresh, and sometimes it is not (and has been sitting out for too long looking like a nice fat food poisoning lawsuit waiting to happen).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Like the proverbial “box of chocolates”, you never know what you’re going to get.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Similarly, their checkouts are never balanced.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;You’ll see huge lines at several of them while employees sit idle at empty others.&lt;span style="mso-spacerun:yes"&gt;   &lt;/span&gt;Every time I go there some happy-go-lucky line manager with a bright idea of the week makes it harder for me to shop and enjoy it there which is why I never set foot in the place any more (instead, I go to an even more expensive supermarket).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Apparently no one is keeping track of this.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Least of all the department managers who seem to frequently act on impulse in trial-and-error fashion.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Yet surely Whole Foods has an uber-BI stack running somewhere in Austin giving them a “big picture” on a store by store basis right?&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;You’d think someone would be paying attention (or maybe even using it)?&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;But clearly they are not, or someone would be fixing these problems (or at least genuinely addressing customer complaints, which they won’t because it's "inconvenient" for them, as they like to put it).&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;I don’t think it’s so much about the difficulty of implementing and leveraging BI as the article suggests.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I think it’s about genuine laziness on the part of upper management.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Because, at the end of the day, if you’re the CEO of a place like this, you need to get your highly compensated butt out on the floor to truly see, taste and smell what’s going on.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;You need to talk to your employees, your customers, and get in their shoes (incognito if possible).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I’m always shocked to hear C-level people lamenting the fact that their CRM system isn’t giving them enough visibility into their customers.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Or better yet, they need to “understand the customer” better. Who’s kidding who?&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Fact is they simply don’t give a hoot most of the time.&lt;span style="mso-spacerun:yes"&gt;   &lt;/span&gt;And if they’re too lazy or too self-important to do that, they’re not likely to pay much attention to BI tools and warehouses either no matter how fancy or ubiquitous the software might be.  That's really just the nature of what "service" has become in the US lately.  In order to improve BI usage, we will have to improve the quality of Management first and put real folks back in charge.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-2500706782556553021?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/2500706782556553021/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/08/bits-pieces-summer-posting.html#comment-form' title='4 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/2500706782556553021'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/2500706782556553021'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/08/bits-pieces-summer-posting.html' title='Bits &amp; Pieces Summer Posting'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>4</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-6257023181650489749</id><published>2009-07-28T15:50:00.000-07:00</published><updated>2009-07-28T16:06:57.966-07:00</updated><title type='text'>How I Learned to Love Mondrian (confessions of a WISA guy)</title><content type='html'>&lt;p class="MsoNormal"&gt;I’ve been playing with Pentaho’s &lt;a href="http://mondrian.pentaho.com"&gt;Mondrian &lt;/a&gt;for almost a year now on and off.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I have to say, everytime I mess with that stack I am more and more impressed by its richness and capabilities.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And twelve months ago, when I started learning it, I was what you could call “severely LAMP-challenged”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I’ve sure made a lot of progress since then and wanted to talk about this as I figured it might help other Microsofties out there needing (or wanting) to put a toe in these mysterious LAMP/OSS waters.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;The first thing I ever did with Mondrian was figure out how to install it on a Windows platform.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The reason I did was twofold.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;First, we didn’t have appropriate Linux hardware/software in house at the time, and second, I have way more experience on Windows so it’s a lot easier for me, and third, I wanted to do it locally and avoid dealing with cross-platform bridging at the moment (our ODBC drivers are Windows only as well).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Path of least resistance is an engineering mantra in my book.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Lucky for me I had worked with Java and Apache Tomcat in the dot-com days so I had no trouble pulling and installing the JRE/JDK and the web server itself&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;(which comes as a Windows service).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Next, I deployed the Mondrian WAR file into the Tomcat webapps folder which caused it to be automatically “deployed” as a web application.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Way easier than deploying ASP.NET applications (but you didn’t hear this from me &lt;g&gt;).&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Then, I fired up the Mondrian landing page, clicked on the Jpivot link and, of course, kaboom.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Yes, without a JDBC driver, Mondrian is not a happy camper. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;It took me a little longer to figure out the Sun JDBC-ODBC bridge and how to plug corresponding connection string it into numerous Mondrian files to replace the default connections there (which are all for MySQL if I recall). &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span style="mso-spacerun:yes"&gt;&lt;/span&gt;The Mondrian documentation isn’t great but if you Google long enough you can usually find some other poor slob with a similar problem and, with luck, published solutions online.&lt;span style="mso-spacerun:yes"&gt;  [&lt;/span&gt;Side note: I once had the nerve to email Julian Hyde, their Chief Architect, about some technical question.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;He abruptly suggested I don’t bother him and use the “community” forums instead]. Unfortunately those forums are often useless for non-enterprise (read: non-paying) users.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;There are flurries of unanswered questions and problems up there.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This is a generic OSS problem I suppose. You get what you pay for &lt;span style="font-family:Wingdings;mso-ascii-font-family:Calibri;mso-ascii-theme-font: minor-latin;mso-hansi-font-family:Calibri;mso-hansi-theme-font:minor-latin; mso-char-type:symbol;mso-symbol-font-family:Wingdings"&gt;&lt;span style="mso-char-type: symbol;mso-symbol-font-family:Wingdings"&gt;J&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;So finally I had my bridge setup, along with a DSN called MondrianFoodMart (the default) pointing at the default Access database (distributed with Mondrian).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And now I was able to fire up Mondrian on top of the database and do a couple drills, run a couple MDX queries.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Bliss.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Next, I took that Access database and exported it to CSV format with corresponding DDL.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This is the way to feed the XSPRADA engine.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Fired up our RDM/x server and ran the DDL scripts.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Then re-defined the MondrianFoodMart DSN to point to us via our ODBC driver (32-bit only, 64-bit won't fly with the bridge - painful lessons learned...).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Reloaded the Mondrian page, and voila! Mondrian was now talking to RDM/x and displaying the Sales cube.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;One point of the exercise was being able to show Mondrian OLAP on top of our database. Another was being able to show our database’s behavior in time as more and more queries come in (hint: it gets faster).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Now, with Mondrian, this is a little tricky because the platform is heavily cache-based.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Mondrian shoots initial queries at a relational system and proceeds to cache heavily as it aggregates results.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;So the more you use it, the more it caches.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Obviously the reason for this is that Mondrian is designed to run on top of relational databases, and not “OLAP-intelligent” engines such as ours.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It has to translate the MDX into straight SQL queries every time, as it fills its caches initially.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Nevertheless it does re-hit the database as needed when you start slicing and dicing on new dimensions or facts, as one would expect.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;So you can actually see our engine’s “dinosaur tail” behavior as I once described in a &lt;a href="http://u.nu/9wxn"&gt;previous post&lt;/a&gt;.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-font-family: Arial;color:black"&gt;Now, the MondrianFoodMart database is fine for setting up the stack, but it’s not particularly interesting in so far as data volume goes if you’re in the VLDB space.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;More recently, I attempted to setup the TPC-H/SSB sample data under Mondrian, meaning I tried to manually create the XML defining some SSB cube.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;There is a whole fairly complex XML language Mondrian uses to define and connect fact tables (measures) with associated dimensions.&lt;span style="mso-spacerun:yes"&gt;   &lt;/span&gt;They have a UX driven tool called Workbench but I could never get it to work on my system (and didn’t have enough time to keep messing with it).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;With online help and using the FoodMart.xml sample file, I was able to get a basic cube up in about a day.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Nothing fancy, but now I can OLAP into arbitrarily large data sets and that’s a good thing.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-font-family: Arial;color:black"&gt;As cool as it is seeing our stuff run under Mondrian, I always dreamed of doing the same thing under Excel (as in 75% market share, yeah I want to support that please). &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Until recently, I thought this would not be possible until we implemented MDX in the engine but then I saw the light.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It is called the &lt;a href="http://www.simba.com/odbo-to-xmla.htm"&gt;SimbaO2X&lt;/a&gt; &lt;/span&gt;&lt;/span&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-font-family:Arial; color:black"&gt;connector and it rocks!&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-font-family: Arial;color:black"&gt;&lt;b&gt;[Start Commercial]&lt;/b&gt; Did I mention how much I love this company Simba? &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;They pretty much wrote the book on data connectivity.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Within a day they had me a 30-day trial version of their O2X offering, no questions asked.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And follow-up to boot.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Their stuff works, and they know how to take care of people.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;What a concept! &lt;b&gt;[End C&lt;/b&gt;&lt;b&gt;ommercial]&lt;/b&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-font-family: Arial;color:black"&gt;This SimbaO2X puppy lets ODBO clients (say like Excel) talk to XML/A OLAP servers (say like Mondrian).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Note, there is a similar offering from Pentaho called Pentaho Spreadsheet Services. It carries a small yearly license fee from what I understand.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Supposedly you can email Pentaho sales for additional information and a local contact.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;I’m still waiting for their reply.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Hey it’s OSS…Did I mention you get what you pay for? &lt;g&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-font-family: Arial;color:black"&gt;Either way, the relevant fact is that the SimbaO2X connector works without a hitch.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I am finally able to create and manage pivot tables from Excel, talking to Mondrian (via XML/A), talking to RDM/x (via ODBC)!&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This is the bomb!&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I need to really get a deeper understanding of MDX capabilities now.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;But the more I learn about it the more impressed I get, and the better demos I can do.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-6257023181650489749?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/6257023181650489749/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/07/how-i-learned-to-love-mondrian.html#comment-form' title='7 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/6257023181650489749'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/6257023181650489749'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/07/how-i-learned-to-love-mondrian.html' title='How I Learned to Love Mondrian (confessions of a WISA guy)'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>7</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-2238813884323997079</id><published>2009-07-28T11:30:00.000-07:00</published><updated>2009-07-28T11:38:21.325-07:00</updated><title type='text'>The Canary in the Gold Mine?</title><content type='html'>&lt;p class="MsoNormal"&gt;I’ve been claiming for a while that data mining and predictive analytics (PA) were the new hills to conquer in BI and this morning the news came out that IBM had &lt;a href="http://tinyurl.com/nt4qkv"&gt;plopped down big money for SPSS&lt;/a&gt;.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;IBM is also investing R&amp;amp;D dollars in ways to &lt;a href="http://u.nu/79vn"&gt;manipulate data directly&lt;/a&gt; while encrypted and/or compressed.&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;This particular research fascinates me because I believe it will be key to SaaS acceptance, where security is still a significant push-back for obvious reasons.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This means analytics might actually have a future on the cloud.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And this is important IMHO because this allows for significant progress in the UX systems required to use (drive) mining engines efficiently.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The kind of improvements that cannot be generated and deployed quickly enough with fat client implementations.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I’m thinking of really interesting things like &lt;a href="http://www.spezify.com/"&gt;www.spezify.com&lt;/a&gt; for example.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Another interesting trend is pushing analytical capabilities deep into the database engine either via stored procedures or user-defined functions in one or more programming languages (much like .NET inside SQL Server, for example).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;All this leads me to believe that insightful BI players have been turning their guns on solving the next big pain point of BI which is, IMHO, data mining and predictive analytics.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;span class="apple-style-span"&gt;&lt;span style="color:black"&gt;This embedded capability relates to the deep&lt;/span&gt;&lt;/span&gt; kind of analytics I once &lt;a href="http://jeromepineau.blogspot.com/2009/05/mad-about-you.html"&gt;blogged &lt;/a&gt;about in the context of Greenplum’s MAD paper.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;So does this mean we’re all done with OLAP?&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Not likely, but I think a certain peak has been reached where OLAP has become “bearable”. &lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I don’t really have a 3-5 year “future outlook” on OLAP at this point.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Is it still hard to cube and do MDX? &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Yes.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Is it still a pain in the behind to setup large SSAS analytics? &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;You bet.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Is setting up a production version of Pentaho’s Mondrian ROLAP for the faint of heart?&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Not exactly. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;But there are now multiple alternatives out there in both hardware (faster COTS components, FPGAs, GPUs, MPP) and software (columnar, ALGEBRAIX) realms.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span style="mso-spacerun:yes"&gt;&lt;/span&gt;Our own ADBMS at XSPRADA is designed and tuned specifically for OLAP workloads in its present form.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Product such as ours have helped “commoditize” OLAP work by shifting design and pre-structuring efforts (cubing, slicing and dicing) from the user (DBA) to the software itself.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This is done automatically and based on queries coming in.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;There is no need to configure cubes, mixed workloads are supported, and all the user really has to do is ask questions.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It’s that simple really.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Let the software worry about the darn cubes!&lt;/p&gt;  &lt;p class="MsoNormal"&gt;So I guess my point is, if there are people still struggling (read: losing time and money) with OLAP in the enterprise, I have to say it’s because they’re either poorly advised or simply not opening their eyes to new tools and techniques currently available.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;At this point OLAP pain is no longer a necessity. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;It’s an uneducated choice. &lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;From a technical standpoint, it has been addressed.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Let’s move on to the next problem please.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This is why I think the industry is poised to tackle another challenge now, namely data mining and predictive analytics.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Even Curt Monash in a recent &lt;a href="http://u.nu/56wn"&gt;blog&lt;/a&gt; about the SPSS acquisition writes:&lt;/p&gt;  &lt;p class="MsoNormal"&gt;“&lt;span class="apple-style-span"&gt;&lt;span style="color:black"&gt;So far business intelligence/predictive analytics integration has been pretty minor, because nobody’s figured out how to do it right, but some day that will change. Hmm — I feel another “Future of … ” post coming on”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="color:black"&gt;Sorry Curt, I beat you to it &lt;/span&gt;&lt;/span&gt;&lt;span class="apple-style-span"&gt;&lt;span style="font-family:Wingdings;mso-ascii-font-family:Calibri;mso-ascii-theme-font: minor-latin;mso-hansi-font-family:Calibri;mso-hansi-theme-font:minor-latin; color:black;mso-char-type:symbol;mso-symbol-font-family:Wingdings"&gt;&lt;span style="mso-char-type:symbol;mso-symbol-font-family:Wingdings"&gt;J&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Mining is a totally different segment of the business intelligence endeavor.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;When you do OLAP, you’re asking “tell me what happened and why”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;When you do mining, you have no clue what happened and much less why.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In mining you’re asking “tell me what I should be looking at” or “tell me what’s interesting in this data?”&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And predictively, you’re asking “tell me what’s likely to happen” – as in, show me the crystal ball.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Mining is not a pre-structured, pre-indexed kind of “cubing” world.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It’s an ad-hoc discovery process.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It’s iterative.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Much like the way a human brain functions when discovering information, and trying to make sense of it. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;This “human-like” behavior is actually one of QlikView’s usability pitches.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In mining, the relational model is a hindrance, not an asset, because relationships are not necessarily canned or static.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Predictive analytics are more of an art than a science as well.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;These concepts don’t fit nicely in pre-structured, tabulated formats.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Additionally, mining and PA are creative endeavors (whereas OLAP is not).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This is why it’s important to let users define their own “stuff” so they can trial-and-error through the problem.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Conventional database engines don’t support this type of workload elegantly.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It’s simply not “structured” nicely like OLTP or OLAP.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;You can’t easily (or cost-effectively) try, erase and re-start with conventional engines.  They're not forgiving.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;So what’s needed are systems that can first intelligently process data upstream in ELT mode because acquiring statistic on incoming data (at varying rates) is an important step for analytics.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;XSPRADA’s engine starts analyzing data statistically upon initial presentation.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;More importantly, it keeps doing so automatically in real time, and continuously via comprehensive optimization.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This is a unique feature that causes the system to continuously re-evaluate system resources against queries and data to seek out additional or more effective optimizations.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Next, you need systems that can tell you where NOT to look.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Because in this type of work, pertinent data is often clustered in very specific areas (as in 5% of 100TB perhaps).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And user questions tend to hit within small percentages of those clusters.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Yes there are always exceptions, but generally-speaking, that’s what happens.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;So what you DON’T want are systems that spend a lot of time scanning boatloads of data (needle in the haystack).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;What you need is intelligent software that can quickly eliminate vast areas of informational “no-man’s land” based on incoming queries.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In such a problem space, throwing additional monies at ever more powerful metal is a self-defeating approach.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It’s the software stupid! &lt;span style="font-family:Wingdings;mso-ascii-font-family:Calibri; mso-ascii-theme-font:minor-latin;mso-hansi-font-family:Calibri;mso-hansi-theme-font: minor-latin;mso-char-type:symbol;mso-symbol-font-family:Wingdings"&gt;&lt;span style="mso-char-type:symbol;mso-symbol-font-family:Wingdings"&gt;J&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;As it turns out, XSPRADA’s ALGEBRAIX technology is very good at eliminating "useless" (read: at a given time) data spaces.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Not only that, but it also shines at inferring subtle relationships between different entities.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The kind of relationships a human wouldn’t even think of asking on her own.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It’s also very good at recognizing patterns (both in queries and targeted result sets).&lt;/p&gt;  &lt;p class="MsoNormal"&gt;In a way, you would expect that a system built on pure mathematical foundation would be particularly well suited to data mining workloads. And it sure is.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This is the beauty of having a “wide” and rich enough technology that is as easily and readily applicable to a multitude of different BI problems.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It means you don’t need to re-invent the wheel or re-architect your system every time a new problem space opens up.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And that, in the business intelligence technology world is a rare find indeed.&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-2238813884323997079?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/2238813884323997079/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/07/canary-in-gold-mine.html#comment-form' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/2238813884323997079'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/2238813884323997079'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/07/canary-in-gold-mine.html' title='The Canary in the Gold Mine?'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-6690996964454512909</id><published>2009-07-27T08:10:00.000-07:00</published><updated>2009-07-27T08:17:16.487-07:00</updated><title type='text'>Who leaves a country packed with ponies to come to a non-pony country?</title><content type='html'>&lt;p class="MsoNormal"&gt;Usually I think of myself as a “hot-shot” when it comes to XSPRADA technology and its applications.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This is because I’ve been involved with it for ten years, and that kind of history builds bonds.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In a word, having lived and breathed it for so long, I’m severely biased, but at least, I’m aware of it.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It’s a completely different story when you talk to a user who also happens to be biased from experience running the stuff to solve real problems.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;That’s when you hear praise that makes you step back and go “wow, we really do shine here above and beyond”.&lt;span style="mso-spacerun:yes"&gt;   &lt;/span&gt;There is nothing sweeter than an adamant customer evangelist.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;One such person we shall call Tim (why not, since it's his real name).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;He works for a major DOD contractor out here in California.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Tim asked me to withhold his company’s name for obvious reasons.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;He’s been running POC projects using XSPRADA technology for years.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;As a matter of fact, Tim once ran a real-time CEP version of our engine (which can handle both real time and historical input) for a demo bid project he needed to put together where no other vendor could come close.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Tim is the ultimate engineer’s engineer and one of the smartest folks in the “information” field I’ve ever met.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;He’s got experience galore and has been around the block a few times.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Currently there are other groups in Tim’s shop running the XSPRADA engine for other purposes, and he keeps abreast of those POCs as well.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I could tell you what they entail but then I’d have to kill you &lt;g&gt;.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Suffice to say that the engineering being done there would blow most people’s minds (as in, holy cow, we're actually doing this?!?).&lt;/p&gt;  &lt;p class="MsoNormal"&gt;It turns out Tim is biased as well, but he’s biased from a user perspective.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This is "been-there-done-that" advocacy.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;And that, in my book, is far more compelling than any argument coming from an insider like myself.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Although I don’t know too many people who can explain and position the technology as well as I can (I’m so modest, no pictures please! &lt;g&gt;), Tim had the following analysis recently and I thought it was so “perfect” I had to reproduce it here (bolding my own):&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="mso-bidi-font-family:Arial"&gt;“One of the distinctions I’ve been using lately to explain the difference between &lt;b style="mso-bidi-font-weight:normal"&gt;set based data processing&lt;/b&gt; and most everything else (row based, column based, partition based,…) is that most other DBs are based on defining somewhat &lt;b style="mso-bidi-font-weight:normal"&gt;arbitrary bins of data of fixed size or dimension&lt;/b&gt; (tables with fixed columns in RDBMs, column collections for things like Vertica ,et al, and  chunks in Google’s BigTable designs, to name a few). &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Then there is significant &lt;b style="mso-bidi-font-weight: normal"&gt;overhead&lt;/b&gt; to partition the incoming/outgoing data to fit into these fixed containers. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Inevitably, any operation on these artificial partitions will include &lt;b style="mso-bidi-font-weight: normal"&gt;wasted&lt;/b&gt; processing or I/O on irrelevant data that just “happens” to live in the affected partitions. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;This is a huge &lt;b style="mso-bidi-font-weight:normal"&gt;waste of time and resources&lt;/b&gt;. In addition, these bins are continually reused by means of destructive updates which require them to be locked during transactions to avoid data corruptions. This is the other main source of &lt;b style="mso-bidi-font-weight:normal"&gt;waste&lt;/b&gt; in that significant delays are now imposed not only on the relevant data involved in the operation, but also on &lt;b style="mso-bidi-font-weight:normal"&gt;collateral&lt;/b&gt; data that might be holding up other operations unnecessarily. These two effects are mutually opposed:  larger partitions would help the I/O problem, but at the expense of exacerbating the locking problem. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;And vice-versa. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;By contrast set based data systems, like XSP, use &lt;b style="mso-bidi-font-weight:normal"&gt;completely variable sized containers&lt;/b&gt; (the sets) dynamically partitioned &lt;b style="mso-bidi-font-weight:normal"&gt;based on operational relevance&lt;/b&gt; and not on any predetermined partition sizes. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;In this way, the amount of &lt;b style="mso-bidi-font-weight:normal"&gt;irrelevant&lt;/b&gt; data moved across the I/O boundary for any data operation is significantly reduced. And because these sets are &lt;b style="mso-bidi-font-weight:normal"&gt;immutable&lt;/b&gt;, there is no locking interference with other concurrent data operations.”&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;To put it in Southern California speak, I was like, wow, this dude really gets it!&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And there isn’t much I can add to Tim’s conclusions.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I bow to the completeness of his understanding; his analysis stands on its own.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;The context of our exchange was about pre-structured or “canned” mechanisms used by other database engines versus the flexibility of our approach.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It’s what I naively call “bucketizing”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And this topic is of course related to the ADR functionality I was discussing in my last post.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;But it also pertains to the “schema agnosticism”, parallelism, and ACID aspects of the database I’ve mentioned in the past.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;The XSPRADA engine dynamically adapts to queries and data.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It avoids the inherent rigidity prevalent in all other technologies.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Yes it’s compelling to handle analytics via columns, but that’s only one of many ways you can address the problem.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And if your technology is “columnar” in nature, it’s the ONLY way you can address the problem.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;You’ve put all your eggs in one basket.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;At the end of the day, you’re still looking at the world in a tabular format (which happens to be vertical).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;So it’s what I call the one-trick pony approach.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;There’s nothing wrong with one-trick ponies if you need to solve a very specific business problem quickly and efficiently.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;But from a holistic business perspective, it’s a scary proposition.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;If you’re running an enterprise, you want flexibility. You want the ability to address problems as they come up using all available means at your disposal.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;You’re looking for a wide array of tools and methods to win battles, not a single weapon system.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And this is what XSPRADA technology offers: “the ability to apply the right &lt;b style="mso-bidi-font-weight:normal"&gt;technique&lt;/b&gt; for any &lt;b style="mso-bidi-font-weight:normal"&gt;question&lt;/b&gt; for any &lt;b style="mso-bidi-font-weight:normal"&gt;data&lt;/b&gt; at any &lt;b style="mso-bidi-font-weight: normal"&gt;time&lt;/b&gt;”.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;If Tim is concerned with waste and inefficiency, it’s because his shop deals with tera and petabyte volumes of data with limited shelf-life.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In other words, waste is not an option.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This isn’t about airline transactions messing up your trip or bank accounts being debited incorrectly by the way.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This is about national security and people living or dying in real-life tactical situations.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;In applications like this, one-trick ponies don’t cut the mustard.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And this is why Tim and several other groups in his company have been looking at XSPRADA technology for years.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;There simply isn’t anything out there that can meet their requirements, and believe me they’ve tried all the usual suspects.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;To Tim and his colleagues, the unfair competitive advantage XSPRADA can deliver to their company (and clients) is well worth the risk of evaluating technology that’s a little out of the ordinary.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-6690996964454512909?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/6690996964454512909/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/07/who-leaves-country-packed-with-ponies.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/6690996964454512909'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/6690996964454512909'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/07/who-leaves-country-packed-with-ponies.html' title='Who leaves a country packed with ponies to come to a non-pony country?'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-7394009923060086770</id><published>2009-07-17T12:18:00.000-07:00</published><updated>2009-07-17T12:34:57.978-07:00</updated><title type='text'>ADR and how I got kicked out by Kickfire</title><content type='html'>&lt;p class="MsoNormal"&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;I had the opportunity to brief &lt;a href="http://mervadrian.wordpress.com/"&gt;Merv Adrian&lt;/a&gt; (major BI industry analyst) recently about XSPRADA and in discussing competitive technological differentiation, I highlighted the three major points that make XSPRADA uniquely stand out in a sea of “new-breed” ADBMS vendors, namely our algebraic engine (aka ALGEBRAIX), adaptive data restructuring (ADR), and temporal invariance. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Those of you who have honored me with their readership in the past will no doubt be familiar with those terms and concepts. (side note: Merv is the one who coined this ADBMS term that I shamelessly borrow constantly now -- thank you Merv!)&lt;/p&gt;  &lt;p class="MsoNormal"&gt;I want to focus a little bit on ADR in this post because in the midst of our discussion, Merv asked me a really good question about it.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;He said “if you’re busy doing ADR, and more and more queries come in and more and more users get on board, what will the impact be on performance?”.  Excellent point!&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Quite honestly, no one had ever asked me this before so after our call, I did a little more research and came up with a few more questions on my own.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;All of which I’d like to discuss here.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;First of all, to recap, ADR is the process of adaptively restructuring data both logically (say the structures in RAM for example) and on storage (disk) based on the nature of queries coming in.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;What does this mean?&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Well for example if a query is clearly pulling only certain columns from a table (as is typically the case in OLAP), ADR will pull these columns out and optimally lay them out on disk for more efficient access.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;ADR could also include indexing (as in bitmap, in low-cardinality cases) and any other means at its disposal to optimize questions pertaining to these columns.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Sound familiar?&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It should as this is basically the principle behind columnar systems.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;However ADR doesn’t stop there.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;It may, for example, decide that sharding row blocks is a more efficient strategy given a particular query pattern and set out to do just that as well.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It may decide that duplicating certain pieces of information on given disks is more I/O efficient.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In memory, it may decide to implement different indexing schemes depending on the nature of the queries.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In short, ADR has absolute “carte blanche” to take every means at its disposal to optimize the system in real time.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span style="mso-spacerun:yes"&gt;&lt;/span&gt;Unlike most other ADBMS out there, the XSPRADA engine is a living breathing entity constantly striving for optimal performance. And it has more than a few tools at its disposal (in other words, not a one-trick pony, compliments of the underlying mathematics) -- &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;But so the question is indeed legitimate: how does this impact performance if at all?&lt;/p&gt;  &lt;p class="MsoNormal"&gt;The answer, unsurprisingly, is it depends.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;First, it’s important to realize the design principle behind ADR. It is called “&lt;a href="http://en.wikipedia.org/wiki/Crowdsourcing"&gt;crowdsourcing&lt;/a&gt;”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The philosophy is that the more people hit the database from all angles , the better chance there is of being able to optimize the database.&lt;span style="mso-spacerun:yes"&gt;   &lt;/span&gt;From a technical perspective, overload is always a possibility, as with any other system.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;If too much is submitted at a given time, the system could theoretically run out of resources and impact performance.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;But when properly configured (balanced) with proper amounts of storage, the system should not degrade with additional users and queries coming in.&lt;span style="mso-spacerun:yes"&gt;   &lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span style="mso-spacerun:yes"&gt;&lt;/span&gt;Note that concurrent queries are also beneficial to the system.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This is because data streams can be shared between query processes in parallel, thereby reducing disk I/O.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Additionally, ADR is not user or query-specific. &lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;So there is no risk of one ADR action resulting from Query1 from “clobbering” another ADR action from Query2.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;When sharing is possible, it is implemented and maintained in the mathematical model (algebraic integrity).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;But all queries from all users enter the “algebraic space” together in one big pool.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This means everyone’s actions benefit everyone else. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;The XSPRADA engine is a very populist one &lt;span style="font-family:Wingdings;mso-ascii-font-family:Calibri;mso-ascii-theme-font: minor-latin;mso-hansi-font-family:Calibri;mso-hansi-theme-font:minor-latin; mso-char-type:symbol;mso-symbol-font-family:Wingdings;"&gt;&lt;span style="mso-char-type: symbol;mso-symbol-font-family:Wingdings;"&gt;J&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;On a completely different topic, I wanted to relate a funny incident that happened to me on the way to the Forum recently.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Well okay, it wasn’t exactly a forum per say but rather the newly minted Kickfire on-demand trial process.&lt;span style="mso-spacerun:yes"&gt;  They call it &lt;/span&gt;&lt;a href="http://u.nu/99pk"&gt;Cloud-based Trial&lt;/a&gt;&lt;span style="mso-spacerun:yes"&gt;.&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span style="mso-spacerun:yes"&gt;&lt;/span&gt;I enthusiastically signed up for the trial and got an assigned time-slice for today.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Within minutes, I get the following email from Kickfire’s Karl Van den Bergh, who is Kickfire’s Vice President of Marketing and Business Development (phew! Talk about long names to match long titles).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span style="mso-spacerun:yes"&gt;&lt;/span&gt;“Thanks for your interest in Kickfire. Our trialing system is reserved for prospects and partners. As our companies are somewhat competitive we are not able to give access to XSPRADA at this time.” &lt;/p&gt;&lt;p class="MsoNormal"&gt;Wow. Cold man.&lt;/p&gt;&lt;p class="MsoNormal"&gt;But next morning, I get this email from the company’s trial support team:&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span style="mso-spacerun:yes"&gt;&lt;/span&gt;“Congratulations! This email confirms that your Kickfire On-Demand Trial will begin at 10:00am (PST) on 07/17/2009.” &lt;/p&gt;&lt;p class="MsoNormal"&gt;Then thirty minutes later I get this: &lt;/p&gt;&lt;p class="MsoNormal"&gt;“jerome, Your reservation has been deleted. Reservation #sc14a5eb4e82d0dc.” &lt;/p&gt;&lt;p class="MsoNormal"&gt;By that time I’m thinking okay, somebody at Kickfire Trial Support finally got their behinds kicked (pun intended) for daring to confirm my trial session. Apologizing for this confusion is Karl again: &lt;/p&gt;&lt;p class="MsoNormal"&gt;"Apologies for the registration mix up this morning - we have a number of system administrators who weren't in synch."&lt;/p&gt;&lt;p class="MsoNormal"&gt;Yeah, I’d say.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;But wait, there’s more if you order now!&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Early this morning, after my session was supposed to end, I get a phone call from Kickfire asking me how my trial went!&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I couldn’t help but blurt out “you guys are really confused”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Then I explained that Karl had branded me persona-non-grata at which point I was promptly dropped like a bad case of H1N1.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span style="mso-spacerun:yes"&gt;&lt;/span&gt;Now, don’t get me wrong.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I love this Kickfire appliance concept.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And I have a positive informative relationship with technical folks there who are nothing if not extremely competent (and nice people to boot).&lt;span style="mso-spacerun:yes"&gt;   &lt;/span&gt;And although I cannot play with the Kickfire appliance either locally or remotely, from what I’ve heard and read, these guys have a top notch team, great technology, a seasoned CEO and attractive market positioning. &lt;/p&gt;&lt;p class="MsoNormal"&gt;In thinking about this, my initial reaction was that I was stupidly naïve.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Honestly, it never occurred to me that Kickfire would ban any “competitor” (or anyone else for that matter) from remotely playing with their software.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The reason is because we don’t do this at XSPRADA.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;As a matter of fact, anyone can pull our bits for free from our website, including Kickfire (which already has). &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;So to me this has a strange, secretive, “we have something to hide” feel which, given what I know about Kickfire, really took me by surprise. &lt;/p&gt;&lt;p class="MsoNormal"&gt;But it's true I have this weird concept about openness that many other vendors share, and I’m happy to put our stuff in competitive hands (after all, the customers certainly will!) anytime and get any feedback from the experience, negative or not.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In my experience, few competitors will go out and trash another vendor’s offering, much less try to reverse-engineer it (at least not in this industry) -- Maybe I’m foolish.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span style="mso-spacerun:yes"&gt;&lt;/span&gt;However, it’s true I don’t have a “marketing” bone in my body. &lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;As an engineer and evangelist, I’ve always been adamant about transparency, peer review and feedback.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;So maybe it’s a good think I don’t handle XSPRADA Marketing, or Kickfire’s for that matter! &lt;span style="font-family:Wingdings;mso-ascii-font-family:Calibri;mso-ascii-theme-font: minor-latin;mso-hansi-font-family:Calibri;mso-hansi-theme-font:minor-latin; mso-char-type:symbol;mso-symbol-font-family:Wingdings;"&gt;&lt;span style="mso-char-type: symbol;mso-symbol-font-family:Wingdings;"&gt;J&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-7394009923060086770?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/7394009923060086770/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/07/adr-and-how-i-got-kicked-out-by.html#comment-form' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/7394009923060086770'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/7394009923060086770'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/07/adr-and-how-i-got-kicked-out-by.html' title='ADR and how I got kicked out by Kickfire'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-5429045998122019013</id><published>2009-07-02T14:30:00.000-07:00</published><updated>2009-07-02T14:37:04.922-07:00</updated><title type='text'>Of Views, Cubes, Patents and Haystacks</title><content type='html'>&lt;p class="MsoNormal"&gt;In keeping with a past commitments to do so, I plan on staying purely technical in this posting. The first topic I want to bring up relates to database views.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The second one deals with OLAP “cubing”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I want to discuss those in the context of the XSPRADA analytical engine RDM/x.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Many conventional database engines support view and materialized views and I believe Oracle actually came up with the concept (I am not 100% sure of that) but numerous other vendors support them including Postgres, MySQL, DB2, SQL Server, etc.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;In either case, the idea is simple.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;A view is basically a logical rendition of a SQL query.&lt;span style="mso-spacerun:yes"&gt;   &lt;/span&gt;It looks and behaves like a table.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Views can be logical or materialized.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;A logical view is really just a “pointer” to the actual data.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;A materialized view actually contains data and is implemented as a full-fledged database object.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;There are several reasons to use views.&lt;span style="mso-spacerun:yes"&gt;   &lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span style="mso-spacerun:yes"&gt;&lt;/span&gt;One is performance enhancement and convenience (caching a nasty JOIN or OLAP summaries dynamically, meaning original data changes trigger recalculations). &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;One is abstraction. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Another is security and access control (you can grant specific privileges to views while keeping underlying data protected from users).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;The important thing to remember is that views were conceived in response to a problem, namely that database performance can really suck without them (especially in warehousing applications).&lt;/p&gt;  &lt;p class="MsoNormal"&gt;If you didn’t have inherent performance issues in conventional database engines, you wouldn’t need views to begin with.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And herein lies the reason why the XSPRADA engine doesn’t support classical views: it simply doesn’t need them.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The engine caches and materializes query results dynamically on the fly.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;As such, “materialized views” are generated on an as-needed basis.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;But this is true for everything else going on inside the XSPRADA engine because the adaptive data restructuring (ADR) feature dictates that all forms of optimizations be materialized for optimal logical and I/O performance on the fly as queries come in. &lt;/p&gt;  &lt;p class="MsoNormal"&gt;Who determines when this happens?&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The optimizer and the algebraic engine (now officially known as XSPRADA ALGEBRAIX &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;under US Patent 11/383,477).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;If and when the system detects more than one query involving a particular set of joins, it materializes (and likely caches) the results for subsequent use, assuming that a request pattern has emerged. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Similarly, it is always possible to “store” a complex query into a named result set using the INTO extension.&lt;span style="mso-spacerun:yes"&gt;   &lt;/span&gt;For example, you can write something like:&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;Select p.product_name, s.store_city, sum(f.store_sales) tots from sales_facts f&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;Join product p on p.product_id = f.product_id &lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;Join store s on s.store_id&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;= f.store_id &lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;Group by s.store_city, p.product_name&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;Order by tots desc INTO ds;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;At that point, the ‘ds’ table will contain the results and can be used independently and updated at will.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Currently, the XSPRADA engine will not automatically update ‘ds’ should any change occur to any of the underlying tables. &lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Regardless, ‘ds’ now becomes a regular RDM/x table like any other.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;There is no distinction between views and tables in RDM/x.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And even though views are defined in RDM/x, this is done for sheer syntactic convenience, and using them yields no technical advantage.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;On the OLAP front, the system behaves in a similar way.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This is why I always say, with RDM/x you don’t need to pre-structure or “cube” your data.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The engine does that automatically internally for the user based on his/her query patterns!&lt;/p&gt;  &lt;p class="MsoNormal"&gt;When you feed RDM/x a query like this:&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;SELECT s.store_city, sum(f.total_sales) from store s, sales_fact f&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;Where s.store_id = f.store_id&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;GROUP BY s.store_city&lt;/p&gt;  &lt;p class="MsoNormal"&gt;RDM/x automatically starts building a “cube” for it.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;What does that mean?&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The system creates a set of mathematical expressions relating the detail data to the aggregates you’ve just requested.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In the process, instances of the data in intermediate and final form are realized.&lt;span style="mso-spacerun:yes"&gt;   &lt;/span&gt;This city-based slice is “remembered” by the system and any subsequent query against that slice or variation thereof will benefit from immense optimization (and performance increase).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This is how subsequent queries just get faster and faster in the XSPRADA system.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And if you slice along another dimension next, say products, for example, the same optimization and ADR is performed for that slice.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;If you ask enough questions involving enough dimensions, you get closer and closer to instantaneous response time.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Adding a dimension may cause a slight performance dip on the first shot, but the penalty is nowhere near as painful as having to add another slice or recompute aggregates (cells) in a conventional “manual cubing” system (say like Mondrian or SSAS to name a few).&lt;/p&gt;  &lt;p class="MsoNormal"&gt;People have asked (namely Chris Webb, an MDX guru who runs an excellent &lt;a href="http://u.nu/5wig"&gt;blog&lt;/a&gt;) if there is some sort of internal structure that makes the query run faster over time.&lt;span style="mso-spacerun:yes"&gt;   &lt;/span&gt;The answer is that the re-structuring of the data, combined with the algebraic system and the methods for accessing the data internally (in memory and storage) are what makes these queries get faster with time.&lt;span style="mso-spacerun:yes"&gt;   &lt;/span&gt;Another excellent question was “can you edit these structures?” &lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And the answer is no, as the system is entirely automated and requires no user intervention.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Importantly, DML operations (INSERT, UPDATE, DELETE) on the original detail data cause automatic updates of the aggregated data, as long as the aggregates continue to be expressed in terms of the original data.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Simply put, the system maintains internal integrity automatically unless, of course, you change the original query.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;MDX and dynamic “view” updates are features likely to be implemented by RDM/x in the near future.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Without MDX, you lack the convenience of being able to talk to RDM/x directly from clients such as Excel (or SAP for that matter, who has apparently adopted MDX now), although you can clearly question RDM/x via Excel using OLAP queries.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;MDX is obviously more powerful and convenient.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This doesn’t take away from the internal “magic” occurring inside RDM/x when handling OLAP workloads.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;And I have just a few more lines to make one last point.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Technically speaking, RDM/x is currently heavily biased in favor of OLAP workloads.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This is not a consequence of the technology being used per say but rather reflects marketing priorities.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I sometimes meet people who try out our product to do search.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;For example, they’ll try to grab a few select rows from 100,000,000 rows of data.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This is not where RDM/x currently shines!&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I always tell them: if you want to do this, you’re better off using a conventional OLTP system like SQL Server or Oracle.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;RDM/x is not about finding the proverbial needle in the haystack.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It’s about telling you what the haystack looks like. &lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-5429045998122019013?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/5429045998122019013/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/07/of-views-cubes-patents-and-haystacks.html#comment-form' title='4 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/5429045998122019013'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/5429045998122019013'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/07/of-views-cubes-patents-and-haystacks.html' title='Of Views, Cubes, Patents and Haystacks'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>4</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-2396021935094879919</id><published>2009-06-29T14:20:00.000-07:00</published><updated>2009-06-29T14:26:14.765-07:00</updated><title type='text'>ParAccel's Big Ad-Venture</title><content type='html'>&lt;p class="MsoNormal"&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;I was just about to write a piece about the sad state of venture funding in the analytical database and BI space over the past years when BAM! &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Out of nowhere comes the announcement that ParAccel just scored $22M in a C-round investment.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;More importantly, as Merv Adrian &lt;a href="http://tinyurl.com/lsq7x4"&gt;points out&lt;/a&gt;,&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;“ParAccel’s previous investors participated as well.”&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;This last bit of information is even more telling than their ability to raise new money in this zero-point-zero economy.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Because if you look at past events surrounding Dataupia (and even Lucidera, in a way), it’s clear to me that there is a lot of “investor fatigue” in the BI industry at the moment.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In my book, this is partly due to the fact that a lot, if not most, of these investors had no clue about the data management market or BI as a whole (but at the time, it sounded cool, and the guys next door were touting it so…) or how much it takes to actually build a full-fledged high-performance analytical engine from scratch (which is about 200,000 man-hours just on the engineering and maybe $20-60M if you’re lucky).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;So of course when the going gets tough, these guys get jittery and bail out.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;No surprise there.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;My prediction is that, by year end, there’s going to be a lot of bodies left “on the carpet” as we say in French (I think it's an old boxing term).&lt;span style="mso-spacerun:yes"&gt;   &lt;/span&gt;ParAccel’s new money, combined with their recent marketing coup involving a 30TB TPC-H benchmark, will surely help them survive these tough times provided they don’t squander the funds and from what I’ve seen so far, these guys tread lightly and wisely.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Not to say this makes the BI capital markets suddenly look better.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The speculation “out there” (meaning in the “connected” unofficial circles) &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;is that the average VC returns for the 2000-2009 decade (which won’t be reported before April 2010) will likely be close to zero or negative.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The spectacular returns achieved in 1999 will drop out.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Clearly VCs would not take kindly to this information being published and the NVCA has not made it public.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Nevertheless if you follow VC performance sites like &lt;a href="http://www.pwcmoneytree.com"&gt;MoneyTree&lt;/a&gt; and&lt;span class="Apple-style-span"  style="font-family:Arial;"&gt; the &lt;a href="http://www.nvca.org"&gt;NVCA&lt;/a&gt; &lt;/span&gt;you can see the writing on the wall and it isn’t pretty.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;There are a lot of angel and VC investors looking for a way out nowadays at any cost.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And to paraphrase Dr. Evil, when this happens, “people DIE!”&lt;/p&gt;  &lt;p class="MsoNormal"&gt;According to this &lt;a href="http://u.nu/6xpf"&gt;article&lt;/a&gt;, venture capital is getting depleted. It’s hard to raise serious cash these days, imagine that!&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I’ll go one further and speculate that the venture capital system as we know it in the US may not be around another ten years.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;At least not in its present form. &lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;By then, the major source of funding will likely be big government (European-style) as risk will become more and more demonized and “regulated” (and you can’t regulate risk by definition, or it isn’t risk!).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;But what do I know? &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;I’m not a VC.&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-2396021935094879919?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/2396021935094879919/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/06/paraccels-big-ad-venture.html#comment-form' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/2396021935094879919'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/2396021935094879919'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/06/paraccels-big-ad-venture.html' title='ParAccel&apos;s Big Ad-Venture'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-1387000723116664585</id><published>2009-06-29T13:51:00.000-07:00</published><updated>2009-06-29T13:57:15.232-07:00</updated><title type='text'>For full-service SEs, this product is for you.</title><content type='html'>&lt;p class="MsoNormal"&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;I have not written specifically about my SE job before here at XSPRADA because I try to keep this blog fairly technical, or at least directly related to database and BI issues, but I want to make an exception this time. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Reason being I recently discovered a very interesting SaaS offering called &lt;a href="http://www.teamsupport.com"&gt;TeamSupport&lt;/a&gt; that makes my job easier.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And I figure, if it does that for me, chances are it can also do it for others in the profession.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;So I felt compelled to share the wealth.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;First, to put things in context, I discovered TeamSupport completely by chance interacting with their COO/VP Sales Eric Harrington on a LinkedIn group.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Eric offered to give a quick and dirty online demo and five minutes later, he produced.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;That in itself was impressive to me.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;TeamSupport is a cross between an issue/bug management tracking system (not unlike JIRA, FogBugz, Team System or Bugzilla, all of which I have used in the past) and a CRM system (not unlike Salesforce.com or RightNow.com), although they don’t bill themselves as being CRM per say.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The TeamSupport pitch is about “bridging the gap” between customer service/support, product development, engineering and QA.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Given the nature of my work as full-service SE , this seemed like a pretty compelling tool to me.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Now, typical sales engineers in larger shops ride in tandem with Sales people (AEs) on most accounts and are mandated with “greasing the rails” of sales, as I like to put it.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;But I typically don’t go out with AEs (nothing personal, we just don’t have any) and always pretty much face clients and prospects (both technical and executive) on my own.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;By necessity, I have a very close and tight relationship with our engineering group, and can typically remember specific tracking issues by JIRA number.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Similarly, as I also handle a lot of the sales/support side of things, I’m often up on Salesforce.com managing and supporting accounts, and mining opportunities and what have you.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;As great as JIRA is and as useful as Salesforce can be, they don’t play nicely together out of the box for this purpose.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;TeamSupport integrates functionality from both sides of the house.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;As it is on-demand software, I was able to create an account and log on in no time. &lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I just love the simple clean UX of this product.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;On the left pane menu are all the entities I can create, edit and manage such as issues, features, tasks, bugs, users, customers and products.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;In the middle is the workspace corresponding to the selected menu item.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;But make no mistake this software is rich, rich, rich.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;So, for example, I use Features to enter feature requests from user and prospects’ wish lists.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;If I click on Features, I immediately see my list by ticket number.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I can therefore track those with Engineering but also product management (as in, when can we accomplish this and should we?) and give customers feedback on progress (or at least estimates).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;You can associate Features with one or more customers.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This is useful when more than one customer makes a similar request (guess what, that’s common).&lt;/p&gt;  &lt;p class="MsoNormal"&gt;I use Tasks to keep track of what I need to handle or resolve on a daily basis. Those too can be associated with multiple customers.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;That’s cool.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;You can subscribe to tasks and get email notifications when they are modified by other users (TeamSupport is free for up to three users, by the way).&lt;/p&gt;  &lt;p class="MsoNormal"&gt;In the Bugs section, I can enter pertinent items from our internal JIRA tracking system.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I can assign or link those to various corporate groups like Engineering, QA or Sales Engineering.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;In the Knowledge section, I like to enter resolutions to past issues or problems.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;These are likely to come up for other customers or prospects so it’s a great way to keep track of those for future reference.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;In Customers, I entered all my current customers or prospects.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I don’t differentiate on that.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Paying customer or evaluating customer, whether you’ve paid your money or just kicking the tires, you get the same high level of service from me. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Priority to existing customers, clearly, but same level of service&lt;/p&gt;  &lt;p class="MsoNormal"&gt;In Products, I can enter all versions of existing product lines and link those with customers and prospects (as in who bought what or who is currently trying what version and which maintenance release).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;You can of course track issues and features by product.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;We do maintenance releases fairly often so this is a great way for me to track critical feature enhancements/bug fixes on a per-version level.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;If I click on Dashboard, I get an immediate 30,000 foot picture of where I’m at this point in time. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;I can then drill into my tickets or customers at will.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I now have all I need on hand to help me cover everything from the most minute technical details to the most important pain point gleaned from my last interaction with a prospect.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;TeamSupport can also ingest our JIRA database.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;All you have to do is export your database and they help you get it loaded.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This is great on the ticketing side.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;They are also working on a future API to do this automatically.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;On the CRM side, they are integrating with Salesforce as well, and I will be beta-testing that effort shortly to bring in accounts from that side into TeamSupport.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Last but not least, TeamSupport lets you setup support portals for each customer. This is important (and unique) for several reasons. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;First, it makes us look a lot more polished than just handling everything via email (or Twitter).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Second, it creates a history of interaction that can be mined and referenced at will (this in itself is very valuable business intelligence!).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Third it facilitates a push model of interaction with the customer because each side gets change notifications. &lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This means I can respond in real time to questions and issues.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I like real time.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;As most great software, it’s sort of hard to do it justice in a quick write-up.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;You kind of just “get it” as soon as you start using it.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It’s elegant, simple, fast, and easy to use.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It just flows and it’s intuitive.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I actually enjoy using the darn thing!&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Go figure.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I forget what the exact subscription costs are but it’s dirt cheap (ahem, I mean cost-effective) considering the value provided out of the box.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I should conclude by saying that I am in no way connected to this company.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I had never heard of them before last month, and have zero affiliations with anyone there.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I can tell you they built a really valuable product if you’re in technical sales.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Did I mention their support is stellar?&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Enough said: if you want to try this product out, shoot my friend Eric a quick &lt;a href="mailto:eharrington@teamsupport.com"&gt;email&lt;/a&gt; or twit him up at &lt;a href="http://www.twitter.com/teamsupport"&gt;TeamSupport&lt;/a&gt;.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Tell him I sent you &lt;span style="font-family:Wingdings;mso-ascii-font-family:Calibri;mso-ascii-theme-font: minor-latin;mso-hansi-font-family:Calibri;mso-hansi-theme-font:minor-latin; mso-char-type:symbol;mso-symbol-font-family:Wingdings"&gt;&lt;span style="mso-char-type: symbol;mso-symbol-font-family:Wingdings"&gt;J&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-1387000723116664585?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/1387000723116664585/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/06/for-full-service-ses-this-product-is.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/1387000723116664585'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/1387000723116664585'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/06/for-full-service-ses-this-product-is.html' title='For full-service SEs, this product is for you.'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-890932945420116111</id><published>2009-06-25T23:07:00.000-07:00</published><updated>2009-06-25T23:21:08.481-07:00</updated><title type='text'>Pawn to King IV</title><content type='html'>&lt;p class="MsoNormal"&gt;I recently &lt;a href="http://www.legrandbi.com/2009/06/sap-businessobjects-bi-ondemand-jive/"&gt;read&lt;/a&gt; (sorry it's in French &lt;g&gt;) that SAP and &lt;a href="http://www.jivesoftware.com/"&gt;Jive&lt;/a&gt; had partnered up to offer BI in an Enterprise 2.0 context (aka enterprise social networking).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The same happened in France with Dassault Systems and &lt;a href="http://www.bluekiwi-software.com/"&gt;blueKiwi &lt;/a&gt;Software.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I’ve also been following the new “BI for the masses” trend exemplified by offerings like &lt;a href="http://www.pushbi.com/"&gt;PushBI&lt;/a&gt; and &lt;a href="http://www.roambi.com/"&gt;RoamBI&lt;/a&gt;.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Then several weeks ago I saw the Google Wave presentation at I/O and thought to myself, geez wouldn’t it be cool if people could share and collaborate using data wavelets under management from a BI Wave “bot” dedicated to a specific community of users. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;&lt;/g&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;I think BI is on its way to becoming commoditized.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I see it moving up from the technologists directly into the hands (and mobile devices) of the users. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Years from now, people will look back on monolithic enterprise data warehouses and their infrastructures and wonder “how could people ever have lived like this?”&lt;/p&gt;  &lt;p class="MsoNormal"&gt;I think it’s really crucial for progressive companies in the BI space to ask themselves what their users will look like ten or twenty years down the line.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It’s obviously important to understand current users, but doing so is relatively easy.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Anticipating what today’s young people will purchase in 10-20 years as senior corporate buyers (and decision makers) is not so straightforward. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;But as today’s “kids” are the people you’ll be selling corporate BI to in the future, doesn’t &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;it make sense to (1) analyze their mindset and (2) start reaching out to them now?&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Those of you who read this blog regularly know that I worship &lt;a href="http://www.guykawasaki.com/"&gt;Guy Kawasaki&lt;/a&gt;.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In his book&lt;a href="http://www.guykawasaki.com/books/"&gt; Reality Check&lt;/a&gt;, there’s a chapter called “Get a Clue: The Global Youth Market”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In it, he interviews Kathleen Gasperini, the cofounder of &lt;a href="http://labelnetworks.com/"&gt;Label Networks&lt;/a&gt;.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;These folks analyze “global youth culture” for major corporations.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It’s a fascinating read (as is most of the book).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span style="mso-spacerun:yes"&gt;&lt;/span&gt;You’re thinking great, what does selling Nike shoes or Levi Strauss jeans to kids have to do with pitching enterprise software to business intelligence users?&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I say a lot.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Because if you understand the behavior and expectations of upcoming generations of BI buyers, you’ll understand your future BI customer and gain unfair competitive advantage in the process. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Now, I don’t purport to understand current youth behavioral traits, but I do make the following subjective observations based on past experience.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;b style="mso-bidi-font-weight:normal"&gt;Instant gratification&lt;/b&gt;.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Immediacy has become a birth right. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Expectations of “just-in-time” are prevalent in everything they do, purchase or share. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;I don’t think these people are the kind who will sit around the office waiting six months for a $4M BI project and associated resources to get provisioned, staffed, configured, and maybe then approved.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;These folks are going to want something up and running in days. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Anyone not operating in the same timeframes will be left in the dust.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;b style="mso-bidi-font-weight:normal"&gt;Social consciousness&lt;/b&gt;.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Business is business, but “good” business wins points.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;A “good” business does not exist exclusively for pecuniary purposes.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Making meaning (as Guy K. would say), as opposed to just making a quick buck, will matter.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Being “green” will matter (not sure what that means but it’s a hot button). &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;I know this sounds awfully naïve, but young people are experiencing the ability to “make a difference in the world” (something past generations may have missed out on) and respecting those who strive to do so.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;They’ll take that to the corporate world as well. Help them do so. They mean it.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;b style="mso-bidi-font-weight:normal"&gt;24/7 Connectivity&lt;/b&gt;.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Young people are constantly connected to the internet “matrix”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It’s a 24/7 world for them.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Like an addiction, this isn’t a habit you casually kick with age.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Accessing future BI buyers outside the realm of the “matrix” will be futile.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Accessing or supporting users during “regular business hours” will get you laughed out of the market. Not reaching out to or monitoring social networks will be foolish at best.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The thought of instant, always-on BI following users around on mobile devices 24/7 may make some people laugh, but this is how young people already live. There’s no reason they’ll leave this behind in the context of “the office” in years to come. &lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;b style="mso-bidi-font-weight:normal"&gt;Do-it-yourself&lt;/b&gt; is the new mantra.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Self-empowerment is alive and well. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;As Bob Marley used to sing, “when one door is closed, don’t you know, another is open”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Unless you provide the tools and infrastructure to “DIY” you probably won’t get much traction (we see this happening now with open source already).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Empower users because they’ll be savvy and used to it.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;The current economic turmoil and its endless revelations, combined with the technological advances that have shifted the way young people see, communicate with, and gauge the world around them, will translate into a new type of enterprise BI buyer in the coming decades. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;If I were running a BI company, I’d want to bring the “good word” into the schools now, and start cultivating my future clients in the dorms, in the research centers, in the classrooms and on their mobile devices.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;I could be wrong on this, but my money is on the bet that the upcoming generation of BI buyers is an entirely different animal. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;In this business, like chess, it pays to think several moves ahead.&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-890932945420116111?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/890932945420116111/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/06/pawn-to-king-iv.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/890932945420116111'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/890932945420116111'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/06/pawn-to-king-iv.html' title='Pawn to King IV'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-8194015985318031688</id><published>2009-06-24T14:40:00.000-07:00</published><updated>2009-06-24T14:46:45.233-07:00</updated><title type='text'>Leave the Gun, Take the Cannoli</title><content type='html'>&lt;p class="MsoNormal"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;"&gt;I grew up in New York City and spent enough time in Jersey back East to have witnessed a couple interesting brawls in my life (I even had neighbors who dug holes for a living or worked in waste management) but it’s been a while since I’ve seen anything like the recent scuffle among industry analysts and vendors regarding the recently published &lt;a href="http://www.paraccel.com"&gt;ParAccel &lt;/a&gt;TPC-H benchmark.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Maron!&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;"&gt;It all started innocently enough two days ago when Merv Adrian, BI industry analyst emeritus, published the news in his blog &lt;/span&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;"&gt;titled “&lt;a href="http://tinyurl.com/n2tp74"&gt;ParAccel Rocks the TPC-H – Will See Added Momentum&lt;/a&gt;”.&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;"&gt;&lt;span style="mso-spacerun:yes"&gt;&lt;/span&gt;Now, it’s not every day that a vendor publishes audited TPC-H benchmarks (“audited” being the key word, as the process runs around $100K from what I understand).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Very few companies besides the Big Three have the deep pockets and technology prowess to accomplish that.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Furthermore, ParAccel did its benchmark based on 30TB which isn’t exactly a small chunk of data.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And so Merv made the point that, at the very least, the news should certainly help put ParAccel on the map. To quote him: &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;“This is a coup for ParAccel, whose timing turns out to be impeccable”.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;"&gt;Immediately, this was picked up by no other than Curt Monash, BI analyst to the stars (and I say that quite seriously), who happens to despise the very concept of TPC benchmarks for reasons he clearly outlines in a recent post &lt;/span&gt;&lt;span style="font-family: &amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;"&gt;entitled “&lt;a href="http://tinyurl.com/nrhtas"&gt;The TPC-H benchmark is a blight upon the industry&lt;/a&gt;”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;To pull a couple of money-quotes from the site:&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;strong&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;; mso-bidi-font-family:&amp;quot;Times New Roman&amp;quot;;mso-bidi-theme-font:minor-bidi; color:black;font-weight:normal;mso-bidi-font-weight:bold"&gt;“...the TPC-H is irrelevant to judging an analytic DBMS’ real world performance.&lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/strong&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;; color:black"&gt;“In my opinion, this independent yardstick [the TPC-H] is too warped to be worth the trouble of measuring with.&lt;/span&gt;”&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;; color:#29303B"&gt;“I was suggesting that buyers don’t pay the TPC-H much heed.&lt;/span&gt; (CAM)”&lt;/span&gt;&lt;span class="apple-style-span"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;;color:black"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;; color:black"&gt;“TPC-Hs waste hours of my time every year. I generally am scathing whenever they come up&lt;/span&gt;”&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;"&gt;Now, notwithstanding the TPC-H issues, I think Curt will concede that he doesn’t particularly appreciate or trust ParAccel as a company either as the following statements will show:&lt;strong&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;;mso-bidi-font-family: &amp;quot;Times New Roman&amp;quot;;mso-bidi-theme-font:minor-bidi;color:black"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;; color:black"&gt;“I would not advise anybody to consider ParAccel’s product, for any use, except after&lt;/span&gt;&lt;/span&gt;&lt;span class="apple-converted-space"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;;color:black"&gt; &lt;/span&gt;&lt;/span&gt;&lt;strong&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;;mso-bidi-font-family:&amp;quot;Times New Roman&amp;quot;; mso-bidi-theme-font:minor-bidi;color:black;font-weight:normal;mso-bidi-font-weight: bold"&gt;a proof-of-concept in which ParAccel was not given the time and opportunity to perform extensive off-site tuning.&lt;/span&gt;&lt;/strong&gt;&lt;span class="apple-converted-space"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;; color:black"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;span class="apple-style-span"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;;color:black"&gt;I tend to feel that way about all analytic DBMS, but it’s a particular concern in the case of ParAccel.&lt;/span&gt;”&lt;/span&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;; color:#29303B"&gt;“I’d categorically advise against including ParAccel on a short list unless the company confirms it is willing to do a POC at the prospect’s location.&lt;/span&gt;”&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;; color:black"&gt;“The system built and run in that benchmark — as in almost all TPC-Hs — is ludicrous. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Hence it should be of interest only to ludicrously spendthrift organizations.&lt;/span&gt;”&lt;/span&gt;&lt;strong&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;;mso-bidi-font-family:&amp;quot;Times New Roman&amp;quot;; mso-bidi-theme-font:minor-bidi;color:black"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;; color:#29303B"&gt;“Based on past experience, I’d be very skeptical of ParAccel’s competitive claims, even more than I would be of most other vendors’.&lt;/span&gt;”&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;"&gt;The combination of published TPC benchmarks and the originator of the benchmark seem to have created what Curt himself refers to as “the perfect storm”.&lt;span style="mso-spacerun:yes"&gt;  To say h&lt;/span&gt;e doesn’t like either would be a gross understatement :)&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;"&gt;Both blogs immediately started getting “opinionated” comments from the public at large, including ParAccel’s VP of Marketing Kim Stanick and a gentleman named Richard Gostanian who may or may not be connected to Sun Microsystems (depending on which Twits you read).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Sun supplied the hardware for the ParAccel benchmark.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;To cite a couple quotes from the comments, Richard Gostanian responds:&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;; color:black"&gt;“Perusing your website, I detect a certain hostility towards ParAccel.&lt;/span&gt;” – (No kidding!)&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;; color:black"&gt;“Indeed you do more to harm your own credibility than raise doubts about ParAccel.&lt;/span&gt;” &lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;; color:black"&gt;“…TPC-H is the only industry standard, objective, benchmark that attempts to measure the performance, and price-performance, of combined hardware and software solutions for data warehousing.&lt;/span&gt;”&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;; color:black"&gt;“So Curt, pray tell, if ParAccel’s 30 TB result wasn’t “much of an accomplishment”, how is it that no other vendor has published anything even remotely close?&lt;/span&gt;”&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;; color:black"&gt;Then Kim Stanick says: &lt;/span&gt;&lt;/span&gt;&lt;span class="apple-style-span"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;;color:#29303B"&gt;“It [TPC-H] is the most credible general benchmark to-date&lt;/span&gt;.”&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;; color:#29303B"&gt;And an anonymous reader chimes in:&lt;/span&gt;&lt;/span&gt;&lt;span class="apple-style-span"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;;color:black"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;; color:black"&gt;“After reading Curt’s post about ParAccel and Kim this is obviously personal…I wonder why the little fella didn’t have a fit over Oracles 1TB TPC-H?&lt;/span&gt; Check his bio.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;He consults for Oracle.”&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;; color:black"&gt;To which Curt replies (among other things):&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;; color:black"&gt;“As for your question as to why other vendors don’t do TPC-Hs — perhaps they’re too busy doing POCs for real customers and prospects to bother.&lt;/span&gt;”&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;; color:black"&gt;Ouch!&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This nasty sudden melee took me by surprise at a time when I was considering blogging about the whole TPC-H system for analytical engines anyway.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I’ve wondered for quite a while whether or not publishing such metrics actually helped “new breed” startups like ourselves from a marketing and sales standpoint. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Given the high cost and resource drain, what’s the return on this investment? &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;What’s more, I have yet to meet a prospect or user who either cares of knows about TPC-H benchmarks.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;So far, the only people I’ve ever seen show any interest in the matter are venture capitalists and investors which tells me right there that something is amiss (or, maybe that’s why the small players take the plunge, I don’t know).&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;; color:black"&gt;As some of you may know, XSPRADA is a recent member of TPC.org alongside other industry startups like Kickfire, Vertica, ParAccel and Greenplum.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Numerous other startups in the same category are not members.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;They don’t seem to fare any worse.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Furthermore, as best I can tell, even some existing members (namely Greenplum or Vertica) don’t publish audited benchmarks.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Yet clearly these two vendors don’t seem negatively affected by the lack thereof.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;; color:black"&gt;Although we at XSPRADA have conducted TPC-H benchmarks (and continue to do so) internally, we have never attempted to get them audited and published.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;If a prospect asked me about it, I would recommend we help him run those benchmarks in-house on his own hardware anyway!&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Even if we had $100K to blow on getting audited benchmarks, I’m not sure it would make sense to pursue.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;; color:black"&gt;I’m usually a pretty opinionated black &amp;amp; white guy, but with respect to this TPC-H business, I tend to centerline. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Strangely enough, I identify with both sides of the argument.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;One the one hand, I don’t believe the benchmarks to be totally useless.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Having been involved in generating our internal results, I can vouch for the fact that it takes a lot of tedious work and kick-ass engineering to even complete the list.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;By any stretch of the imagination, this is not a small inconsequential feat.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Doing so on anything above 10TB is, in my opinion, nothing to sneeze at.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;If nothing else, being able to handle the SQL for all 22 queries is a decent achievement.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And then of course, there’s the notion that even “trying” to do it is noble in itself.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In that sense I tip my hat to the small guys who pulled it off.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;; color:black"&gt;On the other hand, I don’t feel the benchmarks are holistically useful for evaluation purposes.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;As a prospect looking at several vendors, they might figure in my check-list but not more significantly than others I consider more important. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Namely: how easy is the product to work with, what resources does it consume (human and metal), how does it play in the BI ecosystem as a whole (connectivity), and last but not least, what kind of support and viability will the vendor provide?&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I’m a little weird that way.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I tend to evaluate companies based on their people over most everything else.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;But that’s just me.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;; color:black"&gt;At the end of the day (and everyone does seem to agree on that), what matters are onsite POCs.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Nothing can beat running your own data on your own metal.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I want a vendor to hand me the keys and go “ok, have a good ride, call me if you need anything” and mean it.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;BMW sells cars this way. Enough said.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This is what I drive to when helping people evaluate our offering.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="font-family:&amp;quot;Georgia&amp;quot;,&amp;quot;serif&amp;quot;; color:black"&gt;It remains to be seen how much of this brouhaha will benefit ParAccel in the long run.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;They say there’s no such thing as bad publicity.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;If they end up getting recognition and sales from it, then they have chosen wisely, and no one can take that away from them.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Personally, I wish them the best.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I believe the more numerous we are in this upstart game, the better it is for us, and more importantly, for our customers.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;So I say leave the guns, and take the cannoli. &lt;g&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-8194015985318031688?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/8194015985318031688/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/06/leave-gun-take-cannoli.html#comment-form' title='9 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/8194015985318031688'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/8194015985318031688'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/06/leave-gun-take-cannoli.html' title='Leave the Gun, Take the Cannoli'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>9</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-3383167017988941112</id><published>2009-06-23T16:48:00.000-07:00</published><updated>2009-06-23T16:52:49.182-07:00</updated><title type='text'>In-House or SaaS? How About Both?</title><content type='html'>&lt;p class="MsoNormal"&gt;Chuck Hollis just penned another interested blog &lt;a href="http://chucksblog.emc.com/chucks_blog/2009/06/monetizing-the-private-cloud-part-1.html"&gt;post&lt;/a&gt; about the economics behind private clouds for the enterprise. There is a lot of talk about on-premise versus on-demand SaaS these days in the BI community (and when I say SaaS I mean either private or public).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span style="mso-spacerun:yes"&gt;&lt;/span&gt;From a financial standpoint, the two models are fairly well established.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Basically, on-premise is budgeted as capital expenditure, while on-demand is budgeted as operational expenditure.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Much like the difference between buying your TV and paying for your electric bill on a monthly basis, or the difference between buying and leasing a car.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span style="mso-spacerun:yes"&gt;&lt;/span&gt;With on-premise you buy a lot of expensive stuff and it depreciates (and loses value) over time.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;With on-demand you rent a service for a short critical amount of time as needed.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In many ways, the parallels are strikingly close to hiring in-house software developers versus outsourcing to consultants.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The business case for either direction is easily conceived.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;In-house developers are a long-term investment.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;They will learn the business and be allocated as needed on a per-project basis.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The project is likely to be long-term.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Much like capital equipment, they also need to be “upgraded” periodically – namely allowed and encouraged to keep up with technology so they remain productive and far-sighted.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;They also need to be provisioned with tools and resources to do their jobs effectively.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The most progressive shops understand that.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Not unlike expensive heavy metal and software licenses, they also tend to get worn out or outdated with time.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And they’re expensive to replace and renew.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Consultants (either remote or in-house) are a quick-fix solution, usually applied to a pressing problem when in-house resources are either not available or not competent to handle the pressing business need. They’re expensive, but they (hopefully) get the job done quickly, get you the answers you need when you need them, and ride out into the sunset. Like on-demand offerings, they can be shutdown at will, but they also carry vendor-lock risk.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Having been in the software business for two decades, I’ve been on both sides of this fence numerous times.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In my experience, the best shops implement a hybrid approach with strong internal cores supplemented as needed by top-notch “gun slingers”. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;In the best of cases, I’ve seen synergy and knowledge transfer occur between the two entities (when the politics were right) with significant benefit to the enterprise.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;I think the same thing will probably occur in the BI space.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I would imagine large shops will probably have both in-house staff and equipment, backed up by quickly ramped up SaaS offerings dedicated to what I call “transient data mart needs”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I could be wrong about this, but hybrid approaches (in business and technology) are usually more flexible and not necessarily conflicting. They can also feed off each other in positive ways. &lt;/p&gt;  &lt;p class="MsoNormal"&gt;I don’t believe the choices are going to be 100% on-premise or 100% on-demand.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The trick for the CIOs out there is going to be determining which projects and which needs are better served by internal (strategic) or external (tactical) solutions in an agile way.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In that sense there is really no “battle” between the two approaches.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;They should be considered complementary parts of an intelligent BI strategy toolbox.&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-3383167017988941112?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/3383167017988941112/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/06/in-house-or-saas-how-about-both.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/3383167017988941112'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/3383167017988941112'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/06/in-house-or-saas-how-about-both.html' title='In-House or SaaS? How About Both?'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-8360212102422683429</id><published>2009-06-23T08:51:00.000-07:00</published><updated>2009-06-23T08:56:09.421-07:00</updated><title type='text'>This ACID Leaves No Bitter Taste</title><content type='html'>&lt;p class="MsoNormal"&gt;In my previous post below I received a comment (question) from Swany about inserts in the XSPRADA database engine RDM/x. Specifically, he (or she) asked:&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="font-size:9.0pt; line-height:115%;color:black"&gt;“What happens if there is an error during the SELECT .. INTO? Are such inserts ACID or will I get partial data in a table if the system crashes?”&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;This is of course an excellent question, and I thought it was worth addressing on a wider level beyond just incremental loads.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;To place this in context, recall that ACID properties are defined as a set of rules pertaining to transactional database management systems defined as &lt;b style="mso-bidi-font-weight:normal"&gt;A&lt;/b&gt;tomicity, &lt;b style="mso-bidi-font-weight:normal"&gt;C&lt;/b&gt;onsistency, &lt;b style="mso-bidi-font-weight: normal"&gt;I&lt;/b&gt;solation and &lt;b style="mso-bidi-font-weight:normal"&gt;D&lt;/b&gt;urability.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;If a transactional database does not meet these conditions, it is not considered “reliable”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I won’t bore the reader with yet another ACID definition.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Suffice to hit Wikipedia for a reasonable &lt;a href="http://en.wikipedia.org/wiki/ACID"&gt;description&lt;/a&gt;.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Now the first question of course is whether analytical database engines supporting OLAP style work can or should meet the same criteria as classical transactional OLTP systems. By definition, analytical systems are biased for read access and updates are supposed to be rare, but inserts certainly occur as incremental loads are performed on warehouses (and data marts) at various intervals (from hours to weeks typically). &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;In either case, an analytical engine clearly needs to handle data changes in an ACID way or data loss and corruption can occur.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Similarly, data value and integrity need to be protected (locked) from concurrent (possibly conflicting) access patterns.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Internal database structures are vulnerable to corruption during these transactions. So how does XSPRADA technology handle these issues?&lt;/p&gt;  &lt;p class="MsoNormal"&gt;The answer, not surprisingly, lies in XSPRADA’s “magic sauce”, namely, the mathematics of Extended Set Processing (XSP).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;To appreciate this, one needs to understand that all entities inside the XSPRADA mathematical model (ie: tables, rows, fields etc.) are represented as extended sets. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;And all extended sets by definition are immutable.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This means updates to the system are implemented by creating additional extended sets, and the original ones are never mutated or deleted by subsequent processing. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;This ensures that data sets in the system are never corrupted or worse, deleted by mistake.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Internally, the XSPRADA data model is fully contained in a “universe” of extended sets.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This is the set of all sets.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Sets in this universe are related to each other via algebraic relations (hence the “relational” part of Relational Data Miner or RDM/x).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Depending on the state of the system at a specific point in time, sets are either “realized” (materialized) to disk, or “virtual”, meaning they have an internal mathematical representation defined by algebraic expressions involving other sets, but no physical existence. (This has repercussions concerning “materialized views” which I’ll attempt to discuss in a future post).&lt;/p&gt;  &lt;p class="MsoNormal"&gt;So when sets are modified by internal operations, both new and old sets remain in existence within the universe. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;This means updates and inserts &lt;i style="mso-bidi-font-style:normal"&gt;never actually change information&lt;/i&gt;, but only add to it.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;To complete the transaction, RDM updates the universe metadata to include knowledge of the newly created sets (if any) along with the algebraic relations linking them to their original brethren. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Genesis is maintained.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This is crucial because, unlike in conventional DBMS systems, the original data never needs to be re-generated (or created) to achieve rollback. The universe is only updated once all operations have completed successfully.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;If an error occurs, no harm no foul, as the previous state of the universe was maintained and still exists.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In essence, the INSERT, UPDATE, DELETE functionality of the XSPRADA database is merely a logical emulation of conventional DBMS DML.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Each of these statements internally results in an &lt;i style="mso-bidi-font-style:normal"&gt;additive&lt;/i&gt; activity.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In fact, UPDATE is internally implemented as DELETE+INSERT.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;So to recover from a failed set of operations (a transaction gone south) the system simply deletes any incomplete sets and does not update the universe metadata!&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This mechanism enforces &lt;b style="mso-bidi-font-weight: normal"&gt;atomicity&lt;/b&gt; and &lt;b style="mso-bidi-font-weight:normal"&gt;consistency&lt;/b&gt; natively without any need for additional programming or complexity.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;On the &lt;b style="mso-bidi-font-weight:normal"&gt;isolation&lt;/b&gt; front, lists of in-process and pending operations are maintained in dynamic pipelines for each extended set. The system examines these pipelines and algebraically identifies any potential conflicts between operands.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Again, the mathematics allows this to occur natively.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;So if the results of pending operations do not affect in-process operations, the system executes them concurrently and immediately.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Conversely, if the mathematics identify a potential conflict or deadlock, pending operations are queued until conflicting running operations have completed.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;b style="mso-bidi-font-weight:normal"&gt;Durability&lt;/b&gt; is the last remaining condition. The system maintains all realized sets in persistent storage (disk drives).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Although sets or parts thereof can be (and often are) kept in cache, any complete set also has an image on disk. This prevents system failures from affecting the durability of realized sets.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;When the system restarts, it also restarts any operations that were executing at the time of failure.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;For all these reasons, XSPRADA technology is actually superior to conventional database mechanisms for enforcing ACID, as the enforcement is inherently “built-in” via the mathematics underlying the system at all times.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;As I mentioned in the last post, the engine is also time-invariant, meaning it can always be queried at a given time point in the past. The ability to do this without any external programming or internal modeling is significant.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;One use case that immediately comes to mind (to me anyway) in wake of the recent Wall Street disasters is being able to ask a financial database to yield answers as if it were being queried months or years ago.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Imagine being able to roll back time to analyze or audit results that supported past decisions and the people who signed off on them. What a concept! &lt;/p&gt;&lt;p class="MsoNormal"&gt;If you'd like to take the XSPRADA database out for a spin (and pull the plug in the middle of using it just to see what happens &lt;g&gt;) feel free to get the trial bits from our &lt;a href="http://www.xsprada.com"&gt;website&lt;/a&gt;.&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-8360212102422683429?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/8360212102422683429/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/06/this-acid-leaves-no-bitter-taste.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/8360212102422683429'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/8360212102422683429'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/06/this-acid-leaves-no-bitter-taste.html' title='This ACID Leaves No Bitter Taste'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-4666791516192884055</id><published>2009-06-22T13:29:00.000-07:00</published><updated>2009-06-22T13:49:00.373-07:00</updated><title type='text'>Can Rover Roll Over Too?</title><content type='html'>&lt;p class="MsoNoSpacing"&gt;&lt;/p&gt;&lt;p class="MsoNoSpacing"&gt;In this post, I want to stay in gear-head mode and "demo" a couple of neat tricks from the XSPRADA RDM/x analytical engine puppy.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I’m going to address quick prototyping capabilities (thanks to schema agnosticism), incremental inserts, and time invariance.&lt;/p&gt;&lt;p&gt;&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;We’re going to do this with some &lt;i&gt;really&lt;/i&gt; simple tables and data structures just to highlight the concepts behind the engine’s functionality.  So suppose we start with a CSV table called ‘demo.csv’ consisting of the following rows (NOTE: the file must be terminated by CR-LF):&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;1000,12,"now is the time"&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;1001,45,"for all good men"&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;1002,76,"to come to the aid"&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;Now we want to load this into RDM/x so we use the XSPRADA SQL extension CREATE TABLE FROM as follows:&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;&lt;span class="Apple-style-span"  style="font-size:medium;"&gt;create table demo(id int,c1 int,c2 char(128)) from "C:\ demo.csv"&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNoSpacing"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;Then we do a SELECT * on the table just to make sure all is well (for example, using QTODBC or any other ODBC-compliant SQL front-end tool) and we see:&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;1000&lt;span style="mso-tab-count:1"&gt;       &lt;/span&gt;12&lt;span style="mso-tab-count:1"&gt;           &lt;/span&gt;now is the time&lt;span style="mso-spacerun:yes"&gt;                                                                                                               &lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;&lt;span style="mso-bidi-;font-family:Consolas;"&gt;1001&lt;span style="mso-tab-count:1"&gt;       &lt;/span&gt;45&lt;span style="mso-tab-count:1"&gt;           &lt;/span&gt;for all good men&lt;span style="mso-spacerun:yes"&gt;                                                                                              &lt;/span&gt;&lt;span style="mso-spacerun:yes"&gt;                &lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;&lt;span style="mso-bidi-;font-family:Consolas;"&gt;1002&lt;span style="mso-tab-count:1"&gt;       &lt;/span&gt;76&lt;span style="mso-tab-count:1"&gt;           &lt;/span&gt;to come to the aid&lt;span style="mso-spacerun:yes"&gt;     &lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;And if I look at my schema in the QTOBDC object browser I see this as expected:&lt;/p&gt;&lt;p class="MsoNoSpacing"&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_dCMl0ieMmTw/Sj_qjeu-alI/AAAAAAAACPI/RESTPP_j-uU/s1600-h/6-22-2009+1-28-44+PM.jpg"&gt;&lt;img src="http://1.bp.blogspot.com/_dCMl0ieMmTw/Sj_qjeu-alI/AAAAAAAACPI/RESTPP_j-uU/s320/6-22-2009+1-28-44+PM.jpg" border="0" alt="" id="BLOGGER_PHOTO_ID_5350252777605851730" style="cursor: pointer; width: 207px; height: 93px; " /&gt;&lt;/a&gt;&lt;/p&gt;&lt;p class="MsoNoSpacing"&gt;&lt;/p&gt;&lt;p class="MsoNoSpacing"&gt;So this is fine and dandy when you know the actual schema of your data but now what if you don’t or what if you’re not sure or what if you're out to determine what the best schema might be given your application?&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;Ok well what we can do it simply load everything as text types.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;So now we do:&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;create table demo(&lt;span class="Apple-style-span" style="font-family: Consolas; "&gt;id char(128),c1 char(128),c2 char(128)) from "C:\ demo.csv"&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;&lt;span style="mso-bidi-;font-family:Consolas;"&gt;The original ‘demo’ table is overwritten with the new schema which now becomes:&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNoSpacing"&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_dCMl0ieMmTw/Sj_q_9ntMbI/AAAAAAAACPQ/lpe8DlUG4Vk/s1600-h/6-22-2009+1-28-25+PM.jpg"&gt;&lt;img src="http://2.bp.blogspot.com/_dCMl0ieMmTw/Sj_q_9ntMbI/AAAAAAAACPQ/lpe8DlUG4Vk/s320/6-22-2009+1-28-25+PM.jpg" border="0" alt="" id="BLOGGER_PHOTO_ID_5350253266933199282" style="cursor: pointer; width: 184px; height: 101px; " /&gt;&lt;/a&gt;&lt;/p&gt;&lt;p class="MsoNoSpacing"&gt;&lt;/p&gt;&lt;p class="MsoNoSpacing"&gt;&lt;span class="Apple-style-span"  style="font-family:Consolas;"&gt;&lt;span class="Apple-style-span" style="font-family: Georgia; "&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;span class="Apple-style-span"  style="font-family:Consolas;"&gt;&lt;p class="MsoNoSpacing"&gt;&lt;/p&gt;&lt;p class="MsoNoSpacing"&gt;&lt;span&gt;And now we can play around with that until we’re satisfied.&lt;span&gt;  &lt;/span&gt;Assume we suspect the optimal way to work with this data might be using INT for the index, a double precision type for the c1 column, and an 80-char VARCHAR for the last field.&lt;span&gt;  &lt;/span&gt;We might take our existing table and “flip” it into a new schema (called flipped) as such:&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNoSpacing"&gt;select cast(id as int), cast(c1 as double precision), cast(c2 as varchar(80)) from demo &lt;b&gt;into flipped&lt;/b&gt;&lt;/p&gt;&lt;p class="MsoNoSpacing"&gt;Note how we convert the schema into a new one directly into a new table on the fly.&lt;span&gt;  &lt;/span&gt;Now if we select from flipped we see:&lt;/p&gt;&lt;p class="MsoNoSpacing"&gt;1000&lt;span&gt;       &lt;/span&gt;12.0&lt;span&gt;        &lt;/span&gt;now is the time&lt;/p&gt;&lt;p class="MsoNoSpacing"&gt;&lt;span&gt;1001&lt;span&gt;       &lt;/span&gt;45.0&lt;span&gt;        &lt;/span&gt;for all good men&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNoSpacing"&gt;&lt;span&gt;1002&lt;span&gt;       &lt;/span&gt;76.0&lt;span&gt;        &lt;/span&gt;to come to the aid&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNoSpacing"&gt;&lt;span&gt;&lt;o:p&gt;And the schema for the new table, as expected, is:&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p class="MsoNoSpacing"&gt;&lt;span class="Apple-style-span" style="font-family: Georgia; "&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_dCMl0ieMmTw/Sj_rPOT2nqI/AAAAAAAACPY/dnEeZZumVKU/s1600-h/6-22-2009+1-28-55+PM.jpg"&gt;&lt;img src="http://4.bp.blogspot.com/_dCMl0ieMmTw/Sj_rPOT2nqI/AAAAAAAACPY/dnEeZZumVKU/s320/6-22-2009+1-28-55+PM.jpg" border="0" alt="" id="BLOGGER_PHOTO_ID_5350253529111371426" style="cursor: pointer; width: 320px; height: 74px; " /&gt;&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;&lt;/span&gt;&lt;p&gt;&lt;/p&gt;&lt;p class="MsoNoSpacing"&gt;&lt;/p&gt;&lt;p class="MsoNoSpacing"&gt;Now if we needed to do some querying on numerics for C1, we could. Note we didn’t even have to explicitly create or schema the ‘flipped’ table. RDM/x took care of that automatically, making these types of operations ideal for quick prototyping.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Incidentally, we could have done this directly on the existing table as well.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This type of flexibility is pretty cool and allows you to “play” with the data in a trial and error mode.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;Now suppose we come across some additional data and we wish to INSERT this information into our existing database.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This is typically what happens with daily incremental into a data warehouse.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;On a periodic basis, chunks of data are added to (typically) fact tables.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Our incremental CSV looks as such:&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;2000,132,"to be "&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;2001,465,"or not to be"&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;2002,786,"that is the question"&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;There are several ways to handle this using RDM/x.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The most direct one is to simply use the XSPRADA “INSERT INTO FROM” SQL extension as follows:&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;INSERT INTO demo FROM “c:\demo_062209.csv”&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;In this case RDM/x is told “hey, there is more data being presented to you so, algebraically, union it with the existing data”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;As RDM/x does not “load” data in the conventional sense of the term or support bulk loading (as it doesn’t need to), additional data can be presented virtually in real-time with little or no effect on the ability to query the system simultaneously.&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;This is what’s called “non-disruptive live updates” or concurrent load and query capability.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Initially, users are a little surprised that bulk inserts are not supported.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;But in fact, all that’s needed is “dropping” the incremental data somewhere on disk and telling RDM/x of its existence.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;RDM/x can ingest this information as fast as it can be written to disk.&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;Another way to do this is by loading the incremental “chunk” into its own table as such:&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;create table demo_062209(id int,c1 double precision,c2 varchar(80)) from "C:\ demo_062209.csv"&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;&lt;span style="mso-bidi-font-family:Consolas"&gt;&lt;o:p&gt;And then doing&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;select * from demo union select * from demo_062209 &lt;b style="mso-bidi-font-weight:normal"&gt;into demo&lt;/b&gt;&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;&lt;span style="mso-bidi-font-family:Consolas"&gt;&lt;o:p&gt;A bit of subtlety there: when you do this, you’re effectively “updating” the existing demo table with the unioned results of the incremental.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;So you take the old demo table, add the incremental, then flip that back into the original demo table.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;You could have saved the “old” demo table first before doing this as such:&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;select * from demo into demo_current&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;Notice how these “dynamic” tables really behave like variables in a loosely-typed programming language. They are conceptually related to conventional database “views” (although RDM/x does support view objects in the database but from a purely semantic way).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;As a matter of fact, all tables and views in RDM/x are essentially the same thing and materialized (to disk and/or memory) on a JIT basis anyway.&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;Of interest here is that you can always recover past instances of any data RDM/x via a nifty little feature called “time invariance”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This feature is unique in the world of databases to the best of my knowledge. Essentially, you can query RDM/x much like a “time machine”, asking it to yield results as if the question were being asked in the past.&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;So suppose you had inserted your incremental into the ‘demo’ table by mistake and suppose your insert had occurred at a given time such as '2009-06-22 15:13:52.775000000' (and you can tell this if you are logging your SQL statements to the database using the usrtrace option of the RDM/x ODBC driver). &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;You can always recover the state of ‘demo’ at the time by issuing:&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;SELECT * FROM demo AT TIMESTAMP '2009-06-22 15:13:52.775000000' into recover&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;Now your ‘recover’ table will contain ‘demo’ exactly the way it was at that time.&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;&lt;span style="mso-bidi-font-family:Consolas"&gt;&lt;o:p&gt;The implications are far-reaching because you can essentially always query the RDM/x database at any point in the past.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;RDM/x never deletes information UNLESS the information has been explicitly deleted using a DROP TABLE or DELETE type of DML statement AND the garbage collector kicks in. Short of that, the database maintains information and integrity through time.&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNoSpacing"&gt;These are just a couple of nifty tricks from one original database engine called RDM/x.&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-4666791516192884055?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/4666791516192884055/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/06/can-rover-roll-over-too.html#comment-form' title='7 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/4666791516192884055'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/4666791516192884055'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/06/can-rover-roll-over-too.html' title='Can Rover Roll Over Too?'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_dCMl0ieMmTw/Sj_qjeu-alI/AAAAAAAACPI/RESTPP_j-uU/s72-c/6-22-2009+1-28-44+PM.jpg' height='72' width='72'/><thr:total>7</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-3805015757810798503</id><published>2009-06-19T14:28:00.000-07:00</published><updated>2009-06-19T14:36:31.569-07:00</updated><title type='text'>Choose Wisely Grasshopper</title><content type='html'>&lt;p class="MsoNormal"&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;If you’re been tasked with implementing or supplementing business intelligence offerings in your organization lately (read: a data mart, for example), the choices have gotten quite a bit more complicated.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;A while ago, you would have your pick of three or four vendors.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;If one of those vendors was already in-house handling your transactional systems, guess what, he was likely to also handle your warehousing needs.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Or at least try to.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Nowadays, the decision path is a little more involved because the BI world is no longer ruled by a small oligarchy (namely Oracle, Microsoft and IBM).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The proliferation of “new-breed” analytical engine vendors has greatly expanded a buyer’s options.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;There are around twenty-some players in this field now.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Not only that, but delivery options have expanded as well.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Nowadays, you can get BI delivered and running in-house sitting on commodity or custom hardware.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;You can buy canned appliances.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;You can go proprietary bits.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;You can go Open Source.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Or you can tap the “cloud” with an on-demand subscription-based model.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;You can pick a columnar vendor, or a row-based one.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;You can choose an MPP architecture, or an SMP implementation.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;You can even stick with the “gorillas” if that makes you feel better (and money is no object).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;These different paths are all strategically different both technically and economically.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The modern BI strategist is compelled to choose wisely in an unforgiving economy where failure is no longer an option (this time they mean it).&lt;/p&gt;  &lt;p class="MsoNormal"&gt;So for the sake of argument, I’m going to assume the following buyer profile:&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;/p&gt;&lt;ol&gt;&lt;li&gt;Money is _definitely_ an object.&lt;/li&gt;&lt;li&gt;IT resources are non-existent, limited, not available, or not inclined to help.&lt;/li&gt;&lt;li&gt;A lengthy proof-of-concept (POC) cycle is not an option.&lt;/li&gt;&lt;li&gt;Project timeline is measured in weeks not months or years.&lt;/li&gt;&lt;li&gt;C-level people want to see incremental results starting today.&lt;/li&gt;&lt;li&gt;A single DBA is left standing in your organization (but next week, maybe zero).&lt;/li&gt;&lt;li&gt;Your ass is on the line.&lt;/li&gt;&lt;/ol&gt;&lt;p&gt;&lt;/p&gt;              &lt;p class="MsoNormal"&gt;I think this describes a fairly common scenario these days.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Faced with such odds, I think most people opt for the path of least economic and implementation resistance.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Most people don’t have $2-6M hanging around including staff and a six-twelve month window to implement a full-blown Oracle, Microsoft or IBM solution. These days are simply gone (good riddance on that).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It would appear at first glance that the only remaining alternatives would be open source (OSS) or on-demand SaaS software.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Now, OSS is attractive from a cost basis, as most freebies tend to be.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Yes, you do pay for support if pulling down “enterprise” versions of the software but in many cases, some buyers get away using the free versions for a while, at least for quick POCs.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In some cases, the free versions are either limited or incomplete in functionality (like &lt;a href="http://tinyurl.com/kqdgfg"&gt;InfoBright&lt;/a&gt;, for example) so that could be a “gotcha” depending on your application needs.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Similarly, non-enterprise versions usually depend on community for support.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;If you’re in a jam and need serious dedicated support on a moment’s notice, you’ll need to pony up for an enterprise version or wait until “the community” comes up with an adequate answer to your problem (if ever).&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Additionally, OSS does have hidden costs, not the least of which are installation, setup, configuration, and maintenance.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;But, if you happen to be a Linux shop and have enough LAMP developers on staff with sufficient expertise and time, and your management happens to be accepting of the whole OSS concept, it might just be a viable option.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Another quick way to get up and running quickly is to go the Cloud (SaaS) service route.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In that scenario, you pay a monthly fee to access a BI platform in the cloud. This hands-off approach is certainly attractive in many cases provided data volumes and security restrictions do not get in the way.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Shlepping 10-100TB of data offsite is not something most people consider yet. But, for smaller data sizes, SaaS is certainly an option.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Vertica, Kognitio and Aster Data come to mind as the latest new-breeders to provide cloud-based services to customers (either on proprietary or public cloud platforms like EC2).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;There is a flurry of other on-demand BI players as described in several of my past &lt;a href="http://tinyurl.com/kpugpm"&gt;posts&lt;/a&gt;.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Of course, the downsides there tend to be upload time, limited functionality and vendor lock-in. &lt;/p&gt;  &lt;p class="MsoNormal"&gt;Now if I can get on my soapbox for a minute, I’m going to pitch a third option.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;At XSPRADA, we’ve developed a 5MB high-performance analytical database running as a Windows service on commodity hardware and Server 2003 or 2008 x64 operating systems.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;You can install this puppy internally (your data stays nice and safe in-house) in about 2.5 minutes including ODBC drivers.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;You then point it at your CSV data on disk and start firing off queries immediately.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The more you ask, the faster it gets.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And you can show results in minutes, not weeks or months.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;No cubing, no indexing, no pre-structuring, no partitioning, none of that nonsense.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;That’s the bottom line in a nutshell.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I promise you business intelligence is not the rocket science so many folks make it out to be!&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;For those who might be hesitating between open source, SaaS or much more expensive in-house solutions, I think this is a pretty unique proposition.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;There really is nothing like it anywhere else on the market, and it just so happens we have a 30 day trial going on right now if you visit our &lt;a href="http://www.xsprada.com"&gt;website &lt;/a&gt;&lt;span style="font-family:Wingdings;mso-ascii-font-family:Calibri;mso-ascii-theme-font: minor-latin;mso-hansi-font-family:Calibri;mso-hansi-theme-font:minor-latin; mso-char-type:symbol;mso-symbol-font-family:Wingdings"&gt;&lt;span style="mso-char-type: symbol;mso-symbol-font-family:Wingdings"&gt;J&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-3805015757810798503?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/3805015757810798503/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/06/choose-wisely-grasshopper.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/3805015757810798503'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/3805015757810798503'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/06/choose-wisely-grasshopper.html' title='Choose Wisely Grasshopper'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-5870008269387248854</id><published>2009-06-04T16:00:00.000-07:00</published><updated>2009-06-04T16:07:51.252-07:00</updated><title type='text'>Yabadabadoo and off to Austin!</title><content type='html'>&lt;div&gt;&lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;I’m headed out to Austin again tomorrow.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Last I checked it was 94F over there with 60% humidity.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Guess I won’t be going on my daily power walk for a little while lest I keel over.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;That’s OK though, I’ll get my workouts at &lt;a href="http://www.bonedaddys.com/"&gt;Bone Daddy’s&lt;/a&gt;.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I’m jazzed up about the trip as usual.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I love that town.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It has a relaxed, genuine, homey feel I have not found in many other US cities.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;When people ask “how ya doing?” they actually mean it and expect an answer!&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;I have prospects, clients and partners to visit.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Now that our pre-release RDM/x software is available, I’m getting to see how people actually use these bits in real-life situations and that’s fairly exciting for many reasons. &lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;Product claims are one thing.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;But it’s quite another to actually see people’s reaction when they discover the software.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;When folks install our software the usual reaction is “huh? Is this all there is to it? &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;What did I miss?” – The answer is “you missed nothing” – RDM/x deploys and installs as a 5 Meg Windows Service.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Read my lips: a 5 megabyte Windows Service EXE is all you need to query TB-level data sets faster and easier than you’re used to.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Install the bits, present data, connect to ODBC driver, and ask questions.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Done.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Naturally, given users’ past experience and expectations with classic analytical engines, it’s hard to swallow initially. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;And speaking of shockers, here a doozy of a &lt;a href="http://www.tpc.org/results/FDR/tpch/HP_BladeSystem128P_090603_TPCH_FDR.pdf"&gt;benchmark &lt;/a&gt;you can find on the TPC website.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="mso-bidi-font-family:Arial"&gt;That’s right folks, $6 Million dollars with Godzilla metal and full-blown enterprise software will buy you queries into a mere 1TB of data.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Or, you could install and start a 5MB Windows Service on a $20,000 server.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="mso-bidi-font-family:Arial"&gt;What else am I picking up? People are incredulous about the fact that there’s no need for partitioning, indexing, pre-structuring, worrying about rigid schemas and data models.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In one case, I was actually able to load 18GB of data by pulling everything in as strings.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;These particular data sets were CSV clickstreams for a web analytics application (conveniently, they’re often text log files which suits us well as CSV is the only format we currently ingest).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The ETL attempts had become tedious due to poor data quality and uncertain schemas.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It wasn’t clear initially what the best schema might be for the type of data at hand so experimentation was needed.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;So we took more of an ELT approach and just DDL’d everything in as text.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;We then quickly flipped internally to another schema, casting columns as needed on the fly.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And if that schema turned out to be less efficient or interesting than originally thought, we just flipped the whole dataset to another one in one line of SQL.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This type of just-in-time flexibility in presenting and transforming data is seldom (if ever) found in more ponderous solutions.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;For web analytics type of applications (where DQ is typically poor and schemas fairly dynamic) this is a huge competitive advantage.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="mso-bidi-font-family:Arial"&gt;Another sweet spot is OLAP cubes.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Or lack thereof I should say.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In typical OLAP applications, building cubes is lengthy, complicated and expensive (as anyone who ever footed an analytical consultant’s bill will attest to).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Re-aggregating or adding dimensions is also painfully slow and tedious as data volumes increase (and they always do).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;A lot of times these processes are setup to run nightly as batch processes.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;If you’re lucky and if you’ve done your engineering properly (lots of ifs), you show up in the morning and it’s done.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;But you can’t add a dimension in real-time without impact on the entire system.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And you need to plan for your questions up-front, as in “what shall I slice &amp;amp; dice this year?”&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;With RDM/x, you don’t need to setup dimensions explicitly.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The very act of querying in a dimensional way alerts the engine to that effect and it starts building “cubes” internally and aggregating as needed, optimizing for specified dimensions.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Then adding a dimension is painless – all you need to do is actually query against it more than once.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;With RDM/x, once is an event, twice is a pattern.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="mso-bidi-font-family:Arial"&gt;This “anticipatory” self-adjusting behavior is actually at the heart of RDM/x.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This is one area where the product distinguishes itself from the competition.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Because the software continuously re-structures both data and queries as more and more questions come in. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;This means you see response time decrease as quantity and complexity of queries increase.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;This is fundamentally different from conventional database behavior where query response time “flat-lines” up to a given number (not including load time) and pretty much hovers there until and unless an “expert” can optimize either code or configuration (if possible at all).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And when you add to or complicate your query patterns, performance suffers.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span style="mso-bidi-font-family:Arial"&gt;&lt;span style="mso-spacerun:yes"&gt;&lt;/span&gt;The RDM/x response-time profile is more of what I call the “dinosaur tail” effect: big initial hump, then exponential taper down over time.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Here’s my Fred Flintstone rendition of the effect using MS Paint (nostalgia?):&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_dCMl0ieMmTw/SihSJVcH6xI/AAAAAAAACPA/5sDqWdvtfyg/s1600-h/dinosaur.JPG"&gt;&lt;img src="http://3.bp.blogspot.com/_dCMl0ieMmTw/SihSJVcH6xI/AAAAAAAACPA/5sDqWdvtfyg/s320/dinosaur.JPG" border="0" alt="" id="BLOGGER_PHOTO_ID_5343611278202497810" style="cursor: pointer; width: 320px; height: 200px; " /&gt;&lt;/a&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span style="mso-bidi-font-family:Arial"&gt;It is always a blast observing people as they first witness this unique behavior. It’s like discovering a new life form &lt;/span&gt;&lt;span style="font-family:Wingdings; mso-ascii-font-family:Calibri;mso-ascii-theme-font:minor-latin;mso-hansi-font-family: Calibri;mso-hansi-theme-font:minor-latin;mso-bidi-font-family:Arial;mso-char-type: symbol;mso-symbol-font-family:Wingdings"&gt;&lt;span style="mso-char-type:symbol; mso-symbol-font-family:Wingdings"&gt;J&lt;/span&gt;&lt;/span&gt;&lt;span style="mso-bidi-font-family: Arial"&gt;&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;-- And I’m looking forward to repeating the experience in Austin next week and throughout the country soon enough.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;If you care for a head-start, pull down the bits from our &lt;/span&gt;&lt;a href="http://www.xsprada.com"&gt;website&lt;/a&gt;&lt;span style="mso-bidi-font-family: Arial"&gt; and give it a whirl!&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-5870008269387248854?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/5870008269387248854/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/06/yabadabadoo-and-off-to-austin.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/5870008269387248854'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/5870008269387248854'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/06/yabadabadoo-and-off-to-austin.html' title='Yabadabadoo and off to Austin!'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_dCMl0ieMmTw/SihSJVcH6xI/AAAAAAAACPA/5sDqWdvtfyg/s72-c/dinosaur.JPG' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-7315443075105437383</id><published>2009-05-25T17:55:00.000-07:00</published><updated>2009-05-26T15:28:47.521-07:00</updated><title type='text'>Data Warehousing for Dummies</title><content type='html'>&lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;Every so often I’ll be talking to someone implementing our solution at a customer or POC site and the question comes up “what’s so special about your database anyway?” and “So, is it really different from MySQL, SQL Server or Oracle?” or “I don’t understand why your database talks SQL since it’s not a normal database like Oracle”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Better yet: “What else do I need to do after installation to get this working?”&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Usually these questions come from non-LOB folks who are tasked with implementing a particular solution using our product.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Typically these people tend to be experienced software developers or DBAs.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;People who, as &lt;a href="http://www.joelonsoftware.com/"&gt;Joel Spolsk&lt;/a&gt;y likes to say, are “smart and get shit done”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The type of folks you can throw a problem at and say “Ok, go solve it using this new tool.”&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;But as often happens in large organizations, they may not have been briefed fully by management on the features/functionality of the new tool needing evaluation.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Or maybe this is their first exposure to BI.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;They may also never have encountered or worked with an analytical database product.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I know that, several years ago, if you’d asked me what the difference was between OLTP and OLAP I would have blurted something like “one if for transactional stuff, the other for reporting” and been in the right ballpark but no cigar.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;So when these questions come up, I am always ecstatic to be able to share what I’ve learned with the guys in the trenches doing the real work! &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;The first thing I do is give I very general view of the differences between transactional (operational) and analytical use cases.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Then I’ll try and give a 30,000 foot picture of data warehousing and its history. I‘ll mention Kimball and Inmon, of course, then several books and a series of blogs, websites and youtube videos for further exploration. &lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;But this weekend, I discovered the holy grail of data warehousing 101.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I was hanging out at my local Borders looking to trade my 40% off coupon in exchange for yet another good data warehousing/BI book when I noticed the yellow “&lt;a href="http://tinyurl.com/rbadjk"&gt;Data Warehousing for Dummies&lt;/a&gt;” on the bottom shelf.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Obviously I couldn’t resist picking it up, especially since I have yet to meet anyone remotely “dumb” in this business. &lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;To my great surprise, I noticed the author was none other than Tom Hammergren, the owner of &lt;a href="http://www.balancedinsight.com/"&gt;Balanced Insigh&lt;/a&gt;t, one of the top BI Software Innovator firms in the country.  To say that Tom is a warehousing and BI guru is an understatement.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This much I knew.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;But I had NO idea he was also an accomplished writer who could present this complicated subject in clear, simple terms anyone can understand and relate to!&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;From now on, whenever someone asks me for the quick low-down about BI and data warehousing, I’ll be referring him or her to Tom’s book.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;Now to answer the above questions about our own product RDM/x.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;There is nothing magical about our analytical database, at least from a usage standpoint.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;RDM/x talks and walks like any other database product out there on the market using ODBC.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The magic is on the inside, certainly not in the interface (thankfully) as it supports a significant subset of the SQL-92 standard (minus TCL and DCL).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Is RDM/x “really” different from SQL Server, Oracle, MySQL or DB2 though?&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;You bet.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-spacerun:yes"&gt;&lt;/span&gt;RDM/x is designed for data analysis, not transactional processing.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;As a matter of fact, RDM/x is the smallest, nimblest, on-premise solution available on the market that will let you query terabytes of data in minutes from installation.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And that’s why I believe most people are a little confused from the get-go.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Because they’re used to large footprint multi-module database clients with 500 page installation and setup manuals, followed by complicated tuning and optimization techniques involving indexing, partitioning, and all that “good” stuff. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;When they see a 5 megabyte piece of software installing as a Windows Service, ready willing and able to handle queries on giga or terabytes of data within minutes, they think they’re missing something.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;How can BI be this simple?&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Well it can.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The proof is in the pudding and since we’re allowing you to download a fully functional 30-day evaluation from our website effective now, the best I can do is recommend you take me up on that assertion by visiting our &lt;a href="http://www.xsprada.com/"&gt;website&lt;/a&gt;.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-7315443075105437383?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/7315443075105437383/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/05/data-warehousing-for-dummies.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/7315443075105437383'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/7315443075105437383'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/05/data-warehousing-for-dummies.html' title='Data Warehousing for Dummies'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-4410595001033380651</id><published>2009-05-21T12:03:00.000-07:00</published><updated>2009-05-21T12:15:28.459-07:00</updated><title type='text'>Running RDM/x on Amazon EC2 is DICEE!</title><content type='html'>&lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;I want to keep this post fairly brief because there is so much stuff going on at &lt;a href="http://www.xsprada.com"&gt;XSPRADA &lt;/a&gt;lately that I find myself pressed for time from 6AM to midnight on a typical day which usually also includes weekends, but that’s the price you pay for building a revolution. Ask &lt;a href="http://www.argentour.com/images/che_guevara_fidel_castro.jpg"&gt;Fidel&lt;/a&gt;, he knows.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;First of all, I finally had time last Sunday to record a &lt;a href="http://xsprada.com/info/support/installation-video/"&gt;screencast &lt;/a&gt;explaining how to install and setup our RDM/x software. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;In the process, I discovered that &lt;a href="http://camstudio.org/"&gt;Camstudio &lt;/a&gt;and &lt;a href="http://tinyurl.com/2wdzgt"&gt;Microsoft Windows x64 decoder&lt;/a&gt; were my friends, reducing a 1.2GB video to 35Megs (phew!).&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;Second, I want to talk about my recent epiphany with EC2.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Early this week I decided to see if I could install and run our Windows-based database engine RDM/x on some sort of cloud platform because I don’t think anyone in their right minds in enterprise software can afford to ignore this trend any longer.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;My purpose was certainly not to setup a full-fledged production system up there, but rather to setup a quick and dirty demonstration system so people could either duplicate or use it on the fly to test-drive our software, for example.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;After poking around a bit I settled on &lt;a href="http://aws.amazon.com/ec2/"&gt;Amazon’s EC2&lt;/a&gt;.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;They seemed like the only “big-time” player supporting WinTel boxes (our software runs on 64-bit Windows Server 2003 and 2008) and they have enough credibility and market “karma” at this point to alleviate most basic concerns about reliability and security. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;So after checking out possible configuration tools and hitting our CEO up for some plastic, I signed up for EC2 and started exploring this brave new world.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;It turns out EC2 is really several “platform” components comprising: the actual EC2 O/S instance (a VM blade) known as an AMI (Amazon Machine Instance), persistent storage called EBS (Elastic Block Store) which presents as “volumes” you mount onto the AMI, and persistent (hot/cold) storage (also used for EBS snapshots) called S3.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;There’s also a queuing system called SQS but that didn’t enter my mix.&lt;span style="mso-spacerun:yes"&gt;   &lt;/span&gt;Confused yet? It’s not that bad once you get used to it &lt;/span&gt;&lt;span class="apple-style-span"&gt;&lt;span style="font-family:Wingdings;mso-ascii-font-family:Calibri;mso-ascii-theme-font: minor-latin;mso-hansi-font-family:Calibri;mso-hansi-theme-font:minor-latin; mso-char-type:symbol;mso-symbol-font-family:Wingdings"&gt;&lt;span style="mso-char-type: symbol;mso-symbol-font-family:Wingdings"&gt;J&lt;/span&gt;&lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;For the configuration tooling, you can use command line tools (which I suspect most *NIX/LAMP people prefer), a FireFox pluggin, or the web-based AWS (Amazon Web Services) Console.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I used both of the latter to compare.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;I brought up one of the standard Windows AMI as a Windows 2003 R2 64-bit datacenter server.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I actually tried two different instance types. One extra-large standard with 15GB of RAM and 4 cores, and one large high-CPU with 7GB of RAM and 8 cores.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I found better performance on the 4 core box with twice the RAM so I ended up sticking to that one.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;For credentials, you need to generate a key-pair, and then plug in the private part into a dialog box which then spits out an admin password for the new instance. You then connect remotely to your instance using Remote Desktop Connection (or SSH if you’re talking to a *nix instance).&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;Right off the bat my instance came with four attached 500GB “hard drives”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;These volumes are more like flash drives I think. This is not persistent storage but it’s pretty darn fast.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;For “real” storage you need to create Elastic Block Storage (EBS) “volumes” and attach them to your instance.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;So I did just that and slapped 4 additional 500GB drives to my box, and then converted each drive to an NTFS mount point (because this is best practice for our particular application). Unfortunately, I extracted a maximum 22-25MB/sec I/O to and from these volumes.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I had read somewhere that these dynamic “block devices” were more like instant SANs but in fact, Amazon Silver Support (another pay-for service but well worth it if you ask me) stated the following to me in an email:&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoPlainText"&gt;“Even though it presents a block interface, EBS isn't intended to be equivalent to fibre-channel SAN storage. The performance you should expect from EBS would more closely align with a NAS device. You can stripe several EBS devices together for higher I/O rates, but your rates will be limited by various shared components in the system, including the network between your instance and the storage servers. Larger instance types will typically see better performance than smaller instance types.”&lt;/p&gt;  &lt;p class="MsoPlainText"&gt;“More closely align with a NAS device”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Huh oh. That means gating at the NIC level.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The systems are clearly not setup for intense I/O data processing needs, at least not using the standard EC2 configuration models currently available.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Nevertheless, I was still able to do sufficient work with sufficient data to build a reasonable “functional demo” machine.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And that was my goal from the onset. Given this took me about 1.5 days to figure out, at an average cost of a&lt;span class="apple-style-span"&gt;round $16/day (not including the additional Silver support fees) I am very impressed with this cloud platform to say the least and I’m sure Jeff Bezos is basking in the bliss of my endorsement :)&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;  &lt;p class="MsoPlainText"&gt;Quite honestly, this cloud business is no joke.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I haven’t seen, heard and felt such a buzz around a new “platform” in the industry since I got my hands on Windows 3.0 in the early nineties.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It was the same “oh my God” emotional feeling at the time, or DICEE as &lt;a href="http://blog.guykawasaki.com/"&gt;Guy Kawasaki&lt;/a&gt; likes to put it (Deep, Intelligent, Complete, Elegant and Emotive).&lt;br /&gt;&lt;/p&gt;  &lt;p class="MsoPlainText"&gt;&lt;span class="apple-style-span"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-4410595001033380651?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/4410595001033380651/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/05/running-rdmx-on-amazon-ec2-is-dicee.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/4410595001033380651'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/4410595001033380651'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/05/running-rdmx-on-amazon-ec2-is-dicee.html' title='Running RDM/x on Amazon EC2 is DICEE!'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-2352504867677002943</id><published>2009-05-16T11:57:00.001-07:00</published><updated>2009-05-18T22:49:19.295-07:00</updated><title type='text'>Software that Sucks</title><content type='html'>&lt;p class="MsoNormal"&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span class="apple-style-span"&gt;Here’s a classic from the tech press that really caught my attention recently: &lt;/span&gt;&lt;span style="color:#383838;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="color:#383838;"&gt;“In the context of software, the word “Enterprise” has now officially come to mean software that sucks. Enterprise Software hit the nadir of suckitude (sic) at the launch of “Enjoy SAP”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This is like the American Dental Association launching “Enjoy Root Canal”. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;SAP is certainly an easy target, but let’s face it, “Enterprise Software” is generally a poorly integrated mess. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Working with Enterprise Software feels a bit like walking through an industrial landfill or an airport hangar. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Nothing is built to human scale.”&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;This was written on the SOA Center &lt;a href="http://www.soacenter.com/?p=185"&gt;blog &lt;/a&gt;&lt;/span&gt;&lt;span class="apple-style-span"&gt;by no other than Software AG’s Chief Strategist Miko Matsumura.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;His use of the techo-political term “suckitude” is one for the annals of our new post-TARP technology world.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;If nothing else, the current situation seems to be facilitating proverbial “paradigm shifts” (namely, on-demand software) while encouraging more anti-status-quo “frank-speak” from industry figureheads.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I’m all for that.&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Because, notwithstanding all the pain, suffering and incertitude in the economy lately, one of the really brilliant consequences of this world-wide mess is that people are starting to say out loud what everyone’s been thinking silently for years.&lt;span style="mso-spacerun:yes"&gt;  Even in the sacrosanct enterprise software glass mansions, people who matter are starting to throw stones.  &lt;/span&gt;When major industry players start talking straight and using technical terms like “sucking”, you know the BS gloves are off. &lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I think established players, platforms and ways of doing business and thinking about customers are all up for questioning at this point.&lt;span style="mso-spacerun:yes"&gt;  Sunshine is the best disinfectant.&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;And speaking of gloves off, SAP and industry shifts, this old &lt;a href="http://news.cnet.com/8301-13953_3-9911262-80.html"&gt;article &lt;/a&gt;&lt;/span&gt;&lt;span class="apple-style-span"&gt;from April 2008 refers to a slug match between two industry titans at the Churchill Club.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;One is Marc Benioff from Salesforce.com and the other Dr. Hasso Plattner of SAP fame.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;There’s a &lt;a href="http://www.youtube.com/watch?v=RU_47sW1ziE"&gt;video &lt;/a&gt;of the exchange on Youtube.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I know it’s a long one, but I assure you it’s worth watching entirely if you care &lt;span class="Apple-style-span" style="font-style: italic;"&gt;anything&lt;/span&gt; about the on-demand versus on-premise religious wars of late.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;I am not going to comment at length on the video as anyone can draw their own conclusions, but I did want to point out what I consider some key points, and throw in a few gold nuggets.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;First, the body language between those two guys is simply priceless.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It is more than obvious from the get-go that they can’t stand each other.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;You can catch the vibe even in that one picture in the article (and throughout the video). &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Benioff’s looking away from Dr. Plattner constantly (he fidgets with his wedding band incessantly), and Dr. Plattner is reflective in his own world as in “why the hell am I here”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;To my amazement, at the end of the video, they both reveal that this is their very first in-person meeting!&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Incidentally, one audience member does ask Dr. Plattner at the end why he accepted to do this.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;His answer: “for the challenge”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Not sure what that means.&lt;br /&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;Second, the verbal jousting between the two is fairly aggressive.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I don’t think these guys have much respect for each other notwithstanding their pseudo-polite claims to the contrary.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;If you asked me whether Benioff hates Microsoft or SAP more, I’d be tempted to say SAP. &lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;At one point Benioff states: "We have been passionate about moving obstacles out of the way of the old enterprise software companies.”&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I guess this is one major tenet of the on-demand adepts.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Power to the users!&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In my opnion, &lt;span class="apple-style-span"&gt;Dr. Plattner &lt;/span&gt;really &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;&lt;i style="mso-bidi-font-style:normal"&gt;does&lt;/i&gt; buy the on-demand proposition but not “religiously”, and either way, he can’t say it in public.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;He knows SAP screwed it up in the past. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;I’m not sure he believes in SAP’s ability to execute such a shift internally.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And I bet he wouldn’t mind buying Salesforce outright with one check.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;He implies as much several times but then claims he doesn’t want to get into a bidding war with Oracle.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Hogwash.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Throughout the video, both contestants score evenly, in my opinion, on the arrogance meter.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I guess they can both afford to be that way, but it does take a certain piece of the “human” side away from each. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;For Dr. Plattner, I think the Germanic personality comes through more than genuine arrogance.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;After all, he doesn’t need it at this point.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The guy built and ran a $40B company.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Enough said.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Benioff often has this “do the right thing” Google-ish “morality” in several other interviews and videos.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;But when you watch him in action here, the only thing that comes out is ruthless self-convinced warrior (it’s no coincidence his favorite read is Sun Tzu’s The Art of War).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Although conviction and the ability to back it up is noble (and key to business success), I’ve always feared people immutably driven by their own dogma (mind you, I actually buy into on-demand big time).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;But as my high-school math teacher used to say “you can never shelter yourself from a surprise”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Finally, as I was lauding “frank-speak” earlier, I did want to point out that Dr. Plattner uses the term “shit” several times during the exchange.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Initially, referring to Salesforce grabbing Dupont from them he states “Why did he win DuPont?&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Because we had a shitty CRM system, and he had a much better one.”&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Then later, referring to a customer still using code Dr. Plattner himself wrote: “…Shit! There is a customer in America still using the code I wrote.” Then referring to SAP’s earlier attempt at on-demand CRM: “…Shit, yeah!&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It was better than our CRM on-demand.”&lt;span style="mso-spacerun:yes"&gt;   &lt;/span&gt;I find that endearing.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;To conclude, if you truly want to understand the ongoing (and upcoming) battles between the SaaS and on-premise proponents of the enterprise software industry, you owe it to yourself to watch this video or, at the very least, pull down the &lt;a href="http://www.armapartners.com/files/admin/uploads/W17_Field_2_27630.pdf"&gt;transcript&lt;/a&gt;. And bring some popcorn!&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-2352504867677002943?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/2352504867677002943/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/05/software-that-sucks.html#comment-form' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/2352504867677002943'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/2352504867677002943'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/05/software-that-sucks.html' title='Software that Sucks'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-5858419207910322948</id><published>2009-05-07T09:18:00.000-07:00</published><updated>2009-05-07T09:27:57.411-07:00</updated><title type='text'>Tidbits and Check this Guy out</title><content type='html'>&lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;I read this article a couple of weeks ago and thought about one of our field test partners (telecom) because they had some political issues shipping us some data due to (very legitimate) privacy concerns – as in their CSO going “are you guys out of your f$##$ing minds?!?”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;As it turns out, there are several data obfuscation tools out there on the market, including DMSuite’s offering as described in this &lt;a href="http://www.byteandswitch.com/document.asp?doc_id=175990"&gt;article&lt;/a&gt;. I’m curious if most companies’ privacy policies make an exception for data that’s been altered by such a tool and if so, is there some sort of standard or certification these tools must meet?&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;If you know anything about that, I’d appreciate some insight.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;I didn’t know until last night that there actually is a CIQP Certification.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;What is CIQP you ask? Come on, get with the program!&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Everyone knows what a Certified Information Quality Professional is!&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;There is a whole &lt;a href="http://idqcert.iaidq.org"&gt;website &lt;/a&gt;&lt;/span&gt;&lt;span class="apple-style-span"&gt;dedicated to DQ as well.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I had never heard about this professional category.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;If anyone reading this happens to be in that category and/or CIQP Certified, I’d love to chat with you and learn more about it.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;For those of you who think the economy really sucks, your deduction is likely valid.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Nevertheless, BI and on-demand software market indexes seem pretty healthy to me as this &lt;a href="http://datadoghouse.typepad.com/data_doghouse/2009/05/business-intelligence-bi-on-demand-software-indexes-sprinting-ahead-of-the-pack.html"&gt;article&lt;/a&gt; &lt;/span&gt;&lt;span class="apple-style-span"&gt;demonstrates.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;My conclusion: I’d rather be in the “avant-garde” BI enterprise software sector than working for SAP or Oracle at this point &lt;/span&gt;&lt;span class="apple-style-span"&gt;&lt;span style="font-family:Wingdings;mso-ascii-font-family: Calibri;mso-ascii-theme-font:minor-latin;mso-hansi-font-family:Calibri; mso-hansi-theme-font:minor-latin;mso-char-type:symbol;mso-symbol-font-family: Wingdings"&gt;&lt;span style="mso-char-type:symbol;mso-symbol-font-family:Wingdings"&gt;J&lt;/span&gt;&lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;I discovered Guy Kawasaki’s Entrepreneurial Lectures delivered at Stanford in 2003-2004 via this &lt;a href="http://ecorner.stanford.edu/authorMaterialInfo.html?author=24"&gt;videocast&lt;/a&gt; series and sat there mesmerized listening to every single clip for hours.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;Guy (who now runs this &lt;a href="http://blog.guykawasaki.com"&gt;blog &lt;/a&gt;&lt;/span&gt;and this &lt;a href="http://alltop.com"&gt;company&lt;/a&gt;) &lt;span class="apple-style-span"&gt;successfully evangelized the Mac in the mid-80s and now runs a VC firm called &lt;a href="http://www.garage.com"&gt;Garage Technology Ventures&lt;/a&gt;.&lt;/span&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-spacerun:yes"&gt;  His &lt;/span&gt;reputation and track record are legendary.  In the clips, he lectures young Stanford engineers-to-be on how to become successful entrepreneurs, change the world, and keep their soul in the process.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;These are the points (or quotes) from his lectures that were etched on my mind:&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;Make meaning and make the world a better place.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Don’t write a mission statement, write a Mantra.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;If your product is not unique and adds no value, you’re doing something stupid.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Don’t ask people [customers] to do things you wouldn’t do.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Be a &lt;a href="http://en.wikipedia.org/wiki/Mensch"&gt;Mensch&lt;/a&gt;.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Hire infected people.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Suck down.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The higher you go in the enterprise, the thinner the oxygen.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;A milestone is something that increases the valuation of your company.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;The valuation formula for a startup is: add $500,000 per engineer and subtract $250,000 per MBA.&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;                  &lt;p class="MsoNormal"&gt;All these points are perfectly in line with my personal experience.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;But I’d never heard anyone formalize them in such an entertaining way before! And I don’t know that any comment can properly decorate any of these either. It’s one of those “you either get it or you don’t” kind of things.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It can’t be taught or inculcated by anything else than passion-driven experience.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-5858419207910322948?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/5858419207910322948/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/05/tidbits-and-check-this-guy-out.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/5858419207910322948'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/5858419207910322948'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/05/tidbits-and-check-this-guy-out.html' title='Tidbits and Check this Guy out'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-7102167652604573785</id><published>2009-05-05T09:32:00.000-07:00</published><updated>2009-05-05T15:11:24.646-07:00</updated><title type='text'>MAD About You.</title><content type='html'>&lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;I haven’t had a minute to sit down and blog lately.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;Our upcoming “pre-release” software slated for Cinco de Mayo has absorbed all my efforts.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;I just flew back from Austin, TX last week to participate in final engineering and release touches.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;At the same time, one of our major Defense prospects just re-activated a huge project so it’s all hands on deck at XSPRADA these days.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;I love it when a plan comes together.&lt;?xml:namespace prefix = o /&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;Nevertheless, I recently picked up on a &lt;a href="http://databeta.wordpress.com/2009/03/20/mad-skills/"&gt;paper &lt;/a&gt;called “MAD Skills: New Analysis Practices for Big Data” referenced on Curt Monash’s &lt;a href="http://www.dbms2.com/"&gt;blog&lt;/a&gt; by one of its authors &lt;a href="http://db.cs.berkeley.edu/jmh"&gt;Joe Hellerstein&lt;/a&gt;&lt;/span&gt;&lt;span class="apple-style-span"&gt;. &lt;span style="mso-spacerun: yes"&gt; &lt;/span&gt;Joe is a CS professor at UC Berkeley, if I understand correctly, and a consultant for Greenplum. I expected to read a lot of pro-Greenplum content in there but I don’t feel the major arguments presented are specifically tied to this vendor’s architecture or features per say.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;What makes this paper really valuable, in my opinion, is that it “&lt;/span&gt;&lt;span style="mso-bidi-: ;font-family:CMR9;" &gt;resulted from a fairly quick, iterative discussion among data-centric people with varying job descriptions and training.&lt;/span&gt;&lt;span class="apple-style-span"&gt;” – In other words, it is user-driven (and not just a bunch of theoretical PhD musings) and as such, probably an accurate reflection of what I consider to be a significant shift in the world of BI lately.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;Namely, that the “user” base is shifting from the IT type to an analyst and business person type.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;Note I didn’t say “end-user”, because the end user is still the business consumer. &lt;span style="mso-spacerun: yes"&gt; &lt;/span&gt;But what’s really changing is the desire (and ability) to cut out the middle man (read: the IT “gurus”), in essence, at every step of the way from data ingestion to results production.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;To put it in terms Marx would have liked, the means of production are shifting from dedicated technical resources to actual consumers &lt;/span&gt;&lt;span class="apple-style-span"&gt;&lt;span style="FONT-FAMILY: Wingdings; mso-ascii-font-family: Calibri; mso-ascii-theme-font: minor-latin; mso-hansi-font-family: Calibri; mso-hansi-theme-font: minor-latin; mso-char-type: symbolfont-family:Wingdings;" &gt;&lt;span style="mso-char-type: symbol;font-family:Wingdings;" &gt;J&lt;/span&gt;&lt;/span&gt; -- This is particularly true in the on-demand world, but probably pervasive throughout as well.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;I think the paper outlines and defines this change.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;&lt;p style="LINE-HEIGHT: normal; MARGIN-BOTTOM: 0pt; mso-layout-grid-align: none" class="MsoNormal"&gt;&lt;span style="FONT-FAMILY: CMR9;font-family:CMR9;" &gt;&lt;span style="FONT-STYLE: italic" class="Apple-style-span"&gt;&lt;span class="Apple-style-span"  style="font-size:small;"&gt;“The conceptual and computational centrality of the EDW makes it a mission-critical, expensive resource, used for serving data-intensive reports targeted at executive decision-makers. It is traditionally controlled by a dedicated IT staff that not only maintains the system, but jealously controls access to ensure that executives can rely on a high quality of service.”&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span"  style="font-size:x-small;"&gt;&lt;span style="FONT-STYLE: italic" class="Apple-style-span"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p style="LINE-HEIGHT: normal; MARGIN-BOTTOM: 0pt; mso-layout-grid-align: none" class="MsoNormal"&gt;The starting premise of the paper is that the industry is changing in its concept and approach to EDW. Whereas in the past an “Inmonish” view of the EDW was predicated on one central repository containing a single comprehensive “true” view of all enterprise data, the new model (and practical reality) is really more “Kimbalish” in the sense that the whole of EDW comprises the sum of its parts.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;And its parts are disparate, needing to be integrated in real time, without expectations of “perfect data” (DQ) in an instant-gratification world.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;This new model is premised on a MAD model: &lt;b style="mso-bidi-font-weight: normal"&gt;Magnetic, Agile and Deep. &lt;span style="mso-spacerun: yes"&gt; &lt;/span&gt;&lt;/b&gt;&lt;br /&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;Magnetic because the new architecture needs to “attract” a multitude of data sources naturally and dynamically.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;Agile because it needs to adapt flexibly to fast-shifting business requirements and technical challenges without losing a beat.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;And deep because it needs to support exploratory and analytical endeavors at all altitudes (detail/big picture) and for numerous user types simultaneously.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;&lt;p style="LINE-HEIGHT: normal; MARGIN-BOTTOM: 0pt; mso-layout-grid-align: none" class="MsoNormal"&gt;&lt;span style="FONT-FAMILY: CMR9;font-family:CMR9;" &gt;&lt;span class="Apple-style-span"  style="font-size:small;"&gt;&lt;span style="FONT-STYLE: italic" class="Apple-style-span"&gt;“Traditional Data Warehouse philosophy revolves around a disciplined approach to modeling information and rocesses in an enterprise. In the words of warehousing advocate Bill Inmon, it is an “architected environment" [12]. This view of warehousing is at odds with the magnetism and agility desired in many new analysis settings.”&lt;/span&gt;&lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;&lt;p style="LINE-HEIGHT: normal; MARGIN-BOTTOM: 0pt; mso-layout-grid-align: none" class="MsoNormal"&gt;&lt;span class="Apple-style-span"  style="font-family:CMR9;"&gt;&lt;span class="Apple-style-span"  style="font-size:small;"&gt;&lt;span style="FONT-STYLE: italic" class="Apple-style-span"&gt;“The EDW is expected to support very disparate users, from sales account managers to research scientists. These users' needs are very different, and a variety of reporting and statistical software tools are leveraged against the warehouse every day.”&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p style="LINE-HEIGHT: normal; MARGIN-BOTTOM: 0pt; mso-layout-grid-align: none" class="MsoNormal"&gt;Fulfilling this new reality requires a new type of analytical engine, in my opinion, because the old ones are premised on a model which, apparently, has not delivered successfully or sufficiently over time for BI.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;So the question beckons, what set of architectural features does a new analytical engine need to have in order to support the MAD model as described in this paper?&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;And more importantly (from where I stand), is the XSPRADA analytical engine MAD enough?&lt;br /&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span style="mso-bidi-: ;font-family:CMR9;" &gt;To support &lt;b style="mso-bidi-font-weight: normal"&gt;magnetism&lt;/b&gt;, an engine should make it easy to ingest any type of data.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;&lt;p style="LINE-HEIGHT: normal; MARGIN-BOTTOM: 0pt; mso-layout-grid-align: none" class="MsoNormal"&gt;&lt;span style="FONT-FAMILY: CMR9;font-family:CMR9;" &gt;&lt;span style="FONT-STYLE: italic" class="Apple-style-span"&gt;&lt;span class="Apple-style-span"  style="font-size:small;"&gt;“Given the ubiquity of data in modern organizations, a data warehouse can keep pace today only by being “magnetic": attracting all the data sources that crop up within an organization regardless of data quality niceties.”&lt;/span&gt;&lt;/span&gt;&lt;span style="FONT-STYLE: italic" class="Apple-style-span"&gt;&lt;span class="Apple-style-span"  style="font-size:small;"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p style="LINE-HEIGHT: normal; MARGIN-BOTTOM: 0pt; mso-layout-grid-align: none" class="MsoNormal"&gt;The “physical” data presented to the XSPRADA engine must consist of CSV text files.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;Currently CSV is the only possible way of presenting source data to the database. CSV is the least common data format denominator so this means the engine can handle pretty much any data source provided it can be morphed to CSV.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;Since the vast majority of enterprise data lends itself to CSV export, that covers a fairly wide array of data sources.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;How easy is it to “load” data into the engine?&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;Very easy, especially since the engine doesn’t “load” data per say but works off the bits on disk directly.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;All it takes is a CSV file (one per table) and a SQL DDL statement such as “CREATE TABLE…FROM &lt;filepath&gt;” to “present” data to the engine.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;“The central philosophy in MAD data modeling is to get the organization's data into the warehouse as soon as possible.” – I’d say this fulfills that philosophy.&lt;br /&gt;&lt;/filepath&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span style="mso-bidi-: ;font-family:CMR9;" &gt;To support &lt;b style="mso-bidi-font-weight: normal"&gt;agility&lt;/b&gt;, an engine shouldn’t dictate means and methods of usage and must be flexible:&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;&lt;p style="LINE-HEIGHT: normal; MARGIN-BOTTOM: 0pt; mso-layout-grid-align: none" class="MsoNormal"&gt;&lt;span style="FONT-FAMILY: CMR9;font-family:CMR9;" &gt;&lt;span style="FONT-STYLE: italic" class="Apple-style-span"&gt;&lt;span class="Apple-style-span"  style="font-size:small;"&gt;“Given growing numbers of data sources and increasingly sophisticated and mission-critical data analyses, a modern warehouse must instead allow analysts to easily ingest, digest, produce and adapt data at a rapid pace. This requires a database whose physical and logical contents can be in continuous rapid evolution… we take the view that it is much more important to provide agility to analysts than to aspire to an elusive ideal of full integration”&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p style="LINE-HEIGHT: normal; MARGIN-BOTTOM: 0pt; mso-layout-grid-align: none" class="MsoNormal"&gt;&lt;span style="mso-bidi-: ;font-family:CMR9;" &gt;&lt;o:p&gt;The external representation of data on disk and the internal “logical” modeling of the data changes dynamically based on incoming queries. &lt;span style="mso-spacerun: yes"&gt; &lt;/span&gt;The XSPRADA engine is “adaptive” in that sense and constantly looks at incoming queries and data on disk to determine the most optimal way of storing and rendering it internally.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;This feature is called Adaptive Data Restructuring (ADR).&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;On the flexibility side, the XSPRADA engine is schema-agnostic.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;This is a fairly unique feature that allows users to “flip” schemas on the fly. &lt;span style="mso-spacerun: yes"&gt; &lt;/span&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;&lt;p style="LINE-HEIGHT: normal; MARGIN-BOTTOM: 0pt; mso-layout-grid-align: none" class="MsoNormal"&gt;&lt;span style="mso-bidi-: ;font-family:CMR9;" &gt;&lt;o:p&gt;For example, it’s possible to present entire data sets to the engine based on an all VARCHAR schema (ie: make every column VARCHAR).&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;Maybe you don’t know the real schema at the time, or perhaps you don’t care about it or perhaps the optimal schema can only be determined after some analysis is performed.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;Or perhaps there are inconsistencies in the data or DQ issues preventing a valid “load” based on a rigid schema.&lt;span style="mso-spacerun: yes"&gt;  Or maybe it was just easier and quicker to export all the data as string types in the CSV.  &lt;/span&gt;In either case, the XSPRADA engine will happily ingest that data.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;Later on, you can CAST each field as needed into a new table on the fly and run queries against the new model, or try others as needed.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;Similarly, ingestion validation is kept to a minimum by design.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;For example, it’s quite possible to load an 80-char string into a CHAR(3) field.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;This is not possible with conventional databases.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;The implications of this from a performance and flexibility angle are impressive.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;The XSPRADA database lends itself to internal transformation; hence it favors an ELT model, minus the “L”.&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;&lt;p style="LINE-HEIGHT: normal; MARGIN-BOTTOM: 0pt; mso-layout-grid-align: none" class="MsoNormal"&gt;&lt;span style="mso-bidi-: ;font-family:CMR9;" &gt;&lt;o:p&gt;&lt;span style="mso-bidi-: ;font-family:CMR9;" &gt;&lt;span style="FONT-STYLE: italic" class="Apple-style-span"&gt;&lt;span class="Apple-style-span"  style="font-size:small;"&gt;“&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style="FONT-FAMILY: CMR9;font-family:CMR9;" &gt;&lt;span style="FONT-STYLE: italic" class="Apple-style-span"&gt;&lt;span class="Apple-style-span"  style="font-size:small;"&gt;In recent years, there is increasing pressure to push the work of transformation into the DBMS, to enable parallel execution via SQL transformation scripts. This approach has been dubbed &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style="FONT-FAMILY: CMTI9;font-family:CMTI9;" &gt;&lt;span style="FONT-STYLE: italic" class="Apple-style-span"&gt;&lt;span class="Apple-style-span"  style="font-size:small;"&gt;ELT &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style="FONT-FAMILY: CMR9;font-family:CMR9;" &gt;&lt;span style="FONT-STYLE: italic" class="Apple-style-span"&gt;&lt;span class="Apple-style-span"  style="font-size:small;"&gt;since transformation is done after loading.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style="mso-bidi-: ;font-family:CMR9;" &gt;&lt;span style="FONT-STYLE: italic" class="Apple-style-span"&gt;&lt;span class="Apple-style-span"  style="font-size:small;"&gt;”&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;And finally, to support &lt;b style="mso-bidi-font-weight: normal"&gt;depth&lt;/b&gt;, an engine should allow rich analytics, provide an ability to “focus” in and out on the data, and provide a holistic un-segmented view of the entire data set:&lt;br /&gt;&lt;/p&gt;&lt;p style="LINE-HEIGHT: normal; MARGIN-BOTTOM: 0pt; mso-layout-grid-align: none" class="MsoNormal"&gt;&lt;span style="FONT-FAMILY: CMR9;font-family:CMR9;" &gt;&lt;span style="FONT-STYLE: italic" class="Apple-style-span"&gt;&lt;span class="Apple-style-span"  style="font-size:small;"&gt;“Modern data analyses involve increasingly sophisticated statistical methods… analysts often need to see both the forest and the trees in running these algorithms… The modern data warehouse should serve both as a deep data repository and as a sophisticated algorithmic runtime engine.”&lt;/span&gt;&lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;&lt;p style="LINE-HEIGHT: normal; MARGIN-BOTTOM: 0pt; mso-layout-grid-align: none" class="MsoNormal"&gt;A salient feature of the XSPRADA engine is its ability to handle multiple “types” of BI work at the same time.&lt;span style="mso-spacerun: yes"&gt;   &lt;/span&gt;For example, it’s possible to mix OLAP, data mining, reporting and ad-hoc workloads simultaneously on the same data (and all of it) without resorting to “optimization” tricks for each mode.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;Similarly, the need for logical database partitioning doesn’t exist in the XSPRADA engine.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;Duplicating and re-modeling data islands on separate databases (physical or logical) for use by different departments is neither necessary nor recommended.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;&lt;/p&gt;&lt;p style="LINE-HEIGHT: normal; MARGIN-BOTTOM: 0pt; mso-layout-grid-align: none" class="MsoNormal"&gt;In an OLAP use case, there is no need to pre-define or load multidimensional cubes.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;The very act of querying consistently (meaning more than once) based on fact and dimension axes causes the engine to realize that this particular section of data is being accessed “multi-dimensionally”. &lt;span style="mso-spacerun: yes"&gt; &lt;/span&gt;It then starts cubing information internally, aggregating as indicated (if needed) by incoming queries.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;In this mode, perhaps the engine will decide a columnar storage approach is optimal and will re-structure the data accordingly.&lt;span style="mso-spacerun: yes"&gt;  &lt;/span&gt;In a data mining use case, the approach is likely different because incoming queries are “incremental” (often pinpointed) and results are used to generate new queries without pre-determined patterns. The engine will likely start by eliminating vast “wasteland” areas of the data (the forest) from consideration as needed, then proceed to optimize specific islands of interest (the trees) as they become more relevant in the &lt;span class="Apple-style-span"  style="font-size:medium;"&gt;queries&lt;/span&gt;.&lt;span style="mso-spacerun: yes"&gt; &lt;span style="font-family:CMR9;"&gt;&lt;span class="Apple-style-span"  style="font-size:medium;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p style="LINE-HEIGHT: normal; MARGIN-BOTTOM: 0pt; mso-layout-grid-align: none" class="MsoNormal"&gt;&lt;span style="mso-spacerun: yes"&gt;&lt;span style="font-family:CMR9;"&gt;&lt;span class="Apple-style-span"  style="font-size:medium;"&gt;So overall, I think the XSPRADA analytical engine was indeed designed with “MAD-ness” from the get-go, even if the term didn’t exist years ago.&lt;/span&gt;&lt;span style="mso-spacerun: yes"&gt;&lt;span class="Apple-style-span"  style="font-size:medium;"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span"  style="font-size:medium;"&gt;It’s the approach and the philosophy that really matters.&lt;/span&gt;&lt;span style="mso-spacerun: yes"&gt;&lt;span class="Apple-style-span"  style="font-size:medium;"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span"  style="font-size:medium;"&gt;In that respect, we’re definitely headed for the MAD-house :)&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-7102167652604573785?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/7102167652604573785/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/05/mad-about-you.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/7102167652604573785'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/7102167652604573785'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/05/mad-about-you.html' title='MAD About You.'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-5468262343645536352</id><published>2009-04-13T13:28:00.000-07:00</published><updated>2009-04-13T14:07:56.400-07:00</updated><title type='text'>On-demand BI beyond SMB</title><content type='html'>&lt;p class="MsoNormal"&gt;&lt;span style="color:#333333"&gt;The more I read, test and learn about on-demand BI every week, the more surprised I am to realize how many players there are on the market.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And this isn’t even just a small &amp;amp; medium size (SMB) market anymore as I originally assumed.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;There’s a whole slew of on-demand BI companies out there targeting serious size enterprises with giga and terabyte size warehouses.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="color:#333333"&gt;It seems one of the main arguments against the performance (latency) issues for BaaS is that the penalty is imposed only once at load time. In other words, yes it’s time-consuming and fairly slow to upload warehouse data (sometimes up to weeks) due to current network and pipeline bandwidth, but it’s something that is done only once and subsequent pushes are essentially incremental and consequently much quicker.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I guess I can buy that argument provided the “done once” endeavor is resilient enough to resist catastrophic errors.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;For example, I better be able to just pick up and continue where I left off if I spent 9 of 10 days waiting for my upload to finish before my network connection dropped or my server exploded.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="color:#333333"&gt;But even the “big guys” are dipping toes in the SaaS pool lately.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;As this &lt;a href="http://www.cxotoday.com/cxo/jsp/article.jsp?article_id=100387&amp;amp;cat_id=912"&gt;article &lt;/a&gt;points out, SAS is investing in the cloud big time.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="color:#333333"&gt;Vertica seems to have tacked from an “appliance” model to a hosted or cloud-based one (Vertica for the Cloud) as evidenced in their latest webinar as well, &lt;a href="http://www.vertica.com/_pdf/VerticaWebinarStonebrakerApril2009.pdf"&gt;The Cloud and the Future of DBMSs&lt;/a&gt; in which they pretty much repeat their usual marketing litany.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="color:#333333"&gt;Kognitio, who claims to have put the “D” in DaaS, just announced a cloud &lt;a href="http://www.kognitio.com/news/pressreleases/index.php?id=77"&gt;deal &lt;/a&gt;with Kelkoo.com, "Europe’s largest e-commerce website after Amazon and eBay”. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;They also have a cloud-based DaaS implementation with British Telecom (BT), who is about to &lt;a href="http://layofftracker.blogspot.com/2009/04/british-telecom-to-cut-10000-jobs.html"&gt;lay off 10,000 people&lt;/a&gt; incidentally.&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;&lt;/span&gt; Kognitio has &lt;span style="color:#333333"&gt;been one of the most cloud-aggressive companies out there.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="color:#333333"&gt;And of course behemoth Microsoft is breathing down everyone’s neck (discretely at the moment) via things like &lt;a href="http://www.sqlserverdatamining.com/cloud"&gt;www.sqlserverdatamining.com/cloud&lt;/a&gt; and the entire Madison and Azure platforms. &lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="color:#333333"&gt;Pretty much all the big players have some sort of stake in the “cloud” one way or another.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;No one wants to be left out, just in case. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;But even beyond these well-known players you also have folks like these here, some of which address specific analytical niches:&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="color:#333333"&gt;&lt;a href="http://www.deciphertech.com/"&gt;www.deciphertech.com&lt;/a&gt; – sales analytics with Salesforce.com&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;a href="http://www.hostanalytics.com/"&gt;www.hostanalytics.com&lt;/a&gt; – financial analytics (budgeting/revenue planning) niche.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I initially thought these guys might be connected to &lt;a href="http://www.i-lluminate.com/"&gt;www.i-lluminate.com&lt;/a&gt; by the nature of the audio on their website. &lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;a href="http://www.adaptiveplanning.com/"&gt;www.adaptiveplanning.com&lt;/a&gt; - Budgeting, forecasting and reporting analytics.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;a href="http://www.quantivo.com/"&gt;www.quantivo.com&lt;/a&gt; – Customer behavior analytics.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;a href="http://www.1010data.com/"&gt;www.1010data.com&lt;/a&gt; &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;- I believe they use tenbase on the backend, are columnar in architecture and have an ODBC as well as an Excel plug-in connector.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;a href="http://www.shsna.com/"&gt;www.shsna.com/&lt;/a&gt; &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Nutricia North America (owned by Danon) makes baby food and now also runs Pentaho over MySQL in the EC2 cloud for its internal BI needs as described &lt;a href="http://bi.cbronline.com/news/nutricia_north_america_deploys_pentaho_bi_suite_on_the_amazon_ec2_130209"&gt;here&lt;/a&gt;.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;a href="http://www.kpiweb.com/"&gt;www.kpiweb.com&lt;/a&gt; – Some French startup focusing on (I’m willing to bet) KPI metrics.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;a href="http://www.limtree.com/"&gt;www.limtree.com&lt;/a&gt; – Another French startup. In fact, just a QlikView integrator.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Speaking of French sites, if you can read French then check out this BI blog at &lt;a href="http://www.legrandbi.com/"&gt;www.legrandbi.com&lt;/a&gt; – If you can’t read French, head over to &lt;a href="http://translate.google.com/"&gt;translate.google.com&lt;/a&gt; and read it anyway because it’s a precious resource full of interesting “in-your-face” BI insight and informational tidbits I have not found elsewhere.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;a href="http://www.pivotlink.com/"&gt;www.pivotlink.com&lt;/a&gt; which I mentioned in a previous post, is geared exclusively at large enterprise warehouses in the cloud (small players need not apply) and backed by Trident Capital from what I gather.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Even &lt;a href="http://www.computerworld.com/action/article.do?command=viewArticleBasic&amp;amp;articleId=334189"&gt;data integration&lt;/a&gt; seems to have made some inroads&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;in the cloud realm.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I often read that ETL and integration consume 70-80% of a typical BI project.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I don’t know if the actual proportion is this huge, but I _do_ know from experience that integration tends to get grossly underestimated.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;That being said, it’s clearly a huge BI pain point and I was initially surprised to see anyone trying to do this on a hosted/cloud basis but the guys at &lt;a href="http://www.boomi.com/"&gt;www.boomi.com&lt;/a&gt; are pitching just that.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;And speaking of boomi, I have to hand it to them for trying this approach which I believe has merit but they have about the worse webinar I have _ever_ attended. I think they’ve managed to do absolutely everything that a company should avoid doing in a webinar namely:&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;Advertize a webinar lasting more than 60 minutes. Right off the bat, that doesn’t give me a warm feeling.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;How could they possibly need that much time when everyone else is managing to stay at or under an hour?&lt;/li&gt;&lt;li&gt;Make it an expensive pain in the ass for people to connect.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;There’s a toll-full number to call in the US. Fine. Then there’s also a 9 digit access code. Then there’s an audio pin, then another 9-digit webinar code. What the heck?&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Display what seems like an interactive question-answer widget during the session but in fact have no one managing it on the other end.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Hate when that happens.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Hire a consultant presenter who is obviously quite astute technically but sporting a monotonically depressive voice.&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Take 35 minutes to explain how to take an FTP input, transform some rows, and output it to a folder, all on a local machine.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Fascinating, but I think most people could have groked this rocket science in say, five minutes. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;No wonder they need 90 minutes to get through all this!&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;        &lt;p&gt;&lt;/p&gt;        &lt;p class="MsoNormal"&gt;The point being, I couldn’t stand more than 35 minutes of this treatment and decided to bail out and try it out later on my own.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Basically these guys have an agent-based architecture that allows you to connect to their “AtomSphere” (get it? Like atmosphere. Yeah) and have agents manipulate your data via connectors and transformations. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;You can setup and “send” agents onto other boxes and platforms. It did sound interesting technically, and I tried to pull their demo from the site to no avail. &lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I got a message saying they welcomed me.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Great.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Where’s the bits at? &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;I emailed back to support. I was assigned a “request for assistance” case number. Wow. Finally I got another message saying my account was active and I could login from the main page (which contradicts the initial instructions claiming you’ll get some email with a link in it) – Oh, and as for the suggestion that since I was new to Boomi, I should register for one of their training webinars, thanks but no thanks.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I will definitely try it out though.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It’s too compelling to pass up based on a few minor glitches from weak marketing or customer support departments.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Moving right along, I did want to mention &lt;a href="http://www.lyzasoft.com/"&gt;www.lyzasoft.com&lt;/a&gt;, even though their offering is not “on-demand” per say, it kind of is in a “local” way.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Basically you download their java thick client application. You then plop a bunch of connectors onto a workbook, bring in some data, and start graphing or analyzing it within minutes.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;All point-and-click, drag and drop.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Yes I know this sounds like a “so-what” scenario but you don’t understand: I actually had _fun_ using this thing, yet it’s far from a toy! &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;I wasn’t planning on spending more than 30 minutes with the product initially but ended up messing around with it for a couple hours. You can pull in anything from ODBC to flat-file connections, then you graphically describe relationships among tables (ie: joins) then you can merge that output with other data sources into a graph or statistical “component” where you then drag measures and attributes into corresponding axis “boxes” (much like the Excel pivot table designer interface).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Amazingly enough (to me) this stuff just worked.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It’s really cool how you can try stuff out and then back out or delete, then start from scratch or add/delete relationships and data at will.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It’s very intuitive.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;There’s some performance and UI quirks (not surprising running Java UI code on Windows) and I doubt you can bring in significant (read terabytes) amounts of data at this stage. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Yet Lyzasoft claims 175-200 maximum number of data input sources with largest customer databases at a “few million rows” of around 250-300 columns (I think that’s probably around 30-50GB of data roughly?).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;But overall it’s an impressive beginning and, quite honestly, probably easy enough to adapt to a hosted model. Add to that excellent and efficient real-time customer support, and you have a winner worth looking at here that could, in my opinion, pose a serious challenge to someone like QlikTek, with the right analytical engine behind them.&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-5468262343645536352?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/5468262343645536352/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/04/on-demand-bi-beyond-smb.html#comment-form' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/5468262343645536352'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/5468262343645536352'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/04/on-demand-bi-beyond-smb.html' title='On-demand BI beyond SMB'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>3</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-6576263810639344691</id><published>2009-04-07T14:05:00.000-07:00</published><updated>2009-04-07T14:20:14.431-07:00</updated><title type='text'>And up through the ground came a bubblin' crude</title><content type='html'>&lt;p class="MsoNormal"&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;Anyone not convinced that “doing BI” is both very hard and very expensive need only consult this &lt;a href="http://tinyurl.com/clkex6"&gt;link&lt;/a&gt;&lt;b&gt;&lt;span style="font-size:7.5pt;line-height:115%;font-family:&amp;quot;Verdana&amp;quot;,&amp;quot;sans-serif&amp;quot;; color:black"&gt;.&lt;/span&gt;&lt;/b&gt;&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Yes it’s from 2006, but have things really changed since then? Probably the opposite.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In his postings, SSAS guru Dave Rodabaugh describes the hiring process for BI/DW architects in a really compelling five-part story.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;He starts off by stating “I admit to being a real hammer when conducting an interview.” – Quite honestly, I don’t blame him given these folks are charging anywhere north of $250/hr these days.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;At that price, you _better_ damn well know your stuff!&lt;br /&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Here are a couple other posts on the trials and tribulations of hiring good DBAs: &lt;a href="http://tinyurl.com/ct75os"&gt;http://tinyurl.com/ct75os&lt;/a&gt; and &lt;span class="apple-style-span"&gt;&lt;a href="http://tinyurl.com/dksywb"&gt;http://tinyurl.com/dksywb&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;And finally, here’s what I thought was an interesting set of SQL Server interview &lt;a href="http://vyaskn.tripod.com/iq.htm"&gt;questions&lt;/a&gt; (understanding them alone is challenging).&lt;/p&gt;  &lt;p class="MsoNormal"&gt;What’s the point of all this?&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;DW/BI is indeed rocket science.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Its High Priests are DBA and DW Architects. And with time, they become irreplaceable in the enterprise.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;If a programmer, a software architect, or even an enterprise architect makes a mistake, you can usually catch it in time and if not, there is usually an opportunity to fix the problem downstream – it’s like losing one of several engines in flight.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It’s not fun, it can get bumpy, but chances are no one is going to die.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;On the other hand, a database BI/DW management mistake can be fatal because data can get whacked, irreversibly. &lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Or it can get stolen, or corrupted for example, with dramatic legal repercussions for the business.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This is like an explosion in flight. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;It’s not something you can fix in time, and survival is not likely.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;That’s why competent DBA and DW architects rule the BI/DW world.&lt;span style="mso-spacerun:yes"&gt;  If y&lt;/span&gt;ou want to see what these guys really do for a living and why they commend the big bucks, check out the following for starters:&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;&lt;a href="http://dbscience.blogspot.com/"&gt;http://dbscience.blogspot.com/&lt;/a&gt;&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;&lt;a href="http://prodlife.wordpress.com/"&gt;http://prodlife.wordpress.com/&lt;/a&gt;&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;&lt;a href="http://www.fulltablescan.com/"&gt;http://www.fulltablescan.com/&lt;/a&gt;&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;&lt;a href="http://jesseorosz.spaces.live.com/"&gt;http://jesseorosz.spaces.live.com/&lt;/a&gt;&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;&lt;a href="http://optimaldba.blogspot.com/"&gt;http://optimaldba.blogspot.com/&lt;/a&gt;&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;&lt;a href="http://www.petefinnigan.com/weblog/entries/"&gt;http://www.petefinnigan.com/weblog/entries/&lt;/a&gt;&lt;/p&gt;  &lt;p class="MsoNoSpacing"&gt;&lt;a href="http://www.sqlservercentral.com/"&gt;http://www.sqlservercentral.com/&lt;/a&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;o:p&gt;So I have two questions about the existing state of BI.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;First, why did it get so complicated and second, is it good or bad for business?&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;I think the first question is simple enough.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Things are like this because the darn products are simply too hard to use.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;You shouldn’t need a PhD to properly setup, configure, deploy, tune and question a database engine.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Forgive me for liking simplicity, but last I checked, operations manuals for behemoths like DB2, Oracle and SQL Server were in the thousands of pages combined. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;To me, when a set of product manuals have more pages than the &lt;a href="http://government.zdnet.com/?p=3620"&gt;Federal Budget&lt;/a&gt;, that’s cause for concern.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I’m an avid follower of the &lt;a href="http://en.wikipedia.org/wiki/KISS_principle"&gt;KISS &lt;/a&gt;principle.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Recently I purchased this &lt;a href="http://tinyurl.com/c4nuqa"&gt;book&lt;/a&gt; which is essentially a Microsoft SQL Server Analysis “bible” covering every part of that ecosystem including SSAS, SSIS, SSRS and Excel.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It runs 624 pages.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And quite honestly, unless you have significant mastery of each subject matter, you’re not going to be very effective doing BI at the enterprise level (at least on the Microsoft stack, but the others are typically hairier anyway, let’s face it).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In my opinion, this book represents the _minimum_ one needs to know to be effective in this business.&lt;span style="mso-spacerun:yes"&gt;  And believe me t&lt;/span&gt;hat’s not for the casual weekend DBA.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;The second question is a bit more subtle.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;How do you define “bad for business”?&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;For the past forty years, business leaders (CIOs) and bean counters (CFOs) have computed the total costs of owning and exploiting these databases (salaries, hardware, software, licenses, power, time wasted, failure rates etc.) and determined that the rewards justified the costs.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;But do they really?&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In light of the well-documented failure rates among business intelligence projects in the past years, can they possibly be right?&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It isn’t too challenging to pull up a myriad of articles such as these decrying and documenting the dismal success rates of BI project all over the planet: &lt;span class="apple-style-span"&gt;&lt;span style="color:black; mso-bidi-font-weight:bold"&gt;&lt;a href="http://tinyurl.com/cr8lt8"&gt;http://tinyurl.com/cr8lt8&lt;/a&gt; or &lt;a href="http://tinyurl.com/cnc36h"&gt;http://tinyurl.com/cnc36h&lt;/a&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;And in some cases, you also need a PhD to even figure out _how_ to calculate ROI on a BI endeavor as this 2002 44-slide &lt;a href="http://tinyurl.com/dkok6m"&gt;PowerPoint presentation &lt;/a&gt;from Jonathan Wu will attest to.&lt;/p&gt;&lt;p class="MsoNormal"&gt;So from the looks of it, countless millions of dollars and hours of intellectual effort have been spent in the past decades achieving mostly failure.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The same argument could be made (and has often times) about software development in general, but it looks like BI is faring even worse!&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;When’s the last time you heard about a successful enterprise BI project that wasn’t over budget, over schedule or over-complicated?&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;We know what happens when complexity is used to hide questionable assumptions, processes and results.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Ask anyone at AIG. If it’s too complex, too obfuscated, and too inaccessible, then chances are it’s bad business.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;And so it is in light of this that I have recently started to think differently about this “cloud” offering in the BI world sometimes referred to as BaaS, BIaaS or even DaaS.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Having been in software technology for twenty years, I’ve seen my share “new improved”, world-changing, and “revolutionary” concepts come and go.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;But people are missing the point about this one as it applies to BI.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;The BI “cloud” isn’t about technology, delivery, security or governance hurdles per say. It’s about abstraction.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And abstraction is the antidote to complexity. This is why I feel many players in today’s BI world are threatened by this concept, and why there’s so much gold in them thar hills when applied to enterprise business intelligence in judicious ways.&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-6576263810639344691?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/6576263810639344691/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/04/and-up-through-ground-came-bubblin.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/6576263810639344691'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/6576263810639344691'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/04/and-up-through-ground-came-bubblin.html' title='And up through the ground came a bubblin&apos; crude'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-1535916767982220897</id><published>2009-04-01T09:10:00.000-07:00</published><updated>2009-04-01T09:38:03.796-07:00</updated><title type='text'>Mind your own business [intelligence] Part II</title><content type='html'>First, I want to thank Curt Monash (&lt;a href="http://www.dbms2.com"&gt;www.dbms2.com&lt;/a&gt;) for mentioning my blog in his &lt;a href="http://tinyurl.com/c4h7wp"&gt;postings &lt;/a&gt;this morning -- that's really nice of him.  He is also reviewing several BI players at the moment.  God knows there's a lot of them.  Go read his reviews.&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Since my last post about that market, I emailed with Gooddata's CEO and Founder Roman Stanek who was kind enough to put up with my questioning back &amp;amp; forth emails.  He suggested several other names I might be interested in exploring so I set out to do just that.  In the process, I discovered &lt;a href="http://www.indicee.com/"&gt;Indicee &lt;/a&gt;who totally blew me away.  They have a clean, fast, operational UX. Not only was I able to upload data immediately, but they were nice enough to kick up my account size so I could push up more data. Next, I have to say their customer support response time is just awesome.  For me, Mr. instant gratification, this says a lot about a company. And finally, believe it or not, you can ask questions of the data in English!  This has been one of my "dreams" for a long time and I had actually envioned the same kind of interface Outlook has to setup mail rules, but theirs is even cooler.  As you probe the data, they show you what the question is like _IN ENGLISH_ -- Nirvana.  Their stuff just works.  These Canadians can write software I tell you!&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Not so impressive was &lt;a href="http://www.pivotlink.com/"&gt;PivotLink&lt;/a&gt;.  Their website clearly displays a link marked "Start Free Trial".  Stupid me, I thought I might be able to get a free trial.  No such luck.  Instead, I was contacted by a Sales Development person who offered to schedule a "demo/webinar" for me and "go over my requirements" and determine if their "solution is a fit for the company".  Are they kidding? Why don't you let ME the customer decide for myself?  Thanks but no thanks.  Then, they offered to "send me over" to their Chief Marketing Office.  Needless to say, by that time I had already scratched off their name from my list of potentials.  Disingenous and condescending. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Next in line was &lt;a href="http://www.lucidera.com/"&gt;Lucidera&lt;/a&gt;. I contacted the company to see if a trial version was available.  I'm not even going to write one word about this little experience.  Instead, I'm going to post the email response I received -- you decide:&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="border-collapse: collapse; font-family: arial; font-size: 13px; "&gt;We actually have somewhat of a different "trial" experience than most&lt;br /&gt;organizations. Since we are an on-demand model and analytics tends to be&lt;br /&gt;pretty extensive, we have what is called a Pipeline Healthcheck. What we&lt;br /&gt;do with the Pipeline Healthcheck after having a call with you to go over&lt;br /&gt;your analytic requirements is we access your Salesforce.com data and&lt;br /&gt;bring that into our application via a read-only log in and we analyze&lt;br /&gt;your sales data, focusing on your sales people, sales processes and your&lt;br /&gt;pipeline while highlighting areas of opportunity or potential risks that&lt;br /&gt;we find based on our best practice analytics. Customers and prospects&lt;br /&gt;are typically finding hundreds of thousands to millions of dollars in&lt;br /&gt;revenue opportunity. The Pipeline Healthcheck also serves as a business&lt;br /&gt;case with quantifiable value based on your sales data if you think the&lt;br /&gt;application makes sense for your business. If you'd like to set up a&lt;br /&gt;call to discuss further and your organization is a Salesforce.com&lt;br /&gt;customer, please let me know of a few good times and a number to reach&lt;br /&gt;you and I'd be happy to go over this further as well as discuss your&lt;br /&gt;interest in sales analytics. Thank you!&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Huh?!? I don't know about you, but the last thing I need these days is to be "pipeline checked". It just doesn't sound appropriate on a first date.  &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-1535916767982220897?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/1535916767982220897/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/04/mind-your-own-business-intelligence.html#comment-form' title='4 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/1535916767982220897'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/1535916767982220897'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/04/mind-your-own-business-intelligence.html' title='Mind your own business [intelligence] Part II'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>4</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-3361307677256775321</id><published>2009-03-31T15:13:00.000-07:00</published><updated>2009-03-31T15:43:51.358-07:00</updated><title type='text'>2 = 2</title><content type='html'>&lt;p class="MsoNoSpacing"&gt;Trawling through various “sales engineering” blogs recently, I happened to come across this one &lt;span class="apple-style-span"&gt;&lt;b&gt;&lt;span style="color:black;"&gt;&lt;a href="http://tinyurl.com/dz8nma"&gt;http://tinyurl.com/dz8nma&lt;/a&gt; &lt;/span&gt;&lt;/b&gt;&lt;span style="mso-bidi-font-weight:bold;color:black;"&gt;by Xavier Petit who is apparently a fellow “paisan” from France.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Xavier seems like a sharp guy with an interesting manipulation of written English but nevertheless full of excellent insight, in my opinion.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Not the least of which is his reference to this other site here &lt;/span&gt;&lt;/span&gt;&lt;a href="http://www.morenewmath.com/"&gt;http://www.morenewmath.com&lt;/a&gt;.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Part of my job sometimes involves trying to explain or demo esoteric mathematical concepts to people who want to “drill-down” into our product’s technology (rare, but it happens) so you can imagine I was eager to learn more about this “new math” approach to performing successful demos.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Little did I realize initially that these were just mathematically-expressed (terse) philosophical principles (equations about life) – some of which are really pertinent to what XSPRADA is about. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;For instance:&lt;/p&gt;&lt;p class="MsoNoSpacing"&gt;&lt;br /&gt;&lt;/p&gt;&lt;p class="MsoNoSpacing"&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_dCMl0ieMmTw/SdKZG2vqLaI/AAAAAAAACLw/TGfESlYaeQo/s1600-h/slide1.png"&gt;&lt;img src="http://2.bp.blogspot.com/_dCMl0ieMmTw/SdKZG2vqLaI/AAAAAAAACLw/TGfESlYaeQo/s320/slide1.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5319482452931653026" style="display: block; margin-top: 0px; margin-right: auto; margin-bottom: 10px; margin-left: auto; text-align: center; cursor: pointer; width: 320px; height: 208px; " /&gt;&lt;/a&gt;&lt;/p&gt;&lt;p class="MsoNoSpacing"&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;I wish I had a dollar (ok, in this economy, make that a Jackson) for every time I’d heard someone say “you guys can’t do this, it’s impossible”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Now we have working software to prove them wrong.&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;br /&gt;&lt;/p&gt;&lt;p class="MsoNormal" style="text-align: center;"&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_dCMl0ieMmTw/SdKZnXvMNDI/AAAAAAAACMA/tkHhVJYnLO0/s1600-h/slide2.png"&gt;&lt;img src="http://2.bp.blogspot.com/_dCMl0ieMmTw/SdKZnXvMNDI/AAAAAAAACMA/tkHhVJYnLO0/s320/slide2.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5319483011543872562" style="cursor: pointer; width: 320px; height: 208px; " /&gt;&lt;/a&gt;&lt;br /&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;A lot of folks initially said we were crazy for even attempting to disrupt the relational “status-quo”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Five years later, here we are, and in good company I might add.&lt;br /&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;br /&gt;&lt;/p&gt;&lt;p class="MsoNormal" style="text-align: center;"&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_dCMl0ieMmTw/SdKZUNCl7yI/AAAAAAAACL4/Ja34UlUWbYM/s1600-h/slide2.png"&gt;&lt;span class="Apple-style-span" style="color: rgb(0, 0, 0); "&gt;&lt;/span&gt;&lt;/a&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_dCMl0ieMmTw/SdKZ0d3WyqI/AAAAAAAACMI/Fk0t_7JRNsQ/s1600-h/slide3.png"&gt;&lt;img src="http://4.bp.blogspot.com/_dCMl0ieMmTw/SdKZ0d3WyqI/AAAAAAAACMI/Fk0t_7JRNsQ/s320/slide3.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5319483236527032994" style="cursor: pointer; width: 320px; height: 208px; " /&gt;&lt;/a&gt;&lt;br /&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;The concept of uniqueness is very hard to define mathematically.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Our ability to do just that is one pillar of our math-based technology.&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;br /&gt;&lt;/p&gt;&lt;p class="MsoNormal" style="text-align: center;"&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_dCMl0ieMmTw/SdKZ92KsUUI/AAAAAAAACMQ/IrPDgPjReb8/s1600-h/slide4.png"&gt;&lt;img src="http://4.bp.blogspot.com/_dCMl0ieMmTw/SdKZ92KsUUI/AAAAAAAACMQ/IrPDgPjReb8/s320/slide4.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5319483397669409090" style="cursor: pointer; width: 320px; height: 208px; " /&gt;&lt;/a&gt;&lt;br /&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;Why disappoint people (and the market) by making overhyped promises and under-delivering.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;We went the other route.&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;br /&gt;&lt;/p&gt;&lt;p class="MsoNormal" style="text-align: center;"&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_dCMl0ieMmTw/SdKaFWcLBsI/AAAAAAAACMY/0sr-K1JA2Cw/s1600-h/slide5.png"&gt;&lt;img src="http://1.bp.blogspot.com/_dCMl0ieMmTw/SdKaFWcLBsI/AAAAAAAACMY/0sr-K1JA2Cw/s320/slide5.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5319483526591743682" style="cursor: pointer; width: 320px; height: 208px; " /&gt;&lt;/a&gt;&lt;br /&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;When perception matches reality over time, you have a successful product (think Rolex or BMW). People often ask me “what’s your value proposition”? I say “You tell me, here’s the software, here’s what it does, but don’t take my word, try it out.”&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;They’ve all heard the “better, faster, cheaper” pitch before, so why waste their time when it’s better spent actually using the software?&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;br /&gt;&lt;/p&gt;&lt;p class="MsoNormal" style="text-align: center;"&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_dCMl0ieMmTw/SdKaQpaF1nI/AAAAAAAACMg/_6JTBmsZZvI/s1600-h/slide6.png"&gt;&lt;img src="http://4.bp.blogspot.com/_dCMl0ieMmTw/SdKaQpaF1nI/AAAAAAAACMg/_6JTBmsZZvI/s320/slide6.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5319483720661849714" style="cursor: pointer; width: 320px; height: 208px; " /&gt;&lt;/a&gt;&lt;br /&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;And my personal favorite, which has nothing to do with software, databases, or XSPRADA but fulfills me as a wine lover. &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;A votre santé! &lt;span style="font-family:Wingdings; mso-ascii-font-family:Calibri;mso-ascii-theme-font:minor-latin;mso-hansi-font-family: Calibri;mso-hansi-theme-font:minor-latin;mso-char-type:symbol;mso-symbol-font-family:Wingdings;"&gt;&lt;span style="mso-char-type:symbol;mso-symbol-font-family:Wingdings;"&gt;J&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p class="MsoNoSpacing"&gt;&lt;br /&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-3361307677256775321?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/3361307677256775321/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/03/trawling-through-various-sales.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/3361307677256775321'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/3361307677256775321'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/03/trawling-through-various-sales.html' title='2 = 2'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_dCMl0ieMmTw/SdKZG2vqLaI/AAAAAAAACLw/TGfESlYaeQo/s72-c/slide1.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-4244377576397081493</id><published>2009-03-29T16:44:00.000-07:00</published><updated>2009-03-30T07:56:58.438-07:00</updated><title type='text'>Mind your own business (intelligence)</title><content type='html'>&lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;As I mentioned previously, I’m not a BI power user.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Until twelve months ago, I couldn’t have told you the difference between HOLAP, ROLAP or MOLAP, much less explained what a slowly-changing dimension was, or what the difference was between star and snowflake schemas.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;Nevertheless, I do have a technical background and I’ve learned a boatload about BI in the past year.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In the past months, I’ve been trying to put myself in the mid-market LOB user’s shoe by using all and any available tool I could find to “do BI” using our ODBC-talking analytical database as the backend (I wanted to do this from a non-enterprise perspective).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;I tried to get my hands on every product I could get from front-end analysis packages (either thick client or web-based) to ETL tools, both proprietary or open source.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Those are part of what I call the BI “ecosystem”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;My testing had several goals:&lt;span class="Apple-style-span" style="font-family: Georgia; "&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-mso-bidi-theme-font: minor-latin;font-family:Calibri;color:black;"&gt;&lt;span style="mso-list:Ignore"&gt;&lt;span style="font:7.0pt &amp;quot;Times New Roman&amp;quot;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;/p&gt;&lt;ol&gt;&lt;li&gt;&lt;span class="Apple-style-span" style="font-family: Arial; "&gt;How easy is the tool to install, setup and use without direction, manuals or training.&lt;/span&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;span class="Apple-style-span" style="font-family: Arial; "&gt;How easy is it to connect to our database engine via ODBC.&lt;/span&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;span class="Apple-style-span" style="font-family: Arial; "&gt;How easy is it to extract data from our engine and do some very basic analysis.&lt;/span&gt;&lt;br /&gt;&lt;/li&gt;&lt;/ol&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;      &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;So I loaded the small Pentaho Mondrian sample database (FoodMart) into the XSPRADA engine and went to work.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;b style="mso-bidi-font-weight: normal"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;Microsoft Excel&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;-- &lt;a href="http://www.microsoft.com/excel"&gt;www.microsoft.com/excel&lt;/a&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;With Excel 2007, I generated an ODBC connection and quickly pulled in a data subset using Microsoft Query.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Then, I inserted a pivot table based on that data island in a new sheet, and did basic slice/dice analysis on the data.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Total time spent: all of maybe 5 minutes.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;There’s a small instructional video on how to do this at &lt;/span&gt;&lt;/span&gt;&lt;a href="http://tinyurl.com/ckfwp6"&gt;&lt;b&gt;http://tinyurl.com/ckfwp6&lt;/b&gt;&lt;/a&gt;&lt;b&gt;&lt;span style="color:black;"&gt;. &lt;/span&gt;&lt;/b&gt;&lt;span style="mso-bidi-font-weight: bold;color:black;"&gt;&lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;Not rocket science.&lt;b&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;b style="mso-bidi-font-weight: normal"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;QlikView&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;-- &lt;a href="http://www.qliktech.com/"&gt;www.qliktech.com&lt;/a&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;You can download evaluation copies of QlikView from the website. When I first ran it, I thought, hey, this is MDX for dummies with an Excel twist (workbook paradigm).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;QlickTech actually bills itself as “Excel on steroids”.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;You can get results with QlikView fairly quickly out of the box.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Yes it takes a bit of training and practice to do anything really significant, as the product is very rich (and can do some serious ETL in the process) but overall it’s very visual and intuitive.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And if you run into any issues, their sales engineers are readily available and quite effective.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;b style="mso-bidi-font-weight: normal"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;Tableau -- &lt;a href="http://www.tableausoftware.com/"&gt;www.tableausoftware.com&lt;/a&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;I pulled Tableau evaluation bits from their website and also attended one of their webinars.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Unfortunately, Tableau doesn’t talk to ODBC. I’m not sure why.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;That pretty much ended my evaluation right there.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;b style="mso-bidi-font-weight: normal"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;AltoSoft&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;-- &lt;a href="http://www.altosoft.com/"&gt;www.altosoft.com&lt;/a&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;I tried Insight Studio but I couldn’t extract table schemas from our system via ODBC.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Quite honestly, I think it’s because our driver probably doesn’t support some level 1 API the tool is using.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;So then I tried pulling from my FoodMart database inside SQL Server 2008.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Even then, I never “got” the tool.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It requires some training and/or patience I simply don’t have.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It’s too complicated right off the bat.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I see I can create some PKIs, but then the UI isn’t really obvious or smooth.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;It just doesn’t “flow” and there’s some web-based dashboard stuff too which itself isn’t exactly self-explanatory and pops up tons of little Cassini web server on my system.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;b style="mso-bidi-font-weight: normal"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;Panorama -- &lt;a href="http://www.panoramasoftware.com/"&gt;www.panoramasoftware.com&lt;/a&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;For obvious reasons, these folks only talk to Microsoft SQL Server and SAP back ends.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Sharp UI and good usability, but, in my case, of little use.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;b style="mso-bidi-font-weight: normal"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;Talend&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;-- &lt;a href="http://www.talend.com/"&gt;www.talend.com&lt;/a&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;Talend is a French open source (OSS) company. (Incidentally, in this economy, they managed to recently score a $12M round --&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;not too shabby).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I tried their Open Studio tool and was able to do some very basic ETL to generate and convert CSV files to XSPRADA CVS specifications. This is important because at the moment, this is the only format you can present to our engine. So I wanted to see how easily someone could extract data from say Oracle or SQL and convert it to our specs.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The answer is: easily. That tool has a lot of knobs and switches and deep functionality but I was able to do simple stuff fairly quickly without diving too deep into Java code.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Given my OSS biases, I was impressed.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;b style="mso-bidi-font-weight: normal"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;Pentaho&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;-- &lt;a href="http://www.pentaho.com/"&gt;www.pentaho.com&lt;/a&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;Pentaho distributes an open source OLAP platform called Mondrian that talks SQL via MDX to numerous underlying databases, including ours.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I’ve been using Mondrian on Windows with the built-in JDBC-ODBC bridge to our database for months now without a problem.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;With Mondrian’s web-based UI you can slice and dice FoodMart data any which way you like.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Where it gets a little hairy is actually setting up the cubing structures for your own data.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;That does take expertise and is not for the casual LOB user (think: lots of trial and error) . But with enough effort and community support, numerous people and organizations have apparently done just that.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;b style="mso-bidi-font-weight: normal"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;Birst&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;-- &lt;a href="http://www.birst.com/"&gt;www.birst.com&lt;/a&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;I was very excited to hear about this new company because it seemed they had a really new approach to hosted on-demand BI.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;They have a subscription-based service.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;So I quickly created an account and tried to upload some simple data (SQL’s from my ODBC connection) and see immediate results. Unfortunately that was about a week ago and since then, I still haven’t been able to either load or analyze data on their site due to various technical glitches.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;This is really unfortunate because Birst’s claim to fame is “ease of use” but if they can work the quirks out (and find some really good UX people) I think there’s huge potential in their “hands-off” approach which is basically what I was looking for, namely “here’s some data, here’s a question, show me something quickly without my having to become a cube expert”.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;b style="mso-bidi-font-weight: normal"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;Gooddata &lt;span style="mso-spacerun:yes"&gt; &lt;/span&gt;-- &lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;a href="http://www.gooddata.com/"&gt;&lt;b style="mso-bidi-font-weight:normal"&gt;&lt;span style="mso-bidi-;font-family:Arial;"&gt;www.gooddata.com&lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;span class="apple-style-span"&gt;&lt;b style="mso-bidi-font-weight:normal"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;This company is another “on-demand” hosted BI offering.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;The application is in beta mode and it shows.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Nevertheless, I was able to upload data and run slice/dice charts in a very intuitive manner within minutes.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;One of the major issues I see is performance.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Even with small data sets, upload and processing operations are very slow (I believe they run off of Amazon cloud services).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I am hopeful the UI/performance/usability quirks will be resolved in a production version because this is the only on-demand player I’ve seen that seems to “get it” so far.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Ironically enough, they offer the Mondrian sample data set as a “default” project to play with when you create an account.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Felt right at home.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;I draw the following conclusions/recommendations from this little experiment:&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoListParagraphCxSpFirst" style="text-indent:-.25in;mso-list:l0 level1 lfo2"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="font-family:Symbol;mso-fareast-font-family: Symbol;mso-bidi-font-family:Symbol;color:black;"&gt;&lt;span style="mso-list:Ignore"&gt;·&lt;span style="font:7.0pt &amp;quot;Times New Roman&amp;quot;"&gt;        &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;Doing BI is not easy (duh).&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Although some of these tools are more intuitive than others, we’re still a long way from “easy” BI.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Even with Excel you still have to know something about pivot tables, cubes (if you’re connecting to SSAS) and connections.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;With Mondrian, it really helps to know MDX.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In that context, the on-demand solutions are simpler.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And of course with hosted approaches, you don’t need a license key for Office.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoListParagraphCxSpMiddle"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;&lt;o:p&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;LOB users can’t use OSS tools without “expert” help.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;There is no way a typical business person can suffer the slings and arrows of Mondrian or Talend quirks and “gotchas” on his or her own.&lt;/span&gt;&lt;/span&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoListParagraphCxSpMiddle"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;A lot of vendors are really very confused about what constitutes good UX.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;In some cases, I truly wonder if these folks ever run usability studies.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Please take the app home to a spouse or parent and have them use it.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I swear that’s helpful.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;My mom is an Excel wizard yet can barely plug in a USB wireless mouse.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;  &lt;p class="MsoListParagraphCxSpMiddle"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;Unless you have a tool that is easier to setup and use than Excel, don’t bother!&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;I mean what’s the point? Excel has massive market penetration.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Nobody is going to care about a new application unless it can beat Excel hands down in usability and performance.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Also, you can share Excel spreadsheets and data in the “cloud” nowadays using various providers (like google) so again, unless you have some compelling collaboration features to beat that, think again.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;On the enterprise side, people are pegged into lock-in situation driven by licensing discounts and vendor pressure.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;If you’re an Oracle, IBM or Microsoft shop, chances are you will be using a certain set of BI tools no matter what.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;And I don’t think a Microsoft shop will be running Cognos, or an IBM shop SSAS.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;Companie like QlikTech have a really tough time penetrating the large enterprise I believe.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span class="apple-style-span"&gt;&lt;span style="mso-bidi-;font-family:Arial;color:black;"&gt;But everywhere else, new approaches are not only available, but also politically possible.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;So I think the commoditization of BI will have to come from the bottom up. In that respect, the foundation currently being laid by on-demand and hosted BI services is much more important to the future of BI than meets the eye.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-4244377576397081493?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/4244377576397081493/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/03/mind-your-own-business-intelligence.html#comment-form' title='15 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/4244377576397081493'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/4244377576397081493'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/03/mind-your-own-business-intelligence.html' title='Mind your own business (intelligence)'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>15</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-1714233062789465624</id><published>2009-03-12T16:09:00.000-07:00</published><updated>2009-03-12T16:17:45.878-07:00</updated><title type='text'>There are only two types of people I cannot stand: people who are intolerant of other peoples' cultures, and the dutch.</title><content type='html'>In my quest to mine the business intelligence market to the last drop, when I’m not busy doing other things like talking to people, setting up POCs, coding, looking at product features or evaluating what I call “ecosystem” products like BI and ETL platforms, I spend a heck of a lot of time online reading blogs and scanning websites. &lt;br /&gt;&lt;br /&gt;This market is very dynamic and it’s important to keep up and spot trends and pain points early on.  This isn’t unlike an NSA type of endeavor really.  There are gazillions of “signals” out there and it’s up to me to pick up the pieces and make sense out of them as best I can. &lt;br /&gt;&lt;br /&gt;When I do this successfully, I can not only enhance our competitive edge, but also discuss intelligently how we fit into the business intelligence “big picture”.  In this business, someone who is not constantly educating himself is in a world of hurt.&lt;br /&gt;&lt;br /&gt;To do this, I scan several dozen blogs and websites an almost daily basis (see my partial list following this posting). This activity, combined with talking to folks in the BI world, product specialists, consultants, being involved in coding and testing a full-fledged database product, and attending webinars for the past year has given me a certain outlook on the market that carries “no baggage”.  Let me explain.&lt;br /&gt;&lt;br /&gt;I don’t come at this with the perspective of someone who has been immersed exclusively in the BI world for decades, or someone who has the hands-on background designing and implementing data warehouses his entire career.  I’m just a software engineer who happens to have worked for a multitude of different businesses and in numerous domains, not just BI.  So I feel this gives me a more “detached” view of the DW/BI market than an industry expert with BI-specific experience might have.  That being said, the following are nothing more than observation-based personal opinions about the warehousing and BI market developed over the past 12 months.  I’d love to hear your thoughts as well – at the risk of being branded a mere neophyte.&lt;br /&gt;&lt;br /&gt;&lt;strong&gt;And then there was light…&lt;br /&gt;&lt;/strong&gt;Acceptance of “non-standard” (read: not from Microsoft, IBM or Oracle) approaches to analytical databases (including non-relational approaches) is going mainstream in the warehousing realm.  Four to six years ago, if you talked to anyone in the enterprise about trying a “non-relational” database, they’d flip the bozo bit on you instantly. Nowadays, although relational “bigots” still (and always will) exist, BI people are more open-minded, mostly thanks to dozens of “new-breed” players in the market with well-documented successful implementations (not the least of which are Netezza, Teradata, Vertica and SybaseIQ).  To use a phrase I detest, people are finally “thinking outside the Big Three box”.&lt;br /&gt;&lt;br /&gt;&lt;strong&gt;Gimme a little OLTP with that OLAP would you?&lt;br /&gt;&lt;/strong&gt;The distinction between operational (transactional) and warehousing (analytical) business activity is blurring.  Operational and analytical business efforts are often integrated.  Warehousing and analytics is no longer “the crazy uncle in the attic” project. Another sign of this is the recent desire to enable ever more frequent insert/updates to warehouse stores than in the past.  Not content with infrequent batch updates, people are now looking at efficiently pushing update/inserts real time into their warehouse.  The updates often come from both an ODS and external data sources. Unfortunately most analytical databases are designed around the assumption that warehousing is mostly read-only (InfoBright ICE doesn’t even support DML) and optimize as such.  I think they’re up against hard times unless they can “transactionalize” quickly.&lt;br /&gt;&lt;br /&gt;&lt;strong&gt;Open source: cheaper than free?&lt;br /&gt;&lt;/strong&gt;Open source is having a large impact in analytics for both economic and practical reasons.  Nowadays, businesses can set up data marts using engines like InfoBright (MySQL), for example, and use BI/Integration tools like Talend’s Open Studio or Pentaho’s Mondrian. Clearly a lot of this is driven by current economics, but open source deployment and licensing models are competing head-on with proprietary solutions on many levels.&lt;br /&gt;&lt;br /&gt;&lt;strong&gt;Hey buddy, want some good BI?&lt;br /&gt;&lt;/strong&gt;Everyone’s talking about BI.  Microsoft is running TV ad spots about it during prime-time and pushing the concept in a very public manner.  After buying up Datallegro last year, they announced the Madison project, put out best-practice configurations for Dell and HP “appliances” hosting SQL Server, (&lt;a href="http://www.intelligententerprise.com/channels/information_management/showArticle.jhtml;jsessionid=GSPUKPDEIPLJCQSNDLPSKHSCJUNN2JVN?articleID=214502509"&gt;http://www.intelligententerprise.com/channels/information_management/showArticle.jhtml;jsessionid=GSPUKPDEIPLJCQSNDLPSKHSCJUNN2JVN?articleID=214502509&lt;/a&gt;) and have been discretely adding warehouse-oriented features to SQL since 2007 (&lt;a href="http://msdn.microsoft.com/en-us/library/cc278097.aspx"&gt;http://msdn.microsoft.com/en-us/library/cc278097.aspx&lt;/a&gt;).  What’s more, they own the BI desktop with Excel and Office 14 promises a significant OLAP functional push. Vaporware?  Maybe, but I think they’re gunning to eat a lot of BI folks’ lunches out there in the coming months. If anyone can commoditize DW and BI technology, it’s Microsoft.&lt;br /&gt;&lt;br /&gt;&lt;strong&gt;Performance, shperformance&lt;br /&gt;&lt;/strong&gt;“Performance” is a big mystery.  No one really knows how to define it in the DW/BI space.  Is it load time?  Is it query response time? Is it data presentation to result time?  Does it include backup time? How about data recovery?  Opinions differ.  Service level expectations are often misguided or unrealistic.  People don’t seem to care much about TPC-H or TPC-DS performance metrics.&lt;br /&gt;&lt;br /&gt;&lt;strong&gt;The proof is in the concept&lt;/strong&gt;&lt;br /&gt;POC POC POC! People want to see numbers on their data on their hardware up close and personal.  That’s the way it should be. There’s too much hype in the industry, and people tune out the bullshit &lt;a href="http://www.biblogs.com/2009/03/12/rejecting-stale-tech-marketing-words/"&gt;http://www.biblogs.com/2009/03/12/rejecting-stale-tech-marketing-words/&lt;/a&gt; -- Who can blame them. These folks have been abused and lied to for decades now.  As I’ve been on the other side of the fence many times, I can relate.  That’s why I always keep it real simple when talking about our product.  Everyone’s heard the “better, faster, simpler” shpiel a million times, so why insult their intelligence. Give them a set of keys and let them test-drive the darn product!  POCs should be run as described at &lt;a href="http://www.altosoftcommunity.com/"&gt;http://www.altosoftcommunity.com/&lt;/a&gt; and &lt;a href="http://www.altosoftcommunity.com/?p=61"&gt;http://www.altosoftcommunity.com/?p=61&lt;/a&gt; and &lt;a href="http://www.dbms2.com/2009/03/02/ideas-for-bi-pocs/"&gt;http://www.dbms2.com/2009/03/02/ideas-for-bi-pocs/&lt;/a&gt;.  There isn’t a single solution in this market that applies perfectly to every customer across the board and any vendor claiming otherwise is either naïve or disingenuous.&lt;br /&gt;&lt;br /&gt;&lt;strong&gt;Cloud cool-aid&lt;br /&gt;&lt;/strong&gt;“Cloud computing” madness has taken hold in the DW/BI industry as well.  Vertica and Kognitio are big pushers of warehouse cloud hosting solutions.  The buzzword now is DaaS (data as a service).  I find it hard to get excited about this recent trend.  I don’t mean to Andy Rooney the whole concept, but in enterprise warehousing and BI, given the amounts of data involved and the security, governance, availability, and SLA issues, I just don’t “grok” it.  Maybe for small segments of “cold” data? I don’t know.  I see the craze, I feel the buzz, I get the marketing upside, and I notice the traction but…I am not of the body.  The power of clouds doesn’t compel me.&lt;br /&gt;&lt;br /&gt;Here’s my list of DW/BI industry blogs:&lt;br /&gt;&lt;a href="http://www.altosoftcommunity.com/?feed=rss2"&gt;http://www.altosoftcommunity.com/?feed=rss2&lt;/a&gt;&lt;br /&gt;&lt;a href="http://www.b-eye-network.com/rss/content.php"&gt;http://www.b-eye-network.com/rss/content.php&lt;/a&gt;&lt;br /&gt;&lt;a href="http://www.biblogs.com/"&gt;http://www.biblogs.com/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://bibrain.blogspot.com/"&gt;http://bibrain.blogspot.com/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://www.byteandswitch.com/"&gt;http://www.byteandswitch.com/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://chucksblog.emc.com/chucks_blog/"&gt;http://chucksblog.emc.com/chucks_blog/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://customerexperiencematrix.blogspot.com/"&gt;http://customerexperiencematrix.blogspot.com/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://www.dbms2.com/"&gt;http://www.dbms2.com/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://prodlife.wordpress.com/"&gt;http://prodlife.wordpress.com/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://www.informationweek.com/"&gt;http://www.informationweek.com/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://www.intelligententerprise.com/movabletype/blog/msmith.html/blog/"&gt;http://www.intelligententerprise.com/movabletype/blog/msmith.html/blog/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://www.intelligententerprise.com/movabletype/blog/skemsley.html/blog/"&gt;http://www.intelligententerprise.com/movabletype/blog/skemsley.html/blog/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://www.intelligententerprise.com/movabletype/blog/dhensche.html/blog/"&gt;http://www.intelligententerprise.com/movabletype/blog/dhensche.html/blog/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://www.intelligententerprise.com/movabletype/blog/nraden.html/blog/"&gt;http://www.intelligententerprise.com/movabletype/blog/nraden.html/blog/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://kevinclosson.wordpress.com/"&gt;http://kevinclosson.wordpress.com/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://www.petefinnigan.com/weblog/entries"&gt;http://www.petefinnigan.com/weblog/entries&lt;/a&gt;&lt;br /&gt;&lt;a href="http://www.sqlservercentral.com/"&gt;http://www.sqlservercentral.com/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://storagezilla.typepad.com/storagezilla/"&gt;http://storagezilla.typepad.com/storagezilla/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://www.databasecolumn.com/"&gt;http://www.databasecolumn.com/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://thestorageanarchist.typepad.com/weblog/"&gt;http://thestorageanarchist.typepad.com/weblog/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://andyonsoftware.com/"&gt;http://andyonsoftware.com/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://www.b-eye-network.com/blogs/imhoff/"&gt;http://www.b-eye-network.com/blogs/imhoff/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://www.b-eye-network.com/blogs/business_integration/"&gt;http://www.b-eye-network.com/blogs/business_integration/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://bisqlserver.blogspot.com/"&gt;http://bisqlserver.blogspot.com/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://biforbusinesspeople.blogspot.com/"&gt;http://biforbusinesspeople.blogspot.com/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://kognitio.blogspot.com/"&gt;http://kognitio.blogspot.com/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://dbscience.blogspot.com/"&gt;http://dbscience.blogspot.com/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://www.fulltablescan.com/"&gt;http://www.fulltablescan.com/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://optimaldba.blogspot.com/"&gt;http://optimaldba.blogspot.com/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://optimaldba.blogspot.com/"&gt;http://optimaldba.blogspot.com/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://www.dwoptimize.com/"&gt;http://www.dwoptimize.com/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://oraclestorageguy.typepad.com/oraclestorageguy/"&gt;http://oraclestorageguy.typepad.com/oraclestorageguy/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://qlikviewmaven.blogspot.com/"&gt;http://qlikviewmaven.blogspot.com/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://ianalyze.net/"&gt;http://ianalyze.net/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://www.gillespol.nl/"&gt;http://www.gillespol.nl/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://talkdig.blogspot.com/"&gt;http://talkdig.blogspot.com/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://biguru.wordpress.com/"&gt;http://biguru.wordpress.com/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://360degreeinsight.com/"&gt;http://360degreeinsight.com/&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-1714233062789465624?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/1714233062789465624/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/03/there-are-only-two-types-of-people-i.html#comment-form' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/1714233062789465624'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/1714233062789465624'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/03/there-are-only-two-types-of-people-i.html' title='There are only two types of people I cannot stand: people who are intolerant of other peoples&apos; cultures, and the dutch.'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-8084640156207803662</id><published>2009-03-10T13:20:00.000-07:00</published><updated>2009-03-10T13:30:20.760-07:00</updated><title type='text'>"Out of nothing I have created a strange new universe". [Janos Bolyai]</title><content type='html'>Right off the bat, let me refer you to an interesting discussion of extended set theory on the web at &lt;a href="http://www.xprogramming.com/xpmag/xstSomeThoughts.htm"&gt;http://www.xprogramming.com/xpmag/xstSomeThoughts.htm&lt;/a&gt; written by by Ron Jeffries.  It's not a bad conceptual introduction but I'm going to try to do it one better in simplicity below.&lt;br /&gt;&lt;br /&gt;In our last post we discussed the notion of “distinguishability” among classical set theory members, and how XSPRADA extended set theory allowed one to clearly differentiate between and uniquely identify extended set members.  These extended set members become values in extended set “couplets” and an XSPRADA “couplet” is an XSPRADA “ordered pair”.  Each member is defined by a scope and a constituent piece, either of which is interchangeable at will, but neither of which can exist without the other.  Now, in cases where order and identity are irrelevant, classical set theory can be used to manipulate these members.  But in cases where we do need to identify, discern or order set members, we apply extended set theory accordingly as such:&lt;br /&gt;&lt;br /&gt;Suppose you have a set of elements {A, B, C}.  We want to represent both mathematically and physically (in a computer) so we can do calculations on it using the mathematics to answer questions, and then bring answers back to the computer.  For example, we may want to count members in a set (cardinality), or calculate the intersection of two sets.  So typically you store this information on the computer in one way or another.  Then you “read” back the physical data (via disk I/O), map it to your mathematical ontology (in the software), do your calculations (count or intersect operation), then bring results back to the “real world” namely as a SQL result.  The faster you can do this, the better off everyone is.&lt;br /&gt;&lt;br /&gt;In a computer, you can’t simply store {A,B,C} because all possible permutations of the triplet {A,B,C} can be represented in many different ways (well, six actually, or factorial of 3) in memory (or storage).  So which one do you pick? Clearly you can’t store every possible combination efficiently.  Even if you wanted to, you’d still have to somehow indicate which combination you meant in a given context.  So engineers came up with a hack called “indexing”.  But now you also have to store the index values along with the data, even though the index itself is not native to the information.   Back on the math side, when you load this record back into the model, there is still no inherent notion of order.  To the model, it makes no difference if you load back A B C or C B A or B A C because they are all equivalent.  So you have to load the index as well. Then you need to do some math “magic” to tie the two, and promulgate the magic all the way back to a result, which you must then move back to the physical world as well.  It’s the quintessential anvil to the ankle problem.&lt;br /&gt;&lt;br /&gt;Extended set theory models physical storage (RAM or disk) using newly defined set members and so right off the bat we would express our initial set as {1.A, 2.B, 3.C}.  This cannot be confused with an entirely different set known as {1.B, 2.C, 3.A}.  When “materializing” this abstraction to disk or RAM, the representation is clearly as follows:&lt;br /&gt;&lt;br /&gt;ADDRESS     VALUE&lt;br /&gt;0x002200    A&lt;br /&gt;0x002201    B&lt;br /&gt;0x002202    C&lt;br /&gt;&lt;br /&gt;In other words, we have now modeled computer addressing using extended set scopes. This is one of the many possible applications of extended set theory for information management.&lt;br /&gt;&lt;br /&gt;Now take a different arrangement, for example:&lt;br /&gt;&lt;br /&gt;ADDRESS    VALUE&lt;br /&gt;0x002200   C&lt;br /&gt;0x002201   A&lt;br /&gt;0x002202   B&lt;br /&gt;&lt;br /&gt;When you move this back to the extended math model (in the software), you end up with {1.C, 2.A, 3.B}.  No need for slight-of-hand indexing or other “unnatural” steps.  There is zero ambiguity.&lt;br /&gt;&lt;br /&gt;Now suppose you have 2 different sets and wish to determine if they are “equivalent” as follows:&lt;br /&gt;&lt;br /&gt;ADDRESS    VALUE&lt;br /&gt;0x002200   A&lt;br /&gt;0x002201   B&lt;br /&gt;0x002202   C&lt;br /&gt;&lt;br /&gt;ADDRESS    VALUE&lt;br /&gt;0x002200   C&lt;br /&gt;0x002201   A&lt;br /&gt;0x002202   B&lt;br /&gt;&lt;br /&gt;Using classical set theory, you don’t have enough information (without indexing hacks) to mathematically make the call.  In extended set theory, you end up comparing two sets {1.A, 2.B, 3.C} and {1.C, 2.A, 3.B}.  You can see 1.A and 1.C are clearly distinct, as are 2.B and 2.A and 3.C and 3.B.  Why? Because extended set theory says two members are equal only if both their scopes and constituents are equal!  Consequently, you can say with certainty “these two sets are different”.&lt;br /&gt;&lt;br /&gt;Let’s apply this to finding intersection between these two sets. In classical set theory intersecting sets {A, B, C} and {C, B, A} yields {A, B, C}.  Intuitively, both sets seem to contain the same elements. But do they really?  It depends.  In extended set theory, the question is: do {1.A, 2.B, 3.C} and {1.C, 2.B, 3.A} have anything in common? And the answer is a resounding no as none of the elements has matching scope and constituents.  In this example, if order matters, classical set theory cannot help us.&lt;br /&gt;&lt;br /&gt;If you’ve been following me so far and thinking, this is really stupidly simple, you’re catching on. Simple in concept? Yes.  As Einstein once said, “Make everything as simple as possible, but not simpler”. Simple in implementation? That’s another story.  But the fact is, the implications for information management are mind-boggling. Why?  Six reasons:&lt;br /&gt;&lt;br /&gt;First, since the mathematics formally models order and relationships, and can manipulate these internally via a formal algebra, humans don’t need to pre-index or model information before exploiting it.  The software extracts that information from the data directly.  More interestingly, it does so by examining the bits and bytes right off storage with no loading or pre-canned processing logic needed.&lt;br /&gt;&lt;br /&gt;Second, every piece of information presented to the system is internalized in the math engine.  This means the original format of the information (be it relational or unstructured, CSV, XML, or video) becomes moot once inside the math engine.  Once modeled, any information (bits and bytes) becomes sets, which can be manipulated and operated on consistently.&lt;br /&gt;&lt;br /&gt;Third, because the mathematical model being maintained is complete and closed, the integrity of the “universe” is maintained at all times. This means information is immutable.  It cannot be changed or deleted.  This feature is called “temporal invariance”.  Consequently, you can actually query the database as if you were asking questions weeks or months ago, not unlike a time machine.&lt;br /&gt;&lt;br /&gt;Fourth, every query (question) fed to the system is transformed into its mathematically equivalent set of algebraic expressions.  As such it can be optimized mathematically.  Expressions and relations are also cached and exploited in real time.  This lets the system discover existing relationships that a human user could not have foreseen (and much less asked about).  It also lets the system adapt to “poor SQL” or poorly formulated questions without impacting results or performance.&lt;br /&gt;&lt;br /&gt;Fifth, the math engine partitions queries.  As a matter of fact, it can be shown mathematically that the number of possible query “types” in a given business intelligence domain is finite.  No matter how complex or convoluted a user or machine-generated query is, the engine can determine without ambiguity which partition “bucket” it belongs to.  Because this taxonomy is well-defined and finite, spectacular optimization techniques can be applied.&lt;br /&gt;&lt;br /&gt;Sixth, it can be mathematically demonstrated that a finite number of operations suffice to handle any possible workload presented to the engine.  The mathematics prove that any query can be answered using a small set of well-defined operations such as union, intersect and complement, for example.  Consequently, the XSPRADA math engine need only implement a fairly small number of operations, and no more.  As these operations can be threaded and parallelized in the software, this makes for a very highly optimizable high-performance system.&lt;br /&gt;&lt;br /&gt;So from a technical perspective, XSPRADA technology and its application to database software is a genuine breakthrough.  But more importantly, for business intelligence purposes (reporting, OLAP, decision support, or data mining) the rigorous application of formal mathematics to the software implementation of an analytical database engine offers unequaled competitive advantage.  I’ll describe those in real-world terms in a subsequent post, and I'll also dig up a couple places and instances where this technology was actually succesfully applied in the past as well!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9173851091724725518-8084640156207803662?l=jeromepineau.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jeromepineau.blogspot.com/feeds/8084640156207803662/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://jeromepineau.blogspot.com/2009/03/out-of-nothing-i-have-created-strange.html#comment-form' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/8084640156207803662'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9173851091724725518/posts/default/8084640156207803662'/><link rel='alternate' type='text/html' href='http://jeromepineau.blogspot.com/2009/03/out-of-nothing-i-have-created-strange.html' title='&quot;Out of nothing I have created a strange new universe&quot;. [Janos Bolyai]'/><author><name>Jerome Pineau</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh3.googleusercontent.com/-nwiKI_LvL9Y/AAAAAAAAAAI/AAAAAAAADM4/61o4i8R6qGI/s512-c/photo.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9173851091724725518.post-1470443760260548119</id><published>2009-03-05T10:03:00.000-08:00</published><updated>2009-03-05T10:04:40.046-08:00</updated><title type='text'>Smashing baby!</title><content type='html'>&lt;p class="MsoNormal"&gt;In the last post I promised to dive a little deeper into the math-based XSPRADA technology without putting readers to sleep.&lt;span style="mso-spacerun:yes"&gt;  &lt;/span&gt;That’s a tall order, as most people flee at the mere ment
