<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Text Technologies &#187; SAS</title>
	<atom:link href="http://www.texttechnologies.com/category/vendors/sas/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.texttechnologies.com</link>
	<description>Understanding technology ... in both senses of the phrase</description>
	<lastBuildDate>Sun, 28 Feb 2010 05:30:01 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Predictive analytics vendors&#8217; text mining sophistication</title>
		<link>http://www.texttechnologies.com/2007/09/18/predictive-analytics-vendors-text-mining-sophistication/</link>
		<comments>http://www.texttechnologies.com/2007/09/18/predictive-analytics-vendors-text-mining-sophistication/#comments</comments>
		<pubDate>Tue, 18 Sep 2007 06:33:24 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[SAS]]></category>
		<category><![CDATA[SPSS]]></category>
		<category><![CDATA[Sentiment analysis]]></category>
		<category><![CDATA[Text mining]]></category>
		<category><![CDATA[kxen]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/2007/09/18/predictive-analytics-vendors-text-mining-sophistication/</guid>
		<description><![CDATA[Steve Gallant of KXEN contacted me over the summer to show me KXEN&#8217;s new text mining capability.  It was pretty basic bag-of-words stuff, which is still a lot better than nothing, and actually fits pretty well with KXEN&#8217;s general simplicity-centric strategy.
This inspired me to check whether there had been any big changes in text [...]]]></description>
			<content:encoded><![CDATA[<p>Steve Gallant of <a href="http://www.monashreport.com/2006/10/04/kxen-and-verix-try-to-disrupt-the-data-mining-market/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.monashreport.com');">KXEN</a> contacted me over the summer to show me KXEN&#8217;s new text mining capability.  It was pretty basic bag-of-words stuff, which is still a lot better than nothing, and actually fits pretty well with KXEN&#8217;s general simplicity-centric strategy.</p>
<p>This inspired me to check whether there had been any big changes in text mining capabilities at SAS or SPSS.  It turned out there hadn&#8217;t.  SAS is also still on the bag-of-words level.  SPSS, however, does do sentiment analysis (pretty obvious, considering their focus on surveys and the like) and negation.</p>
<p>Thanks go out to Mary Crissey and Olivier Jouve for getting back to me when I asked, along with apologies for taking a while to post what they told me.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2007/09/18/predictive-analytics-vendors-text-mining-sophistication/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Text analytics marketplace trends</title>
		<link>http://www.texttechnologies.com/2007/07/22/text-analytics-marketplace-trends/</link>
		<comments>http://www.texttechnologies.com/2007/07/22/text-analytics-marketplace-trends/#comments</comments>
		<pubDate>Sun, 22 Jul 2007 09:44:49 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Application areas]]></category>
		<category><![CDATA[ClearForest/Reuters]]></category>
		<category><![CDATA[Custom publishing]]></category>
		<category><![CDATA[Factiva/Dow Jones]]></category>
		<category><![CDATA[Mark Logic]]></category>
		<category><![CDATA[SAS]]></category>
		<category><![CDATA[Search engines]]></category>
		<category><![CDATA[Spam and antispam]]></category>
		<category><![CDATA[Text Analytics Summit]]></category>
		<category><![CDATA[Text mining]]></category>
		<category><![CDATA[Voice of the Customer]]></category>
		<category><![CDATA[nStein]]></category>
		<category><![CDATA[Assentor]]></category>
		<category><![CDATA[compliance]]></category>
		<category><![CDATA[Dow Jones]]></category>
		<category><![CDATA[ETL]]></category>
		<category><![CDATA[Factiva]]></category>
		<category><![CDATA[fraud detection]]></category>
		<category><![CDATA[reputation management]]></category>
		<category><![CDATA[Reuters]]></category>
		<category><![CDATA[spam]]></category>
		<category><![CDATA[StreamBase]]></category>
		<category><![CDATA[Text analytics]]></category>
		<category><![CDATA[warranty analysis]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/2007/07/22/text-analytics-marketplace-trends/</guid>
		<description><![CDATA[It was tough to judge user demand at the recent Text Analytics Summit because, well, very few users showed up.  And frankly, I wasn&#8217;t as aggressive at pumping vendors for trends as I am some other times.  That said, I have talked with most text analytics vendors recently,* and here are my impressions [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in">It was tough to judge user demand at the recent Text Analytics Summit because, well, very few users showed up.  And frankly, I wasn&#8217;t as aggressive at pumping vendors for trends as I am some other times.  That said, I <em>have </em>talked with most text analytics vendors recently,* and here are my impressions of what&#8217;s going on.  Any contrary – or confirming! &#8212; opinions would be most welcome.</p>
<p style="margin-bottom: 0in"><em><span>*Factiva is the most significant exception.   Hint, hint.</span></em></p>
<p style="margin-bottom: 0in">If you think about it, text analytics is a <span style="font-style: normal"><strong>“secret ingredient” in search, antispam, and data cleaning,</strong></span>* and this dominates all other uses of the technology.  A significant minority of the research effort at companies that do any kind of text filtering is – duh &#8212; text analytics.  Cold comfort for specialist text analytics vendors, to be sure, but that&#8217;s the way it is.</p>
<p style="margin-bottom: 0in"><em>*I.e., part of the “T” in “ETL” (Extract/Transform/Load).</em></p>
<p style="margin-bottom: 0in">Text-analytics-enhanced <span style="font-style: normal"><strong>custom publishing</strong></span> will surely at some point become a  must-have for business and technical publishers.  However, it appears that we&#8217;re not quite there yet, as large publishers make do with simple-minded search and the like.  In what I suspect is a telling market commentary, there&#8217;s no headlong rush among vendors to dump text mining for custom publishing, notwithstanding the examples of nStein and (sort of) ClearForest.  I don&#8217;t want to be overly negative – either my friends at Mark Logic are doing just fine or else they&#8217;re putting up a mighty brave front – but I don&#8217;t think the nonspecialist publishing market is there yet.<span id="more-119"></span></p>
<p style="margin-bottom: 0in">Two business publishers who have made major investments in owning text analytics technology are Dow Jones (now sole owners of Factiva) and Reuters (recent purchaser of ClearForest).  Beyond that, however, I don&#8217;t yet see a lot of activity in the <strong>investor/trading</strong> market, although ClearForest reported some activity last year and StreamBase reports that one customer is using them for text filtering, presumably alongside the ticker-munching traders usually use StreamBase for.</p>
<p style="margin-bottom: 0in">Obviously, the <span style="font-style: normal"><strong>intelligence</strong></span> market is what fueled the start of the text analytics business, and still provides the majority of revenue at multiple companies.  Certainly it&#8217;s still going strong.  But it&#8217;s tough to gauge the growth potential from here, especially since the details of usage are typically classified.</p>
<p style="margin-bottom: 0in">Similar things could be said about <span style="font-style: normal"><strong>pharmaceutical research.</strong></span><em> </em> Text analytics is totally accepted in that market, but what&#8217;s the growth potential from here?  And “here” isn&#8217;t actually very big (much smaller than intelligence).  The related category of <span style="font-style: normal"><strong>patient records analysis</strong></span> looks very promising, but is basically still at the research-project stage.  (In general, an explosion in biological IT can be expected when research methods are adapted for clinical use.)</p>
<p style="margin-bottom: 0in">The <span style="font-style: normal"><strong>warranty analysis</strong></span> market, so promising early on, is not showing a lot of growth and depth.   The same thing has happened many times before with innovative technologies sold to manufacturing companies&#8217; engineers.  It seems to be happening again now.</p>
<p style="margin-bottom: 0in"><span style="font-style: normal"><strong>Voice of the customer*</strong></span> is pretty much the same thing, but for service industries.   And the text analytics market for VotC is evidently stronger right now than that for warranty analysis.  This makes sense, because the obvious alternative to text analytics – multiple-choice coded forms – is less appealing, due to two application differences:</p>
<ul>
<li>
<p style="margin-bottom: 0in">VotC looks for opinion as well as fact.</p>
</li>
<li>
<p style="margin-bottom: 0in">VotC looks for input from people 	under no obligation to share it, and who hence can&#8217;t be compelled to play along with a structured form – let alone trained to fill it in accurately.</p>
</li>
</ul>
<p style="margin-bottom: 0in"><em>*Definitional note: </em><span style="font-style: normal">Voice of the customer </span><em>is when customers or prospects communicates with you directly, e.g. via a survey form or an angry email. </em><span style="font-style: normal">Reputation management </span><em>is when you web-scrape and find out what they&#8217;re saying to everybody else.  At least, I think marketers are still using the terms that way pretty consistently.</em></p>
<p style="margin-bottom: 0in"><span style="font-style: normal"><strong>Reputation management</strong></span><em> </em><span style="font-style: normal">is surely</span><em> </em>becoming a standard application for the biggest consumer brands.  How deep that market turns out to be, however, remains to be seen.</p>
<p>Text analytics for <span style="font-style: normal"><strong>fraud discovery</strong></span> seems poised to sweep the insurance industry, and then the rest of financial services.  Current activity, however, while decent, still seems to consist of more poising than sweeping.</p>
<p><span style="font-style: normal"><strong>Compliance</strong></span> is a minimum-acceptable-efforts kind of activity in most markets.  Accordingly search/clustering seems to be the preferred text-checking approach.  Where that&#8217;s not the case, the market seems to have gone to specialized products like Assentor (stock brokerage).</p>
<p style="margin-bottom: 0in"><strong>Human resources</strong> is a good area to sell follow-on applications, at least to enterprises with so many employees that they want to automate the reading of employee feedback.  I&#8217;m not aware of it being the first-sale app to very many enterprises, however.</p>
<p style="margin-bottom: 0in">SAS used to speak glowingly of text mining used directly for <span style="font-style: normal"><strong>ETL.</strong></span> However, nobody else has talked about this, and even from SAS I get the sense that some of the glow has worn off.  As noted above, text analytics is an important ingredient to the transformation part of ETL, but it I think it rarely would be the best option for doing the transformations directly.</p>
<p style="margin-bottom: 0in"><em></em></p>
<p style="margin-bottom: 0in"><em><br />
</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2007/07/22/text-analytics-marketplace-trends/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>(A little) more on Business Objects/Inxight</title>
		<link>http://www.texttechnologies.com/2007/05/23/a-little-more-on-business-objectsinxight/</link>
		<comments>http://www.texttechnologies.com/2007/05/23/a-little-more-on-business-objectsinxight/#comments</comments>
		<pubDate>Wed, 23 May 2007 10:39:23 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Attensity]]></category>
		<category><![CDATA[Business Objects and Inxight]]></category>
		<category><![CDATA[ClearForest/Reuters]]></category>
		<category><![CDATA[Enterprise search]]></category>
		<category><![CDATA[SAS]]></category>
		<category><![CDATA[Search engines]]></category>
		<category><![CDATA[Text mining]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/2007/05/23/a-little-more-on-business-objectsinxight/</guid>
		<description><![CDATA[After missing what seems to have been an uninformative press conference anyway, I hooked up later with the Business Objects folks on the phone.  I say that it was probably uninformative because in the short call, it was pointed out to me that they really weren&#8217;t at liberty to say much anyway. Here are [...]]]></description>
			<content:encoded><![CDATA[<p>After <a href="http://www.texttechnologies.com/2007/05/22/business-objects-inxight/" >missing</a> what seems to have been an uninformative press conference anyway, I hooked up later with the Business Objects folks on the phone.  I say that it was probably uninformative because in the short call, it was pointed out to me that they really weren&#8217;t at liberty to say much anyway. Here are a couple of tidbits I picked up even so.</p>
<ul>
<li><em>Business Objects&#8217; text mining partnerships have been more demo/sales-cycle than actual sales up until now. </em>That said, they have a few deals each with Attensity and Inxight (but not with ClearForest, which <a href="http://www.texttechnologies.com/2007/04/30/clearforest-reuters-acquisition/" >pulled in its horns</a> prior to being acquired by Reuters).   I still think they&#8217;re the leading BI vendor in integrating with text mining, SAS perhaps aside (who if nothing else have a lot of fun using text mining for data cleaning).  The working Inxight partnership, by the way, was all about the specific app of email compliance, with the demo being based on the publicly available Enron corpus.</li>
<li><em>Inxight&#8217;s visualization technology is in the form of an SDK anyway.  So integrating it into BOBJ&#8217;s product line should be straightforward. </em>Note: Through the Excelsius acquisition, BOBJ has been trying to gain competitive advantage in the cool-visualization area.</li>
<li><em>Inxight&#8217;s &#8220;federation&#8221; capability for search is pretty primitive</em> (my term and opinion of course, not theirs).  It takes in search result sets from various sources, then clusters and/or refilters them.  What it does NOT do is the much harder task of taking actual relevancy rankings from various engines and somehow arbitrating between them.  Nor, I&#8217;m guessing, does it even assign higher or lower weights to various corpuses or anything like that.  Thus, it does not sound terribly competitive with the distributed search capabilities built into any state-of-the-art enterprise search engine.</li>
</ul>
<p><em></em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2007/05/23/a-little-more-on-business-objectsinxight/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Text Analytics Summit marketing panel:  Membership firmed up</title>
		<link>http://www.texttechnologies.com/2007/03/21/text-analytics-summit-marketing-panel-membership-firmed-up/</link>
		<comments>http://www.texttechnologies.com/2007/03/21/text-analytics-summit-marketing-panel-membership-firmed-up/#comments</comments>
		<pubDate>Wed, 21 Mar 2007 08:42:41 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Attensity]]></category>
		<category><![CDATA[Mark Logic]]></category>
		<category><![CDATA[SAS]]></category>
		<category><![CDATA[Text Analytics Summit]]></category>
		<category><![CDATA[Text mining]]></category>
		<category><![CDATA[nStein]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/2007/03/21/text-analytics-summit-marketing-panel-membership-firmed-up/</guid>
		<description><![CDATA[We&#8217;ve now solidified the membership of the Text Analytics Summit marketing panel.  It is:

Curt Monash, President, Monash Information Services
Dave Kellogg, CEO, Mark Logic Corporation
Michelle De Haaff, VP Marketing, Attensity Corporation
Michel Lemay, VP Marketing, nstein Technologies
Mary Crissey, SAS Analytics Marketing Manager, SAS Institute

Michelle, Michel, and Mary are all obvious choices, responsible for marketing at leading [...]]]></description>
			<content:encoded><![CDATA[<p>We&#8217;ve now solidified the membership of the <a href="http://www.texttechnologies.com/2007/03/07/three-crucial-issues-in-text-analytics/" >Text Analytics Summit marketing panel</a>.  It is:</p>
<ul>
<li>Curt Monash, President, Monash Information Services</li>
<li>Dave Kellogg, CEO, Mark Logic Corporation</li>
<li>Michelle De Haaff, VP Marketing, Attensity Corporation</li>
<li>Michel Lemay, VP Marketing, nstein Technologies</li>
<li>Mary Crissey, SAS Analytics Marketing Manager, SAS Institute</li>
</ul>
<p>Michelle, Michel, and Mary are all obvious choices, responsible for marketing at leading text mining vendors.  In addition, Mary has excelled on the same panel in the past, Michel sent me e-mail with some brilliant thoughts on the panel subject, and Attensity has <a href="http://www.texttechnologies.com/2006/12/27/telling-attensity-and-clearforest-apart/" >one of the most interesting strategies in the text analytics market</a>.</p>
<p>As for Dave &#8212; he&#8217;s simply one of the most astute marketing theorists working in software today.  And he runs a very interesting text technology company.  And he used to be most senior marketing guy in all of business intelligence, when he was SVP at Business Objects.  In his copious free time, he writes a <a href="http://marklogic.blogspot.com/2006/04/ingres-can-you-ever-go-back.html" onclick="javascript:pageTracker._trackPageview('/outbound/article/marklogic.blogspot.com');">really cool blog</a>.</p>
<p><em></em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2007/03/21/text-analytics-summit-marketing-panel-membership-firmed-up/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Why the BI vendors are integrating with Google OneBox</title>
		<link>http://www.texttechnologies.com/2006/09/01/why-the-bi-vendors-are-integrating-with-google-onebox/</link>
		<comments>http://www.texttechnologies.com/2006/09/01/why-the-bi-vendors-are-integrating-with-google-onebox/#comments</comments>
		<pubDate>Fri, 01 Sep 2006 15:34:01 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[BI integration]]></category>
		<category><![CDATA[Enterprise search]]></category>
		<category><![CDATA[FAST]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Natural language processing (NLP)]]></category>
		<category><![CDATA[SAS]]></category>
		<category><![CDATA[Search engines]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/2006/09/01/why-the-bi-vendors-are-integrating-with-google-onebox/</guid>
		<description><![CDATA[I&#8217;m hearing the same thing from multiple BI vendors, with SAS being the most recent and freshest in my mind &#8212; customers want them to &#8220;integrate&#8221; with Google OneBox.   Why Google rather than a better enterprise search technology, such as FAST&#8217;s?  So far as I&#8217;ve figured out, these are the reasons, in [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m hearing the same thing from multiple BI vendors, with SAS being the most recent and freshest in my mind &#8212; customers want them to &#8220;integrate&#8221; with Google OneBox.   Why Google rather than a better enterprise search technology, such as FAST&#8217;s?  So far as I&#8217;ve figured out, these are the reasons, in no particular order:</p>
<ul>
<li>Price.</li>
<li>Ease of installation (real or imagined).</li>
<li>The familiar Google brand name.</li>
<li>The familiar Google UI.</li>
<li>Google OneBox&#8217;s ability to search relational records, reports, etc. along with more tradtional record types.</li>
</ul>
<p>The last point, I think, is the most interesting.  Lots of people think text search is and/or should be the dominant UI of the future.  Now, I&#8217;ve been a big fan of natural language command line interfaces ever since the days of Intellect and Lotus HAL.  But judging by the market success of those products &#8212; or for that matter of voice command/control &#8212; I was in a very small minority.  Maybe the even simpler search interface &#8212; words jumbled together without grammatical structure &#8212; will win out instead.</p>
<p>Who knows?  Progress is a funny thing.  Maybe the ultimate UI will be one that responds well to grunts, hand gestures, and stick-figure drawings. We could call it NeanderHAL, but that would wrong &#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2006/09/01/why-the-bi-vendors-are-integrating-with-google-onebox/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>The current state of text mining/analytics marketing?</title>
		<link>http://www.texttechnologies.com/2006/06/23/the-current-state-of-text-mining-analytics-marketing/</link>
		<comments>http://www.texttechnologies.com/2006/06/23/the-current-state-of-text-mining-analytics-marketing/#comments</comments>
		<pubDate>Fri, 23 Jun 2006 09:15:46 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[About this blog]]></category>
		<category><![CDATA[SAS]]></category>
		<category><![CDATA[SPSS]]></category>
		<category><![CDATA[Text Analytics Summit]]></category>
		<category><![CDATA[Text mining]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/2006/06/23/the-current-state-of-text-mininganalytics-marketing/</guid>
		<description><![CDATA[One thing that didn&#8217;t go so well at the Text Analytics Summit was the marketing panel.  Indeed, when we wracked our brains afterward, Mary Crissey (who was on the panel) and I could only think of a single observation that was actually made about marketing.  Namely, she referred to a core truth of [...]]]></description>
			<content:encoded><![CDATA[<p>One thing that didn&#8217;t go so well at the Text Analytics Summit was the marketing panel.  Indeed, when we wracked our brains afterward, Mary Crissey (who was on the panel) and I could only think of a single observation that was actually made about marketing.  Namely, she referred to a core truth of marketing:  Just selling features doesn&#8217;t work (nobody cares).  Just selling benefits doesn&#8217;t work (you&#8217;re not differentiated).  What you have to do is <strong>sell the <em>connection</em> between your features and desirable benefits.</strong></p>
<p>So I&#8217;m going to try to gather some useful observations on marketing here, filling the gap that the panel left.  Key questions I&#8217;d love input on include:</p>
<p>1.  Which feature-benefit connections do you see customers easily accepting?</p>
<p>2.  Which feature-benefit connections is it harder to get them to believe?</p>
<p>3.  How are customers defining text analytics market segments?</p>
<p>4.  What do they see as the key issues in each segement?</p>
<p>5.  Which application areas are showing growth even beyond that of the market overall?</p>
<p>I&#8217;m particularly interested in comments from the larger vendors that are selling into multiple parts of the text mining and text analytics market.  But everybody else&#8217;s input would be warmly appreciated too.</p>
<p>The comment thread to this post is open for business!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2006/06/23/the-current-state-of-text-mining-analytics-marketing/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
	</channel>
</rss>
