<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Text Technologies &#187; Autonomy</title>
	<atom:link href="http://www.texttechnologies.com/category/vendors/autonomy/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.texttechnologies.com</link>
	<description>Understanding technology ... in both senses of the phrase</description>
	<lastBuildDate>Wed, 18 Jan 2012 17:02:59 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.3</generator>
		<item>
		<title>How good does e-discovery search need to be?</title>
		<link>http://www.texttechnologies.com/2008/09/01/how-good-does-e-discovery-search-need-to-be/</link>
		<comments>http://www.texttechnologies.com/2008/09/01/how-good-does-e-discovery-search-need-to-be/#comments</comments>
		<pubDate>Mon, 01 Sep 2008 04:44:58 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Autonomy]]></category>
		<category><![CDATA[E-discovery]]></category>
		<category><![CDATA[Enterprise search]]></category>
		<category><![CDATA[Search engines]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/?p=277</guid>
		<description><![CDATA[Two years ago, CEO Mike Lynch of Autonomy tried to persuade me that Autonomy was and would remain dominant in the e-discovery search market because: The essence of the buying decision was that enterprises wanted to fulfill obligations to make their information available in a way that would would satisfy the courts. Autonomy had some [...]]]></description>
			<content:encoded><![CDATA[<p>Two years ago, CEO Mike Lynch of Autonomy tried to persuade me that Autonomy was and would remain dominant in the e-discovery search market because:<span id="more-277"></span></p>
<ul>
<li>The essence of the buying decision was that enterprises wanted to fulfill obligations to make their information available in a way that would would satisfy the courts.</li>
<li>Autonomy had some high-profile traction (e.g., the Enron case) that made it the default decision, and hence in particular a choice that met the requirement.</li>
</ul>
<p>Recently, I ran that theory by David Ferris, whose firm <a href="http://www.ferris.com" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.ferris.com');">Ferris Research</a> has long been a/the leading small analyst firm covering e-mail and related technologies.  He wasn&#8217;t buying.  David believes courts are getting <a href="http://www.ferris.com/2008/07/22/courts-will-tolerate-search-inaccuracies/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.ferris.com');">more sophisticated in their understanding of search technology</a>.  Even more to the point, David cited several other buying motivations that would lead enterprises to want best-available rather than just-good-enough e-discovery search technology, such as:</p>
<ul>
<li>Enterprises want to know what information is available to be discovered against them.</li>
<li>Enterprises want to discover the information that will best aid their legal defense.</li>
<li>If they&#8217;re archiving the material for one purpose (e-discovery) anyway, enterprises want to get the most possible value out of it for other purposes while they&#8217;re at it.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2008/09/01/how-good-does-e-discovery-search-need-to-be/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Event stream processors active in text filtering</title>
		<link>http://www.texttechnologies.com/2007/07/26/event-stream-processors-active-in-text-filtering/</link>
		<comments>http://www.texttechnologies.com/2007/07/26/event-stream-processors-active-in-text-filtering/#comments</comments>
		<pubDate>Thu, 26 Jul 2007 11:08:35 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Autonomy]]></category>
		<category><![CDATA[Business Objects and Inxight]]></category>
		<category><![CDATA[Enterprise search]]></category>
		<category><![CDATA[Search engines]]></category>
		<category><![CDATA[Text mining]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/2007/07/26/event-stream-processors-active-in-text-filtering/</guid>
		<description><![CDATA[OK. I secured permission to actually quote the details on something I&#8217;d previously dropped a small hint about &#8212; stream processing for text messages. Traditionally, that&#8217;s been the province of enterprise search companies. A decade ago, Verity had a kernel group of 6-7 engineers under Phil Nelson. They managed to produce not only a decent [...]]]></description>
			<content:encoded><![CDATA[<p>OK.  I secured permission to actually quote the details on something I&#8217;d previously dropped <a href="http://www.texttechnologies.com/2007/07/22/text-analytics-marketplace-trends/" >a small hint</a> about &#8212; stream processing for text messages.   Traditionally, that&#8217;s been the province of enterprise search companies.   A decade ago, Verity had a kernel group of 6-7 engineers under Phil Nelson.  They managed to produce not only a decent search engine, but a search engine &#8220;turned on its side&#8221; as well.  I.e., instead of running one query against a corpus, they could run many queries each against documents as they arrived, one document at a time.  Subsequently, the same idea has been implemented by most enterprise search providers, at least those that are serious about the intelligence market.</p>
<p>Well, the event-processing guys are active in that market too.  At least StreamBase is.    <span id="more-120"></span>It was an obvious guess to ask if they were, and over the past few months I&#8217;ve gotten confirmation (including that they partner w/ Inxight just like almost everybody else does).   Here, quoted with permission and lightly edited, is what StreamBase VP Marketing Bill Hobbib has to say on the subject</p>
<blockquote><p>Regarding text filtering and processing, we&#8217;ve done work in financial services  for a hedge fund and extensive work with the federal government and intelligence  community.  The existing StreamBase schema and message/field structure supports  a wide array of message types, and text parsing can be done in either the SMTP  adapter or the StreamBase engine, where the text processing occurs.   Multiplexing onto different streams/schemas is supported.</p>
<p>In terms of text  processing, StreamBase can process emails, documents, sentences, individual  words or letters using StreamSQL&#8217;s existing capabilities, including a variety of  built-in standard string operations, or user-defined functions, custom  operators, and user-defined aggregates.  We can also partner with text-mining  companies.</p>
<p>Though you didn&#8217;t ask about processing media such as audio or video  data, the new BLOB datatype is designed for this purpose.  We partner with  feature extraction vendors (e.g. speech to text) where necessary, as we don&#8217;t  process the native audio or video data&#8211;just the metadata or the extracted  features and converted files.</p></blockquote>
<p>More on StreamBase can be found over on <em><a href="http://www.dbms2.com/category/products-and-vendors/streambase/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.dbms2.com');">DBMS2</a></em>, e.g. in <a href="http://www.dbms2.com/2007/06/07/streambase-and-truviso/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.dbms2.com');">this post</a> or <a href="http://www.dbms2.com/2007/07/18/streambase-rebuts/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.dbms2.com');">this one</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2007/07/26/event-stream-processors-active-in-text-filtering/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Analyst reports about enterprise search</title>
		<link>http://www.texttechnologies.com/2006/07/29/analyst-reports-about-enterprise-search/</link>
		<comments>http://www.texttechnologies.com/2006/07/29/analyst-reports-about-enterprise-search/#comments</comments>
		<pubDate>Sat, 29 Jul 2006 12:16:33 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Autonomy]]></category>
		<category><![CDATA[Convera]]></category>
		<category><![CDATA[Enterprise search]]></category>
		<category><![CDATA[FAST]]></category>
		<category><![CDATA[Search engines]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/2006/07/29/analyst-reports-about-enterprise-search/</guid>
		<description><![CDATA[Gartner and Forrester have high opinions of FAST. Not coincidentally, you can download both those firms’ recent search industry survey reports from almost any page of www.fastsearch.com. Of the two, Forrester’s is both better and more recent. Summarizing brutally, the big firms’ consensus seems to be: FAST and Autonomy are the clear leaders. Endeca has [...]]]></description>
			<content:encoded><![CDATA[<p class="MsoNormal">Gartner and Forrester have high opinions of FAST.  Not coincidentally, you can download both those firms’ recent search industry survey reports from almost any page of <a href="http://www.fastsearch.com/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.fastsearch.com');">www.fastsearch.com</a>.  Of the two, Forrester’s is both better and more recent.</p>
<p class="MsoNormal">Summarizing brutally, the big firms’ consensus seems to be:</p>
<ul type="disc" style="margin-top: 0in">
<li class="MsoNormal">FAST      and Autonomy are the clear leaders.</li>
<li class="MsoNormal">Endeca      has great technology and is coming on strong.</li>
<li class="MsoNormal">Everybody      else is a niche player, at least for now.</li>
<li class="MsoNormal">Convera      is in deep yogurt.</li>
</ul>
<p class="MsoNormal">Forrester is particularly harsh on <a href="http://www.texttechnologies.com/2006/07/29/convera-aka-excalibur-aka-conquest/" >Convera</a>.  Presumably this has much to do with the fact that Convera did not cooperate well with the survey process.  I shall not speculate as to which way the causality runs there – but I should note that Convera was quite cooperative with my research last week.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2006/07/29/analyst-reports-about-enterprise-search/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Autonomy on text mining</title>
		<link>http://www.texttechnologies.com/2006/07/23/autonomy-on-text-mining/</link>
		<comments>http://www.texttechnologies.com/2006/07/23/autonomy-on-text-mining/#comments</comments>
		<pubDate>Sun, 23 Jul 2006 05:19:22 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[About this blog]]></category>
		<category><![CDATA[Autonomy]]></category>
		<category><![CDATA[Search engines]]></category>
		<category><![CDATA[Text mining]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/2006/07/23/autonomy-on-text-mining/</guid>
		<description><![CDATA[I asked Mike Lynch (Autonomy CEO) about text mining. He responded with an example: A very well-known company &#8220;mines&#8221; its incoming emails for signs of trouble, not via any linguistics-driven approach, but just by clustering them. If a cluster changes size anomalously over time, it bears close investigation.]]></description>
			<content:encoded><![CDATA[<p>I asked Mike Lynch (Autonomy CEO) about text mining.  He responded with an example:</p>
<p>A very well-known company &#8220;mines&#8221; its incoming emails for signs of trouble, not via any linguistics-driven approach, but just by clustering them.  If a cluster changes size anomalously over time, it bears close investigation.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2006/07/23/autonomy-on-text-mining/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Update:  Autonomy/Verity merger</title>
		<link>http://www.texttechnologies.com/2006/07/23/autonomy-verity-merger/</link>
		<comments>http://www.texttechnologies.com/2006/07/23/autonomy-verity-merger/#comments</comments>
		<pubDate>Sun, 23 Jul 2006 05:12:32 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[About this blog]]></category>
		<category><![CDATA[Autonomy]]></category>
		<category><![CDATA[Enterprise search]]></category>
		<category><![CDATA[Search engines]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/2006/07/23/autonomy-verity-merger/</guid>
		<description><![CDATA[I had a couple of very interesting calls with Autonomy last week. One message I got was that they do not want to be pigeonholed in search, which they think on the whole is a primitive way of dealing with &#8220;unstructured information.&#8221; Nonetheless, my first post based on those calls will indeed focus on text [...]]]></description>
			<content:encoded><![CDATA[<p>I had a couple of very interesting calls with Autonomy last week.  One message I got was that they do not want to be pigeonholed in search, which they think on the whole is a primitive way of dealing with &#8220;unstructured information.&#8221;  Nonetheless, my first post based on those calls will indeed focus on text indexing and search.  You see, <a href="http://www.monashreport.com/2005/11/04/autonomyverity-merger/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.monashreport.com');">I wrote</a> <a href="http://www.texttechnologies.com/2005/11/04/autonomy-verity-so-what/" >quite skeptically</a> about the Autonomy/Verity merger when it was announced, and I&#8217;d like to amend that with an updated opinion.   Autonomy&#8217;s claims can be summarized in part by the following:<span id="more-31"></span></p>
<p><strong>1.  Long before acquiring Verity, Autonomy supposedly offered full plug-compatibility with Verity&#8217;s text indexing product. </strong>(At varous times the Verity product was called &#8220;Topic&#8221; and &#8220;K2&#8243;.)  This claim is almost credible. After all, there was no &#8220;secret sauce&#8221; in getting the result set; it was all tokenization and boolean operations and proximity and so on.   The reason I&#8217;m a little skeptical about the full claim is that the same can&#8217;t be said for <em>ranking</em> the results.  There, each vendor indeed had/has some secrets which aren&#8217;t so easy to reverse-engineer.</p>
<p><strong>2.  At the time of the acquisition and for a while before, Autonomy&#8217;s underlying engine (IDOL) was simply better than Verity&#8217;s.  At least, that&#8217;s the way the kernels stacked up. </strong> When I was closest to Verity, which was for a couple of years including 1997, the kernel group consisted of about 6 guys, led by Phil Nelson.  All except one left, and the remaining one wasn&#8217;t Phil.  Verity never recovered.  I bet Dennis McEvoy would have fixed things given enough time, but if memory serves he was there less than a year.  (In general, Verity seemed to suffer from brutal turnover.)</p>
<p>The most specific and obvious point is that IDOL is multithreaded, while K2 Classic is singlethreaded.  Beyond that, IDOL and K2 have similar architectures (inverted index on words, etc.), although IDOL overall tracks more info (all the Bayesian stuff, and maybe it also was more advanced in recognizing parts of documents, headers, etc. &#8212; this is another point where I&#8217;d be curious to get clarification.)</p>
<p><strong>3.  IDOL is now the upgrade path for K2 users.</strong> Well, in light of the above &#8212; duh.  Of course it is.  And even if plug-compatibility wasn&#8217;t perfect when the companies were independent, it surely was easy to achieve once Autonomy could learn the ranking functions used in Verity&#8217;s product.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2006/07/23/autonomy-verity-merger/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

