<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Text Technologies &#187; Mark Logic</title>
	<atom:link href="http://www.texttechnologies.com/category/vendors/marklogic/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.texttechnologies.com</link>
	<description>Understanding technology ... in both senses of the phrase</description>
	<lastBuildDate>Wed, 18 Jan 2012 17:02:59 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.3</generator>
		<item>
		<title>Ike Pigott on the future of reporting</title>
		<link>http://www.texttechnologies.com/2010/04/04/ike-pigott-on-the-future-of-reporting/</link>
		<comments>http://www.texttechnologies.com/2010/04/04/ike-pigott-on-the-future-of-reporting/#comments</comments>
		<pubDate>Sun, 04 Apr 2010 13:47:06 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Blogosphere]]></category>
		<category><![CDATA[Mark Logic]]></category>
		<category><![CDATA[Online media]]></category>
		<category><![CDATA[Social software and online media]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/?p=392</guid>
		<description><![CDATA[Ike Pigott argues that, as the number of conventional journalists plummets, corporations will have to hire their own &#8220;embedded&#8221; journalists to fill the void. As he puts it: The embeds of the future will work for the company, and be paid by the company to provide news about the company in a multitude of formats. [...]]]></description>
			<content:encoded><![CDATA[<p>Ike Pigott argues that, as the number of conventional journalists plummets, corporations will have to hire their own <a href="http://www.mediabullseye.com/mb/2010/04/dear-journalist.html" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.mediabullseye.com');">&#8220;embedded&#8221; journalists</a> to fill the void. <span id="more-392"></span>As he puts it:</p>
<blockquote><p>The embeds of the future will work for the company, and be paid by the company to provide news about the company in a multitude of formats. Print, newsletter, video, blog, podcast, moving billboards, tattoos — whatever it takes. Because the bits and pieces of Corporate America that have a story to tell will still have their stories – just no ready outlets.</p>
<p>How is this different than what you have today? Surely there are corporate PR departments and external agencies already doing these things, right?</p>
<p>No.</p>
<p>What is required is an internal producer who writes in external voice — like the neutral point-of-view so often described by Wikipedia. People can smell marketing and propaganda coming around the corner, and they know when the pitches and puff pieces are missing that edge of neutrality. An accurate and fair piece is accurate and fair, no matter who writes it.</p></blockquote>
<p>It&#8217;s an interesting theory, but it seems to presuppose dual marketing communication efforts, with separate departments of &#8220;Straightforwardness&#8221; and &#8220;Hype&#8221;. That may work at some companies, but in most cases I think it will be more practical to try to infuse straightforwardness through multiple parts of the marcom effort.</p>
<p>My more specific quick responses include:</p>
<ul>
<li>That sure sounds a lot like Robert Scoble in his Microsoft days.</li>
<li>It also sounds like &#8220;community managers&#8221; at MMO game companies. (Both of the MMOs I&#8217;ve played have had great ones.) They often only use one or two channels (forums and the associated general website), but otherwise they fit the bill.</li>
<li>Ike&#8217;s views fit very well with mine on <a href="http://www.texttechnologies.com/2009/03/29/where-i-think-the-information-ecosystem-is-headed/" >the future of the information ecosystem</a>.</li>
<li>I&#8217;m getting ever more sympathetic to the idea that you need people whose main job is external communication of a straightforward kind. Reasons include:
<ul>
<li>Senior executives who write great blogs commonly don&#8217;t keep them up. And even when they&#8217;re active, the blogging is pretty sparse. E.g., among companies I follow closely, <a href="http://databasecolumn.vertica.com/" onclick="javascript:pageTracker._trackPageview('/outbound/article/databasecolumn.vertica.com');">Vertica</a>, <a href="http://www.asterdata.com/blog/index.php/category/statements/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.asterdata.com');">Aster Data</a>, and <a href="http://www.netezzacommunity.com/people/pfrancisco?view=overview" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.netezzacommunity.com');">Netezza</a> have all done some outstanding blogging in the past, but do very little of it now. Only <a href="http://www.kellblog.com/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.kellblog.com');">Dave Kellogg</a> at Mark Logic really keeps going.</li>
<li>It&#8217;s not obvious that senior executives are wrong to spend their time at something other than blogging. One of the greatest vendor blogs ever was <a href="http://blogs.sun.com/jonathan/" onclick="javascript:pageTracker._trackPageview('/outbound/article/blogs.sun.com');">Jonathan Schwartz&#8217;s</a> at Sun.  Umm &#8212; how sure are we that he actually did much good for his company with that effort?</li>
<li> I frequently tell vendors &#8220;If you tell Story X in your own words, I&#8217;ll gladly point to it or post it for you.&#8221; They usually agree this is a wonderful idea &#8212; but then usually don&#8217;t free up the rather limited resources that would be required to take me up on it.</li>
</ul>
</li>
<li>That said, the kinds of people who provide customer support (pre- or post-sales) are often very well suited to fill the role Ike is describing. At least, that&#8217;s the case in enterprise tech companies.</li>
<li>The media mix isn&#8217;t really as complex as Ike was suggesting. It basically falls into two groups: Text, and audio/video.</li>
<li>That said, text/graphics and audio/video media people are increasingly the same. (Just think of sports media, where the newspaper folks make their big bucks on radio or TV. That&#8217;s a harbinger of the future. Or think again of Scoble.)</li>
<li>One flaw of Ike&#8217;s idea is that in its pure form it only makes sense for companies large enough to have multi-person PR staffs. Other firms would have to use part-timers, or outsource.  And if you&#8217;re going to do that, might it not make more sense to pay part of the cost of sponsoring, you guessed it, an independent blog?</li>
<li>I know that&#8217;s text/graphics-only, or at least text/graphics-mainly, but I happen to think audio/visual business news/PR is minor anyway. People may give enough attention to, for example, listen to audio from a company if it purports to teach them something. But news ABOUT a company? Who&#8217;s so interested in that to sit still for audio/video, unless they happen to be employees, or investors in its stock?</li>
</ul>
<p><strong>Bottom line:</strong> I think he&#8217;s wrong about some of his detailed views, but Ike Pigott is directionally very right in suggesting that <strong>newsmakers will increasingly become content creators</strong> for news about themselves.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2010/04/04/ike-pigott-on-the-future-of-reporting/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>More website weirdness</title>
		<link>http://www.texttechnologies.com/2008/11/19/more-website-weirdness/</link>
		<comments>http://www.texttechnologies.com/2008/11/19/more-website-weirdness/#comments</comments>
		<pubDate>Thu, 20 Nov 2008 03:27:27 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[ClearForest/Reuters]]></category>
		<category><![CDATA[Custom publishing]]></category>
		<category><![CDATA[Mark Logic]]></category>
		<category><![CDATA[Search engines]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/?p=298</guid>
		<description><![CDATA[Here&#8217;s something longer-lasting and weirder than Vertica&#8217;s &#8220;We sell turkeys&#8221; theme: Mark Logic, whose product is used primarily to help enterprises make their content more acceptable, doesn&#8217;t have a search engine on its own website.* *Or if it does, it&#8217;s VERY well-hidden. I looked at the home page and site map alike. I wanted to [...]]]></description>
			<content:encoded><![CDATA[<p>Here&#8217;s something longer-lasting and weirder than <a href="http://www.dbms2.com/2008/11/18/silly-website-tricks/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.dbms2.com');">Vertica&#8217;s &#8220;We sell turkeys&#8221; theme</a>: Mark Logic, whose product is used primarily to help enterprises make their content more acceptable, doesn&#8217;t have a search engine on its own website.*<span id="more-298"></span></p>
<p><em>*Or if it does, it&#8217;s VERY well-hidden. I looked at the home page and site map alike.</em></p>
<p>I wanted to refresh my memory as to Mark Logic&#8217;s history of working with specific text mining vendors, beyond what&#8217;s on the official <a href="http://www.marklogic.com/partners/open-enrichment-framework.html" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.marklogic.com');">partner page</a>. No luck.  Normally when site search is inadequate, one goes to Google.   But that&#8217;s problematic too.  Marklogic.com pages come up pretty low on Google&#8217;s search results, suggesting that:</p>
<ol>
<li>Mark Logic doesn&#8217;t put a lot of effort into SEO (or else doesn&#8217;t do it very well).</li>
<li>One can&#8217;t be confident all the site&#8217;s significant pages are findable by Google.</li>
</ol>
<p>Looking to other companies&#8217; sites for clues isn&#8217;t conclusive either.  E.g., <a href="http://clearforest.com/Partners/PartnerDetails.asp?id=11" onclick="javascript:pageTracker._trackPageview('/outbound/article/clearforest.com');">Clearforest lists Mark Logic as a partner</a>, but Mark Logic doesn&#8217;t return the compliment.  (If memory serves, Mark Logic and Clearforest have worked together both on national security deals and custom publishing deals &#8212; but don&#8217;t hold me to that.)</p>
<p>When it comes to making its own information conveniently available, Mark Logic is quite the unshod cobbler.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2008/11/19/more-website-weirdness/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>When just-in-time electronic documentation is a really good idea</title>
		<link>http://www.texttechnologies.com/2008/06/08/when-just-in-time-electronic-documentation-is-a-really-good-idea/</link>
		<comments>http://www.texttechnologies.com/2008/06/08/when-just-in-time-electronic-documentation-is-a-really-good-idea/#comments</comments>
		<pubDate>Sun, 08 Jun 2008 04:49:08 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Application areas]]></category>
		<category><![CDATA[Custom publishing]]></category>
		<category><![CDATA[Mark Logic]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/?p=231</guid>
		<description><![CDATA[Mark Logic basically makes an XML DBMS – confusingly called Marklogic without a space &#8211; optimized for document processing (including text search). Mark Logic&#8217;s main market is custom publishing – assembling documents on the fly, whether based on search or some other starting point. Airlines put Marklogic to an interesting use: They create “electronic flight [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;">Mark Logic basically makes an XML DBMS – confusingly called Marklogic without a space &#8211; optimized for document processing (including text search). Mark Logic&#8217;s main market is custom publishing – assembling documents on the fly, whether based on search or some other starting point.</p>
<p style="margin-bottom: 0in;">Airlines put Marklogic to an interesting use:  They create “electronic flight bags.” Apparently, flight crews typically carry a whole satchel of documents (flight bags) onto a plane, the precise contents of which frequently vary.  Marklogic lets these be automatically generated in electronic form.</p>
<p style="margin-bottom: 0in;">Well, in recent news it turns out that a $1.4 billion B-1 bomber crashed because a known prudent take-off/maintenance procedure hadn&#8217;t been followed.  (Something about heating the components to evaporate water that otherwise destroyed the electronics.)  This plane-saving had been discovered, but not propagated to all bases and maintenance crews responsible for the B-1.  You think something like Marklogic might have helped?<span id="more-231"></span></p>
<p style="margin-bottom: 0in;">To quote <a href="http://www.cnn.com/2008/US/06/06/crash.ap/index.html" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.cnn.com');">CNN.com</a>,</p>
<blockquote><p>However, a technique learned by some two years ago that had gone widely unknown and unadopted probably would have prevented the crash, Carpenter said. The technique essentially heats the sensors and evaporates any moisture before data calibrations.</p>
<p>&#8220;This technique was never formalized in a technical order change or captured in &#8216;lessons learned&#8217; reports. Hence, only some pilots and some maintenance technicians knew of the suggestion,&#8221; according to Carpenter&#8217;s executive summary of the accident.</p>
<p>The report said, &#8220;The human factor of communicating critical information was a contributing factor to this mishap.&#8221;</p>
</blockquote>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2008/06/08/when-just-in-time-electronic-documentation-is-a-really-good-idea/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Mark Logic viewed as a different kind of text search technology vendor</title>
		<link>http://www.texttechnologies.com/2008/04/29/mark-logic-viewed-as-a-different-kind-of-text-search-technology-vendor/</link>
		<comments>http://www.texttechnologies.com/2008/04/29/mark-logic-viewed-as-a-different-kind-of-text-search-technology-vendor/#comments</comments>
		<pubDate>Tue, 29 Apr 2008 12:44:40 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Application areas]]></category>
		<category><![CDATA[Custom publishing]]></category>
		<category><![CDATA[Mark Logic]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/?p=217</guid>
		<description><![CDATA[I&#8217;m putting up two posts this morning on Mark Logic and its MarkLogic product family. The main one, over on DBMS2, outlines the technical architecture &#8212; focusing on MarkLogic as an XML database management system &#8212; and provides a bit of overall context. This post attempts to position MarkLogic against alternative kinds of text analytics [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m putting up two posts this morning on Mark Logic and its MarkLogic product family.  The main one, over on <em><a href="http://www.dbms2.com/?p=412" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.dbms2.com');">DBMS2,</a> </em>outlines the technical architecture &#8212; focusing on MarkLogic as an XML database management system &#8212; and provides a bit of overall context. This post attempts to position MarkLogic against alternative kinds of text analytics engine.</p>
<p style="margin-bottom: 0in;">For the most part, MarkLogic is indeed sold (and bought) for the storage, manipulation, and retrieval of text.  (One long-confidential exception to this rule is scheduled to be unveiled at the June user conference.)   Most applications seem to fit a custom publishing/enhanced search paradigm:</p>
<ol>
<li>
<p style="margin-bottom: 0in;">Ingest text.</p>
</li>
<li>
<p style="margin-bottom: 0in;">Enhance it.</p>
</li>
<li>
<p style="margin-bottom: 0in;">Serve it up in chunks, typically 	via a sophisticated search interface.</p>
</li>
</ol>
<p style="margin-bottom: 0in;">Differences vs. conventional search engines include:</p>
<ul>
<li>
<p style="margin-bottom: 0in;">Documents are indexed on the fly, 	and available for query immediately upon ingestion.</p>
</li>
<li>
<p style="margin-bottom: 0in;">MarkLogic is a real, 	ACID-compliant DBMS.  So everything else – such as a user tag or 	comment &#8212; is also available for immediate query.  Mark Logic says 	customers are making a lot of use of this feature.</p>
</li>
<li>
<p style="margin-bottom: 0in;">MarkLogic has a real programming 	language – specifically XQuery.  (Note:  XQuery is a much fuller 	language than, say, standard SQL, with conditional logic, 	arithmetic, try/catch, and so on.)</p>
</li>
<li>
<p style="margin-bottom: 0in;">MarkLogic handles fielded 	information, document chunks, and whole documents in a completely 	integrated fashion.  Truth be told, I don&#8217;t know exactly to what 	extent Autonomy or FAST do or don&#8217;t fall short of this standard, but 	it&#8217;s never seemed to be as much of a priority on their part as I&#8217;ve 	felt it should be.</p>
</li>
</ul>
<p style="margin-bottom: 0in;">Mark Logic also claims huge advantages in corpus administration.  Scalability seems good too; there&#8217;s a national-intelligence customer with a 200 terabyte database.  And they&#8217;re proud of a feature called <a href="http://xqzone.marklogic.com/columns/smallchanges/2006-06-23.xqy" onclick="javascript:pageTracker._trackPageview('/outbound/article/xqzone.marklogic.com');">lexicons</a>, although it seems so obvious to me that I&#8217;ve so far failed to muster what they&#8217;d probably regard as the proper level of excitement about it.  (In SQL terms, it seems to be a combination of SELECT and COUNT DISTINCT, both of which are capabilities I&#8217;d think would be in XQuery anyway.)</p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2008/04/29/mark-logic-viewed-as-a-different-kind-of-text-search-technology-vendor/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>What TEMIS is seeing in the marketplace</title>
		<link>http://www.texttechnologies.com/2007/11/01/what-temis-is-seeing-in-the-marketplace/</link>
		<comments>http://www.texttechnologies.com/2007/11/01/what-temis-is-seeing-in-the-marketplace/#comments</comments>
		<pubDate>Thu, 01 Nov 2007 09:49:22 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Application areas]]></category>
		<category><![CDATA[Competitive intelligence]]></category>
		<category><![CDATA[Custom publishing]]></category>
		<category><![CDATA[IBM and UIMA]]></category>
		<category><![CDATA[Investment research and trading]]></category>
		<category><![CDATA[Mark Logic]]></category>
		<category><![CDATA[TEMIS]]></category>
		<category><![CDATA[Text mining]]></category>
		<category><![CDATA[Voice of the Customer]]></category>
		<category><![CDATA[nStein]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[uima]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/2007/11/01/what-temis-is-seeing-in-the-marketplace/</guid>
		<description><![CDATA[CEO Eric Bregand of Temis recently checked in by email with an update on text mining market activity. Highlights of Eric&#8217;s views include: Yep, Voice Of The Customer is hot, in &#8220;many markets&#8221;; Eric specifically mentioned banking, car, energy, food, and retail. He further sees IBM backing VotC as text&#8217;s &#8220;killer app.&#8221; (Note: Temis has [...]]]></description>
			<content:encoded><![CDATA[<p>CEO Eric Bregand of Temis recently checked in by email with an update on text mining market activity.  Highlights of Eric&#8217;s views include:</p>
<ul>
<li>Yep, <strong>Voice Of The Customer</strong> is hot, in &#8220;many markets&#8221;; Eric specifically mentioned banking, car, energy, food, and retail.  He further sees IBM backing VotC as text&#8217;s &#8220;killer app.&#8221;  (Note:  Temis has a history of partnering with IBM, most notably via its <a title="http://www.texttechnologies.com/2007/04/04/temis-overview/" href="http://www.texttechnologies.com" >unusually strong commitment to UIMA</a>.)</li>
<li>Specifically, THE hot topics in the European market these days are <strong>competitive intelligence</strong> and <strong>sentiment analysis.</strong> (Note:  I&#8217;ve always thought Temis got serious about competitive analysis a little earlier than most other text mining vendors did.)</li>
<li><strong>Life sciences</strong> is an ever growing focus for Temis.</li>
<li>I confused him a bit with how I phrased my question about <strong>custom publishing</strong> and Temis&#8217; Mark Logic partnership.   But he did express favorable views of the market, specifically in the area of integrating text mining and native XML database management, and even volunteered that nStein appears to be doing well.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2007/11/01/what-temis-is-seeing-in-the-marketplace/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Text analytics marketplace trends</title>
		<link>http://www.texttechnologies.com/2007/07/22/text-analytics-marketplace-trends/</link>
		<comments>http://www.texttechnologies.com/2007/07/22/text-analytics-marketplace-trends/#comments</comments>
		<pubDate>Sun, 22 Jul 2007 09:44:49 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Application areas]]></category>
		<category><![CDATA[ClearForest/Reuters]]></category>
		<category><![CDATA[Custom publishing]]></category>
		<category><![CDATA[Factiva/Dow Jones]]></category>
		<category><![CDATA[Mark Logic]]></category>
		<category><![CDATA[SAS]]></category>
		<category><![CDATA[Search engines]]></category>
		<category><![CDATA[Spam and antispam]]></category>
		<category><![CDATA[Text Analytics Summit]]></category>
		<category><![CDATA[Text mining]]></category>
		<category><![CDATA[Voice of the Customer]]></category>
		<category><![CDATA[nStein]]></category>
		<category><![CDATA[Assentor]]></category>
		<category><![CDATA[compliance]]></category>
		<category><![CDATA[Dow Jones]]></category>
		<category><![CDATA[ETL]]></category>
		<category><![CDATA[Factiva]]></category>
		<category><![CDATA[fraud detection]]></category>
		<category><![CDATA[reputation management]]></category>
		<category><![CDATA[Reuters]]></category>
		<category><![CDATA[spam]]></category>
		<category><![CDATA[StreamBase]]></category>
		<category><![CDATA[Text analytics]]></category>
		<category><![CDATA[warranty analysis]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/2007/07/22/text-analytics-marketplace-trends/</guid>
		<description><![CDATA[It was tough to judge user demand at the recent Text Analytics Summit because, well, very few users showed up. And frankly, I wasn&#8217;t as aggressive at pumping vendors for trends as I am some other times. That said, I have talked with most text analytics vendors recently,* and here are my impressions of what&#8217;s [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in">It was tough to judge user demand at the recent Text Analytics Summit because, well, very few users showed up.  And frankly, I wasn&#8217;t as aggressive at pumping vendors for trends as I am some other times.  That said, I <em>have </em>talked with most text analytics vendors recently,* and here are my impressions of what&#8217;s going on.  Any contrary – or confirming! &#8212; opinions would be most welcome.</p>
<p style="margin-bottom: 0in"><em><span>*Factiva is the most significant exception.   Hint, hint.</span></em></p>
<p style="margin-bottom: 0in">If you think about it, text analytics is a <span style="font-style: normal"><strong>“secret ingredient” in search, antispam, and data cleaning,</strong></span>* and this dominates all other uses of the technology.  A significant minority of the research effort at companies that do any kind of text filtering is – duh &#8212; text analytics.  Cold comfort for specialist text analytics vendors, to be sure, but that&#8217;s the way it is.</p>
<p style="margin-bottom: 0in"><em>*I.e., part of the “T” in “ETL” (Extract/Transform/Load).</em></p>
<p style="margin-bottom: 0in">Text-analytics-enhanced <span style="font-style: normal"><strong>custom publishing</strong></span> will surely at some point become a  must-have for business and technical publishers.  However, it appears that we&#8217;re not quite there yet, as large publishers make do with simple-minded search and the like.  In what I suspect is a telling market commentary, there&#8217;s no headlong rush among vendors to dump text mining for custom publishing, notwithstanding the examples of nStein and (sort of) ClearForest.  I don&#8217;t want to be overly negative – either my friends at Mark Logic are doing just fine or else they&#8217;re putting up a mighty brave front – but I don&#8217;t think the nonspecialist publishing market is there yet.<span id="more-119"></span></p>
<p style="margin-bottom: 0in">Two business publishers who have made major investments in owning text analytics technology are Dow Jones (now sole owners of Factiva) and Reuters (recent purchaser of ClearForest).  Beyond that, however, I don&#8217;t yet see a lot of activity in the <strong>investor/trading</strong> market, although ClearForest reported some activity last year and StreamBase reports that one customer is using them for text filtering, presumably alongside the ticker-munching traders usually use StreamBase for.</p>
<p style="margin-bottom: 0in">Obviously, the <span style="font-style: normal"><strong>intelligence</strong></span> market is what fueled the start of the text analytics business, and still provides the majority of revenue at multiple companies.  Certainly it&#8217;s still going strong.  But it&#8217;s tough to gauge the growth potential from here, especially since the details of usage are typically classified.</p>
<p style="margin-bottom: 0in">Similar things could be said about <span style="font-style: normal"><strong>pharmaceutical research.</strong></span><em> </em> Text analytics is totally accepted in that market, but what&#8217;s the growth potential from here?  And “here” isn&#8217;t actually very big (much smaller than intelligence).  The related category of <span style="font-style: normal"><strong>patient records analysis</strong></span> looks very promising, but is basically still at the research-project stage.  (In general, an explosion in biological IT can be expected when research methods are adapted for clinical use.)</p>
<p style="margin-bottom: 0in">The <span style="font-style: normal"><strong>warranty analysis</strong></span> market, so promising early on, is not showing a lot of growth and depth.   The same thing has happened many times before with innovative technologies sold to manufacturing companies&#8217; engineers.  It seems to be happening again now.</p>
<p style="margin-bottom: 0in"><span style="font-style: normal"><strong>Voice of the customer*</strong></span> is pretty much the same thing, but for service industries.   And the text analytics market for VotC is evidently stronger right now than that for warranty analysis.  This makes sense, because the obvious alternative to text analytics – multiple-choice coded forms – is less appealing, due to two application differences:</p>
<ul>
<li>
<p style="margin-bottom: 0in">VotC looks for opinion as well as fact.</p>
</li>
<li>
<p style="margin-bottom: 0in">VotC looks for input from people 	under no obligation to share it, and who hence can&#8217;t be compelled to play along with a structured form – let alone trained to fill it in accurately.</p>
</li>
</ul>
<p style="margin-bottom: 0in"><em>*Definitional note: </em><span style="font-style: normal">Voice of the customer </span><em>is when customers or prospects communicates with you directly, e.g. via a survey form or an angry email. </em><span style="font-style: normal">Reputation management </span><em>is when you web-scrape and find out what they&#8217;re saying to everybody else.  At least, I think marketers are still using the terms that way pretty consistently.</em></p>
<p style="margin-bottom: 0in"><span style="font-style: normal"><strong>Reputation management</strong></span><em> </em><span style="font-style: normal">is surely</span><em> </em>becoming a standard application for the biggest consumer brands.  How deep that market turns out to be, however, remains to be seen.</p>
<p>Text analytics for <span style="font-style: normal"><strong>fraud discovery</strong></span> seems poised to sweep the insurance industry, and then the rest of financial services.  Current activity, however, while decent, still seems to consist of more poising than sweeping.</p>
<p><span style="font-style: normal"><strong>Compliance</strong></span> is a minimum-acceptable-efforts kind of activity in most markets.  Accordingly search/clustering seems to be the preferred text-checking approach.  Where that&#8217;s not the case, the market seems to have gone to specialized products like Assentor (stock brokerage).</p>
<p style="margin-bottom: 0in"><strong>Human resources</strong> is a good area to sell follow-on applications, at least to enterprises with so many employees that they want to automate the reading of employee feedback.  I&#8217;m not aware of it being the first-sale app to very many enterprises, however.</p>
<p style="margin-bottom: 0in">SAS used to speak glowingly of text mining used directly for <span style="font-style: normal"><strong>ETL.</strong></span> However, nobody else has talked about this, and even from SAS I get the sense that some of the glow has worn off.  As noted above, text analytics is an important ingredient to the transformation part of ETL, but it I think it rarely would be the best option for doing the transformations directly.</p>
<p style="margin-bottom: 0in"><em></em></p>
<p style="margin-bottom: 0in"><em><br />
</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2007/07/22/text-analytics-marketplace-trends/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Text Analytics Summit marketing panel:  Membership firmed up</title>
		<link>http://www.texttechnologies.com/2007/03/21/text-analytics-summit-marketing-panel-membership-firmed-up/</link>
		<comments>http://www.texttechnologies.com/2007/03/21/text-analytics-summit-marketing-panel-membership-firmed-up/#comments</comments>
		<pubDate>Wed, 21 Mar 2007 08:42:41 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Attensity]]></category>
		<category><![CDATA[Mark Logic]]></category>
		<category><![CDATA[SAS]]></category>
		<category><![CDATA[Text Analytics Summit]]></category>
		<category><![CDATA[Text mining]]></category>
		<category><![CDATA[nStein]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/2007/03/21/text-analytics-summit-marketing-panel-membership-firmed-up/</guid>
		<description><![CDATA[We&#8217;ve now solidified the membership of the Text Analytics Summit marketing panel. It is: Curt Monash, President, Monash Information Services Dave Kellogg, CEO, Mark Logic Corporation Michelle De Haaff, VP Marketing, Attensity Corporation Michel Lemay, VP Marketing, nstein Technologies Mary Crissey, SAS Analytics Marketing Manager, SAS Institute Michelle, Michel, and Mary are all obvious choices, [...]]]></description>
			<content:encoded><![CDATA[<p>We&#8217;ve now solidified the membership of the <a href="http://www.texttechnologies.com/2007/03/07/three-crucial-issues-in-text-analytics/" >Text Analytics Summit marketing panel</a>.  It is:</p>
<ul>
<li>Curt Monash, President, Monash Information Services</li>
<li>Dave Kellogg, CEO, Mark Logic Corporation</li>
<li>Michelle De Haaff, VP Marketing, Attensity Corporation</li>
<li>Michel Lemay, VP Marketing, nstein Technologies</li>
<li>Mary Crissey, SAS Analytics Marketing Manager, SAS Institute</li>
</ul>
<p>Michelle, Michel, and Mary are all obvious choices, responsible for marketing at leading text mining vendors.  In addition, Mary has excelled on the same panel in the past, Michel sent me e-mail with some brilliant thoughts on the panel subject, and Attensity has <a href="http://www.texttechnologies.com/2006/12/27/telling-attensity-and-clearforest-apart/" >one of the most interesting strategies in the text analytics market</a>.</p>
<p>As for Dave &#8212; he&#8217;s simply one of the most astute marketing theorists working in software today.  And he runs a very interesting text technology company.  And he used to be most senior marketing guy in all of business intelligence, when he was SVP at Business Objects.  In his copious free time, he writes a <a href="http://marklogic.blogspot.com/2006/04/ingres-can-you-ever-go-back.html" onclick="javascript:pageTracker._trackPageview('/outbound/article/marklogic.blogspot.com');">really cool blog</a>.</p>
<p><em></em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2007/03/21/text-analytics-summit-marketing-panel-membership-firmed-up/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Telling Attensity and ClearForest apart</title>
		<link>http://www.texttechnologies.com/2006/12/27/telling-attensity-and-clearforest-apart/</link>
		<comments>http://www.texttechnologies.com/2006/12/27/telling-attensity-and-clearforest-apart/#comments</comments>
		<pubDate>Wed, 27 Dec 2006 19:02:26 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Application areas]]></category>
		<category><![CDATA[Attensity]]></category>
		<category><![CDATA[ClearForest/Reuters]]></category>
		<category><![CDATA[Custom publishing]]></category>
		<category><![CDATA[Mark Logic]]></category>
		<category><![CDATA[TEMIS]]></category>
		<category><![CDATA[Text mining]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/2006/12/27/telling-attensity-and-clearforest-apart/</guid>
		<description><![CDATA[So far as I can tell, Attensity’s strategy when the company was originally founded was rather like ClearForest’s strategy today – and vice-versa. That said, here’s where they seem to stand at this time: Attensity wants to make text analytics very easy to integrate into business intelligence and data mining – at the moment, they’re [...]]]></description>
			<content:encoded><![CDATA[<p class="MsoNormal">So far as I can tell, Attensity’s strategy when the company was originally founded was rather like ClearForest’s strategy today – and vice-versa.  That said, here’s where they seem to stand at this time:</p>
<ul>
<li class="MsoNormal">Attensity      wants to make text analytics very easy to integrate into business      intelligence and data mining – at the moment, they’re not too focused on      the differences between those two disciplines – and is trying to deliver      the best possible fact extraction consistent with that charter.</li>
<li class="MsoNormal">ClearForest      wants to provide really great information extraction &#8212; to the limits of      what can be done without excessive knowledge engineering – and is trying      to integrate as well as possible with other technologies, the better to      serve the customers who need what they offer.</li>
</ul>
<p><span id="more-62"></span>The guy I usually talk with at ClearForest, Jay Henderson, believes that text analytics is a collection of dozens of niche markets.  Not coincidentally, a lot of ClearForest’s customers are in the publishing sector (I’ve remarked on ClearForest’s synergy with Mark Logic before).  Attensity obviously is trying a broader play.  In Jay’s view, Inxight and TEMIS are more analogous to ClearForest than Attensity is, except that Inxight is focused on different markets (e.g. OEM and/or search), and he thinks ClearForest is just better than Temis except in a couple of specific kinds of understanding (e.g., life sciences, sentiment).</p>
<p class="MsoNormal">That said, both Attensity and ClearForest credibly claim to do large fractions of what the other one does.  ClearForest, as the currently nichier player, takes the traditional stance “We do everything they do, and more.  Most of our customers are ones who really appreciate the difference.”  Attensity conveys the equally traditional attitude “We do most of what they do, and a bunch of other stuff besides.  And it’s better-packaged too.  As for what they do that we don’t – not a lot of customers really have a need for it.”</p>
<p class="MsoNormal">Frankly, most enterprises that have a need for this technology should put both <a href="http://www.texttechnologies.com/2006/07/27/more-on-attensity/" >Attensity</a> and <a href="http://www.texttechnologies.com/2006/07/23/introduction-to-clearforest/" >ClearForest</a> on their short lists.  But here’s one technical note that may help predict who you’ll wind up actually selecting:  Attensity’s lead strategy for integration is to dump everything into relational tables, for conventional analytics-stack products like Business Objects’ and Teradata’s to manipulate.  ClearForest’s lead strategy for integration has more of an SOA/XML flavor, grown out of conventional OO.  If one of those sounds like an obviously better fit to your situation than the other, then that’s the vendor you absolutely, positively should not leave out of your evaluation process.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2006/12/27/telling-attensity-and-clearforest-apart/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Two own-dogfood text-based bug-tracking applications</title>
		<link>http://www.texttechnologies.com/2006/10/03/two-own-dogfood-text-based-bug-tracking-applications/</link>
		<comments>http://www.texttechnologies.com/2006/10/03/two-own-dogfood-text-based-bug-tracking-applications/#comments</comments>
		<pubDate>Wed, 04 Oct 2006 00:10:46 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Enterprise search]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Mark Logic]]></category>
		<category><![CDATA[Search engines]]></category>
		<category><![CDATA[Specialized search]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/2006/10/03/two-own-dogfood-text-based-bug-tracking-applications/</guid>
		<description><![CDATA[Last July I wrote about Google&#8217;s text-based project management system. Dave Kellogg of Mark Logic offers links to discussion of a related Google project, and adds news of his own &#8212; Mark Logic built a text-based bug tracking system in its own MarkLogic technology.]]></description>
			<content:encoded><![CDATA[<p>Last July I wrote about Google&#8217;s <a href="http://www.texttechnologies.com/2006/07/11/google-project-knowledge-management/" >text-based project management system</a>.  Dave Kellogg of Mark Logic offers links to discussion of a related Google project, and adds news of his own &#8212; Mark Logic built a <a href="http://marklogic.blogspot.com/2006/10/deconstructing-databases.html" onclick="javascript:pageTracker._trackPageview('/outbound/article/marklogic.blogspot.com');">text-based bug tracking system</a> in its own MarkLogic technology.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2006/10/03/two-own-dogfood-text-based-bug-tracking-applications/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Mark Logic and the custom publishing business</title>
		<link>http://www.texttechnologies.com/2006/08/26/mark-logic-and-the-custom-publishing-business/</link>
		<comments>http://www.texttechnologies.com/2006/08/26/mark-logic-and-the-custom-publishing-business/#comments</comments>
		<pubDate>Sat, 26 Aug 2006 09:48:14 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Application areas]]></category>
		<category><![CDATA[ClearForest/Reuters]]></category>
		<category><![CDATA[Custom publishing]]></category>
		<category><![CDATA[Mark Logic]]></category>
		<category><![CDATA[Search engines]]></category>
		<category><![CDATA[Specialized search]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/2006/08/26/mark-logic-and-the-custom-publishing-business/</guid>
		<description><![CDATA[I talked again with Mark Logic, makers of MarkLogic Server, and they continue to have an interesting story. Basically, their technology is better search/retrieval through XML. The retrieval part is where their major differentiation lies. Accordingly, their initial market focus (they’re up to 46 customers now, including lots of big names) is on custom publishing. [...]]]></description>
			<content:encoded><![CDATA[<p class="MsoNormal">I talked again with Mark Logic, makers of MarkLogic Server, and they continue to have an interesting story.  Basically, their technology is better search/retrieval through XML.  The retrieval part is where their major differentiation lies.  Accordingly, their initial market focus (they’re up to 46 customers now, including lots of big names) is on custom publishing.  And by the way, they’re a good partner for fact-extraction companies, at least in the case of <a href="http://www.texttechnologies.com/2006/07/23/introduction-to-clearforest/" >ClearForest</a>.</p>
<p class="MsoNormal">Here, as best I understand, is the story of the custom publishing business. <span id="more-50"></span> Its core market is publishers of high-cost material sold to people with high-priced time – i.e., scientific/engineering/medical/legal/business/etc. Other markets are general publishing, internal document preparation (e.g., intelligence community), and of course maintenance manuals (maintenance/repair has been a flagship market for just about everything, from expert systems to generic text search to, of course, text mining now as well).</p>
<p class="MsoNormal">The phrase “custom publishing,” however, obscures the distinction between two different paradigms.  One of these is what we might call <em>true custom publishing</em> – assembling paragraphs, articles, chapters whatever from various sources, in an assembly customized for specific reader needs, roles, or preferences.  On the revenue side, that’s a fascinating subject.  But technically, I’m more interested in the other view:  <em>search results plus.</em></p>
<p class="MsoNormal">We all know lots of problems with search engines.  One of the many is this:  Except on rare occasions, getting the benefit of a successful search involves a whole lot of link-clicking and scrolling.  But what if the relevant passages were all assembled together for you?  Link-clicking would be eliminated, and scrolling might be minimized as well.  The potential is huge.  But I don’t know what level of precision is needed before the theoretical benefits become real.</p>
<p class="MsoNormal">The two paradigms can be blended, of course.  A publishing product or dashboard or personal web page with a topic filter might get the results in custom-document rather than link-of-lists form.   Once again, the problem is conciseness.  The more concise the “complete” results can be, the more useful this kind of technology will ultimately prove.</p>
<p class="MsoNormal">For more on Mark Logic, and more insight about the industry in general, see CEO <a href="http://marklogic.blogspot.com/" onclick="javascript:pageTracker._trackPageview('/outbound/article/marklogic.blogspot.com');">Dave Kellogg’s blog.</a> For a technical discussion of MarkLogic, see my <a href="http://www.dbms2.com/2006/08/26/mark-logic-and-the-marklogic-server/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.dbms2.com');">DBMS2.com write-up</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2006/08/26/mark-logic-and-the-custom-publishing-business/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>

