<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Text Technologies &#187; Investment research and trading</title>
	<atom:link href="http://www.texttechnologies.com/category/text-analytics-applications/investment-research-and-trading/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.texttechnologies.com</link>
	<description>Understanding technology ... in both senses of the phrase</description>
	<lastBuildDate>Wed, 18 Jan 2012 17:02:59 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.3</generator>
		<item>
		<title>The state of the art in text analytics applications</title>
		<link>http://www.texttechnologies.com/2010/12/01/state-of-the-art-text-analytics-mining-applications/</link>
		<comments>http://www.texttechnologies.com/2010/12/01/state-of-the-art-text-analytics-mining-applications/#comments</comments>
		<pubDate>Thu, 02 Dec 2010 02:06:54 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Attensity]]></category>
		<category><![CDATA[BI integration]]></category>
		<category><![CDATA[Investment research and trading]]></category>
		<category><![CDATA[SPSS]]></category>
		<category><![CDATA[Text mining]]></category>
		<category><![CDATA[Voice of the Customer]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/?p=443</guid>
		<description><![CDATA[Text analytics application areas typically fall into one or more of three broad, often overlapping domains: Understanding the opinions of customers, prospects, or other groups. This can be based on any combination of documents the user organization controls (email, surveys, warranty reports, call center logs, etc.) &#8212; in which case &#8212; or public-domain documents such [...]]]></description>
			<content:encoded><![CDATA[<p>Text analytics application areas typically fall into one or more of three broad, often overlapping domains:</p>
<ul>
<li><strong>Understanding the opinions of customers, prospects, or other groups.</strong> This can be based on any combination of documents the user organization controls (email, surveys, warranty reports, call center logs, etc.) &#8212; in which case &#8212; or public-domain documents such as blogs, forum posts, and tweets. The former is usually called <strong>Voice of the Customer (VotC),</strong> while the latter is <strong>Voice of the Market (VotM).</strong></li>
<li><strong>Detecting and identifying problems.</strong> This can happen across many domains &#8212; VotC, VotM, diagnosing equipment malfunctions, identifying bad guys (from terrorists to fraudsters), or even getting early warnings of infectious disease outbreaks.</li>
<li><strong>Aiding text search, custom publishing, and other electronic document-shuffling use cases,</strong> often via document <a href="http://www.dbms2.com/2010/11/29/data-that-is-derived-augmented-enhanced-adjusted-or-cooked/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.dbms2.com');">augmentation</a>.</li>
</ul>
<p>For several years, I&#8217;ve been distressed at the lack of progress in text analytics or, as it used to be called, text mining. Yes, the rise of <a href="../../../../../category/text-mining/sentiment-analysis/">sentiment analysis</a> has been impressive, and higher volumes of text data are being processed than were before. But otherwise, there&#8217;s been a lot of the same old, same old. Most actual deployed applications of text analytics or text mining go something like this:</p>
<ul>
<li>A bunch of documents are analyzed to ascertain the ideas expressed in them.</li>
<li>A count is made as to how many times each idea turns up.</li>
<li>The application user notices any surprisingly large numbers, and as result of noticing pays attention to the corresponding ideas.</li>
</ul>
<p>Often, it seems desirable to integrate text analytics with business intelligence and/or predictive analytics tools that operate on tabular data is. Even so, such<strong> integration is most commonly weak or nonexistent. </strong>Apart from the usual reasons for silos of automation, I blame this lack on a mismatch in precision, among <a href="../../../../../2008/10/24/text-mining-data-warehousin/">other reasons</a>. A 500% increase in mentions of a subject could be simple coincidence, or the result of a single identifiable press article. In comparison, a 5% increase in a conventional business metric might be much more important.</p>
<p>But in fairness, <strong>the text analytics innovation picture hasn&#8217;t been quite as bleak as what I&#8217;ve been painting so far. </strong><span id="more-443"></span>While standalone, passively-reported text analytics is indeed the baseline, there are some interesting exceptions. For example:</p>
<ul>
<li>I once confirmed that SPSS customer <a href="http://www.spss.com/press/template_view.cfm?PR_ID=1059" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.spss.com');">Cablecom</a>&#8216;s statistical models for churn and the like absolutely included text data; Cablecom even assigned different weights to the same apparent level of emotion depending on whether the text was in German, French, or Italian. Vertica recently told me of a <a href="http://www.dbms2.com/2010/10/12/vertica-hadoop-connector-integration/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.dbms2.com');">Vertica/Hadoop</a> customer doing something similar, except for the multilingual aspect. And the end of a <a href="http://www2.sas.com/proceedings/forum2008/123-2008.pdf" onclick="javascript:pageTracker._trackPageview('/outbound/article/www2.sas.com');">2008 SAS-based paper</a> makes similar claims.</li>
<li>There long* have been some examples of fact extraction that don&#8217;t really fit into my three buckets above. For example, researchers mine collections of articles to try to determine biochemical or biological pathways that would not be apparent from examining single research studies alone.</li>
<li>It also has long* been the case that some bad-guy-finding applications &#8212; especially in the anti-terrorism area &#8212; used text analytics to populate state-of-the-art <a href="http://www.dbms2.com/2009/08/21/social-network-analysis-aka-relationship-analytics/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.dbms2.com');">graph-oriented data analysis tools</a>.</li>
</ul>
<p><em>*When it comes to text analytics, &#8220;long&#8221; means &#8220;at least for the past several years.&#8221;</em></p>
<p>In more recent examples:</p>
<ul>
<li><a href="http://www.dbms2.com/category/products-and-vendors/greenplum/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.dbms2.com');">Greenplum</a> built a document recommender for law firms that does hard-core statistical analysis to determine which .1% of a document set lawyers might actually want to see, and which then learns from users&#8217; feedback after they respond to initial result sets.</li>
<li><a href="../../../../../2008/09/19/low-latency-text-mining-in-the-investment-market/">Information extracted from investment news</a> gets included into automated trading algorithms. This was unusual technology a couple of years ago, but is more common today.</li>
<li>After a series of mergers, <a href="../../../../../2009/04/20/the-new-attensity-deal-overview/">Attensity</a> now uses marketing-oriented text analytics in at least three different ways:
<ul>
<li>Attensity text analytics feeds marketing dashboards just as it always did.</li>
<li>Attensity text analytics triggers alerts, as I wish dashboards and business intelligence tools more often did, <a href="http://www.dbms2.com/2010/07/25/alerts-metrics-dashboards/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.dbms2.com');">the false positives problem</a> notwithstanding.</li>
<li>Attensity text analytics triggers concrete workflows, for example <a href="http://www.attensity.com/2010/10/05/attensity-announces-respond-for-social-media/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.attensity.com');">routing specific social media hits for priority response</a>.</li>
<li>And in one example that did not actually get into production, a very large social networking company correlated word usage (e.g., choice among different synonyms) against user characteristics such as age and gender.</li>
</ul>
</li>
</ul>
<p>Finally there are some applications that, while fitting the standard template, just strike me as getting to unusually sophisticated levels of analysis. For example, Vertica told me of another Vertica/Hadoop case where VotM document analysis is carried out to the level of observing which order brand names appear in, and adjusting that for whether or not it was just an alphabetical list.</p>
<p>I suspect <strong>text analytics is about to become more interesting again.</strong></p>
<p><strong><em>Related links</em></strong></p>
<ul>
<li>The enabling <a href="../../../../../2006/06/24/attensity-extractive-exhaustion-and-the-frn/">technology for text/tabular data integration</a> has existed for years.</li>
<li>In 2006, I listed <a href="http://www.monashreport.com/2006/09/08/where-does-data-mining-succeed-and-why/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.monashreport.com');">major application areas for data mining/predictive analytics</a>. It overlaps pretty closely with the similar list for text mining/text analytics.</li>
<li>Before being acquired by IBM, <a href="../../../../../2008/06/17/spss-update/">SPSS boasted a rather large text mining user base</a>.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2010/12/01/state-of-the-art-text-analytics-mining-applications/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Low-latency text mining in the investment market</title>
		<link>http://www.texttechnologies.com/2008/09/19/low-latency-text-mining-in-the-investment-market/</link>
		<comments>http://www.texttechnologies.com/2008/09/19/low-latency-text-mining-in-the-investment-market/#comments</comments>
		<pubDate>Fri, 19 Sep 2008 09:15:58 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[ClearForest/Reuters]]></category>
		<category><![CDATA[Investment research and trading]]></category>
		<category><![CDATA[Sentiment analysis]]></category>
		<category><![CDATA[Text mining]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/?p=282</guid>
		<description><![CDATA[I&#8217;m not at Gartner&#8217;s Event Processing conference, but there seem to be some interesting posts and articles coming out of it. Seth Grimes has one on Reuters&#8217; integration of text mining and event processing, including sentiment analysis. Well worth reading. Lots more detail than I&#8217;ve ever posted on similar applications.]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m not at Gartner&#8217;s Event Processing conference, but there seem to be some interesting posts and articles coming out of it.  Seth Grimes has one on <a href="http://www.intelligententerprise.com/blog/archives/2008/09/event_processin_1.html" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.intelligententerprise.com');">Reuters&#8217; integration of text mining and event processing</a>, including sentiment analysis.  Well worth reading.  Lots more detail than I&#8217;ve ever posted on <a href="http://www.texttechnologies.com/2006/12/27/text-analytics-is-finally-being-used-for-investment-analysis/" >similar</a> <a href="http://www.texttechnologies.com/2007/08/03/more-on-text-processing-in-cep/" >applications</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2008/09/19/low-latency-text-mining-in-the-investment-market/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Investment text mining job listing</title>
		<link>http://www.texttechnologies.com/2008/04/25/investment-text-mining-job-listing/</link>
		<comments>http://www.texttechnologies.com/2008/04/25/investment-text-mining-job-listing/#comments</comments>
		<pubDate>Fri, 25 Apr 2008 11:18:50 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Application areas]]></category>
		<category><![CDATA[Investment research and trading]]></category>
		<category><![CDATA[Text mining]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/?p=215</guid>
		<description><![CDATA[As per this job listing, at least one &#8220;major NYC investment bank&#8221; plans to do text mining on a proprietary trading desk. The successful candidate will mine text data from numerous news sources and incorporate the information the proprietary trading systems.]]></description>
			<content:encoded><![CDATA[<p>As per this <a href="http://www.emii.com/CareerCenter.aspx?CCKey=job-4000000000146225&amp;LS=EMS176387" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.emii.com');">job listing</a>, at least one &#8220;major NYC investment bank&#8221; plans to do text mining on a proprietary trading desk.</p>
<blockquote><p>The successful candidate will mine text data from numerous news sources and incorporate the information the proprietary trading systems.</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2008/04/25/investment-text-mining-job-listing/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The text mining vendors continue to lack constructive vision</title>
		<link>http://www.texttechnologies.com/2007/12/18/the-text-mining-vendors-continue-to-lack-constructive-vision/</link>
		<comments>http://www.texttechnologies.com/2007/12/18/the-text-mining-vendors-continue-to-lack-constructive-vision/#comments</comments>
		<pubDate>Wed, 19 Dec 2007 02:39:05 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Application areas]]></category>
		<category><![CDATA[Factiva/Dow Jones]]></category>
		<category><![CDATA[Investment research and trading]]></category>
		<category><![CDATA[Text mining]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/2007/12/18/the-text-mining-vendors-continue-to-lack-constructive-vision/</guid>
		<description><![CDATA[I&#8217;ve been thinking for a long time that the various text mining companies doing sentiment analysis should try some public-facing (or at least multi-customer) services. Investors might love such a thing. So might marketing managers (actually, Factiva claims to be active there, at least as per their web site). And as a key part of [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve been thinking for a long time that the various text mining companies doing sentiment analysis should try some public-facing (or at least multi-customer) services.  Investors might love such a thing.  So might marketing managers (actually, Factiva claims to be active there, at least as per their <a href="http://factiva.com/products/solutions/jobrole.asp?node=menuElem1488&amp;from=homepage_jobrole_apr2006&amp;segment=General" onclick="javascript:pageTracker._trackPageview('/outbound/article/factiva.com');">web site</a>).  And as a key part of the strategy, text mining companies selling to enterprises might brand such a site and gain massive awareness accordingly.  Well, it seems that public-facing sentiment analysis sites are springing up.   At least, <a href="http://www.summize.com" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.summize.com');">Summize</a> has.  (Hat tip to <a href="http://www.techcrunch.com/2007/12/17/summize-a-sentiment-engine-for-the-reviewosphere/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.techcrunch.com');">TechCrunch.) </a> And the text mining vendors are nowhere to be seen.</p>
<p>So what else is new?<span id="more-152"></span> The leading text mining vendors also aren&#8217;t active in text search, except to some extent in the custom-publishing vertical, despite the huge <a href="http://www.texttechnologies.com/2006/11/11/text-mining-and-search-joined-at-the-hip/" >reliance of search vendors on text mining technologies</a>.   They aren&#8217;t getting traction in the archiving/compliance area.   There don&#8217;t seem to be significant efforts to develop next-generation <a href="http://www.texttechnologies.com/2005/12/11/the-text-technologies-market-4-requirements-for-an-industry-altering-ontology-management-system/" >integrated servers</a>.  In fact, there doesn&#8217;t seem to be much of anything except:</p>
<p>1.  Custom publishing.</p>
<p>2.  The same old, same old <a href="http://www.texttechnologies.com/2006/07/27/application-processes-in-text-mining-%e2%80%93-finding-warning-signs/" >failure analysis and threat detection</a>.</p>
<p>Jay Henderson, when he was at ClearForest, told me he tracked 80 text mining companies whose aggregate revenue he estimated at under $100 million.  He also didn&#8217;t think any text mining company would get over $20 million in revenue.   Now, I think the latter was a little pessimistic.  But directionally, unless the industry seriously gets its act together, he was correct.</p>
<p>Right now, I think the most likely outcome is that the text mining industry pretty much gets merged out of existence in a few years.  Consolidation is a pretty safe way to bet in most software sectors these days, and I don&#8217;t see the kind of energy in the text mining sector that is required to beat the consolidation trend.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2007/12/18/the-text-mining-vendors-continue-to-lack-constructive-vision/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Attivio tries to do it all</title>
		<link>http://www.texttechnologies.com/2007/12/12/attivio-tries-to-do-it-all/</link>
		<comments>http://www.texttechnologies.com/2007/12/12/attivio-tries-to-do-it-all/#comments</comments>
		<pubDate>Wed, 12 Dec 2007 04:38:55 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Attivio]]></category>
		<category><![CDATA[BI integration]]></category>
		<category><![CDATA[Investment research and trading]]></category>
		<category><![CDATA[Lucene]]></category>
		<category><![CDATA[Open source text analytics]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/2007/12/12/attivio-tries-to-do-it-all/</guid>
		<description><![CDATA[When Andrew McKay was at FAST, I grumped about his search/BI integration story. Now that he&#8217;s trying to do the same thing at a startup called Attivio, it sounds more plausible. Attivio is having a house party and product rollout in the latter part of January, and details are scarce in the mean time. But [...]]]></description>
			<content:encoded><![CDATA[<p>When Andrew McKay was at FAST, I grumped about his <a href="http://www.texttechnologies.com/2007/02/01/what%e2%80%99s-interesting-about-the-fast-venture-in-bi/" >search/BI integration story</a>.   Now that he&#8217;s trying to do the same thing at a startup called <a href="http://www.attivio.com" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.attivio.com');">Attivio</a>, it sounds more plausible.</p>
<p>Attivio is having a house party and product rollout in the latter part of January, and details are scarce in the mean time.  But here are some highlights.</p>
<ul>
<li>Attivio was founded in August.  It has 21 people and 1 VC.  The VC has invested &gt;$6 million and committed &gt;$12 million total.</li>
<li>Attivio has ambitious plans for a fully integrated data management/real-time BI stack.  It&#8217;s currently called the &#8220;Active Intelligence Engine.&#8221;<span id="more-151"></span></li>
<li>The data management part combines tabular, text, and XML data.  The tabular part is some kind of bitmap.  The text part is fairly traditional, and based on Lucene.</li>
<li>One point of this architecture is that one can more or less seamlessly join different kinds of data.</li>
<li>Another point is surely that &#8212; with everything being more or less like a column or bitmap &#8212; memory management and administration are manageable issues.</li>
<li>Despite containing all these wonders, the code is under 10 megs total.  At least right now.  But then &#8212; how much code can one write in a few months?</li>
<li>Andrew didn&#8217;t want me to repeat everything he said about target markets, but clearly Wall Street is one of the top possibilities.</li>
</ul>
<p>Stay tuned.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2007/12/12/attivio-tries-to-do-it-all/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>What TEMIS is seeing in the marketplace</title>
		<link>http://www.texttechnologies.com/2007/11/01/what-temis-is-seeing-in-the-marketplace/</link>
		<comments>http://www.texttechnologies.com/2007/11/01/what-temis-is-seeing-in-the-marketplace/#comments</comments>
		<pubDate>Thu, 01 Nov 2007 09:49:22 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Application areas]]></category>
		<category><![CDATA[Competitive intelligence]]></category>
		<category><![CDATA[Custom publishing]]></category>
		<category><![CDATA[IBM and UIMA]]></category>
		<category><![CDATA[Investment research and trading]]></category>
		<category><![CDATA[Mark Logic]]></category>
		<category><![CDATA[TEMIS]]></category>
		<category><![CDATA[Text mining]]></category>
		<category><![CDATA[Voice of the Customer]]></category>
		<category><![CDATA[nStein]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[uima]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/2007/11/01/what-temis-is-seeing-in-the-marketplace/</guid>
		<description><![CDATA[CEO Eric Bregand of Temis recently checked in by email with an update on text mining market activity. Highlights of Eric&#8217;s views include: Yep, Voice Of The Customer is hot, in &#8220;many markets&#8221;; Eric specifically mentioned banking, car, energy, food, and retail. He further sees IBM backing VotC as text&#8217;s &#8220;killer app.&#8221; (Note: Temis has [...]]]></description>
			<content:encoded><![CDATA[<p>CEO Eric Bregand of Temis recently checked in by email with an update on text mining market activity.  Highlights of Eric&#8217;s views include:</p>
<ul>
<li>Yep, <strong>Voice Of The Customer</strong> is hot, in &#8220;many markets&#8221;; Eric specifically mentioned banking, car, energy, food, and retail.  He further sees IBM backing VotC as text&#8217;s &#8220;killer app.&#8221;  (Note:  Temis has a history of partnering with IBM, most notably via its <a title="http://www.texttechnologies.com/2007/04/04/temis-overview/" href="http://www.texttechnologies.com" >unusually strong commitment to UIMA</a>.)</li>
<li>Specifically, THE hot topics in the European market these days are <strong>competitive intelligence</strong> and <strong>sentiment analysis.</strong> (Note:  I&#8217;ve always thought Temis got serious about competitive analysis a little earlier than most other text mining vendors did.)</li>
<li><strong>Life sciences</strong> is an ever growing focus for Temis.</li>
<li>I confused him a bit with how I phrased my question about <strong>custom publishing</strong> and Temis&#8217; Mark Logic partnership.   But he did express favorable views of the market, specifically in the area of integrating text mining and native XML database management, and even volunteered that nStein appears to be doing well.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2007/11/01/what-temis-is-seeing-in-the-marketplace/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Business Objects-Inxight update</title>
		<link>http://www.texttechnologies.com/2007/10/17/business-objects-inxight-update/</link>
		<comments>http://www.texttechnologies.com/2007/10/17/business-objects-inxight-update/#comments</comments>
		<pubDate>Wed, 17 Oct 2007 19:24:22 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Application areas]]></category>
		<category><![CDATA[BI integration]]></category>
		<category><![CDATA[Business Objects and Inxight]]></category>
		<category><![CDATA[Investment research and trading]]></category>
		<category><![CDATA[Voice of the Customer]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/2007/10/17/business-objects-inxight-update/</guid>
		<description><![CDATA[I&#8217;m at the Business Objects annual user conference, and had a couple of chances to talk with Inxight/text analytics folks. When I asked about areas of commercial application traction, answers were similar to those I got from Attensity and Clarabridge, but not quite the same. Specifically: Voice of the Customer is definitely tops. Some of [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in">I&#8217;m at the Business Objects annual user conference, and had a couple of chances to talk with Inxight/text analytics folks. When I asked about areas of commercial application traction, answers were similar to those I got from <a href="http://www.texttechnologies.com/2007/10/05/text-mining-applications-as-per-attensity-and-clarabridge/" >Attensity and Clarabridge</a>, but not quite the same.  Specifically:</p>
<ul>
<li>Voice of the Customer is definitely tops.</li>
<li>Some of the other applications Attensity and Clarabridge mentioned appear as well (e.g., antifraud).</li>
<li>Business Objects also has a couple of customers looking at text mining as an aid to medical records, e.g. by helping to catch errors in tabular-field coding.</li>
<li>There are some projects in actual investment research/analysis/trading, e.g. in correlating news announcements and stock price movements.</li>
</ul>
<p style="margin-bottom: 0in">The Business Objects/Inxight folks also made a couple of interesting general technical points.  <span id="more-134"></span>When I challenged the usefulness of text analytics in dashboards, they pointed out how it can at least be a good drill-down. (Example: You&#8217;re getting unusually many of customer complaints in a particular time frame; you drill down into a text mining-based graphic to see which particular areas of complaint have spiked.) Also, when I mentioned exhaustive extraction, Ian Hersey pointed out that in many cases the intermediate results of Inxight tagging and so on happen to be stored in an RDBMS.</p>
<p style="margin-bottom: 0in">Perhaps most important, I got a general feeling that Business Objects is serious about integrating Inxight into its general product offerings, which is not good news for independent text mining vendors such as Attensity or Temis (except insofar as it heats up the acquisition market for same). On the other hand, I&#8217;ve gotten nothing but confirmation of my view that Business Objects plans to remain a good OEM partner, even to competitors such as SAS.</p>
<p style="margin-bottom: 0in"><em>This is my first post from the conference.  There surely will be more soon on </em><a href="http://www.dbms2.com/category/products-and-vendors/business-objects/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.dbms2.com');"><span style="font-style: normal">DBMS2</span></a><em> and the</em><span style="font-style: normal"> <a href="http://www.monashreport.com/category/vendors/business-objects/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.monashreport.com');">Monash Report</a>.</span></p>
<p style="margin-bottom: 0in">&nbsp;</p>
<p style="margin-bottom: 0in"><em>Get great research about text mining, data warehouse appliances, and other hot analytics-related topics! Subscribe to our comprehensive (if not exhaustive) <a href="http://www.monash.com/blogs.html" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.monash.com');">feed</a>, by RSS/Atom or e-mail! We recommend taking the integrated feed for all our blogs, but blog-specific ones are also easily available.</em></p>
<p style="margin-bottom: 0in">&nbsp;</p>
<p><em><p>Technorati Tags: <a href="http://technorati.com/tag/Business+Objects" onclick="javascript:pageTracker._trackPageview('/outbound/article/technorati.com');" rel="tag">Business Objects</a>, <a href="http://technorati.com/tag/Inxight" onclick="javascript:pageTracker._trackPageview('/outbound/article/technorati.com');" rel="tag"> Inxight</a>, <a href="http://technorati.com/tag/text+analytics" onclick="javascript:pageTracker._trackPageview('/outbound/article/technorati.com');" rel="tag"> text analytics</a></p></em></p>
<p style="margin-bottom: 0in">&nbsp;</p>
<p style="margin-bottom: 0in">&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2007/10/17/business-objects-inxight-update/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Text mining applications as per Attensity and Clarabridge</title>
		<link>http://www.texttechnologies.com/2007/10/05/text-mining-applications-as-per-attensity-and-clarabridge/</link>
		<comments>http://www.texttechnologies.com/2007/10/05/text-mining-applications-as-per-attensity-and-clarabridge/#comments</comments>
		<pubDate>Sat, 06 Oct 2007 03:37:59 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Application areas]]></category>
		<category><![CDATA[Attensity]]></category>
		<category><![CDATA[Clarabridge]]></category>
		<category><![CDATA[ClearForest/Reuters]]></category>
		<category><![CDATA[Competitive intelligence]]></category>
		<category><![CDATA[Factiva/Dow Jones]]></category>
		<category><![CDATA[Investment research and trading]]></category>
		<category><![CDATA[Text mining]]></category>
		<category><![CDATA[Voice of the Customer]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/2007/10/05/text-mining-applications-as-per-attensity-and-clarabridge/</guid>
		<description><![CDATA[Besides asking them technical questions, I surveyed Attensity and Clarabridge last week about text mining application trends, getting generously detailed answers from Michelle De Haaff of Attensity and Justin Langseth of Clarabridge. Perhaps the most important point to emerge was that it&#8217;s not just about particular apps. Enterprises are doing text mining POCs (Proofs of [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in">Besides asking them technical 	questions, I surveyed Attensity and Clarabridge last week about text 	mining application trends, getting generously detailed answers from 	Michelle De Haaff of Attensity and Justin Langseth of Clarabridge.  	Perhaps the most important point to emerge was that it&#8217;s not just 	about particular apps.  Enterprises are doing text mining POCs 	(Proofs of Concept) around specific apps, commonly in the CRM area, 	but immediately structuring the buying process in anticipation of a 	rollout across multiple departments in the enterprise.</p>
<p style="margin-bottom: 0in">Other highlights of what they said included:<span id="more-131"></span></p>
<ul>
<li><strong>Voice of the Customer</strong> remains hot, hot, hot.</li>
<li>Closely allied with <strong>Voice of the Customer,</strong> and also hot, is <a href="http://www.texttechnologies.com/2007/10/05/nice-new-phrase-voice-of-the-market/" ><strong>Voice of the Market</strong></a> and/or more direct <strong>competitive intelligence</strong>.</li>
<li><strong>Classical </strong><strong>warranty analysis</strong> is quiet but not wholly dead.  Attensity, historically strong in that application, sees it as merging into Voice of the Customer.  Clarabridge, previously not so strong there (if I recall correctly), is getting at least a little of the traditional-style warranty business.</li>
<li><strong>Human resources</strong> (especially <strong>Voice of the Employee</strong> – I detect a trend in application-naming here) gets mentioned a fair amount.  It&#8217;s usually not the first text mining application an enterprise deploys, but it&#8217;s a common follow-on.</li>
<li><strong>Antifraud</strong> isn&#8217;t just for insurance companies.  Retailing and money-laundering also got mentioned as areas where text mining helped combat fraud.</li>
<li><strong>Insurance industry</strong> use of text mining for claims analysis, I gather, goes well beyond just fraud detection.</li>
<li><strong>Intelligence </strong>is obviously a huge market for Attensity (not so much for Clarabridge), but I didn&#8217;t focus on the classified stuff.  That said, I was reminded of Attensity&#8217;s awkward phrase<em> link analysis,</em> which has nothing to do with hypertext, but instead is the detection of relationships between entities.  This lies at the heart of a non-empty set of civilian <strong>law enforcement</strong> applications and the like.</li>
<li><strong>Investment research</strong> applications of text mining still seem nascent and experimental, at least if one talks with Clarabridge and Attensity.  That said, Factiva is a large subsidiary of Dow Jones now, and ClearForest a smaller one of Reuters, and they&#8217;re doing something or other.  Apparently, it&#8217;s much more document tagging for the sake of readers or search-style filters than it is for use in any kind of business intelligence/statistical mining kind of application.</li>
</ul>
<p>All this isn&#8217;t too different from <a href="http://www.texttechnologies.com/2007/07/22/text-analytics-marketplace-trends/" >what I posted back in July</a>, but I think text mining application trends is a subject that bears frequent revisiting.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2007/10/05/text-mining-applications-as-per-attensity-and-clarabridge/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>More on text processing in CEP</title>
		<link>http://www.texttechnologies.com/2007/08/03/more-on-text-processing-in-cep/</link>
		<comments>http://www.texttechnologies.com/2007/08/03/more-on-text-processing-in-cep/#comments</comments>
		<pubDate>Sat, 04 Aug 2007 01:21:34 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Investment research and trading]]></category>
		<category><![CDATA[Progress and EasyAsk]]></category>
		<category><![CDATA[Sentiment analysis]]></category>
		<category><![CDATA[Text mining]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/2007/08/03/more-on-text-processing-in-cep/</guid>
		<description><![CDATA[StreamBase isn&#8217;t the only complex event/stream processing (CEP) vendor doing text processing. Progress Apama is as well. Stemming, fuzzy matching, and so on seem to happen all the time. But there&#8217;s also at least one case where they flat-out do sentiment analysis.  Edit:  I presume this is in the investment market, as that&#8217;s where most [...]]]></description>
			<content:encoded><![CDATA[<p>StreamBase isn&#8217;t the only complex event/stream processing (CEP) vendor <a href="http://www.texttechnologies.com/2007/07/26/event-stream-processors-active-in-text-filtering/" >doing text processing</a>.  <a href="http://www.dbms2.com/2007/08/03/a-deeper-dive-into-apama/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.dbms2.com');">Progress Apama</a> is as well.   Stemming, fuzzy matching, and so on seem to happen all the time.  But there&#8217;s also at least one case where they flat-out do sentiment analysis.  <em>Edit:  I presume this is in the investment market, as that&#8217;s where most of Progress Apama&#8217;s business is. </em><span id="more-121"></span></p>
<p>That&#8217;s about all the detail I could muster.  When we discussed this, we talked past each other a fair amount.  It&#8217;s clear they&#8217;ve thought about direct analytic integration and also about how text and tabular data could work together side-by-side.  But I didn&#8217;t get the impression their otherwise top-of-the-line technology-consuming customers were doing everything with text analytics that the technology makes available.</p>
<p>Perhaps a partnership with a tokenization OEM would stand Apama in good stead.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2007/08/03/more-on-text-processing-in-cep/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Text analytics is finally being used for investment analysis</title>
		<link>http://www.texttechnologies.com/2006/12/27/text-analytics-is-finally-being-used-for-investment-analysis/</link>
		<comments>http://www.texttechnologies.com/2006/12/27/text-analytics-is-finally-being-used-for-investment-analysis/#comments</comments>
		<pubDate>Wed, 27 Dec 2006 19:06:24 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[ClearForest/Reuters]]></category>
		<category><![CDATA[Investment research and trading]]></category>
		<category><![CDATA[Text mining]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/2006/12/27/text-analytics-is-finally-being-used-for-investment-analysis/</guid>
		<description><![CDATA[Jay Henderson of ClearForest tells me that hedge funds are one of their more interesting growth areas. It&#8217;s about time. I think a lot of the reason for investment firms not making more use of text analytics has been structural &#8212; Factiva, the (relatively speaking) mammoth joint venture of Reuters and Dow Jones, is forbidden [...]]]></description>
			<content:encoded><![CDATA[<p>Jay Henderson of ClearForest tells me that hedge funds are one of their more interesting growth areas.  It&#8217;s about time.</p>
<p>I think a lot of the reason for investment firms not making more use of text analytics has been structural &#8212; Factiva, the (relatively speaking) mammoth joint venture of Reuters and Dow Jones, is forbidden by its parent companies from meeting investment firms&#8217; needs.  And that&#8217;s kind of a pity, as it&#8217;s probably the best-positioned firm to do so.  It&#8217;s good to hear that the little guys are finally filling the gap.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2006/12/27/text-analytics-is-finally-being-used-for-investment-analysis/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>

