July 26, 2006

Megaputer on the text mining market

Sergei Ananyan is president of Megaputer, which is not one of the easier companies to get information about. They’re an essentially Russian firm based in Bloomington, Indiana. Their website is, to put it kindly, not up to date. And I wound up speaking with Sergei while he was at his rural vacation house, located somewhere between the Black and Aral Seas.

However, Sergei followed up by email with his views of the marketplace, and I think they’re interesting enough to share below. I really like his focus on analytic business processes, something that generally doesn’t get enough consideration.

(Emphasis mine. Also, for context, please note that Megacomputer started out as a data mining generalist, but has increasingly focused on text mining.)

I believe that the Text Mining market is currently characterized by three main features:

1) This is an emerging and highly fragmented market. So far, only early adopters have incorporated text mining systems as an integral part of their business processes. Most customers are evaluating the effectiveness of a text mining solution by comparing it to the effectiveness of their existing manual data analysis processes, more than to the solutions from other vendors. Different customers are focused on different tasks, and thus have to seek tools from vendors that have good offerings of the respective capabilities. This leads to market fragmentation. The situation will be gradually changing as best practices are worked out for various standard application domains. But so far, relatively few case studies with proven ROI have been reported; correspondingly, best practices are yet to be formulated.

2) End consumers of results generated in Text Mining are not data analysts or statisticians, as in Data Mining, but rather the upper management of a company. These people need to interact with the results of text analysis in order to make decisions based on text mining efforts and substantiate these decisions. They have no time or skills to mess with developing analysis scenarios; rather, they need a very simple interface for viewing and manipulating the results of the analysis. They need dashboards featuring results obtained through the execution of nontrivial data analysis scenarios developed by their colleagues, the data analysts.

3) Documents that require analysis are frequently linked with some structured attributes. For example, for drug safety reports structured fields can embrace date and time of the report, drug name, and age, gender, type and location of the reporter. Values of these attributes provide vital context for correctly interpreting the related narratives. Customers expect a text mining system to be able to perform joint analysis of information extracted from report narratives and associated structured attributes.

Categories: Megaputer, Text mining
Subscribe to our complete feed!

Comments

One Response to “Megaputer on the text mining market”

Text Technologies»Blog Archive » Application processes in text mining – finding warning signs on July 27th, 2006 5:32 am

[…] Sergei Ananyan’s claim that analytic business processes involving text are still very primitive is absolutely correct. Indeed, analytic business processes have a lot of maturing to do overall. Still, there’s one area where the industry has devoted a lot of thought over the past few years, and some notion of process has emerged. This is in the finding of warning signs. […]

Leave a Reply

Name (required)

Email Address(required)

Website

Subscribe to the Monash Research feed via RSS or email:
Login

Search our blogs and white papers

Monash Research blogs

DBMS 2 covers database management, analytics, and related technologies.

Text Technologies covers text mining, search, and social software.

Strategic Messaging analyzes marketing and messaging strategy.

The Monash Report examines technology and public policy issues.

Software Memories recounts the history of the software industry.

User consulting

Building a short list? Refining your strategic plan? We can help.

Vendor advisory

We tell vendors what's happening -- and, more important, what they should do about it.

Monash Research highlights

Learn about white papers, webcasts, and blog highlights, by RSS or email.

Recent posts

The future of search

SOPA’s potentially chilling effect on public debate

Freemium journalism business models, or the Launch of the Spawn of TechCrunch

Social technology in the enterprise

The Text Analytics Summit needs to be replaced

Categories

About this blog

Application areas

Competitive intelligence

Custom publishing

E-discovery

Investment research and trading

Voice of the Customer

BI integration

Categorization and filtering

Censorship

Directories

Ontologies

Spam and antispam

Website filtering

Companies and products

Attensity

Attivio

Autonomy

Baynote

Business Objects and Inxight

Clarabridge

ClearForest/Reuters

Convera

Coveo

Endeca

Expert System S.p.A.

Factiva/Dow Jones

FAST

Google

IBM and UIMA

InQuira

Lexalytics

Lucene

Mark Logic

Megaputer

Mercado

Microsoft

MuseGlobal

nStein

Nuance

ODP and DMOZ

Powerset

Progress and EasyAsk

QL2

SAP

SAS

SPSS

Sybase

TEMIS

Twitter

Yahoo

Fun stuff

Babelfish game

Humor

Jobs and careers

Language recognition

Natural language processing (NLP)

Speech recognition

Online marketing

Search engine optimization (SEO)

Open source text analytics

Search engines

Audio and video search

Enterprise search

Specialized search

Structured search

Social software and online media

Blogosphere

Microblogging

Online media

Software as a Service (SaaS)

Text mining SaaS

Text Analytics Summit

Text mining

Comprehensive or exhaustive extraction

Sentiment analysis

Date archives

Links

Monash Research

White Papers

Admin

Log in