July 26th, 2007 Curt Monash
OK. I secured permission to actually quote the details on something I’d previously dropped a small hint about — stream processing for text messages. Traditionally, that’s been the province of enterprise search companies. A decade ago, Verity had a kernel group of 6-7 engineers under Phil Nelson. They managed to produce not only a decent search engine, but a search engine “turned on its side” as well. I.e., instead of running one query against a corpus, they could run many queries each against documents as they arrived, one document at a time. Subsequently, the same idea has been implemented by most enterprise search providers, at least those that are serious about the intelligence market.
Well, the event-processing guys are active in that market too. At least StreamBase is. Read the rest of this entry »
Posted in Autonomy and Verity, Business Objects and Inxight, Enterprise search, Search and text storage, Text mining | 2 Comments »
July 22nd, 2007 Curt Monash
It was tough to judge user demand at the recent Text Analytics Summit because, well, very few users showed up. And frankly, I wasn’t as aggressive at pumping vendors for trends as I am some other times. That said, I have talked with most text analytics vendors recently,* and here are my impressions of what’s going on. Any contrary – or confirming! — opinions would be most welcome.
*Factiva is the most significant exception. Hint, hint.
If you think about it, text analytics is a “secret ingredient” in search, antispam, and data cleaning,* and this dominates all other uses of the technology. A significant minority of the research effort at companies that do any kind of text filtering is – duh — text analytics. Cold comfort for specialist text analytics vendors, to be sure, but that’s the way it is.
*I.e., part of the “T” in “ETL” (Extract/Transform/Load).
Text-analytics-enhanced custom publishing will surely at some point become a must-have for business and technical publishers. However, it appears that we’re not quite there yet, as large publishers make do with simple-minded search and the like. In what I suspect is a telling market commentary, there’s no headlong rush among vendors to dump text mining for custom publishing, notwithstanding the examples of nStein and (sort of) ClearForest. I don’t want to be overly negative – either my friends at Mark Logic are doing just fine or else they’re putting up a mighty brave front – but I don’t think the nonspecialist publishing market is there yet.
Read the rest of this entry »
Posted in ClearForest and Reuters, Factiva and Dow Jones, Mark Logic, SAS, Search and text storage, Spam and antispam, Text Analytics Summit, Text mining, Voice of the Customer, nStein | 1 Comment »
July 20th, 2007 Curt Monash
TEMIS is a French company, with US headquarters in the US, as befits a company whose strongest vertical market is pharmaceuticals. I offered to put up a couple of job postings for them. (Nice of me — TEMIS isn’t even a client yet!) Here goes. Read the rest of this entry »
Posted in TEMIS, Text mining | Comments Off
July 16th, 2007 Curt Monash
I dropped by Progress a couple of weeks ago for back-to-back briefings on Apama and EasyAsk. EasyAsk is Larry Harris’ second try at natural language query, after the Intellect product fell by the wayside at Trinzic, the company Artificial Intelligence Corporation grew into.* After a friendly divorce from the company he founded, if my memory is correct, Larry was able to build EasyAsk very directly on top of the Intellect intellectual property.
*Other company or product names in the mix at various times include AI Corp and English Wizard. Not inappropriately, it seems that Larry has quite an affinity for synonyms …
EasyAsk is still a small business. The bulk is still in enterprise query, but new activity is concentrated on e-commerce applications. While Larry thinks that they’ve solved most of the other technical problems that have bedeviled him over the past three decades, the system still takes too long to implement.
Read the rest of this entry »
Posted in BI integration, Mercado, Natural language and speech recognition, Natural language processing (NLP), Progress and EasyAsk, Speech recognition | No Comments »
July 14th, 2007 Curt Monash
When a company announces an acquisition, it usually does a round of limited-content briefings, in no small part because the antitrust lawyers won’t let them do anything else. Once the deal closes, antitrust restrictions are lifted, and they do another round of briefings. These, typically, are vague and platitudinous.
Business Objects/Inxight have now reached that point. Even so, my briefing yesterday had some aspects worth writing up.
Read the rest of this entry »
Posted in BI integration, Business Objects and Inxight, Enterprise search, Search and text storage | 2 Comments »