August 31st, 2007 Curt Monash
Give or take a corrected typo, here’s a challenge to DMOZ bashers I just wrote in the flame war thread.
If you want to do something that is:
A. Correct
B. Credible
C. Potentially useful
just go find a specific category with terrible listings, and publicize the fact with overwhelmingly clear proof of your assessment.
If that’s not EASY for you to do … then maybe DMOZ isn’t so bad after all, eh?
In particular, I’d encourage you to post a version of the category that is clearly better than what is currently there.
Technorati Tags: DMOZ, ODP
Posted in Directories, Directories and filtering, ODP and DMOZ, Social software and media | 1 Comment »
August 31st, 2007 Curt Monash
My latest thoughts about DMOZ and the ODP may be found in this blog comment thread.
The gist is:
- DMOZ has many problems, such as categories that are at least five years out of date.
- Newly, corruptly listed sites are NOT high on the list of problems.
- In fact, the attention paid to avoiding such corruption is a terrible drain on ODP resources.
- There are a lot of liars and/or idiots bashing DMOZ in the website owner community.
- robjones is a sarcastic jerk, but he’s our sarcastic jerk.
Or something like that. As I said, it’s a flame war …
Anyhow, I’m flying off on a two-week snorkeling trip Saturday, and should be much mellower soon.
Posted in Directories, Directories and filtering, ODP and DMOZ, Search engine optimization (SEO), Social software and media | 8 Comments »
August 3rd, 2007 Curt Monash
I’ve been pretty skeptical about Inxight’s Awareness Server. My theory is that ordinary enterprise search engines can index remotely anyway, and they offer much better search functionality. Inxight’s Ian Hersey was kind enough to write in and offer two counter-arguments.
First, Ian points out that there are circumstances when, due to security and permissions, you can’t really index everything via one search engine. Specifically, he offers the government as an example. OK, I can see that in the government, with its classified and/or regulated silos. However, I have trouble thinking of many more examples. While there certainly are plenty of instances where a variety of organizations share information on a somewhat arms-length basis, it’s tough to think of such cases where federated text search would come into play.
Second, Ian in essence disputes my claim of inferior functionality. While implicitly conceding — as well he should! — that Inxight’s Awareness Server doesn’t do some things full-featured search engines do, he points out analytic features that may not be found in conventional search engine offering. The big one he calls out is faceted search — which of course was the core of Intelliseek,the acquisition Awareness Server came from. Hmm. Faceted search has a checkered history, with Excite and Northern Light being perhaps the most visible among many failures. On the other hand, it’s a great idea that keeps being tried, and some versions — notably Endeca’s — have turned out well.
I guess I’ll have to reserve judgment on that part until I look at Inxight’s product and see what they do and don’t actually have.
Want to continue getting great research about search, BI, and other subjects related to text analytics? Then get a FREE subscription to our research, by RSS/Atom or e-mail! We strongly recommend taking the integrated feed for all our blogs, but blog-specific ones are also easily available.
Posted in BI integration, Business Objects and Inxight, Endeca, Enterprise search, Search and text storage | No Comments »
August 3rd, 2007 Curt Monash
StreamBase isn’t the only complex event/stream processing (CEP) vendor doing text processing. Progress Apama is as well. Stemming, fuzzy matching, and so on seem to happen all the time. But there’s also at least one case where they flat-out do sentiment analysis.
That’s about all the detail I could muster. When we discussed this, we talked past each other a fair amount. It’s clear they’ve thought about direct analytic integration and also about how text and tabular data could work together side-by-side. But I didn’t get the impression their otherwise top-of-the-line technology-consuming customers were doing everything with text analytics that the technology makes available.
Perhaps a partnership with a tokenization OEM would stand Apama in good stead.
Keep getting great research about text analytics, data management and related technologies. Get a FREE subscription by RSS/Atom or e-mail
Posted in Progress and EasyAsk, Text mining | No Comments »