<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Text Technologies &#187; QL2</title>
	<atom:link href="http://www.texttechnologies.com/category/vendors/ql2/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.texttechnologies.com</link>
	<description>Understanding technology ... in both senses of the phrase</description>
	<lastBuildDate>Wed, 18 Jan 2012 17:02:59 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.3</generator>
		<item>
		<title>QL2 &#8211; web text extraction and more</title>
		<link>http://www.texttechnologies.com/2007/12/07/ql2-web-text-extraction-and-more/</link>
		<comments>http://www.texttechnologies.com/2007/12/07/ql2-web-text-extraction-and-more/#comments</comments>
		<pubDate>Fri, 07 Dec 2007 21:18:01 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Application areas]]></category>
		<category><![CDATA[Competitive intelligence]]></category>
		<category><![CDATA[QL2]]></category>
		<category><![CDATA[Text mining]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/2007/12/07/ql2-web-text-extraction-and-more/</guid>
		<description><![CDATA[Here are some highlights of the QL2 story, per exec Mike McDermott. QL2&#8242;s main business is scraping price and other product offering data from the web for high-speed competitive analysis. For example, of their 250ish customers overall, over 90 are airlines. Online retailers are another big chunk of their customer base. QL2 also commonly partners [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in">Here are some highlights of the QL2 story, per exec Mike McDermott.</p>
<ul>
<li>QL2&#8242;s main business is scraping price and other product offering data from the web for high-speed competitive analysis.  For example, of their 250ish customers overall, over 90 are airlines.  Online retailers are another big chunk of their customer base.</li>
<li>QL2 also commonly partners with <a href="http://www.texttechnologies.com/2007/12/23/text-mining-myths-realities/" >text mining</a> companies in applications such as Voice of the Market or competitive intelligence.  E.g., QL2 has been brought into a few deals each by Attensity, Clarabridge, and especially <a href="http://www.texttechnologies.com/2007/11/01/what-temis-is-seeing-in-the-marketplace/" >Temis</a>.</li>
<li>QL2 goes well beyond basic crawling.  Notably, the system fills in forms with parameters.  And of course it monitors pages for changes.</li>
<li>QL2&#8242;s scripting language is, Mike tells me, very SQL-like.  Hence the “QL” in the name.</li>
<li>QL2 rolls its own filters, rather than using INSO or whoever.  (Actually, what are the main file-reading filter choices these days?  I&#8217;ve lost track.)  Indeed, Mike fondly believes QL2 does a better job with PDFs than Adobe does.</li>
<li>QL2 doesn&#8217;t want to be thought of as web-only.  Rather, Mike likes my formulation of “text data ETL, web or otherwise.”  That said, he freely admits QL2&#8242;s strength is in <em>Extract</em> rather than in <em>Transform</em> or <em>Load.</em></li>
</ul>
<p style="margin-bottom: 0in; font-style: normal"> <span id="more-147"></span>This all sounds very much in line with a post I made about the <a href="http://blogs.computerworld.com/node/324" onclick="javascript:pageTracker._trackPageview('/outbound/article/blogs.computerworld.com');">smart scraping</a> market 2 ½ years ago.</p>
<p style="margin-bottom: 0in; font-style: normal">&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2007/12/07/ql2-web-text-extraction-and-more/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

