<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Text Technologies &#187; Specialized search</title>
	<atom:link href="http://www.texttechnologies.com/category/storage-search/specialized-search-engines/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.texttechnologies.com</link>
	<description>Understanding technology ... in both senses of the phrase</description>
	<lastBuildDate>Sat, 05 Jun 2010 04:23:24 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Data marts in the world of text</title>
		<link>http://www.texttechnologies.com/2009/09/20/data-marts-in-the-world-of-text/</link>
		<comments>http://www.texttechnologies.com/2009/09/20/data-marts-in-the-world-of-text/#comments</comments>
		<pubDate>Sun, 20 Sep 2009 09:08:53 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Enterprise search]]></category>
		<category><![CDATA[Ontologies]]></category>
		<category><![CDATA[Search engines]]></category>
		<category><![CDATA[Specialized search]]></category>
		<category><![CDATA[Structured search]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/?p=334</guid>
		<description><![CDATA[CMS/search (Content Management System) expert Alan Pelz-Sharpe recently decried &#8220;Shadow IT&#8221;, by which he seems to mean departmental proliferation of data stores outside the control of the IT department. In other words, he&#8217;s talking about data marts, only for documents rather than tabular data.
Notwithstanding the manifest virtues of centralization, there are numerous reasons you might [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in;">CMS/search (Content Management System) expert Alan Pelz-Sharpe recently <a href="http://www.intelligententerprise.com/blog/archives/2009/08/shadow_it_and_e.html" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.intelligententerprise.com');">decried &#8220;Shadow IT&#8221;</a>, by which he seems to mean departmental proliferation of data stores outside the control of the IT department. In other words, he&#8217;s talking about data marts, only for documents rather than tabular data.</p>
<p style="margin-bottom: 0in;">Notwithstanding the manifest virtues of centralization, there are numerous reasons you might want data marts,  in the tabular and document worlds alike.  For example:</p>
<ul>
<li><strong>Price/performance.</strong> Your 	main/central data manager might be too expensive to support 	additional large specialized databases. Or different databases and 	applications might have sufficiently different profiles so as to get 	great price/performance from different kinds of data managers. This 	is particularly prevalent in the relational world, where each of 	column stores, sequentially-oriented row stores, and random 	I/O-oriented row stores have compelling use cases.</li>
<li><strong>Different SLAs</strong> (Service-Level Agreements). Similarly, different applications may 	have very different requirements for uptime, response time, and the 	like.  (In the relational world, think of operational data stores.)</li>
<li><strong>Different security 	requirements.</strong> Different subsets of the data may need different 	levels of security. This is particularly prevalent in the document 	world, where security problems are not as well-solved as in the 	tabular arena, and where it&#8217;s common for a search engine to index 	across different corpuses with radically different levels of 	sensitivity.</li>
<li><strong>Integrated application and user 	interfaces.</strong> In the relational world, there&#8217;s a pretty clean 	separation between data management and interface logic; most serious 	business intelligence tools can talk to most DBMS. The document 	world is quite different. Some search engines bundle, for example, 	various kinds of faceted or parameterized search interfaces. What&#8217;s 	more, in public-facing search, a major differentiator is the 	facilities that the product offers for skewing search results.</li>
<li><strong>Different text applications 	require different thesauruses or taxonomy management systems</strong>. 	Ideally, those should all be integrated &#8212; but <a href="../2005/12/11/the-text-technologies-market-3-heres-whats-missing/">the 	requisite technology still doesn&#8217;t exist</a>.</li>
</ul>
<p style="margin-bottom: 0in;">Bottom line: <strong>Text data marts, much like relational data marts, are almost surely here to stay.</strong></p>
<p style="margin-bottom: 0in;"><em><strong>Related link</strong></em></p>
<ul>
<li>
<p style="margin-bottom: 0in;"><a href="http://www.dbms2.com/2009/06/08/the-future-of-data-marts/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.dbms2.com');">The 	future of data marts</a></p>
</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2009/09/20/data-marts-in-the-world-of-text/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Twitter shows some directions for growth</title>
		<link>http://www.texttechnologies.com/2009/03/31/twitter-shows-some-directions-for-growth/</link>
		<comments>http://www.texttechnologies.com/2009/03/31/twitter-shows-some-directions-for-growth/#comments</comments>
		<pubDate>Tue, 31 Mar 2009 04:24:30 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Microblogging]]></category>
		<category><![CDATA[Search engines]]></category>
		<category><![CDATA[Social software and online media]]></category>
		<category><![CDATA[Specialized search]]></category>
		<category><![CDATA[Twitter]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/?p=311</guid>
		<description><![CDATA[TechCrunch pointed out a Twitter jobs page. The specific job TechCrunch mentioned* isn&#8217;t up there any more, but at the moment I write this, 18 others are (copied below). That&#8217;s considerable growth, given that the same page says Twitter has fewer than 30 current employees. Note the emphasis on search and the mention of Japan.
*Care [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.techcrunch.com/2009/03/30/twitter-hiring-a-concierge-to-pamper-celebrities-requires-schmoozing/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.techcrunch.com');">TechCrunch</a> pointed out a <a href="http://twitter.com/jobs" onclick="javascript:pageTracker._trackPageview('/outbound/article/twitter.com');">Twitter jobs page</a>. The specific job TechCrunch mentioned* isn&#8217;t up there any more, but at the moment I write this, 18 others are (copied below). That&#8217;s considerable growth, given that the same page says Twitter has fewer than 30 current employees. Note the emphasis on search and the mention of Japan.</p>
<p><em>*Care and feeding of celebrity tweeters. Celebrity tweeting is actually a subject </em><em>I&#8217;ve <a href="http://www.networkworld.com/community/node/38811" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.networkworld.com');">written</a> and even been <a href="http://www.networkworld.com/community/node/39695?ts" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.networkworld.com');">interviewed</a> about</em><em> several times.</em></p>
<p>As of this writing, the full list is:<span id="more-311"></span></p>
<blockquote><p><!--/rounded_590_top--></p>
<table class="table" border="0" cellspacing="0" cellpadding="6" width="579">
<tbody>
<tr class="table-header" valign="top">
<td class="tableheaders"><strong><a href="http://twitter.jobscore.com/list?iframe=1&amp;code=twitter&amp;sort_obj%5B0%5D=ASC" onclick="javascript:pageTracker._trackPageview('/outbound/article/twitter.jobscore.com');">Job Title</a></strong> <img src="http://twitter.jobscore.com/images/ats_arrow_sort_filler.gif?1236920666" border="0" alt="Sort" /></td>
<td class="tableheaders"><strong><a href="http://twitter.jobscore.com/list?iframe=1&amp;code=twitter&amp;sort_obj%5B1%5D=ASC" onclick="javascript:pageTracker._trackPageview('/outbound/article/twitter.jobscore.com');">Location</a></strong> <img src="http://twitter.jobscore.com/images/ats_arrow_sort_filler.gif?1236920666" border="0" alt="Sort" /></td>
<td class="tableheaders_last"><strong><a href="http://twitter.jobscore.com/list?iframe=1&amp;code=twitter&amp;sort_obj%5B2%5D=DESC" onclick="javascript:pageTracker._trackPageview('/outbound/article/twitter.jobscore.com');">Department</a></strong> <img src="http://twitter.jobscore.com/images/ats_arrow_sort_down.gif?1236920666" border="0" alt="Sort" /></td>
</tr>
<tr class="tablerows" valign="top">
<td><a href="http://twitter.jobscore.com/jobs/twitter/operationsengineer/avDI5iwryr3BNRaaWPp1Hh" onclick="javascript:pageTracker._trackPageview('/outbound/article/twitter.jobscore.com');" target="jobscore_jobapp">Operations Engineer</a></td>
<td>Technical Operations</td>
</tr>
<tr class="tablerows" valign="top">
<td><a href="http://twitter.jobscore.com/jobs/twitter/operationsengineersearch/dfxY047Mer3Ar9aaWP50_m" onclick="javascript:pageTracker._trackPageview('/outbound/article/twitter.jobscore.com');" target="jobscore_jobapp">Operations Engineer &#8211; Search</a></td>
<td>Technical Operations</td>
</tr>
<tr class="tablerows" valign="top">
<td><a href="http://twitter.jobscore.com/jobs/twitter/productmanager/a-BUjk1RSr3ydFaaWP50_m" onclick="javascript:pageTracker._trackPageview('/outbound/article/twitter.jobscore.com');" target="jobscore_jobapp">Product Manager</a></td>
<td>Product</td>
</tr>
<tr class="tablerows" valign="top">
<td><a href="http://twitter.jobscore.com/jobs/twitter/productmanagerplatformconsumerinternet/c75Ngqe_Kr3OlOaaWP50_m" onclick="javascript:pageTracker._trackPageview('/outbound/article/twitter.jobscore.com');" target="jobscore_jobapp">Product Manager &#8211; Platform (Consumer Internet)</a></td>
<td>Product</td>
</tr>
<tr class="tablerows" valign="top">
<td><a href="http://twitter.jobscore.com/jobs/twitter/productmanagersearch/ajh3ske_Kr3Q59aaWP50_m" onclick="javascript:pageTracker._trackPageview('/outbound/article/twitter.jobscore.com');" target="jobscore_jobapp">Product Manager &#8211; Search</a></td>
<td>Product</td>
</tr>
<tr class="tablerows" valign="top">
<td><a href="http://twitter.jobscore.com/jobs/twitter/uidesignerdeveloper/aFvJiKNZ0r3AY2aaWP50_m" onclick="javascript:pageTracker._trackPageview('/outbound/article/twitter.jobscore.com');" target="jobscore_jobapp">UI Designer/Developer</a></td>
<td>Product</td>
</tr>
<tr class="tablerows" valign="top">
<td><a href="http://twitter.jobscore.com/jobs/twitter/userexperiencedesigner/bCQ7Ga9TCr3zjTaaWP50_m" onclick="javascript:pageTracker._trackPageview('/outbound/article/twitter.jobscore.com');" target="jobscore_jobapp">User Experience Designer</a></td>
<td>Product</td>
</tr>
<tr class="tablerows" valign="top">
<td><a href="http://twitter.jobscore.com/jobs/twitter/communicationsassociatecontracttofte/atlFzmhbCr3O1XaaWP50_m" onclick="javascript:pageTracker._trackPageview('/outbound/article/twitter.jobscore.com');" target="jobscore_jobapp">Communications Associate (Contract to FTE)</a></td>
<td>G&amp;A</td>
</tr>
<tr class="tablerows" valign="top">
<td><a href="http://twitter.jobscore.com/jobs/twitter/founderassociate/bQJwJGQfOr3zBkaaWP50_m" onclick="javascript:pageTracker._trackPageview('/outbound/article/twitter.jobscore.com');" target="jobscore_jobapp">Founder Associate</a></td>
<td>G&amp;A</td>
</tr>
<tr class="tablerows" valign="top">
<td><a href="http://twitter.jobscore.com/jobs/twitter/softwareengineerbusinessintelligence/aUAP9egU0r3OzfaaWP50_m" onclick="javascript:pageTracker._trackPageview('/outbound/article/twitter.jobscore.com');" target="jobscore_jobapp">Software Engineer &#8211; Business Intelligence</a></td>
<td>Engineering</td>
</tr>
<tr class="tablerows" valign="top">
<td><a href="http://twitter.jobscore.com/jobs/twitter/softwareengineerfrontend/bYjQISwryr3BNRaaWPp1Hh" onclick="javascript:pageTracker._trackPageview('/outbound/article/twitter.jobscore.com');" target="jobscore_jobapp">Software Engineer &#8211; Front-End</a></td>
<td>Engineering</td>
</tr>
<tr class="tablerows" valign="top">
<td><a href="http://twitter.jobscore.com/jobs/twitter/softwareengineerrubyonrails/cDBkkCaser3R9qaaWP50_m" onclick="javascript:pageTracker._trackPageview('/outbound/article/twitter.jobscore.com');" target="jobscore_jobapp">Software Engineer &#8211; Ruby on Rails</a></td>
<td>Engineering</td>
</tr>
<tr class="tablerows" valign="top">
<td><a href="http://twitter.jobscore.com/jobs/twitter/softwareengineersearch/d2F42M8wer3BU_aaWP50_m" onclick="javascript:pageTracker._trackPageview('/outbound/article/twitter.jobscore.com');" target="jobscore_jobapp">Software Engineer &#8211; Search</a></td>
<td>Engineering</td>
</tr>
<tr class="tablerows" valign="top">
<td><a href="http://twitter.jobscore.com/jobs/twitter/softwareengineersystems/a5ZcUawryr3BbgaaWPp1Hh" onclick="javascript:pageTracker._trackPageview('/outbound/article/twitter.jobscore.com');" target="jobscore_jobapp">Software Engineer &#8211; Systems</a></td>
<td>Engineering</td>
</tr>
<tr class="tablerows" valign="top">
<td><a href="http://twitter.jobscore.com/jobs/twitter/softwareengineersystemsspam/cmdvhKhx4r3P-aaaWP50_m" onclick="javascript:pageTracker._trackPageview('/outbound/article/twitter.jobscore.com');" target="jobscore_jobapp">Software Engineer &#8211; Systems Spam</a></td>
<td>Engineering</td>
</tr>
<tr class="tablerows" valign="top">
<td><a href="http://twitter.jobscore.com/jobs/twitter/directorstrategicpartnerships/c94ZAANZir3AGpaaWP50_m" onclick="javascript:pageTracker._trackPageview('/outbound/article/twitter.jobscore.com');" target="jobscore_jobapp">Director, Strategic Partnerships</a></td>
<td>Business Development</td>
</tr>
<tr class="tablerows" valign="top">
<td><a href="http://twitter.jobscore.com/jobs/twitter/japancountrymanager/bzvJowbdqr3PE7aaWP50_m" onclick="javascript:pageTracker._trackPageview('/outbound/article/twitter.jobscore.com');" target="jobscore_jobapp">Japan Country Manager</a></td>
<td>Business Development</td>
</tr>
</tbody>
</table>
</blockquote>
<p><strong><em>Related link</em></strong></p>
<ul>
<li><a href="http://www.dbms2.com/2009/03/31/twitter-is-considering-using-mapreduce/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.dbms2.com');">Twitter&#8217;s analytics job opening</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2009/03/31/twitter-shows-some-directions-for-growth/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Where &#8220;semantic&#8221; technology is or isn&#8217;t important</title>
		<link>http://www.texttechnologies.com/2008/12/29/where-semantic-technology-is-or-isnt-important/</link>
		<comments>http://www.texttechnologies.com/2008/12/29/where-semantic-technology-is-or-isnt-important/#comments</comments>
		<pubDate>Tue, 30 Dec 2008 00:59:55 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Enterprise search]]></category>
		<category><![CDATA[Ontologies]]></category>
		<category><![CDATA[Search engines]]></category>
		<category><![CDATA[Specialized search]]></category>
		<category><![CDATA[Structured search]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/?p=301</guid>
		<description><![CDATA[At Lynda Moulton&#8217;s behest, I spoke a couple of times recently on the subject of where &#8220;semantic&#8221; technology is or isn&#8217;t likely to be important.  One was at the Gilbane conference in early December.  The slides were based on my previously posted deck for a June talk I gave on a text analytics market overview. [...]]]></description>
			<content:encoded><![CDATA[<p>At Lynda Moulton&#8217;s behest, I spoke a couple of times recently on the subject of where &#8220;semantic&#8221; technology is or isn&#8217;t likely to be important.  One was at the Gilbane conference in early December.  The slides were based on my previously posted deck for a June talk I gave on a <a href="http://www.texttechnologies.com/2008/06/19/text-analytics-marketplace-competitive-landscape-trends/" >text analytics market overview</a>. The actual Gilbane slides may be found <a href="http://www.monash.com/uploads/Gilbane-December-2008.ppt" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.monash.com');">here</a>.</p>
<p>My opinions about the applicability of semantic technology include:</p>
<ul>
<li>The big bucks in web search are for &#8220;transactional&#8221; web search, and semantics isn&#8217;t the issue there. <em>(Slides 3-4)</em></li>
<li>When UIs finally go beyond the simple search box &#8212; e.g. to clusters/facets or to voice &#8212; semantics should have a role to play. <em>(Slide 5)</em></li>
<li>Public-facing site search depends &#8212; more than any other area of text analytics &#8212; on hand-tagging. <em>(Slide 7)</em></li>
<li>&#8220;Enterprise&#8221; search that searches specialized external databases could benefit from semantic technologies. <em>(Slide <img src='http://www.texttechnologies.com/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> </em></li>
<li>True enterprise search could benefit from semantic technologies in multiple ways, but has other problems as well. <em>(Slides 10-11)</em></li>
<li>Semantics &#8212; specifically extraction &#8212; is central to custom publishing. <em>(Slide 12 &#8212; upon review I regret using the word &#8220;sophisticated&#8221;)</em></li>
<li>Semantics is central to text mining. <em>(Slide 18)</em></li>
<li>Semantics could play a big role in all sorts of exciting future developments. <em>(Slide 19)</em></li>
</ul>
<p>So what would your list be like?</p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2008/12/29/where-semantic-technology-is-or-isnt-important/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Lukewarm review of Yahoo mobile search</title>
		<link>http://www.texttechnologies.com/2008/11/11/review-yahoo-mobile-search/</link>
		<comments>http://www.texttechnologies.com/2008/11/11/review-yahoo-mobile-search/#comments</comments>
		<pubDate>Tue, 11 Nov 2008 23:01:36 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Language recognition]]></category>
		<category><![CDATA[Search engines]]></category>
		<category><![CDATA[Specialized search]]></category>
		<category><![CDATA[Speech recognition]]></category>
		<category><![CDATA[Yahoo]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/?p=293</guid>
		<description><![CDATA[Stephen Shankland reviewed Yahoo&#8217;s mobile voice search, which works by taking voice input and returning results onscreen (in his case on his Blackberry Pearl).  He found:

There are plenty of times when voice is a more convenient form of input than typing.
Voice recognition was good but far from perfect.
Editing search strings was annoyingly difficult.
Search results [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://news.cnet.com/8301-1023_3-10092659-93.html" onclick="javascript:pageTracker._trackPageview('/outbound/article/news.cnet.com');">Stephen Shankland</a> reviewed Yahoo&#8217;s mobile voice search, which works by taking voice input and returning results onscreen (in his case on his Blackberry Pearl).  He found:</p>
<ul>
<li>There are plenty of times when voice is a more convenient form of input than typing.</li>
<li>Voice recognition was good but far from perfect.</li>
<li>Editing search strings was annoyingly difficult.</li>
<li>Search results themselves aren&#8217;t 100% perfect.</li>
</ul>
<p>No big surprises there. <img src='http://www.texttechnologies.com/wp-includes/images/smilies/icon_biggrin.gif' alt=':D' class='wp-smiley' /> </p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2008/11/11/review-yahoo-mobile-search/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google and the Author&#8217;s Guild establish an ASCAP for books</title>
		<link>http://www.texttechnologies.com/2008/10/28/google-and-the-authors-guild-establish-an-ascap-for-books/</link>
		<comments>http://www.texttechnologies.com/2008/10/28/google-and-the-authors-guild-establish-an-ascap-for-books/#comments</comments>
		<pubDate>Wed, 29 Oct 2008 00:40:25 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Google]]></category>
		<category><![CDATA[Search engines]]></category>
		<category><![CDATA[Social software and online media]]></category>
		<category><![CDATA[Specialized search]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/?p=290</guid>
		<description><![CDATA[Most of the coverage of the Google/Authors Guild settlement today seems to focus on Google&#8217;s side of things.  But I think the authors&#8217; side is much more important. This deal paves the way for traditional publishers to become quaint and useless &#8212; and not a moment too soon.
Below are some quotes &#8212; fair use!! [...]]]></description>
			<content:encoded><![CDATA[<p>Most of the coverage of the Google/Authors Guild settlement today seems to focus on Google&#8217;s side of things.  But I think the authors&#8217; side is much more important. This deal paves the way for <strong>traditional publishers to become quaint and useless</strong> &#8212; and not a moment too soon.</p>
<p>Below are some quotes &#8212; fair use!! <img src='http://www.texttechnologies.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  &#8212; from the Authors Guild official statement on the deal (emphasis mine): <span id="more-290"></span></p>
<blockquote><p>Our proposal to Google back in May 2006 was simple:  while we don’t approve of your unauthorized scanning of our books and displaying snippets for profit, if you’re willing to do something far more ambitious and useful, and you’re willing to cut authors in for their fair share, then it would be our pleasure to work with you.</p>
<p>&#8230;</p>
<p>The payments will flow through the Book Rights Registry, a new independent entity that can be thought of as <strong>the writers’ equivalent of ASCAP. </strong> Much as ASCAP tracks the uses of songs and collects royalties for songwriters and musicians, the Registry will serve the interests of authors and others who own the rights to books appearing online as a result of this settlement. The Registry will be controlled by a board of authors and publishers; as part of the settlement, Google will pay $34.5 million to get the Registry up and running, notify rightsholders of the settlement, and process claims.</p>
<p>Readers are also big winners under the settlement of Authors Guild v. Google.  Readers will be able to browse from their own computers an enormous collection of books.  We hope this will encourage some readers to buy full online access to some of the books.  Readers wanting to view books online in their entirety for free need only reacquaint themselves with their participating local public library: <strong> every public library building is entitled to a free, view-only license to the collection. </strong> College students working on term papers will be able to point their computers to resources other than Wikipedia, if they’re so inclined:<strong> students at subscribing institutions will be able to read and print out any books in the collection.</strong></p></blockquote>
<p>This is what writers have been &#8212; or at least should have been &#8212; awaiting for over a decade, ever since it became clear that the Web would transform media and publishing.  With print-on-demand plus an online book registry, authors get complete access (starting in the US, at least) to readers, paying and otherwise.</p>
<p>So let&#8217;s review what publishers are good for.  In some order, their role is:</p>
<ul>
<li>Marketing through branding/imprimatur.</li>
<li>Marketing through advertising, book tours, and the like.</li>
<li>Marketing/sales/distribution through the physical supply chain.</li>
<li>Editing, art, and other actual contributions to the quality of the product.</li>
</ul>
<p>It&#8217;s a publishing industry open secret that advertising and such like are pretty useless, doing more for the egos of all concerned than they do for actual book sales.  Amazon is obsoleting most physical book stores, airport locations (for impulse purchases) and the like perhaps excepted.</p>
<p>As for branding/imprimatur: The backing of a major publisher can be worth a few thousand hard copy sales to libraries.  But where else does it matter? I was going to suggest that it might in the academic world. But then I looked over at the math books on my shelves, a number of which are bound in familiar Springer Verlag yellow and white. Is there one book there I wouldn&#8217;t own if it weren&#8217;t a Springer Verlag publication? Probably not.  I suspect that your initial reputation boost in academic publishing comes from your institution and peers much more than from an actual academic press.</p>
<p>So for the most part, book publishers and music publishers are left with just one marketing function &#8212; getting the ball rolling.  Published products ultimately sell through word of mouth, but if you don&#8217;t start out with listeners or readers, how can the word of mouth build? The answer, of course, will increasingly be online promotion. If your natural audience is scattered around the country or the world, without being concentrated in one particular geographic location, online marketing is the obvious way to go. And in this world of search engines, YouTube, blogs, and the like, ever more channels for marketing are opening up.</p>
<p>Other than promotion and aggregation (the latter applying more to news/blog publishers than books/music), I do see one other role for publishers &#8212; actually creating product.  Movies/TV and video games are both far bigger businesses than book publishing, and in both cases products are produced by large teams of people. Music is generally produced by small teams of people. And by the way, books can spin off from other kinds of entertainment (I suspect that a large fraction of all science fiction book sales at this point are Star Wars/Star Trek/etc.). Or vice-versa &#8212; some day the economic model for trying an ambitious new comic book project may factor in hoped-for movie and other spin-offs from the getgo.</p>
<p>But traditional publishers aren&#8217;t generally set up to do anything that ambitious. As for lesser but still important functions &#8212; editing, art direction, etc. &#8212; who needs large firms for that?  The music industry seems to get by just fine with small recording studios and independent record producers; book producers could spring up just as well.  Indeed, they exist already, but suffer from the problem that nobody wants to pay them much because book sales overall are so weak.</p>
<p>If I were a best-selling author, I&#8217;d hire away my favorite editor, put her on the payroll directly, and then send her and my agent to squeeze a few extra percentage points out of the big publishers, more than covering the cost of her paycheck.  At least, I would if I were a best-selling author and also had my current personality.  Now, I&#8217;m not, and if I were that kind of author I&#8217;d probably be quite fixated on being left alone to write, and might find it easier to leave money on the table than to take that kind of business responsibility myself. That kind of consideration is probably a big reason why traditional publishers are allowed to stay in business.</p>
<p>But the times sure are a&#8217;changin&#8217;.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2008/10/28/google-and-the-authors-guild-establish-an-ascap-for-books/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google Health spoof</title>
		<link>http://www.texttechnologies.com/2008/07/09/google-health-spoof/</link>
		<comments>http://www.texttechnologies.com/2008/07/09/google-health-spoof/#comments</comments>
		<pubDate>Wed, 09 Jul 2008 15:13:02 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Fun stuff]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Humor]]></category>
		<category><![CDATA[Search engines]]></category>
		<category><![CDATA[Specialized search]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/?p=261</guid>
		<description><![CDATA[FutureFeedForward is on a roll:
MOUNTAIN VIEW&#8211;Information search giant Google, Inc. announced Thursday the release of Google Body, a search service aiming to index the internal and external anatomy of every living creature on the planet. &#8230;
Early testers have remarked upon a fuzzy-logic &#8220;match my organ&#8221; feature, which helps users get in touch with the nearest, [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://futurefeedforward.com/front.php?fid=104" onclick="javascript:pageTracker._trackPageview('/outbound/article/futurefeedforward.com');">FutureFeedForward is on a roll</a>:</p>
<blockquote><p>MOUNTAIN VIEW&#8211;Information search giant Google, Inc. announced Thursday the release of Google Body, a search service aiming to index the internal and external anatomy of every living creature on the planet. &#8230;</p>
<p>Early testers have remarked upon a fuzzy-logic &#8220;match my organ&#8221; feature, which helps users get in touch with the nearest, most suitable donor for multiple organ systems.  &#8230;</p>
<p>Responding to criticism from privacy groups, Google&#8217;s Hind pointed to the program&#8217;s opt-out policy. &#8220;We are very concerned about user privacy, and that&#8217;s why we will not make publicly available any information about anybody who let&#8217;s us know they do not want to participate by wearing an Opt-Out headband when in public. Google archives information about those individuals, but does not make it searchable.&#8221; The yellow and black vinyl headbands can be requested free of charge by writing to the company at its Mountain View headquarters.</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2008/07/09/google-health-spoof/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google could dominate single-site search</title>
		<link>http://www.texttechnologies.com/2008/03/05/google-could-dominate-single-site-search/</link>
		<comments>http://www.texttechnologies.com/2008/03/05/google-could-dominate-single-site-search/#comments</comments>
		<pubDate>Thu, 06 Mar 2008 03:07:43 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Enterprise search]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Search engines]]></category>
		<category><![CDATA[Specialized search]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/2008/03/05/google-could-dominate-single-site-search/</guid>
		<description><![CDATA[Google has begun to introduce a feature whereby, if your search obviously leads you to a single site (e.g., you searched on a company name), you get a second search box to search only within that site.  More details at Google and Search Engine Land.  Basically, this is Google Site Search made a [...]]]></description>
			<content:encoded><![CDATA[<p>Google has begun to introduce a feature whereby, if your search obviously leads you to a single site (e.g., you searched on a company name), you get a second search box to search only within that site.  More details at <a href="http://googleblog.blogspot.com/2008/03/search-within-site-tale-of.html" onclick="javascript:pageTracker._trackPageview('/outbound/article/googleblog.blogspot.com');">Google</a> and <a href="http://googleblog.blogspot.com/2008/03/search-within-site-tale-of.html" onclick="javascript:pageTracker._trackPageview('/outbound/article/googleblog.blogspot.com');">Search Engine Land</a>.  Basically, this is Google Site Search made a lot easier to use.</p>
<p>I think this could be a really big deal.  <span id="more-201"></span>On most sites, it is already the case that Google gives much better results than the native search engine. (I generally use the site name or URL rather than the &#8220;site:&#8221; command to get this effect, but I imagine the difference is small.) Now that it&#8217;s easy to use, I expect increasing numbers of Web users to, like me, use the system that works better.</p>
<p>What&#8217;s more, there&#8217;s nothing stopping Google from co-opting the one way native site search is commonly made to work well, which is by hand-tagging of pages. Most metatags are devalued in search when used to compare pages from <em>different</em> sites, for very good reasons of adversarial information retrieval.  But there&#8217;s little reason not to take metatags into account <em>within</em> a site, albeit with a little care to avert the worst potential abuses.</p>
<p>Large e-commerce sites running something like Inquira, Endeca, or Mercado will do better with their specialized search tools than with Google search. For most of the rest, however, Google is likely to win on quality, relevance, and utility.</p>
<p>Yes, there&#8217;s an obvious problem about ad placement and revenue, as <a href="http://www.google.com/url?sa=t&amp;ct=res&amp;cd=3&amp;url=http%3A%2F%2Ftwitter.com%2Fnazgul&amp;ei=I1_PR4SJApaqgATQhOHvAg&amp;usg=AFQjCNFcBxmeurp2L2sSawP6FLWjCkVByQ&amp;sig2=3-2qkg4fRV6lksEvHh78hg" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.google.com');">Kee Hinckley</a> recently noted to me. But:</p>
<ul>
<li>Site owners don&#8217;t really have a choice about cooperating with Google, as the searcher sees Google before she sees the target site.  Going nuclear and removing oneself from the Google index isn&#8217;t a realistic option.</li>
<li>Most large site owners do a lot of SEO (Search Engine Optimization) for Google, and get a lot of benefit from Google already.</li>
<li>Google could cut special ad deals for single-site search, changing their usual policies to let site owners buy all placements, for a suitable fee.</li>
</ul>
<p><em><strong>Please <a href="http://www.monash.com/signup.html" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.monash.com');">sign up</a> for our feed!</strong></em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2008/03/05/google-could-dominate-single-site-search/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Code search options</title>
		<link>http://www.texttechnologies.com/2008/02/28/code-search-options/</link>
		<comments>http://www.texttechnologies.com/2008/02/28/code-search-options/#comments</comments>
		<pubDate>Fri, 29 Feb 2008 03:04:56 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Search engines]]></category>
		<category><![CDATA[Specialized search]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/2008/02/28/code-search-options/</guid>
		<description><![CDATA[Questions come up here from time to time about code search engines, a subject I have not researched.  Well, here&#8217;s a quick link listing some leading code search engines, both Web (guess who?) and internal.  Most interesting may be that the list is so short.
]]></description>
			<content:encoded><![CDATA[<p>Questions come up here from time to time about code search engines, a subject I have not researched.  Well, here&#8217;s a quick link listing some leading <a href="http://knightknetwork.com/2007/09/13/5-great-code-search-engines/" onclick="javascript:pageTracker._trackPageview('/outbound/article/knightknetwork.com');">code search engines</a>, both Web (guess who?) and internal.  Most interesting may be that the list is so short.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2008/02/28/code-search-options/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Danny Sullivan thinks blended vertical search is a game-changer</title>
		<link>http://www.texttechnologies.com/2007/12/02/danny-sullivan-thinks-blended-vertical-search-is-a-game-changer/</link>
		<comments>http://www.texttechnologies.com/2007/12/02/danny-sullivan-thinks-blended-vertical-search-is-a-game-changer/#comments</comments>
		<pubDate>Mon, 03 Dec 2007 00:33:37 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Google]]></category>
		<category><![CDATA[Search engine optimization (SEO)]]></category>
		<category><![CDATA[Search engines]]></category>
		<category><![CDATA[Specialized search]]></category>
		<category><![CDATA[Structured search]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/2007/12/02/danny-sullivan-thinks-blended-vertical-search-is-a-game-changer/</guid>
		<description><![CDATA[Danny Sullivan thinks blended vertical search &#8212; which he&#8217;s calling Search 3.0 &#8212; is a game changer.  (In this context, &#8220;vertical&#8221; search denotes alternate result types such as video, image, map coordinates, or product listings.)   In saying that, he&#8217;s focused on search marketers, who now have a lot more ways to try [...]]]></description>
			<content:encoded><![CDATA[<p>Danny Sullivan thinks <a href="http://searchengineland.com/071127-091128.php" onclick="javascript:pageTracker._trackPageview('/outbound/article/searchengineland.com');">blended vertical search &#8212; which he&#8217;s calling Search 3.0 &#8212; is a game changer</a>.  (In this context, &#8220;vertical&#8221; search denotes alternate result types such as video, image, map coordinates, or product listings.)   In saying that, he&#8217;s focused on search marketers, who now have a lot more ways to try to get their messages onto Google searchers&#8217; top result pages.   But I presume what he&#8217;s really saying is that there will be a feedback effect &#8212; if Google tells all web searchers about videos and product listings, then internet marketers will be more motivated to post videos and product listings, and hence there will be more interesting choices of videos and product listings &#8212; which Google will naturally wind up featuring more prominently in its search results.  And so on.</p>
<p>Given the Youtube explosion, I find it hard to argue with his claim.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2007/12/02/danny-sullivan-thinks-blended-vertical-search-is-a-game-changer/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Wise Crowds of Long-Tailed Ants, or something like that</title>
		<link>http://www.texttechnologies.com/2007/04/30/baynote-buzzwords/</link>
		<comments>http://www.texttechnologies.com/2007/04/30/baynote-buzzwords/#comments</comments>
		<pubDate>Tue, 01 May 2007 02:03:16 +0000</pubDate>
		<dc:creator>Curt Monash</dc:creator>
				<category><![CDATA[Baynote]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Ontologies]]></category>
		<category><![CDATA[Search engine optimization (SEO)]]></category>
		<category><![CDATA[Search engines]]></category>
		<category><![CDATA[Social software and online media]]></category>
		<category><![CDATA[Software as a Service (SaaS)]]></category>
		<category><![CDATA[Specialized search]]></category>

		<guid isPermaLink="false">http://www.texttechnologies.com/2007/04/30/baynote-buzzwords/</guid>
		<description><![CDATA[Baynote sells a recommendation engine whose motto appears to be “popularity implies accuracy.” While that leads to some interesting technological ideas (below), Baynote carries that principle to an unfortunate extreme in its marketing, which is jam-packed with inaccurate buzzspeak. While most of that is focused on a few trendy meme-oriented books, the low point of [...]]]></description>
			<content:encoded><![CDATA[<p class="MsoNormal">Baynote sells a recommendation engine whose motto appears to be “popularity implies accuracy.”<span> </span>While that leads to some interesting technological ideas (below), Baynote carries that principle to an unfortunate extreme in its marketing, which is jam-packed with inaccurate buzzspeak.<span> </span>While most of that is focused on a few trendy meme-oriented books, the low point of my briefing today was the probably the insistence against pushback that “95%” of Google’s results depend on “PageRank.” <span> </span>(I think what Baynote really meant is “all off-page factors combined,” but anyhow I sure didn’t get the sense that accuracy was an important metric for them in setting their briefing strategy.<span> </span>And by the way, one reason I repeat the company’s name rather than referring to Baynote by a pronoun is that on-page factors DO matter in search engine rankings.)</p>
<p class="MsoNormal">That said, here’s the essence of Baynote’s story, as best I could figure it out. <span id="more-105"></span></p>
<ul style="margin-top: 0in" type="disc">
<li class="MsoNormal">Baynote’s secret sauce is      a set of 20+ behavioral metrics to identify whether, if somebody clicks on      a page, they are SATISFIED with the content.</li>
<li class="MsoNormal">Based on that, Baynote      provides a “content recommendation” engine. (For now, the distinction      between “content” and “web page” is not important, but the concepts are in      my opinion diverging over time.) <span> </span>This      is manifested in two forms (a typical installation uses both).<span> </span>One is just a list of      recommendations.<span> </span>The other is in a      search engine – “social search” with an “implicit folksonomy” &#8212; and its      results pages.<span> </span>Both sit on web      pages as boxes/widgets.</li>
<li class="MsoNormal">Baynote’s first markets were online support and eMarketing.  The company is now rolling out eCommerce as well.   I didn’t get clarity about what was      different in the nature of the recommendations, if anything, that      underlies any small separation between these apps.<span> </span>(Baynote was clear about saying that the      differences were indeed small.)</li>
<li class="MsoNormal">The whole thing is SaaS,      built on a LAMP stack.  MySQL      4.something seems to suffice, which makes sense given that Baynote’s      system is not handling any significant transactions directly.<span> </span>That said, I didn’t push to understand      what it means for a search engine to be built on MySQL.<span> </span>This wasn’t the kind of conversation in      which one could elicit substantive detail.</li>
<li class="MsoNormal">Baynote claims that a      sample size of as few as 7-10 visitors liking a particular piece of      content suffices to provide a good basis for predicting who else will like      it.<span> </span>I’m not in a position to assess      the credibility or, more to the point, limitations of this claim.</li>
<li class="MsoNormal">Baynote has the philosophy      that they try to watch a user’s behavior on a site and map that to a “context.”<span> </span>I like that approach.</li>
<li class="MsoNormal">The company cites tested      stats of 20% net lift (revenue increase), with 50% of sales being touched      by its recommendations.<span> </span>Those      numbers don’t sound terribly impressive, perhaps unless they’re truly      additive to those provided by, say, Endeca, which is an announced partner.</li>
</ul>
<p class="MsoNormal"><em> </em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.texttechnologies.com/2007/04/30/baynote-buzzwords/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
	</channel>
</rss>
