<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: How is YouTube relating videos?</title>
	<atom:link href="http://www.texttechnologies.com/2008/05/19/how-is-youtube-relating-videos/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.texttechnologies.com/2008/05/19/how-is-youtube-relating-videos/</link>
	<description>Understanding technology ... in both senses of the phrase</description>
	<lastBuildDate>Thu, 19 Jan 2012 17:11:01 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.3</generator>
	<item>
		<title>By: Susheel</title>
		<link>http://www.texttechnologies.com/2008/05/19/how-is-youtube-relating-videos/#comment-53969</link>
		<dc:creator>Susheel</dc:creator>
		<pubDate>Thu, 13 Nov 2008 07:12:44 +0000</pubDate>
		<guid isPermaLink="false">http://www.texttechnologies.com/?p=221#comment-53969</guid>
		<description>There is a paper published on this:
http://www.esprockets.com/papers/adsorption-yt.pdf</description>
		<content:encoded><![CDATA[<p>There is a paper published on this:<br />
<a href="http://www.esprockets.com/papers/adsorption-yt.pdf" onclick="javascript:pageTracker._trackPageview('/outbound/comment/www.esprockets.com');" rel="nofollow">http://www.esprockets.com/papers/adsorption-yt.pdf</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Lawrence Miao</title>
		<link>http://www.texttechnologies.com/2008/05/19/how-is-youtube-relating-videos/#comment-39656</link>
		<dc:creator>Lawrence Miao</dc:creator>
		<pubDate>Tue, 20 May 2008 14:55:10 +0000</pubDate>
		<guid isPermaLink="false">http://www.texttechnologies.com/?p=221#comment-39656</guid>
		<description>I read a paper today solving the same problem in a similar site. The model is built in a bipartite graph, one set is video set, another set is keyword set. The keywords are parsed by natural language processing tools from tags provided by user, and also the title. The paper co-clustered the graph by using some information theory based metrics. After getting the co-clusters, by ranking, we can get the &#039;hot&#039; topics and by traversal property or connectivity of the bipartite graph, we can get related videos. 

according to the tech:
The tags of Material Girl are:
self-parody  economic  metaphor  high  camp  golddigga  bling  ice  cashmoney  holla  OG  madonna  material  girl  

The tags of Girls Just Want to Have Fun:
Girls  Just  Want  to  Have  Fun  Cyndi  Lauper  Music 

The tags of Diamonds Are A Girl&#039;s Best Friend:
marilyn  monroe  diamonds  girl  best  friend  gentlemen  prefer  blondes  song  songs  movie  movies 

All of them have &#039;girl&#039;, from this, all of the three are connected in the bipartite graph, other techniques can be used to do ranking. There are lots of ranking schemes available. For example, if in the ranking, Youtube use data mining techs like (sequence) frequent patterns of user click sequence history, there&#039;ll be high probability, user like all of the three and watch them all in a sequence.

This is the first time, I made comment here. Your site is very useful for me. Thanks for your work. : )

btw, the paper title is 
Web video topic discovery and tracking via bipartite graph reinforcement model, 
by
Lu Liu, Lifeng Sun, Yong Rui, Yao Shi, Shiqiang Yang,
appeared in WWW 2008</description>
		<content:encoded><![CDATA[<p>I read a paper today solving the same problem in a similar site. The model is built in a bipartite graph, one set is video set, another set is keyword set. The keywords are parsed by natural language processing tools from tags provided by user, and also the title. The paper co-clustered the graph by using some information theory based metrics. After getting the co-clusters, by ranking, we can get the &#8216;hot&#8217; topics and by traversal property or connectivity of the bipartite graph, we can get related videos. </p>
<p>according to the tech:<br />
The tags of Material Girl are:<br />
self-parody  economic  metaphor  high  camp  golddigga  bling  ice  cashmoney  holla  OG  madonna  material  girl  </p>
<p>The tags of Girls Just Want to Have Fun:<br />
Girls  Just  Want  to  Have  Fun  Cyndi  Lauper  Music </p>
<p>The tags of Diamonds Are A Girl&#8217;s Best Friend:<br />
marilyn  monroe  diamonds  girl  best  friend  gentlemen  prefer  blondes  song  songs  movie  movies </p>
<p>All of them have &#8216;girl&#8217;, from this, all of the three are connected in the bipartite graph, other techniques can be used to do ranking. There are lots of ranking schemes available. For example, if in the ranking, Youtube use data mining techs like (sequence) frequent patterns of user click sequence history, there&#8217;ll be high probability, user like all of the three and watch them all in a sequence.</p>
<p>This is the first time, I made comment here. Your site is very useful for me. Thanks for your work. : )</p>
<p>btw, the paper title is<br />
Web video topic discovery and tracking via bipartite graph reinforcement model,<br />
by<br />
Lu Liu, Lifeng Sun, Yong Rui, Yao Shi, Shiqiang Yang,<br />
appeared in WWW 2008</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Curt Monash</title>
		<link>http://www.texttechnologies.com/2008/05/19/how-is-youtube-relating-videos/#comment-39618</link>
		<dc:creator>Curt Monash</dc:creator>
		<pubDate>Tue, 20 May 2008 06:54:51 +0000</pubDate>
		<guid isPermaLink="false">http://www.texttechnologies.com/?p=221#comment-39618</guid>
		<description>Better theory than any I came up with.  Thanks!

CAM</description>
		<content:encoded><![CDATA[<p>Better theory than any I came up with.  Thanks!</p>
<p>CAM</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: rtl</title>
		<link>http://www.texttechnologies.com/2008/05/19/how-is-youtube-relating-videos/#comment-39591</link>
		<dc:creator>rtl</dc:creator>
		<pubDate>Tue, 20 May 2008 02:50:44 +0000</pubDate>
		<guid isPermaLink="false">http://www.texttechnologies.com/?p=221#comment-39591</guid>
		<description>almost certainly normalized co-occurrences of favorites or views.

The tags are completely different so it&#039;s not mutual tags.  No freaking way on the visual comparison.  There&#039;s no place to manually enter the &quot;historical&quot; connection and that&#039;s obviously not scalable.  Mining the comments text is messy.  One other option would be show candidate videos randomly in that space and rank them by eCTR; but the candidate list would anyways have to be generated using one of these techniques.</description>
		<content:encoded><![CDATA[<p>almost certainly normalized co-occurrences of favorites or views.</p>
<p>The tags are completely different so it&#8217;s not mutual tags.  No freaking way on the visual comparison.  There&#8217;s no place to manually enter the &#8220;historical&#8221; connection and that&#8217;s obviously not scalable.  Mining the comments text is messy.  One other option would be show candidate videos randomly in that space and rank them by eCTR; but the candidate list would anyways have to be generated using one of these techniques.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

