<?xml version="1.0" encoding="UTF-8"?><!-- generator="wordpress/2.3.3" -->
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	>
<channel>
	<title>Comments on: LOD cloud shows surprisingly lumpy structure</title>
	<link>http://blog.larkc.eu/?p=1941</link>
	<description></description>
	<pubDate>Wed, 08 Sep 2010 22:00:41 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.3.3</generator>
		<item>
		<title>By: Tom Morris</title>
		<link>http://blog.larkc.eu/?p=1941#comment-120421</link>
		<dc:creator>Tom Morris</dc:creator>
		<pubDate>Mon, 17 May 2010 20:36:05 +0000</pubDate>
		<guid>http://blog.larkc.eu/?p=1941#comment-120421</guid>
		<description>Any chance of scalable version (SVG?) of these graphics?  The Gephi version is better, but some sections are still too dense to read labels.

The assumption that starting data table is accurate may not be valid.  For the spot checking I did, it was missing significant chunks of stuff.  For example, Freebase and DBpedia are basically fully bidrectionally connected for the things they have in common (80%+ of DBpedia?), but this isn't represented at all.  Freebase also links to MusicBrainz and a number of other data sources.

One other thing worth noting is that it appears the diameter of the nodes now represents in-degree rather than number of triples stored.</description>
		<content:encoded><![CDATA[<p>Any chance of scalable version (SVG?) of these graphics?  The Gephi version is better, but some sections are still too dense to read labels.</p>
<p>The assumption that starting data table is accurate may not be valid.  For the spot checking I did, it was missing significant chunks of stuff.  For example, Freebase and DBpedia are basically fully bidrectionally connected for the things they have in common (80%+ of DBpedia?), but this isn&#8217;t represented at all.  Freebase also links to MusicBrainz and a number of other data sources.</p>
<p>One other thing worth noting is that it appears the diameter of the nodes now represents in-degree rather than number of triples stored.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jose Manuel Gomez-Perez</title>
		<link>http://blog.larkc.eu/?p=1941#comment-120281</link>
		<dc:creator>Jose Manuel Gomez-Perez</dc:creator>
		<pubDate>Mon, 17 May 2010 13:10:30 +0000</pubDate>
		<guid>http://blog.larkc.eu/?p=1941#comment-120281</guid>
		<description>Nice work. IMHO, this kind of visualizations can provide useful insight in order to improve LOD from an architectural viewpoint so to say, e.g. to detect scalability issues, maximize domain coverage, support detection of redundancies, etc. 

On the other hand, something that can be drawn from this particular work is a notion of the domains where LOD is having a greater impact currently: Bio and Biblio, complemented with general purpose knowledge from the DBPedia cluster.  However, this can be seen the other way around: probably these domains are the best represented ones because their respetive communities have traditionally had a deeper involvement in LOD.</description>
		<content:encoded><![CDATA[<p>Nice work. IMHO, this kind of visualizations can provide useful insight in order to improve LOD from an architectural viewpoint so to say, e.g. to detect scalability issues, maximize domain coverage, support detection of redundancies, etc. </p>
<p>On the other hand, something that can be drawn from this particular work is a notion of the domains where LOD is having a greater impact currently: Bio and Biblio, complemented with general purpose knowledge from the DBPedia cluster.  However, this can be seen the other way around: probably these domains are the best represented ones because their respetive communities have traditionally had a deeper involvement in LOD.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Kingsley Idehen</title>
		<link>http://blog.larkc.eu/?p=1941#comment-109661</link>
		<dc:creator>Kingsley Idehen</dc:creator>
		<pubDate>Sun, 28 Mar 2010 22:10:12 +0000</pubDate>
		<guid>http://blog.larkc.eu/?p=1941#comment-109661</guid>
		<description>Belated comments.

Nice post.

The Linked Open Data realm is really a lumpy galaxy. It certainly isn't a DBpedia based Solar System :-)

Links:

1. http://bit.ly/90tUKJ -- old post titled State of the Linked Data Web.</description>
		<content:encoded><![CDATA[<p>Belated comments.</p>
<p>Nice post.</p>
<p>The Linked Open Data realm is really a lumpy galaxy. It certainly isn&#8217;t a DBpedia based Solar System <img src='http://blog.larkc.eu/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
<p>Links:</p>
<p>1. <a href="http://bit.ly/90tUKJ" rel="nofollow">http://bit.ly/90tUKJ</a> &#8212; old post titled State of the Linked Data Web.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Hugh Glaser</title>
		<link>http://blog.larkc.eu/?p=1941#comment-105071</link>
		<dc:creator>Hugh Glaser</dc:creator>
		<pubDate>Fri, 12 Mar 2010 10:33:27 +0000</pubDate>
		<guid>http://blog.larkc.eu/?p=1941#comment-105071</guid>
		<description>Nice work indeed.
Hopefully Anja (and others) won't have to spend their time doing the cloud by hand any more - thanks for the efforts over the years!

Looks like it is showing how disconnected RKB is from the rest of the world.

However, it isn't actually quite that bad, you will be pleased to hear. There are quite a lot of point-wise links from different RKB stores to dbpedia and others. It is just that I haven't found the time to put in small numbers of links (and didn't want to make the table overloaded with RKB stuff).
And actually there is a whole unlocode RKB site missing (with links to geonames and datagovuk I think), and probably some others.

A great thing about these pictures is that it is now easy for me to see such things, and dataset publishers will feel encouraged to keep the data up to date.</description>
		<content:encoded><![CDATA[<p>Nice work indeed.<br />
Hopefully Anja (and others) won&#8217;t have to spend their time doing the cloud by hand any more - thanks for the efforts over the years!</p>
<p>Looks like it is showing how disconnected RKB is from the rest of the world.</p>
<p>However, it isn&#8217;t actually quite that bad, you will be pleased to hear. There are quite a lot of point-wise links from different RKB stores to dbpedia and others. It is just that I haven&#8217;t found the time to put in small numbers of links (and didn&#8217;t want to make the table overloaded with RKB stuff).<br />
And actually there is a whole unlocode RKB site missing (with links to geonames and datagovuk I think), and probably some others.</p>
<p>A great thing about these pictures is that it is now easy for me to see such things, and dataset publishers will feel encouraged to keep the data up to date.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Pieter De Leenheer</title>
		<link>http://blog.larkc.eu/?p=1941#comment-104891</link>
		<dc:creator>Pieter De Leenheer</dc:creator>
		<pubDate>Thu, 11 Mar 2010 16:23:37 +0000</pubDate>
		<guid>http://blog.larkc.eu/?p=1941#comment-104891</guid>
		<description>the formation of criticial hubs is typical to scale-free networks. Amazing that knowledge networks organise according to the same pattern as social networks. I would be interesting to compare both networks here and make correlations. This would indeed lead to the conclusion is the main champion driving discussions within the initiative. Where is the SIOC graph :-) ?</description>
		<content:encoded><![CDATA[<p>the formation of criticial hubs is typical to scale-free networks. Amazing that knowledge networks organise according to the same pattern as social networks. I would be interesting to compare both networks here and make correlations. This would indeed lead to the conclusion is the main champion driving discussions within the initiative. Where is the SIOC graph <img src='http://blog.larkc.eu/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> ?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Joshua Shinavier</title>
		<link>http://blog.larkc.eu/?p=1941#comment-104571</link>
		<dc:creator>Joshua Shinavier</dc:creator>
		<pubDate>Wed, 10 Mar 2010 20:56:11 +0000</pubDate>
		<guid>http://blog.larkc.eu/?p=1941#comment-104571</guid>
		<description>Also check out Marko Rodriguez's analysis from March 2009: http://arxiv.org/abs/0903.0194
This paper likewise describes several distinct clusters of linkage.  Some other observations:
1) the LOD graph is not strongly connected
2) a diameter of 8 is large given relatively small size of the cloud
3) data sets in the cloud have nearly identical incoming and outgoing link patterns (which indicates that the majority of links are reciprocal owl:sameAs)</description>
		<content:encoded><![CDATA[<p>Also check out Marko Rodriguez&#8217;s analysis from March 2009: <a href="http://arxiv.org/abs/0903.0194" rel="nofollow">http://arxiv.org/abs/0903.0194</a><br />
This paper likewise describes several distinct clusters of linkage.  Some other observations:<br />
1) the LOD graph is not strongly connected<br />
2) a diameter of 8 is large given relatively small size of the cloud<br />
3) data sets in the cloud have nearly identical incoming and outgoing link patterns (which indicates that the majority of links are reciprocal owl:sameAs)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dan Brickley</title>
		<link>http://blog.larkc.eu/?p=1941#comment-104561</link>
		<dc:creator>Dan Brickley</dc:creator>
		<pubDate>Wed, 10 Mar 2010 20:12:02 +0000</pubDate>
		<guid>http://blog.larkc.eu/?p=1941#comment-104561</guid>
		<description>Re the 'two orgs' vulnerability; since this is all open shared data, you would hope that the data would still be useful and used even if the original source drops off the 'net. Of course RDF and the Linked Data model especially put particular weight on domain-name based HTTP URIs. So I guess we should take care to pick 'em carefully, cos changing them afterwards is going to be tricky...</description>
		<content:encoded><![CDATA[<p>Re the &#8216;two orgs&#8217; vulnerability; since this is all open shared data, you would hope that the data would still be useful and used even if the original source drops off the &#8216;net. Of course RDF and the Linked Data model especially put particular weight on domain-name based HTTP URIs. So I guess we should take care to pick &#8216;em carefully, cos changing them afterwards is going to be tricky&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: PaulGroth</title>
		<link>http://blog.larkc.eu/?p=1941#comment-104501</link>
		<dc:creator>PaulGroth</dc:creator>
		<pubDate>Wed, 10 Mar 2010 16:39:43 +0000</pubDate>
		<guid>http://blog.larkc.eu/?p=1941#comment-104501</guid>
		<description>Interesting observation by @Richard, two of the major clusters of the LOD (scholarly data and bio data ) are run by two organizations. So if Southampton and Bio2rdf collapse these chunks die. Would nice to see more organizations hosting or make sure that these organizations have enough support so that they can ensure uptime.</description>
		<content:encoded><![CDATA[<p>Interesting observation by @Richard, two of the major clusters of the LOD (scholarly data and bio data ) are run by two organizations. So if Southampton and Bio2rdf collapse these chunks die. Would nice to see more organizations hosting or make sure that these organizations have enough support so that they can ensure uptime.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Pieter De Leenheer</title>
		<link>http://blog.larkc.eu/?p=1941#comment-104491</link>
		<dc:creator>Pieter De Leenheer</dc:creator>
		<pubDate>Wed, 10 Mar 2010 16:33:06 +0000</pubDate>
		<guid>http://blog.larkc.eu/?p=1941#comment-104491</guid>
		<description>Great to see the network effect here !</description>
		<content:encoded><![CDATA[<p>Great to see the network effect here !</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: FrankVanHarmelen</title>
		<link>http://blog.larkc.eu/?p=1941#comment-104481</link>
		<dc:creator>FrankVanHarmelen</dc:creator>
		<pubDate>Wed, 10 Mar 2010 16:24:30 +0000</pubDate>
		<guid>http://blog.larkc.eu/?p=1941#comment-104481</guid>
		<description>Christophe Gueret's script for generating the .net file is at http://bit.ly/aho1Ut</description>
		<content:encoded><![CDATA[<p>Christophe Gueret&#8217;s script for generating the .net file is at <a href="http://bit.ly/aho1Ut" rel="nofollow">http://bit.ly/aho1Ut</a></p>
]]></content:encoded>
	</item>
</channel>
</rss>
