<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.sindice.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">

<channel>
	<title>Sindice Blog</title>
	
	<link>http://blog.sindice.com</link>
	<description>Just another WordPress weblog</description>
	<lastBuildDate>Fri, 09 Jul 2010 01:04:10 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://feeds.sindice.com/SindiceBlog" /><feedburner:info uri="sindiceblog" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><item>
		<title>Sindice now supports Efficient Data discovery and Sync</title>
		<link>http://feeds.sindice.com/~r/SindiceBlog/~3/AytyvlV4Efs/</link>
		<comments>http://blog.sindice.com/2010/07/09/sindice-now-supports-efficient-data-discovery-and-sync/#comments</comments>
		<pubDate>Fri, 09 Jul 2010 01:04:10 +0000</pubDate>
		<dc:creator>Giovanni Tummarello</dc:creator>
				<category><![CDATA[Announcements]]></category>
		<category><![CDATA[Sindice]]></category>

		<guid isPermaLink="false">http://blog.sindice.com/?p=264</guid>
		<description><![CDATA[So far semantic web search engines and semantic aggregation services have been inserting datasets by hand or have been based on &#8220;random walk&#8221; like crawls with no data completeness or freshness guarantees. After quite some work, we are happy to announce that Sindice is now supporting effective large scale data acquisition with *efficient syncing* capabilities based on [...]]]></description>
			<content:encoded><![CDATA[<p>So far semantic web search engines and semantic aggregation services have been inserting datasets by hand or have been based on &#8220;random walk&#8221; like crawls with no data completeness or freshness guarantees.</p>
<p>After quite some work, we are happy to announce that Sindice is now supporting effective large scale data acquisition with *efficient syncing* capabilities based on already existing standards (a specific use of  the sitemap protocol).</p>
<p>For example if you publish 300000 products using RDFa or whatever you want to use (microformats,  303s etc), by making sure you comply to the proposed method, Sindice will now guarantee you</p>
<p>a) to crawl your dataset completely (might take some time since we do this &#8220;politely&#8221;)</p>
<p>b) ..but only crawl you once and then get just the updated URLs on a daily bases! (so timely data update guarantee)</p>
<p>So this is not &#8220;Crawling&#8221; anymore, but rather a live &#8220;DB like&#8221; connection between remote, diverse dataset all based on http. in our opinion this is a *very* important step forward for semantic web data aggregation infrastructures.</p>
<p>The specification we support (and how to make sure you&#8217;re being properly indexed) are published here  (pretty simple stuff actually!)</p>
<p><a href="http://sindice.com/developers/publishing" target="_blank">http://sindice.com/developers/publishing</a></p>
<p>and results can be seen from websites which are already implementing these (you might be already doing that indeed without knowing..)</p>
<p><a href="http://sindice.com/search?q=domain:www.scribd.com+date:last_week&amp;qt=term" target="_blank">http://sindice.com/search?q=domain:www.scribd.com+date:last_week&amp;qt=term</a></p>
<p>Why not make sure that your site can be effectively kept in sync today?</p>
<p>As always  we look forward for comments, suggestions and ideas on how to serve better your data needs (e.g. yes, we&#8217;ll also support Openlink dataset sync proposal once the specs are finalized). Feel free to ask specific questions about this or any other Sindice related issue on our dev forum <a href="http://sindice.com/main/forum" target="_blank">http://sindice.com/main/forum</a></p>
<p>Giovanni,</p>
<p>on behalf of the team <a href="http://sindice.com/main/about" target="_blank">http://sindice.com/main/about</a>. Special credits for this to Tamas Benko and Robert Fuller.</p>
<p>p.s. we&#8217;re still interested in hiring selected researchers and developers</p>
<div class="lightsocial_container"><a class="lightsocial_a" href="http://digg.com/submit?url=http%3A%2F%2Fblog.sindice.com%2F2010%2F07%2F09%2Fsindice-now-supports-efficient-data-discovery-and-sync%2F&amp;title=Sindice+now+supports+Efficient+Data+discovery+and+Sync" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/digg.png" alt="Digg This" title="Digg This" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.reddit.com/submit?url=http%3A%2F%2Fblog.sindice.com%2F2010%2F07%2F09%2Fsindice-now-supports-efficient-data-discovery-and-sync%2F&amp;title=Sindice+now+supports+Efficient+Data+discovery+and+Sync" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/reddit.png" alt="Reddit This" title="Reddit This" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.stumbleupon.com/submit?url=http%3A%2F%2Fblog.sindice.com%2F2010%2F07%2F09%2Fsindice-now-supports-efficient-data-discovery-and-sync%2F&amp;title=Sindice+now+supports+Efficient+Data+discovery+and+Sync" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/stumbleupon.png" alt="Stumble Now!" title="Stumble Now!" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://buzz.yahoo.com/buzz?targetUrl=http%3A%2F%2Fblog.sindice.com%2F2010%2F07%2F09%2Fsindice-now-supports-efficient-data-discovery-and-sync%2F&amp;headline=Sindice+now+supports+Efficient+Data+discovery+and+Sync" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/yahoo_buzz.png" alt="Buzz This" title="Buzz This" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.dzone.com/links/add.html?title=Sindice+now+supports+Efficient+Data+discovery+and+Sync&amp;url=http%3A%2F%2Fblog.sindice.com%2F2010%2F07%2F09%2Fsindice-now-supports-efficient-data-discovery-and-sync%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/dzone.png" alt="Vote on DZone" title="Vote on DZone" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.facebook.com/sharer.php?t=Sindice+now+supports+Efficient+Data+discovery+and+Sync&amp;u=http%3A%2F%2Fblog.sindice.com%2F2010%2F07%2F09%2Fsindice-now-supports-efficient-data-discovery-and-sync%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/facebook.png" alt="Share on Facebook" title="Share on Facebook" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://delicious.com/save?title=Sindice+now+supports+Efficient+Data+discovery+and+Sync&amp;url=http%3A%2F%2Fblog.sindice.com%2F2010%2F07%2F09%2Fsindice-now-supports-efficient-data-discovery-and-sync%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/delicious.png" alt="Bookmark this on Delicious" title="Bookmark this on Delicious" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.dotnetkicks.com/kick/?title=Sindice+now+supports+Efficient+Data+discovery+and+Sync&amp;url=http%3A%2F%2Fblog.sindice.com%2F2010%2F07%2F09%2Fsindice-now-supports-efficient-data-discovery-and-sync%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/dotnetkicks.png" alt="Kick It on DotNetKicks.com" title="Kick It on DotNetKicks.com" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://dotnetshoutout.com/Submit?title=Sindice+now+supports+Efficient+Data+discovery+and+Sync&amp;url=http%3A%2F%2Fblog.sindice.com%2F2010%2F07%2F09%2Fsindice-now-supports-efficient-data-discovery-and-sync%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/dotnetshoutout.png" alt="Shout it" title="Shout it" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.linkedin.com/shareArticle?mini=true&amp;url=http%3A%2F%2Fblog.sindice.com%2F2010%2F07%2F09%2Fsindice-now-supports-efficient-data-discovery-and-sync%2F&amp;title=Sindice+now+supports+Efficient+Data+discovery+and+Sync&amp;summary=&amp;source=" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/linkedin.png" alt="Share on LinkedIn" title="Share on LinkedIn" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.technorati.com/faves?add=http%3A%2F%2Fblog.sindice.com%2F2010%2F07%2F09%2Fsindice-now-supports-efficient-data-discovery-and-sync%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/technorati.png" alt="Bookmark this on Technorati" title="Bookmark this on Technorati" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://twitter.com/home?status=Reading+http%3A%2F%2Fblog.sindice.com%2F2010%2F07%2F09%2Fsindice-now-supports-efficient-data-discovery-and-sync%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/twitter.png" alt="Post on Twitter" title="Post on Twitter" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.google.com/buzz/post?url=http%3A%2F%2Fblog.sindice.com%2F2010%2F07%2F09%2Fsindice-now-supports-efficient-data-discovery-and-sync%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/google_buzz.png" alt="Google Buzz (aka. Google Reader)" title="Google Buzz (aka. Google Reader)" /></a>&nbsp;&nbsp;</div><div class="feedflare">
<a href="http://feeds.sindice.com/~ff/SindiceBlog?a=AytyvlV4Efs:IpWSddODL-8:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/SindiceBlog?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.sindice.com/~ff/SindiceBlog?a=AytyvlV4Efs:IpWSddODL-8:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/SindiceBlog?i=AytyvlV4Efs:IpWSddODL-8:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.sindice.com/~ff/SindiceBlog?a=AytyvlV4Efs:IpWSddODL-8:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/SindiceBlog?i=AytyvlV4Efs:IpWSddODL-8:D7DqB2pKExk" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/SindiceBlog/~4/AytyvlV4Efs" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.sindice.com/2010/07/09/sindice-now-supports-efficient-data-discovery-and-sync/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		<feedburner:origLink>http://blog.sindice.com/2010/07/09/sindice-now-supports-efficient-data-discovery-and-sync/</feedburner:origLink></item>
		<item>
		<title>Sindice planned downtime this weekend</title>
		<link>http://feeds.sindice.com/~r/SindiceBlog/~3/_sTd7NC_IOk/</link>
		<comments>http://blog.sindice.com/2010/06/09/sindice-planned-downtime-this-weekend/#comments</comments>
		<pubDate>Wed, 09 Jun 2010 09:58:17 +0000</pubDate>
		<dc:creator>smulcahy</dc:creator>
				<category><![CDATA[Announcements]]></category>
		<category><![CDATA[Sindice]]></category>

		<guid isPermaLink="false">http://blog.sindice.com/?p=258</guid>
		<description><![CDATA[Hi. Due to an expansion of one of our datacentres (and the electrical work that this implies), Sindice and related services such as sig.ma will be down from 1730 GMT+1, 11-Jun-2010 (Friday) to 1730 GMT+1, 12-Jun-2010 (Saturday). This major upgrade will give us increased room to grow the Sindice infrastructure over time. On 27-May-2010 we [...]]]></description>
			<content:encoded><![CDATA[<p>Hi. Due to an expansion of one of our datacentres (and the electrical work that this implies), Sindice and related services such as <a href="http://sig.ma">sig.ma</a> will be <strong>down  from 1730 GMT+1, 11-Jun-2010 (Friday) to 1730 GMT+1, 12-Jun-2010  (Saturday)</strong>. This major upgrade will give us increased room to grow the Sindice infrastructure over time. On 27-May-2010 we hit a major milestone in Sindice of having indexed 100 million documents (over 6.5 billion  triples) from the semantic web. At current rates of data acquisition, we expect to hit the 200 million document mark before Christmas &#8211; so we&#8217;ll need that extra room!</p>
<p>If you have any further queries on this downtime, please leave a  comment or contact us via the <a href="http://groups.google.com/group/sindice-dev">Sindice Developers  group</a>.</p>
<p><strong>Update:</strong> Thanks to the efforts of NUIG&#8217;s ISS team sindice.com is back online ahead of schedule. All services (including <a href="http://sig.ma">sig.ma</a> should now be operational).</p>
<div class="lightsocial_container"><a class="lightsocial_a" href="http://digg.com/submit?url=http%3A%2F%2Fblog.sindice.com%2F2010%2F06%2F09%2Fsindice-planned-downtime-this-weekend%2F&amp;title=Sindice+planned+downtime+this+weekend" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/digg.png" alt="Digg This" title="Digg This" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.reddit.com/submit?url=http%3A%2F%2Fblog.sindice.com%2F2010%2F06%2F09%2Fsindice-planned-downtime-this-weekend%2F&amp;title=Sindice+planned+downtime+this+weekend" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/reddit.png" alt="Reddit This" title="Reddit This" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.stumbleupon.com/submit?url=http%3A%2F%2Fblog.sindice.com%2F2010%2F06%2F09%2Fsindice-planned-downtime-this-weekend%2F&amp;title=Sindice+planned+downtime+this+weekend" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/stumbleupon.png" alt="Stumble Now!" title="Stumble Now!" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://buzz.yahoo.com/buzz?targetUrl=http%3A%2F%2Fblog.sindice.com%2F2010%2F06%2F09%2Fsindice-planned-downtime-this-weekend%2F&amp;headline=Sindice+planned+downtime+this+weekend" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/yahoo_buzz.png" alt="Buzz This" title="Buzz This" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.dzone.com/links/add.html?title=Sindice+planned+downtime+this+weekend&amp;url=http%3A%2F%2Fblog.sindice.com%2F2010%2F06%2F09%2Fsindice-planned-downtime-this-weekend%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/dzone.png" alt="Vote on DZone" title="Vote on DZone" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.facebook.com/sharer.php?t=Sindice+planned+downtime+this+weekend&amp;u=http%3A%2F%2Fblog.sindice.com%2F2010%2F06%2F09%2Fsindice-planned-downtime-this-weekend%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/facebook.png" alt="Share on Facebook" title="Share on Facebook" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://delicious.com/save?title=Sindice+planned+downtime+this+weekend&amp;url=http%3A%2F%2Fblog.sindice.com%2F2010%2F06%2F09%2Fsindice-planned-downtime-this-weekend%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/delicious.png" alt="Bookmark this on Delicious" title="Bookmark this on Delicious" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.dotnetkicks.com/kick/?title=Sindice+planned+downtime+this+weekend&amp;url=http%3A%2F%2Fblog.sindice.com%2F2010%2F06%2F09%2Fsindice-planned-downtime-this-weekend%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/dotnetkicks.png" alt="Kick It on DotNetKicks.com" title="Kick It on DotNetKicks.com" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://dotnetshoutout.com/Submit?title=Sindice+planned+downtime+this+weekend&amp;url=http%3A%2F%2Fblog.sindice.com%2F2010%2F06%2F09%2Fsindice-planned-downtime-this-weekend%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/dotnetshoutout.png" alt="Shout it" title="Shout it" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.linkedin.com/shareArticle?mini=true&amp;url=http%3A%2F%2Fblog.sindice.com%2F2010%2F06%2F09%2Fsindice-planned-downtime-this-weekend%2F&amp;title=Sindice+planned+downtime+this+weekend&amp;summary=&amp;source=" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/linkedin.png" alt="Share on LinkedIn" title="Share on LinkedIn" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.technorati.com/faves?add=http%3A%2F%2Fblog.sindice.com%2F2010%2F06%2F09%2Fsindice-planned-downtime-this-weekend%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/technorati.png" alt="Bookmark this on Technorati" title="Bookmark this on Technorati" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://twitter.com/home?status=Reading+http%3A%2F%2Fblog.sindice.com%2F2010%2F06%2F09%2Fsindice-planned-downtime-this-weekend%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/twitter.png" alt="Post on Twitter" title="Post on Twitter" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.google.com/buzz/post?url=http%3A%2F%2Fblog.sindice.com%2F2010%2F06%2F09%2Fsindice-planned-downtime-this-weekend%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/google_buzz.png" alt="Google Buzz (aka. Google Reader)" title="Google Buzz (aka. Google Reader)" /></a>&nbsp;&nbsp;</div><div class="feedflare">
<a href="http://feeds.sindice.com/~ff/SindiceBlog?a=_sTd7NC_IOk:LtP2xhObEiI:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/SindiceBlog?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.sindice.com/~ff/SindiceBlog?a=_sTd7NC_IOk:LtP2xhObEiI:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/SindiceBlog?i=_sTd7NC_IOk:LtP2xhObEiI:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.sindice.com/~ff/SindiceBlog?a=_sTd7NC_IOk:LtP2xhObEiI:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/SindiceBlog?i=_sTd7NC_IOk:LtP2xhObEiI:D7DqB2pKExk" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/SindiceBlog/~4/_sTd7NC_IOk" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.sindice.com/2010/06/09/sindice-planned-downtime-this-weekend/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.sindice.com/2010/06/09/sindice-planned-downtime-this-weekend/</feedburner:origLink></item>
		<item>
		<title>Any23 v0.4.0 Released</title>
		<link>http://feeds.sindice.com/~r/SindiceBlog/~3/YIqp4noTuqg/</link>
		<comments>http://blog.sindice.com/2010/05/27/any23-0-4-0-released/#comments</comments>
		<pubDate>Thu, 27 May 2010 12:41:10 +0000</pubDate>
		<dc:creator>micmos</dc:creator>
				<category><![CDATA[Sindice]]></category>

		<guid isPermaLink="false">http://blog.sindice.com/?p=240</guid>
		<description><![CDATA[Dear All, the Sindice FBK team is proud to announce the Any23 0.4.0 release. In this new release we paid particular attention in data validation and correction, in  particular  we can claim  to extract the  Open Graph Protocol[1]  metadata also whether affected by syntactical errors[2]. We&#8217;ve also added full support for the N-Quads[3] format. As [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignright" src="http://developers.any23.org/images/logo-any23.png" alt="The Any23 logo" width="254" height="156" /></p>
<div>Dear All,</div>
<div>the Sindice FBK team is proud to announce the Any23 <strong>0.4.0</strong> release.</div>
<p></p>
<div>In this new release we paid particular attention in data validation and correction,</div>
<div>in  particular  we can claim  to extract the  Open Graph Protocol[1]  metadata also</div>
<div>whether affected by syntactical errors[2].</div>
<p></p>
<div>We&#8217;ve also added full support for the N-Quads[3] format.</div>
<p></p>
<div>As usual everybody is invited to adopt this new release and</div>
<div>report any encountered bug[4].</div>
<p></p>
<div>A live demo is running at [5], please feel free to try it.</div>
<p></p>
<div>We’re planning the milestone 0.5.0, so if you are waiting for the fix of</div>
<div>a particular improvement please submit it to us using our issue tracker[4].</div>
<p></p>
<div>Below an extract of the 0.4.0 release note [6]:</div>
<div>
<ul>
<li>The any23-service module has been separated from the any23-core module, the Ant build system has been dropped. <strong>[Issue 44]</strong></li>
<li>Added support for HTML metadata (RDFa / Microformats) validation and correction (validator). <strong>[Issue 77]</strong></li>
<li>Added flag to disable the nesting relationship property enrichment.<strong> [Issue 67]</strong></li>
<li>Improved coverage of Microformat tests.<strong> [Issue 65]</strong></li>
<li>Improved documentation. <strong>[Issue 44]</strong></li>
<li>Various code consolidation. <strong>[Issues 68, 69, 70, 71, 72, 73, 74, 77]</strong></li>
</ul>
</div>
<p></p>
<div>Thanks for supporting our work.</div>
<p></p>
<div>The Any23 Developers Team</div>
<p></p>
<div>[1] <a href="http://opengraphprotocol.org/">http://opengraphprotocol.org/</a></div>
<div>[2] <a href="http://developers.any23.org/ill-formed-rdfa.html">http://developers.any23.org/ill-formed-rdfa.html</a></div>
<div>[3] <a href="http://sw.deri.org/2008/07/n-quads/">http://sw.deri.org/2008/07/n-quads/</a></div>
<div>[4] <a href="http://code.google.com/p/any23/issues/list">http://code.google.com/p/any23/issues/list</a></div>
<div>[5] <a href="http://any23.org/">http://any23.org/</a></div>
<div>[6] <a href="http://any23.googlecode.com/svn/trunk/RELEASE-NOTES.txt">http://any23.googlecode.com/svn/trunk/RELEASE-NOTES.txt</a></div>
<div class="lightsocial_container"><a class="lightsocial_a" href="http://digg.com/submit?url=http%3A%2F%2Fblog.sindice.com%2F2010%2F05%2F27%2Fany23-0-4-0-released%2F&amp;title=Any23+v0.4.0+Released" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/digg.png" alt="Digg This" title="Digg This" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.reddit.com/submit?url=http%3A%2F%2Fblog.sindice.com%2F2010%2F05%2F27%2Fany23-0-4-0-released%2F&amp;title=Any23+v0.4.0+Released" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/reddit.png" alt="Reddit This" title="Reddit This" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.stumbleupon.com/submit?url=http%3A%2F%2Fblog.sindice.com%2F2010%2F05%2F27%2Fany23-0-4-0-released%2F&amp;title=Any23+v0.4.0+Released" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/stumbleupon.png" alt="Stumble Now!" title="Stumble Now!" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://buzz.yahoo.com/buzz?targetUrl=http%3A%2F%2Fblog.sindice.com%2F2010%2F05%2F27%2Fany23-0-4-0-released%2F&amp;headline=Any23+v0.4.0+Released" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/yahoo_buzz.png" alt="Buzz This" title="Buzz This" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.dzone.com/links/add.html?title=Any23+v0.4.0+Released&amp;url=http%3A%2F%2Fblog.sindice.com%2F2010%2F05%2F27%2Fany23-0-4-0-released%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/dzone.png" alt="Vote on DZone" title="Vote on DZone" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.facebook.com/sharer.php?t=Any23+v0.4.0+Released&amp;u=http%3A%2F%2Fblog.sindice.com%2F2010%2F05%2F27%2Fany23-0-4-0-released%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/facebook.png" alt="Share on Facebook" title="Share on Facebook" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://delicious.com/save?title=Any23+v0.4.0+Released&amp;url=http%3A%2F%2Fblog.sindice.com%2F2010%2F05%2F27%2Fany23-0-4-0-released%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/delicious.png" alt="Bookmark this on Delicious" title="Bookmark this on Delicious" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.dotnetkicks.com/kick/?title=Any23+v0.4.0+Released&amp;url=http%3A%2F%2Fblog.sindice.com%2F2010%2F05%2F27%2Fany23-0-4-0-released%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/dotnetkicks.png" alt="Kick It on DotNetKicks.com" title="Kick It on DotNetKicks.com" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://dotnetshoutout.com/Submit?title=Any23+v0.4.0+Released&amp;url=http%3A%2F%2Fblog.sindice.com%2F2010%2F05%2F27%2Fany23-0-4-0-released%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/dotnetshoutout.png" alt="Shout it" title="Shout it" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.linkedin.com/shareArticle?mini=true&amp;url=http%3A%2F%2Fblog.sindice.com%2F2010%2F05%2F27%2Fany23-0-4-0-released%2F&amp;title=Any23+v0.4.0+Released&amp;summary=&amp;source=" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/linkedin.png" alt="Share on LinkedIn" title="Share on LinkedIn" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.technorati.com/faves?add=http%3A%2F%2Fblog.sindice.com%2F2010%2F05%2F27%2Fany23-0-4-0-released%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/technorati.png" alt="Bookmark this on Technorati" title="Bookmark this on Technorati" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://twitter.com/home?status=Reading+http%3A%2F%2Fblog.sindice.com%2F2010%2F05%2F27%2Fany23-0-4-0-released%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/twitter.png" alt="Post on Twitter" title="Post on Twitter" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.google.com/buzz/post?url=http%3A%2F%2Fblog.sindice.com%2F2010%2F05%2F27%2Fany23-0-4-0-released%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/google_buzz.png" alt="Google Buzz (aka. Google Reader)" title="Google Buzz (aka. Google Reader)" /></a>&nbsp;&nbsp;</div><div class="feedflare">
<a href="http://feeds.sindice.com/~ff/SindiceBlog?a=YIqp4noTuqg:uJSOAvvES2Y:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/SindiceBlog?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.sindice.com/~ff/SindiceBlog?a=YIqp4noTuqg:uJSOAvvES2Y:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/SindiceBlog?i=YIqp4noTuqg:uJSOAvvES2Y:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.sindice.com/~ff/SindiceBlog?a=YIqp4noTuqg:uJSOAvvES2Y:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/SindiceBlog?i=YIqp4noTuqg:uJSOAvvES2Y:D7DqB2pKExk" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/SindiceBlog/~4/YIqp4noTuqg" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.sindice.com/2010/05/27/any23-0-4-0-released/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.sindice.com/2010/05/27/any23-0-4-0-released/</feedburner:origLink></item>
		<item>
		<title>Any23 v0.3.0 Released</title>
		<link>http://feeds.sindice.com/~r/SindiceBlog/~3/S4--pBzsHBQ/</link>
		<comments>http://blog.sindice.com/2010/04/23/any23-v0-3-0-release/#comments</comments>
		<pubDate>Fri, 23 Apr 2010 17:05:23 +0000</pubDate>
		<dc:creator>micmos</dc:creator>
				<category><![CDATA[Sindice]]></category>

		<guid isPermaLink="false">http://blog.sindice.com/?p=225</guid>
		<description><![CDATA[Dear All, we&#8217;re pleased to announce the Any23 0.3.0 release. Please keep in mind this is a beta, so everybody using Any23 in a development session is invited to migrate to this latest version and report in our issue tracker [1] any eventual bug. As usual we have a live demo running at [2], please feel [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignright" src="http://developers.any23.org/images/logo-any23.png" alt="The Any23 logo" width="254" height="156" /></p>
<p>Dear All,</p>
<p>we&#8217;re pleased to announce the Any23 0.3.0 release.</p>
<p>Please keep in mind this is a beta, so everybody using Any23 in a development<br />
session is invited to migrate to this latest version and report in our issue tracker [1]<br />
any eventual bug.</p>
<p>As usual we have a live demo running at [2], please feel free to try it.</p>
<p>We&#8217;re planning the Milestone 0.4, so if you are waiting for the fix of a<br />
particular issue please verify that it is open, and eventually add a comment to ask more priority.</p>
<p>To end, below you&#8217;ll find an extract of the 0.3.0 release note [3]:</p>
<ol>
<li>Added detection and enrichment of nested microformats. [Issue #61]</li>
<li>Added detection and support of N-Quads as input and output format. [Issue #7]</li>
<li>General Improvements in RDFa extraction. [Issue #12, Issue #14]</li>
<li>Added support of Turtle embedded in HTML script tag. [Issue #62]</li>
<li>Improvement in encoding support. [Issue #43]</li>
<li>Improvement in Core API. [Issue #27]</li>
<li>Improved support for Species Microformat. [Issue #63]</li>
</ol>
<p>Thanks for supporting our work.</p>
<p>The Any23 Developers Team</p>
<p>[1] <a href="http://code.google.com/p/any23/issues/list" target="_blank">http://code.google.com/p/any23/issues/list</a><br />
[2] <a href="http://any23.org/" target="_blank">http://any23.org/</a><br />
[3] <a href="http://any23.googlecode.com/svn/trunk/any23-core/RELEASE-NOTES.txt" target="_blank">http://any23.googlecode.com/svn/trunk/any23-core/RELEASE-NOTES.txt</a></p>
<div class="lightsocial_container"><a class="lightsocial_a" href="http://digg.com/submit?url=http%3A%2F%2Fblog.sindice.com%2F2010%2F04%2F23%2Fany23-v0-3-0-release%2F&amp;title=Any23+v0.3.0+Released" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/digg.png" alt="Digg This" title="Digg This" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.reddit.com/submit?url=http%3A%2F%2Fblog.sindice.com%2F2010%2F04%2F23%2Fany23-v0-3-0-release%2F&amp;title=Any23+v0.3.0+Released" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/reddit.png" alt="Reddit This" title="Reddit This" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.stumbleupon.com/submit?url=http%3A%2F%2Fblog.sindice.com%2F2010%2F04%2F23%2Fany23-v0-3-0-release%2F&amp;title=Any23+v0.3.0+Released" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/stumbleupon.png" alt="Stumble Now!" title="Stumble Now!" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://buzz.yahoo.com/buzz?targetUrl=http%3A%2F%2Fblog.sindice.com%2F2010%2F04%2F23%2Fany23-v0-3-0-release%2F&amp;headline=Any23+v0.3.0+Released" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/yahoo_buzz.png" alt="Buzz This" title="Buzz This" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.dzone.com/links/add.html?title=Any23+v0.3.0+Released&amp;url=http%3A%2F%2Fblog.sindice.com%2F2010%2F04%2F23%2Fany23-v0-3-0-release%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/dzone.png" alt="Vote on DZone" title="Vote on DZone" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.facebook.com/sharer.php?t=Any23+v0.3.0+Released&amp;u=http%3A%2F%2Fblog.sindice.com%2F2010%2F04%2F23%2Fany23-v0-3-0-release%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/facebook.png" alt="Share on Facebook" title="Share on Facebook" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://delicious.com/save?title=Any23+v0.3.0+Released&amp;url=http%3A%2F%2Fblog.sindice.com%2F2010%2F04%2F23%2Fany23-v0-3-0-release%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/delicious.png" alt="Bookmark this on Delicious" title="Bookmark this on Delicious" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.dotnetkicks.com/kick/?title=Any23+v0.3.0+Released&amp;url=http%3A%2F%2Fblog.sindice.com%2F2010%2F04%2F23%2Fany23-v0-3-0-release%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/dotnetkicks.png" alt="Kick It on DotNetKicks.com" title="Kick It on DotNetKicks.com" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://dotnetshoutout.com/Submit?title=Any23+v0.3.0+Released&amp;url=http%3A%2F%2Fblog.sindice.com%2F2010%2F04%2F23%2Fany23-v0-3-0-release%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/dotnetshoutout.png" alt="Shout it" title="Shout it" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.linkedin.com/shareArticle?mini=true&amp;url=http%3A%2F%2Fblog.sindice.com%2F2010%2F04%2F23%2Fany23-v0-3-0-release%2F&amp;title=Any23+v0.3.0+Released&amp;summary=&amp;source=" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/linkedin.png" alt="Share on LinkedIn" title="Share on LinkedIn" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.technorati.com/faves?add=http%3A%2F%2Fblog.sindice.com%2F2010%2F04%2F23%2Fany23-v0-3-0-release%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/technorati.png" alt="Bookmark this on Technorati" title="Bookmark this on Technorati" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://twitter.com/home?status=Reading+http%3A%2F%2Fblog.sindice.com%2F2010%2F04%2F23%2Fany23-v0-3-0-release%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/twitter.png" alt="Post on Twitter" title="Post on Twitter" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.google.com/buzz/post?url=http%3A%2F%2Fblog.sindice.com%2F2010%2F04%2F23%2Fany23-v0-3-0-release%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/google_buzz.png" alt="Google Buzz (aka. Google Reader)" title="Google Buzz (aka. Google Reader)" /></a>&nbsp;&nbsp;</div><div class="feedflare">
<a href="http://feeds.sindice.com/~ff/SindiceBlog?a=S4--pBzsHBQ:w93COgE_WK8:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/SindiceBlog?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.sindice.com/~ff/SindiceBlog?a=S4--pBzsHBQ:w93COgE_WK8:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/SindiceBlog?i=S4--pBzsHBQ:w93COgE_WK8:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.sindice.com/~ff/SindiceBlog?a=S4--pBzsHBQ:w93COgE_WK8:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/SindiceBlog?i=S4--pBzsHBQ:w93COgE_WK8:D7DqB2pKExk" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/SindiceBlog/~4/S4--pBzsHBQ" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.sindice.com/2010/04/23/any23-v0-3-0-release/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.sindice.com/2010/04/23/any23-v0-3-0-release/</feedburner:origLink></item>
		<item>
		<title>Any23 v0.2 Released</title>
		<link>http://feeds.sindice.com/~r/SindiceBlog/~3/os5HytlFHQQ/</link>
		<comments>http://blog.sindice.com/2010/02/19/any23-v0-2-released/#comments</comments>
		<pubDate>Fri, 19 Feb 2010 14:41:17 +0000</pubDate>
		<dc:creator>micmos</dc:creator>
				<category><![CDATA[Sindice]]></category>

		<guid isPermaLink="false">http://blog.sindice.com/?p=190</guid>
		<description><![CDATA[We are proud to announce a new release of Any23 &#8211; Anything to Triples http://developers.any23.org/ Any23 is a Java library that parses RDF from a variety of Web document formats. The currently supported input formats are RDFa, RDF/XML, Turtle, N3, N-Triples, and a number of Microformats. Any23 is an Open Source project originated from the code created within [...]]]></description>
			<content:encoded><![CDATA[<div>We are proud to announce a new release of <em>Any23</em> &#8211; <em><strong>Anything to Triples</strong></em></div>
<div><a title="http://developers.any23.org/" href="http://developers.any23.org/">http://developers.any23.org/</a></div>
<div style="text-align: left">Any23 is a Java library that parses RDF from a variety of Web document formats.</div>
<div style="text-align: left">The currently supported input formats are RDFa, RDF/XML, Turtle, N3, N-Triples,</div>
<div style="text-align: left">and a number of Microformats.</div>
<div><em>Any23</em> is an Open Source project originated from the code created within the Sindice project</div>
<div>and now used both inside sindice and in related projects e.g. <a href="http://sig.ma/">Sig.Ma</a> .</div>
<div><em>Any23</em> comes with a handy command-line tool for parsing RDF and converting between formats.</div>
<div>We have also set up a demo service where you can try any23 online and use a REST API to convert</div>
<div>between different RDF formats, similar in spirit to triplr.org:</div>
<div><a href="http://any23.org/">http://any23.org/</a></div>
<div>The major new features in this release are:</div>
<div>
<ul>
<li>Redesigned Java API</li>
<li>-  Input from string, stream, file, or URI</li>
<li>-  Allow choosing which extractors to use</li>
<li>-  Report origin of triples (document/extractor) to client processors</li>
<li>-  Various processors/serializers for extracted triples</li>
<li>Added flexible command-line tool for easy testing</li>
<li>Vastly improved website and documentation</li>
<li>Media type and encoding detection via Apache Tika</li>
<li>Switched RDF library from Jena to Sesame</li>
<li>Added Maven build</li>
<li>Better RDF extraction from Microformats</li>
<li>Extractors come with example file to document typical in- and output</li>
<li>Major refactoring</li>
<li>Lots and lots of bugfixes</li>
</ul>
</div>
<div>The following people have contributed to this release:</div>
<div>Michele Mostarda and Davide Palmisano (FBK, Trento, Italy, Web of Data Unit (WED) );</div>
<div>Richard Cyganiak and Jurgen Umbrich (DERI, NUI Galway, Ireland);</div>
<div>Michele Catasta (EPFL, Lausanne, Switzerland), Giovanni Tummarello.</div>
<div>This release is the first result of the joint effort between Fondazione Bruno Kessler and DERI,</div>
<div>that recently started working together on Sindice. We strongly believe that Any23 could benefit from the wide</div>
<div>Open Source community, especially considering the license under which it has been released.</div>
<div>We think that the new Any23 v0.2, now integrated in the Sindice ingestion pipeline,</div>
<div>will impact on the quality of the indexed data.</div>
<div>This release adds other pieces of Open Sources in Sindice, notably the Semantic Information Retrieval Index (SIREN) available at <a href="http://siren.sindice.com">http://siren.sindice.com</a> .</div>
<div class="lightsocial_container"><a class="lightsocial_a" href="http://digg.com/submit?url=http%3A%2F%2Fblog.sindice.com%2F2010%2F02%2F19%2Fany23-v0-2-released%2F&amp;title=Any23+v0.2+Released" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/digg.png" alt="Digg This" title="Digg This" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.reddit.com/submit?url=http%3A%2F%2Fblog.sindice.com%2F2010%2F02%2F19%2Fany23-v0-2-released%2F&amp;title=Any23+v0.2+Released" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/reddit.png" alt="Reddit This" title="Reddit This" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.stumbleupon.com/submit?url=http%3A%2F%2Fblog.sindice.com%2F2010%2F02%2F19%2Fany23-v0-2-released%2F&amp;title=Any23+v0.2+Released" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/stumbleupon.png" alt="Stumble Now!" title="Stumble Now!" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://buzz.yahoo.com/buzz?targetUrl=http%3A%2F%2Fblog.sindice.com%2F2010%2F02%2F19%2Fany23-v0-2-released%2F&amp;headline=Any23+v0.2+Released" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/yahoo_buzz.png" alt="Buzz This" title="Buzz This" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.dzone.com/links/add.html?title=Any23+v0.2+Released&amp;url=http%3A%2F%2Fblog.sindice.com%2F2010%2F02%2F19%2Fany23-v0-2-released%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/dzone.png" alt="Vote on DZone" title="Vote on DZone" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.facebook.com/sharer.php?t=Any23+v0.2+Released&amp;u=http%3A%2F%2Fblog.sindice.com%2F2010%2F02%2F19%2Fany23-v0-2-released%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/facebook.png" alt="Share on Facebook" title="Share on Facebook" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://delicious.com/save?title=Any23+v0.2+Released&amp;url=http%3A%2F%2Fblog.sindice.com%2F2010%2F02%2F19%2Fany23-v0-2-released%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/delicious.png" alt="Bookmark this on Delicious" title="Bookmark this on Delicious" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.dotnetkicks.com/kick/?title=Any23+v0.2+Released&amp;url=http%3A%2F%2Fblog.sindice.com%2F2010%2F02%2F19%2Fany23-v0-2-released%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/dotnetkicks.png" alt="Kick It on DotNetKicks.com" title="Kick It on DotNetKicks.com" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://dotnetshoutout.com/Submit?title=Any23+v0.2+Released&amp;url=http%3A%2F%2Fblog.sindice.com%2F2010%2F02%2F19%2Fany23-v0-2-released%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/dotnetshoutout.png" alt="Shout it" title="Shout it" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.linkedin.com/shareArticle?mini=true&amp;url=http%3A%2F%2Fblog.sindice.com%2F2010%2F02%2F19%2Fany23-v0-2-released%2F&amp;title=Any23+v0.2+Released&amp;summary=&amp;source=" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/linkedin.png" alt="Share on LinkedIn" title="Share on LinkedIn" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.technorati.com/faves?add=http%3A%2F%2Fblog.sindice.com%2F2010%2F02%2F19%2Fany23-v0-2-released%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/technorati.png" alt="Bookmark this on Technorati" title="Bookmark this on Technorati" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://twitter.com/home?status=Reading+http%3A%2F%2Fblog.sindice.com%2F2010%2F02%2F19%2Fany23-v0-2-released%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/twitter.png" alt="Post on Twitter" title="Post on Twitter" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.google.com/buzz/post?url=http%3A%2F%2Fblog.sindice.com%2F2010%2F02%2F19%2Fany23-v0-2-released%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/google_buzz.png" alt="Google Buzz (aka. Google Reader)" title="Google Buzz (aka. Google Reader)" /></a>&nbsp;&nbsp;</div><div class="feedflare">
<a href="http://feeds.sindice.com/~ff/SindiceBlog?a=os5HytlFHQQ:Za0riDnTPXc:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/SindiceBlog?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.sindice.com/~ff/SindiceBlog?a=os5HytlFHQQ:Za0riDnTPXc:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/SindiceBlog?i=os5HytlFHQQ:Za0riDnTPXc:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.sindice.com/~ff/SindiceBlog?a=os5HytlFHQQ:Za0riDnTPXc:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/SindiceBlog?i=os5HytlFHQQ:Za0riDnTPXc:D7DqB2pKExk" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/SindiceBlog/~4/os5HytlFHQQ" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.sindice.com/2010/02/19/any23-v0-2-released/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.sindice.com/2010/02/19/any23-v0-2-released/</feedburner:origLink></item>
		<item>
		<title>Sindice downtime notice</title>
		<link>http://feeds.sindice.com/~r/SindiceBlog/~3/iMEeZs14M3I/</link>
		<comments>http://blog.sindice.com/2010/02/08/sindice-downtime-notice/#comments</comments>
		<pubDate>Mon, 08 Feb 2010 15:17:32 +0000</pubDate>
		<dc:creator>smulcahy</dc:creator>
				<category><![CDATA[Announcements]]></category>
		<category><![CDATA[Sindice]]></category>
		<category><![CDATA[Availability]]></category>
		<category><![CDATA[downtime]]></category>

		<guid isPermaLink="false">http://blog.sindice.com/?p=184</guid>
		<description><![CDATA[We&#8217;re talking about downtime again on the Sindice project but this time it&#8217;s not due to climate change or some other unforeseen event. Our datacentre cooling system is getting a major upgrade. Unfortunately, due to the scale of this upgrade (the existing cooling system which takes up a significant area of the datacentre needs to [...]]]></description>
			<content:encoded><![CDATA[<p>We&#8217;re talking about downtime again on the Sindice project but this time it&#8217;s not due to climate change or some other unforeseen event. Our datacentre cooling system is getting a major upgrade. Unfortunately, due to the scale of this upgrade (the existing cooling system which takes up a significant area of the datacentre needs to be removed, and a new set of units need to be installed) &#8211; we need to bring down everything running in the datacentre.</p>
<p>While technically possible to host a second Sindice site in a second datacentre (we do have access to such a datacentre) &#8211; it&#8217;s not a scenario we have examined in detail to date due to the overall high availability of our current datacentre. In addition, Sindice currently runs on 14 servers &#8211; meaning we&#8217;d have to have 14 servers sitting around in standby mode. While we&#8217;d like to have 100% availability through fail-over, we prefer to offer the higher performance on a day to day basis using those 14 servers in other ways (including a new Hadoop cluster which we&#8217;re in the process of bringing online &#8211; more about that in a future posting).</p>
<p>In summary, the current schedule for our datacentre is to be <strong>down from 0000 GMT, 05-Mar-2010 (Friday) to 2000 GMT, 08-Mar-2010 (Monday)</strong>. We&#8217;re hoping the people working on the datacentre upgrades can get things done sooner but the schedule is pretty aggressive already. Obviously if we get things back up sooner, we&#8217;ll let you know.</p>
<p>If you have any further queries on this downtime, please leave a comment or contact us via the <a href="http://groups.google.com/group/sindice-dev">Sindice Developers group</a>.</p>
<div class="lightsocial_container"><a class="lightsocial_a" href="http://digg.com/submit?url=http%3A%2F%2Fblog.sindice.com%2F2010%2F02%2F08%2Fsindice-downtime-notice%2F&amp;title=Sindice+downtime+notice" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/digg.png" alt="Digg This" title="Digg This" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.reddit.com/submit?url=http%3A%2F%2Fblog.sindice.com%2F2010%2F02%2F08%2Fsindice-downtime-notice%2F&amp;title=Sindice+downtime+notice" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/reddit.png" alt="Reddit This" title="Reddit This" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.stumbleupon.com/submit?url=http%3A%2F%2Fblog.sindice.com%2F2010%2F02%2F08%2Fsindice-downtime-notice%2F&amp;title=Sindice+downtime+notice" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/stumbleupon.png" alt="Stumble Now!" title="Stumble Now!" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://buzz.yahoo.com/buzz?targetUrl=http%3A%2F%2Fblog.sindice.com%2F2010%2F02%2F08%2Fsindice-downtime-notice%2F&amp;headline=Sindice+downtime+notice" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/yahoo_buzz.png" alt="Buzz This" title="Buzz This" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.dzone.com/links/add.html?title=Sindice+downtime+notice&amp;url=http%3A%2F%2Fblog.sindice.com%2F2010%2F02%2F08%2Fsindice-downtime-notice%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/dzone.png" alt="Vote on DZone" title="Vote on DZone" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.facebook.com/sharer.php?t=Sindice+downtime+notice&amp;u=http%3A%2F%2Fblog.sindice.com%2F2010%2F02%2F08%2Fsindice-downtime-notice%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/facebook.png" alt="Share on Facebook" title="Share on Facebook" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://delicious.com/save?title=Sindice+downtime+notice&amp;url=http%3A%2F%2Fblog.sindice.com%2F2010%2F02%2F08%2Fsindice-downtime-notice%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/delicious.png" alt="Bookmark this on Delicious" title="Bookmark this on Delicious" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.dotnetkicks.com/kick/?title=Sindice+downtime+notice&amp;url=http%3A%2F%2Fblog.sindice.com%2F2010%2F02%2F08%2Fsindice-downtime-notice%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/dotnetkicks.png" alt="Kick It on DotNetKicks.com" title="Kick It on DotNetKicks.com" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://dotnetshoutout.com/Submit?title=Sindice+downtime+notice&amp;url=http%3A%2F%2Fblog.sindice.com%2F2010%2F02%2F08%2Fsindice-downtime-notice%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/dotnetshoutout.png" alt="Shout it" title="Shout it" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.linkedin.com/shareArticle?mini=true&amp;url=http%3A%2F%2Fblog.sindice.com%2F2010%2F02%2F08%2Fsindice-downtime-notice%2F&amp;title=Sindice+downtime+notice&amp;summary=&amp;source=" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/linkedin.png" alt="Share on LinkedIn" title="Share on LinkedIn" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.technorati.com/faves?add=http%3A%2F%2Fblog.sindice.com%2F2010%2F02%2F08%2Fsindice-downtime-notice%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/technorati.png" alt="Bookmark this on Technorati" title="Bookmark this on Technorati" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://twitter.com/home?status=Reading+http%3A%2F%2Fblog.sindice.com%2F2010%2F02%2F08%2Fsindice-downtime-notice%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/twitter.png" alt="Post on Twitter" title="Post on Twitter" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.google.com/buzz/post?url=http%3A%2F%2Fblog.sindice.com%2F2010%2F02%2F08%2Fsindice-downtime-notice%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/google_buzz.png" alt="Google Buzz (aka. Google Reader)" title="Google Buzz (aka. Google Reader)" /></a>&nbsp;&nbsp;</div><div class="feedflare">
<a href="http://feeds.sindice.com/~ff/SindiceBlog?a=iMEeZs14M3I:2sTEuxa41z4:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/SindiceBlog?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.sindice.com/~ff/SindiceBlog?a=iMEeZs14M3I:2sTEuxa41z4:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/SindiceBlog?i=iMEeZs14M3I:2sTEuxa41z4:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.sindice.com/~ff/SindiceBlog?a=iMEeZs14M3I:2sTEuxa41z4:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/SindiceBlog?i=iMEeZs14M3I:2sTEuxa41z4:D7DqB2pKExk" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/SindiceBlog/~4/iMEeZs14M3I" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.sindice.com/2010/02/08/sindice-downtime-notice/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.sindice.com/2010/02/08/sindice-downtime-notice/</feedburner:origLink></item>
		<item>
		<title>Sindice outage</title>
		<link>http://feeds.sindice.com/~r/SindiceBlog/~3/PsGlMSaTENs/</link>
		<comments>http://blog.sindice.com/2009/11/25/sindice-outage/#comments</comments>
		<pubDate>Wed, 25 Nov 2009 10:47:08 +0000</pubDate>
		<dc:creator>smulcahy</dc:creator>
				<category><![CDATA[Sigma]]></category>
		<category><![CDATA[Sindice]]></category>
		<category><![CDATA[Availability]]></category>
		<category><![CDATA[DERI]]></category>
		<category><![CDATA[Flooding]]></category>
		<category><![CDATA[Outage]]></category>

		<guid isPermaLink="false">http://blog.sindice.com/?p=178</guid>
		<description><![CDATA[Since I started worked in DERI in January of this year, one of my jobs has been to improve the Sindice infrastructure to make it more robust and fault-tolerant. The Sindice infrastructure consists of a total of 15 servers (including two small hadoop clusters). We&#8217;ve taken various actions to improve availability including adopting a basic [...]]]></description>
			<content:encoded><![CDATA[<p>Since I started worked in <a href="http://www.deri.ie">DERI</a> in January of this year, one of my jobs  has been to improve the <a href="http://sindice.com">Sindice</a> infrastructure to make it more robust  and fault-tolerant. The Sindice infrastructure consists of a total of 15  servers (including two small <a href="http://hadoop.apache.org/">hadoop</a> clusters). We&#8217;ve taken various  actions to improve availability including adopting a basic release  process, production-hardening the code, monitoring individual subsystems  and tuning systems which exhibited higher failure rates. We&#8217;ve been  using <a href="http://www.zabbix.com/">Zabbix</a> as our monitoring framework since April. It indicates that  the overall availability of Sindice&#8217;s search functionality since  20-Apr-2009 has been 94.2%. This means that Sindice has been unavailable  for a total of about 12 days. For a system built on bleeding edge  components and developed as a set of research projects first, and a  coherent production infrastructure second, this isn&#8217;t bad but we are continuously working on improving this (as to whether we&#8217;ll ever achieve  the mythical <a href="http://www.continuitycentral.com/feature0267.htm">five nines</a>, that&#8217;s a discussion  for another day). About half of that downtime is accounted for by  planned outages for maintenance, system upgrades and infrastructure  maintenance.</p>
<p>The most recent outage we suffered (for about 2.5 days) started on  18-Nov-2009 and the source of the outage was somewhat more serious than  a compnent or server failure. Those of you from Ireland will be aware  that after very heavy rain, the country experienced <a href="http://www.irishtimes.com/newspaper/breaking/2009/1120/breaking3.htm">severe flooding in  various areas</a>.  Unfortunately, this excessive rainfall caused part of the <a href="http://blog.deri.ie/index.php?id=452&amp;tx_ttnews[tt_news]=594&amp;tx_ttnews[year]=2009&amp;tx_ttnews[month]=11&amp;tx_ttnews[day]=20&amp;cHash=79ad8209af">ground floor  of the DERI building to flood</a>.  Thanks to the quick response from DERI staff members and <a href="http://www.nuigalway.ie">NUI Galway</a> facilities people, all systems in our data centre were shutdown before  any damage was caused (water and electricity don&#8217;t mix all that well).  The Sindice infrastructure is entirely located in the DERI building at  this time (while we share some facilities with the main NUI Galway data  centre, we don&#8217;t currently have a fully replicated infrastructure for  the Sindice project). Once the cause of the flooding had been addressed  and the data centre had been fully dried out, we took some time to  verify that all of the electrical infrastructure was intact before we  proceeded to restart the Sindice systems on Friday morning. I&#8217;m happy to  report that all systems came back up without problems and Sindice (and  related projects including <a href="http://sig.ma">Sig.ma</a>) resumed operations.</p>
<p>Obviously, we&#8217;ve learned some lessons during this outage &#8211; we&#8217;re  currently identifying various measures we can put in place to avoid  similar problems in the future and we&#8217;re also taking the opportunity to  review the overall <a href="http://en.wikipedia.org/wiki/Disaster_recovery">disaster recovery</a> and <a href="http://en.wikipedia.org/wiki/Business_continuity_planning">business continuity</a> measures we  have in place for DERI and the services provided by DERI such as  Sindice. As a result of this incident, we believe we&#8217;ll be able to  deliver a more robust and reliable service and maybe even reach two or  three nines availability for Sindice in 2010!</p>
<div class="lightsocial_container"><a class="lightsocial_a" href="http://digg.com/submit?url=http%3A%2F%2Fblog.sindice.com%2F2009%2F11%2F25%2Fsindice-outage%2F&amp;title=Sindice+outage" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/digg.png" alt="Digg This" title="Digg This" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.reddit.com/submit?url=http%3A%2F%2Fblog.sindice.com%2F2009%2F11%2F25%2Fsindice-outage%2F&amp;title=Sindice+outage" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/reddit.png" alt="Reddit This" title="Reddit This" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.stumbleupon.com/submit?url=http%3A%2F%2Fblog.sindice.com%2F2009%2F11%2F25%2Fsindice-outage%2F&amp;title=Sindice+outage" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/stumbleupon.png" alt="Stumble Now!" title="Stumble Now!" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://buzz.yahoo.com/buzz?targetUrl=http%3A%2F%2Fblog.sindice.com%2F2009%2F11%2F25%2Fsindice-outage%2F&amp;headline=Sindice+outage" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/yahoo_buzz.png" alt="Buzz This" title="Buzz This" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.dzone.com/links/add.html?title=Sindice+outage&amp;url=http%3A%2F%2Fblog.sindice.com%2F2009%2F11%2F25%2Fsindice-outage%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/dzone.png" alt="Vote on DZone" title="Vote on DZone" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.facebook.com/sharer.php?t=Sindice+outage&amp;u=http%3A%2F%2Fblog.sindice.com%2F2009%2F11%2F25%2Fsindice-outage%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/facebook.png" alt="Share on Facebook" title="Share on Facebook" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://delicious.com/save?title=Sindice+outage&amp;url=http%3A%2F%2Fblog.sindice.com%2F2009%2F11%2F25%2Fsindice-outage%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/delicious.png" alt="Bookmark this on Delicious" title="Bookmark this on Delicious" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.dotnetkicks.com/kick/?title=Sindice+outage&amp;url=http%3A%2F%2Fblog.sindice.com%2F2009%2F11%2F25%2Fsindice-outage%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/dotnetkicks.png" alt="Kick It on DotNetKicks.com" title="Kick It on DotNetKicks.com" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://dotnetshoutout.com/Submit?title=Sindice+outage&amp;url=http%3A%2F%2Fblog.sindice.com%2F2009%2F11%2F25%2Fsindice-outage%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/dotnetshoutout.png" alt="Shout it" title="Shout it" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.linkedin.com/shareArticle?mini=true&amp;url=http%3A%2F%2Fblog.sindice.com%2F2009%2F11%2F25%2Fsindice-outage%2F&amp;title=Sindice+outage&amp;summary=&amp;source=" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/linkedin.png" alt="Share on LinkedIn" title="Share on LinkedIn" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.technorati.com/faves?add=http%3A%2F%2Fblog.sindice.com%2F2009%2F11%2F25%2Fsindice-outage%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/technorati.png" alt="Bookmark this on Technorati" title="Bookmark this on Technorati" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://twitter.com/home?status=Reading+http%3A%2F%2Fblog.sindice.com%2F2009%2F11%2F25%2Fsindice-outage%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/twitter.png" alt="Post on Twitter" title="Post on Twitter" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.google.com/buzz/post?url=http%3A%2F%2Fblog.sindice.com%2F2009%2F11%2F25%2Fsindice-outage%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/google_buzz.png" alt="Google Buzz (aka. Google Reader)" title="Google Buzz (aka. Google Reader)" /></a>&nbsp;&nbsp;</div><div class="feedflare">
<a href="http://feeds.sindice.com/~ff/SindiceBlog?a=PsGlMSaTENs:LVyOmGuzrrE:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/SindiceBlog?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.sindice.com/~ff/SindiceBlog?a=PsGlMSaTENs:LVyOmGuzrrE:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/SindiceBlog?i=PsGlMSaTENs:LVyOmGuzrrE:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.sindice.com/~ff/SindiceBlog?a=PsGlMSaTENs:LVyOmGuzrrE:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/SindiceBlog?i=PsGlMSaTENs:LVyOmGuzrrE:D7DqB2pKExk" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/SindiceBlog/~4/PsGlMSaTENs" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.sindice.com/2009/11/25/sindice-outage/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.sindice.com/2009/11/25/sindice-outage/</feedburner:origLink></item>
		<item>
		<title>New: Inspector, Full Cache API – all with Online Data Reasoning</title>
		<link>http://feeds.sindice.com/~r/SindiceBlog/~3/QS1vYteC7Ew/</link>
		<comments>http://blog.sindice.com/2009/10/12/new-inspector-full-cache-api-all-with-online-data-reasoning/#comments</comments>
		<pubDate>Mon, 12 Oct 2009 23:25:41 +0000</pubDate>
		<dc:creator>Giovanni Tummarello</dc:creator>
				<category><![CDATA[Sindice]]></category>

		<guid isPermaLink="false">http://blog.sindice.com/?p=128</guid>
		<description><![CDATA[We&#8217;re happy to release today 2 distinct yet interplaying features in Sindice: The Sindice Inspector and the Sindice Cache API (both including Sindice&#8217;s Online Data Reasoning). A) Sindice Inspector - Takes anything with structured data on (RDF, RDFa, Microformats), and provides several handy ways: in a &#8220;Sigma&#8221; based view a novel card/frame based view a [...]]]></description>
			<content:encoded><![CDATA[<p>We&#8217;re happy to release today 2 distinct yet interplaying features in Sindice: The <em>Sindice Inspector</em> and the <em>Sindice Cache API</em> (both including Sindice&#8217;s <em>Online Data Reasoning).</em></p>
<p><strong>A) Sindice Inspector</strong></p>
<p>- Takes anything with structured data on (RDF, RDFa, Microformats), and provides several handy ways:</p>
<ul>
<li>in a &#8220;Sigma&#8221; based view</li>
<li>a novel card/frame based view</li>
<li>a SVG based interactive graph view (a la google map)</li>
<li>sortable triples, with prettyprint namespace support</li>
<li>full ontology tree view for Online Data Reasoning  debugging</li>
</ul>
<p>- Does live <em>Online Data Reasoning</em>: allows a data publisher to see which ontologies are implicitly or explicitly (directly or indirectly, via other ontologies) and</p>
<ul>
<li>visualizes the full closure of inferred statements using different colors.</li>
<li>provides a tree of the ontologies in use and their dependencies.</li>
</ul>
<p><em><strong>Ways to use it:</strong></em></p>
<ul>
<li>a tool from Sindice.com (the inspect tab on the homepage) or the <a href="http://www.sindice.com/developers/inspector">Inspector Homepage</a></li>
<li><a href="http://www.sindice.com/developers/inspector">a Bookmarklet</a> (drag it to your bookmark bar and use while browsing)</li>
<li><a href=" http://sindice.com/developers/inspector/">an API</a> either raw (Any23 output, no reasoning) or with reasoning</li>
<li>to send links to structured data files around, every visualization has its own permalink.</li>
</ul>
<p><strong><em>Examples:</em></strong></p>
<ul>
<li>Sortable Triples with Reasoning Closure (<a href="http://sindice.com/developers/inspector/?url=http://dbpedia.org/resource/Lough_Corrib&amp;doReasoning=true#triples">try</a>)</li>
<li>Ontology Import Tree of Axel Polleres&#8217;s DERI foaf file (<a href="http://sindice.com/developers/inspector?doReasoning=true&#038;url=http%3A%2F%2Fwww.deri.ie%2Ffileadmin%2Fscripts%2Ffoaf.php%3Fid%3D58#ontologies">try</a> notice how the GEO ontology and the DCTerms elements are only imported indirectly via other ontologies but yet contribute to the reasoning)</li>
<li>Graph of the RDF representation of Axel&#8217;s Facebook public profile (from microformats) (<a href="http://sindice.com/developers/inspector/?url=http%3A%2F%2Fwww.facebook.com%2FAxelPolleres%3F_fb_noscript%3D1#graph">try</a>).</li>
<li>All the data in a eventful.com page (<a href="http://sindice.com/developers/inspector/?url=http%3A%2F%2Feventful.com%2Fwinnipeg%2Fevents%2Fmetallica-%2FE0-001-025118008-2#fullcontent">try</a>)</li>
</ul>
<p><strong>B) Sindice Cache API</strong></p>
<p>Tired of your favorite data being offline now and then? convinced that you can&#8217;t really do a linked data application without any network safety net? rejoice <img src='http://blog.sindice.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> . With the <a href="http://www.sindice.com/developers/cacheapi">Sindice Cache API</a> you can:</p>
<ul>
<li>access and retrieve any of the currently 64 million RDF sources in Sindice with a simple REST api;</li>
<li>access and retrieve the full set of inferred triples created by Online Data Reasoning (instant access to the precomputed closure, 0 wait time);</li>
<li>visualize the cache with the same handy tools as available in the inspector. Just try from any <a href="http://sindice.com/search?q=galway&amp;qt=term">Sindice result page</a>.</li>
</ul>
<p>Feel free to use Sindice Cache with reasoning as a fallback service when data is not available and as a way to add full recursive ontology importing + reasoning support to your application (with none of the massive pain associated with the full procedure). A document you need is not in the cache? just <a href="http://sindice.com/main/submit">ping the URL</a> in and it will be available within minutes;</p>
<p>&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;-</p>
<p><strong>But what is Online Data Reasoning exactly? Background and implementation</strong></p>
<p>When RDF/RDFa data is put out there on the web, the &#8220;<em>explicit</em>&#8221; information given in the data is just part of the story.  Reasoning is a process by which the vocabulary used in the data (Ontologies) are analyzed to extract new pieces of information (otherwise <em>implicit</em>) &#8211; e.g. giving a &#8220;date of birth&#8221; to an &#8220;agent&#8221; would makes that a &#8220;person&#8221;.</p>
<p>Typically, Semantic Web software have manually imported the ontologies they needed. If ontologies are published using W3C best practices, however, it becomes possible to do the entire process automatically: ontological properties can be resolved (i.e. as they are resolvable URIs, they are HTTP fetched) and therefore all can be imported automatically &#8230;</p>
<p>&#8230; but ontologies can import other ontologies, and form circles and so on.</p>
<p>Implementing the process efficiently (with maximal reuse of previous results across files), scalable (with algorithms and frameworks based on Hadoop) and correctly  (e.g. no &#8220;contamination&#8221; of reasoning results between different data files) was not &#8211; believe me &#8211; an easy task.</p>
<p>We invested in it, however, since we believe it is important to answer queries based also on the &#8220;implicit&#8221; part of the document: if not else, it allows the markup on web pages to be considerably more concise.</p>
<p><em>A glimpse behind the curtain</em></p>
<p>The implementation of the large scale reasoning pipeline is based on [1] and engineered on top of the Hadoop platform.</p>
<p>Ontologies in RDF/RDFa documents (but also in microformats which have been converted to RDF in some form) can be &#8220;included&#8221; either explicitly with owl:imports declarations or implicitly by using property and class URIs that link directly to the data describing the ontology itself. As ontologies might refer to other ontologies, the import process then needs to be recursively iterated until the dependency graph of ontologies is complete. For example, in Fig. 1 is shown the dependency graph of ontologies of a document Doc. The document imports a first ontology, A, through the use of the property A#property and a second ontology, B, through the use of the class B#class. In addition, the ontology B is importing with the use of an owl:imports assertion the ontology C.</p>
<div id="attachment_134" class="wp-caption alignnone" style="width: 450px"><img class="size-full wp-image-134" title="dfcs3932_10g2dhkzck_b" src="http://blog.sindice.com/wp-content/uploads/2009/10/dfcs3932_10g2dhkzck_b.png" alt="Fig. 1: A document and its dependency graph of ontologies (in bold) materialised. The dashed circles represent inferred ontological assertions in their own context." width="440" height="216" /><p class="wp-caption-text">Fig. 1: A document and its dependency graph of ontologies (in bold) materialised. The dashed circles represent inferred ontological assertions in their own context.</p></div>
<p>In general, for each document one would have to recursively fetch the ontologies, create a model composed by these the original document and only at this point computing the deductive closure.</p>
<p>Clearly, doing this in isolation for each of the file is bound to be very time consuming and in general inefficient since a lot of processing time will be used to recalculate deductions which could be instead reused for possibly large classes of other documents during the indexing procedure. To reuse previous inference results, a simple strategy has been traditionally to put several (if not all) the ontologies together compute and reuse their deductive closures across all the documents to be indexed. While this simple approach is computationally convenient, it turns out to be sometimes inappropriate, since data publishers can reuse or extend ontology terms with divergent points of view.</p>
<p>For example, if an ontology other than the FOAF vocabulary itself extends foaf:name as an inverse functional property (i.e. transform the name as a primary key), an inferencing agent should not consider this axiom outside the scope of the document that references this particular ontology. Doing so would severely decrease the precision of semantic querying, by diluting the results with many false positives. For this reason, a fundamental requirement of the procedure that we developed has been to confine ontological assertions and reasoning tasks into contexts in order to track provenance of inference results. Coming back to Fig. 1, the inferred ontological assertions are stored in &#8220;virtual contexts&#8221; (dashed circles), and the dashed links symbolise the origin of the inferred assertions, i.e. the set of ontologies that lead to such assertions. By tracking the provenance of each single assertion, we are able to restrict inference to a particular context and prevent one ontology to alter the semantics of other ontologies on a global scale.</p>
<p>This context-dependent <em>Online Data Reasoning</em> mechanism allows Sindice to avoid the deduction of undesirable assertions in documents, a common risk when working with the Web of Data. However, this context mechanism does not restrict the freedom of expression of data publisher. Data publisher are still allowed to reuse and extend ontologies in any manner, but the consequences of their modifications will be confined in their own context, i.e. their published documents, and will not alter the intended semantics of the other documents on the Web.</p>
<p>When all the fragments of data, ontologies and partial reasoning results have been identified, these are all put inside instances of <a href="http://www.ontotext.com/owlim/">OntoText OWLIM</a> where the actual final inference occurs to produce the triples which will then be indexed. Big thanks to all at Ontotext for supporting us with issues when we encountered them.</p>
<p>The technique presented here is mainly focused on the TBox level (ontology level), since it considers only import relations between ontologies as dependency relationships. But the context-dependent reasoning mechanism can be extended to the ABox level. Instead of following import relations, it would be possible for example follow relations such as owl:sameAs between instances.</p>
<p><strong>A bit about the performance </strong></p>
<blockquote><p>from	Robert Fuller [DERI]</p>
<p>date	Wed, Aug 26, 2009 at 11:15 AM</p>
<p>subject	Re: do we have numbers about the latest implementation of the reasoner?</p>
<p>Hi,</p>
<p>Based on the current running of dump splitter, which is processing dbpedia.org</p>
<p>Yesterday we processed 2.5million documents which works out to (just under) 30 per second across 3 hadoop nodes, so that&#8217;s 10 per second on each node. I think there are 4 concurrent jobs running on each node, which works out for a single document 400ms to apply reasoning and update the index and hbase.</p>
<p>I hope this helps.</p>
<p>Rob.</p></blockquote>
<p>It does indeed.</p>
<p><strong>Credits:</strong></p>
<p>Reasoning Services and methodology [1] &#8211; <em>Renaud Delbru, Michele Catasta, Robert Fuller</em></p>
<p>Data extraction &#8211; <em>Any23 library &#8211; http://code.google.com/p/any23/ &#8211; Richard Cyganiak, Jurgen Umbrich, Michele Catasta</em></p>
<p>User interface and frontend programming &#8211; <em>Szymon Danielczyk</em></p>
<p>The Card/Frame and SVG visualization courtesy of <a href="http://rhizomik.net/">http://rhizomik.net/</a> &#8211; thanks especially to Roberto Garcia who supported us during his visit this summer with his wife Rosa and little Victor <img src='http://blog.sindice.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' />  hi guys.</p>
<p>[1] <em>R. Delbru, A. Polleres, G. Tummarello and S. Decker. Context Dependent Reasoning for Semantic Documents in Sindice. In Proceedings of the 4th International Workshop on Scalable Semantic Web Knowledge Base Systems (SSWS). Kalrsruhe, Germany, 2008.</em></p>
<div class="lightsocial_container"><a class="lightsocial_a" href="http://digg.com/submit?url=http%3A%2F%2Fblog.sindice.com%2F2009%2F10%2F12%2Fnew-inspector-full-cache-api-all-with-online-data-reasoning%2F&amp;title=New%3A+Inspector%2C+Full+Cache+API+-+all+with+Online+Data+Reasoning" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/digg.png" alt="Digg This" title="Digg This" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.reddit.com/submit?url=http%3A%2F%2Fblog.sindice.com%2F2009%2F10%2F12%2Fnew-inspector-full-cache-api-all-with-online-data-reasoning%2F&amp;title=New%3A+Inspector%2C+Full+Cache+API+-+all+with+Online+Data+Reasoning" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/reddit.png" alt="Reddit This" title="Reddit This" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.stumbleupon.com/submit?url=http%3A%2F%2Fblog.sindice.com%2F2009%2F10%2F12%2Fnew-inspector-full-cache-api-all-with-online-data-reasoning%2F&amp;title=New%3A+Inspector%2C+Full+Cache+API+-+all+with+Online+Data+Reasoning" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/stumbleupon.png" alt="Stumble Now!" title="Stumble Now!" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://buzz.yahoo.com/buzz?targetUrl=http%3A%2F%2Fblog.sindice.com%2F2009%2F10%2F12%2Fnew-inspector-full-cache-api-all-with-online-data-reasoning%2F&amp;headline=New%3A+Inspector%2C+Full+Cache+API+-+all+with+Online+Data+Reasoning" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/yahoo_buzz.png" alt="Buzz This" title="Buzz This" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.dzone.com/links/add.html?title=New%3A+Inspector%2C+Full+Cache+API+-+all+with+Online+Data+Reasoning&amp;url=http%3A%2F%2Fblog.sindice.com%2F2009%2F10%2F12%2Fnew-inspector-full-cache-api-all-with-online-data-reasoning%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/dzone.png" alt="Vote on DZone" title="Vote on DZone" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.facebook.com/sharer.php?t=New%3A+Inspector%2C+Full+Cache+API+-+all+with+Online+Data+Reasoning&amp;u=http%3A%2F%2Fblog.sindice.com%2F2009%2F10%2F12%2Fnew-inspector-full-cache-api-all-with-online-data-reasoning%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/facebook.png" alt="Share on Facebook" title="Share on Facebook" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://delicious.com/save?title=New%3A+Inspector%2C+Full+Cache+API+-+all+with+Online+Data+Reasoning&amp;url=http%3A%2F%2Fblog.sindice.com%2F2009%2F10%2F12%2Fnew-inspector-full-cache-api-all-with-online-data-reasoning%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/delicious.png" alt="Bookmark this on Delicious" title="Bookmark this on Delicious" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.dotnetkicks.com/kick/?title=New%3A+Inspector%2C+Full+Cache+API+-+all+with+Online+Data+Reasoning&amp;url=http%3A%2F%2Fblog.sindice.com%2F2009%2F10%2F12%2Fnew-inspector-full-cache-api-all-with-online-data-reasoning%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/dotnetkicks.png" alt="Kick It on DotNetKicks.com" title="Kick It on DotNetKicks.com" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://dotnetshoutout.com/Submit?title=New%3A+Inspector%2C+Full+Cache+API+-+all+with+Online+Data+Reasoning&amp;url=http%3A%2F%2Fblog.sindice.com%2F2009%2F10%2F12%2Fnew-inspector-full-cache-api-all-with-online-data-reasoning%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/dotnetshoutout.png" alt="Shout it" title="Shout it" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.linkedin.com/shareArticle?mini=true&amp;url=http%3A%2F%2Fblog.sindice.com%2F2009%2F10%2F12%2Fnew-inspector-full-cache-api-all-with-online-data-reasoning%2F&amp;title=New%3A+Inspector%2C+Full+Cache+API+-+all+with+Online+Data+Reasoning&amp;summary=&amp;source=" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/linkedin.png" alt="Share on LinkedIn" title="Share on LinkedIn" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.technorati.com/faves?add=http%3A%2F%2Fblog.sindice.com%2F2009%2F10%2F12%2Fnew-inspector-full-cache-api-all-with-online-data-reasoning%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/technorati.png" alt="Bookmark this on Technorati" title="Bookmark this on Technorati" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://twitter.com/home?status=Reading+http%3A%2F%2Fblog.sindice.com%2F2009%2F10%2F12%2Fnew-inspector-full-cache-api-all-with-online-data-reasoning%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/twitter.png" alt="Post on Twitter" title="Post on Twitter" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.google.com/buzz/post?url=http%3A%2F%2Fblog.sindice.com%2F2009%2F10%2F12%2Fnew-inspector-full-cache-api-all-with-online-data-reasoning%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/google_buzz.png" alt="Google Buzz (aka. Google Reader)" title="Google Buzz (aka. Google Reader)" /></a>&nbsp;&nbsp;</div><div class="feedflare">
<a href="http://feeds.sindice.com/~ff/SindiceBlog?a=QS1vYteC7Ew:TVgxe9YFtVE:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/SindiceBlog?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.sindice.com/~ff/SindiceBlog?a=QS1vYteC7Ew:TVgxe9YFtVE:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/SindiceBlog?i=QS1vYteC7Ew:TVgxe9YFtVE:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.sindice.com/~ff/SindiceBlog?a=QS1vYteC7Ew:TVgxe9YFtVE:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/SindiceBlog?i=QS1vYteC7Ew:TVgxe9YFtVE:D7DqB2pKExk" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/SindiceBlog/~4/QS1vYteC7Ew" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.sindice.com/2009/10/12/new-inspector-full-cache-api-all-with-online-data-reasoning/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		<feedburner:origLink>http://blog.sindice.com/2009/10/12/new-inspector-full-cache-api-all-with-online-data-reasoning/</feedburner:origLink></item>
		<item>
		<title>Position opening: Senior Engineer at Sindice.com</title>
		<link>http://feeds.sindice.com/~r/SindiceBlog/~3/kL1-tSn4mR8/</link>
		<comments>http://blog.sindice.com/2009/08/22/position-opening-senior-engineer-at-sindice-com/#comments</comments>
		<pubDate>Sat, 22 Aug 2009 13:57:51 +0000</pubDate>
		<dc:creator>Giovanni Tummarello</dc:creator>
				<category><![CDATA[Sindice]]></category>

		<guid isPermaLink="false">http://blog.sindice.com/2009/08/22/position-opening-senior-engineer-at-sindice-com/</guid>
		<description><![CDATA[The Sindice Team at DERI.ie has now an opening for a senior software engineer to work on the Sindice.com infrastructure (and other related projects http://sig.ma and others of the Data Intensive Infrastructures research group). Working location is Galway, Ireland. We require strong java development skills, proven experience with enterprise development practices and frameworks, great flexibility, [...]]]></description>
			<content:encoded><![CDATA[<p>The Sindice Team at DERI.ie has now an opening for a senior software engineer to work on the Sindice.com infrastructure (and other related projects http://sig.ma and others of the Data Intensive Infrastructures research group). Working location is Galway, Ireland. </p>
<p>We require strong java development skills, proven experience with enterprise development practices and frameworks, great flexibility, pragmatic  problem solving mindset and positive attitude. Knowledge of semantic web technologies, and experience with lucene and hadoop are considered important pluses.</p>
<p>Please write directly to g.tummarelloatgmail.com </p>
<p>thanks for forwarding this<br />
Giovanni</p>
<div class="lightsocial_container"><a class="lightsocial_a" href="http://digg.com/submit?url=http%3A%2F%2Fblog.sindice.com%2F2009%2F08%2F22%2Fposition-opening-senior-engineer-at-sindice-com%2F&amp;title=Position+opening%3A+Senior+Engineer+at+Sindice.com" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/digg.png" alt="Digg This" title="Digg This" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.reddit.com/submit?url=http%3A%2F%2Fblog.sindice.com%2F2009%2F08%2F22%2Fposition-opening-senior-engineer-at-sindice-com%2F&amp;title=Position+opening%3A+Senior+Engineer+at+Sindice.com" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/reddit.png" alt="Reddit This" title="Reddit This" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.stumbleupon.com/submit?url=http%3A%2F%2Fblog.sindice.com%2F2009%2F08%2F22%2Fposition-opening-senior-engineer-at-sindice-com%2F&amp;title=Position+opening%3A+Senior+Engineer+at+Sindice.com" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/stumbleupon.png" alt="Stumble Now!" title="Stumble Now!" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://buzz.yahoo.com/buzz?targetUrl=http%3A%2F%2Fblog.sindice.com%2F2009%2F08%2F22%2Fposition-opening-senior-engineer-at-sindice-com%2F&amp;headline=Position+opening%3A+Senior+Engineer+at+Sindice.com" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/yahoo_buzz.png" alt="Buzz This" title="Buzz This" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.dzone.com/links/add.html?title=Position+opening%3A+Senior+Engineer+at+Sindice.com&amp;url=http%3A%2F%2Fblog.sindice.com%2F2009%2F08%2F22%2Fposition-opening-senior-engineer-at-sindice-com%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/dzone.png" alt="Vote on DZone" title="Vote on DZone" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.facebook.com/sharer.php?t=Position+opening%3A+Senior+Engineer+at+Sindice.com&amp;u=http%3A%2F%2Fblog.sindice.com%2F2009%2F08%2F22%2Fposition-opening-senior-engineer-at-sindice-com%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/facebook.png" alt="Share on Facebook" title="Share on Facebook" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://delicious.com/save?title=Position+opening%3A+Senior+Engineer+at+Sindice.com&amp;url=http%3A%2F%2Fblog.sindice.com%2F2009%2F08%2F22%2Fposition-opening-senior-engineer-at-sindice-com%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/delicious.png" alt="Bookmark this on Delicious" title="Bookmark this on Delicious" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.dotnetkicks.com/kick/?title=Position+opening%3A+Senior+Engineer+at+Sindice.com&amp;url=http%3A%2F%2Fblog.sindice.com%2F2009%2F08%2F22%2Fposition-opening-senior-engineer-at-sindice-com%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/dotnetkicks.png" alt="Kick It on DotNetKicks.com" title="Kick It on DotNetKicks.com" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://dotnetshoutout.com/Submit?title=Position+opening%3A+Senior+Engineer+at+Sindice.com&amp;url=http%3A%2F%2Fblog.sindice.com%2F2009%2F08%2F22%2Fposition-opening-senior-engineer-at-sindice-com%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/dotnetshoutout.png" alt="Shout it" title="Shout it" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.linkedin.com/shareArticle?mini=true&amp;url=http%3A%2F%2Fblog.sindice.com%2F2009%2F08%2F22%2Fposition-opening-senior-engineer-at-sindice-com%2F&amp;title=Position+opening%3A+Senior+Engineer+at+Sindice.com&amp;summary=&amp;source=" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/linkedin.png" alt="Share on LinkedIn" title="Share on LinkedIn" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.technorati.com/faves?add=http%3A%2F%2Fblog.sindice.com%2F2009%2F08%2F22%2Fposition-opening-senior-engineer-at-sindice-com%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/technorati.png" alt="Bookmark this on Technorati" title="Bookmark this on Technorati" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://twitter.com/home?status=Reading+http%3A%2F%2Fblog.sindice.com%2F2009%2F08%2F22%2Fposition-opening-senior-engineer-at-sindice-com%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/twitter.png" alt="Post on Twitter" title="Post on Twitter" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.google.com/buzz/post?url=http%3A%2F%2Fblog.sindice.com%2F2009%2F08%2F22%2Fposition-opening-senior-engineer-at-sindice-com%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/google_buzz.png" alt="Google Buzz (aka. Google Reader)" title="Google Buzz (aka. Google Reader)" /></a>&nbsp;&nbsp;</div><div class="feedflare">
<a href="http://feeds.sindice.com/~ff/SindiceBlog?a=kL1-tSn4mR8:N_MG-2VH4Tk:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/SindiceBlog?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.sindice.com/~ff/SindiceBlog?a=kL1-tSn4mR8:N_MG-2VH4Tk:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/SindiceBlog?i=kL1-tSn4mR8:N_MG-2VH4Tk:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.sindice.com/~ff/SindiceBlog?a=kL1-tSn4mR8:N_MG-2VH4Tk:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/SindiceBlog?i=kL1-tSn4mR8:N_MG-2VH4Tk:D7DqB2pKExk" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/SindiceBlog/~4/kL1-tSn4mR8" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.sindice.com/2009/08/22/position-opening-senior-engineer-at-sindice-com/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.sindice.com/2009/08/22/position-opening-senior-engineer-at-sindice-com/</feedburner:origLink></item>
		<item>
		<title>Sig.ma – Live views on the Web of Data</title>
		<link>http://feeds.sindice.com/~r/SindiceBlog/~3/tK-qrHOdGR8/</link>
		<comments>http://blog.sindice.com/2009/07/22/sigma-live-views-on-the-web-of-data/#comments</comments>
		<pubDate>Wed, 22 Jul 2009 18:40:11 +0000</pubDate>
		<dc:creator>Giovanni Tummarello</dc:creator>
				<category><![CDATA[Announcements]]></category>
		<category><![CDATA[Blogroll]]></category>
		<category><![CDATA[Sigma]]></category>
		<category><![CDATA[Sindice]]></category>

		<guid isPermaLink="false">http://blog.sindice.com/?p=84</guid>
		<description><![CDATA[Today we release Sig.ma, Hurray \o/ ! Sig.ma is a pretty advanced application implemented on top of Sindice which gives a very visual and interactive access to the &#8220;Web of Data&#8221; as a whole.  Best thing to do, really, is watch the screencast. Bear the first 60 seconds where I introduce the Web of Data, [...]]]></description>
			<content:encoded><![CDATA[<p>Today we release <a href="http://sig.ma">Sig.ma</a>, Hurray \o/ !<br />
Sig.ma is a pretty advanced application implemented on top of Sindice which gives a very visual and interactive access to the &#8220;Web of Data&#8221; as a whole.  Best thing to do, really, is watch the screencast. Bear the first 60 seconds where I introduce the Web of Data, it&#8217;s pretty fast after that.</p>
<p><object width="608" height="456" data="http://vimeo.com/moogaloop.swf?clip_id=5703809&amp;server=vimeo.com&amp;show_title=1&amp;show_byline=1&amp;show_portrait=0&amp;color=&amp;fullscreen=1" type="application/x-shockwave-flash"><param name="allowfullscreen" value="true" /><param name="allowscriptaccess" value="always" /><param name="src" value="http://vimeo.com/moogaloop.swf?clip_id=5703809&amp;server=vimeo.com&amp;show_title=1&amp;show_byline=1&amp;show_portrait=0&amp;color=&amp;fullscreen=1" /></object></p>
<p>While the demo is probably.. agreably cool <img src='http://blog.sindice.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' />  there is more to talk about.</p>
<p>While Sig.ma is by no mean the first data aggregator for the Semantic Web, its contribution is to show that the sum is really bigger than the single parts and exciting possibilities lie in a holistic approach for automatic semistructured data discovery and consolidation.</p>
<p>In Sig.ma, elements such as large scale semantic web indexing, logic reasoning, data aggregation heuristics, pragmatic ontology alignments and, last but not least, user interaction and refinement, all play together to provide entity descriptions which become live, embeddable data mash ups.</p>
<p>An interesting example:</p>
<p>when we first saw the B&amp;W pictures (e.g. see the demo ) pop up automatically the first time we ran Sigma we were really excited: that DERI data had been there forever yet never meaningfully used or integrated.. let alone automatically! That <a href="http://www.deri.ie/fileadmin/images/elements/foaf.gif">DERI RDF</a> file does no reuse the right URI for people , doesn&#8217;t use Inverse Functional Properties such as &#8220;emails&#8221;, and  uses only one of many ways to say &#8220;author&#8221;.</p>
<p>But here it was! That file was there, discovered automatically and  contributing marvelously to the mashup providing information about papers,  (including technical reports that would not be listed otherwise) an extra picture, the phone number, a confirmation of the personal homepage, research projects and more.</p>
<p>Note: this doesn&#8217;t mean that the DERI file is bad at all actually. It&#8217;s simply <em>not unrealistically great,</em> in other words it was created with a realistic effort, the same that we can expect from any data publisher.</p>
<p>There was no way to get that very useful data with classic  Semantic Web inference and rule consolidation alone. All it took was instead the <em>mix</em> of semantic web practices and tricks with pragmatic and elements of soft computing (quite basic indeed).</p>
<p>In our opinion it all makes sense and inspires the following thoughts:</p>
<ol>
<li>A little semantic might in fact go a long way:  no way there could be something comparable to Sig.ma had we not had a large core of semantically structured data (the Web of Data itself). Publish way more please!  Be this in whatever format can be consolidated to RDF.</li>
<li>&#8230; it goes in fact even more a long way when the user is involved, and can with pragmatic actions (e.g. &#8220;reject&#8221; or &#8220;approve&#8221;) to  steer and validate the results.</li>
<li>For data publishers: just like on the HTML web you can simply care only about your site.  If you <em>don&#8217;t</em> reuse other people URIs or you <em>don&#8217;t</em> put &#8220;sameAs&#8221; links or you <em>don&#8217;t</em> really use the ontology everyone else is using then..  it can work all the same most likely and for most applications!</li>
<li>.. but <em>overdescribe</em><br />
Be verbose with your semantic descriptions,  more than what you would be for a human. A well described entity will be the best possible &#8220;entity&#8221; identifier that one could think. It will <em>automatically generate</em> invisible but robust links to others entity descriptions. So dont just write name = fooguy, make sure you expose all you have (and are willing to share) and let aggregation engines use this data to at least do the best consolidation possible.  Good descriptions will also make you show more often in semantic aggregations, foster new applications and make people more likely to integrate with you.</li>
<li>For data consumers:<strong> </strong>We are working for you really and willing to do the hard work.<br />
This is again very similar to the HTML world. How difficult is to make sense of all the broken HTML out there? Very! How many people have to do it really? just a few, the browser makers.  Others can reuse their efforts and concentrate on other aspects. Sig.ma and Sindice are engines that do the hard part for you as a Web of Data developer.  We provide open services and open source components (heck, at the end of the week we&#8217;re even releasing our index open source <img src='http://blog.sindice.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' />  , next the reasoning engine).  If there is interest and market others will come and there will be more choice</li>
</ol>
<p>So let me conclude with a good-fortune  Sig.ma of Stefan Decker <img src='http://blog.sindice.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' />  (50 sources, sigma &#8220;Stefan Decker&#8221; + add info &#8220;Stefan Decker DERI&#8221;, with a  couple of manual sources added or deleted)</p>
<p><script src="http://sig.ma/js/sigma-widget.js"  type="text/javascript"></script><br />
<script type="text/javascript" sigma="true" >
       <!--
       createSigma("5f0cb05349ba2c8270d0de58c243b3ef",{width:600,height:400});
       // the config object (with dimensions) is optional
       //-->
</script></p>
<p>And the rest follows from the small FAQ in the Sig.ma about page. Cheers!</p>
<p><strong>Why is this potentially revolutionary?</strong></p>
<p class="justify">As appropriate data 					sources become available (pages annotated with RDFa or Microformats), 					Sig.ma is in a different league in terms of information richness and 					precision compared to methods solely based on web text analysis.</p>
<p class="justify">Sig.ma can be used by 					humans and software agents alike to obtain structured data about any 					entity.</p>
<h3 class="western">Is Sigma noise free?</h3>
<p class="justify">Not yet. Sig..ma still 					employs heuristics for many aspects and has to deal with heterogeneous 					data in the current Web of Data – a very early stage environment! What 					we can say however is:</p>
<ol>
<li>Sig.ma is interactive 						and can learn from its usage: when a user deletes a piece of 						information or a source, Sigma writes it down and that piece of 						information is less likely to show back at a later time.</li>
<li>We have deliberately 						chosen very simple strategies at this point to test the general idea 						more than advanced strategies: the potential for improvement is 						tremendous.</li>
<li>The Web of Data 						itself is very new: until very recently there was basically no way to 						see this data in action and markup has been done on a best 						effort-hacker enthusiastic-leap of faith way. Now that Google and Yahoo 						are starting to recognize the value of page markup, it is realistic to 						expect improvements in data coverage and quality.</li>
</ol>
<h3 class="western">Why does my phone number/picture/favourite 					movie not appear?</h3>
<p>Pages exposing RDF, 					RDFa or Microformats will appear. If you or your company want 					information to be found on the web of data, it is very simple to mark up 					your HTML using RDFa, then submit it to Sindice. You will find it 					returned by Sig.ma within 10-15 minutes.</p>
<h3 class="western">How is Sig.ma built? Can I build applications 					like Sig.ma?</h3>
<p>Sig.ma is enabled by 					Sindice, an index of the web of data. Thanks to Sindice, Sig.ma can 					accurately locate sources of web data using not only text but also 					precise attribute value searches and more. Sindice is alive and growing, 					constantly finding new information, receiving “pings” and immediately 					adding new documents etc. Where to start? Please write on our forum.</p>
<h3 class="western">Acknowledgements</h3>
<p><em>Sig.ma and Sindice are built at </em> <span style="text-decoration: underline;"><a href="http://deri.ie/"><em>DERI</em></a></span> <em>mainly within the OKKaM Project (ICT-215032) but also with the support of the </em> <span style="text-decoration: underline;"><a href="http://www.sfi.ie/"><em>Science Foundation Ireland</em></a></span> <em>under Grant No. SFI/02/CE1/I131, of the </em> <span style="text-decoration: underline;"><a href="http://www.ict-romulus.eu/home"><em>ROMULUS project</em></a></span><em>(ICT-217031) and the <a href="http://imp-project.eu/">iMP</a> project.</em></p>
<p><em>R&amp;D by  <span style="text-decoration: underline;"><a href="http://www.deri.ie/about/team/member/michele_catasta/">Michele Catasta</a></span>,  					   <span style="text-decoration: underline;"><a href="http://www.deri.ie/about/team/member/richard_cyganiak/">Richard Cyganiak</a></span>,  					   <span style="text-decoration: underline;"><a href="http://www.deri.ie/about/team/member/szymon_danielczyk/">Szymon Danielczyk</a></span> and  					   <span style="text-decoration: underline;"><a href="http://www.deri.ie/about/team/member/giovanni_tummarello/">Giovanni Tummarello.</a></span> </em></p>
<div class="lightsocial_container"><a class="lightsocial_a" href="http://digg.com/submit?url=http%3A%2F%2Fblog.sindice.com%2F2009%2F07%2F22%2Fsigma-live-views-on-the-web-of-data%2F&amp;title=Sig.ma+-+Live+views+on+the+Web+of+Data" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/digg.png" alt="Digg This" title="Digg This" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.reddit.com/submit?url=http%3A%2F%2Fblog.sindice.com%2F2009%2F07%2F22%2Fsigma-live-views-on-the-web-of-data%2F&amp;title=Sig.ma+-+Live+views+on+the+Web+of+Data" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/reddit.png" alt="Reddit This" title="Reddit This" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.stumbleupon.com/submit?url=http%3A%2F%2Fblog.sindice.com%2F2009%2F07%2F22%2Fsigma-live-views-on-the-web-of-data%2F&amp;title=Sig.ma+-+Live+views+on+the+Web+of+Data" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/stumbleupon.png" alt="Stumble Now!" title="Stumble Now!" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://buzz.yahoo.com/buzz?targetUrl=http%3A%2F%2Fblog.sindice.com%2F2009%2F07%2F22%2Fsigma-live-views-on-the-web-of-data%2F&amp;headline=Sig.ma+-+Live+views+on+the+Web+of+Data" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/yahoo_buzz.png" alt="Buzz This" title="Buzz This" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.dzone.com/links/add.html?title=Sig.ma+-+Live+views+on+the+Web+of+Data&amp;url=http%3A%2F%2Fblog.sindice.com%2F2009%2F07%2F22%2Fsigma-live-views-on-the-web-of-data%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/dzone.png" alt="Vote on DZone" title="Vote on DZone" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.facebook.com/sharer.php?t=Sig.ma+-+Live+views+on+the+Web+of+Data&amp;u=http%3A%2F%2Fblog.sindice.com%2F2009%2F07%2F22%2Fsigma-live-views-on-the-web-of-data%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/facebook.png" alt="Share on Facebook" title="Share on Facebook" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://delicious.com/save?title=Sig.ma+-+Live+views+on+the+Web+of+Data&amp;url=http%3A%2F%2Fblog.sindice.com%2F2009%2F07%2F22%2Fsigma-live-views-on-the-web-of-data%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/delicious.png" alt="Bookmark this on Delicious" title="Bookmark this on Delicious" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.dotnetkicks.com/kick/?title=Sig.ma+-+Live+views+on+the+Web+of+Data&amp;url=http%3A%2F%2Fblog.sindice.com%2F2009%2F07%2F22%2Fsigma-live-views-on-the-web-of-data%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/dotnetkicks.png" alt="Kick It on DotNetKicks.com" title="Kick It on DotNetKicks.com" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://dotnetshoutout.com/Submit?title=Sig.ma+-+Live+views+on+the+Web+of+Data&amp;url=http%3A%2F%2Fblog.sindice.com%2F2009%2F07%2F22%2Fsigma-live-views-on-the-web-of-data%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/dotnetshoutout.png" alt="Shout it" title="Shout it" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.linkedin.com/shareArticle?mini=true&amp;url=http%3A%2F%2Fblog.sindice.com%2F2009%2F07%2F22%2Fsigma-live-views-on-the-web-of-data%2F&amp;title=Sig.ma+-+Live+views+on+the+Web+of+Data&amp;summary=&amp;source=" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/linkedin.png" alt="Share on LinkedIn" title="Share on LinkedIn" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.technorati.com/faves?add=http%3A%2F%2Fblog.sindice.com%2F2009%2F07%2F22%2Fsigma-live-views-on-the-web-of-data%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/technorati.png" alt="Bookmark this on Technorati" title="Bookmark this on Technorati" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://twitter.com/home?status=Reading+http%3A%2F%2Fblog.sindice.com%2F2009%2F07%2F22%2Fsigma-live-views-on-the-web-of-data%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/twitter.png" alt="Post on Twitter" title="Post on Twitter" /></a>&nbsp;&nbsp;<a class="lightsocial_a" href="http://www.google.com/buzz/post?url=http%3A%2F%2Fblog.sindice.com%2F2009%2F07%2F22%2Fsigma-live-views-on-the-web-of-data%2F" ><img class="lightsocial_img" src="http://blog.sindice.com/wp-content/plugins/light-social/google_buzz.png" alt="Google Buzz (aka. Google Reader)" title="Google Buzz (aka. Google Reader)" /></a>&nbsp;&nbsp;</div><div class="feedflare">
<a href="http://feeds.sindice.com/~ff/SindiceBlog?a=tK-qrHOdGR8:WXl9T8NloRs:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/SindiceBlog?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.sindice.com/~ff/SindiceBlog?a=tK-qrHOdGR8:WXl9T8NloRs:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/SindiceBlog?i=tK-qrHOdGR8:WXl9T8NloRs:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.sindice.com/~ff/SindiceBlog?a=tK-qrHOdGR8:WXl9T8NloRs:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/SindiceBlog?i=tK-qrHOdGR8:WXl9T8NloRs:D7DqB2pKExk" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/SindiceBlog/~4/tK-qrHOdGR8" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.sindice.com/2009/07/22/sigma-live-views-on-the-web-of-data/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		<feedburner:origLink>http://blog.sindice.com/2009/07/22/sigma-live-views-on-the-web-of-data/</feedburner:origLink></item>
	</channel>
</rss>
