<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments for Ivan’s private site</title>
	<atom:link href="http://ivan-herman.name/comments/feed/" rel="self" type="application/rss+xml" />
	<link>http://ivan-herman.name</link>
	<description></description>
	<lastBuildDate>Thu, 26 Jan 2012 14:40:55 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>Comment on Nice reading on Semantic Search by Aidan Hogan</title>
		<link>http://ivan-herman.name/2012/01/24/nice-reading-on-semantic-search/#comment-7228</link>
		<dc:creator><![CDATA[Aidan Hogan]]></dc:creator>
		<pubDate>Thu, 26 Jan 2012 14:40:55 +0000</pubDate>
		<guid isPermaLink="false">http://ivan-herman.name/?p=835#comment-7228</guid>
		<description><![CDATA[&gt; Wow! This is a blog post by itself, not just a comment:-) But thanks!

If I had more time, I would&#039;ve made it shorter. :)

&gt; Isn’t it correct that if you forego symmetric claims then, formally, you do not follow the precise owl:sameAs semantics? 

I guess the distinction between soundness and completeness is important here. By dropping the symmetry of claims, you do follow the precise owl:sameAs semantics, but only partially: as a consumer, you only use the part that you can trust, e.g, based on authoritative considerations.

&gt; In other words, it is exactly for the possibility of expressing non-symmetric identifications that new predicates might be needed, to allow you to do that while staying within the definitions…

Reminds me of an analogous discussion about the use of owl:equivalentProperty and owl:equivalentClass relations given by FOAF between foaf:maker/dct:creator and foaf:Agent/dct:Agent. The discussion is very much related to the symmetry of claims for owl:sameAs:

      http://groups.google.com/group/pedantic-web/browse_thread/thread/ed643ec5f391e053?pli=1

At the time, I didn&#039;t particularly like that FOAF was mandating new inferences for DC data... e.g.,importing all instances of dct:Agent into foaf:Agent. I was arguing that this should be changed, on principle, to the non-symmetric &quot;foaf:Agent rdfs:subClassOf dct:Agent&quot;, with a reciprocal link from DC if desired. Dan pointed out that from his point of view, dct:Agent and foaf:Agent were equivalent classes, and why shouldn&#039;t he be allowed to make that claim in his vocabulary? This was really difficult to argue against (although I tried). I came away with the belief that publishers should be allowed to claim what they want (they will anyways). It&#039;s largely up to consumers then to disentangle these claims (e.g., using authoritative reasoning or similar).

Of course, having a vocabulary of owl:sameAs with symmetry&#124;transitivity&#124;reflexivity turned off can&#039;t be a bad thing: publishers have more properties to make claims with. Towards official support and evangelism, the open question I have is what *claim* is being made with, e.g., a non-symmetric owl:sameAs? From an inference/trust/imperative point of view, I can see the effect (turn off the &quot;eq-ref&quot;/&quot;eq-sym&quot; rules for this property). From a declarative point of view, what does this claim actually *mean* though, and how would it be described in English? For example, what two real-world entities would the non-symmetric claim hold against (that full blown equality wouldn&#039;t hold for)? And, more importantly, does this actually solve the root problem? (For example, what if publishers ignore the properties and continue to use industrial-strength owl:sameAs?)

In summary, I still think it&#039;s first a consumer-side problem, second a publisher-side problem.]]></description>
		<content:encoded><![CDATA[<p>&gt; Wow! This is a blog post by itself, not just a comment:-) But thanks!</p>
<p>If I had more time, I would&#8217;ve made it shorter. <img src='http://s0.wp.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p>&gt; Isn’t it correct that if you forego symmetric claims then, formally, you do not follow the precise owl:sameAs semantics? </p>
<p>I guess the distinction between soundness and completeness is important here. By dropping the symmetry of claims, you do follow the precise owl:sameAs semantics, but only partially: as a consumer, you only use the part that you can trust, e.g, based on authoritative considerations.</p>
<p>&gt; In other words, it is exactly for the possibility of expressing non-symmetric identifications that new predicates might be needed, to allow you to do that while staying within the definitions…</p>
<p>Reminds me of an analogous discussion about the use of owl:equivalentProperty and owl:equivalentClass relations given by FOAF between foaf:maker/dct:creator and foaf:Agent/dct:Agent. The discussion is very much related to the symmetry of claims for owl:sameAs:</p>
<p>      <a href="http://groups.google.com/group/pedantic-web/browse_thread/thread/ed643ec5f391e053?pli=1" rel="nofollow">http://groups.google.com/group/pedantic-web/browse_thread/thread/ed643ec5f391e053?pli=1</a></p>
<p>At the time, I didn&#8217;t particularly like that FOAF was mandating new inferences for DC data&#8230; e.g.,importing all instances of dct:Agent into foaf:Agent. I was arguing that this should be changed, on principle, to the non-symmetric &#8220;foaf:Agent rdfs:subClassOf dct:Agent&#8221;, with a reciprocal link from DC if desired. Dan pointed out that from his point of view, dct:Agent and foaf:Agent were equivalent classes, and why shouldn&#8217;t he be allowed to make that claim in his vocabulary? This was really difficult to argue against (although I tried). I came away with the belief that publishers should be allowed to claim what they want (they will anyways). It&#8217;s largely up to consumers then to disentangle these claims (e.g., using authoritative reasoning or similar).</p>
<p>Of course, having a vocabulary of owl:sameAs with symmetry|transitivity|reflexivity turned off can&#8217;t be a bad thing: publishers have more properties to make claims with. Towards official support and evangelism, the open question I have is what *claim* is being made with, e.g., a non-symmetric owl:sameAs? From an inference/trust/imperative point of view, I can see the effect (turn off the &#8220;eq-ref&#8221;/&#8221;eq-sym&#8221; rules for this property). From a declarative point of view, what does this claim actually *mean* though, and how would it be described in English? For example, what two real-world entities would the non-symmetric claim hold against (that full blown equality wouldn&#8217;t hold for)? And, more importantly, does this actually solve the root problem? (For example, what if publishers ignore the properties and continue to use industrial-strength owl:sameAs?)</p>
<p>In summary, I still think it&#8217;s first a consumer-side problem, second a publisher-side problem.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Nice reading on Semantic Search by Ivan Herman</title>
		<link>http://ivan-herman.name/2012/01/24/nice-reading-on-semantic-search/#comment-7227</link>
		<dc:creator><![CDATA[Ivan Herman]]></dc:creator>
		<pubDate>Thu, 26 Jan 2012 10:03:28 +0000</pubDate>
		<guid isPermaLink="false">http://ivan-herman.name/?p=835#comment-7227</guid>
		<description><![CDATA[Wow! This is a blog post by itself, not just a comment:-) But thanks!

I think the only area where either we are in disagreement or I am not sure I understand what you say is your last section, referring to the handling of owl:sameAs. Isn&#039;t it correct that if you forego symmetric claims then, formally, you do not follow the precise owl:sameAs semantics? Of course it is perfectly for you to do that but, at least for my taste, this is not entirely ok. In other words, it is exactly for the possibility of expressing non-symmetric identifications that new predicates might be needed, to allow you to do that while staying within the definitions…

Of course, as I say, the issue might be that this train is already gone, and there are too many owl:sameAs out there.]]></description>
		<content:encoded><![CDATA[<p>Wow! This is a blog post by itself, not just a comment:-) But thanks!</p>
<p>I think the only area where either we are in disagreement or I am not sure I understand what you say is your last section, referring to the handling of owl:sameAs. Isn&#8217;t it correct that if you forego symmetric claims then, formally, you do not follow the precise owl:sameAs semantics? Of course it is perfectly for you to do that but, at least for my taste, this is not entirely ok. In other words, it is exactly for the possibility of expressing non-symmetric identifications that new predicates might be needed, to allow you to do that while staying within the definitions…</p>
<p>Of course, as I say, the issue might be that this train is already gone, and there are too many owl:sameAs out there.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Nice reading on Semantic Search by Aidan Hogan</title>
		<link>http://ivan-herman.name/2012/01/24/nice-reading-on-semantic-search/#comment-7223</link>
		<dc:creator><![CDATA[Aidan Hogan]]></dc:creator>
		<pubDate>Wed, 25 Jan 2012 19:58:36 +0000</pubDate>
		<guid isPermaLink="false">http://ivan-herman.name/?p=835#comment-7223</guid>
		<description><![CDATA[Many thanks for reading our (lengthy) paper. The positive feedback is really greatly appreciated! 

I had planned on a short comment, but almost all of the questions you&#039;ve raised have been on the forefronts of our minds over the past while, so . . .

&gt; that may be a good input for a possible Linked Data profile (although the differences are really minor, if one looks at the appendix of the paper that lists the rule sets the engine uses)

We&#039;ve recently been looking into what kinds of RDFS and OWL primitives are being used in popular Linked Data vocabularies. Our (somewhat coarse) high-level observation is that RDFS/OWL features not requiring blank-nodes to express in the RDF mapping (e.g., sub-class, inverse-of, TransitiveProperty, etc.) are much more prominently used than those requiring blank-nodes (e.g., those requiring lists like owl:intersectionOf, or restrictions like owl:allValuesFrom, etc.). The pattern is quite evident, but as to what it means in a practical sense, we&#039;re not sure. Hopefully we&#039;ll have a paper on this soon.

&gt; the experiences of major Semantic Web search engines on handling Linked Data might provide a great set of input for such pragmatic work.

We feel that our primary contribution in SWSE is not the systems or the techniques, but rather a better understanding of how Linked Data works: empirically &quot;putting it though it&#039;s paces&quot; so-to-speak. I&#039;m sure that others working on similar engines also have a good feel for the inner workings of Linked Data. We&#039;ve spent many hours/days/weeks debugging Web datasets to explain strange crawling behaviour, or strange consolidation results, or strange reasoning results, etc. This was part of the inspiration of the &quot;Pedantic Web Group&quot;, although the mailing lists have been quiet of late. It seems that more feedback loops are needed between the consumers and producers of Linked Data.

&gt; A great deal of work had to be done in SWSE on the proper handling of owl:sameAs. ... On the other hand, one of the recurring discussions on various mailing list and elsewhere is on whether the usage of this property is semantically o.k. or not (see, e.g., [3]). 

Yep, the owl:sameAs issue is a priority issue for engines like SWSE, FalconS, Watson, Swoogle, Sindice, FactForge, etc.: if you can&#039;t merge (or at least link) data on a specific entity from multiple sources with a high degree of confidence, then the rest of the system design becomes academic. In an immediate sense, owl:sameAs reasoning far surpasses the importance of generic reasoning (e.g., more complete OWL 2 RL) for search. Some of the guys from the FalconS group have been looking into this in more detail, particularly generating owl:sameAs links using methods beyond deductive reasoning:

	http://www.springerlink.com/content/c21t3w21870t38k7/
	http://dl.acm.org/citation.cfm?doid=1963405.1963421

The latter paper is interesting for applying inductive methods to the problem, which seems promising. We&#039;ve also just wrapped up a more detailed (and lengthy) paper on owl:sameAs using the SWSE results as a baseline:

	http://aidanhogan.com/docs/entcons_jws_final.pdf

Related to your comment, some conclusions from the paper were that the published owl:sameAs relations sampled from our dataset were perhaps not as bad as Halpin et al. found. We applied a much simpler (linear) sampling and judging methodology (we crunched through 1,000 relations with one opinion each) than Halpin&#039;s paper, but found that ~97% of sampled owl:sameAs were deemed &quot;not to be problematic&quot;. We were probably not as picky as Halpin&#039;s judges: our core criteria was essentially, &quot;to our trained eyes, would the merged entity look strange in SWSE?&quot;. Some of the inspection results are up here:

	http://aidanhogan.com/econs/bl/

(Many of the results are trivial given that one or the other URI have insufficient data to actually cause a problem when merging.)

Also, we found that using OWL 2 RL rules to *infer* owl:sameAs doesn&#039;t really buy you all that much. Primarily, owl:InverseFunctionalProperty gives a lot of raw equivalences, but almost all are between blank-nodes on the same domain as per the old FOAF &quot;smushing&quot; practice (you need a long black-list of nonsense values for homepages and email-addresses to make this feasible). owl:FunctionalProperty ekes out a few more new owl:sameAs relations. Again though, as opposed to explicit owl:sameAs triples, which give rich relations between URIs across domains, the additional OWL 2 RL inferences are almost entirely composed of owl:sameAs relations between blank-nodes *on the same site*, which are simply not as interesting. Similar results are echoed by the FalconS guys in their work. 

We also looked at using inconsistency rules from OWL 2 RL to detect and repair cases where owl:sameAs goes awry, with limited success: current vocabularies are unfortunately not very detailed in terms of axiomatising things they consider to be nonsense, like disjoint classes, properties, etc. Which is a real pity. Linked Data needs more inconsistency!!

&gt; A possible alternative would be to define (beyond owl:sameAs) a set of properties borrowed from the SKOS Recommendation, like closeMatch, exactMatch, broadMatch, etc.

I&#039;m not entirely convinced that this is *needed* (yet).

Our current sketchy idea on robust use of owl:sameAs for Web data is to have different views for each individual entity, where for owl:sameAs, we would allow &quot;transitive claims&quot;, but forego &quot;symmetric claims&quot; and only merge entities where reciprocal, authoritative links exist. This will imply different views for a given entity, a bit like considering owl:sameAs as a transitive &quot;imports&quot; property. So if we find the triple &quot;A sameAs B&quot; in a location authoritative for A, the entity-view for A will include data for B, but not vice-versa. If we also find &quot;B sameAs C&quot; in a document authoritative for B, the view for A and the view for B will include the data for C also. If we also find the triple &quot;B sameAs A&quot;, we can merge A and B such that they become one view. The core principle here is that a publisher has complete control over its local entities: they can opt in or opt out of what data its entities are viewed/merged with by dropping the pertinent owl:sameAs relation(s) in the local document. It would not be difficult to then support selecting and traversing these different views for a given entity in a UI.

Importantly, this doesn&#039;t require a new property or vocabulary for publishers, but rather a refined/selective/more robust interpretation of owl:sameAs by consumers. In general, when reasoning over Web data, it&#039;s more a case of disentangling who says what and where, as opposed to interpreting everything you see as formal truth. Obviously the burden is not (only) on publishers to get it right, but also on consumers to not implode when they get it wrong.

&gt; Thanks to the authors!

And thanks for the encouraging feedback and interesting discussion!]]></description>
		<content:encoded><![CDATA[<p>Many thanks for reading our (lengthy) paper. The positive feedback is really greatly appreciated! </p>
<p>I had planned on a short comment, but almost all of the questions you&#8217;ve raised have been on the forefronts of our minds over the past while, so . . .</p>
<p>&gt; that may be a good input for a possible Linked Data profile (although the differences are really minor, if one looks at the appendix of the paper that lists the rule sets the engine uses)</p>
<p>We&#8217;ve recently been looking into what kinds of RDFS and OWL primitives are being used in popular Linked Data vocabularies. Our (somewhat coarse) high-level observation is that RDFS/OWL features not requiring blank-nodes to express in the RDF mapping (e.g., sub-class, inverse-of, TransitiveProperty, etc.) are much more prominently used than those requiring blank-nodes (e.g., those requiring lists like owl:intersectionOf, or restrictions like owl:allValuesFrom, etc.). The pattern is quite evident, but as to what it means in a practical sense, we&#8217;re not sure. Hopefully we&#8217;ll have a paper on this soon.</p>
<p>&gt; the experiences of major Semantic Web search engines on handling Linked Data might provide a great set of input for such pragmatic work.</p>
<p>We feel that our primary contribution in SWSE is not the systems or the techniques, but rather a better understanding of how Linked Data works: empirically &#8220;putting it though it&#8217;s paces&#8221; so-to-speak. I&#8217;m sure that others working on similar engines also have a good feel for the inner workings of Linked Data. We&#8217;ve spent many hours/days/weeks debugging Web datasets to explain strange crawling behaviour, or strange consolidation results, or strange reasoning results, etc. This was part of the inspiration of the &#8220;Pedantic Web Group&#8221;, although the mailing lists have been quiet of late. It seems that more feedback loops are needed between the consumers and producers of Linked Data.</p>
<p>&gt; A great deal of work had to be done in SWSE on the proper handling of owl:sameAs. &#8230; On the other hand, one of the recurring discussions on various mailing list and elsewhere is on whether the usage of this property is semantically o.k. or not (see, e.g., [3]). </p>
<p>Yep, the owl:sameAs issue is a priority issue for engines like SWSE, FalconS, Watson, Swoogle, Sindice, FactForge, etc.: if you can&#8217;t merge (or at least link) data on a specific entity from multiple sources with a high degree of confidence, then the rest of the system design becomes academic. In an immediate sense, owl:sameAs reasoning far surpasses the importance of generic reasoning (e.g., more complete OWL 2 RL) for search. Some of the guys from the FalconS group have been looking into this in more detail, particularly generating owl:sameAs links using methods beyond deductive reasoning:</p>
<p>	<a href="http://www.springerlink.com/content/c21t3w21870t38k7/" rel="nofollow">http://www.springerlink.com/content/c21t3w21870t38k7/</a><br />
	<a href="http://dl.acm.org/citation.cfm?doid=1963405.1963421" rel="nofollow">http://dl.acm.org/citation.cfm?doid=1963405.1963421</a></p>
<p>The latter paper is interesting for applying inductive methods to the problem, which seems promising. We&#8217;ve also just wrapped up a more detailed (and lengthy) paper on owl:sameAs using the SWSE results as a baseline:</p>
<p>	<a href="http://aidanhogan.com/docs/entcons_jws_final.pdf" rel="nofollow">http://aidanhogan.com/docs/entcons_jws_final.pdf</a></p>
<p>Related to your comment, some conclusions from the paper were that the published owl:sameAs relations sampled from our dataset were perhaps not as bad as Halpin et al. found. We applied a much simpler (linear) sampling and judging methodology (we crunched through 1,000 relations with one opinion each) than Halpin&#8217;s paper, but found that ~97% of sampled owl:sameAs were deemed &#8220;not to be problematic&#8221;. We were probably not as picky as Halpin&#8217;s judges: our core criteria was essentially, &#8220;to our trained eyes, would the merged entity look strange in SWSE?&#8221;. Some of the inspection results are up here:</p>
<p>	<a href="http://aidanhogan.com/econs/bl/" rel="nofollow">http://aidanhogan.com/econs/bl/</a></p>
<p>(Many of the results are trivial given that one or the other URI have insufficient data to actually cause a problem when merging.)</p>
<p>Also, we found that using OWL 2 RL rules to *infer* owl:sameAs doesn&#8217;t really buy you all that much. Primarily, owl:InverseFunctionalProperty gives a lot of raw equivalences, but almost all are between blank-nodes on the same domain as per the old FOAF &#8220;smushing&#8221; practice (you need a long black-list of nonsense values for homepages and email-addresses to make this feasible). owl:FunctionalProperty ekes out a few more new owl:sameAs relations. Again though, as opposed to explicit owl:sameAs triples, which give rich relations between URIs across domains, the additional OWL 2 RL inferences are almost entirely composed of owl:sameAs relations between blank-nodes *on the same site*, which are simply not as interesting. Similar results are echoed by the FalconS guys in their work. </p>
<p>We also looked at using inconsistency rules from OWL 2 RL to detect and repair cases where owl:sameAs goes awry, with limited success: current vocabularies are unfortunately not very detailed in terms of axiomatising things they consider to be nonsense, like disjoint classes, properties, etc. Which is a real pity. Linked Data needs more inconsistency!!</p>
<p>&gt; A possible alternative would be to define (beyond owl:sameAs) a set of properties borrowed from the SKOS Recommendation, like closeMatch, exactMatch, broadMatch, etc.</p>
<p>I&#8217;m not entirely convinced that this is *needed* (yet).</p>
<p>Our current sketchy idea on robust use of owl:sameAs for Web data is to have different views for each individual entity, where for owl:sameAs, we would allow &#8220;transitive claims&#8221;, but forego &#8220;symmetric claims&#8221; and only merge entities where reciprocal, authoritative links exist. This will imply different views for a given entity, a bit like considering owl:sameAs as a transitive &#8220;imports&#8221; property. So if we find the triple &#8220;A sameAs B&#8221; in a location authoritative for A, the entity-view for A will include data for B, but not vice-versa. If we also find &#8220;B sameAs C&#8221; in a document authoritative for B, the view for A and the view for B will include the data for C also. If we also find the triple &#8220;B sameAs A&#8221;, we can merge A and B such that they become one view. The core principle here is that a publisher has complete control over its local entities: they can opt in or opt out of what data its entities are viewed/merged with by dropping the pertinent owl:sameAs relation(s) in the local document. It would not be difficult to then support selecting and traversing these different views for a given entity in a UI.</p>
<p>Importantly, this doesn&#8217;t require a new property or vocabulary for publishers, but rather a refined/selective/more robust interpretation of owl:sameAs by consumers. In general, when reasoning over Web data, it&#8217;s more a case of disentangling who says what and where, as opposed to interpreting everything you see as formal truth. Obviously the burden is not (only) on publishers to get it right, but also on consumers to not implode when they get it wrong.</p>
<p>&gt; Thanks to the authors!</p>
<p>And thanks for the encouraging feedback and interesting discussion!</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Nice reading on Semantic Search by Ronald Poell</title>
		<link>http://ivan-herman.name/2012/01/24/nice-reading-on-semantic-search/#comment-7222</link>
		<dc:creator><![CDATA[Ronald Poell]]></dc:creator>
		<pubDate>Wed, 25 Jan 2012 08:58:31 +0000</pubDate>
		<guid isPermaLink="false">http://ivan-herman.name/?p=835#comment-7222</guid>
		<description><![CDATA[&gt; A possible alternative would be to define (beyond owl:sameAs) a set of properties borrowed from the SKOS Recommendation, like closeMatch, exactMatch, broadMatch, etc.

I have always recommended that registered info should correspond as much as possible to reality. I.e. if two things ARE not the same, owl:sameAs should not be used. If during this fase the transformation from reality to the formalization deviates too much from reality, you loose information and you will never be able to get it back without changing the created formalization.

If a user wants (at exploitation level) results to include broader, narrower, similar etceteras relationships, local query expansion for the predicates can be easily done.

Of course there are numerous cases where there is no clear conceptual border between 2 or more concepts (ex. colors) but we can use &quot;cluster&quot; concepts grouping all variants (ex &quot;red&quot; or &quot;red colors&quot;). These clusters might contain personal flavors that might contradict each other (ex reddish-green and a greenish-red might in physical terms (RGB) be the same color but considered &quot;green&quot; for one group of persons and &quot;red&quot; for another).

Roughly spoken: let predicates proliferate, let search engines do the heavy job of expanding the queries. I did some work on similarity recommandation (Pamela - Personal agent for mapping elements that look alike) and this can be done.

Regards,

Ronald Poell]]></description>
		<content:encoded><![CDATA[<p>&gt; A possible alternative would be to define (beyond owl:sameAs) a set of properties borrowed from the SKOS Recommendation, like closeMatch, exactMatch, broadMatch, etc.</p>
<p>I have always recommended that registered info should correspond as much as possible to reality. I.e. if two things ARE not the same, owl:sameAs should not be used. If during this fase the transformation from reality to the formalization deviates too much from reality, you loose information and you will never be able to get it back without changing the created formalization.</p>
<p>If a user wants (at exploitation level) results to include broader, narrower, similar etceteras relationships, local query expansion for the predicates can be easily done.</p>
<p>Of course there are numerous cases where there is no clear conceptual border between 2 or more concepts (ex. colors) but we can use &#8220;cluster&#8221; concepts grouping all variants (ex &#8220;red&#8221; or &#8220;red colors&#8221;). These clusters might contain personal flavors that might contradict each other (ex reddish-green and a greenish-red might in physical terms (RGB) be the same color but considered &#8220;green&#8221; for one group of persons and &#8220;red&#8221; for another).</p>
<p>Roughly spoken: let predicates proliferate, let search engines do the heavy job of expanding the queries. I did some work on similarity recommandation (Pamela &#8211; Personal agent for mapping elements that look alike) and this can be done.</p>
<p>Regards,</p>
<p>Ronald Poell</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Nice reading on Semantic Search by caos! blog &#187; Semantic Web &#8211; Zemanta Experiment</title>
		<link>http://ivan-herman.name/2012/01/24/nice-reading-on-semantic-search/#comment-7221</link>
		<dc:creator><![CDATA[caos! blog &#187; Semantic Web &#8211; Zemanta Experiment]]></dc:creator>
		<pubDate>Wed, 25 Jan 2012 02:26:06 +0000</pubDate>
		<guid isPermaLink="false">http://ivan-herman.name/?p=835#comment-7221</guid>
		<description><![CDATA[[...] Nice reading on Semantic Search (ivan-herman.name)       comente! [...]]]></description>
		<content:encoded><![CDATA[<p>[...] Nice reading on Semantic Search (ivan-herman.name)       comente! [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Nice reading on Semantic Search by Andreas Gebhard</title>
		<link>http://ivan-herman.name/2012/01/24/nice-reading-on-semantic-search/#comment-7217</link>
		<dc:creator><![CDATA[Andreas Gebhard]]></dc:creator>
		<pubDate>Tue, 24 Jan 2012 19:07:45 +0000</pubDate>
		<guid isPermaLink="false">http://ivan-herman.name/?p=835#comment-7217</guid>
		<description><![CDATA[Others are much more qualified than I am to speak to the core questions you raise in this post. Just this much:

&gt; Or is it already too late, because the ubiquitous usage of owl:sameAs is already so prevalent that it is not worth touching that stuff?

Being relatively new to the party, I don&#039;t have any history in this discussion. What I think I can safely say is that in the grand scheme of things (i.e., the Internet as a whole), neither owl:sameAs nor any of the SKOS properties are really &quot;prevalent&quot; in the classic definition of that word. So why not tackle this task as you suggest (provided these alternatives actually *do* provide better data for search). 

We should never rule out scrapping years (or even decades) of work in favor of something new -- if it promises to be better.]]></description>
		<content:encoded><![CDATA[<p>Others are much more qualified than I am to speak to the core questions you raise in this post. Just this much:</p>
<p>&gt; Or is it already too late, because the ubiquitous usage of owl:sameAs is already so prevalent that it is not worth touching that stuff?</p>
<p>Being relatively new to the party, I don&#8217;t have any history in this discussion. What I think I can safely say is that in the grand scheme of things (i.e., the Internet as a whole), neither owl:sameAs nor any of the SKOS properties are really &#8220;prevalent&#8221; in the classic definition of that word. So why not tackle this task as you suggest (provided these alternatives actually *do* provide better data for search). </p>
<p>We should never rule out scrapping years (or even decades) of work in favor of something new &#8212; if it promises to be better.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Mac OS Lion: The Good, the Bad and the Ugly by Ruben Verborgh</title>
		<link>http://ivan-herman.name/2011/12/30/mac-os-lion-the-good-the-bad-and-the-ugly/#comment-7129</link>
		<dc:creator><![CDATA[Ruben Verborgh]]></dc:creator>
		<pubDate>Tue, 03 Jan 2012 10:56:42 +0000</pubDate>
		<guid isPermaLink="false">http://ivan-herman.name/?p=824#comment-7129</guid>
		<description><![CDATA[I got my MacBook 1.5 years ago and I&#039;m still doubting whether or not to upgrade to Lion. However, a faster Mail is one of the things that could convince me, since it&#039;s an important workhorse for me as well. Maybe I should consider a New Year&#039;s present for myself :)]]></description>
		<content:encoded><![CDATA[<p>I got my MacBook 1.5 years ago and I&#8217;m still doubting whether or not to upgrade to Lion. However, a faster Mail is one of the things that could convince me, since it&#8217;s an important workhorse for me as well. Maybe I should consider a New Year&#8217;s present for myself <img src='http://s0.wp.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Where we are with RDFa 1.1? by Ivan Herman</title>
		<link>http://ivan-herman.name/2011/12/16/where-we-are-with-rdfa-1-1/#comment-7095</link>
		<dc:creator><![CDATA[Ivan Herman]]></dc:creator>
		<pubDate>Sat, 24 Dec 2011 07:51:53 +0000</pubDate>
		<guid isPermaLink="false">http://ivan-herman.name/?p=809#comment-7095</guid>
		<description><![CDATA[First of all, referring to your last remark: no disrespect taken! :-) 

In the interest of fairness: schema.org, more exactly the members of schema.org, have chosen one W3C spec over the other, i.e., they chose microdata (part of the HTML5 standard-to-be) over RDFa 1.1 (another standard-to-be). In other words, the blame should not be on the fact that they’ve made a choice. The work on RDFa 1.1 of the past few months have done a lot (o.k., this is my subjective assessment) to bring these two closer, and this was done, to a large extend, because the schema.org members have given us very clear comments on RDFa 1.1. to make it usable for them. A bit late, I agree, but better late than never. And schema.org now acknowledges this duality and simply lives with it by accepting both syntaxes. From their point of view, that is a very pragmatic reaction.

The blame, if we want to understand the situation, is rather on us, i.e., W3C at large, to allow two, parallel standards in developments that clearly overlap in functionality. Put it another way, W3C members (including of course the members of schema.org, but others, too) could have, or should have, intervened on that issue way earlier. The blame is… on all of us, really, including all W3C members. B.t.w., this is not a totally unheard-of situation: we do have SVG and Canvas within HTML5, we did have CSS and XSLT side-by-side… But, at this point, this is history, which cannot really be changed. 

Actually, I do not regard RDFa Lite as a &quot;watered-down&quot; version; it is simply a subset of RDFa that is suited for authors who do not know about Semantic Web, nor are they interested by it. Because Lite is a strict subset of RDFa 1.1, authors can later “upgrade” their content if they wish to, but they can adhere to Lite and do a lot already. Yes, schema.org benefits from that but not only; for example, adding Dublin Core terms to a page can be done, I think, with RDFa Lite easily, and I would not be surprised if DCMI would advise authors to use that level. I regard this type of authors’ simple view as helpful; I would, to take again a different example, be pleased to have such a subset of CSS at my disposal if all I want is to do simple formatting and I would not have to go through the maze of the full CSS specification (which I do find daunting sometimes). RDFa Lite is between a good Primer and a full spec, in some way…

Merry Xmas and a very Happy New Year!]]></description>
		<content:encoded><![CDATA[<p>First of all, referring to your last remark: no disrespect taken! <img src='http://s0.wp.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' />  </p>
<p>In the interest of fairness: schema.org, more exactly the members of schema.org, have chosen one W3C spec over the other, i.e., they chose microdata (part of the HTML5 standard-to-be) over RDFa 1.1 (another standard-to-be). In other words, the blame should not be on the fact that they’ve made a choice. The work on RDFa 1.1 of the past few months have done a lot (o.k., this is my subjective assessment) to bring these two closer, and this was done, to a large extend, because the schema.org members have given us very clear comments on RDFa 1.1. to make it usable for them. A bit late, I agree, but better late than never. And schema.org now acknowledges this duality and simply lives with it by accepting both syntaxes. From their point of view, that is a very pragmatic reaction.</p>
<p>The blame, if we want to understand the situation, is rather on us, i.e., W3C at large, to allow two, parallel standards in developments that clearly overlap in functionality. Put it another way, W3C members (including of course the members of schema.org, but others, too) could have, or should have, intervened on that issue way earlier. The blame is… on all of us, really, including all W3C members. B.t.w., this is not a totally unheard-of situation: we do have SVG and Canvas within HTML5, we did have CSS and XSLT side-by-side… But, at this point, this is history, which cannot really be changed. </p>
<p>Actually, I do not regard RDFa Lite as a &#8220;watered-down&#8221; version; it is simply a subset of RDFa that is suited for authors who do not know about Semantic Web, nor are they interested by it. Because Lite is a strict subset of RDFa 1.1, authors can later “upgrade” their content if they wish to, but they can adhere to Lite and do a lot already. Yes, schema.org benefits from that but not only; for example, adding Dublin Core terms to a page can be done, I think, with RDFa Lite easily, and I would not be surprised if DCMI would advise authors to use that level. I regard this type of authors’ simple view as helpful; I would, to take again a different example, be pleased to have such a subset of CSS at my disposal if all I want is to do simple formatting and I would not have to go through the maze of the full CSS specification (which I do find daunting sometimes). RDFa Lite is between a good Primer and a full spec, in some way…</p>
<p>Merry Xmas and a very Happy New Year!</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Where we are with RDFa 1.1? by theheatexchange</title>
		<link>http://ivan-herman.name/2011/12/16/where-we-are-with-rdfa-1-1/#comment-7094</link>
		<dc:creator><![CDATA[theheatexchange]]></dc:creator>
		<pubDate>Sat, 24 Dec 2011 03:33:44 +0000</pubDate>
		<guid isPermaLink="false">http://ivan-herman.name/?p=809#comment-7094</guid>
		<description><![CDATA[I understand that schema.org did not actually create or invent a new syntax, but the point I wanted to make is that schema.org is a member of the W3C and the W3C is working on RDFa.  I would think, as a member, schema.org would have worked towards implementing RDFa.  If not, then maybe I am not understanding the goal of membership in the W3C.

If my understanding is correct, RDFa is supposed to supply a generic syntax whereby other, individually developed, vocabularies could be used.  Syntax aside, stablilizing a vocabulary appears difficult enough, especially considering the 8 stable terms of FOAF.  schema.org makes sense as far as developing a search vocabulary, but why use a different syntax?  Why not fully support their own consortium?

Now, even before RDFa 1.1 is finalized, we have RDFa Lite; another specification to muddle the waters.  If RDFa is generic, then why do we need a watered-down, or streamlined, version?  RDFa Lite appears to accommodate schema.org more than anything else.

Having multiple choices does not produce an efficient semantic web.  As have already been proven, having many alternate choices to do the same things often result in more things being done wrong.

If schema.org found something that it did not like about RDFa, maybe the more &quot;member-loyal&quot; approach would have been to revise RDFa and use that revision.  One syntax and multiple vocubalaries: with schema.org backing up this effort, the semantic web would become more stable.

Thanks for responding and no disrespect intended.]]></description>
		<content:encoded><![CDATA[<p>I understand that schema.org did not actually create or invent a new syntax, but the point I wanted to make is that schema.org is a member of the W3C and the W3C is working on RDFa.  I would think, as a member, schema.org would have worked towards implementing RDFa.  If not, then maybe I am not understanding the goal of membership in the W3C.</p>
<p>If my understanding is correct, RDFa is supposed to supply a generic syntax whereby other, individually developed, vocabularies could be used.  Syntax aside, stablilizing a vocabulary appears difficult enough, especially considering the 8 stable terms of FOAF.  schema.org makes sense as far as developing a search vocabulary, but why use a different syntax?  Why not fully support their own consortium?</p>
<p>Now, even before RDFa 1.1 is finalized, we have RDFa Lite; another specification to muddle the waters.  If RDFa is generic, then why do we need a watered-down, or streamlined, version?  RDFa Lite appears to accommodate schema.org more than anything else.</p>
<p>Having multiple choices does not produce an efficient semantic web.  As have already been proven, having many alternate choices to do the same things often result in more things being done wrong.</p>
<p>If schema.org found something that it did not like about RDFa, maybe the more &#8220;member-loyal&#8221; approach would have been to revise RDFa and use that revision.  One syntax and multiple vocubalaries: with schema.org backing up this effort, the semantic web would become more stable.</p>
<p>Thanks for responding and no disrespect intended.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Where we are with RDFa 1.1? by Ivan Herman</title>
		<link>http://ivan-herman.name/2011/12/16/where-we-are-with-rdfa-1-1/#comment-7089</link>
		<dc:creator><![CDATA[Ivan Herman]]></dc:creator>
		<pubDate>Fri, 23 Dec 2011 06:06:56 +0000</pubDate>
		<guid isPermaLink="false">http://ivan-herman.name/?p=809#comment-7089</guid>
		<description><![CDATA[First of all, to be fair: schema.org did &lt;em&gt;not&lt;/em&gt; invent their own syntax. That syntax (microdata) had been around for a while, in parallel to RDFa, and schema.org made the choice, in around April this year, to adopt microdata first. The choice was controversial indeed, and started a lot of discussion, of course, but the fact remain: they did not invent something. In other words, the question on that level is why were there two syntaxes in development, namely RDFa and microdata. And the reasons for that are really very complicated and I am probably not the appropriate person to answer that: first of all, I am active member of the W3C RDFa Working Group (i.e., biased) but, maybe even more importantly, I am also part of the W3C staff. But I would prefer to leave this to history.
However… there is a &lt;a href=&quot;http://blog.schema.org/2011/11/using-rdfa-11-lite-with-schemaorg.html&quot; rel=&quot;nofollow&quot;&gt;public blog of schema.org&lt;/a&gt; that puts an end to the discussion: schema.org &lt;em&gt;will&lt;/em&gt; use RDFa Lite alongside microdata. I.e., it becomes a matter of choice which syntax to use. There is also a document in preparation (at this moment, only an &lt;a href=&quot;http://dvcs.w3.org/hg/htmldata/raw-file/default/html-data-guide/index.html&quot; rel=&quot;nofollow&quot;&gt;editor’s draft&lt;/a&gt; is available) which looks at the different syntaxes to help in this choice. Finally, there is no issue around “W3C will not fully back up RDFa”: RDFa &lt;em&gt;is&lt;/em&gt; fully backed by W3C.]]></description>
		<content:encoded><![CDATA[<p>First of all, to be fair: schema.org did <em>not</em> invent their own syntax. That syntax (microdata) had been around for a while, in parallel to RDFa, and schema.org made the choice, in around April this year, to adopt microdata first. The choice was controversial indeed, and started a lot of discussion, of course, but the fact remain: they did not invent something. In other words, the question on that level is why were there two syntaxes in development, namely RDFa and microdata. And the reasons for that are really very complicated and I am probably not the appropriate person to answer that: first of all, I am active member of the W3C RDFa Working Group (i.e., biased) but, maybe even more importantly, I am also part of the W3C staff. But I would prefer to leave this to history.<br />
However… there is a <a href="http://blog.schema.org/2011/11/using-rdfa-11-lite-with-schemaorg.html" rel="nofollow">public blog of schema.org</a> that puts an end to the discussion: schema.org <em>will</em> use RDFa Lite alongside microdata. I.e., it becomes a matter of choice which syntax to use. There is also a document in preparation (at this moment, only an <a href="http://dvcs.w3.org/hg/htmldata/raw-file/default/html-data-guide/index.html" rel="nofollow">editor’s draft</a> is available) which looks at the different syntaxes to help in this choice. Finally, there is no issue around “W3C will not fully back up RDFa”: RDFa <em>is</em> fully backed by W3C.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

