http://www.openlinksw.com/weblog/public/search.vspx?blogid=127&q=data%20integration&type=text&output=html
Kingsley Idehen's Blog Data Space
2024-03-29T14:08:35Z
Kingsley Uyi Idehen
kidehen@openlinksw.com
About data integration
59
1
10
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/126
<p><font face="Arial" size="2"><strong>Enabling Integration of Internal and External Data</strong></font></p>
Enabling Integration of Internal and External Data
2006-06-22T12:56:58Z
2006-06-22T08:56:58-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/235
<font size="2"> <p>* IBM has introduced new portal software for accessing and integrating disparate applications, business processes, and data while collaborating with colleagues via a single Web-based environment and sign-on. </p> <p><a href="http://www.bijonline.com/News.asp?NewsID=980"><u><font color="#0000ff" size="2">http://www.bijonline.com/News.asp?NewsID=980</font></u></a></p></font>
IBM Announces New Integration Portal
2006-06-22T12:56:58Z
2006-06-22T08:56:58-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/476
<font size="2"> <blockquote dir="ltr" style="MARGIN-RIGHT: 0px"> <p>By Jack Vaughan, Application Development Trends</p> <p>XML has clear benefits as a lingua franca for integration, but it must co-exist with a well-established body of relational DB know-how. More than a few architects are concerned that they will soon have to throw a slew of finely tuned relational queries out the window and start over. JNetDirect recently unveiled software to address RDB-to-XML data mapping. JSQLMapper is a bidirectional data-mapping tool that cuts requirements for custom coding to bring relational data into XML format. With JSQLMapper, developers can create data mappings from existing relational data stores to XML documents.</p> <p><a href="http://www.adtmag.com/article.asp?id=9092"><u><font color="#0000ff" size="2">http://www.adtmag.com/article.asp?id=9092</font></u></a></p><font size="2"> <p>See also XML and Databases: <a href="http://xml.coverpages.org/xmlAndDatabases.html"><u><font color="#0000ff" size="2">http://xml.coverpages.org/xmlAndDatabases.html</font></u></a></p></font></blockquote> <p dir="ltr">Why on earth does XML's emergence somehow imply dumping existing investement in relational database queries? </p> <p dir="ltr">XML provides an alternative, uniform, and widely adopted mechanism for exploiting existing relational data. The emergence of the recent SQL/XML (SQLX) standard should at least bring this into clearer perspective.</p> <p dir="ltr">Obviously the lack of practical SQLX tutorial material may be the source of some of the confusion about this subject matter, so here are a few urls:</p> <p dir="ltr">Generating RSS from SQL Data - <a href="http://www.openlinksw.com/articles/rssvirtsqlx.htm">http://www.openlinksw.com/articles/rssvirtsqlx.htm</a> (this is one of many ways that this can be achieved using <a href="http://www.openlinksw.com/virtuoso/whatis.htm">OpenLink Virtuoso</a>).</p> <p dir="ltr">SQLX Tutorials and online demos - <br /><a href="http://demo.openlinksw.com:8890/tutorial/xml/index.vsp?f=1">http://demo.openlinksw.com:8890/tutorial/xml/index.vsp?f=1</a></p> <p dir="ltr"> </p> <p dir="ltr"> </p></font>
Moving to XML: Does it Mean Throwing Out Your RDB Queries?
2006-06-22T12:56:58Z
2006-06-22T08:56:58-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1457
<p>I just stumbled across an post from <a href="http://www.itbusinessedge.com" id="link-id10f82f50">ITBusines Edge</a> titled: <a href="http://www.itbusinessedge.com/item/?ci=48119" id="link-id10f37b90">How Semantic Technology Can Help Companies with Integration</a>. While reading the post I encountered the term: <a href="http://dbpedia.org/resource/Master_Data_Management" id="link-id11055eb8">Master Data Manager (MDM)</a>, and wondered to myself, "what's that?" only to realize it's the very same thing I described as a <a href="http://dbpedia.org/resource/Federated_database_system" id="link-id13985af0">Data Virtualization</a> or <a href="http://dbpedia.org/resource/Virtual_Database" id="link-id1167c720">Virtual Database technology</a> (circa. 1998).</p> <p>Now, if re-labeling can confuse <a href="http://myopenlink.net/dataspace/person/kidehen#this" id="link-id14aaaaf0">me</a> when applied to a realm I've been intimately involved with for eons (<a href="http://dbpedia.org/resource/Internet" id="link-id112042f0">internet</a> time). I don't want to imagine what it does for others who aren't that intimately involved with the important data access and data integration realms. </p> <p>On the more refreshing side, the article does shed some light on the potency of RDF and OWL when applied to the construction of conceptual views of heterogeneous data sources.</p> <blockquote> <cite>"How do you know that data coming from one place calculates net revenue the same way that data coming from another place does? You’ve got people using the same term for different things and different terms for the same things. How do you reconcile all of that? That’s really what semantic integration is about." </cite> </blockquote> <p>BTW - I discovered this article via another titled: <a href="http://www.itbusinessedge.com/blogs/mia/?p=485" id="link-id11134098">Understanding Integration And How It Can Help with SOA</a>, that covers SOA and Integration matters. Again, in this piece I feel the gradual realization of the virtues that RDF, OWL, and RDF <a href="http://dbpedia.org/resource/Linked_Data" id="link-id11048740">Linked Data</a> bring to bear in the vital realm of data integration across heterogeneous data silos.</p> <h3>Conclusion</h3> <p>A number of events, at the micro and macro economic levels, are forcing attention back to the issue of productive use of existing IT resources. The trouble with the aforementioned quest is that it ultimately unveils the global IT affliction known as: heterogeneous data silos, and the challenges of pain alleviation, that have been ignored forever or approached inadequately as clearly shown by the rapid build up of SOA horror stories in the data integration realm.</p> <p>Data Integration via conceptualization of heterogenous data sources, that result in concrete conceptual layer data access and management, remains the greatest and most potent application of technologies associated with the "<a href="http://dbpedia.org/resource/Semantic_Web" id="link-id10fa5050">Semantic Web</a>" and/or "Linked Data" monikers.</p> <h3>Related</h3> <ul> <li> <a href="http://www.infoworld.com/article/03/05/23/21FEinnovidehen_1.html" id="link-id118c9c00">InforWorld 2003 Innovator article</a> </li> <li> <a href="http://weblog.infoworld.com/udell/2006/04/28.html" id="link-id11057298">2006 Podcast Interview with Jon Udell</a> </li> <li> <a href="http://dbpedia.org/resource/Enterprise_Information_Integration" id="link-id13f89030">Enterprise Information Integration</a> </li> <li>One of <a href="http://www.openlinksw.com/weblog/public/search.vspx?blogid=127&q=data%20integration&type=text&output=html" id="link-id11048b98">several posts</a> about our <a href="http://virtuoso.openlinksw.com" id="link-id10fef0e0">Virtuoso</a> <a href="http://dbpedia.org/resource/Virtuoso_Universal_Server" id="link-id10e5a068">Universal Server</a> and <a href="http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1406" id="link-id111d5aa8">Conceptual Model based data integration</a> </li> <li> <a href="http://virtuoso.openlinksw.com/wiki/main/Main/VOSHistory" id="link-id11020108">History of Virtuoso</a> </li> <li> <a href="http://www.mkbergman.com/me/" id="link-id1101e7b0">Mike Bergman</a>'s post titled: <a href="http://www.mkbergman.com/?p=459" id="link-id10fdb640">WOA: A New Enterprise Partner for Linked Data</a> </li> </ul>
The Trouble with Labels (Contd.): Data Integration & SOA
2008-10-12T22:54:22Z
2008-10-12T18:54:22-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1123
<p> <a href="http://fgiasson.com">Frederick Giasson</a> provides compelling data that supports the view that the <a href="http://www.openlinksw.com/blog/~kidehen/?id=1122">Semantic Web bootstrap is a global Data Integration & Data Generation effort</a> that inevitably involves a variety of Data Sources such as: social networks, blogs, wikis etc.</p> <p> The Data in Fred's post is based on <a href="http://fgiasson.com/blog/index.php/2007/01/21/reaching_at_least_600_000_people_with_19">FOAF Ontology instance data generated from a myriad of Data Sources</a>.</p>
Semantic Web Data Generation Activity: FOAF Crawling
2007-01-22T19:25:48Z
2007-01-22T14:25:48-05:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/301
<font size="2"> <p>IBM TO SHIP DB2 INTEGRATION SOFTWARE</p> <p>Posted May 15, 2003 4:46 PM Pacific Time</p> <p>IBM on Tuesday plans to announce availability of its DB2 Information Integrator software, for integrating and analyzing multiple forms of information, the company acknowledged on Thursday.</p> <p>In beta since February, the software is intended to enable customers to manage centrally data, text, images, photos, video and audio files stored in different databases, according to IBM. XML content and Web services also are supported.</p> <p><em><strong>Interesting Quote:</strong></em></p> <p class="ArticleBody" page="1">"If we move to information as a utility for giant data grids, this is key technology for hiding or making unimportant the location and type of data. This software enables the data to be accessed transparently wherever it might be," Jones said. </p> <p class="ArticleBody" page="1"><em><strong>Product Pricing</strong></em><br />DB2 Information Integrator will be available for $20,000 per processor and $15,000 per data source connector.<br />Detail will also be available on Tuesday. </p> <p class="ArticleBody" page="1">The cost for a bulk adapter license is about $75,000. If change capture is involved, the adapter license costs about $150,000. Real-time integration costs are mips-based, with a starting cost of about $300,000. One adapter can be used to translate and make native calls to all environments. <br /><br /><em>Very interesting pricing! </em></p> <p class="ArticleBody" page="1">For the full story: <a href="http://www.infoworld.com/article/03/05/15/HNdb2integrate_1.html"><u><font color="#0000ff" size="2"><a href="http://www.infoworld.com/article/03/05/15/HNdb2integrate_1.html">http://www.infoworld.com/article/03/05/15/HNdb2integrate_1.html</a></font></u></a></p></font>
<p>IBM TO SHIP DB2 INTEGRATION SOFTWARE</p>
2006-06-22T12:56:58Z
2006-06-22T08:56:58-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/276
<p>An interesting <a href="http://www.sys-con.com/xml/article2a.cfm?id=652&count=18437&tot=14&page=12">piece</a> by Michael Carey architect for Liquid Data at BEA re. Enterprise Information Integration from <a href="http://www.sys-con.com/xml">XML Journal</a>.</p> <p>Key quote.</p> <p><em>Since the dawn of the database era more than three decades ago, enterprises have been amassing an ever-increasing volume of information - both current and historical - about their operations. For the past two of those three decades, the database world has struggled with the problem of somehow integrating information that natively resides in multiple database systems or other information sources (Landers and Rosenberg).</em> </p> <p>This is the root cause of many of the systems integration challenges facing may IT decsion makers. They want to exploit the new and emerging technologies, but the internal disparity of data and application logic presents many obstacles.</p> <p>Michael had this to say in his introduction.</p> <p><em>The IT world knows this problem today as the enterprise information integration (EII) problem: enterprise applications need to be able to easily access and combine information about a given business entity from a distributed and highly varied collection of information sources. Relevant sources include various relational database systems (RDBMSs); packaged applications from vendors such as Siebel, PeopleSoft, SAP, and others; "homegrown" proprietary systems; and an increasing number of data sources that are starting to speak XML, such as XML files and Web services</em>.<br /></p> <p>Virtuoso (which coincedentally has been used to build and host this blog) has been developed to address the challenges presented above; by providing a Virtual Database Engine for disparate data and application logic (all the GEMs on this page have been generated on the fly using it's SQL-XML functionality).</p> <p>Additional article excerpts:<br /><em>With XQuery, the solution sketched above can be implemented by viewing the enterprise's different data sources all as virtual XML documents and functions. XQuery can stitch the distributed customer information together into a comprehensive, reusable base view.</em> </p> <p>A critical issue at this point is how sensistive the XML VIEW is to underlying data source changes. Enterprises are dynamic, so static XML VIEWs are going to be suboptimal in many situations. Applications are only as relevant as the underlying data fluidity served up by the data access (this issue is data format agnostic).</p> <p>Virtuoso addresses this problem through its support of Persistent and Transient forms of XML VIEWs (which are derived from SQL, XML, Web Services, or any combination of these).</p> <p>Final excerpt:<br /><em>The relational data sources can be exposed using simple default XML Schemas, and the other sources - SAP and the credit-checking Web service - can be exposed to XQuery as callable XQuery functions with appropriate signatures.</em> </p> <p>Unfortunately XML Schemas aren't easy, so making this a requirement for producing XML VIEWs is somewhat problematic (or should I say challenging). Of course this approach has it merits, but it does put a significant knowledge acquisition burden on the end-user or developer. This is why Virtuoso also supports an approach based on SQL extensions for generating XML from SQL that facilitate the production of Well Formed and/or Valid XML documents on the fly from heterogeneous SQL Data Sources (this syntax is identical to the FOR XML RAW | AUTO | EXPLICIT modes of SQL Server). It can also use it's in-built XSL-T engine to further transform other non SQL XML data sources (and then generate an XML Schema for the final product if required and validate against this schema using it's in-build XML Schema validaton engine).</p> <p>This article certainly sheds light on the kinds of problems that EII based technologies such as Virtual Databases are positioned to address.</p> <p>There is a live XQuery demo of Virtuoso at: <a href="http://demo.openlinksw.com:8890/xqdemo"><a href="http://demo.openlinksw.com:8890/xqdemo">http://demo.openlinksw.com:8890/xqdemo</a></a></p>
<a href="http://www.sys-con.com/xml/article2a.cfm?id=652&count=18437&tot=14&page=12">piece</a>
2006-06-22T12:56:58Z
2006-06-22T08:56:58-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1645
<p> <b> <i><a class="auto-href" href="http://dbpedia.org/resource/Linked_Data" id="link-id0x1e81beb0">Linked Data</a> </i> </b> is simply <i><a href="http://dbpedia.org/resource/Hypermedia" id="link-id0x1d9d5e30">hypermedia</a>-based structured <a href="http://dbpedia.org/resource/Data">data</a>.</i> </p> <p>Linked Data offers everyone a <a href="http://dbpedia.org/resource/World_Wide_Web">Web</a>-scale, Enterprise-grade mechanism for platform-independent creation, curation, access, and integration of data.</p> <p>The fundamental steps to creating Linked Data are as follows:</p> <ol> <li> <p>Choose a <i>Name Reference Mechanism</i> — i.e., URIs.</p> </li> <li> <p>Choose a <i>Data Model</i> with which to Structure your Data — minimally, you need a model which clearly distinguishes</p> <ol type="a"> <li> <i>Subjects</i> (also known as <i>Entities</i>)</li> <li> <i>Subject Attributes</i> (also known as <i><a class="auto-href" href="http://dbpedia.org/resource/Entity" id="link-id0x171a1808">Entity</a> Attributes</i>), and</li> <li> <i>Attribute Values</i> (also known as <i>Subject Attribute Values</i> or <i>Entity Attribute Values</i>).</li> </ol> </li> <li> <p>Choose one or more <i>Data Representation Syntaxes</i> (also called <i>Markup Languages</i> or <i>Data Formats</i>) to use when creating <i>Resources</i> with <i>Content</i> based on your chosen <i>Data Model.</i> Some Syntaxes in common use today are HTML+<a class="auto-href" href="http://dbpedia.org/resource/RDFa" id="link-id0x1a95cc58">RDFa</a>, <a href="http://www.w3.org/DesignIssues/Notation3" id="link-id0x1f596330">N3</a>, <a href="http://www.w3.org/TeamSubmission/turtle/" id="link-id0x16fdca68">Turtle</a>, <a href="http://www.w3.org/TR/REC-rdf-syntax/" id="link-id0x1d7cf0c0">RDF/XML</a>, <a href="http://sw.nokia.com/trix/TriX.html" id="link-id0x19690b60">TriX</a>, <a href="http://dbpedia.org/resource/Extensible_Resource_Descriptor" id="link-id0x1bb46968">XRDS</a>, <a href="http://code.google.com/apis/gdata/index.html" id="link-id0x18f63f20">GData</a>, <a href="http://odata.org" id="link-id0x19aee1e0">OData</a>, <a href="http://opengraphprotocol.org/" id="link-id0x1a43eb78">OpenGraph</a>, and many others.</p> </li> <li> <p>Choose a <i><a class="auto-href" href="http://dbpedia.org/resource/Uniform_Resource_Identifier" id="link-id0x19aa3900">URI</a> Scheme</i> that facilitates binding <i>Referenced Names</i> to the <i>Resources</i> which will carry your <i>Content</i> -- your <i>Structured Data.</i> </p> </li> <li> <p>Create <i>Structured Data</i> by using your chosen <i>Name Reference Mechanism,</i> your chosen <i>Data Model,</i> and your chosen <i>Data Representation Syntax,</i> as follows:</p> <ol type="a"> <li>Identify <i>Subject(s)</i> using <i>Resolvable URI(s).</i> </li> <li>Identify <i>Subject Attribute(s)</i> using <i>Resolvable URI(s).</i> </li> <li>Assign <i>Attribute Values</i> to <i>Subject Attributes.</i> These <i>Values</i> may be either <i>Literals</i> (e.g., STRINGs, BLOBs) or <i>Resolvable URIs.</i> </li> </ol> </li> </ol> <p>You can create Linked Data (hypermedia-based data representations) Resources from or for many things. Examples include: personal profiles, calendars, address books, blogs, photo albums; there are many, many more.</p> <h3>Related</h3> <ol> <li> <a href="http://socialmedia.net/linked-data-introduction" id="link-id0x1bb13d50">Linked Data an Introduction</a> -- simple introduction to Linked Data and its virtues</li> <li> <a href="http://gigaom.com/2010/10/11/jeff-jonas-big-data/" id="link-id0xa00d7e8">How Data Makes Corporations Dumb</a> -- Jeff Jonas (IBM) interview</li> <li> <a href="http://www.amundsen.com/hypermedia/" id="link-id0x18f64958">Hypermedia Types</a> -- evolving <a class="auto-href" href="http://dbpedia.org/resource/Information" id="link-id0x1903b880">information</a> portal covering different aspects of Hypermedia resource types</li> <li> <a href="http://linkeddata.uriburner.com" id="link-id0x18af0cf8">URIBurner </a>-- service that generates Linked Data from a plethora of heterogeneous data sources</li> <li> <a href="http://www.w3.org/DesignIssues/LinkedData.html" id="link-id0x1929eea0">Linked Data Meme</a> -- <a class="auto-href" href="http://www.w3.org/People/Berners-Lee/card#i" id="link-id0x1e8127c8">TimbL</a> design issues note about Linked Data</li> <li> <a href="http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1624" id="link-id0x18a5b768">Data 3.0 Manifesto</a> -- note about format agnostic Linked Data</li> <li> <a href="http://dbpedia.org/About" id="link-id0x19ae9338">DBpedia</a> -- large Linked Data Hub</li> <li> <a href="http://richard.cyganiak.de/2007/10/lod/" id="link-id0x14d677f8">Linked Open Data Cloud</a> -- collection of Linked Data Spaces</li> <li> <a href="http://linkedopencommerce.com" id="link-id0x17c6dbf8">Linked Open Commerce Cloud </a>-- commerce (clicks & mortar and/or clicks & clicks) oriented <a class="auto-href" href="http://en.wikipedia.org/wiki/Data_Spaces" id="link-id0x13959308">Linked Data Space</a> </li> <li> <a href="http://lod.openlinksw.com" id="link-id0x18ccb9e8">LOD Cloud Cache </a>-- massive Linked Data Space hosting most of the LOD Cloud Datasets</li> <li> <a href="http://lod2.eu" id="link-id0x1a472c20">LOD2 Initiative</a> -- EU Co-Funded Project to develop global <a class="auto-href" href="http://dbpedia.org/resource/Knowledge" id="link-id0x1c0ae7d0">knowledge</a> space from LOD</li>. </ol>
What is Linked Data, really?
2010-11-09T18:53:01Z
2010-11-09T13:53:01-05:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1639
<p> <b> <i><a class="auto-href" href="http://dbpedia.org/resource/Linked_Data" id="link-id0x1e81beb0">Linked Data</a> </i> </b> is simply <i><a href="http://dbpedia.org/resource/Hypermedia" id="link-id0x1d9d5e30">hypermedia</a>-based structured <a href="http://dbpedia.org/resource/Data">data</a>.</i> </p> <p>Linked Data offers everyone a <a href="http://dbpedia.org/resource/World_Wide_Web">Web</a>-scale, Enterprise-grade mechanism for platform-independent creation, curation, access, and integration of data.</p> <p>The fundamental steps to creating Linked Data are as follows:</p> <ol> <li> <p>Choose a <i>Name Reference Mechanism</i> — i.e., URIs.</p> </li> <li> <p>Choose a <i>Data Model</i> with which to Structure your Data — minimally, you need a model which clearly distinguishes</p> <ol type="a"> <li> <i>Subjects</i> (also known as <i>Entities</i>)</li> <li> <i>Subject Attributes</i> (also known as <i><a class="auto-href" href="http://dbpedia.org/resource/Entity" id="link-id0x171a1808">Entity</a> Attributes</i>), and</li> <li> <i>Attribute Values</i> (also known as <i>Subject Attribute Values</i> or <i>Entity Attribute Values</i>).</li> </ol> </li> <li> <p>Choose one or more <i>Data Representation Syntaxes</i> (also called <i>Markup Languages</i> or <i>Data Formats</i>) to use when creating <i>Resources</i> with <i>Content</i> based on your chosen <i>Data Model.</i> Some Syntaxes in common use today are HTML+<a class="auto-href" href="http://dbpedia.org/resource/RDFa" id="link-id0x1a95cc58">RDFa</a>, <a href="http://www.w3.org/DesignIssues/Notation3" id="link-id0x1f596330">N3</a>, <a href="http://www.w3.org/TeamSubmission/turtle/" id="link-id0x16fdca68">Turtle</a>, <a href="http://www.w3.org/TR/REC-rdf-syntax/" id="link-id0x1d7cf0c0">RDF/XML</a>, <a href="http://sw.nokia.com/trix/TriX.html" id="link-id0x19690b60">TriX</a>, <a href="http://dbpedia.org/resource/Extensible_Resource_Descriptor" id="link-id0x1bb46968">XRDS</a>, <a href="http://code.google.com/apis/gdata/index.html" id="link-id0x18f63f20">GData</a>, and <a href="http://odata.org" id="link-id0x19aee1e0">OData</a>; there are many others.</p> </li> <li> <p>Choose a <i><a class="auto-href" href="http://dbpedia.org/resource/Uniform_Resource_Identifier" id="link-id0x19aa3900">URI</a> Scheme</i> that facilitates binding <i>Referenced Names</i> to the <i>Resources</i> which will carry your <i>Content</i> -- your <i>Structured Data.</i> </p> </li> <li> <p>Create <i>Structured Data</i> by using your chosen <i>Name Reference Mechanism,</i> your chosen <i>Data Model,</i> and your chosen <i>Data Representation Syntax,</i> as follows:</p> <ol type="a"> <li>Identify <i>Subject(s)</i> using <i>Resolvable URI(s).</i> </li> <li>Identify <i>Subject Attribute(s)</i> using <i>Resolvable URI(s).</i> </li> <li>Assign <i>Attribute Values</i> to <i>Subject Attributes.</i> These <i>Values</i> may be either <i>Literals</i> (e.g., STRINGs, BLOBs) or <i>Resolvable URIs.</i> </li> </ol> </li> </ol> <p>You can create Linked Data (hypermedia-based data representations) Resources from or for many things. Examples include: personal profiles, calendars, address books, blogs, photo albums; there are many, many more.</p> <h3>Related</h3> <ol> <li> <a href="http://www.amundsen.com/hypermedia/" id="link-id0x18f64958">Hypermedia Types</a> -- evolving <a class="auto-href" href="http://dbpedia.org/resource/Information" id="link-id0x1903b880">information</a> portal covering different aspects of Hypermedia resource types</li> <li> <a href="http://linkeddata.uriburner.com" id="link-id0x18af0cf8">URIBurner </a>-- service that generates Linked Data from a plethora of heterogeneous data sources</li> <li> <a href="http://www.w3.org/DesignIssues/LinkedData.html" id="link-id0x1929eea0">Linked Data Meme</a> -- <a class="auto-href" href="http://www.w3.org/People/Berners-Lee/card#i" id="link-id0x1e8127c8">TimbL</a> design issues note about Linked Data</li> <li> <a href="http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1624" id="link-id0x18a5b768">Data 3.0 Manifesto</a> -- note about format agnostic Linked Data</li> <li> <a href="http://dbpedia.org/About" id="link-id0x19ae9338">DBpedia</a> -- large Linked Data Hub</li> <li> <a href="http://richard.cyganiak.de/2007/10/lod/" id="link-id0x14d677f8">Linked Open Data Cloud</a> -- collection of Linked Data Spaces</li> <li> <a href="http://linkedopencommerce.com" id="link-id0x17c6dbf8">Linked Open Commerce Cloud </a>-- commerce (clicks & mortar and/or clicks & clicks) oriented <a class="auto-href" href="http://en.wikipedia.org/wiki/Data_Spaces" id="link-id0x13959308">Linked Data Space</a> </li> <li> <a href="http://lod.openlinksw.com" id="link-id0x18ccb9e8">LOD Cloud Cache </a>-- massive Linked Data Space hosting most of the LOD Cloud Datasets</li> <li> <a href="http://lod2.eu" id="link-id0x1a472c20">LOD2 Initiative</a> -- EU Co-Funded Project to develop global <a class="auto-href" href="http://dbpedia.org/resource/Knowledge" id="link-id0x1c0ae7d0">knowledge</a> space from LOD</li>. </ol>
What is Linked Data, really?
2011-02-15T22:28:06Z
2011-02-15T17:28:06.000002-05:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1174
<p> <a href="http://dannyayers.com/">Danny Ayers</a> responds, via his post titled: <a href="http://dannyayers.com/2007/03/30/sampling">Sampling</a>, to "Stefano Mazzochi's post about <a href="http://www.betaversion.org/~stefano/linotype/news/101/">Data Integration using Semantic Web Technologies</a>.</p> <blockquote> <p>"There is a potential problem with republication of transformed data, in that right away there may be inconsistency with the original source data. Here provenance tracking (probably via named graphs) becomes a must-have. The web data space itself can support very granular separation. Whatever, data integration is a hard problem. But if you have a uniform language for describing resources, at least it can be possible."<br /> </p> <p>Alex James also chimes in with valuable insights in his post: <a href="http://www.base4.net">Sampling the global data model</a>, where he concludes:</p> <blockquote>"Exactly we need to use projected views, or conceptual models. ' <p> See a projected view can be thought of as a conceptual model that has some mapping to a *sampling* of the global data model.</p> <p>The benefits of introducing this extra layer are many and varied: Simplicity, URI predictability, Domain Specificity and the ability to separate semantics from lower level details like data mapping.</p> <p>Unfortunately if you look at today’s ORMs you will quickly notice that they simply map directly from Object Model to Data Model in one step.</p> <p>This naïve approach provides no place to manage the mapping to a conceptual model that sampling the world’s data requires.</p> <p>What we need to solve the problems Stefano sees is to bring together the world of mapping and semantics. And the place they will meet is simply the Conceptual Model."</p> </blockquote> <p>Data Integration challenges arise because the following facts hold true all of the time (whether we like it or not):</p> <ol> <li>Data Heterogeneity is a fact of life at the intranet and internet levels </li> <li>Data is rarely clean</li> <li>Data Integration prowess are ultimately measured by pain alleviation</li> <li>A some point human participation is required, but the trick is to move human activity up the value chain</li> <li>Glue code size and Data Integration success are inversely related</li> <li>Data Integration is best addressed via "M" rather than "C" (if we use the MVC pattern as a guide. "V" is dead on arrival for the scrappers out there)</li> </ol> <p>In 1997 we commenced the <a href="http://www.openlinksw.com/virtuoso/">Virtuoso</a> Virtual DBMS Project that morphed into the <a href="http://en.wikipedia.org/wiki/Virtuoso_Universal_Server">Virtuoso Universal Server</a>; A fusion of DBMS functionality and Middleware functionality in a single product. The goal of this undertaking remains alleviation of the costs associated with Data Integration Challenges by Virtualizing Data at the Logical and Conceptual Layers.</p> <p>The Logical Data Layer has been concrete for a while (e.g Relational DBMS Engines), what hasn't reached the mainstream is the <a href="http://www.openlinksw.com/weblog/public/search.vspx?blogid=127&q=conceptual%20data%20model&type=text&output=html">Concrete Conceptual Model</a>, but this is changing fast courtesy of the activity taking place in the realm of RDF.</p> <p>RDF provides an Open and Standards compliant vehicle for developing and exploiting Concrete Conceptual Data Models that ultimately move the Human aspect of the "Data Integration alleviation quest" higher up the value chain. </p> </blockquote>
RDF based Integration Challenges (update)
2007-03-30T23:35:35Z
2007-03-30T19:35:35-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/483
<p>An <a href="http://www.crn.com/sections/special/HOF/hof00.asp?ArticleID=21432">interesting piece</a> I stumbled across regarding one of the RDBMS industry's notable pioneers. </p> <blockquote dir="ltr" style="MARGIN-RIGHT: 0px"> <p><font face="UNIVERS" size="2"><font face="Arial,Helvetica,Geneva,Swiss,SunSans-Regular" size="2">Today, technology areas that catch Stonebraker's eye include wireless and data integration on the Web. <br /><br />Started Ingres project in early 1970s at Berkeley to develop relational databases. Ingres Corp. formed in 1980.<br /><br />Another Berkeley project, Postgres, yielded object relational databases and spawned Illustra Information Technologies in 1992.<br /><br />Became Informix's CTO in 1996, holding that post until September 2000.<br /><br />Launched Cohera, a maker of federated databases, in 1999, based on a Berkeley research project, Miraposa.</font></font> </p></blockquote> <p dir="ltr"><a href="http://www.crn.com/sections/special/HOF/hof00.asp?ArticleID=21432">Read on..</a></p>
DBMS Hall of Fame: Prof. Michael Stonebraker
2006-06-22T12:56:58Z
2006-06-22T08:56:58-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1609
<h2>Situation Analysis</h2> <p>Since the beginning of the modern IT era, each period of innovation has inadvertently introduced its fair share of <a href="http://dbpedia.org/resource/Data">Data</a> Silos. The driving force behind this anomaly remains an overemphasis on the role of applications when selecting problem solutions. Unfortunately, most solution selecting decision makers remain oblivious to the fact that most applications are architecturally monolithic; i.e., they fail to separate the following five layers that are critical to all solutions: </p> <ol> <li>Data Unit (Datum or Data Object) Identity,</li> <li>Data Storage/Persistence,</li> <li>Data Access,</li> <li>Data Representation, and</li> <li>Data Presentation/Visualization. </li> </ol> <p>The rise of the <a href="http://dbpedia.org/resource/Internet" id="link-id13fe21b0">Internet</a>, and its exponentially-growing user-friendly enclave known as the <a href="http://dbpedia.org/resource/World_Wide_Web" id="link-id1233c608">World Wide Web</a>, is bringing the intrinsic costs of the monolithic application architecture anomaly to bear -- in manners unanticipated by many. For example, the emergence of network-oriented solutions across the realms of Enterprise 2.0-based Collaboration and <a href="http://dbpedia.org/resource/World_Wide_Web">Web</a> 2.0-based Software-as-a-Service (SaaS), combined with the overarching influence of Social Media, are producing more heterogeneously-structured and disparately-located data sources than people can effectively process.</p> <p>As is often the case, a variety of problem and product monikers have emerged for the data access and integration challenges outlined above. Contemporary examples include Enterprise <a href="http://dbpedia.org/resource/Information" id="link-id13f7e458">Information</a> Integration, Master Data Management, and <a href="http://dbpedia.org/resource/Federated_database_system" id="link-id13f57da0">Data Virtualization</a>. Labeling aside, the fundamental issues of the unresolved Data Integration challenge boil down to the following:</p> <ul> <li>Data Model Heterogeneity</li> <li>Data Quality (Cleanliness)</li> <li>Semantic Variance across Contexts (e.g., weights and measures).</li> </ul> <p>Effectively solving today's data integration challenges requires a move away from monolithic application architecture to loosely-coupled, network-centric application architectures. Basically, we need a ubiquitous network-centric application protocol that lends itself to loosely-coupled across-the-wire orchestration of data interactions. In short, this will be what revitalizes the art of application development and deployment.</p> <p>The World Wide Web is built around a network application protocol called HTTP. This protocol intrinsically separates the five layers listed earlier, thereby enabling:</p> <ul> <li>Use of Generic HTTP URIs as Data Object (<a href="http://dbpedia.org/resource/Entity" id="link-id113b7318">Entity</a>) Identifiers;</li> <li>Identifier Co-reference, such that multiple <a href="http://dbpedia.org/resource/Identity_(object-oriented_programming)" id="link-id117151d8">Data Object Identifiers</a> may reference the same Data Object;</li> <li>Use of the <a href="http://dbpedia.org/resource/Entity-attribute-value_model" id="link-id13fa4fa0">Entity</a>-Attribute-Value Model to describe Data Objects using real world modeling friendly conceptual graphs;</li> <li>Use of HTTP URLs to Identify Locations of Resources that bear (host) Data Object Descriptions (Representations);</li> <li>Data Access mechanism for retrieving Data Object Representations from persistent or transient storage locations.</li> </ul> <h2>What is <a href="http://virtuoso.openlinksw.com" id="link-id116af950">Virtuoso</a>?</h2> <p>A uniquely designed to address today's escalating Data Access and Integration challenges without compromising performance, security, or platform independence. At its core lies an unrivaled commitment to industry standards combined with unique technology innovation that transcends erstwhile distinct realms such as: </p> <ul> <li>Data Management (<a href="http://blogs.usnet.private:8893/main/rdbms-engine.html" id="link-id11943dc0">Relational</a>, <a href="http://blogs.usnet.private:8893/main/rdf-quad-store.html" id="link-id12312240">RDF Graph</a>, or Document), </li> <li> <a href="http://blogs.usnet.private:8893/main/middleware.htm" id="link-id115d71c0">Data Access Middleware</a>, </li> <li> <a href="http://blogs.usnet.private:8893/main/web-application-server.html" id="link-id142ca788">Web Application & Services Deployment</a>, </li> <li> <a href="http://blogs.usnet.private:8893/main/linked-data.html" id="link-id112b92c0">Linked Data Deployment</a>, and </li> <li>Messaging. </li> </ul> <p>When Virtuoso is installed and running, HTTP-based Data Objects are automatically created as a by-product of its powerful data virtualization, transcending data sources and data representation formats. The benefits of such power extend across profiles such as:</p> <ul> <li> <a href="http://blogs.usnet.private:8893/information-and-knowledge-worker-benefits" id="link-id118df198">Information & Knowledge Workers</a>, </li> <li> <a href="http://blogs.usnet.private:8893/systems-integrator-benefits" id="link-id1429d178">Systems Integrators & Architects</a>, </li> <li> <a href="http://blogs.usnet.private:8893/distributed-collaboration-benefits" id="link-id142fa2a0">Distributed Collaboration & Social Media</a>, </li> <li> <a href="http://blogs.usnet.private:8893/cloud-computing-benefits" id="link-id11aee6b0">Cloud Computing</a>, and </li> <li> <a href="http://blogs.usnet.private:8893/application-developer-benefits" id="link-id142440b8">Application Development</a>. </li> </ul> <h2>Product Benefits Summary</h2> <ul> <li> <b>Enterprise Agility</b> — Virtuoso lets you mix-&-match best-of-class combinations of Operating Systems, Programming Environments, Database Engines and Data-Access Middleware when building or tweaking your IS infrastructure, without the typical impedance of vendor-lock-in.</li> <li> <b>Data Model Dexterity</b> — By supporting multiple protocols and data models in a single product, Virtuoso protects you against costly vulnerabilities such as: perennial acquisition and accumulation of expensive data model specific DBMS products that still operate on the fundamental principle of: proprietary technology lock-in, at a time when heterogeneity continues to intrinsically define the information technology landscape.</li> <li> <b>Cost-effectiveness</b> — By providing a single point of access (and single-sign-on, SSO) to a plethora of Web 2.0-style social networks, Web Services, and Content Management Systems, and by using Data Object Identifiers as units of Data Virtualization that become the focal points of all data access, Virtuoso lowers the cost to exploit emerging frontiers such as socially-enhanced enterprise collaboration.</li> <li> <b>Speed of Exploitation</b> — Virtuoso provides the ability to rapidly assemble 360-degree conceptual views of data, across internal line-of-business application (CRM, ERP, ECM, HR, etc.) data and/or external data sources, whether these are unstructured, semi-structured, or fully structured.</li> </ul> <p>Bottom line, Virtuoso delivers unrivaled flexibility and scalability, without compromising performance or security.</p> <h2>Related</h2> <ul> <li> <a href="http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com's BLOG [127]/1567" id="link-id13ee6840">HTTP URI Abstraction and Linked Data</a> </li> <li> <a href="http://blog.jonudell.net/2009/09/09/talking-with-kingsley-idehen-about-mastering-your-own-search-index/" id="link-id1428b698">Be The Master of Your Own Search Index</a> </li> <li> <a href="http://walkingoncoals.blogspot.com/2009/12/whos-data-is-it-part-1.html" id="link-id117db508">Who's Data Is It?</a> </li> <li> <a href="http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1482" id="link-id13f64d90">MDM & Linked Data</a> </li> <li> <a href="http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1453" id="link-id118861d8">What is Linked Data Oriented RDF-zation?</a> </li> <li> <a href="http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1444" id="link-id11820d70">Semantic Web: Travails to Harmony Illustrated</a> </li> </ul> <p> </p>
OpenLink Virtuoso - Product Value Proposition Overiew
2010-02-27T17:46:36Z
2010-02-27T12:46:36-05:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1357
<p>After listening to the <a href="http://semanticgang.talis.com/2008/05/02/april-2008-the-semantic-web-gang-discuss-a-wikipedia-for-data/" id="link-id1089e218">latest Semantic Web Gang podcast</a>, I found myself agreeing with some of the points made by <a href="http://www.linkedin.com/in/iskold" id="link-id10b91e58">Alex Iskold</a>, specifically: </p> <ul>-- <a href="http://dbpedia.org/resource/Linked_Data" id="link-id106e24e0">Linked Data</a> does not implicitly imply making all your <a href="http://dbpedia.org/resource/Data" id="link-id17ab3d48">data</a> public</ul> <ul>-- <a href="http://dbpedia.org/resource/Linked_Data" id="link-id11fdcef0">Linked Data</a> principles benefit <a href="http://dbpedia.org/resource/Intranet" id="link-id109756e8">Intranet</a> and <a href="http://dbpedia.org/resource/Extranet" id="link-id1099cfd8">Extranet</a> style <a href="http://dbpedia.org/resource/Data" id="link-id10cd25b0">data</a> integration (trumps alternative <a href="http://dbpedia.org/resource/federated_database_system" id="link-id14f29940">distributed database</a> integration approaches any day)</ul> <ul>-- Business exploitation of <a href="http://dbpedia.org/resource/Linked_Data" id="link-id0xca51940">Linked Data</a> on the <a href="http://dbpedia.org/resource/World_Wide_Web">Web</a> will certainly be driven by the correlation of opportunity costs (which is more than likely what Alex meant by "use cases") associated with the lack of URIs originating from the domain of a given business (Tom Heath: also effectively alluded to this via his <a href="http://dbpedia.org/resource/BBC" id="link-id16f33348">BBC</a> and <a href="http://dbpedia.org/resource/Uniform_Resource_Identifier" id="link-id10decf38">URI</a> land grab anecdotes; same applies Georgi's examples)</ul> <ul>-- History is a great tutor, answers to many of today's problems always lie somewhere in plain sight of the past.</ul> <p>Of course, I also believe that <a href="http://dbpedia.org/resource/Linked_Data">Linked Data</a> serves Web <a href="http://dbpedia.org/resource/Data" id="link-id0x1afebd58">Data</a> Integration across the <a href="http://dbpedia.org/resource/Internet" id="link-id10aa5668">Internet</a> very well too, and the fact that it will be beneficial to businesses in a big way. No individual or organization is an island, I think the <a href="http://dbpedia.org/resource/Internet" id="link-id0xb25fbd0">Internet</a> and Web have done a good job of demonstrating that thus far :-) We're all <a href="http://dbpedia.org/resource/Data">data</a> nodes in a <a href="http://dbpedia.org/resource/Giant_Global_Graph" id="link-id5d8a3a8">Giant Global Graph</a>.</p> <p> <a href="http://myopenlink.net/dataspace/person/danieljohnlewis#this" id="link-id17cac8a0">Daniel lewis</a> did shed light on the read-write aspects of the Linked Data <a href="http://dbpedia.org/resource/Giant_Global_Graph" id="link-id10be8590">Web</a>, which is actually very close to the callout for a Wikipedia for Data. <a href="http://www.w3.org/People/Berners-Lee/card#i" id="link-id10a810c0">TimBL</a> has been working on this via <a href="http://dig.csail.mit.edu/2005/ajar/release/tabulator/0.8/tab.html" id="link-id184b7108">Tabulator</a> (see <a href="http://dig.csail.mit.edu/2007/tab/tutorial/editing.mov" id="link-id1416f1e8">Tabulator Editing Screencast</a>), <a href="http://bnode.org/about" id="link-id17e33750">Bengamin Nowack</a> also added <a href="http://arc.semsol.org/download/plugins/data_wiki" id="link-id1688cc40">similar functionality to ARC</a>, and of course we support the same <a href="http://dbpedia.org/resource/SPARQL" id="link-id10bff7c8">SPARQL</a> UPDATE into an <a href="http://dbpedia.org/resource/Resource_Description_Framework" id="link-id168ace08">RDF</a> <a href="http://dbpedia.org/resource/Information" id="link-id10641878">information</a> resource via the <a href="http://dbpedia.org/resource/Resource_Description_Framework" id="link-id0xddb5240">RDF</a> Sink feature of our WebDAV and <a href="http://virtuoso.openlinksw.com/dataspace/dav/wiki/Main/OdsBriefcase" id="link-id0x11199310">ODS</a>-Briefcase implementations.</p>
Comments about recent Semantic Gang Podcast
2008-05-06T00:06:42Z
2008-05-05T20:06:42.000004-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1325
<p> <a href="http://www.linkedin.com/pub/0/207/489" id="link-id10914030">John Schmidt</a>, from Informatica, penned an interesting post titled: <a href="http://blogs.informatica.com/enterprise_data_management/2008/03/it_doesnt_matter_integration_d.html" id="link-idd6d76d8">IT Doesn't Matter - Integration Does</a>. </p> <p>Yes, integration is hard, but I do profoundly believe that what's been happening on the Web over the last 10 or so years also applies to the Enterprise, and by this I absolutely do not mean "Enterprise 2.0" since "2.0" and productive agility do not compute in my realm of discourse. </p> <blockquote>large collections of RSS feeds, Wikiwords, Shared Bookmarks, Discussion Forums etc.. when disconnected at the data level (i.e. hosted in pages with no access to the "data behind") simply offer information deluge and inertia (there are only so many hours for processing opaque information sources in a given day).</blockquote> <p>Enterprises fundamentally need to process information efficiently as part of a perpetual assessment of their relative competitive Strengths, Weaknesses, Opportunities, and Threats (<a href="http://dbpedia.org/resource/SWOT_analysis" id="link-id10776fe8">SWOT</a>), in existing and/or future markets. Historically, IT acquisitions have run counter intuitively to the aforementioned quest for "Ability" due to the predominance of "rip and replace" approach technology acquisition that repeatedly creates and perpetuates information silos across Application, Database, Operating System, Development Environment boundaries. The sequence of events typically occurs as follows:</p> <ol> <li> applications are acquired on a problem by problem basis</li> <li>back-end application databases are discovered once ad-hoc information views are sought by <a href="http://dbpedia.org/resource/Information_worker" id="link-id10a111c8">information workers</a> </li> <li>back-end database disparity across applications is discovered once holistic views are sought by <a href="http://dbpedia.org/resource/Knowledge_worker" id="link-id107997d8">knowledge workers</a> (typically <a href="http://dbpedia.org/resource/Domain_expert" id="link-id102ddf08">domain experts</a>).</li> </ol> <p>In the early to mid 90's (pre ubiquitous Web), operating system, programming language, operating system, and development framework independence inside the enterprise was technically achievable via ODBC (due to it's platform independence). That said, DBMS specific <a href="http://dbpedia.org/resource/Open_Database_Connectivity" id="link-id10889d20">ODBC</a> channels alone couldn't address the holistic requirements associated with Conceptual Views of disparate data sources, hence the need for Data Access Virtualization via <a href="http://dbpedia.org/resource/Federated_database_system" id="link-id10884490">Virtual Database</a> Engine technology.</p> <p>Just as is the case on the Web today, with the emergence of the "Linked Data" meme, enterprises now have a powerful mechanism for exploiting the Data Integration benefits associated with generating Data Objects from disparate data sources, endowed with HTTP based IDs (URIs).</p> <p>Conceptualizing access to data exposed Databases APIs, SOA based Web Services (SOAP style Web Services), Web 2.0 APIs (REST style Web Services), XML Views of <a href="http://dbpedia.org/resource/SQL" id="link-id117f8a00">SQL</a> Data (<a href="http://dbpedia.org/resource/SQL/XML" id="link-id104bb730">SQLX</a>), pure XML etc.. is problem area addressed by RDF aware middleware (<a href="http://esw.w3.org/topic/ConverterToRdf" id="link-id10a9deb8">RDFizers</a> e.g <a href="http://virtuoso.openlinksw.com/presentations/Virtuoso_Sponger_1/Virtuoso_Sponger_1.html" id="link-id10256fb0">Virtuoso Sponger</a>).</p> <img src="http://myopenlink.net:8890/%7Ekidehen/Public/images/URI_Data_Source_Pyra_Enterp.png" /> <p>Here are examples of what <a href="http://virtuoso.openlinksw.com/presentations/RDF_Mapping_Presentation_W3C_workshop3/RDF_Mapping_Presentation_W3C_workshop3.html" id="link-id129a6a30">SQL Rows exposed as RDF Data Objects </a>(identified using HTTP based URIs) would look like outside or behind a corporate firewall:</p> <ul> Customer - <a href="http://demo.openlinksw.com/Northwind/Customer/ALFKI#this" id="link-id1183acd8">Alfreds Futterkiste</a> </ul> <ul>Customer Contact - <a href="http://demo.openlinksw.com/Northwind/CustomerContact/ALFKI#this" id="link-id11746bb0">Maria Anders</a> </ul> <ul>Salesrep - <a href="http://demo.openlinksw.com/Northwind/Employee/NancyDavolio1#this" id="link-idff76ed8">Nancy Davolio</a> </ul> <ul>Customer Orders Numbers - <a href="http://demo.openlinksw.com/Northwind/Order/11084#this" id="link-id10ca2648">11084</a>, <a href="http://demo.openlinksw.com/Northwind/Order/11011#this" id="link-id11736160">11011</a>, <a href="http://demo.openlinksw.com/Northwind/Order/11078#this" id="link-id108156e0">11078</a>, <a href="http://demo.openlinksw.com/Northwind/Order/11088#this" id="link-id10747f30">11085</a> </ul> <p>What's Good for the Web Goose (<a href="http://kidehen.idehen.net/dataspace/person/kidehen%23this%3E" id="link-id10a33c50">Personal Data Space URIs</a>) is good for the Enterprise Gander (<a href="http://www.openlinksw.com/dataspace/organization/openlink#this" id="link-id109fbbe0">Enterprise Data Space URIs</a>).</p> <h2>Related</h2> <ul> <a href="http://blogs.informatica.com/enterprise_data_management/2008/02/data_access_a_cultural_or_tech.html" id="link-idffe8168">Data Access - A Cultural or Technical Challenge?</a> </ul>
Linked Data is vital to Enterprise Integration driven Agility
2008-03-22T18:13:41Z
2008-03-22T14:13:41.000002-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1122
<p> <a href="http://www.betaversion.org/~stefano/">Stefano Mazzocchi</a>, via his blog: <a href="http://www.betaversion.org/~stefano/linotype/">Stefano's Linotype</a>, delivers <a href="http://www.betaversion.org/~stefano/linotype/news/99/">insightful contribution</a> to the ongoing effort to recapture the essence of the original <a href="http://en.wikipedia.org/wiki/Semantic_Web">Semantic Web </a>vision.</p> <p>The Semantic Web is about granular exposure of the underlying web-of-data that fuels the World Wide Web. It models "<a href="http://www.w3.org/1999/04/WebData">Web Data</a>" using a <a href="http://en.wikipedia.org/wiki/Graph_(mathematics)">Directed Graph</a> Data Model (back-to-the-future: <a href="http://en.wikipedia.org/wiki/Network_model">Network Model Database</a>) called <a href="http://www.w3.org/TR/rdf-primer/">RDF</a>.</p> <p>In line with contemporary database technology thinking, the Semantic Web also seeks to expose Web Data to architects, developers, and users via a concrete <a href="http://en.wikipedia.org/wiki/Conceptual_schema">Conceptual Layer</a> that is defined using <a href="http://www.w3.org/TR/rdf-schema/">RDF Schema</a>.</p> <p>The abstract nature of Conceptual Models implies that actual instance data (<a href="http://en.wikipedia.org/wiki/Entity-relationship_diagrams">Entities, Attributes, and Relationships/Associations</a>) occurs by way of "Logical to Conceptual" schema mapping and data generation that can involve a myriad of logical data sources (SQL, XML, Object databases, traditional web content, <a href="http://en.wikipedia.org/wiki/Rss_%28file_format%29">RSS</a>/<a href="http://en.wikipedia.org/wiki/Atom_%28standard%29">Atom</a> feeds etc.). Thus, by implication, it is safe assume that the Semantic Web's construction is basically a <a href="http://en.wikipedia.org/wiki/Data_integration">Data Integration</a> and exposure effort. The point that Stefano alludes to in the blog post excerpts that follow: </p> <blockquote> <p>The semantic web is really just data integration at a global scale. Some of this data might end up being consistent, detailed and small enough to perform symbolic reasoning on, but even if this is the case, that would be such a small, expensive and fragile island of knowledge that it would have the same impact on the world as calculus had on deciding to invade Iraq.</p> <p>The biggest problem we face right now is a way to 'link' information that comes from different sources that can scale to hundreds of millions of statements (and hundreds of thousands of equivalences). Equivalences and subclasses are the only things that we have ever needed of <a href="http://www.w3.org/TR/owl-features/">OWL</a> and RDFS, we want to 'connect' dots that otherwise would be unconnected. We want to suggest people to use whatever ontology pleases them and then think of just mapping it against existing ones later. This is easier to bootstrap than to force them to agree on a conceptualization before they even know how to start!</p> </blockquote> <p>Additional insightful material from Stefano:</p> <ol> <li> <a href="http://www.betaversion.org/~stefano/linotype/news/57/">A No-Nonsense Guide to Semantic Web Specs for XML People [Part I]</a> </li> <li> <a href="http://www.betaversion.org/~stefano/linotype/news/78/">A No-nonsense Guide to Semantic Web Specs for XML People [Part II]</a> </li> </ol> <p> <a href="http://bnode.org/blog/sw_en">Benjamin Nowack</a> also chimes into this conversation via his <a href="http://rdfer.com/swk/data-information-knowledge">simple guide to understanding Data, Information, and Knowledge</a> in relation so the Semantic Web.</p>
Semantic Web & Data Integration
2007-01-18T14:25:51Z
2007-01-18T09:25:51.000006-05:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1030
<p>Note: An updated version of a previously unpublished blog post:</p> <p> Continuing from <a href="http://weblog.infoworld.com/udell/2006/04/28.html">our recent Podcast conversation</a>, Jon Udell sheds further insight into the essence of our conversation via a “Strategic Developer” column article titled: <a href="http://www.infoworld.com/cgi-bin/redirect?source=rss&url=http://www.infoworld.com/article/06/05/03/77873_19OPstrategic_1.html">Accessing the web of databases</a>. </p> <p> Below, I present an initial dump of a DataSpace FAQ below that hopefully sheds light on the DataSpace vision espoused during my podcast conversation with Jon. </p> <p> What is a DataSpace? <br /> </p> <p>A moniker for Web-accessible atomic containers that manage and expose Data, Information, Services, Processes, and Knowledge. </p> <p> What would you typically find in a Data Space? Examples include: </p> <ul> <li>Raw Data - SQL, HTML, XML (raw), XHTML, RDF etc.<br /> <br /> </li> <li>Information (Data In Context) - XHTML (various microformats), Blog Posts (in RSS, Atom, RSS-RDF formats), Subscription Lists (OPML, OCS, etc), Social Networks (FOAF, XFN etc.), and many other forms of applied XML.</li> </ul> <ul> <li>Web Services (Application/Service Logic) - REST or SOAP based invocation of application logic for context sensitive and controlled data access and manipulation.</li> </ul> <ul> <li>Persisted Knowledge - Information in actionable context that is also available in transient or persistent forms expressed using a Graph Data Model. A modern knowledgebase would more than likely have RDF as its Data Language, RDFS as its Schema Language, and OWL as its Domain Definition (Ontology) Language. Actual Domain, Schema, and Instance Data would be serialized using formats such as RDF-XML, N3, Turtle etc).</li> </ul> <p> How do Data Spaces and Databases differ? <br />Data Spaces are fundamentally problem-domain-specific database applications. They offer functionality that you would instinctively expect of a database (e.g. AICD data management) with the additonal benefit of being data model and query language agnostic. Data Spaces are for the most part DBMS Engine and Data Access Middleware hybrids in the sense that ownership and control of data is inherently loosely-coupled. </p> <p>How do Data Spaces and Content Management Systems differ?<br />Data Spaces are inherently more flexible, they support multiple data models and data representation formats. Content management systems do not possess the same degree of data model and data representation dexterity. </p> <p>How do Data Spaces and Knowledgebases differ?<br />A Data Space cannot dictate the perception of its content. For instance, what I may consider as knowledge relative to my Data Space may not be the case to a remote client that interacts with it from a distance, Thus, defining my Data Space as Knowledgebase, purely, introduces constraints that reduce its broader effectiveness to third party clients (applications, services, users etc..). A Knowledgebase is based on a Graph Data Model resulting in significant impedance for clients that are built around alternative models. To reiterate, Data Spaces support multiple data models. </p> <p> What Architectural Components make up a Data Space? </p> <ul> <li>ORDBMS Engine - for Data Modeling agility (via complex purpose specific data types and data access methods), Data Atomicity, Data Concurrency, Transaction Isolation, and Durability (aka ACID).<br /> <br /> </li> <li>Virtual Database Engine - for creating a single view of, and access point to, heterogeneous SQL, XML, Free Text, and other data. This is all about Virtualization at the Data Access Level.</li> </ul> <ul> <li>Web Services Platform - enabling controlled access and manipulation (via application, service, or protocol logic) of Virtualized or Disparate Data. This layer handles the decoupling of functionality from monolithic wholes for function specific invocation via Web Services using either the SOAP or REST approach.</li> </ul> <br />Where do Data Spaces fit into the Web's rapid evolution?<br />They are an essential part of the burgeoning Data Web / Semantic Web. In short, they will take us from data “Mash-ups” (combining web accessible data that exists without integration and repurposing in mind) to “Mesh-ups” (combining web accessible data that exists with integration and repurposing in mind).<p> Where can I see a DataSpace along the lines described, in action? </p> <p> Just look at my blog, and take the journey as follows: </p> <ul> <li> <a href="http://www.openlinksw.com/blog/%7Ekidehen/">Front Door</a> (Web 1.0)</li> <li>Lounge (Web 2.0) via <a href="http://www.openlinksw.com/GData/127">GData</a> or <a href="http://www.openlinksw.com/weblog/public/search.vspx?blogid=127&type=text&kwds=%27semantic+web%27&OpenSearch">OpenSearch</a> </li> <li>Floor Plan via <a href="http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/about.rdf">FOAF</a> or <a href="http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/sioc.rdf">SIOC</a> RDF Data Sets (Graphs)</li> <li>Rest of the house (beyond Web 2.0) sending <a href="http://virtuoso.openlinksw.com/wiki/main/Main/VOSODSSparqlSamples">SPARQL Queries</a> to a <a href="http://myopenlink.net:8890/sparql/">SPARQL Endpoint</a>.<br /> </li> </ul> <p> What about other Data Spaces? </p> <p> There are several and I will attempt to categorize along the lines of query method available: <br />Type 1 (Free Text Search over HTTP): <br />Google, MSN, Yahoo!, Amazon, eBay, and most Web 2.0 plays . </p> <p> Type 2 (Free Text Search and XQuery/XPath over HTTP) <br />A few blogs and Wikis (Jon Udell's and a few others)</p>Type 3 (RDF Data Sets and SPARQL Queryable):<br /> <ul> <li> <a href="http://esw.w3.org/topic/SIOC/EnabledSites">SIOC enabled sites</a> (aka points of semantic web presence)<br /> </li> <li> <a href="http://pingthesemanticweb.com/">PingTheSemantic</a> <br /> </li> </ul>Type 4 (Generic Free Text Search, OpenSearch, GData, XQuery/XPath, and SPARQL):<br />Points of Semantic Web presence such as the Data Spaces at: <br /> <ul> <li> <a href="http://www.openlinksw.com/dataspace/kidehen@openlinksw.com">My Blog Data Space</a> (as stated earlier in this post)<br /> </li> <li> <a href="http://myopenlink.net:8890/dataspace/kidehen@openlinksw.com">My General Data Space</a> - (ditto; note that this is currently experimental)<br /> </li> </ul> <p>What About Data Space aware tools?<br /> <br /> </p> <ul> <li> <a href="http://demo.openlinksw.com/DAV/JS/oat/index.html/">OpenLink Ajax Toolkit </a>- provides Javascript Control level binding to Query Services such as XMLA for SQL, GData for Free Text, OpenSearch for Free Text, SPARQL for RDF, in addition to service specific Web Services (Web 2.0 hosted solutions that expose service specific APIs)</li> <li> <a href="http://rdfs.org/sioc/firefox">Semantic Radar </a>- a Firefox Extension</li> <li> <a href="http://pingthesemanticweb.com/">PingTheSemantic</a> - the Semantic Webs equivalent of Web 2.0's weblogs.com</li> <li> <a href="http://simile.mit.edu/piggy-bank/">PiggyBank</a> - a Firefox Extension</li> </ul> <p> </p>
Data Spaces and Web of Databases
2006-09-04T22:58:56Z
2006-09-04T18:58:56.000001-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/957
<p> <a href="http://www.softwarememories.com/2006/02/09/prerelational-dbms-vendors-a-quick-overview/">Prerelational DBMS vendors — a quick overview</a>: "</p> <p> <strong>IBM. </strong> With BOMP and D-BOMP, IBM was probably the first company to commercialize precursors to DBMS. (BOMP stood for Bill Of Materials Planning, foreshadowing the hierarchical architecture of IMS.) Out of those grew DL/1 and IMS, IBM’s flagship hierarchical DBMS, and the world’s first dominant DBMS product(s). Of course, IBM also innovated relational DBMS, via the research of E. F. ‘Ted’ Codd, then some prototype products, and eventual the mainframe version of DB2. To this day DB2 on the mainframe remains one of the world’s major DBMS, as does the separate but related product of DB2 for ‘open systems.’</p> <p> <strong>Cincom. </strong> In the 1970s, Cincom was probably the most successful independent software product company. Its flagship product was Total, a shallow-network DBMS that was a little more general than the strictly hierarchical IMS. What’s more, Total ran on almost any brand of computer hardware. Cincom remains independent and privately held to this day.</p> <p> <strong>Cullinane/Cullinet.</strong> Charlie Bachman innovated a true network DBMS at Honeywell, but it didn’t turn into a serious product at that time. B. F. Goodrich, however, ran a version. This is what John Cullinane’s company bought and turned into IDMS, which at least on the mainframe supplanted Total as the technical, mind share, and probably revenue market leader. Cullinet (as it was then called) ran into technical difficulties, however, losing ground to the more flexible index-based DBMS. It was eventually sold to Computer Associates. </p> <p>A lot of software industry leaders cut their teeth at Cullinet, notably Andrew ‘Flip’ Filipowski, later the colorful founder of Platinum. Other alumni include Renato ‘Ron’ Zambonini, Dave Litwack, Dave Ireland, and the original PowerBuilder development team. John Landry and Bob Weiler ran the firm for a while toward the end, but they don’t really count; rather, they’re the most prominent alumni of applications pioneer McCormack & Dodge.</p> <p> <strong>Note:</strong> <em>Index-based</em> is a term I used in and probably coined for my first report in 1982, comprising both inverted-list and relational RDBMS, as opposed to the link(ed)-list hierarchical and network products such as IMS, Total, and IDBMS. The companies that beat Cullinet were long-time rival Software AG, and then especially Applied Data Research; then all three of those independents were blown out by IBM’s DB2. And then the whole mainframe DBMS business was in turn obsoleted by the rise of UNIX … but I’m getting ahead of my story.</p> <p> <strong>Software AG.</strong> Like Cincom, Germany-based Software AG is a 1970s DBMS pioneer that has always remained independent and privately held. Sort of. Twice, Software AG of North America was spun off as a separate, eventually public company. Software AG’s flagship DBMS was the inverted list product ADABAS. SAP’s MaxDB was also owned by Software AG for a while (and seemingly by every other significant German computer company as well – or more precisely, by Nixdorf where it was developed, and by Siemens after it bought Nixdorf).</p> <p>I actually visited Software AG in Darmstadt once. Founder Peter Schnell and key techie Peter Page were both gracious hosts. Schnell was proud of their new building, and especially of the hexagon-based wooden dual desks he’d personally designed. General analytic rule – when the CEO is focused on the décor, this is not a good sign for the company’s near-term prospects. (I call this having an ‘edifice complex.’)</p> <p> <strong>Applied Data Research (ADR). </strong> ADR is often credited as being the first independent software company, having introduced products in the late 1960s and prevailed in antitrust struggles against IBM to allow the business to survive. Basically, it sold programmer productivity tools. This led it to acquire Datacom/DB, an inverted-list DBMS developed in the Dallas area. In the early 1980s, Datacom/DB began to boom, and was on a track to surpass both IDMS and ADABAS in market share until DB2 showed up and blew them all away. ADR was particularly aided by its fourth-generation language (4GL) IDEAL, which was an excellent product notwithstanding the famous State of New Jersey fiasco. (As John Landry said to me about that one, ‘4GLs are powerful tools. In particular, they allow you to write bad programs really quickly.’)</p> <p>ADR was an underappreciated powerhouse, boasting all of the Fortune 100 as customers way back in the early 1980s (yes, even archrival IBM). When the DBMS business stalled, however, ADR was quickly sold — first to Ameritech (the Illinois-based Baby Bell company), and soon thereafter to Computer Associates.</p> <p> <strong>Computer Corporation of America (CCA). </strong> CCA’s DBMS Model 204 may have been the best of the prerelational products, boasting an inverted-list architecture akin to that of ADABAS and Datacom/DB. The company was also interesting in that it was first and foremost a government contract research shop, and hence did all sorts of interesting prototype work that sadly never got commercialized. In about 1983 it became that the company wasn’t going anywhere, and it put itself up for sale. </p> <p>I was personally instrumental in that decision. Our investment banker pretended he was considering taking CCA public. CCA President Jim Rothnie showed us revenue projections. I asked how he had gotten them. He replied that he had taken the market size projection 5 years out, assumed 10%, and drawn a ‘plausible curve.’ However, I quickly got Socratic with him. ‘How many salesmen do you have?’ ‘How much revenue does the average experienced salesman produce?’ ‘How many experienced salesmen do you expect to have next year?’ ‘How high do you think their average productivity can grow?’ ‘Let us multiply.’ (Yes, I really said that. I can be a jerk. And anyway Jim was the sort of analytic guy one can say that to without giving serious offense.)</p> <p>CCA was sold to a Canadian insurance company whose name I’ve now forgotten. Eventually, it was spun back out (perhaps after some intermediate changes of ownership), and resurfaced as primarily a data integration company, called Praxis.</p> <p>In the real old days (mid 1970s, perhaps), Model 204 was resold by Informatics (later Informatics General, later the hostile takeover that became the guts of Sterling Software, which like so many other companies was eventually absorbed into Computer Associates). I know this because Richard Currier used to sell the product when he worked at Informatics. That probably makes Richard and me about the only two people who still remember the fact.</p> <p>Hmm. I forgot to mention <strong>Intel’s System 2000. </strong> Well, truth be told it was a dying product even back when I first became an analyst in 1981, and I recall nothing about it, except Gene Lowenthal’s observation that Intel had had trouble selling chips and DBMS through the same salesforce. I think Al Sisto, who I probably met when he was head of sales at RTI (Relational Technology, Inc. — later called Ingres), came out of that business, but I’m not 100% sure. I remember Pete Tierney from that RTI management team more clearly anyway, although that’s mainly because we stayed in touch at subsequent companies over the years.</p>" <p>(Via <a href="http://www.softwarememories.com">Software Memories</a>.)</p>
Prerelational DBMS vendors — a quick overview
2006-06-22T12:56:58Z
2006-06-22T08:56:58-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/938
<p> <a href="http://www.25hoursaday.com/weblog/PermaLink.aspx?guid=d39467a3-7662-4fc4-a782-9c068d47e1b4">ETech 2006 Trip Report: eBay Web Services: A Marketplace Platform for Fun and Profit</a>: "</p> <p> These are my notes from the session <a class="url" href="http://conferences.oreillynet.com/cs/et2006/view/e_sess/8513">eBay Web Services: A Marketplace Platform for Fun and Profit</a> by <a href="http://conferences.oreillynet.com/cs/et2006/view/e_spkr/1518">Adam Trachtenberg</a>. </p> <p> This session was about the <a href="http://developer.ebay.com/">eBay developer program</a>. The talk started by going over the business models for 'Web 2.0' startups. Adam Trachtenberg surmised that so far only two viable models have shown up (i) get bought by <a href="http://www.yahoo.com">Yahoo!</a> and (ii) put a lot of <a href="https://www.google.com/adsense/">Google AdSense</a> ads on your site. The purpose of the talk was to introduce a third option, making money by integrating with eBay's APIs. </p> <p> Adam Trachtenberg went on to talk about the differences between providing information and providing services. Information is read-only while services are read/write. Services have value because they encourage an 'architecture of participation'. </p> <p> <a href="http://www.ebay.com">eBay</a> is a global, online marketplace that facilitates the exchange of goods. The site started off as being a place to purchase used collectibles but now has grown to encompass old and new items, auctions and fixed price sales (fixed price sales are now a third of their sales) and even sales of used cars. There are currently 78 million items being listed at any given time on <a href="http://www.ebay.com">eBay</a>. </p> <p> As <a href="http://www.ebay.com">eBay</a> has grown more popular they have come to realize that one size doesn't fit all when it comes to the website. It has to be customized to support different languages and markets as well as running on devices other the PC. Additionally, they discovered that some companies had started screen scraping their site to give an optimized user experience for some power users. Given how fragile screen scraping is the <a href="http://www.ebay.com">eBay</a> team decided to provide a SOAP API that would be more stable and performant for them than having people screen scrape the website. </p> <p> The API has grown to over 100 methods and about 43% of the items on the website are added via the SOAP API. The API enables one to build user experiences for <a href="http://www.ebay.com">eBay</a> outside the web browser such as integration with cell phones, Microsoft Office, gadgets & widgets, etc. The API has an affiliate program so developers can make money for purchases that happen through the API. An example of the kind of mashup one can build to make money from the eBay API is <a href="https://www.dudewheresmyusedcar.com/">https://www.dudewheresmyusedcar.com</a>. Another example of a mashup that can be used to make money using the eBay API is <a href="http://www.ctxbay.com/">http://www.ctxbay.com</a> which provides contextual eBay ads for web publishers. </p> <p> The aforementioned sites are just a few examples of the kinds of mashups that can be built with the eBay API. Since the API enables buying and listing of items for sale as well as obtaining inventory data from the service, one can build a very diverse set of applications. </p>" <p>(Via <a href="http://www.25hoursaday.com/weblog/">Dare Obasanjo aka Carnage4Life</a>.)</p>
ETech 2006 Trip Report: eBay Web Services: A Marketplace Platform for Fun and Profit
2006-06-22T12:56:58Z
2006-06-22T08:56:58-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/900
<blockquote><p><a href="http://www.ajaxian.com/archives/2005/11/solutions_to_al.html">Solutions to allow XMLHttpRequest to talk to external services</a>: "</p><p>Over on XML.com they published <a href="http://www.xml.com/pub/a/2005/11/09/fixing-ajax-xmlhttprequest-considered-harmful.html">Fixing AJAX: XmlHttpRequest Considered Harmful</a>.</p> <p>This article discusses a few ways to get around the security constraints that we have to live with in the browsers theses days, in particular, only being able to talk to your domain via XHR.</p> <p>The article walks you through three potential solutions:</p> <ol> <li><strong>Application proxies</strong>. Write an application in your favorite programming language that sits on your server, responds to <code>XMLHttpRequest</code>s from users, makes the web service call, and sends the data back to users.</li> <li><strong>Apache proxy</strong>. Adjust your Apache web server configuration so that <code>XMLHttpRequest</code>s can be invisibly re-routed from your server to the target web service domain.</li> <li><strong>Script tag hack with application proxy</strong> (doesn't use <code>XMLHttpRequest</code> at all). Use the HTML <code>script</code> tag to make a request to an application proxy (see #1 above) that returns your data wrapped in JavaScript. This approach is also known as <a href="http://ajaxpatterns.org/On-Demand_Javascript">On-Demand JavaScript</a>.</li> </ol> <p>I can't wait for <em>Trusted Relationships</em> within the browser - server infrastructure. </p> <p>With respect to Apache proxies, these things are priceless. I recently talked about them in relation to <a href="http://www.almaer.com/blog/archives/001099.html">Migrating data centers with zero downtime</a>.</p> <p>What do you guys think about this general issue? Have you come up with any interesting solutions? Any ideas on how we can keep security, yet give us the freedom that we want?</p> <p>(Via <a href="http://www.ajaxian.com/">Ajaxian Blog</a>.)</p></blockquote> <p>Well here is what I think (actually know): </p> <p>Our <a href="http://virtuoso.openlinksw.com">Virtuoso Universal Server</a> has been sitting waiting to deliver this for years (for the record see the <a href="http://www.openlinksw.com/press/v2releas.htm">Virtuoso 2000 Press Release</a>). Virtuoso can proxy for disparate data sources and expose disparate data as Well-Formed XML using an array of vocabularies (you experience this SQL-XML integration on the fly every time you interact with various elements of my <a href="http://www.openlinksw.com/blog/~kidehen/">public blog</a>).</p> <p>Virtuoso has always been able to expose Application Logic as SOAP and/or RESTful/RESTian style XML Web Services. <a href="http://www.openlinksw.com/weblog/public/search.vspx?blogid=127">This blog's search page</a> is a simple demo of this capability.</p> <p>Virtuoso is basically a Junction Box / Aggregator / Proxy for disparate Data, Applications, Services, and BPEL compliant business processes. AJAX clients talk to this single multi-purpose server which basically acts as a conduit to content/data, services, and processes (which are composite services).</p> <p>BTW - there is a lot more, but for now, thou shall have to seek in order to find :-) </p>
Solutions to allow XMLHttpRequest to talk to external services
2006-07-21T11:23:03Z
2006-07-21T07:23:03.000001-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/680
<blockquote dir="ltr" style="MARGIN-RIGHT: 0px"> <p><a href="http://techdirt.com/articles/20050209/1329235_F.shtml">Email As A Platform</a> It looks like more people are starting to realize that email is more than it seems. Especially given the drastic increase in storage size of web-based email applications, more people are realizing that <a href="http://news.bbc.co.uk/2/hi/business/4167633.stm">email is basically a personal database</a>. People simply store information in their email, from contact information that was emailed to them to schedule information to purchase tracking from emailed receipts. Lots of people email messages to themselves, realizing that email is basically the best "permanent" filing system they have. That's part of the reason why good email search is so important. Of course, what the article doesn't discuss is the next stage of this evolution. If you have a database of important information, the next step is to build useful applications on top of it. In other words, people are starting to realize that email, itself, is a <i>platform</i> for personal information management. </p></blockquote> <div align="right">[via <a href="http://www.techdirt.com/">Techdirt</a>]</div> <div align="left"> </div> <div align="left">Yep! And this is where the Unified Storage vision comes into play. Many years ago the same issues emerged in the business application realm, and at the time the issue at hand was: separating the DBMS engine from the Application logic. This is what the SQL Access Group (SAG) addressed via the CLI that laid the foundation for ODBC, JDBC, and recent derivatives; OLE DB and ADO.NET. </div> <div align="left"> </div> <div align="left">Most of us live inside our email applications and the need to integrate the content of emails, address books, notes, calendars with other data sources (Web Portal, Blogs, Wikis, CRM, ERP, and more) as part of our application interaction cycles and domain specific workflow is finally becoming obvious. There is a need for separation of the application/service layer from the storage engine across each one of these functionality realms. XML, RDF, and Triple Stores (RDF / Semantic Data Stores) collectively provide a standards based framework for achieving this goal. On the other hand so does WinFS albeit total proprietary (by this I mean none standards compliant) at the current time.</div> <div align="left"> </div> <div align="left">As you can already see there are numerous applications (conventional or hosted) that address email, address books, bookmarking, notes, calendars, blogs, wikis, crm etc. specifically, but next to none that address the obvious need for transparent integration across each functionality realm - the ultimate goal.</div> <div align="left"> </div> <div align="left">Yes, you know what I am about to say! <a href="http://virtuoso.openlinksw.com">OpenLink Virtuoso</a> is the platform for developing and/or implementing these next generation solutions. We have also decided to go one step further by developing a number of applications that demonstrate the vision (and ultimate reality); and each of these applications (and the inherent integration tapestry) will be the subject of a future Virtuoso Application specific post.</div>
Email As A Platform
2006-06-22T12:56:58Z
2006-06-22T08:56:58-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/442
<blockquote style="margin-right: 0px;" dir="ltr"> <p><a class="listLinkLrg" title="http://newsletter.infoworld.com/t?ctl=4FEDB6:1F3948D" href="http://newsletter.infoworld.com/t?ctl=4FEDB6:1F3948D" target="_new"><strong><font face="Verdana">Databases get a grip on XML</font></strong></a><br /><font size="2"></font><font face="Verdana">From <a href="http://newsletter.infoworld.com/t?ctl=4FEDB6:1F3948D">Inforworld</a>.</font><br /><br /><font face="Verdana,Geneva,Arial,sans-serif" size="2">The next iteration of the SQL standard was supposed to arrive in 2003. But SQL standardization has always been a glacially slow process, so nobody should be surprised that SQL:2003 ? now known as SQL:200n ? isn?t ready yet. Even so, 2003 was a year in which XML-oriented data management, one of the areas addressed by the forthcoming standard, showed up on more and more developers? radar screens. <a title="http://newsletter.infoworld.com/t?ctl=4FEDB6:1F3948D" href="http://newsletter.infoworld.com/t?ctl=4FEDB6:1F3948D" target="_blank">>> READ MORE</a></font></p></blockquote> <p dir="ltr"><font face="Verdana" size="2">This article rounds up product for 2003 in the critical area of Enterprise Database Technology. It's certainly provides an apt reflection of how Virtuoso compares with offerings from some the larger (but certainly slower to implement) database vendors in this space. As usual Jon Udell's quote pretty much sums this up:</font></p> <blockquote style="margin-right: 0px;" dir="ltr"> <p dir="ltr"><!--StartFragment --><span class="artText"><em>"While the spotlight shone on the heavyweight contenders, a couple of agile innovators made noteworthy advances in 2003. </em><a class="regularArticleU" href="http://www.infoworld.com/699"><em>OpenLink Software?s Virtuoso 3.0</em></a><em>, which we reviewed in March, stole thunder from all three major players. Like Oracle, it offers a WebDAV-accessible XML repository. Like DB2 Information Integrator, it functions as database middleware that can perform federated ?joins? across SQL and XML sources. And like the forthcoming Yukon, it embeds the .Net CLR (Common Language Runtime), or in the case of Linux, Novell/Ximian?s Mono."</em></span> </p></blockquote> <p dir="ltr"><font face="Verdana" size="2">Albeit still somewhat unknown to the broader industry we have remained true our "innovator" discipline, which still remains our chosen path to market leadership. Thus, its worth a quick Virtuoso release history, and features recap as we get set to up the ante even further in 2004:</font></p> <p dir="ltr"><font face="Verdana" size="2"><a href="http://www.openlinksw.com/press/virtuoso.htm">1998 - Virtuoso's initial public beta</a> release with functional emphasis on Virtual Database Engine for ODBC and JDBC Data Sources.</font></p> <p dir="ltr"><font face="Verdana" size="2"><a href="http://www.openlinksw.com/press/virtuoso1.htm">1999 - Virtuoso's official commercial</a> release, with emphasis still on Virtual Database functionality for ODBC, JDBC accessible SQL Databases.</font></p> <p dir="ltr"><font face="Verdana" size="2"><a href="http://www.openlinksw.com/press/v2releas.htm">2000 - Virtuoso 2.0</a> adds XML Storage, XPath, XML Schema, XQuery, XSL-T, WebDAV, SOAP, UDDI, HTTP, Replication, Free Text Indexing (*feature update*), POP3, and NNTP support.</font></p> <p dir="ltr"><font face="Verdana" size="2"><a href="http://www.openlinksw.com/press/v27releas.htm">2002 - Virtuoso 2.7</a> extends Virtualization prowess beyond data access via enhancements to its Web Services protocol stack implementation by enabling SQL Stored Procedures to be published as Web Services. It also debuts its Object-Relational engine enhancements that include the incorporation of Java and Microsoft .NET Objects into its User Defined Type, User Defined Functions, and Stored Procedure offerings.</font></p> <p dir="ltr"><font face="Verdana" size="2"><a href="http://www.openlinksw.com/press/virt3beta.htm">2003 - Virtuoso 3.0</a> extends data and application logic virtualization into the Application Server realm (basically a Virtual Application server too!), by adding support for ASP.NET, PHP, Java Server Pages runtime hosting (making applications built using any of these languages deployable using Virtuoso across all supported platforms).</font></p> <p dir="ltr"><font face="Verdana" size="2">Collectively each of these releases have contributed to a very premeditated architecture and vision that will ultimately unveil the inherent power of critical I.S infrastructure virtualization along the following lines; data storage, data access , and application logic via coherent integration of SQL, XML, Web Services, and Persistent Stored Modules (.NET, Java, and other object based component building blocks).</font></p> <p dir="ltr"><font face="Verdana"></font> </p>
Enterprise Databases get a grip on XML
2006-06-22T12:56:58Z
2006-06-22T08:56:58-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/395
<font size="2"> <p><a href="http://www.nwfusion.com/index.html">NETWORK WORLD</a> NEWSLETTER: MARK GIBBS ON WEB APPLICATIONS </p> <p><font size="2">Today's focus: A Virtuoso of a server</font></p> <p>By <a href="http://www.nwfusion.com/columnists/gibbs.html">Mark Gibbs</a></p> <p>One of the bigger drags of Web applications development is that building a system of even modest complexity is a lot like herding cats - you need a database, an applications server, an XML engine, etc., etc. And as they all come from different vendors you are faced with solving the constellation of integration issues that inevitably arise.</p> <p>If you are lucky, your integration results in a smoothly functioning system. If not, you have a lot of spare parts flying in loose formation with the risk of a crash and burn at any moment.</p> <p>An alternative is to look for all of these features and services in a single package but you'll find few choices in this arena.</p> <p>One that is available and looks very promising is OpenLink's Virtuoso (see links below).</p> <p>Virtuoso is described as a cross platform (runs on Windows, all Unix flavors, Linux, and Mac OS X) universal server that provides databases, XML services, a Web application server and supporting services all in a single package.</p> <p>OpenLink's list of supported standards is impressive and includes .Net, Mono, J2EE, XML Web Services (Simple Object Application Protocol, Web Services Description Language, WS-Security, Universal Description, Discovery and Integration), XML, XPath, XQuery, XSL-T, WebDav, HTTP, SMTP, LDAP, POP3, SQL-92, ODBC, JDBC and OLE-DB.</p> <p>Virtuoso provides an HTTP-compliant Web Server; native XML document creation, storage and management; a Web services platform for creation, hosting and consumption of Web services; content replication and synchronization services; free text index server, mail delivery and storage and an NNTP server.</p> <p>Another interesting feature is that with Virtuoso you can create Web services from existing SQL Stored Procedures, Java classes,</p> <p>C++ classes, and 'C' functions as well as create dynamic XML</p> <p>documents from ODBC and JDBC data sources.</p> <p>This is an enormous product and implies a serious commitment on the part of adopters due to its scope and range of services.</p> <blockquote dir="ltr" style="MARGIN-RIGHT: 0px"> <p><em>Virtuoso is enormous by virtue of its architectural ambitions, but actual disk requirements are</em></p></blockquote></font>
A Virtuoso of a Server
2006-06-22T12:56:58Z
2006-06-22T08:56:58-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/209
<p dir="ltr">Microsoft just made the <a href="http://www.vsipdev.com/">VSIP program free of charge</a>. Awesome.</p> <blockquote dir="ltr" style="MARGIN-RIGHT: 0px"> <p dir="ltr">[via <a href="http://radio.weblogs.com/0001011/">The Scobleizer Weblog</a>]</p></blockquote> <p>Now this is good news from Microsoft! This means that products like <a href="http://www.openlinksw.com/virtuoso/whatis.htm">Virtuoso</a> can now compete head-on with Yukon (on a level playing field when it arrives) as far as Visual Studio.NET integration goes. Hopefully I will no longer have to rant about any of the following:</p> <ol> <li>Missing Data Access Controls and Wizards for ODBC (we already have annbsp interesting Generic ADO.NET Provider en route to GA release)</li> <li>Tightly bound integration between Visual Studio.NET <a href="http://www.microsoft.com/presspass/press/2003/Jul03/07-29InnovationListPR.asp">("Whidbey" or "Orcas")</a>nbspand Yukon (next release of SQL Server), it's up to us (OpenLink) to get the same degree of integration re. Virtuoso (via VSIP), but most importantly Visual Studio's future will not be inextricably linked to Yukon's (let's hope the same applies to IE and Longhorn)</li></ol> <p>I wonder if the same degree of openness could extend to Web Matrix? That would be something indeed!</p>
VSIP program free of charge
2006-06-22T12:56:58Z
2006-06-22T08:56:58-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/201
<p><a href="http://blog.bitflux.ch/p1077.html">Tim O'Reilly about network aware software</a> </p> <p>Tim O'Reilly wrote some thoughts about network aware software. Good sumup and nice ideas, why not only blogs should be net-aware (and where even blogs can be improved ;) ) </p> <blockquote dir="ltr" style="MARGIN-RIGHT: 0px"> <div align="left">"<i>For the desktop, my personal vision is to see existing software instrumented to become increasingly web aware. It seems that Apple are doing a good job with this. (What does web aware mean for me? Being able to grok URIs, speaking WebDAV, and using open standard data formats.)</i>" -- <strong>Edd Dumbill</strong> </div> <div align="left"></div> <div align="left">[via <a href="http://blog.bitflux.ch/">Bitflux Blog</a>]</div></blockquote> <div align="left">I agree, but you do have to add Open Data Access formats (such as ODBC and to some degree JDBC) to this mix otherwise the you will need to create data for Open Standard Data Formats from sratch (tough for any enterprise irrespective of size).</div> <div align="left"></div> <div align="left">Tim O'Reilly added the following items to Edd's list:</div> <div align="left"> <ul> <li> <p>Rendezvous-like functionality for automatic discovery of and potential synchronization with other instances of the application on other computers. Apple is showing the power of this idea with iChat and iTunes, but it really could be applied in so many other places. For example, if every PIM supported this functionality, we could have the equivalent of "phonester" where you could automatically ask peers for contact information. Of course, that leads to guideline 2. </p></li></ul></div> <p>Another application is discovery of <a href="http://www.openlinksw.com/info/docs/uda50/mt/features.html#features">ODBC data sources</a>, and database servers. Rendezvous can also simply security and administration of data sources accessible by either one of these standards data access mechanisms. It can also apply to XML databases and data sources exposed by <a href="http://www.openlinksw.com/virtuoso/whatis.htm">XML Databases</a>.</p> <p></p> <p></p> <ul> <li>If you assume ad-hoc networking, you have to automatically define levels of access. I've always thought that the old Unix ugo (user, group, other) three-level permission system was simple and elegant, and if you replace the somewhat arbitrary "group" with "on my buddy list", you get something quite powerful. Which leads me to... <p></p> <p></p></li> <ul> <li>Buddy lists ought to be supported as a standard feature of many apps, and in a consistent way. What's more, our address books really ought to make it easy to indicate who is in a "buddy list" and support numerous overlapping lists for different purposes. <br /></li></ul> <li>Every application ought to expose some version of its data as an XML feed via some well-defined and standard access mechanism. It strikes me that one of the really big wins that fueled the early web was a simple naming scheme: you could go to a site called www.foo.com, and you'd find a web server there. While it wasn't required, it made web addresses eminently guessable. We missed the opportunity for xml.foo.com to mean "this is where you get the data feed" but it's probably still possible to come up with a simple, consistent naming scheme. And of course, if we can do it for web sites, we also need to think about how to do it for local applications, since... </li></ul> <p>The very point I continue to make about Internet Points of Presence beingactual data acces points, in short these end points should be served by database serverprocesses. This is the very basis of <a href="http://www.openlinksw.com/virtuoso">Virtuoso</a>, the inevitability of this realization remains the undepinings of this product. There are other products out there that have some sense of this vision too, but there is a little snag (at least so far in my research efforts), and that is the tendency to create dedicated independent server per protocol (an ultimate integration, administration, and maintenance nightmare).</p> <ul> <li>We ought to be able to have the expectation that all applications, whether local or remote (web) will be set up for two-way interactions. That is, they can be either a source or sink of online data. So, for example, the natural complement to amazon's web services data feeds is data input (for example, the ability to comment on a book on your local blog, and syndicate the review via RSS to amazon's detail page for the book.) And that leads to: <p></p> <p></p></li> <li>We really need to understand who owns what, and come up with mechanisms that protect the legitimate rights of individuals and businesses to their own data, while creating the "liquidity" and free movement of data that will fuel the next great revolution in computer functionality. (I'm doing a panel on this subject at next week's Open Source Convention, entitled "<a href="http://conferences.oreillynet.com/cs/os2003/view/e_sess/4526">We Need a Bill of Rights for Web Services</a>.") <p></p> <p></p></li> <li>We need easy gateways between different application domains. I was recently in Finland at a Nokia retreat, and we used camera-enabled cell phones to create a mobile photoblog. That was great. But even more exciting was the ease with which I could send a photo from the phone not just to another phone but also to an email address. This is the functionality that enabled the blog gateway, but it also made it trivial to send photos home to my family and friends. Similarly, I often blog things that I hear on mailing lists, and read many web sites via screen-scraping enabled email lists. It would be nice to have cross-application gateways be a routine part of software, rather than something that has to be hacked on after the fact.</li></ul> <div align="left">The wish list is pretty much a clear articulation of key items that should matter most to decision makers (CTOs and CIOs) ; in particular those that continue to wrestle with the identification and isolation of relevantcomponentsfor their enterprisearchitectures. </div>
Tim O'Reilly about network aware software
2006-06-22T12:56:58Z
2006-06-22T08:56:58-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/177
<p><span style="FONT-SIZE: 10pt; FONT-FAMILY: Arial"><a href="http://www.macdevcenter.com/pub/a/mac/2003/06/20/odbc.html">Open Database Connectivity for Mac OS X</a></span></p> <p><span style="FONT-SIZE: 10pt; FONT-FAMILY: Arial"></span><span style="FONT-SIZE: 10pt; FONT-FAMILY: Arial">It continues to amaze me that the fundamental implications of corporate data access remains misunderstood by all parties in the ITsphere. How can any organization afford to be ambivalent about where data is stored, and their ability to transform this data into information and knowledge (ultimate competitive advantage)? Data is the most valuable company asset (we even had data in the enterprise before computers!).</span></p> <p><span style="FONT-SIZE: 10pt; FONT-FAMILY: Arial"></span><span style="FONT-SIZE: 10pt; FONT-FAMILY: Arial">Mac OS X is attempting to make a serious push into the enterprise, but how can this be taken seriously if solving one of the biggest problems in the enterprise today isn't a flagship item driving the enterprise marketing strategy? The excerpt below simply sums this up:</span></p> <blockquote dir="ltr" style="MARGIN-RIGHT: 0px"> <p><span style="FONT-SIZE: 10pt; FONT-FAMILY: Arial"></span><span style="FONT-SIZE: 10pt; FONT-FAMILY: Arial">One of the new, albeit virtually undocumented features included in Jaguar is ODBC, or Open Database Connectivity. ODBC allows programs to connect to databases from different vendors using the same set of connectivity protocols. This allows for simplified database programming as well as database access from programs that normally would not allow such access. For instance, with ODBC you can use Excel to get data from MySQL, or you can use FileMaker to get data from Oracle. </span></p> <p><span style="FONT-SIZE: 10pt; FONT-FAMILY: Arial"></span><span style="FONT-SIZE: 10pt; FONT-FAMILY: Arial">From article titled <a href="http://www.macdevcenter.com/pub/a/mac/2003/06/20/odbc.html"><span style="FONT-SIZE: 12pt; FONT-FAMILY: 'Times New Roman'">Open Database Connectivity in Jaguar</span></a> by <a href="http://www.macdevcenter.com/pub/au/1236"><span style="FONT-SIZE: 12pt; FONT-FAMILY: 'Times New Roman'">Andrew Anderson</span></a></span></p> <p><span style="FONT-SIZE: 10pt; FONT-FAMILY: Arial"></span><span style="FONT-SIZE: 10pt; FONT-FAMILY: Arial">Open Database Connectivity is the only mechanism today that will enable any application to connect to any database without compromising choices across the following lines: Operating System, Programming Language, Desktop Productivity Tools, and Database Engine. All alternatives fail in one of the listed areas, with the ultimate destination being the painful realization that you are down a technology cul-de-sac (and these cost money via integration and data access quagmires). <?xml:namespace prefix = o ns = "urn:schemas-microsoft-com:office:office" ?><p xmlns="o"></p></span></p></blockquote> <p> </p>
Open Database Connectivity for Mac OS X
2006-06-22T12:56:58Z
2006-06-22T08:56:58-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1662
<p>There is increasing coalescence around the idea that HTTP-based <a class="auto-href" href="http://dbpedia.org/resource/Linked_Data" id="link-id0x1e93cbd0">Linked Data</a> adds a tangible dimension to the <a class="auto-href" href="http://dbpedia.org/resource/World_Wide_Web" id="link-id0x1dfdde10">World Wide Web</a> (<a href="http://dbpedia.org/resource/World_Wide_Web">Web</a>). This <i><a href="http://dbpedia.org/resource/Data">Data</a> Dimension</i> grants end-users, power-users, integrators, and developers the ability to experience the Web not solely as a <i><a class="auto-href" href="http://dbpedia.org/resource/Information" id="link-id0x19d02b00">Information</a> Space</i> or <i>Document Space,</i> but now also as a <i><a class="auto-href" href="http://en.wikipedia.org/wiki/Data_Spaces" id="link-id0x1ac33378">Data Space</a>.</i> </p> <p>Here is a simple What and Why guide covering the essence of Data Spaces.</p> <h2>What is a Data Space?</h2> <p>A Data Space is a point of presence on a network, where every <i>Data Object</i> (item or <a class="auto-href" href="http://dbpedia.org/resource/Entity" id="link-id0x1d55f910">entity</a>) is given a <i>Name</i> (e.g., a <a class="auto-href" href="http://dbpedia.org/resource/Uniform_Resource_Identifier" id="link-id0x1736ea28">URI</a>) by which it may be Referenced or Identified. </p> <p>In a Data Space, every <i>Representation</i> of those Data Objects (i.e., every <i>Object Representation</i>) has an <i>Address</i> (e.g., a <a class="auto-href" href="http://dbpedia.org/resource/Uniform_Resource_Locator" id="link-id0x1f17f5a8">URL</a>) from which it may be Retrieved (or "gotten").</p> <p>In a Data Space, every Object Representation is a time variant (that is, it changes over time), streamable, and format-agnostic <i>Resource.</i> </p> <p>An Object Representation is simply a Description of that Object. It takes the form of a graph, pictorially constructed from sets of 3 elements which are themselves named <i>Subject,</i> <i>Predicate,</i> and <i>Object</i> (or <i>SPO</i>); or <i>Entity,</i> <i>Attribute,</i> and <i>Value</i> (or <i>EAV</i>). Each <a class="auto-href" href="http://dbpedia.org/resource/Entity-attribute-value_model" id="link-id0x1dedcfe0">Entity</a>+Attribute+Value or Subject+Predicate+Object set (or <i>triple</i>), is one datum, one piece of data, one persisted observation about a given Subject or Entity.</p> <p>The underlying Schema that defines and constrains the construction of Object Representations is based on Logic, specifically <i>First-Order Logic</i>. Each Object Representation is a collection of persisted observations (<i>Data</i>) about a given Subject, which aid observers in materializing their perception (<i>Information</i>), and ultimately comprehension (<i><a class="auto-href" href="http://dbpedia.org/resource/Knowledge" id="link-id0x1a4c7bf8">Knowledge</a></i>), of that Subject.</p> <h2>Why are Data Spaces important?</h2> <p>In the real-world -- which is networked by nature -- data is heterogeneously (or "differently") shaped, and disparately located. </p> <p>Data has been increasing at an alarming rate since the advent of computing; the interWeb simply provides <a class="auto-href" href="http://dbpedia.org/resource/Context_%28language_use%29" id="link-id0x1ad97358">context</a> that makes this reality more palpable and more exploitable, and in the process virtuously ups the ante through increasingly exponential growth rates.</p> <p>We can't stop data heterogeneity; it is endemic to the nature of its producers -- humans and/or human-directed machines. What we can do, though, is create a powerful Conceptual-level "bus" or "interface" for data integration, based on <i>Data Description oriented Logic</i> rather than Data Representation oriented Formats. Basically, it's possible for us to use a <i><a href="http://en.wikipedia.org/wiki/First-order_predicate_logic" id="link-id0x1a481248">Common Logic</a></i> as the basis for expressing and blending SPO- or EAV-based Object Representations in a variety of Formats (or "dialects").</p> <p>The roadmap boils down to:</p> <ol> <li> <p>Assigning unambiguous Object Names to:</p> <ul> <li> <p>Every record (or, in table terms, every row); </p> </li> <li> <p>Every record attribute (or, in table terms, every field or column);</p> </li> <li> <p>Every record relationship (that is, every relationship between one record and another);</p> </li> <li> <p>Every record container (e.g., every table or view in a relational database, every named graph, every spreadsheet, every text file, etc.);</p> </li> </ul> </li> <li> <p>Making each Object Name resolve to an Address through which Create, Read, Update, and Delete ("CRUD") operations can be performed against (can <i>access</i>) the associated Object Representation graph.</p> </li> </ol>
Data Spaces
2011-03-01T22:26:15Z
2011-03-01T17:26:15-05:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1608
<p>In recent times a lot of the commentary and focus re. <a href="http://virtuoso.openlinksw.com" id="link-id16a22f48">Virtuoso</a> has centered on the RDF Quad Store and <a href="http://dbpedia.org/resource/Linked_Data" id="link-id112d82a0">Linked Data</a>. What sometimes gets overlooked is the sophisticated <a href="http://dbpedia.org/resource/Virtual_Database" id="link-id6493cc8">Virtual Database</a> Engine that provides the foundation for all of Virtuoso's <a href="http://dbpedia.org/resource/Data">data</a> integration capabilities.</p> <p>In this post I provide a brief re-introduction to this essential aspect of Virtuoso.</p> <h3>What is it?</h3> <p>This component of Virtuoso is known as the Virtual Database Engine (VDBMS). It provides transparent high-performance and secure access to disparate data sources that are external to Virtuoso. It enables federated access and integration of data hosted by any <a href="http://dbpedia.org/resource/Open_Database_Connectivity" id="link-id13c26008">ODBC</a>- or <a href="http://dbpedia.org/resource/Java_Database_Connectivity" id="link-id166604c0">JDBC</a>-accessible <a href="http://dbpedia.org/resource/Relational_database_management_system" id="link-id139dfdb8">RDBMS</a>, RDF Store, XML database, or Document (Free Text)-oriented Content Management System. In addition, it facilitates integration with <a href="http://dbpedia.org/resource/World_Wide_Web">Web</a> Services (SOAP-based SOA RPCs or REST-fully accessible Web Resources). </p> <h3>Why is it important?</h3> <p>In the most basic sense, you shouldn't need to upgrade your existing database engine version simply because your current DBMS and Data Access Driver combo isn't compatible with ODBC-compliant desktop tools such as Microsoft Access, Crystal Reports, BusinessObjects, Impromptu, or other of ODBC, JDBC, <a href="http://dbpedia.org/resource/ADO.NET" id="link-id13c7ceb8">ADO</a>.NET, or OLE DB-compliant applications. Simply place Virtuoso in front of your so-called "legacy database," and let it deliver the compliance levels sought by these tools</p> <p>In addition, it's important to note that today's enterprise, through application evolution, company mergers, or acquisitions, is often faced with disparately-structured data residing in any number of line-of-business-oriented data silos. Compounding the problem is the exponential growth of user-generated data via new social media-oriented collaboration tools and platforms. For companies to cost-effectively harness the opportunities accorded by the increasing intersection between line-of-business applications and social media, virtualization of data silos must be achieved, and this virtualization must be delivered in a manner that doesn't prohibitively compromise performance or completely undermine security at either the enterprise or personal level. Again, this is what you get by simply installing Virtuoso.</p> <h3>How do I use it?</h3> <p>The VDBMS may be used in a variety of ways, depending on the data access and integration task at hand. Examples include: </p> <h4>Relational Database Federation</h4> <p>You can make a single ODBC, JDBC, ADO.NET, OLE DB, or XMLA connection to multiple ODBC- or JDBC-accessible RDBMS data sources, concurrently, with the ability to perform intelligent distributed joins against externally-hosted database tables. For instance, you can join internal human resources data against internal sales and external stock market data, even when the HR team uses <a href="http://dbpedia.org/resource/Oracle_Database" id="link-id16706720">Oracle</a>, the Sales team uses <a href="http://dbpedia.org/resource/IBM_Informix" id="link-ide5a15c8">Informix</a>, and the Stock Market figures come from <a href="http://dbpedia.org/resource/Ingres" id="link-id13c0e138">Ingres</a>!</p> <h4>Conceptual Level Data Access using the RDF Model</h4> <p>You can construct RDF Model-based Conceptual Views atop Relational Data Sources. This is about generating HTTP-based <a href="http://dbpedia.org/resource/Entity-attribute-value_model" id="link-id115150f8">Entity</a>-Attribute-Value (E-A-V) graphs using data culled "on the fly" from native or external data sources (Relational Tables/Views, XML-based Web Services, or User Defined Types).</p> <p>You can also derive RDF Model-based Conceptual Views from Web Resource transformations "on the fly" -- the Virtuoso <a href="http://virtuoso.openlinksw.com/Whitepapers/html/VirtSpongerWhitePaper.html" id="link-id1675db50">Sponger</a> (RDFizing middleware component) enables you to generate RDF Model Linked Data via a RESTful Web Service or within the process pipeline of the <a href="http://dbpedia.org/resource/SPARQL" id="link-id166b8d90">SPARQL</a> query engine (i.e., you simply use the <a href="http://dbpedia.org/resource/Uniform_Resource_Locator" id="link-id167d00c8">URL</a> of a Web Resource in the FROM clause of a SPARQL query).</p> <p>It's important to note that Views take the form of HTTP links that serve as both Data Source Names and Data Source Addresses. This enables you to query and explore relationships across entities (i.e., People, Places, and other Real World Things) via HTTP clients (e.g., Web Browsers) or directly via SPARQL Query Language constructs transmitted over HTTP.</p> <h4>Conceptual Level Data Access using ADO.NET <a href="http://dbpedia.org/resource/Entity" id="link-id13c6bb60">Entity</a> <a href="http://dbpedia.org/resource/ADO.NET_Entity_Framework" id="link-id16ad3f68">Frameworks</a> </h4> <p>As an alternative to RDF, Virtuoso can expose ADO.NET Entity Frameworks-based Conceptual Views over Relational Data Sources. It achieves this by generating Entity Relationship graphs via its native ADO.NET Provider, exposing all externally attached ODBC- and JDBC-accessible data sources. In addition, the ADO.NET Provider supports direct access to Virtuoso's native RDF database engine, eliminating the need for resource intensive Entity Frameworks model transformations.</p> <h3>Related</h3> <ul> <li> <a href="http://virtuoso.openlinksw.com/dataspace/dav/wiki/Main/VirtLinkRemoteTables" id="link-id1183acd8">Attaching ODBC or JDBC accessible Relational Tables to Virtuoso</a> </li> <li> <a href="http://virtuoso.openlinksw.com/dataspace/dav/wiki/Main/VirtRdb2RDFViewsGeneration#One-Click%20Linked%20Data%20Generation%20&%20Deployment" id="link-id113f2fd8">Using an HTML based Wizard to Generate RDF based Linked Views over Relational Tables</a> </li> <li> <a href="http://www.youtube.com/watch?v=bj7AbJ0ZYCk&feature=channel" id="link-id16ad4480">Screencast Demonstrating Wizard based generation of RDF based Linked Data Views Part 1</a> </li> <li> <a href="http://www.youtube.com/watch?v=yXNlcISS0aY&feature=channel" id="link-id114eb720">Screencast Demonstrating Wizard based generation of RDF based Linked Data Views Part 1</a> </li> <li> <a href="http://virtuoso.openlinksw.com/dataspace/dav/wiki/Main/VirtSponger" id="link-id116e5810">Generating RDF based Linked Data from non RDF based Web Resources via the Sponger</a> </li> <li> <a href="http://virtuoso.openlinksw.com/dataspace/dav/wiki/Main/VirtAdoNet35Provider" id="link-id16706118">Building ADO.NET based Entity Frameworks Views over Relational Data</a> </li> <li> <a href="http://virtuoso.openlinksw.com/dataspace/dav/wiki/Main/VirtSilverlightSPARQLExample" id="link-id139c1278">Building Silverlight Rich Internat Applicaitons using ADO.NET, Entity Frameworks, and RDF based Linked Data</a>.</li> </ul>
Re-introducing the Virtuoso Virtual Database Engine
2010-02-17T21:46:53Z
2010-02-17T16:46:53-05:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1602
<p>For this particular user experience chronicle, I've simply inserted the content of <a href="http://trueg.wordpress.com" id="link-id1368b4d8">Sebastian Trueg</a>'s post titled: <a href="http://trueg.wordpress.com/2010/01/26/what-we-did-last-summer-and-the-rest-of-2009-a-look-back-onto-the-nepomuk-development-year-with-an-obscenely-long-title/#comments" id="link-id139dddb0">What We Did Last Summer (And the Rest of 2009) – A Look Back Onto the Nepomuk Development Year ...</a>, directly into this post, without any additional commentary or modification.</p> <div class="snap_preview"> <p>2009 is over. <em>Yeah, sure, trueg, we know that, it has been over for a while now!</em> Ok, ok, I am a bit late, but still I would like to get this one out - if only for my archive. So here goes.</p> <h2> <a href="http://virtuoso.openlinksw.com" id="link-id64672f0">Virtuoso</a> </h2> <p>Let’s start with the major topic of 2009 (and also the beginning of 2010): The new Nepomuk database backend: <a href="http://virtuoso.openlinksw.com/dataspace/dav/wiki/Main/" id="link-id13cc47e0">Virtuoso</a>. Everybody who used Nepomuk had the same problems: you either used the <a href="http://openrdf.org/" id="link-id13a4ac88">sesame2</a> backend which depends on Java and steals all of your memory or you were stuck with <a href="http://librdf.org/" id="link-id11b6a550">Redland</a> which had the worst performance and missed some <a href="http://dbpedia.org/resource/SPARQL" id="link-id139d82b8">SPARQL</a> features making important parts of Nepomuk like queries unusable. So more than a year ago I had the idea to use the one GPL’ed database server out there that supported RDF in a professional manner: <a href="http://www.openlinksw.com/" id="link-id139fd948">OpenLin</a>k’s <a href="http://virtuoso.openlinksw.com/dataspace/dav/wiki/Main/" id="link-id12329590">Virtuoso</a>. It has all the features we need, has a very good performance, and scales up to dimensions we will probably never reach on the desktop (<em>yeah, right, and 64k main memory will be enough forever!</em>). So very early I started coding the necessary Soprano plugin which would talk to a locally running Virtuoso server through <a href="http://en.wikipedia.org/wiki/Open_Database_Connectivity" id="link-id14930d90">ODBC</a>. But since I ran into tons of small problems (as always) and got sidetracked by other tasks I did not finish it right away. OpenLink, however, was very interested in the idea of their server being part of every KDE installation (why wouldn’t they ;)). So they not only introduced a <a href="http://docs.openlinksw.com/virtuoso/databaseadmsrv.html#ini_Parameters" id="link-id136763c0">lite-mode</a> which makes Virtuoso suitable for the desktop but also helped in debugging all the problems that I had left. Many test runs, patches, and a Virtuoso 5.0.12 release later <a href="http://trueg.wordpress.com/2009/10/22/virtuoso-once-more-with-feeling/" id="link-id13c5a5a0">I could finally announce the Virtuoso integration</a> as usable.</p> <p>Then end of last year I dropped the support for sesame2 and redland. Virtuoso is now the only supported database backend. The reason is simple: Virtuoso is way more powerful than the rest - not only in terms of performance - and it is fully implemented in <a href="http://dbpedia.org/resource/C%2B%2B" id="link-id13a17cd8">C</a>(++) without any traces of Java. Maybe even more important is the integration of the full text index which makes the previously used CLucene index unnecessary. Thus, we can finally combine full text and graph queries in one SPARQL query. This results in a cleaner API and way faster return of search results since there is no need to combine the results from several queries anymore. A direct result of that is the new <a href="http://api.kde.org/4.x-api/kdelibs-apidocs/nepomuk/html/namespaceNepomuk_1_1Query.html" id="link-id149a9fd8">Nepomuk Query API</a> which I will discuss later.</p> <p>So now the only thing I am waiting for is the first bugfix release of Virtuoso 6, i.e. 6.0.1 which will fix the bugs that make 6.0.0 fail with Nepomuk. Should be out any day now. :)</p> <h2>The Nepomuk Query API</h2> <p>Querying <a href="http://dbpedia.org/resource/Data">data</a> in Nepomuk pre-KDE-4.4 could be done in one of two ways: 1. Use the very limited capabilities of the <a href="http://api.kde.org/4.x-api/kdelibs-apidocs/nepomuk/html/classNepomuk_1_1ResourceManager.html" id="link-id139ad3d0">ResourceManager</a> to list resources with certain properties or of a certain type; or 2. Write your own <a href="http://techbase.kde.org/Development/Tutorials/Metadata/Nepomuk/AdvancedQueries" id="link-id13c74608">SPARQL query using ugly QString::arg replacements</a>.</p> <p>With the introduction of Virtuoso and its awesome power we can now do pretty much everything in one query. This allowed <a href="http://myopenlink.net/dataspace/person/kidehen#this" id="link-id13c4cf18">me</a> to finally create a query API for KDE: <a href="http://api.kde.org/4.x-api/kdelibs-apidocs/nepomuk/html/classNepomuk_1_1Query_1_1Query.html" id="link-id602e818">Nepomuk::Query::Query</a> and friends. I won’t go into much detail here since I did that <a href="http://trueg.wordpress.com/2009/12/07/convenient-querying-in-libnepomuk/" id="link-id11282ff8">before</a>.</p> <p>All in all you should remember one thing: whenever you think about writing your own SPARQL query in a KDE application - have a look at libnepomukquery. It is very likely that you can avoid the hassle of debugging a query by using the query API.</p> <p>The first nice effect of the new API (apart from me using it all over the place obviously) is the new query interface in Dolphin. Internally it simply combines a bunch of <a href="http://api.kde.org/4.x-api/kdelibs-apidocs/nepomuk/html/classNepomuk_1_1Query_1_1Term.html" id="link-id11952270">Nepomuk::Query::Term</a> objects into a <a href="http://api.kde.org/4.x-api/kdelibs-apidocs/nepomuk/html/classNepomuk_1_1Query_1_1AndTerm.html" id="link-id13aa85b8">Nepomuk::Query::AndTerm</a>. All very readable and no ugly query strings.</p> <div id="attachment_234" class="wp-caption aligncenter" style="width: 610px"> <a href="http://trueg.files.wordpress.com/2010/01/dolphin-kde-4-4-search-panel.png" id="link-id11454028"><img class="size-full wp-image-234" title="Dolphin Search Panel in KDE SC 4.4" src="http://trueg.files.wordpress.com/2010/01/dolphin-kde-4-4-search-panel.png?w=600&h=208" alt="" width="600" height="208" /> </a> <p class="wp-caption-text">Dolphin Search Panel in KDE SC 4.4</p> </div> <h2>Shared Desktop Ontologies</h2> <p>An important part of the <a href="http://nepomuk.semanticdesktop.org/" id="link-id13a35a90">Nepomuk research project</a> was the creation of a set of <a href="http://www.semanticdesktop.org/ontologies/" id="link-id123a6700">ontologies</a> for describing desktop resources and their metadata. After the <a href="http://xesam.org/main/XesamAbout" id="link-id13c70ab8">Xesam</a> project under the umbrella of <a href="http://www.freedesktop.org/" id="link-id139e2108">freedesktop.org</a> had been convinced to use RDF for describing file metadata they developed their own ontology. Thanks to Evgeny (phreedom) Egorochkin and Antonie Mylka both the Xesam ontology and the Nepomuk <a href="http://dbpedia.org/resource/Information" id="link-id119be318">Information</a> Elements Ontology were already very close in design. Thus, it was relatively easy to merge the two and be left with only one ontology to support. Since then not only KDE but also <a href="http://strigi.sourceforge.net/" id="link-id123b63f0">Strigi</a> and <a href="http://projects.gnome.org/tracker/" id="link-id13d02a30">Tracker</a> are using the Nepomuk ontologies.</p> <p>At the Gran Canaria Desktop Summit I met some of the guys from Tracker and we tried to come up with a plan to create a joint project to maintain the ontologies. This got off to a rough start as nobody really felt responsible. So I simply took the initiative and released the <a href="http://sourceforge.net/projects/oscaf/files/" id="link-id148d7078">shared-desktop-ontologies</a> version 0.1 in November 2009. The result was a s***-load of hate-mails and bug reports due to me breaking KDE build. But in the end it was worth it. Now the package is established and other projects can start to pick it up to create data compatible to the Nepomuk system and Tracker.</p> <p>Today the ontologies (and the shared-desktop-ontologies package) are maintained in the <a href="http://sourceforge.net/apps/trac/oscaf/" id="link-id10ce1038">Oscaf project at Sourceforge</a>. The situation is far from perfect but it is a good start. If you need specific properties in the ontologies or are thinking about creating one for your own application - come and join us in the <a href="http://sourceforge.net/apps/trac/oscaf/report/1" id="link-id11413910">bug tracker</a>…</p> <h2>Timeline KIO Slave</h2> <p>It was at the Akonadi meeting that Will Stephenson and myself got into talking about mimicking some <a href="http://live.gnome.org/Zeitgeist" id="link-id116888b0">Zeitgeist</a> functionality through Nepomuk. Basically it meant gathering some data when opening and when saving files. We quickly came up with a hacky patch for KIO and <a href="http://api.kde.org/4.x-api/kdelibs-apidocs/kio/html/classKFileDialog.html" id="link-id13637348">KFileDialog</a> which covered most cases and allowed us to track when a file was modified and by which application. This little experiment did not leave that state though (it will, however, this year) but another one did: Zeitgeist also provides a fuse filesystem which allows to browse the files by modification dates. Well, whatever fuse can do, KIO can do as well. <a href="http://trueg.wordpress.com/2009/10/26/just-another-way-of-browsing-your-files/" id="link-id13cf58c0">Introducing the timeline:/ KIO slave</a> which gives a calendar view onto your files.</p> <p> <a href="http://trueg.wordpress.com/2009/10/26/just-another-way-of-browsing-your-files/" id="link-id113d4988"><img class="aligncenter size-medium wp-image-208" title="timeline-october" src="http://trueg.files.wordpress.com/2009/10/timeline-october.png?w=300&h=235" alt="" width="300" height="235" /> </a> </p> <h2>Tips And Tricks</h2> <p>Well, I thought I would mention the <a href="http://techbase.kde.org/Development/Tutorials/Metadata/Nepomuk/TipsAndTricks" id="link-id116357d0">Tips And Tricks</a> section I wrote for the <a href="http://techbase.kde.org/Development/Tutorials/Metadata/Nepomuk" id="link-id14473520">techbase</a>. It might not be a big deal but I think it contains some valuable information in case you are using Nepomuk as a developer.</p> <h2>Google Summer Of Code 2009</h2> <p>This time around I had the privilege to <a href="http://trueg.wordpress.com/2009/04/22/nepomuk-in-the-summer-x2/" id="link-id116b0cf8">mentor two students</a> in the Google Summer of Code. Alessandro Sivieri and Adam Kidder did outstanding work on <a href="http://trueg.wordpress.com/2009/08/25/gsoc-wrap-up-part-1/" id="link-id13c9f2f8">Improved Virtual Folders</a> and the <a href="http://trueg.wordpress.com/2009/08/28/gsoc-wrap-up-part-2/" id="link-id123bac00">Smart File Dialog</a>.</p> <p>Adam’s work lead me to some heavy improvements in the Nepomuk KIO slaves myself which I only finished this week (more details on that coming up). Alessandro continued his work on faceted file browsing in KDE and created:</p> <h3>Sembrowser</h3> <p>Alessandro is following up on his work to make faceted file browsing a reality in 2010 (and KDE SC 4.5). Since it was too late to get faceted browsing into KDE SC 4.4 he is working on <a href="http://www.kde-apps.org/content/show.php/Sembrowser?content=117692" id="link-id117c67d0">Sembrowser</a>, a stand-alone faceted file browser which will be the grounds for experiments until the code is merged into Dolphin.</p> <div id="attachment_238" class="wp-caption aligncenter" style="width: 310px"> <a href="http://trueg.files.wordpress.com/2010/01/sembrowser.png" id="link-id13aa8e80"><img class="size-medium wp-image-238" title="sembrowser" src="http://trueg.files.wordpress.com/2010/01/sembrowser.png?w=300&h=189" alt="" width="300" height="189" /> </a> <p class="wp-caption-text">Faceted Browsing in KDE with Sembrowser</p> </div> <h2>Nepomuk Workshops</h2> <p>In 2009 I organized the first Nepomuk workshop in Freiburg, Germany. And also the second one. While <a href="http://trueg.wordpress.com/2009/06/23/the-first-nepomuk-workshop-its-a-wrap/" id="link-id13b553e0">I reported properly on the first one</a> I still owe a summary for the second one. I will get around to that - sooner or later. ;)</p> <h2>CMake Magic</h2> <p> <a href="http://soprano.sourceforge.net/" id="link-id148bfad8">Soprano</a> gives us a nice command line tool to create a C++ namespace from an ontology file: <a href="http://soprano.sourceforge.net/apidox/trunk/soprano_devel_tools.html" id="link-iddac3b58">onto2vocabularyclass</a>. It produces nice convenience namespaces like <a href="http://soprano.sourceforge.net/apidox/trunk/namespaceSoprano_1_1Vocabulary_1_1NAO.html" id="link-idfd4b970">Soprano::Vocabulary::NAO</a>. Nepomuk adds another tool named <a href="http://techbase.kde.org/Development/Tutorials/Metadata/Nepomuk/ResourceGenerator" id="link-id11b60200">nepomuk-rcgen</a>. Both were a bit clumsy to use before. Now we have nice cmake macros which make it very simple to use both.</p> <p>See the <a href="http://techbase.kde.org/Development/Tutorials/Metadata/Nepomuk/ResourceGenerator" id="link-id11963490">techbase article</a> on how to use the new macros.</p> <h2>Bangarang</h2> <p>Without my <a href="http://dbpedia.org/resource/Knowledge" id="link-iddcbd7c8">knowledge</a> (imagine that!) Andrew Lake created an amazing new media player named <a href="http://bangarangkde.wordpress.com/" id="link-id113d9500">Bangarang</a> - <em>a Jamaican word for noise, chaos or disorder.</em> This player is Nepomuk-enabled in the sense that it has a media library which lets you browse your media files based on the Nepomuk data. It remembers the number of times a song or a video has been played and when it was played last. It allows to add detail such as the TV series name, season, episode number, or actors that are in the video - all through Nepomuk (I hope we will soon get <a href="http://thetvdb.com/" id="link-id1154d7a0">tvdb</a> integration).</p> <div id="attachment_242" class="wp-caption aligncenter" style="width: 310px"> <a href="http://trueg.files.wordpress.com/2010/01/bangarang2.png" id="link-id148bcdb8"><img class="size-medium wp-image-242" title="bangarang2" src="http://trueg.files.wordpress.com/2010/01/bangarang2.png?w=300&h=208" alt="" width="300" height="208" /> </a> <p class="wp-caption-text">Edit metadata directly in Bangarang</p> </div> <div id="attachment_243" class="wp-caption aligncenter" style="width: 303px"> <a href="http://trueg.files.wordpress.com/2010/01/bangarang-dolphin-fileinfo.png" id="link-id11c70a48"><img class="size-full wp-image-243" title="bangarang-dolphin-fileinfo" src="http://trueg.files.wordpress.com/2010/01/bangarang-dolphin-fileinfo.png?w=293&h=242" alt="" width="293" height="242" /> </a> <p class="wp-caption-text">Dolphin showing TV episode metadata created by Bangarang</p> </div> <div id="attachment_245" class="wp-caption aligncenter" style="width: 310px"> <a href="http://trueg.files.wordpress.com/2010/01/bangarang-dolphin-search.png" id="link-id149200f8"><img class="size-medium wp-image-245" title="bangarang-dolphin-search" src="http://trueg.files.wordpress.com/2010/01/bangarang-dolphin-search.png?w=300&h=212" alt="" width="300" height="212" /> </a> <p class="wp-caption-text">And of course searching for it works, too...</p> </div> <div id="attachment_244" class="wp-caption aligncenter" style="width: 310px"> <a href="http://trueg.files.wordpress.com/2010/01/bangarang1.png" id="link-id114f7c80"><img class="size-medium wp-image-244" title="bangarang1" src="http://trueg.files.wordpress.com/2010/01/bangarang1.png?w=300&h=225" alt="" width="300" height="225" /> </a> <p class="wp-caption-text">And it is pretty, too...</p> </div> <p>I am especially excited about this since finally applications not written or mentored by me start contributing Nepomuk data.</p> <h2>Gran Canaria Desktop Summit</h2> <p>2009 was also the year of the first Gnome-KDE joint-conference. Let me make a bulletin for completeness and refer to <a href="http://trueg.wordpress.com/2009/07/13/gran-canaria-desktop-summit-2009-the-nepomuk-perspective/" id="link-id143ff668">my previous blog post reporting on my experiences on the island</a>.</p> <p>Well, that was by far not all I did in 2009 but I think I covered most of the important topics. And after all it is ‘just a blog entry’ - there is no need for completeness. Thanks for reading.</p> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/trueg.wordpress.com/232/" id="link-id118a1950"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/trueg.wordpress.com/232/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/trueg.wordpress.com/232/" id="link-id148ffb08"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/trueg.wordpress.com/232/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/trueg.wordpress.com/232/" id="link-id13c65a88"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/trueg.wordpress.com/232/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/trueg.wordpress.com/232/" id="link-id119b85a0"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/trueg.wordpress.com/232/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/trueg.wordpress.com/232/" id="link-id13f5d6b8"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/trueg.wordpress.com/232/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=trueg.wordpress.com&blog=6648236&post=232&subd=trueg&ref=&feed=1" /> </div>"
Virtuoso Chronicles from the Field: Nepomuk, KDE, and the quest for a sophisticated RDF DBMS.
2010-02-01T14:02:55Z
2010-02-01T09:02:55-05:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1601
<p>Thanks to the <a href="http://www.techcrunch.com/" id="link-id114eb070">TechCrunch</a> post titled: <a href="http://www.techcrunch.com/2010/01/01/ten-technologies-2010/" id="link-id1146e550">Ten Technologies That Will Rock 2010</a>, I've been able to quickly construct a derivative post that condenses the ten item list down to a Single Technology That Will Rock 2010 :-)</p> <p>Sticking with the TechCrunch layout, here is why all roads simply lead to <a href="http://dbpedia.org/resource/Linked_Data" id="link-id11141d50">Linked Data</a> come 2010 and beyond: </p> <ol> <li> <strong>The Tablet: </strong>a new form factor addition re. <a href="http://dbpedia.org/resource/Internet" id="link-id13f09418">Internet</a> and <a href="http://dbpedia.org/resource/World_Wide_Web">Web</a> application hosts which is just another way of saying: Linked Data will be accessible from Tablet applications.</li> <li> <strong>Geo:</strong> GPS chips are now standard features of mobile phones, so <a href="http://www.techcrunch.com/2009/12/23/location-2010/" id="link-id112cfdd0">geolocation</a> is increasingly becoming a necessary feature for any killer app. Thus, GeoSpatial Linked Data and GeopSpatial Queries are going to be a critical success factor for any endeavor that seeks to engage mobile applications developers and ultimately their end-users. Basiacally, you want to be able to perform Esoteric Search from these devices of the form: Find Vendors of a Camcorder (e.g., with a Zoom Factor: Weight Ratio of X) within a 2km Radius of my current location. Or how many items from my WishList are available from a Vendor within a 2km radius of my current location. Conversely, provide Vendors with the ability to spot potential Customers within a 2km of a given "clicks & mortar" location (e.g. BestBuy store).</li> <li> <strong>Realtime Search: </strong>Rich Structured Profiles that leverage standards such as <a href="http://dbpedia.org/resource/Friend_of_a_friend" id="link-id140ece38">FOAF</a> and <a href="http://blogs.sun.com/bblfish/entry/foaf_ssl_creating_a_global" id="link-id11856318">FOAF+SSL</a> will enable Highly Personalized Realtime Search (HPRS) without compromisng privacy. Tecnically, this is about <a href="http://esw.w3.org/topic/WebID" id="link-id13ec6260">WebID</a>s securely bound to X.509 Certificates, providing access to verifiable and highly navigable Personal Profile <a href="http://dbpedia.org/resource/Data">Data</a> Spaces that also double as personal search index entry points.</li> <li> <strong>Chrome OS: </strong>Just another operating system for exploiting the burgeoning Web of Linked Data</li> <li> <strong>HTML5: </strong>Courtesy of <a href="http://dbpedia.org/resource/RDFa" id="link-id115b08f0">RDFa</a>, just another mechanism for exposing Linked Data by making HTML+RDFa a bona fide markup for <a href="http://dbpedia.org/resource/Metadata" id="link-id1195b070">metadata</a> (i.e., format for describing real world objects via their attribute-value graphs)</li> <li> <strong>Mobile Video:</strong> Simplifies the production and sharing of Video annotations (comments, reviews etc.) en route to creating rich Linked Discourse Data Spaces.</li> <li> <strong>Augmented Reality:</strong> Ditto</li> <li> <strong>Mobile Transactions:</strong> As per points 1&2 above, Vendor Discovery and Transaction Conusmation will increasingly be driven by high SDQ applications. The "Funnel Effect" (more choices based on individual preferences) will be a critical success factor for any one operating in the Mobile Transaction realm. Note, without Linked Data you cannot deliver scalable solutions that handle the combined requirements of: SDQ, "Funnel Effect", and Mobile Device form factor, will simply maginify the importance of Web accessible Linked Data.</li> <li> <strong>Android:</strong> An additional platform for items 1-8; basically, 2010 isn't going to be an iPhone only zone. Personally, this reminds <a href="http://myopenlink.net/dataspace/person/kidehen#this" id="link-id111ab5e8">me</a> of a battle from the past i.e., Microsoft vs Apple, re. desktop computing dominance. Google has studied history very well :-)</li> <li> <strong>Social CRM:</strong> this is simply about applying points 1-9 alongide the construction of Linked Data from eCRM Data Spaces.</li> </ol> <p>As I've stated in the past (across a variety of mediums), you cannot build applications that have long term value without addressing the following issues:</p> <ol> <li>Data Item or Object Identity</li> <li>Data Structure -- Data Models</li> <li>Data Representation -- Data Model <a href="http://dbpedia.org/resource/Entity" id="link-id1148eaf8">Entity</a> & Relationships Representation mechanism (as delivered by metadata oriented markup)</li> <li>Data Storage -- Database Management Systems</li> <li>Data Access -- Data Access Protocols </li> <li>Data Presentation -- How you present Views and Reports from Structured Data Sources</li> <li>Data Security -- Data Access Policies</li> </ol> <p>The items above basically showcase the very essence of the HTTP <a href="http://dbpedia.org/resource/Uniform_Resource_Identifier" id="link-id1239af68">URI</a> abstraction that drives HTTP based Linked Data; which is also the basic payload unit that underlies <a href="http://dbpedia.org/resource/Representational_State_Transfer" id="link-id11489a98">REST</a>.</p> <h3>Conclusion</h3> <p>I simply hope that the next decade marks a period of broad appreciation and comprehension of Data Access, Integration, and Management issues on the parts of: application developers, integrators, analysts, end-users, and decision makers. Remember, without structured Data we cannot produce or share <a href="http://dbpedia.org/resource/Information" id="link-id13cb5040">Information</a>, and without Information, we cannot produce of share <a href="http://dbpedia.org/resource/Knowledge" id="link-id647abb0">Knowledge</a>.</p> <h3>Related</h3> <ul> <li> <a href="http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1567" id="link-id13fa3a20">HTTP URI Abstraction and Linked Data</a> </li> <li> <a href="http://www.dataflux.com/dfblog/?p=1458," id="link-id138f3ea8">First Law of Data Quality</a> </li> <li> <a href="http://walkingoncoals.blogspot.com/2009/12/whos-data-is-it-part-1.html" id="link-id13efccb8">Who's Data Is It?</a> </li> <li> <a href="http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1442" id="link-id1355df68">Serendipitous Discovery Quotient</a> (SDQ)</li> <li> <a href="http://www.seangolliher.com/2009/linked-data/serendipitous-discovery-quotient-sdq-the-future-of-seo-or-an-abstract-concept/" id="link-id11217cb8">SDQ: The Future of SEO or an Abstract Concept?</a> </li> <li> <a href="http://www.openlinksw.com/dataspace/oerling/weblog/Orri%20Erling%27s%20Blog/1587" id="link-id139cfbe0">SPARQL & GeoSpatial Indexing</a> (implications of <a href="http://dbpedia.org/resource/SPARQL" id="link-id13f51b78">SPARQL</a>-GEO)</li> <li> <a href="http://blog.jonudell.net/2009/09/09/talking-with-kingsley-idehen-about-mastering-your-own-search-index/" id="link-id13c5c248">Mastering Your Own Search Index</a> </li> <li> <a href="http://blog.jonudell.net/2009/11/23/talking-with-martin-hepp-about-solving-the-paradox-of-choice/" id="link-id135ba4d0">Solving the Paradox of Choice</a>.</li> </ul>
One Technology That Will Rock 2010 (Update 1)
2010-02-01T14:02:41Z
2010-02-01T09:02:41-05:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1600
<p>I've created a new discussion space that's squarely focused on the business development and marketing aspects of "HTTP based <a href="http://dbpedia.org/resource/Linked_Data" id="link-id129e32d8">Linked Data" (Linked Data</a>). As its name indicates, It's a BOLD attempt to fill a VoiD. :-)</p> <h3>Background</h3> <p>A few months ago, <a href="http://blog.aldobucchi.com/#this" id="link-id1110eb30">Aldo Bucchi</a> posted a message to the <a href="http://esw.w3.org/topic/SweoIG/TaskForces/CommunityProjects/LinkingOpenData" id="link-id111d08a0">LOD</a> <a href="http://lists.w3.org/Archives/Public/public-lod/" id="link-id118b3778">mailing list</a> seeking a discussion space for more business and marketing oriented topic, in relation to Linked Data. At the time, my assumption was that the existing LOD mailing list served that purpose absolutely fine, but in due course I came to realize that Aldo's request had a much lager foundation than I initially suspected.</p> <h3>Historic Oversight</h3> <p>Linked Data, like its umbrella <a href="http://dbpedia.org/resource/Semantic_Web" id="link-id16ceb618">Semantic Web</a> Project, has suffered from an inadvertent oversight on the parts of many of its enthusiasts (myself included): 100% of the discussion spaces are created by, geared towards, or dominated by researchers (from Academia primarily) and/or developers. Thus, at the very least, we've been operating in an echo chamber that only feed the existing void between the core community and those who are more interested in discussing business and marketing related topics.</p> <p>The new discussion space seeks to cover the following:</p> <ol> <li> Brainstorming Value Proposition Articulation</li> <li>War Story Exchanges</li> <li>Case Studies and Use-cases</li> <li>Market Research & Positioning (for instance Linked Data is killer technology that redefines <a href="http://dbpedia.org/resource/Data">Data</a> Integration, but none of the major research firms currently make that connection)</li>. </ol> <p>How Do I Join The Conversation? Simply sign up on the Google hosted <a href="http://groups.google.com/group/business-of-linked-data-bold" id="link-id129e4d08">BOLD mailing list</a>, introduce yourself (ideally), and then start conversing! :-)</p>
The Business Of Linked Data (BOLD) Discussion Space
2010-02-01T14:02:27Z
2010-02-01T09:02:27.000001-05:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1596
<p>I've created a new discussion space that's squarely focused on the business development and marketing aspects of "HTTP based <a href="http://dbpedia.org/resource/Linked_Data" id="link-id129e32d8">Linked Data" (Linked Data</a>). As its name indicates, It's a BOLD attempt to fill a VoiD. :-)</p> <h3>Background</h3> <p>A few months ago, <a href="http://blog.aldobucchi.com/#this" id="link-id1110eb30">Aldo Bucchi</a> posted a message to the <a href="http://esw.w3.org/topic/SweoIG/TaskForces/CommunityProjects/LinkingOpenData" id="link-id111d08a0">LOD</a> <a href="http://lists.w3.org/Archives/Public/public-lod/" id="link-id118b3778">mailing list</a> seeking a discussion space for more business and marketing oriented topic, in relation to Linked Data. At the time, my assumption was that the existing LOD mailing list served that purpose absolutely fine, but in due course I came to realize that Aldo's request had a much lager foundation than I initially suspected.</p> <h3>Historic Oversight</h3> <p>Linked Data, like its umbrella <a href="http://dbpedia.org/resource/Semantic_Web" id="link-id16ceb618">Semantic Web</a> Project, has suffered from an inadvertent oversight on the parts of many of its enthusiasts (myself included): 100% of the discussion spaces are created by, geared towards, or dominated by researchers (from Academia primarily) and/or developers. Thus, at the very least, we've been operating in an echo chamber that only feed the existing void between the core community and those who are more interested in discussing business and marketing related topics.</p> <p>The new discussion space seeks to cover the following:</p> <ol> <li> Brainstorming Value Proposition Articulation</li> <li>War Story Exchanges</li> <li>Case Studies and Use-cases</li> <li>Market Research & Positioning (for instance Linked Data is killer technology that redefines <a href="http://dbpedia.org/resource/Data" id="link-id0x1d491e90">Data</a> Integration, but none of the major research firms currently make that connection)</li>. </ol> <p>How Do I Join The Conversation? Simply sign up on the Google hosted <a href="http://groups.google.com/group/business-of-linked-data-bold" id="link-id129e4d08">BOLD mailing list</a>, introduce yourself (ideally), and then start conversing! :-)</p>
The Business Of Linked Data (BOLD) Discussion Space
2010-01-31T22:48:48Z
2010-01-31T17:48:48-05:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1591
<ol> <li> It isn't <a href="http://dbpedia.org/resource/World_Wide_Web" id="link-id115dfd68">World Wide Web</a> Specific (HTTP != World Wide Web)</li> <li> It isn't Open <a href="http://dbpedia.org/resource/Data">Data</a> Specific </li> <li> It isn't about "Free" (Beer or Speech) </li> <li> It isn't about Markup (so don't expect to grok it via "markup first" approach) </li> <li>It's about <a href="http://dbpedia.org/resource/Linked_Data" id="link-id13a6aa98">Hyperdata</a> - the use of HTTP and REST to deliver a powerful platform agnostic mechanism for Data Reference, Access, and Integration.</li> </ol> <p> When trying to understand HTTP based <a href="http://dbpedia.org/resource/Linked_Data" id="link-id18aa1490">Linked Data</a>, especially if you're well versed in DBMS technology use (User, Power User, Architect, Analyst, DBA, or Programmer) think: <br /> </p> <ul> <li> Open Database Connectivity (<a href="http://dbpedia.org/resource/Open_Database_Connectivity" id="link-id1428fba0">ODBC</a>) without operating system, data model, or wire-protocol specificity or lock-in potential </li> <li> Java Database Connectivity (<a href="http://dbpedia.org/resource/Java_Database_Connectivity" id="link-id18d3c2a8">JDBC</a>) without programming language specificity </li> <li> <a href="http://dbpedia.org/resource/ADO.NET" id="link-id125725b8">ADO</a>.NET without .NET runtime specificity and .NET bound language specificity </li> <li> OLE-DB without Windows operating system & programming language specificity </li> <li> XMLA without XML format specificity - with Tabular and Multidimensional results formats expressible in a variety of data representation formats. </li> <li>All of the above scoped to the Record rather than Container level, with Generic HTTP scheme URIs associated with each Record, Field, and Field value (optionally) </li> </ul> <p>Remember the need for Data Access & Integration technology is the by product of the following realities:</p> <ol> <li> Human curated data is ultimately dirty, because: <ul> <li>our thick thumbs, inattention, distractions, and general discomfort with typing, make typos prevalent</li> <li>database engines exist for a variety of data models - Graph, Relational, Hierarchical;</li> <li>within databases you have different record container/partition names e.g. Table Names;</li> <li>within a database record container you have records that are really aspects of the same thing (different keys exist in a plethora of operational / line of business systems that expose aspects of the same <a href="http://dbpedia.org/resource/Entity" id="link-id13378338">entity</a> e.g., customer data that spans Accounts, CRM, ERP application databases);</li> <li>different field names (one database has "EMP" while another has "Employee") for the same record</li>.</ul> </li> <li>Units of measurement is driven by locale, the UK office wants to see sales in Pounds Sterling while the French office prefers Euros etc.</li> <li>All of the above is subject to <a href="http://dbpedia.org/resource/Context_%28language_use%29" id="link-id17e46398">context</a> halos which can be quite granular re. sensitivity e.g. staff travel between locations that alter locales and their roles; basically, profiles matters a lot.</li> </ol> <h3>Related</h3> <ul> <li> <a href="http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1364" id="link-id128f0fe8">ODBC and WODBC (Web Open Database Connectivity) Comparison</a> </li> <li> <a href="http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1364" id="link-id1367cd18">Creating, Deploying, and Exploiting Linked Data Presentation</a> </li> <li> <a href="http://www.odata.org/" id="link-id122ab708">Open Data Protocol Project</a> </li> </ul>
5 Very Important Things to Note about HTTP based Linked Data
2010-02-01T14:00:56Z
2010-02-01T09:00:56-05:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1567
<h3> Situation Analysis</h3> <p> As the "<a href="http://www.w3.org/DesignIssues/LinkedData.html" id="link-id12f96a00">Linked Data" meme</a> has gained momentum you've more than likely been on the receiving end of dialog with Linked Open <a href="http://dbpedia.org/resource/Data">Data</a> community members (myself included) that goes something like this:</p> <blockquote> <cite>"Do you have a <a href="http://dbpedia.org/resource/Uniform_Resource_Identifier" id="link-id139252a0">URI</a>", "Get yourself a URI", "Give <a href="http://myopenlink.net/dataspace/person/kidehen#this" id="link-id140eab68">me</a> a de-referencable URI" etc..</cite> </blockquote> <p> And each time, you respond with a <a href="http://dbpedia.org/resource/Uniform_Resource_Locator" id="link-id112c1860">URL</a> -- which to the best of your <a href="http://dbpedia.org/resource/World_Wide_Web">Web</a> <a href="http://dbpedia.org/resource/Knowledge" id="link-id140b51c0">knowledge</a> is a bona fide URI. But to your utter confusion you are told: Nah! You gave me a Document URI instead of the URI of a real-world thing or object etc..</p> <h3> What's up with that?</h3> <p> Well our everyday use of the Web is an unfortunate conflation of two distinct things, which have Identity: Real World Objects (RWOs) & Address/Location of Documents (<a href="http://dbpedia.org/resource/Information" id="link-id144838b0">Information</a> bearing Resources).</p> <p> The "Linked Data" meme is about enhancing the Web by unobtrusively reintroducing its core essence: the generic HTTP URI, a vital piece of Web Architecture DNA. Basically, its about so realizing the full capabilities of the Web as a platform for Open Data Identification, Definition, Access, Storage, Representation, Presentation, and Integration.</p> <h3> What is a Real World Object?</h3> <p> People, Places, Music, Books, Cars, Ideas, Emotions etc..</p> <h3> What is a URI?</h3> <p> A Uniform Resource Identifier. A global identifier mechanism for network addressable data items. Its sole function is Name oriented Identification.</p> <h4> URI Generic Syntax</h4> <p> The constituent parts of a URI (from <a href="http://www.ietf.org/rfc/rfc2396.txt" id="link-id1180c700">URI Generic Syntax RFC</a>) are depicted below: <img src="http://virtuoso.openlinksw.com/images/generic_uri_syntax_image.png" /> </p> <h3> What is a URL?</h3> <p> A location oriented HTTP scheme based URI. The HTTP scheme introduces a powerful and inherent duality that delivers:</p> <ol> <li> Resource Address/Location Identifier</li> <li> Data Access mechanism for an Information bearing Resource (Document, File etc..)</li> </ol> <p> So far so good!</p> <h3> What is an HTTP based URI?</h3> <p> The kind of URI <a href="http://dbpedia.org/resource/Linked_Data" id="link-id11100a28">Linked Data</a> aficionados mean when they use the term: URI.</p> <p> An HTTP URI is an HTTP scheme based URI. Unlike a URL, this kind of HTTP scheme URI is devoid of any Web Location orientation or specificity. Thus, Its inherent duality provides a more powerful level of abstraction. Hence, you can use this form of URI to assign Names/Identifiers to Real World Objects (RWO). Even better, courtesy of the Identity/Address duality of the HTTP scheme, a single URI can deliver the following:</p> <ol> <li> RWO Identfier/Name</li> <li> RWO Metadata document Locator (courtesy of URL aspect)</li> <li> Negotiable Representation of the Located Document (courtesy of HTTP's content negotiation feature).</li> </ol> <h3> What is Metadata?</h3> <p> Data about Data. Put differently, data that describes other data in a structured manner.</p> <h3> How Do we Model Metadata?</h3> <p> The predominant model for metadata is the <a href="http://dbpedia.org/resource/Entity-attribute-value_model" id="link-id11193d30">Entity</a>-Attribute-Value + Classes & Relationships model (<a href="http://dbpedia.org/resource/Entity-attribute-value_model" id="link-id11725710">EAV</a>/CR). A model that's been with us since the inception of modern computing (long before the Web).</p> <h3> What about RDF?</h3> <p> The Resource Description Framework (RDF) is a framework for describing Web addressable resources. In a nutshell, its a framework for adding Metadata bearing Information Resources to the current Web. Its comprised of:</p> <ol> <li> Entity-Attribute-Value (aka. Subject-Predictate-Object) plus Classes & Relationships (<a href="http://dbpedia.org/resource/Data_dictionary" id="link-id138df0f8">Data Dictionaries</a> e.g., <a href="http://dbpedia.org/resource/Web_Ontology_Language" id="link-id116bf590">OWL</a>) metadata model</li> <li> A plethora of instance data representation formats that include: <a href="http://dbpedia.org/resource/RDFa" id="link-id13360b90">RDFa</a> (when doing so within (X)HTML docs), Turtle, N3, TriX, RDF/XML etc.</li> </ol> <h3> What's the Problem Today?</h3> <p> The ubiquitous use of the Web is primarily focused on a Linked Mesh of Information bearing Documents. URLs rather than generic HTTP URIs are the prime mechanism for Web tapestry; basically, we use URLs to conduct Information -- which is inherently subjective -- instead of using HTTP URIs to conduct "Raw Data" -- which is inherently objective.</p> <blockquote> <strong>Note:</strong> Information is "data in <a href="http://dbpedia.org/resource/Context_%28language_use%29" id="link-id1395ca50">context</a>", it isn't the same thing as "Raw Data". Thus, if we can link to Information via the Web, why shouldn't we be able to do the same for "Raw Data"?</blockquote> <h3> How Does the Link Data <a href="http://dbpedia.org/resource/Meme" id="link-id1160ab70">meme</a> solve the problem?</h3> <p> The meme simply provides a set of guidelines (best practices) for producing Web architecture friendly metadata. Meaning: when producing EAV/CR model based metadata, endow Subjects, their Attributes, and Attribute Values (optionally) with HTTP URIs. By doing so, a new level of Link Abstraction on the Web is possible i.e., "Data Item to Data Item" level links (aka <a href="http://dbpedia.org/resource/Linked_Data" id="link-id137a78a0">hyperdata</a> links). Even better, when you de-reference a RWO hyperdata link you end up with a negotiated representations of its metadata.</p> <h3> Conclusion</h3> <p> Linked Data is ultimately about an HTTP URI for each item in the <a href="http://dbpedia.org/resource/Data_hierarchy" id="link-id1393c3e0">Data Organization Hierarchy</a> :-)</p> <h3> Related</h3> <ol> <li> <a href="http://lists.w3.org/Archives/Public/www-tag/2009Aug/0000.html" id="link-id140c1e78">History of how "Resource" became part of URI</a> - historic account by <a href="http://www.w3.org/People/Berners-Lee/card#i" id="link-id1172b128">TimBL</a> </li> <li> <a href="http://www.w3.org/DesignIssues/LinkedData.html" id="link-id1338cbd0">Linked Data Design Issues Document</a> - <a href="http://www.w3.org/People/Berners-Lee/card#i" id="link-id13536ad8">TimBL</a>'s initial Linked Data Guide</li> <li> <a href="http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1561" id="link-id116c1af8">Linked Data Rules Simplified</a> - My attempt at simplifying the Linked Data Meme without <a href="http://dbpedia.org/resource/SPARQL" id="link-id116c3b40">SPARQL</a> & RDF distraction</li> <li> <a href="http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1547" id="link-id135dd1b8">Linked Data & Identity</a> - another related post</li> <li> <a href="http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1565" id="link-id134afc50">The Linked Data Meme's Value Proposition</a> </li> <li> <a href="http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1323" id="link-id1251e9248">So What Does "HREF" stand for anyway?</a> </li> <li> <a href="http://delicious.com/kidehen/identifier_scheme" id="link-id14cc7e18">My Del.icio.us hosted Bookmark Data Space for Identity Schemes</a> </li> <li> <a href="http://www.ted.com/talks/tim_berners_lee_on_the_next_web.html" id="link-id115a3748">TimBL's Ted Talk re. "Raw Linked Data"</a> </li> <li> <a href="http://www.infoq.com/articles/roa-rest-of-rest" id="link-id11b25558">Resource Oriented Architecture</a> </li> <li> <a href="http://www.blipfoto.com/view.php?id=465380&month=2&year=2010" id="link-id139824c8">More Famous Than Simon Cowell</a> .</li> </ol>
The URI, URL, and Linked Data Meme's Generic HTTP URI (Updated)
2010-03-28T16:19:00Z
2010-03-28T12:19:00-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1565
<h3>What is <a href="http://dbpedia.org/resource/Linked_Data" id="link-id138c9aa8">Linked Data</a>?</h3> <p> The primary topic of a <a href="http://dbpedia.org/resource/Meme" id="link-id12f86100">meme</a> penned by <a href="http://www.w3.org/People/Berners-Lee/card#i" id="link-id115b4c98">TimBL</a> in the form of a <a href="http://www.w3.org/DesignIssues/LinkedData.html" id="link-id1333f300">Design Issues Doc</a> (note: this is how TimBL has shared his thoughts since the <a href="http://www.w3.org/DesignIssues/" id="link-id1128a1d0">Beginning of the Web</a>). </p> <p> There are a number of dimensions to the meme, but its primary purpose is the reintroduction of the HTTP <a href="http://dbpedia.org/resource/Uniform_Resource_Identifier" id="link-id13c43cb8">URI</a> -- a vital component of the Web's core architecture. </p> <h3> What's Special about HTTP URIs?</h3> <p> They possess an intrinsic duality that combines persistent and unambiguous <a href="http://dbpedia.org/resource/Data">Data</a> Identity with platform & representation format independent Data Access. Thus, you can use a string of characters that look like a contemporary Web <a href="http://dbpedia.org/resource/Uniform_Resource_Locator" id="link-id119cd8a0">URL</a> to unambiguously achieve the following: </p> <ol> <li>Identity or Name Anything of Interest</li> <li>Describe Anything of Interest by associating the Description Subject's Identity with a constellation of Attribute and Value pairs (technically: an <a href="http://dbpedia.org/resource/Entity-attribute-value_model" id="link-id1133e8a8">Entity</a>-Attribute-Value or Subject-Predicate-Object graph)</li> <li>Make the Description of Named Things of Interest discoverable on the Web by implicitly binding the aforementioned to Documents that hold their descriptions (technically: metadata documents or <a href="http://dbpedia.org/resource/Information" id="link-id1391da40">information</a> resources)</li> </ol> <h3>What's the basic value proposition of the <a href="http://www.w3.org/DesignIssues/LinkedData.html" id="link-id113bb690">Linked Data meme</a>?</h3> <p>Enabling more productive use of the Web by users and developers alike. All of which is achieved by tweaking the Web's Hyperlinking feature such that it now includes Hypertext and <a href="http://dbpedia.org/resource/Linked_Data" id="link-id1337a3f0">Hyperdata</a> as link types.</p> <p>Note: Hyperdata Linking is simply what an HTTP URI facilitates.</p> <p>Examples problems solved by injecting Linked Data into the Web:</p> <ol> <li>Federated Identity by enabling Individuals to unambiguously Identify themselves (Profiles++) courtesy of existing <a href="http://dbpedia.org/resource/Internet" id="link-id13926e28">Internet</a> and Web protocols (e.g., <a href="http://dbpedia.org/resource/Friend_of_a_friend" id="link-id13646ec8">FOAF</a>+SSL's WebIDs which combine Personal Identity with X.509 certificates and HTTPs based client side certification)</li> <li>Security and Privacy challenge alleviation by delivering a mechanism for policy based data access that feeds off federated individual identity and social network (graph) traversal</li> <li>Spam Busting via the above</li>. <li> Increasing the Serendipitous Discovery Quotient (SDQ) of Web accessible resources by embedding Rich Metadata into (X)HTML Documents e.g., structured descriptions of your "WishLists" and "OfferLists" via a common set of terms offered by vocabularies such as <a href="http://www.heppnetz.de/projects/goodrelations/" id="link-id1199b4d0">GoodRelations</a> and <a href="http://dbpedia.org/resource/SIOC" id="link-id1334cfb0">SIOC</a> </li> <li>Coherent integration of disparate data across the Web and/or within the Enterprise via "Data Meshing" rather than "Data Mashing"</li> <li>Moving beyond imprecise statistically driven "Keyword Search" (e.g. Page Rank) to "Precision Find" driven by typed link based <a href="http://dbpedia.org/resource/Entity" id="link-id135f6fe8">Entity</a> Rank plus Entity Type and Entity Property filters.</li> </ol> <h3>Conclusion</h3> <p>If all of the above still falls into the technical mumbo-jumbo realm, then simply consider Linked Data as delivering Open Data Access in granular form to Web accessible data -- that goes beyond data containers (documents or files).</p> <p>The value proposition of Linked Data is inextricably linked to the value proposition of the <a href="http://dbpedia.org/resource/World_Wide_Web" id="link-id1356f5c0">World Wide Web</a>. This is true, because the Linked Data meme is ultimately about an enhancement of the current Web; achieved by reintroducing its architectural essence -- in new <a href="http://dbpedia.org/resource/Context_%28language_use%29" id="link-id11300828">context</a> -- via a new level of link abstraction, courtesy of the Identity and Access duality of HTTP URIs.</p> <p>As a result of Linked Data, you can now have Links on the Web for a Person, Document, Music, Consumer Electronics, Products & Services, Business Opening & Closing Hours, Personal "WishLists" and "OfferList", an Idea, etc.. in addition to links for Properties (Attributes & Values) of the aforementioned. Ultimately, all of these links will be indexed in a myriad of ways providing the substrate for the next major period of Internet & Web driven innovation, within our larger human-ingenuity driven innovation continuum.</p> <h3>Related</h3> <ul> <li> <a href="http://www.ebusiness-unibw.org/wiki/GoodRelations#Recipes_and_Examples" id="link-id11386648">Recipes for Describing Your Business and its Offerings using the GoodRelations Vocabulary / Schema</a> </li> <li> <a href="http://slidesix.com/view/SolvingRealProblemsUsingLinkedData" id="link-id13658ee0">Solving Real Problems with RDF based Linked Data</a> </li> <li> <a href="http://www.openlinksw.com/weblog/public/search.vspx?blogid=127&q=linked%20data&type=text&output=html" id="link-id1175a650">Other Linked Data Posts from this Blog oriented Linked Data Space</a> (goes back a few years!)</li> <li>Various practical <a href="http://delicious.com/kidehen/linked_data_demo" id="link-id13390cf8">Linked Data demo links from my Del.icio.us Bookmark oriented Data Space</a> </li> <li> <a href="http://myopenlink.net/dataspace/person/kidehen#this" id="link-id132cda80">My personal WebID</a> which is conduit to a Linked Data mesh covering vast variety of things I've opted to share with others via the Web (best viewed using a Linked Data aware User Agent like ODE).</li> </ul>
Exploring the Value Proposition of Linked Data
2009-07-24T12:20:01Z
2009-07-24T08:20:01-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1522
<p>Yesterday, I stumbled across an <a href="http://www.ebizq.net" id="link-id13e41be8">ebiz</a> article by <a href="http://www.ebizq.net/MT4/mt-cp.cgi?__mode=view&blog_id=43&id=16" id="link-id11c080a0">David Linthicum</a> titled:<a href="http://www.ebizq.net/blogs/linthicum/2009/01/rdf_and_data_integration.php" id="link-id13620940"> RDF & Data Integration</a>. Naturally, I read it, and while reading encountered a number of inaccuracies that compelled <a href="http://myopenlink.net/dataspace/person/kidehen#this" id="link-id152f4828">me</a> to comment on the post. </p> <p>Today, I revisited the same article -- and to my shock and horror -- my comments do not exist (note: the site did accept my comments yesterday!). Even more frustrating for me, I now have to expend time I don't have re-writing my comments due to the depth and danger of the inaccuracies in this post re. RDF in general.</p> <h3>Important Note to ebiz and David: </h3> <p>Please look into what happened to my comments. It's too early for me to conclude that subjective censorship is a play on the <a href="http://dbpedia.org/resource/World_Wide_Web">Web</a> -- which isn't a hard copy journalistic format style of platform where editors get away with such shenanigans. The Web is a sticky database, and outer joining is well and truly functional (meaning: exclusion and omission ultimately come back to bite via full outer join query results against the Web DB).</p> <p>By the way, if you publish the comments I made to the post (yesterday), I will add a note to this post, accordingly.</p> <p>Yes! David just confirmed to me via <a href="http://linkeddata.uriburner.com/about/html/http://dbpedia.org/resource/Category:Data_modeling" id="link-id15293c20">Twitter</a> that this is yet another comment system related issue and absolutely no intent to censor etc. His words <a href="http://linkeddata.uriburner.com/about/html/http://linkeddata.uriburner.com/about/rdf/http://twitter.com/DavidLinthicum/status/1159201301%23this" id="link-id14e5ac98">Twervatim</a> :-) </p> <p>For sake of clarity, I've itemized the inaccuracies and applied my correction comments (inline) accordingly:</p> <blockquote> <h3>Inaccuracy #1:</h3> <p>Resource Description Framework (RDF), a part of the XML story, provides interoperability between applications that exchange <a href="http://dbpedia.org/resource/Information" id="link-id15f48080">information</a>. </p> <h3>Correction #1: </h3> <p>RDF and XML are not inextricably linked in any way. RDF is part Data Model (<a href="http://dbpedia.org/resource/Entity-attribute-value_model" id="link-id119a7300">EAV</a>/CR style Graph) with associated markup and data serialization formats that include: N3, Turtle, TriX, RDF/XML etc.</p> <h3>Inaccuracy #2:</h3> <p>RDF uses XML to define a foundation for processing metadata and to provide a standard metadata infrastructure for both the Web and the enterprise. </p> <h3>Correction #2: </h3> <p>RDF/XML is an XML based markup and data serialization format. As a markup language it can be used for creating RDF model records/statements (using Subject, Predicate, Object or <a href="http://dbpedia.org/resource/Entity-attribute-value_model" id="link-id15120c28">Entity</a>, Attribute, Value). As a serialization format, it provides a mechanism for marshaling RDF data across data managers and data consumers.</p> <h3>Inaccuracy #3:</h3> <p>The difference between the two is that XML is used to transport data using a common format, while RDF is layered on top of XML defining a broad category of data. </p> <h3>Correction #3:</h3> <p>See earlier corrections above.</p> <h3>Inaccuracy #4:</h3> <p>When the XML data is declared to be of the RDF format, applications are then able to understand the data without understanding who sent it. </p> <h3>Correction #4:</h3> <p>You do not declare data to be of RDF format. RDF isn't a format it is a data model (as stated above). You can "up lift" or map data from XML to RDF (hierarchical to graph model mapping). Likewise you can "down shift" or map data from RDF to XML (example: <a href="http://dbpedia.org/resource/SPARQL" id="link-id146966c0">SPARQL</a> SELECT query patterns "down shift" to SPARQL Results XML, which isn't RDF/XML, while keeping access to graphs via URIs or <a href="http://dbpedia.org/resource/Entity" id="link-id14282528">Entity</a> Identifiers that reside within the serialization).</p> <h3>Inaccuracy #5:</h3> <p>RDF extends the XML model and syntax to be specified for describing either resources or a collection of information. (XML points to a resource in order to scope and uniquely identify a set of properties known as the schema.).</p> <h3>Correction #5:</h3> <p>See earlier comments. </p> </blockquote> <p>The single accurate paragraph in this ebiz article lies right at the end and it states the following:</p> <blockquote> <cite>"I've always thought RDF has been underutilized for data integration, and it's really an old standard. Now that we're focused on both understanding and integrating data, perhaps RDF should make a comeback."</cite> </blockquote> <h3>Related:</h3> <ul> <li> <a href="http://www.w3.org/2001/sw/SW-FAQ#whrdfxml" id="link-id1534cdc8">Semantic Web FAQ fragment re. RDF and XML</a> </li> <li> <a href="http://www.openlinksw.com/weblog/public/search.vspx?blogid=127&q=rdf%20data%20integration&type=text&output=html" id="link-id15a7dbc0">Various posts re. RDF and Data Integration</a> from this <a href="http://dbpedia.org/resource/Blog" id="link-id15da4618">Blog</a> <a href="http://en.wikipedia.org/wiki/Data_Spaces" id="link-id1120d810">Data Space</a>.</li> </ul>
ebiz RDF & Data Integration Article Retort
2009-01-29T21:25:58Z
2009-01-29T16:25:58-05:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1520
<p> As the world works it way through a "once in a generation" economic crisis, the long overdue downgrade of the <a href="http://dbpedia.org/resource/Relational_database_management_system" id="link-id15750540">RDBMS</a>, from its pivotal position at the apex of the <a href="http://dbpedia.org/resource/Data" id="link-id0x24ea3650">data</a> access and data management pyramid is nigh.</p> <h3>What is the Data Access, and Data Management Value Pyramid?</h3> <p> As depicted below, a top-down view of the data access and data management value chain. The term: apex, simply indicates value primacy, which takes the form of a data access API based entry point into a DBMS realm -- aligned to an underlying data model. Examples of data access APIs include: Native Call Level Interfaces (CLIs), <a href="http://dbpedia.org/resource/Open_Database_Connectivity" id="link-id11c254c0">ODBC</a>, <a href="http://dbpedia.org/resource/Java_Database_Connectivity" id="link-id149b16a8">JDBC</a>, <a href="http://dbpedia.org/resource/ADO.NET" id="link-id11451eb0">ADO</a>.NET, <a href="http://dbpedia.org/resource/OLE_DB" id="link-id15b02478">OLE-DB</a>, <a href="http://dbpedia.org/resource/XML_for_Analysis" id="link-id1181fa10">XMLA</a>, and <a href="http://dbpedia.org/resource/World_Wide_Web" id="link-id0x1f8394a8">Web</a> Services.</p> See: <a href="http://virtuoso.openlinksw.com/images/Agility_Value_Factors_Pyramid.png" id="link-id146cadd8"> AVF Pyramid Diagram.</a> <p> The degree to which ad-hoc views of data managed by a DBMS can be produced and dispatched to relevant data consumers (e.g. people), without compromising concurrency, data durability, and security, collectively determine the "Agility Value Factor" (AVF) of a given DBMS. Remember, agility as the cornerstone of environmental adaptation is as old as the concept of evolution, and intrinsic to all pursuits of primacy. </p> <p>In simpler business oriented terms, look at AVF as the degree to which DBMS technology affects the ability to effectively implement "Market Leadership Discipline" along the following pathways: innovation, operation excellence, or customer intimacy. </p> <h3>Why has RDBMS Primacy has Endured?</h3> <p> Historically, at least since the late '80s, the RDBMS genre of DBMS has consistently offered the highest AVF relative to other DBMS genres en route to primacy within the value pyramid. The desire to improve on paper reports and spreadsheets is basically what DBMS technology has fundamentally addressed to date, even though conceptual level interaction with data has never been its forte.</p> See: <a href="http://virtuoso.openlinksw.com/images/Old_RDBMS_Primacy_Pyramid.png" id="link-id134dab90"> RDBMS Primacy Diagram.</a> <p> For more then 10 years -- at the very least -- limitations of the traditional RDBMS in the realm of conceptual level interaction with data across diverse data sources and schemas (enterprise, Web, and <a href="http://dbpedia.org/resource/Internet" id="link-id116001c0">Internet</a>) has been crystal clear to many RDBMS technology practitioners, as indicated by some of the quotes excerpted below:</p> <blockquote> <cite> <p> "Future of Database Research is excellent, but what is the future of data?" </p> "..it is hard for <a href="http://myopenlink.net/dataspace/person/kidehen#this" id="link-id14932398">me</a> to disagree with the conclusions in this report. It captures exactly the right thoughts, and should be a must read for everyone involved in the area of databases and database research in particular." <p>-- <a href="http://jhingran.typepad.com/anant_jhingrans_musings/" id="link-id11334c50">Dr. Anant Jingran</a>, CTO, IBM <a href="http://dbpedia.org/resource/Information" id="link-id150c7970">Information</a> Management Systems, commenting on the <a href="http://db.cs.berkeley.edu/claremont/" id="link-id11c3b408">2007 RDBMS technology retreat</a> attended by a number of key DBMS technology pioneers and researchers.</p> </cite> </blockquote> <blockquote> <cite> <p> "<a href="http://www.databasecolumn.com/2007/09/one-size-fits-all.html" id="link-id15c14f08">One size fits all: A concept whose time has come and gone</a> </p> <p> </p> <ol> <li> They are direct descendants of System R and <a href="http://dbpedia.org/resource/Ingres" id="link-id146da780">Ingres</a> and were architected more than 25 years ago</li> <li> They are advocating "one size fits all"; i.e. a single engine that solves all DBMS needs. </li> </ol> <p>-- Prof. <a href="http://en.wikipedia.org/wiki/Michael_Stonebraker" id="link-id145c4e28">Michael Stonebreaker</a>, one of the founding fathers of the RDBMS industry.</p> </cite> </blockquote> <p>Until this point in time, the requisite confluence of "circumstantial pain" and "open standards" based technology required to enable an objective "compare and contrast" of RDBMS engine virtues and viable alternatives hasn't occurred. Thus, the RDBMS has endured it position of primacy albeit on a "one size fits all basis". </p> <h4>Circumstantial Pain</h4> <p> As mentioned earlier, we are in the midst of an economic crisis that is ultimately about a consistent inability to connect dots across a substrate of interlinked data sources that transcend traditional data access boundaries with high doses of schematic heterogeneity. Ironically, in a era of the dot-com, we haven't been able to make meaningful connections between relevant "real-world things" that extend beyond primitive data hosted database tables and content management style document containers; we've struggled to achieve this in the most basic sense, let alone evolve our ability to connect inline with the <a href="http://www.vldb2007.org/program/slides/s1161-brodie.pdf" id="link-id11a0dcf0">exponential rate at which the Internet & Web are spawning "universes of discourse" (data spaces) that emanate from user activity</a> (within the enterprise and across the Internet & Web). In a nutshell, we haven't been able to upgrade our interaction with data such that "conceptual models" and resulting "<a href="http://dbpedia.org/resource/Context_%28language_use%29" id="link-id12da4b00">context</a> lenses" (or facets) become concrete; by this I mean: real-world <a href="http://dbpedia.org/resource/Entity" id="link-id146a48a8">entity</a> interaction making its way into the computer realm as opposed to the impedance we all suffer today when we transition from conceptual model interaction (real-world) to logical model interaction (when dealing with RDBMS based data access and data management). </p> <p>Here are some simple examples of what I can only best describe as: "critical dots unconnected", resulting from an inability to interact with data conceptually:</p> <strong>Government (Globally) -</strong> <p> Financial regulatory bodies couldn't effectively discern that a <a href="http://dbpedia.org/resource/Credit_default_swap" id="link-id115ba0e0">Credit Default Swap</a> is an Insurance policy in all but literal name. And in not doing so the cost of an unregulated <a href="http://dbpedia.org/resource/Insurance" id="link-id158d4960">insurance policy</a> laid the foundation for exacerbating the toxicity of fatally flawed mortgage backed securities. Put simply: a flawed insurance policy was the fallback on a toxic security that financiers found exotic based on superficial packaging.</p> <strong>Enterprises - </strong> <p> Banks still don't understand that capital really does exists in tangible and intangible forms; with the intangible being the variant that is inherently dynamic. For example, a tech companies intellectual capital far exceeds the value of fixture, fittings, and buildings, but you be amazed to find that in most cases this vital asset has not significant value when banks get down to the nitty gritty of debt collateral; instead, a buffer of flawed securitization has occurred atop a borderline static asset class covering the aforementioned buildings, fixtures, and fittings. </p> <p> In the general enterprise arena, IT executives continued to "rip and replace" existing technology without ever effectively addressing the timeless inability to connect data across disparate data silos generated by internal enterprise applications, let alone the broader need to mesh data from the inside with external data sources. No correlations made between the growth of buzzwords and the compounding nature of data integration challenges. It's 2009 and only a miniscule number of executives dare fantasize about being anywhere within distance of the: relevant information at your fingertips vision. </p> <p> Looking more holistically at data interaction in general, whether you interact with data in the enterprise space (i.e., at work) or on the Internet or Web, you ultimately are delving into a mishmash of disparate computer systems, applications, service (Web or SOA), and databases (of the RDBMS variety in a majority of cases) associated with a plethora of disparate schemas. Yes, but even today "rip and replace" is still the norm pushed by most vendors; pitting one mono culture against another as exemplified by irrelevances such as: FOSS/LAMP vs Commercial or Web vs. Enterprise, when none of this matters if the data access and integration issues are recognized let alone addressed (see: <a href="http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1497?sid=0df0294caee8b37925c6a888bbbca136&realm=wa" id="link-id15c27300">Applications are Like Fish and Data Like Wine</a>). </p> <p> Like the current credit-crunch, exponential growth of data originating from disparate application databases and associated schemas, within shrinking processing time frames, has triggered a rethinking of what defines data access and data management value today en route to an inevitable RDBMS downgrade within the value pyramid.</p> <h3>Technology</h3> <p>There have been many attempts to address real-world modeling requirements across the broader DBMS community from Object Databases to Object-Relational Databases, and more recently the emergence of simple <a href="http://dbpedia.org/resource/Entity-attribute-value_model" id="link-id1128dad0">Entity</a>-Attribute-Value model DBMS engines. In all cases failure has come down to the existence of one or more of the following deficiencies, across each potential alternative:</p> <ol> <li>Query language standardization - nothing close to <a href="http://dbpedia.org/resource/SQL" id="link-id16002d60">SQL</a> standardization</li> <li>Data Access API standardization - nothing close to ODBC, JDBC, OLE-DB, or ADO.NET</li> <li>Wire protocol standardization - nothing close to HTTP</li> <li>Distributed Identity infrastructure - nothing close to the non-repudiatable digital Identity that <a href="http://dbpedia.org/resource/Friend_of_a_friend" id="link-id14926b18">foaf</a>+ssl accords</li> <li>Use of Identifiers as network based pointers to data sources - nothing close to RDF based <a href="http://dbpedia.org/resource/Linked_Data" id="link-id16180a28">Linked Data</a> </li> <li>Negotiable data representation - nothing close to Mime and HTTP based Content Negotiation</li> <li>Scalability especially in the era of Internet & Web scale.</li> </ol> <h4>Entity-Attribute-Value with Classes & Relationships (<a href="http://dbpedia.org/resource/Entity-attribute-value_model" id="link-id13e741b8">EAV</a>/CR) data models</h4> <p>A common characteristic shared by all post-relational DBMS management systems (from Object Relational to pure Object) is an orientation towards variations of EAV/CR based data models. Unfortunately, all efforts in the EAV/CR realm have typically suffered from at least one of the deficiencies listed above. In addition, the same "one DBMS model fits all" approach that lies at the heart of the RDBMS downgrade also exists in the EAV/CR realm.</p> <h3>What Comes Next?</h3> <p>The RDBMS is not going away (ever), but its era of primacy -- by virtue of its placement at the apex of the data access and data management value pyramid -- is over! I make this bold claim for the following reasons: </p> <ol> <li> The Internet aided "Global Village" has brought "<a href="http://en.wikipedia.org/wiki/Open_World_Assumption" id="link-id1148e560">Open World</a>" vs "<a href="http://en.wikipedia.org/wiki/Closed_World_Assumption" id="link-id11967cd0">Closed World</a>" assumption issues to the fore e.g., the current global economic crisis remains centered on the inability to connect dots across "Open World" and "Closed World" data frontiers </li> <li> Entity-Attribute-Value with Classes & Relationships (EAV/CR) based DBMS models are more effective when dealing with disparate data associated with disparate schemas, across disparate DBMS engines, host operating systems, and networks. </li> </ol> <p>Based on the above, it is crystal clear that a different kind of DBMS -- one with higher AVF relative to the RDBMS -- needs to sit atop today's data access and data management value pyramid. The characteristics of this DBMS must include the following:</p> <ol> <li> Every item of data (Datum/Entity/Object/Resource) has Identity</li> <li> Identity is achieved via Identifiers that aren't locked at the DBMS, OS, Network, or Application levels</li> <li> Object Identifiers and Object values are independent (extricably linked by association)</li> <li> Object values should be de-referencable via Object Identifier</li> <li> Representation of de-referenced value graph (entity, attributes, and values mesh) must be negotiable (i.e. content negotiation)</li> <li>Structured query language must provide mechanism for Creation, Deletion, Updates, and Querying of data objects</li> <li> Performance & Scalability across "Closed World" (enterprise) and "Open World" (Internet & Web) realms.</li> </ol> <p>Quick recap, I am not saying that RDBMS engine technology is dead or obsolete. I am simply stating that the era of RDBMS primacy within the data access and data management value pyramid is over. </p> <p>The problem domain (conceptual model views over heterogeneous data sources) at the apex of the aforementioned pyramid has simply evolved beyond the natural capabilities of the RDBMS which is rooted in "Closed World" assumptions re., data definition, access, and management. The need to maintain domain based conceptual interaction with data is now palpable at every echelon within our "Global Village" - Internet, Web, Enterprise, Government etc.</p> <p>It is my personal view that an EAV/CR model based DBMS, with support for the seven items enumerated above, can trigger the long anticipated RDBMS downgrade. Such a DBMS would be inherently multi-model because you would need to the best of RDBMS and EAV/CR model engines in a single product, with in-built support for HTTP and other Internet protocols in order to effectively address data representation and serialization issues.</p> <h4>EAV/CR Oriented Data Access & Management Technology</h4> <p>Examples of contemporary EAV/CR frameworks that provide concrete conceptual layers for data access and data management currently include:</p> <ul> <li> <a href="http://dbpedia.org/resource/Resource_Description_Framework" id="link-id115d1cb0"> Resource Description Framework</a> (RDF) - an EAV/CR based framework</li> <li> <a href="http://dbpedia.org/resource/Linked_Data" id="link-id116cf810">RDF Linked Data </a>- EAV/CR based framework that mandates de-referencable HTTP based Identifiers</li> <li> <a href="http://dbpedia.org/resource/ADO.NET_Entity_Framework" id="link-id13daa160">ADO.NET Entity Frameworks</a> - Microsoft .NET based EAV/CR framework</li> <li> <a href="http://dbpedia.org/page/Core_Data" id="link-id11111838">Core Data Services </a>- Mac OS X based EAV/CR framework that evolved from NeXT's <a href="http://dbpedia.org/resource/Enterprise_Objects_Framework" id="link-id15c27df0">Enterprise Object Frameworks</a> (EOF).</li> </ul> <p>The frameworks above provide the basis for a revised AVF pyramid, as depicted below, that reflects today's data access and management realities i.e., an Internet & Web driven global village comprised of interlinked distributed data objects, compatible with "Open World" assumptions.</p> See: <a href="http://virtuoso.openlinksw.com/images/New_EAV_RDBMS_Pyramid.png" id="link-id158e0760">New EAV/CR Primacy Diagram.</a> <h3>Related</h3> <ul> <li> <a href="http://dynamicorange.com/2009/01/22/blueblog-how-and-why-glue-is-using-amazon-simpledb-instead-of-a-relational-database/" id="link-id15e07c10">How & Why Glue is Using Amazon SimpleDB</a> </li> <li> <a href="http://www.cs.cmu.edu/afs/cs.cmu.edu/user/clamen/OODBMS/Manifesto/htManifesto/node4.html#SECTION00022000000000000000" id="link-id116cf450">Object Database Manifesto (Identity excerpt)</a> </li> <li> <a href="http://www.unixspace.com/context/databases.html" id="link-id150b2c20">Database Models Overview</a> </li> <li> <a href="http://www.youtube.com/watch?v=WEj9vqVvHPc&feature=related" id="link-id0x1135d978">Ted Nelson Explaining Irregularity and Idiosyncrasy of Data Structures</a> - ZigZag Demo </li> </ul>
Time for RDBMS Primacy Downgrade is Nigh! (No Embedded Images Edition - Update 1)
2009-03-17T15:50:58Z
2009-03-17T11:50:58-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1519
<p> As the world works it way through a "once in a generation" economic crisis, the long overdue downgrade of the <a href="http://dbpedia.org/resource/Relational_database_management_system" id="link-id15750540">RDBMS</a>, from its pivotal position at the apex of the <a href="http://dbpedia.org/resource/Data" id="link-id0x66a74b8">data</a> access and data management pyramid is nigh.</p> <h3>What is the Data Access, and Data Management Value Pyramid?</h3> <p> As depicted below, a top-down view of the data access and data management value chain. The term: apex, simply indicates value primacy, which takes the form of a data access API based entry point into a DBMS realm -- aligned to an underlying data model. Examples of data access APIs include: Native Call Level Interfaces (CLIs), <a href="http://dbpedia.org/resource/Open_Database_Connectivity" id="link-id11c254c0">ODBC</a>, <a href="http://dbpedia.org/resource/Java_Database_Connectivity" id="link-id149b16a8">JDBC</a>, <a href="http://dbpedia.org/resource/ADO.NET" id="link-id11451eb0">ADO</a>.NET, <a href="http://dbpedia.org/resource/OLE_DB" id="link-id15b02478">OLE-DB</a>, <a href="http://dbpedia.org/resource/XML_for_Analysis" id="link-id1181fa10">XMLA</a>, and <a href="http://dbpedia.org/resource/World_Wide_Web" id="link-id0x2fef498">Web</a> Services.</p> <div> <img alt="Image" src="http://virtuoso.openlinksw.com/images/Agility_Value_Factors_Pyramid.png" /> </div> <p> The degree to which ad-hoc views of data managed by a DBMS can be produced and dispatched to relevant data consumers (e.g. people), without compromising concurrency, data durability, and security, collectively determine the "Agility Value Factor" (AVF) of a given DBMS. Remember, agility as the cornerstone of environmental adaptation is as old as the concept of evolution, and intrinsic to all pursuits of primacy. </p> <p>In simpler business oriented terms, look at AVF as the degree to which DBMS technology affects the ability to effectively implement "Market Leadership Discipline" along the following pathways: innovation, operation excellence, or customer intimacy. </p> <h3>Why has RDBMS Primacy has Endured?</h3> <p> Historically, at least since the late '80s, the RDBMS genre of DBMS has consistently offered the highest AVF relative to other DBMS genres en route to primacy within the value pyramid. The desire to improve on paper reports and spreadsheets is basically what DBMS technology has fundamentally addressed to date, even though conceptual level interaction with data has never been its forte.</p> <div> <img alt="Image" src="http://virtuoso.openlinksw.com/images/Old_RDBMS_Primacy_Pyramid.png" /> </div> <p> For more then 10 years -- at the very least -- limitations of the traditional RDBMS in the realm of conceptual level interaction with data across diverse data sources and schemas (enterprise, Web, and <a href="http://dbpedia.org/resource/Internet" id="link-id116001c0">Internet</a>) has been crystal clear to many RDBMS technology practitioners, as indicated by some of the quotes excerpted below:</p> <blockquote> <cite> <p> "Future of Database Research is excellent, but what is the future of data?" </p> "..it is hard for <a href="http://myopenlink.net/dataspace/person/kidehen#this" id="link-id14932398">me</a> to disagree with the conclusions in this report. It captures exactly the right thoughts, and should be a must read for everyone involved in the area of databases and database research in particular." <p>-- <a href="http://jhingran.typepad.com/anant_jhingrans_musings/" id="link-id11334c50">Dr. Anant Jingran</a>, CTO, IBM <a href="http://dbpedia.org/resource/Information" id="link-id150c7970">Information</a> Management Systems, commenting on the <a href="http://db.cs.berkeley.edu/claremont/" id="link-id11c3b408">2007 RDBMS technology retreat</a> attended by a number of key DBMS technology pioneers and researchers.</p> </cite> </blockquote> <blockquote> <cite> <p> "<a href="http://www.databasecolumn.com/2007/09/one-size-fits-all.html" id="link-id15c14f08">One size fits all: A concept whose time has come and gone</a> </p> <p> </p> <ol> <li> They are direct descendants of System R and <a href="http://dbpedia.org/resource/Ingres" id="link-id146da780">Ingres</a> and were architected more than 25 years ago</li> <li> They are advocating "one size fits all"; i.e. a single engine that solves all DBMS needs. </li> </ol> <p>-- Prof. <a href="http://en.wikipedia.org/wiki/Michael_Stonebraker" id="link-id145c4e28">Michael Stonebreaker</a>, one of the founding fathers of the RDBMS industry.</p> </cite> </blockquote> <p>Until this point in time, the requisite confluence of "circumstantial pain" and "open standards" based technology required to enable an objective "compare and contrast" of RDBMS engine virtues and viable alternatives hasn't occurred. Thus, the RDBMS has endured it position of primacy albeit on a "one size fits all basis". </p> <h4>Circumstantial Pain</h4> <p> As mentioned earlier, we are in the midst of an economic crisis that is ultimately about a consistent inability to connect dots across a substrate of interlinked data sources that transcend traditional data access boundaries with high doses of schematic heterogeneity. Ironically, in a era of the dot-com, we haven't been able to make meaningful connections between relevant "real-world things" that extend beyond primitive data hosted database tables and content management style document containers; we've struggled to achieve this in the most basic sense, let alone evolve our ability to connect inline with the <a href="http://www.vldb2007.org/program/slides/s1161-brodie.pdf" id="link-id11a0dcf0">exponential rate at which the Internet & Web are spawning "universes of discourse" (data spaces) that emanate from user activity</a> (within the enterprise and across the Internet & Web). In a nutshell, we haven't been able to upgrade our interaction with data such that "conceptual models" and resulting "<a href="http://dbpedia.org/resource/Context_%28language_use%29" id="link-id12da4b00">context</a> lenses" (or facets) become concrete; by this I mean: real-world <a href="http://dbpedia.org/resource/Entity" id="link-id146a48a8">entity</a> interaction making its way into the computer realm as opposed to the impedance we all suffer today when we transition from conceptual model interaction (real-world) to logical model interaction (when dealing with RDBMS based data access and data management). </p> <p>Here are some simple examples of what I can only best describe as: "critical dots unconnected", resulting from an inability to interact with data conceptually:</p> <strong>Government (Globally) -</strong> <p> Financial regulatory bodies couldn't effectively discern that a <a href="http://dbpedia.org/resource/Credit_default_swap" id="link-id115ba0e0">Credit Default Swap</a> is an Insurance policy in all but literal name. And in not doing so the cost of an unregulated <a href="http://dbpedia.org/resource/Insurance" id="link-id158d4960">insurance policy</a> laid the foundation for exacerbating the toxicity of fatally flawed mortgage backed securities. Put simply: a flawed insurance policy was the fallback on a toxic security that financiers found exotic based on superficial packaging.</p> <strong>Enterprises - </strong> <p> Banks still don't understand that capital really does exists in tangible and intangible forms; with the intangible being the variant that is inherently dynamic. For example, a tech companies intellectual capital far exceeds the value of fixture, fittings, and buildings, but you be amazed to find that in most cases this vital asset has not significant value when banks get down to the nitty gritty of debt collateral; instead, a buffer of flawed securitization has occurred atop a borderline static asset class covering the aforementioned buildings, fixtures, and fittings. </p> <p> In the general enterprise arena, IT executives continued to "rip and replace" existing technology without ever effectively addressing the timeless inability to connect data across disparate data silos generated by internal enterprise applications, let alone the broader need to mesh data from the inside with external data sources. No correlations made between the growth of buzzwords and the compounding nature of data integration challenges. It's 2009 and only a miniscule number of executives dare fantasize about being anywhere within distance of the: relevant information at your fingertips vision. </p> <p> Looking more holistically at data interaction in general, whether you interact with data in the enterprise space (i.e., at work) or on the Internet or Web, you ultimately are delving into a mishmash of disparate computer systems, applications, service (Web or SOA), and databases (of the RDBMS variety in a majority of cases) associated with a plethora of disparate schemas. Yes, but even today "rip and replace" is still the norm pushed by most vendors; pitting one mono culture against another as exemplified by irrelevances such as: FOSS/LAMP vs Commercial or Web vs. Enterprise, when none of this matters if the data access and integration issues are recognized let alone addressed (see: <a href="http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1497?sid=0df0294caee8b37925c6a888bbbca136&realm=wa" id="link-id15c27300">Applications are Like Fish and Data Like Wine</a>). </p> <p> Like the current credit-crunch, exponential growth of data originating from disparate application databases and associated schemas, within shrinking processing time frames, has triggered a rethinking of what defines data access and data management value today en route to an inevitable RDBMS downgrade within the value pyramid.</p> <h3>Technology</h3> <p>There have been many attempts to address real-world modeling requirements across the broader DBMS community from Object Databases to Object-Relational Databases, and more recently the emergence of simple <a href="http://dbpedia.org/resource/Entity-attribute-value_model" id="link-id1128dad0">Entity</a>-Attribute-Value model DBMS engines. In all cases failure has come down to the existence of one or more of the following deficiencies, across each potential alternative:</p> <ol> <li>Query language standardization - nothing close to <a href="http://dbpedia.org/resource/SQL" id="link-id16002d60">SQL</a> standardization</li> <li>Data Access API standardization - nothing close to ODBC, JDBC, OLE-DB, or ADO.NET</li> <li>Wire protocol standardization - nothing close to HTTP</li> <li>Distributed Identity infrastructure - nothing close to the non-repudiatable digital Identity that <a href="http://dbpedia.org/resource/Friend_of_a_friend" id="link-id14926b18">foaf</a>+ssl accords</li> <li>Use of Identifiers as network based pointers to data sources - nothing close to RDF based <a href="http://dbpedia.org/resource/Linked_Data" id="link-id16180a28">Linked Data</a> </li> <li>Negotiable data representation - nothing close to Mime and HTTP based Content Negotiation</li> <li>Scalability especially in the era of Internet & Web scale.</li> </ol> <h4>Entity-Attribute-Value with Classes & Relationships (<a href="http://dbpedia.org/resource/Entity-attribute-value_model" id="link-id13e741b8">EAV</a>/CR) data models</h4> <p>A common characteristic shared by all post-relational DBMS management systems (from Object Relational to pure Object) is an orientation towards variations of EAV/CR based data models. Unfortunately, all efforts in the EAV/CR realm have typically suffered from at least one of the deficiencies listed above. In addition, the same "one DBMS model fits all" approach that lies at the heart of the RDBMS downgrade also exists in the EAV/CR realm.</p> <h3>What Comes Next?</h3> <p>The RDBMS is not going away (ever), but its era of primacy -- by virtue of its placement at the apex of the data access and data management value pyramid -- is over! I make this bold claim for the following reasons: </p> <ol> <li> The Internet aided "Global Village" has brought "<a href="http://en.wikipedia.org/wiki/Open_World_Assumption" id="link-id1148e560">Open World</a>" vs "<a href="http://en.wikipedia.org/wiki/Closed_World_Assumption" id="link-id11967cd0">Closed World</a>" assumption issues to the fore e.g., the current global economic crisis remains centered on the inability to connect dots across "Open World" and "Closed World" data frontiers </li> <li> Entity-Attribute-Value with Classes & Relationships (EAV/CR) based DBMS models are more effective when dealing with disparate data associated with disparate schemas, across disparate DBMS engines, host operating systems, and networks. </li> </ol> <p>Based on the above, it is crystal clear that a different kind of DBMS -- one with higher AVF relative to the RDBMS -- needs to sit atop today's data access and data management value pyramid. The characteristics of this DBMS must include the following:</p> <ol> <li> Every item of data (Datum/Entity/Object/Resource) has Identity</li> <li> Identity is achieved via Identifiers that aren't locked at the DBMS, OS, Network, or Application levels</li> <li> Object Identifiers and Object values are independent (extricably linked by association)</li> <li> Object values should be de-referencable via Object Identifier</li> <li> Representation of de-referenced value graph (entity, attributes, and values mesh) must be negotiable (i.e. content negotiation)</li> <li>Structured query language must provide mechanism for Creation, Deletion, Updates, and Querying of data objects</li> <li> Performance & Scalability across "Closed World" (enterprise) and "Open World" (Internet & Web) realms.</li> </ol> <p>Quick recap, I am not saying that RDBMS engine technology is dead or obsolete. I am simply stating that the era of RDBMS primacy within the data access and data management value pyramid is over. </p> <p>The problem domain (conceptual model views over heterogeneous data sources) at the apex of the aforementioned pyramid has simply evolved beyond the natural capabilities of the RDBMS which is rooted in "Closed World" assumptions re., data definition, access, and management. The need to maintain domain based conceptual interaction with data is now palpable at every echelon within our "Global Village" - Internet, Web, Enterprise, Government etc.</p> <p>It is my personal view that an EAV/CR model based DBMS, with support for the seven items enumerated above, can trigger the long anticipated RDBMS downgrade. Such a DBMS would be inherently multi-model because you would need to the best of RDBMS and EAV/CR model engines in a single product, with in-built support for HTTP and other Internet protocols in order to effectively address data representation and serialization issues.</p> <h4>EAV/CR Oriented Data Access & Management Technology</h4> <p>Examples of contemporary EAV/CR frameworks that provide concrete conceptual layers for data access and data management currently include:</p> <ul> <li> <a href="http://dbpedia.org/resource/Resource_Description_Framework" id="link-id115d1cb0"> Resource Description Framework</a> (RDF) - an EAV/CR based framework</li> <li> <a href="http://dbpedia.org/resource/Linked_Data" id="link-id116cf810">RDF Linked Data </a>- EAV/CR based framework that mandates de-referencable HTTP based Identifiers</li> <li> <a href="http://dbpedia.org/resource/ADO.NET_Entity_Framework" id="link-id13daa160">ADO.NET Entity Frameworks</a> - Microsoft .NET based EAV/CR framework</li> <li> <a href="http://dbpedia.org/page/Core_Data" id="link-id11111838">Core Data Services </a>- Mac OS X based EAV/CR framework that evolved from NeXT's <a href="http://dbpedia.org/resource/Enterprise_Objects_Framework" id="link-id15c27df0">Enterprise Object Frameworks</a> (EOF).</li> </ul> <p>The frameworks above provide the basis for a revised AVF pyramid, as depicted below, that reflects today's data access and management realities i.e., an Internet & Web driven global village comprised of interlinked distributed data objects, compatible with "Open World" assumptions.</p> <div> <image src="http://virtuoso.openlinksw.com/images/New_EAV_RDBMS_Pyramid.png"></image> </div> <h3>Related</h3> <ul> <li> <a href="http://allanslibrary.blogspot.com/2009/06/semantic-way.html" id="link-id0xb8c5e498">The Semantic Way</a> - Alan Cho's Summary of <a href="http://www.pwc.com/extweb/home.nsf/docid/1308AF8EA7929CCA852575BA00720F26" id="link-id0xb80f5e10">PwC 2009 tech forecast report on the Semantic Web</a> </li> <li> <a href="http://www.readwriteweb.com/archives/is_the_relational_database_doomed.php" id="link-id0xb8c20658">Is the RDBMS Doomed</a> - <a href="http://www.readwriteweb.com">ReadWriteWeb</a> Article</li> <li> <a href="http://www.metabrew.com/article/anti-rdbms-a-list-of-distributed-key-value-stores/" id="link-id0x1ab4778">Anti-RDBMS: a list of Distributed Key-Value Stores</a> - by <a href="http://www.last.fm/user/RJ" id="link-id0x5a968060">Richard Jones</a> (CTO Last.FM)</li> <li> <a href="http://dynamicorange.com/2009/01/22/blueblog-how-and-why-glue-is-using-amazon-simpledb-instead-of-a-relational-database/" id="link-id15e07c10">How & Why Glue is Using Amazon SimpleDB</a> </li> <li> <a href="http://www.cs.cmu.edu/afs/cs.cmu.edu/user/clamen/OODBMS/Manifesto/htManifesto/node4.html#SECTION00022000000000000000" id="link-id116cf450">Object Database Manifesto (Identity excerpt)</a> </li> <li> <a href="http://www.unixspace.com/context/databases.html" id="link-id150b2c20">Database Models Overview</a> </li> <li> <a href="http://www.youtube.com/watch?v=WEj9vqVvHPc&feature=related" id="link-id0x66b0850">Ted Nelson Explaining Irregularity and Idiosyncrasy of Data Structures</a> - ZigZag Demo </li> </ul>
The Time for RDBMS Primacy Downgrade is Nigh!
2009-06-03T22:09:58Z
2009-06-03T18:09:58.000001-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1512
<p>Happy New Year!</p> <p>In 2009 I hope the following happens re. "<a href="http://dbpedia.org/resource/Linked_Data" id="link-id15acc7d0">Linked Data</a>":</p> <ol> <li>We realize it's a <a href="http://dbpedia.org/resource/Meme" id="link-id1101eb90">Meme</a> </li> <li>We collectively connect the Meme to the concept of granular hyperlinks between <a href="http://dbpedia.org/resource/Data">data</a> entities/objects (datum to datum linkage aka. <a href="http://dbpedia.org/resource/Linked_Data" id="link-id113d96a0">Hyperdata</a> Linking)</li> <li>We generally connect the Meme to technology ancestry such as the <a href="http://dbpedia.org/resource/Entity-attribute-value_model" id="link-id1136d980">Entity-Attribute-Value with Classes & Relationships</a> (EAV/CR) data model (then broader commonality with erstwhile unrelated realms will be unveiled e.g., <a href="http://dbpedia.org/resource/ADO.NET_Entity_Framework" id="link-id1122ab80">Entity Frameworks from Microsoft</a>, <a href="http://dbpedia.org/resource/Core_Data" id="link-id138b5b28">Core Data from Apple</a>, <a href="http://en.wikipedia.org/wiki/Simpledb" id="link-id118576d0">SimpleDB</a> from Amazon, and the <a href="http://en.wikipedia.org/wiki/Freebase_(database)" id="link-id19107a90">Freebase Graph Model DB</a> amongst others)</li> <li>We instinctively connect the Meme to the concept of Entity Oriented Data Access and Management (RDF based Linked Data is basically EAV/CR scheme that uses HTTP based Pointers for Entity, Attribute, and Relationship Identifiers)</li> <li>We naturally connect the Meme with the notion that an identifier for a unit of data (aka. Datum) should be the conduit to a negotiable representation of said Datum's description (i.e., it's attribute and relationship properties in HTML, XHTML, <a href="http://dbpedia.org/resource/RDFa" id="link-id151cc688">RDFa</a>, Turtle, N3, RDF/XML etc., for example)</li> <li>We ultimately connect the Meme with a conceptual-level approach to data integration across disparate data sources (also known as <a href="http://dbpedia.org/resource/Master_Data_Management" id="link-id1596b8d8">Master Data Management</a> (MDM) ).</li> </ol> <p> 2009 is about a reboot on a monumental scale. We need new thinking, new technology, new approaches, and new solutions. No matter what route we take, we can't negate the importance of "Data". When dealing with organic or inorganic computers systems -- Data is simply everything!</p> <p> The ability of individuals and enterprises to access, mesh, and disseminate data to relevant nodes across public and private networks will ultimately determine the winners and losers in the new frontier, ushered in by 2009.</p> <p> Do not take data access and data management technology for granted. User interfaces come and ago, application logic comes and goes, but your data stays with you forever. If you are mystified by data access technology then make 2009 the year of data access technology demystification :-) </p> <h3>Related</h3> <ul> <li> <a href="http://www.openlinksw.com/weblog/oerling/?id=1510" id="link-id11246da8">Linked Data & The Year 2009</a> </li> <li> <a href="http://www.openlinksw.com/weblog/public/search.vspx?blogid=127&q=data%20access&type=text&output=html" id="link-id11848a20">Various posts from my blog space</a> </li> </ul>
My Hopes for Linked Data in 2009 (Update #2)
2009-01-07T02:35:19Z
2009-01-06T21:35:19.000002-05:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1508
<p>Adding to the collection of Amazon EC2 AMI based knowledgebases already unveiled for <a href="http://virtuoso.openlinksw.com/dataspace/dav/wiki/Main/VirtEC2AMIDBpediaInstall" id="link-id117a3710">DBpedia</a> and <a href="http://virtuoso.openlinksw.com/dataspace/dav/wiki/Main/VirtEC2AMINeuroCommonsInstall" id="link-id11293c10">NeuroCommons</a>, we now have a <a href="http://www.bio2rdf.org/" id="link-id14ba6338">Bio2Rdf</a> knowledgebase AMI.</p> <h3>What is Bio2Rdf?</h3> <p>A community developed knowledgebase comprised of Bio Informatics <a href="http://dbpedia.org/resource/Data">data</a> from across <a href="http://www.freebase.com/view/user/bio2rdf/public/sparql" id="link-id1468fb90">30 or so public data sources</a>. The standard deployment of Bio2Rdf includes a a federation of <a href="http://dbpedia.org/resource/SPARQL" id="link-id111962a8">SPARQL</a> endpoints provided by project members and collaborators.</p> <h3>What is the Bio2Rdf EC2 AMI?</h3> <p> An Amazon EC2 hosted variant of the Bio2Rdf knowledgebase. In addition to providing a SPARQL endpoint, the data exposed by the Amazon AMI is published in compliance with <a href="http://dbpedia.org/resource/Linked_Data" id="link-id1560ac50">Linked Data</a> publishing best practices espoused by the <a href="http://community.linkeddata.org/dataspace/organization/lod#this" id="link-id153aff30">Linking Open Data community</a> (<a href="http://community.linkeddata.org/dataspace/organization/lod#this" id="link-id1390d968">LOD</a>).</p> <h3>Benefits?</h3> <p>The ability to instantiate a personal or service-specific variant of this powerful knowledgebase via the Amazon EC2 Cloud. Instead of a 22+ hour error prone odyssey - you simply get down to the task of data analysis and integration within 1.5 hrs (when setting up you AMI for the first time).</p> <h3>How do I get going?</h3> Just follow the instructions in the <a href="http://virtuoso.openlinksw.com/dataspace/dav/wiki/Main/VirtEC2AMIBio2rdfInstall" id="link-id114fc4a8">Bio2Rdf EC2 AMI installation guide</a>. <h3>Related</h3> <ul> <li> <a href="http://bio2rdf.wiki.sourceforge.net/" id="link-id19109ed8">Bio2Rdf Wiki</a> </li> <li> <a href="http://bio2rdf.wiki.sourceforge.net/Demo+queries" id="link-id1134c988">Sample Bio2Rdf Queries Collection</a> </li> <li> <a href="http://virtuoso.openlinksw.com/wiki/main/Main/VirtuosoEC2AMI" id="link-id11c28e08">Virtuoso's Amazon Machine Image (AMI) for EC2 Home Page</a> </li> <li> <a href="http://www.youtube.com/watch?v=XdBd14rjcs0" id="link-id14b4a390">Cloud Computing Explanation Video</a> </li> </ul>
Bio2Rdf EC2 AMI is now Ready! (Updated)
2008-12-24T16:05:13Z
2008-12-24T11:05:13-05:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1489
<h3>What is it?</h3> <p>A pre-installed edition of <a href="http://virtuoso.openlinksw.com" id="link-id14bea838">Virtuoso</a> for Amazon's EC2 Cloud platform.</p> <h3>What does it offer?</h3> From a <a href="http://dbpedia.org/resource/World_Wide_Web">Web</a> Entrepreneur perspective it offers: <ol> <li> Low cost entry point to a game-changing Web 3.0+ (and beyond) platform that combines <a href="http://dbpedia.org/resource/SQL" id="link-id11309b38">SQL</a>, <a href="http://dbpedia.org/resource/Resource_Description_Framework" id="link-id135f7988">RDF</a>, XML, and Web Services functionality</li> <li> Flexible variable cost model (courtesy of <a href="http://aws.amazon.com/devpay/" id="link-id17941018">EC2 DevPay</a>) tightly bound to revenue generated by your services</li> <li> Delivers federated and/or centralized model flexibility for you SaaS based solutions</li> <li> Simple entry point for developing and deploying sophisticated database driven applications (SQL or RDF <a href="http://dbpedia.org/resource/Linked_Data" id="link-id14ea6b10">Linked Data Web</a> oriented)</li> <li> Complete framework for exploiting OpenID, OAuth (including Role enhancements) that simplifies exploitation of these vital Identity and <a href="http://dbpedia.org/resource/Data">Data</a> Access technologies</li> <li>Easily implement RDF Linked Data based Mail, Blogging, Wikis, Bookmarks, Calendaring, Discussion Forums, Tagging, Social-Networking as <a href="http://en.wikipedia.org/wiki/Data_Spaces" id="link-id11519928">Data Space</a> (data containers) features of your application or service offering</li> <li>Instant alleviation of challenges (e.g. service costs and agility) associated with <a href="http://dbpedia.org/resource/DataPortability" id="link-id111cb610">Data Portability</a> and Open Data Access across Web 2.0 data silos</li> <li> LDAP integration for <a href="http://dbpedia.org/resource/Intranet" id="link-id114a8270">Intranet</a> / <a href="http://dbpedia.org/resource/Extranet" id="link-id10fe4f08">Extranet</a> style applications.</li> </ol> <p>From the DBMS engine perspective it provides you with one or more pre-configured instances of Virtuoso that enable immediate exploitation of the following services:</p> <ol> <li> RDF Database (a Quad Store with <a href="http://dbpedia.org/resource/SPARQL" id="link-id11911bf8">SPARQL</a> & SPARUL Language & Protocol support)</li> <li> <a href="http://dbpedia.org/resource/SQL" id="link-id110544c8">SQL</a> Database (with <a href="http://dbpedia.org/resource/Open_Database_Connectivity" id="link-id1524c7d0">ODBC</a>, <a href="http://dbpedia.org/resource/Java_Database_Connectivity" id="link-id14cfb658">JDBC</a>, OLE-DB, <a href="http://dbpedia.org/resource/ADO.NET" id="link-id110ec6c8">ADO</a>.NET, and XMLA driver access)</li> <li>XML Database (XML Schema, <a href="http://dbpedia.org/resource/XQuery" id="link-id10ebf218">XQuery</a>/<a href="http://dbpedia.org/resource/XPath" id="link-id142a7898">Xpath</a>, XSLT, Full Text Indexing)</li> <li>Full Text Indexing.</li> </ol> <p>From a Middleware perspective it provides:</p> <ol> <li> RDF Views (Wrappers / Semantic Covers) over SQL, XML, and other data sources accessible via SOAP or REST style Web Services</li> <li> Sponger Service for converting non RDF <a href="http://dbpedia.org/resource/Information" id="link-id11931c60">information</a> resources into RDF <a href="http://dbpedia.org/resource/Linked_Data" id="link-id118f7168">Linked Data</a> "on the fly" via a large collection of pre-installed RDFizer Cartridges.</li> </ol> <p>From the Web Server Platform perspective it provides an alternative to LAMP stack components such as <a href="http://dbpedia.org/resource/MySQL" id="link-id10f7b780">MySQL</a> and Apace by offering</p> <ol> <li> HTTP Web Server</li> <li> WebDAV Server</li> <li> Web <a href="http://dbpedia.org/resource/Application_server" id="link-id1268daa8">Application Server</a> (includes <a href="http://dbpedia.org/resource/PHP" id="link-id1585d238">PHP</a> runtime hosting)</li> <li> SOAP or REST style Web Services Deployment</li> <li> RDF Linked Data Deployment</li> <li> SPARQL (SPARQL Query Language) and SPARUL (SPARQL Update Language) endpoints</li> <li>Virtuoso Hosted PHP packages for <a href="http://dbpedia.org/resource/MediaWiki" id="link-id15568818">MediaWiki</a>, <a href="http://dbpedia.org/resource/Drupal" id="link-id110bd7a8">Drupal</a>, <a href="http://dbpedia.org/resource/WordPress" id="link-id10f66918">Wordpress</a>, and <a href="http://dbpedia.org/resource/PhpBB" id="link-id13fda4d0">phpBB3</a> (just install the relevant Virtuoso Distro. Package). </li> </ol> <p>From the general System Administrator's perspective it provides:</p> <ol> <li> Online Backups (Backup Set dispatched to S3 buckets, FTP, or HTTP/WebDAV server locations)</li> <li>Synchronized Incremental Backups to Backup Set locations</li> <li>Backup Restore from Backup Set location (without exiting to EC2 shell).</li> </ol> <p>Higher level user oriented offerings include:</p> <ol> <li>OpenLink Data Explorer front-end for exploring the burgeoning Linked Data <a href="http://dbpedia.org/resource/Giant_Global_Graph" id="link-id11646dc8">Web</a> </li> <li> Ajax based SPARQL Query Builder (iSPARQL) that enables SPARQL Query construction by Example</li> <li>Ajax based SQL Query Builder (QBE) that enables SQL Query construction by Example.</li> </ol> <p>For Web 2.0 / 3.0 users, developers, and entrepreneurs it offers it includes Distributed Collaboration Tools & Social Media realm functionality courtesy of <a href="http://dbpedia.org/resource/OpenLink_Data_Spaces" id="link-id11009930">ODS</a> that includes:</p> <ol> <li> Point of presence on the Linked Data Web that meshes your Identity and your Data via URIs</li> <li> System generated Social Network Profile & Contact Data via <a href="http://dbpedia.org/resource/Friend_of_a_friend" id="link-id1185a1c0">FOAF</a>?</li> <li> System generated <a href="http://dbpedia.org/resource/SIOC" id="link-id14791890">SIOC</a> (Semantically Interconnected Online Community) <a href="http://en.wikipedia.org/wiki/Data_Spaces" id="link-id1577cad8">Data Space</a> (that includes a Social Graph) exposing all your Web data in RDF Linked Data form</li> <li> System generated OpenID and automatic integration with FOAF</li> <li> Transparent Data Integration across Facebook, Digg, LinkedIn, FriendFeed, Twitter, and any other Web 2.0 data space equipped with RSS / Atom support and/or REST style Web Services</li> <li> In-built support for SyncML which enables data synchronization with Mobile Phones.</li> </ol> <h3>How Do I Get Going with It?</h3> <ul> <li> <a href="http://virtuoso.openlinksw.com/dataspace/dav/wiki/Main/ODSInstallationEC2" id="link-id114e1600">Standard Installation Guide</a> </li> <li> <a href="http://virtuoso.openlinksw.com/dataspace/dav/wiki/Main/VirtEC2AMIDBpediaInstall" id="link-id110a98e8">Personal or Service Specific DBpedia Installation Guide</a> </li> </ul>
Introducing Virtuoso Universal Server (Cloud Edition) for Amazon EC2
2008-11-28T21:06:02Z
2008-11-28T16:06:02.000006-05:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1475
<p>Recent <a href="http://en.wiktionary.org/wiki/perturbation" id="link-id1bdb9ec8">perturbations</a> in <a href="http://dbpedia.org/resource/Data">Data</a> Access and Data Management technology realms are clear signs of an imminent inflection. In a nutshell, the focus of data access is moving from the "Logical Level" (what you see if you've ever looked at a DBMS schema derived from an <a href="http://dbpedia.org/resource/Entity" id="link-id18735f38">Entity</a> Data Model) to the "Conceptual Level" (i.e., the Entity Model becoming concrete).</p> <p>In recent times I've stumbled across Master Data Management (MDM) which is all about entities that provide holistic views of enterprise data (or what I call: <a href="http://dbpedia.org/resource/Context_%28language_use%29" id="link-id18f07ec8">Context</a> Lenses). I've also stumbled across emerging tensions in the .NET realm between Linq to Entities and Linq to <a href="http://dbpedia.org/resource/SQL" id="link-id19429e88">SQL</a>, where in either case the fundamental issues comes down to the optimal paths "Conceptual Level Access" over the "Logical Logical Level" when dealing with data access in the .NET realm.</p> <p> Strangely, the emerging realm of RDF <a href="http://dbpedia.org/resource/Linked_Data" id="link-id115b3780">Linked Data</a>, MDM, and .NET's Entity Frameworks, remain strangely disconnected.</p> <p>Another oddity is the obvious, but barely acknowledged, blurring of the lines between the "traditional enterprise employee" and the "individual <a href="http://dbpedia.org/resource/World_Wide_Web">Web</a> <a href="http://dbpedia.org/resource/Netizen" id="link-id0x1ffd8640">netizen</a>". The fusion between these entities is one of the most defining characteristics of how the Web is reshaping the data landscape.</p> <p>At the current time, I tend to crystalize my data access world view under the moniker: <a href="http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1474" id="link-id1544ee60">YODA</a> ("You" Oriented Data Access), based on the following:</p> <ol> <li> Entities are the new focal point of data access, management, and integration </li> <li> "You" are the entry point (Data Source Name) into this new realm of inter connected Entities that the Web exposes</li> <li> "You" the "Person" Entity is associated with many other "Things" such as "Organizations", "Other People", "Books", "Music", "Subject Matter" etc. </li> <li> "You" the "Person" needs Identity in this new global database, which is why "You" need to Identify "Yourself" using an an HTTP based Entity <a href="http://dbpedia.org/resource/Identity_%28object-oriented_programming%29" id="link-id145d0438">ID</a> (aka. <a href="http://dbpedia.org/resource/Uniform_Resource_Identifier" id="link-id1873ad08">URI</a>) </li> <li> When "You" have an ID for "Yourself" it becomes much easier for the essence of "You" to be discovered via the Web </li> <li> When "Others" have IDs for "Themselves" on the Web it becomes much easier for "You" to serendipitously discover or explicitly "Find" things on the Web. </li> </ol> <h3>Related</h3> <ul> <li> <a href="http://www.infoq.com/news/2008/11/DLINQ-Future" id="link-id17501eb0">Is LINQ to SQL truly dead?</a> </li> <li> <a href="http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1420" id="link-id10fbf920">Virtuoso, Linked Data, and Linq2Rdf</a> </li> <li> <a href="http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1224" id="link-id19c44b00">Enterprise 0.0, Linked Data, and the Semantic Data Web</a> (*an old post*)</li> </ul>
Entity Oriented Data Access
2008-11-04T03:51:48Z
2008-11-03T22:51:48-05:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1426
<p>Here is another "Linked Discourse" effort via a <a href="http://dbpedia.org/resource/Blog" id="link-id13edcda8">blog</a> post that attempts to add perspective to a developing <a href="http://dbpedia.org/resource/World_Wide_Web">Web</a> based conversation. In this case, the conversation originates from <a href="http://geekaustin.org" id="link-id15a33728">Juan Sequeda</a>'s recent interview with <a href="http://www.linkedin.com/in/luxzia" id="link-id182a4a80">Jana Thompson</a> titled: <a href="http://geekaustin.org/2008/08/21/juan-sequeda-jana-thompson-necessity-semantic-web/" id="link-id146e1f40">Is the Semantic Web necessary (and feasible)?</a> </p> <blockquote> <cite>Jana: What are the benefits you see to the business community in adopting semantic technology? </cite> </blockquote> <p> <a href="http://myopenlink.net/dataspace/person/kidehen#this" id="link-id1941e3b0">Me</a>: Exposure, exploitation, of untapped treasure trove of interlinked <a href="http://dbpedia.org/resource/Data">data</a>, <a href="http://dbpedia.org/resource/Information" id="link-id13593fc0">information</a>, and <a href="http://dbpedia.org/resource/Knowledge" id="link-id1290c318">knowledge</a> across disparate IT infrastructure via conceptual entry points (<a href="http://dbpedia.org/resource/Entity" id="link-id107bad60">Entity</a> IDs / URIs / Data Source Names) that refer to as "<a href="http://dbpedia.org/resource/Context_%28language_use%29" id="link-id15fab9f8">Context</a> Lenses".</p> <br /> <blockquote> <cite> Jana: Do you think these benefits are great enough for businesses to adopt the changes?</cite> </blockquote> <p> <a href="http://myopenlink.net/dataspace/person/kidehen#this" id="link-id0x584ffe0">Me</a>: Yes, infrastructural heterogeneity is a fact of corporate life (growth, mergers, acquisitions etc). Any technology that addresses these challenges is extremely important and valuable. Put differently, the opportunity costs associated with IT infrastructural heterogeneity remains high!</p> <br /> <blockquote> <cite>Jana: How large do you think this impact will actually be?</cite> </blockquote> <p>Me: Huge, enterprise have been aware of their data, <a href="http://dbpedia.org/resource/Information" id="link-id0x1b8057b0">information</a>, and <a href="http://dbpedia.org/resource/Knowledge" id="link-id0x1b3e3760">knowledge</a> treasure troves etc. for eons. Tapping into these via a materialization of the "<a href="http://dbpedia.org/resource/Information">information</a> at your fingertips" vision is something they've simply been waiting to pursue without any platform lock-in, for as long as I've been in this industry.</p> <br /> <blockquote> <cite>Jana: I’ve heard, from contacts in the Bay Area, that they are skeptical of how large this impact of semantic technology will actually be on the web itself, but that the best uses of the technology are for fields such as medical information, or as you mentioned, geo-spatial data.</cite> </blockquote> <p>Me: Unfortunately, those people aren't connecting the <a href="http://dbpedia.org/resource/Semantic_Web" id="link-id10a337d8">Semantic Web</a> and open access to heterogeneous data sources, or the intrinsic value of holistic exploration location of <a href="http://dbpedia.org/resource/Entity" id="link-id0xaa58c520">entity</a> based data networks (aka <a href="http://dbpedia.org/resource/Linked_Data" id="link-id188a1910">Linked Data</a>).</p> <br /> <blockquote> <cite> Jana: Are semantic technologies going to be part of the web because of people championing the cause or because it is actually a necessary step?</cite> </blockquote> <p>Me: <a href="http://dbpedia.org/resource/Linked_Data" id="link-id0x9eb9aca0">Linked Data</a> technology on the Web is a vital extension of the current Web. Semantic Technology without the "Web" component, or what I refer to as "Semantics Inside only" solutions, simply offer little or no value as Web enhancements based on their incongruence with the essence of the Web i.e., "Open Linkage" and no Silos! A nice looking Silo is still a Silo.</p> <br /> <blockquote> <cite>Jana: In the early days of the web, there was an explosion of new websites, due to the ease of learning HTML, from a business to a person to some crackpot talking about aliens. Even today, CSS and XHTML are not so difficult to learn that a determined person can’t learn them from W3C or other tutorials easily. If OWL becomes the norm for websites, what do you think the effects will be on the web? Do you think it is easy enough to learn that it will be readily adopted as part of the standard toolkit for web developers for businesses?</cite> </blockquote> <p>Me: Correction, learning HTML had nothing to do with the Web's success. The value proposition of the Web simply reached critical mass and you simply couldn't afford to not be part of it. The easiest route to joining the Web juggernaut was a Web Page hosted on a Web Site. The question right now is: what's the equivalent driver for the <a href="http://dbpedia.org/resource/Linked_Data">Linked Data</a> <a href="http://dbpedia.org/resource/Giant_Global_Graph" id="link-id12e25c98">Web</a> bearing in mind the initial Web bootstrap. My answer is simply this: Open Data Access i.e., getting beyond the data silos that have inadvertently emerged from Web 2.0.</p> <br /> <blockquote> <cite> Jana: Following the same theme, do you think this will lead to an <a href="http://dbpedia.org/resource/Internet" id="link-id17041398">internet</a> full of corporate-controlled websites, with sites only written by developers rather than individuals?</cite> </blockquote> <p> Me: Not at all, we will have an <a href="http://dbpedia.org/resource/Internet" id="link-id0x16a4abe0">Internet</a> owned by it's participants i.e., You and the agents that work on your behalf.</p> <br /> <blockquote> <cite>Jana: So, you are imagining technologies such as <a href="http://dbpedia.org/resource/Drupal" id="link-id107d1d70">Drupal</a> or <a href="http://dbpedia.org/resource/WordPress" id="link-id13f48db8">Wordpress</a>, that allow users to manage sites without a great deal of <a href="http://dbpedia.org/resource/Knowledge">knowledge</a> of the nuts and bolts of current web technologies?</cite> </blockquote> <p>Me: Not at all! I envisage simple forms that provide conduits to powerful meshes of interlinked data spaces associated with Web users.</p> <br /> <blockquote> <cite>Jana: Given all of the buzz, and my own familiarity with ontology, I am just very curious if the <a href="http://dbpedia.org/resource/Semantic_Web" id="link-id0x1955d360">semantic web</a> is truly necessary? </cite> </blockquote> <p>Me:This question is no different than saying: I hear the Web is becoming a Database, and I wonder if a Data Dictionary is necessary, or even if access to structured data is necessary. It's also akin to saying: I accept "Search" as my only mechanism for Web interaction even though in reality, I really want to be able to "Find" and "Process" relevant things at a quicker rate than I do today, relative to the amount of information, and information processing time, at my disposal.</p> <br /> <blockquote> <cite>Jana: Will it be worth it to most people to go away from the web in its current form, with keyword searches on sites like Google, to a richer and more interconnected internet with potentially better search technology?</cite> </blockquote> <p>Me: As stated above, we need to add "Find" to the portfolio of functions we seek to perform against the Web. "Finding" and "Searching" are mutually inclusive pursuits at different ends of an activity spectrum.</p> <br /> <blockquote> <cite>Jana: For our more technical readers, I have a few additional questions: If no standardization comes about for mapping relational databases to domain ontologies, how do you see that as influencing the decisions about adoption of semantic technology by businesses? After all, the success of technology often lives or dies on its ease of adoption.</cite> </blockquote> <p>Me: Standardization of<a href="http://esw.w3.org/topic/Rdb2RdfXG/StateOfTheArt" id="link-id10abbc30"> RDBMS to RDF Mapping</a> is not the critical success factor here (of course it would be nice). As stated earlier, the issue of data integration that arises from IT infrastructural heterogeneity has been with decision makers in the enterprise for ever. The problem is now seeping into the broader consumer realm via Web ubiquity. The mistakes made in the enterprise realm are now playing out in the consumer Web realm. In both realms the critical success factors are:</p> <ol> <li> Scalable productivity relative to exponential growth of data generated across Intranets, Extranets, and the Internet</li> <li>Concept based <a href="http://dbpedia.org/resource/Context_%28language_use%29" id="link-id0x114e6888">Context</a> Lenses that transcend logical and physical data heterogeneity by putting dereferencable URIs in front of the Line of Business Application Data and/or Web Data Spaces such as Blogs, Wikis, Discussion Forums etc.).</li> </ol>
Is the Semantic Web necessary (and feasible)?
2008-08-29T15:08:12Z
2008-08-29T11:08:12.000002-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1405
As the <a href="http://www.w3.org/DesignIssues/LinkedData.html" id="link-id13dfe618">Linked Data meme</a> continues on it's quest to unravel the mysteries of the <a href="http://dbpedia.org/resource/Semantic_Web" id="link-id10527b30">Semantic Web</a> vision, it's quite gratifying to see that <a href="http://dbpedia.org/resource/Federated_database_system" id="link-id104f58b0">data virtualization</a> comprehension: creating "Conceptual Views" into logically organized "Disparate & Heterogeneous <a href="http://dbpedia.org/resource/Data">Data</a> Sources" via "<a href="http://dbpedia.org/resource/Context_%28language_use%29" id="link-id14a46998">Context</a> Lenses" is taking shape, as illustrated in the "<a href="http://feeds.feedburner.com/%7Er/SemanticBusiness/%7E3/353668031/note-to-self-virtualconceptual-as-wwwsw.html" id="link-id13179dd8">note-to-self</a>" post by <a href="http://www.linkedin.com/in/davidprovost" id="link-id1403dc88">David Provost</a>.<br /> <br /> <br /> <br /> <br />Virtualization of heterogeneous data sources is only achievable if you have a dexterous data model based "Bus" into which the data sources are plugged. RDF has offered such a model for a long time.<br /> <br /> <br /> <br /> <img style="max-width: 800px;" src="http://www.w3.org/DesignIssues/diagrams/sw-clients.png" /> <br /> <br /> <br /> <br />When heterogeneous data sources are plugged into an RDF based integration bus e.g., customer records sourced from a variety of tables, across a plethora of databases, you can only end up with true value if the emergent entities from such an effort are coherently linked and (de)referencable; which is what <a href="http://dbpedia.org/resource/Linked_Data" id="link-id12b06e20">Linked Data</a>'s fundamental preoccupation with dereferencable URIs is all about. Of course, Even when you have all of the above in place, you also need to be able to construct "<a href="http://dbpedia.org/resource/Context_%28language_use%29" id="link-id103c2c80">Context</a> Lenses" i.e., <a href="http://dbpedia.org/resource/Context_%28language_use%29" id="link-id1037a260">context</a> driven views of the <a href="http://dbpedia.org/resource/Linked_Data" id="link-id13e48ab8">Linked Data</a> Mesh (or <a href="http://dbpedia.org/resource/Linked_Data" id="link-id101c7718">Linked Data</a> Spaces).<br /> <br /> <br />Additional Diagrams:<br /> <br /> <br />1. <a href="http://www.w3.org/2005/Talks/1110-iswc-tbl/#%2824%29" id="link-id10808cb8">Clients of the RDF Bus</a> <br />2. <a href="http://www.w3.org/2005/Talks/1110-iswc-tbl/#%2825%29" id="link-id11e5a300">RDF Bus Server plugins: Scripts that emit RDF</a> <br />3. <a href="http://www.w3.org/2005/Talks/1110-iswc-tbl/#%2826%29" id="link-id13ea46a0">RDF Bus Servers: RDF Data Managers (Triple or Quad Stores)</a> <br />4. <a href="http://www.w3.org/2005/Talks/1110-iswc-tbl/#%2827%29" id="link-id101d3470">RDF Bus Servers: Relational to RDF Mappers (RDF Views, Semantic Covers etc.)</a> <br />5. <a href="http://www.w3.org/2005/Talks/1110-iswc-tbl/#%2828%29" id="link-id1052c450">RDF Bus Server plugins: XML to RDF Mappers </a> <br />6. <a href="http://www.w3.org/2005/Talks/1110-iswc-tbl/#%2829%29" id="link-id10281ec0">RDF Bus Server plugins: GRDDL based XSLT stylesheets that emit RDF</a> <br />7. <a href="http://www.w3.org/2005/Talks/1110-iswc-tbl/#%2830%29" id="link-id1444faf0">RDF Bus Server plugins: Intelligent RDF Middleware</a> <br /> <br /> <br /> <br /> <br /> <br /> <br />
Time for Context Lenses (Update)
2008-08-04T15:24:50Z
2008-08-04T11:24:50.000001-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1386
<p>Yikes! I've just discovered that the final part of the semanticweb.com's interview with <a href="http://www.cs.umd.edu/~hendler/2003/foaf.rdf#jhendler" id="link-id10483c28">Jim Hendler</a> and I, includes critical paragraphs that omit my example links :-( As you can imagine, this is a quite excruciating, bearing in mind that "Literals" are of marginal value in a <a href="http://dbpedia.org/resource/Linked_Data" id="link-id1418a578">Linked Data</a> world.</p> <p>Anyway, thanks to the Blogosphere, I can attempt to fix this problem myself -- via this post :-)</p> <p> <strong>Q. If you wanted to provide a bewildered but still curious novice a public example of <a href="http://dbpedia.org/resource/Linked_Data" id="link-id107e67d0">Linked Data</a> at work in their everyday life, what would it be?</strong> </p> <p> <strong><a href="http://myopenlink.net/dataspace/person/kidehen#this" id="link-id13851f80">Kingsley Idehen</a>:</strong> Any one of the following:</p> <p> <a href="http://community.linkeddata.org/dataspace/person/kidehen2" id="link-id144c84b0">My Linking Open Data community Profile Page</a> - the <a href="http://dbpedia.org/resource/Linked_Data" id="link-id10339910">Linked Data</a> integration is exposed via the "Explore <a href="http://dbpedia.org/resource/Data">Data</a>" Tab <a href="http://demo.openlinksw.com/rdfbrowser2/?uri%5B%5D=http%3A%2F%2Fmyopenlink.net%2Fdataspace%2Fkidehen&" id="link-id12116d10">My Linked Data Space</a> - viewed via OpenLink's AJAR (Asynchronous Javascript and RDF) based <a href="http://dbpedia.org/resource/Linked_Data" id="link-id10728ed0">Linked Data</a> Brower <a href="http://demo.openlinksw.com/rdfbrowser2/?uri=http%3A%2F%2Fmyopenlink.net%2Fdataspace%2Fkidehen%2Fcalendar%2FKingsley%2527s%2520Calendar" id="link-id144ef138">My Events Calendar Tag Cloud</a> - a <a href="http://dbpedia.org/resource/Linked_Data" id="link-id0xb8fdf10">Linked Data</a> view of my Calendar Space using an RDF-aware browser In all cases, you have the ability to explore my data spaces by simply clicking on the links, which on the surface appear to be standard hypertext links, although in reality you are dealing with <a href="http://dbpedia.org/resource/Linked_Data" id="link-id142827a8">hyperdata</a> links (i.e., links to entities that result in the generation of <a href="http://dbpedia.org/resource/Entity" id="link-id144c8438">entity</a> description pages that expose <a href="http://dbpedia.org/resource/Entity" id="link-id149c9cf8">entity</a> properties via <a href="http://dbpedia.org/resource/Linked_Data" id="link-id10551628">hyperdata</a> links). Thus, you have a single page that describes me in a very rich way since it encompasses all data associated with me, covering: personal profile, <a href="http://dbpedia.org/resource/Blog" id="link-id10ac5148">blog</a> posts, bookmarks, <a href="http://dbpedia.org/resource/Tag" id="link-id122ff4b0">tag</a> clouds, social networks etc.</p> <p> <strong>Q. What would you show the CEO or CTO of a company outside the tech industry?</strong> </p> <p> <strong><a href="http://myopenlink.net/dataspace/person/kidehen#this" id="link-id106143a8">Kingsley Idehen</a>:</strong> A link to the <a href="http://demo.openlinksw.com/Northwind/Customer/ALFKI#this" id="link-id106144e0">Entity ALFKI</a>, from the popular Northwind Database associated with Microsoft Access and <a href="http://dbpedia.org/resource/SQL" id="link-id10ca6f68">SQL</a> Server database installations. This particular link exposes a typical enterprise <a href="http://en.wikipedia.org/wiki/Data_Spaces" id="link-id14202348">data space</a> (orders, customers, employees, suppliers ...) in a single page. The <a href="http://dbpedia.org/resource/Linked_Data" id="link-id144e9070">hyperdata</a> links represent intricate data relationships common to most business systems that will ultimately seek to repurpose existing legacy data sources and SOA services as <a href="http://dbpedia.org/resource/Linked_Data">Linked Data</a>. Alternatively, I would show the same links via the <a href="http://dataviewer.zitgist.com/?uri=http%3A//demo.openlinksw.com/Northwind/Customer/ALFKI" id="link-id10e2e1d0">Zitgist Data Viewer</a> (another Linked Data-aware browser). In both cases, I am exploiting direct access to entities via HTTP due to the protocols incorporation into the Data Source Naming scheme.</p>
Missing Bits from semanticweb.com Interview
2008-06-13T13:01:40Z
2008-06-13T09:01:40.000003-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1373
<p>I just stumbled across a post titled: <a href="http://clarkparsia.com/weblog/2008/06/06/why-reasoning-matters-consistency-checking/" id="link-id11003f00">Why Reasoning Matters: Consistency Checking</a> from <a href="http://clarkparsia.com/about" id="link-id137e8bc0">Clark and Parsia</a> </p> <p>As you can see from my recent post about how we've started the process of <a href="http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1372" id="link-id100b7d20">inoculating DBpedia against the potential dangers of "contextual incoherence"</a>, we are entering a newer era in the <a href="http://dbpedia.org/resource/Semantic_Web" id="link-id106c35e0">Semantic Web</a>'s evolution. My post and the one from Clark & Parsia both touch different aspects of the "<a href="http://dbpedia.org/resource/Data">Data</a> Dictionary" for the <a href="http://dbpedia.org/resource/Semantic_Web" id="link-id0x9d80080">Semantic Web</a> issue.</p> <p>Note: in my universe of discourse, a Data Dictionary manifests when the constraints and class hierarchies defined in an ontology (e.g. a <a href="http://dbpedia.org/resource/World_Wide_Web">web</a> accessible shared ontology) are functionally bound to a data manager. Interestingly the binding can take the following forms:</p> <ul> <li>Engine Hosted - which is what you get with <a href="http://docs.openlinksw.com:80/virtuoso/rdfsparqlrule.html#rdfsparqlruleintro" id="link-id105c4408">Virtuoso's in-built Inference Engine</a> </li> <li>External - which is what you get when the Inference Engine is a distinct component from the data manager (example: <a href="http://pellet.owldl.org/owlgres" id="link-id13fa37f8">Owlgres</a> which can sit in front of 3rd party <a href="http://dbpedia.org/resource/SPARQL" id="link-id107127e8">SPARQL</a> endpoints via ARQ)</li> </ul> <p>The classification terminology I use above is very much off-the-cuff, its sole purpose is architectural distinction.</p> <p>Anyway, it's really nice to see that we are entering an era re. the <a href="http://dbpedia.org/resource/Semantic_Web">Semantic Web</a> vision, where the virtues of reasoning are getting simpler to demonstrate and articulate.</p> <p>In a nutshell, the point-point data integration era is coming to an end! The era of intelligent ontology based enterprise data integration is nigh!</p> <p>Of course, there is much more to come on the practical utility front, so stay tuned as we work our way through the <a href="http://dbpedia.org/resource/DBpedia" id="link-id10424078">DBpedia</a> inoculation program.</p>
Reasoning Matters Contd
2008-06-06T18:38:54Z
2008-06-06T14:38:54-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1372
<p> The current live instance of <a href="http://dbpedia.org/resource/DBpedia" id="link-id107c7b60">DBpedia</a> has just received dose #1 of a series of planned "<a href="http://dbpedia.org/resource/Context_%28language_use%29" id="link-id10d3ec78">Context</a>" oriented booster shots. These shots seek to to protect <a href="http://dbpedia.org/resource/DBpedia" id="link-id143648f0">DBpedia</a> from contextual incoherence as it grows in <a href="http://dbpedia.org/resource/Data">data</a> set expanse and popularity. Dose #1 (vaccine label: <a href="http://www.mpi-inf.mpg.de/~suchanek/downloads/yago/" id="link-id16d497d0">Yago</a>) equips <a href="http://dbpedia.org/resource/DBpedia" id="link-id13f90120">DBpedia</a> with a functional (albeit non exclusive) Data Dictionary component courtesy of the <a href="http://www.mpi-inf.mpg.de/~suchanek/downloads/yago/" id="link-id10509a08">Yago</a> Class Hierarchy .</p> <p> When the <a href="http://dbpedia.org/resource/DBpedia" id="link-id10a1b378">DBpedia</a> & <a href="http://www.mpi-inf.mpg.de/~suchanek/downloads/yago/" id="link-id10934068">Yago</a> integration took place last year (around WWW2007, Banff) there was a little, but costly omission that occurred: nobody sought to load the <a href="http://www.mpi-inf.mpg.de/~suchanek/downloads/yago/" id="link-id106e47f0">Yago</a> Class Hierarchy into the <a href="http://virtuoso.openlinksw.com" id="link-id13f90890">Virtuoso</a>'s Inference Engine :-(</p> <p> Anyway, the Class Hierarchy has now been loaded into the <a href="http://virtuoso.openlinksw.com" id="link-id100004f8">Virtuoso</a>'s inference engine (as <a href="http://virtuoso.openlinksw.com" id="link-id139900e8">Virtuoso</a> Inference Rules) and the following queries are now feasible using the live <a href="http://virtuoso.openlinksw.com" id="link-id109b02c8">Virtuoso</a> based <a href="http://dbpedia.org/resource/DBpedia" id="link-id143624d8">DBpedia</a> instance hosted by <a href="http://www.openlinksw.com/dataspace/organization/openlink#this" id="link-id12f918c0">OpenLink Software</a>:</p> <p> -- Find all Fiction Books associated with a property "dbpedia:name" that has literal value: "The Lord of the Rings" .</p> <p> <span _fck_bookmark="1" style="display: none; "> </span> </p> <p> <span style="font-size:16px;"><span style="font-family:courier new,courier,monospace;">DEFINE input:inference "http://dbpedia.org/resource/inference/rules/yago#"<br /> </span></span> </p> <p> <span style="font-size:16px;"> <span style="font-family:courier new,courier,monospace;"><span class="Apple-style-span">PREFIX rdf: &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&gt;</span> </span></span> </p> <p> <span style="font-size:16px;"><span style="font-family:courier new,courier,monospace;">PREFIX dbpedia: &lt;http://dbpedia.org/property&gt;</span></span> </p> <p> <span style="font-size:16px;"><span style="font-family:courier new,courier,monospace;">PREFIX yago: &lt;http://dbpedia.org/class/yago&gt;&nbsp;</span></span> </p> <div> <span style="font-size:16px;"><span style="font-family:courier new,courier,monospace;"><br /> </span></span> </div> <div> <span style="font-size:16px;"><span style="font-family:courier new,courier,monospace;">SELECT DISTINCT ?s</span></span> </div> <div> <span style="font-size:16px;"><span style="font-family:courier new,courier,monospace;">FROM < xmlns="http" dbpedia.org="dbpedia.org">//dbpedia.org></span></span> </div> <div> <span style="font-size:16px;"><span style="font-family:courier new,courier,monospace;">WHERE {</span></span> </div> <div> <span style="font-size:16px;"><span style="font-family:courier new,courier,monospace;">?s a yago:Fiction106367107 .</span></span> </div> <div> <span style="font-size:16px;"><span style="font-family:courier new,courier,monospace;">?s dbpedia:name "The Lord of the Rings"@en .</span></span> </div> <div> <span style="font-size:16px;"><span style="font-family:courier new,courier,monospace;">}</span></span> </div> <p> -- Variant of query with <a href="http://virtuoso.openlinksw.com" id="link-id10870920">Virtuoso</a>'s Full Text Index extension via the bif:contains function/magic predicate</p> <p> <span style="font-size:14px;"><span style="font-family:courier new,courier,monospace;">DEFINE input:inference "http://dbpedia.org/resource/inference/rules/yago#"</span></span> </p> <p> <span style="font-size:14px;"><span style="font-family:courier new,courier,monospace;">PREFIX rdf: &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&gt;</span></span> </p> <p> <span style="font-size:16px;"><span style="font-family:courier new,courier,monospace;">PREFIX dbpedia: &lt;http://dbpedia.org/property&gt;</span></span> </p> <p> <span style="font-size:16px;"><span style="font-family:courier new,courier,monospace;">PREFIX yago: &lt;http://dbpedia.org/class/yago&gt;&nbsp;</span></span> </p> <p> <span style="font-size:16px;"><span style="font-family:courier new,courier,monospace;">SELECT DISTINCT ?s ?n</span></span> </p> <p> <span style="font-size:16px;"><span style="font-family:courier new,courier,monospace;">FROM < xmlns="http" dbpedia.org="dbpedia.org">//dbpedia.org></span></span> </p> <p> <span style="font-size:16px;"><span style="font-family:courier new,courier,monospace;">WHERE {</span></span> </p> <p> <span style="font-size:16px;"><span style="font-family:courier new,courier,monospace;">?s a yago:Fiction106367107 .</span></span> </p> <p> <span style="font-size:16px;"><span style="font-family:courier new,courier,monospace;">?s dbpedia:name ?n .</span></span> </p> <p> <span style="font-size:16px;"><span style="font-family:courier new,courier,monospace;">?n bif:contains 'Lord and Rings'</span></span> </p> <p> <span style="font-size:16px;"><span style="font-family:courier new,courier,monospace;">}</span></span> </p> <p> -- Retrieve all individuals instances of Fiction Class which should include all Books.</p> <p> <span style="font-family:courier new,courier,monospace;"><span style="font-size:16px;">DEFINE input:inference "http://dbpedia.org/resource/inference/rules/yago#"<br /> </span></span> </p> <p> <span style="font-family:courier new,courier,monospace;"> <span style="font-size:16px;"><span class="Apple-style-span">PREFIX rdf: &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&gt;</span> </span></span> </p> <p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica"> <span style="font-family:courier new,courier,monospace;"><span style="font-size:16px;">PREFIX dbpedia: &lt;http://dbpedia.org/property&gt;<br /> </span></span> </p> <p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica"> <span style="font-family:courier new,courier,monospace;"><br /> </span> </p> <p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica"> <span style="font-family:courier new,courier,monospace;"><span style="font-size:16px;">PREFIX yago: &lt;http://dbpedia.org/class/yago&gt;&nbsp;</span></span> </p> <div> <span style="font-family:courier new,courier,monospace;"><br /> </span> </div> <div> <span style="font-family:courier new,courier,monospace;">SELECT DISTINCT ?s</span> </div> <div> <span style="font-family:courier new,courier,monospace;">FROM < xmlns="http" dbpedia.org="dbpedia.org">//dbpedia.org></span> </div> <div> <span style="font-family:courier new,courier,monospace;">WHERE {</span> </div> <div> <span style="font-family:courier new,courier,monospace;">?s a yago:Fiction106367107 .</span> </div> <div> <span style="font-family:courier new,courier,monospace;">} LIMIT 50</span> </div> <p> Note: you can also move the inference pragmas to the <a href="http://virtuoso.openlinksw.com" id="link-id13dd0d20">Virtuoso</a> Sever side i.e place the inference rules in a server instance config file, thereby negating the need to place "define input:inference 'http://dbpedia.org/resource/inference/rules/yago#'" pragmas directly in your <a href="http://dbpedia.org/resource/SPARQL" id="link-id10dddd08">SPARQL</a> queries.</p> <h3> Related</h3> <ul> <li> Mike's <a href="http://www.mkbergman.com/?p=431" id="link-id13f2f318">UMBEL: Making Linked Data Classy</a>post</li> <li> Fred's announcement about the <a href="http://fgiasson.com/blog/index.php/2007/12/20/second-version-of-yago-more-facts-and-entities/" id="link-id10a1b178">Yago revamp en route to UMBEL</a> </li> <li> <uo> <a href="http://fgiasson.com/blog/index.php/2008/04/20/exploding-the-domain-umbel-web-services-by-zitgist/" id="link-id14363358">Expanding Data Object Domains via UMBEL</a> </uo> </li> <li> My <a href="http://www.openlinksw.com/weblog/public/search.vspx?blogid=127&q=umbel&type=text&output=html" id="link-id1101ca98">Prior posts about UMBEL</a> </li> </ul>
DBpedia receives shot #1 of CLASSiness vaccine
2010-07-13T14:45:40Z
2010-07-13T10:45:40-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1361
<p> <a href="http://blogs.talis.com/nodalities/2008/05/kingsley-idehen-talks-about-openlink-software-linked-data-and-the-semantic-web.php" id="link-id1036b118">My podcast interview</a> with <a href="http://www.linkedin.com/in/pau1mi11er" id="link-id1026ed10">Paul Miller</a> of <a href="http://www.talis.com" id="link-id12d210d8">Talis</a> is out. As I listened to the podcast (naturally awkward affair) I got a first hand sense of Paul's mastery of the art of interviewing, even when dealing with a fast talking <a href="http://dbpedia.org/resource/Data" id="link-id180e1208">data</a> blitzers like me. Personally, I think I still talk a little too fast (the Nigerian in me), especially when the subject matter hones right into the epicenter of my professional passions: Open <a href="http://dbpedia.org/resource/Data" id="link-id1737a258">Data</a> Access and Heterogeneous <a href="http://dbpedia.org/resource/Data" id="link-id180f0668">Data</a> Integration (aka. <a href="http://dbpedia.org/resource/Virtual_Database" id="link-id10c62348">Virtual Database</a> Technology) -- so you may need to rewind every now and then during the interview :-)</p> <p>During this particular podcast interview, I deliberately wanted to have an conversation about the practical value of <a href="http://dbpedia.org/resource/Linked_Data" id="link-id180c9f88">Linked Data</a>, rather than the technical innards. The fundamental utility of <a href="http://dbpedia.org/resource/Linked_Data" id="link-id17387618">Linked Data</a> remains somewhat mercurial, and I am certainly hoping to do my bit at the upcoming <a href="http://dbpedia.org/resource/Linked_Data" id="link-id183ec288">Linked Data</a> Planet conference re. demonstrating and articulating <a href="http://dbpedia.org/resource/Linked_Data" id="link-id1401f250">linked data</a> value across the blurring realms of "the individual" and "the enterprise".</p> <p> <strong>Note to my old schoolmates on Facebook</strong>: when you listen to this podcast you will at least reconcile "Uyi Idehen" with "<a href="http://myopenlink.net/dataspace/person/kidehen#this" id="link-id180a7060">Kingsley Idehen</a>". Unfortunately, Facebook refuses to let me Identify myself in the manner I choose. Ideally, I would like to have the name: "Kingsley (Uyi) Idehen" associated with my Facebook ID since this is the Identifier known to my personal network of friends, family, and old schoolmates. This Identity predicament is a long running Identity case study in the making.</p>
My Talis Podcast re. Semantic Web, Linked Data, and OpenLink Software
2008-05-16T16:53:49Z
2008-05-16T12:53:49.000002-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1324
<p>These days I increasingly qualify myself and my Semantic Web advocacy as falling under the realm Linked Data. Thus, I tend to use the following introduction: I am <a href="http://kidehen.idehen.net/dataspace/person/kidehen%23this" id="link-idfd257f0">Kingsley Idehen</a>, of the Tribe <a href="http://dbpedia.org/resource/Linked_Data" id="link-idfec62f8">Linked Data</a>.</p> <p>The aforementioned qualification is increasingly necessary for the following reasons:</p> <ol> <li>The Semantic Web vision is broad and comprised of many layers</li> <li>A new era of confusion is taking shape just as we thought we had quelled the prior AI dominated realm of confusion</li> <li>None of the Semantic Web vision layers are comprehensible in practical ways without a basic foundation</li> <li>Open Data Access is the foundation of the Semantic Web (in prior post I used the term: <a href="http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1037" id="link-idfe71640">Semantic Web Layer 1</a>)</li> <li>URIs units of Open Data Access in Semantic Web parlance i.e.. each datum on the Web must have an ID (minted by the host Data Space).</li> </ol> <p>The terms <a href="http://dbpedia.org/resource/Giant_Global_Graph" id="link-id1224e020">GGG</a>, Linked Data, Data Web, Web of Data, and Web 3.0 (when I use this term) all imply URI driven Open Data Access for the Web Database (maybe call this <a href="http://dbpedia.org/resource/Open_Database_Connectivity" id="link-idfeb86e8">ODBC</a> for the Web) -- ability to point to records across data spaces without any adverse effect to the remote data spaces. It's really important to note that none of the aforementioned terms have nothing to do with the "Linguistic Meaning of blurb". Building a smarter document exposed via a URL without exposing descriptive data links doesn't provide open access to information data sources. </p> <p>As human beings we are all endowed with reasoning capability. But we can't reason without access to data. Dearth of openly accessible structured data is the source of many ills in cyberspace and across society in general. Today we still have Subjectivity reigning over Objectivity due to the prohibitive costs of open data access.</p> <p>We can't cost-effectively pursue objectivity without cost-effective infrastructure for creating alternative views of the data behind information sources (e.g. Web Pages). More Objectivity and less Subjectivity is what the next Web Frontier is about. At OpenLink we simply use the moniker: Analysis for All! Everyone becomes a data analyst in some form, and even better, the analysis are easily accessible to anyone connected to the Web. Of course, you will be able to share special analysis with your private network of friends and family, or if you so choose, not at all :-)</p> <p>Recap, it's important to note that Linked Data is the foundation layer of the Semantic Web vision. It's not only facilitates open data access, it also enables data integration (Meshing as opposed to Mashing) across disparate data schemas</p> <p>As demonstrated by <a href="http://dbpedia.org/about" id="link-idfe37fd8">DBpedia</a> and the <a href="http://richard.cyganiak.de/2007/10/lod/" id="link-idfeeef40">Linked Data Solar system</a> emerging around it, if you <a href="http://www.w3.org/Provider/Style/URI" id="link-idee98310">URI everything, then everything is Cool</a>.</p> <p>Linked Data and <a href="http://dbpedia.org/resource/Information_silo" id="link-idfcae4a0">Information Silos</a> are mutually exclusive concepts. Thus, you cannot produce a web accessible Information Silo and then refer to it as "Semantic Web" technology. Of course, it might be very Semantic, but it's fundamentally devoid of critical "Semantic Web" essence (<a href="http://dbpedia.org/resource/DNA" id="link-id10dddd08">DNA</a>).</p> <p>My acid test for any Semantic Web solution is simply this (using a <a href="http://dbpedia.org/resource/User_agent" id="link-idff7b4e8">Web User Agent or Client</a>):</p> <ol> <li>go to the profile page of the service</li> <li>ask for an RDF representation of my profile (by this I mean "get me the raw data in structured form")</li> <li>attempt to traverse the structured data graph (RDF) that the service provides via live de-referncable URIs.</li> </ol> <p>Here is the Acid test against my Data Space:</p> <ol> <li> <a href="http://kidehen.idehen.net/dataspace/person/kidehen" id="link-idfd2e5c8">My Profile Page</a> (HTML representation dispatched via an instance of <a href="http://en.wikipedia.org/wiki/OpenLink_Data_Spaces" id="link-id10d3d0f8">OpenLink Data Spaces</a>)</li> <li>Click on the "Linked Data Tab" (HTML representation endowed with Data Links the link to information resources containing other structured descriptions of things).</li> </ol>
Semantic Web Advocate of Tribe Linked Data! (Updated)
2008-03-20T20:29:47Z
2008-03-20T16:29:47-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1300
<p>In 2006, I stumbled across <a href="http://www.jasonkolb.com" id="link-id17165b98">Jason Kolb</a> (online) via a 4-part series of posts titled: <a href="http://www.jasonkolb.com/weblog/2006/08/reinventing_the_1.html" id="link-id14204cf8">Reinventing the Internet</a>. At the time, I realized that Jason was postulating about what is popularly known today as "<a href="http://en.wikipedia.org/wiki/Data_portability" id="link-id1412b280">Data Portability</a>", so I made contact with him (blogosphere style) via a post of my own titled: <a href="http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1033" id="link-id13b1cb20">Data Spaces, Internet Reinvention, and the Semantic Web</a>. Naturally, I tried to unveil to Jason the connection between his vision and the essence of the <a href="http://dbpedia.org/resource/Semantic_Web" id="link-id143117f0">Semantic Web</a>. Of course, he was skeptical :-)</p> <p>Jason recently moved to <a href="http://dbpedia.org/resource/Massachusetts" id="link-id13c4a470">Massachusetts</a> which lead to me pinging him about our earlier blogosphere encounter and the emergence of a <a href="http://dataportability.org/" id="link-id17395c60">Data Portability Community</a>. I also informed him about the fact that <a href="http://dbpedia.org/resource/Tim_Berners-Lee" id="link-id105507f0">TimBL</a>, myself, and a number of other Semantic Web technology enthusiasts, frequently meet on the 2nd Tuesday of each month at the <a href="http://dbpedia.org/resource/Massachusetts_Institute_of_Technology" id="link-id1719f798">MIT</a> hosted <a href="http://esw.w3.org/topic/CambridgeSemanticWebGatherings" id="link-id1734d460">Cambridge Semantic Web Gatherings</a>, to discuss, demonstrate, debate all aspects of the Semantic Web. Luckily (for both of us), Jason attended the last event, and we got to meet each other in person.</p> <p>Following our face to face meeting in Cambridge, a number of follow-on conversations ensued covering, Linked Data and practical applications of the Semantic Web vision. Jason writes about our exchanges a recent post titled: <a href="http://www.jasonkolb.com/weblog/2008/01/the-semantic-we.html" id="link-id13be6280">The Semantic Web</a>. His passion for Data Portability enabled me to use <a href="http://esw.w3.org/topic/FoafOpenid" id="link-id141516a8">OpenID and FOAF integration</a> to connect the Semantic Web and Data Portability via the Linked Data concept.</p> <p>During our conversations, Jason also eluded to the fact that he had already encountered <a href="http://en.wikipedia.org/wiki/OpenLink_Software" id="link-id17038218">OpenLink Software</a> while working with our <a href="http://data.openlinksw.com/oplweb/product_category/odbc#this" id="link-id14325f08">ODBC Drivers</a> (part of or <a href="http://data.openlinksw.com/oplweb/product_family/uda#this" id="link-id11ab1008">UDA product family</a>) for <a href="http://dbpedia.org/resource/Informix" id="link-id125858d0">IBM Informix</a> (<a href="http://data.openlinksw.com/oplweb/product/odbc-informix-st#this" id="link-id13b85e30">Single-Tier</a> or <a href="http://data.openlinksw.com/oplweb/product/odbc-informix-mt#this" id="link-id13edceb0">Multi-Tier</a> Editions) a few years ago (interesting random connection).</p> <p>As I've stated in the past, I've always felt that the Semantic Web vision will materialize by way of a global epiphany. The count down to this inevitable event started at the birth of the blogosphere, ironically. And accelerated more recently, through the emergence of <a href="http://dbpedia.org/resource/Web_2.0" id="link-id171d4ec8">Web 2.0</a> and <a href="http://dbpedia.org/page/Social_network" id="link-id140da830">Social Networking</a>, even more ironically :-)</p> <p>The blogosphere started the process of Data Space coalescence via RSS/Atom based semi-strucutured data enclaves, Web 2.0 RDFpropagated Web Service usage en route to creating service provider controlled, data and information silosRDF, Social NetworkingRDF brought attention to the fact that User Generated Data wasn't actually owned or controlled by the Data Creators etc.</p> <p>The emergence of "Data Portability" has created a palatable moniker for a clearly defined, and slightly easier to understand, problem: the meshing of Data and Identity in cyberspace i.e. individual points of presence in cyberspace, in the form of "Personal Data Spaces in the Clouds" (think: doing really powerful stuff with .name domains). In a sense, this is the critical inflection point between the document centric "Web of Linked Documents" and the data centric "Web or Linked Data". There is absolutely no other way solve this problem in a manner that alleviates the imminent challenges presented by information overload -- resulting from the exponential growth of user generated data across the Internet and enterprise Intranets.</p>
Semantic Data Web Epiphanies: One Node at a Time
2008-01-18T07:27:27Z
2008-01-18T02:27:27.000004-05:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1267
<blockquote> <p>"The phrase Open Social implies portability of personal and social data. That would be exciting but there are entirely different protocols underway to deal with those ideas. As some people have told me tonight, it may have been more accurate to call this "OpenWidget" - though the press wouldn't have been as good. We've been waiting for data and identity portability - is this all we get?" <br /> [Source: <a href="http://blogs.usnet.private:8893/[Excerpted from: http://feeds.feedburner.com/~r/readwriteweb/~3/178622741/opensocial_three_big_concerns.php]" id="link-id1143a428">Read/Write Web's Commentary & Analysis of Google's OpenSocial API</a>]</p> </blockquote> <blockquote> <p>..Perhaps the world will read the terms of use of the API, and realize this is not an open API; this is a free API, owned and controlled by one company only: Google. Hopefully, the world will remember another time when Google offered a free API and then pulled it. Maybe the world will also take a deeper look and realize that the functionality is dependent on Google hosted technology, which has its own terms of service (including adding ads at the discretion of Google), and that building an OpenSocial application ties Google into your application, and Google into every social networking site that buys into the Dream. Hopefully the world will remember. Unlikely, though, as such memories are typically filtered in the Great Noise....</p>[Source: <a href="http://burningbird.net/technology/terms/" id="link-id116f8c98">Poignant commentary excerpt from <a href="http://burningbird.net" id="link-id11216e98">Shelly Power's Blog</a></a> (as always)]</blockquote> <p>The "<a href="http://dbpedia.org/resource/Semantic_Web" id="link-id1102bc20">Semantic Data Web</a>" vision has always been about "Data & Identity" portability across the Web. Its been that and more from day one.</p> <p>In a nutshell, we continue to exhibit varying degrees of <a href="http://dbpedia.org/resource/Cognitive_dissonance" id="link-id121bb728">Cognitive Dissonance</a> re the following realities:</p> <ol> <li>The <a href="http://dbpedia.org/resource/Network" id="link-id114567b0">Network</a> is the Computer (Internet/Intranet/Extranet depending on your TCP/IP usage scenarios)</li> <li>The Web is the OS (ditto) and it provides a communications subsystem (<a href="http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s+BLOG+%5B127%5D/1231" id="link-id1212b390">Information BUS</a>) comprised of</li> <ul>- <a href="http://dbpedia.org/resource/Hypertext_Transfer_Protocol" id="link-id11b1b760">HTTP</a> Protocol</ul> <ul>- <a href="http://dbpedia.org/resource/Uniform_Resource_Identifier" id="link-id11043020">URI</a>s (pointer system for identifying, accessing, and manipulating data)</ul> <li>HTTP based Interprocess (i.e Web Apps are processes when you discard the HTML UI and interact with the application logic containers called "Web Services" behind the pages) ultimately hit data</li> <li>Web Data is best Modeled as a Graph (RDF, Containers/Items/Item Types, Property & Value Pairs associated with something, and other labels)</li> <li>Network are Graphs and vice versa</li> <li>Social Networks are graphs where nodes are connected via social connectors ( [x]--knows-->[y] ) </li> <li>The Web is a Graph that exposes a People and Data Network (to the degree we allude to humans not being data containers i.e. just nodes in a network, otherwise we are talking about a Data Network)</li> <li>Data access and manipulation depends inherently on canonical Data Access mechanisms such as Data Source Identifiers / Names (time-tested practice in various DBMS realms)</li> <li>Data is forever, it is the basis of Information, and it is increasing exponentially due to proliferation of Web Services induced user activities (User Generated Content)</li> <li>Survival, Vitality, Longevity, Efficiency, Productivity etc.. are all depend on our ability to process data effectively in a shrinking time continuum where Data and/or Information overload is the alternative.</li> </ol> <p> The Data Web is about Presence over Eyeballs due to the following realities:</p> <ol> <li>Eyeballs are input devices for a <a href="http://dbpedia.org/resource/DNA" id="link-id118b29a0">DNA</a> based processing system (Humans). The aforementioned processing system can reason very well, but simply cannot effectively process masses of data or information</li> <li>Widgets offer little value long term re. the imminent data and information overload dilemma, ditto Web pages (however pretty), and any other Eyeballs-only centric Web Apps</li> <li>Computers (machines) are equipped with inorganic (non DNA) based processing power, they are equipped to process huge volumes of data and/or information, but they cannot reason</li> <li>To be effective in the emerging frontier comprised of a Network Computer and a Web OS, we need an effective mechanism that makes best use of the capabilities possessed by humans and machines, by shifting the focus to creation and interaction with points of "Data Web Presence" that openly expose "<a href="http://dbpedia.org/resource/Data_structure" id="link-id10e56458">Structured Linked Data</a>". </li> </ol> <p>This is why we need to inject a mesh of Linked Data into the existing Web. This is what the often misunderstood vision of the "Semantic Data Web" or "Web of Data" or "Web or Structured Data" is all about. </p> <p>As stated earlier (point 10 above), "Data is forever" and there is only more of it to come! Sociality and associated Social Networking oriented solutions are at best a spec in the Web's ocean of data once you comprehend this reality.</p> <p>Note: I am writing this post as an early implementor of <a href="http://dbpedia.org/resource/GData" id="link-id11349808">GData</a> and an implementor of <a href="http://dbpedia.org/resource/Linked_Data" id="link-id120f3a68">RDF Linked Data</a> technology and a "Web Purist". </p> <blockquote> <p>OpenSocial implementation and support across our relevant product families: <a href="http://dbpedia.org/resource/Virtuoso_Universal_Server" id="link-id1217bf20">Virtuoso</a> (i.e the <a href="http://virtuoso.openlinksw.com/Whitepapers/html/VirtSpongerWhitePaper.html" id="link-id12154258">Sponger Middleware</a> for RDF component), <a href="http://virtuoso.openlinksw.com/wiki/main/Main/Ods" id="link-id11369930">OpenLink Data Spaces</a> (Data Space Controller / Services), and the <a href="http://oat.openlinksw.com/" id="link-id113e4da0">OpenLink Ajaxt Toolkit</a> (i.e OAT Widgets and Libraries), is a triviality now that the OpenSocial APIs are public. </p> </blockquote> <p>The concern I have, and the problem that remains mangled in the vast realms of Web Architecture incomprehension, is the fact that GData and GData based APIs cannot deliver Structured Linked Data in line with the essence of the Web without introducing "lock-in" that ultimately compromises the "Open Purity" of the Web. <a href="http://dbpedia.org/resource/Facebook" id="link-id11073980">Facebook</a> and Google's <a href="http://code.google.com/apis/opensocial/docs/" id="link-id1215e020">OpenSocial</a> response to the Facebook juggernaut (i.e. open variant of the Facebook Activity Dashboard and Social Network functionality realms, primarily), are at best icebergs in the ocean we know as the "World Wide Web". The nice and predictable thing about icebergs is that they ultimately melt into the larger ocean :-)</p> On a related note, I had the pleasure of attending the <a href="http://www.w3.org/2007/03/RdfRDB/" id="link-id1106f678">W3C's RDF and DBMS Integration Workshop</a>, last week. The event was well attended by organizations with knowledge, experience, and a vested interested in addressing the issues associated with exposing none RDF data (e.g. SQL) as RDF, and the imminence of data and/or information overload covered in different ways via the following presentations: <ul>- <a href="http://virtuoso.openlinksw.com/presentations/RDF_Mapping_Presentation_W3C_workshop3.ppt" id="link-id11053440">RDF Views of SQL Data</a> - <a href="http://www.openlinksw.com/weblog/oerling" id="link-id1218bf70">Orri Erling </a>on behalf of OpenLink Software</ul> <ul>- <a href="http://www.michaelbrodie.com/documents/Brodie%20VLDB%202007%20V3.zip" id="link-id11eda380">Computer Science 2.0</a> (covering User Generated Content Explosion) - Michael Brodie</ul> <ul>- <a href="http://www.w3.org/2007/03/RdfRDB/talks/Finding_our_way.ppt" id="link-id113b9620">Experiences re. solving SPARQL Access to Distributed Data Sources</a> - Phil Ashworth </ul> <ul>- <a href="http://www.w3.org/2007/03/RdfRDB/program" id="link-id11265180">Other presentations</a> </ul>.
Reminder: Why We Need Linked Data!
2007-11-02T22:52:34Z
2007-11-02T18:52:34-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1265
<p>A new release of Virtuoso is now available in both <a href="http://virtuoso.openlinksw.com/wiki/main/" id="link-id1282d260">Open Source</a> and <a href="http://virtuoso.openlinksw.com" id="link-id1317deb0">Commercial</a> variants. The main features and Enhancements associated with this release include:</p> <ul> * 64-bit Integer Support</ul> <ul> * RDF Sink Folders for WebDAV - enabling RDF Quad Store population by simply dropping RDF files into WebDAV or via HTTP (meaning you can use CURL as an RDF in put mechanism for instance)</ul> <ul>* Additional Sponger Cartridges from Audio binary files (i.e ID3 tag extraction and Music Ontology mapping which exposes the fine details of music as RDF based Structured Data; one for the DJs & Remixers out there!)</ul> <ul>* New Sponger Cartridges for Facebook, Freebase, Wikipedia, GRDDL, RDFa, eRDF and more</ul> <ul>* Support for PHP 5.2 runtime hosting (Virtuoso is a bona fide deployment platform for: Wordpress, MediaWiki, phpBB, Drupal etc.)</ul> <ul>* Enhanced UI for managing <a href="http://dbpedia.org/resource/Linked_Data" id="link-id12837b20">RDF Linked Data</a> deployment (covering Multi Homed domains, Virtual Directories associated with URL-rewrite rules</ul> <ul>* Demonstration Database includes <a href="http://www.openlinksw.com/virtuoso/Whitepapers/html/rdf_views/virtuoso_rdf_views_example.html" id="link-id130c2830">SQL-RDF Views </a>& SQL Table samples for the THALIA Web Data Integration benchmark and test-suite</ul> <ul>* Tutorial Application includes Linked Data style SQL-RDF Views for the Northwind SQL DBMS schema (which is the same as the standard Virtuoso demo atabase schema)</ul> <ul>* SQL-RDF Views implementation of the TPC-D benchmark (Yes, we can run this grueling SQL benchmark via RDF views of SQL Data!)</ul> <ul>* A new Amazon EC2 Image for Virtuoso that enables you to instantiate a fully configured instance comprising the Virtuoso core,<a href="http://virtuoso.openlinksw.com/wiki/main/Main/OdsIndex" id="link-id126c5eb8"> OpenLink Data Spaces</a> platform and the <a href="http://sourceforge.net/projects/oat" id="link-id1341cb68">OpenLink Ajax Toolkit</a> (OAT) (we now have bona fide Data Spaces in the Clouds as an addition to the emerging Semantic Data Web mesh).</ul> <p>Download Lnks: </p> <ul>* <a href="http://virtuoso.openlinksw.com/wiki/main/Main/VOSDownload" id="link-id12745128">Open Source Edition</a> </ul> <ul>* <a href="http://download.openlinksw.com/download/product_matrix.vsp?p=f_os&fm=26&fam=2&df=16" id="link-id12f15ed0">Commercial Edition</a> </ul>
Virtuoso 5.0.2 Released!
2007-10-08T14:27:27Z
2007-10-08T10:27:27-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1261
<p>I've written extensively on the subject of <a href="http://www.openlinksw.com/weblog/public/search.vspx?blogid=127&q=data%20spaces&type=text&output=html" id="link-id134c2280">Data Spaces</a> in relation to the <a href="http://www.openlinksw.com/weblog/public/search.vspx?blogid=127&q=data%20web%0D%0A&type=text&output=html" id="link-id105aef90">Data Web</a> for while. I've also written sparingly about <a href="http://virtuoso.openlinksw.com/wiki/main/Main/OdsIndex" id="link-id105bd100">OpenLink Data Spaces</a> (a Data Web Platform that build using Virtuoso). On the other hand, I haven't shed much light on installation and deployment of OpenLink Data Spaces.</p> <p> <a href="http://blog.jonudell.net" id="link-id14347f20">Jon Udell</a> recently penned a post titled: <a href="http://blog.jonudell.net/2007/09/21/the-fourth-platform/" id="link-id1439ed48">The Fourth Platform</a>. The post arrives at a spookily coincidental time (this happens quite often between Jon and I as demonstrated last year during our <a href="http://weblog.infoworld.com/udell/gems/ju_idehen.mp3" id="link-id107d17a8">podcast</a>; the "Fourth" in his Innovators Podcast series).</p> <p>The platform that Jon describes is "Cloud Based" and comprised of Storage and Computation. I would like to add Data Access and Management (native and virtual) under the fourth platform banner with the end product called: "Cloud based Data Spaces". </p> <p>As I write, we are releasing a Virtuoso AMI (Amazon Image) labeled: virtuoso-dataspace-server. This edition of<a href="http://virtuoso.openlinksw.com" id="link-id13543210"> Virtuoso</a> includes the OpenLink Data Spaces Layer and all of the OAT applications we've been developing for a while.</p> <h2>What Benefits Does this offer?</h2> <ol> <li>Personal Data Spaces in the Cloud - a place where you can control and consolidate data across your Blogs, Wikis, RSS/Atom Feed Subscriptions, Shared Bookmarks, Shared Calendars, Discussion Threads, Photo Galleries etc</li> <li>All the data in your Data <a href="http://en.wikipedia.org/wiki/Data_Spaces">Space</a> is <a href="http://dbpedia.org/resource/SPARQL" id="link-id1149a4f8">SPARQL</a> or <a href="http://dbpedia.org/resource/GData" id="link-id107a9f28">GData</a> accessible.</li> <li>All of the data in your Personal Data Space is <a href="http://dbpedia.org/resource/Linked_Data">Linked Data</a> from the get go. Each Item of data is <a href="http://dbpedia.org/resource/Uniform_Resource_Identifier">URI</a> addressable</li> <li> <a href="http://dbpedia.org/resource/SIOC" id="link-id104f4160">SIOC</a> support - your Blogs, Wikis, Bookmarks etc.. are based on the SIOC ontology for Semantically Interlinking Online Communities (think: Open social-graph++) </li> <li> <a href="http://dbpedia.org/resource/Friend_of_a_friend" id="link-id105beb78">FOAF</a> support - your FOAF Profile page provides a URI that is an in-road to all Data in your Data Space.</li> <li> <a href="http://dbpedia.org/resource/OpenID" id="link-id1144e138">OpenID</a> support - your Personal Data Space ID is usable wherever OpenID is supported. OpenID and FOAF are integrated as per latest FOAF specs</li> <li>Two Integration with Facebook - You can access your Data Space from Facebook or access Facebook from your Data Space</li> <li>Unified Storage - The WebDAV based filesystem provides Cloud Storage that's integrated with Amazon S3; It also exposes all of your Data Space data via a traditional filesystem UI (think virtual Spotlight); You can also mount this drive to your local filesystem via your native operating system's WebDAV support</li> <li> <a href="http://dbpedia.org/resource/SyncML" id="link-id11128f48">SyncML</a> - you can sync calendar and contact details with your Data Space in the cloud from your Mobile phone.</li> <li>A practical Semantic Data Web solution - based on Web Infrastructure and doesn't require you to do anything beyond exposing URIs for data in your Data Spaces.</li> </ol> <h2> <a href="http://dbpedia.org/resource/Amazon_Elastic_Compute_Cloud" id="link-id115d1920">EC2</a>-AMI Details:</h2> <ul>AMI ID: ami-e2ca2f8b</ul> <ul>Manifest file: virtuoso-images/virtuoso-dataspace-server.manifest.xml</ul> <h2>Installation Guide:</h2> <ol> <li>Get an Amazon Web Services (AWS) account</li> <li>Signup for S3 and EC2 services</li> <li>Install the EC2 plugin for Firefox</li> <li>Start the EC2 plugin</li> <li>Locate the row containing <b>ami-7c31d515 Manifest virtuoso-test/virtuoso-cloud-beta-9-i386.manifest.xml </b>(sort using the AMI ID or Manifest Columns or search on pattern: virtuoso, due to name flux)</li> <li>Start the Virtuoso Data Space Server AMI</li> <li>Wait 4-5 minutes (*take a few minutes to create the pre-configured Linux Image*)</li> <li>Connect to http://<public_dns_name_of_your_instance>http://your-ec2-instance-cname:8890/ Log in with user/password dba/dba</public_dns_name_of_your_instance> </li> <li>Go to the Admin UI (Virtuoso Conductor) and change the PWDs for the 'dba' and 'dav' accounts (*Important!*)</li> <li>Give the "SPARQL" user "SPARQL_UPDATE" privileges (required if you want to exploit the in-built Sponger Middleware)</li> <li>Click on the <a href="http://dbpedia.org/resource/OpenLink_Data_Spaces">ODS</a> (OpenLink Data Spaces) link to start an Personal Editon of OpenLink Data Spaces (or go to: http://your-ec2-instance-cname/dataspace/ods/index.html)</li> <li>Log-in using the username and password credentials for the 'dav' account (or register a new user note: OpenID is an option here also) Create an Data Space Application Instance by clicking on a Data Space App. Tab</li> <li>Import data from your existing Web 2.0 style applications into OpenLink Data Spaces e.g. subscribe to a few RSS/Atom feeds via the "Feeds Manager" application or import some Bookmarks using the "Bookmarks" application</li> <li>Then look at the imported data in Linked Data form via your ODS generated URIs based on the patterns: http://your-ec2-instance-cname/dataspace/person/your-ods-id#this (URI for You the Person), http://your-ec2-instance-cname/dataspace/person/your-ods-id (FOAF File URI), http://your-ec2-instance-cname/dataspace/your-ods-id (SIOC File URI)<br /> </li> </ol> <h2> (OAT) from your Data Space instance</h2>Install the OAT VAD package via the Admin UI and then apply the URI patterns below within your browser:<br /> <ol> <li>http://<public_dns_name_of_your_instance>:8890/oatdemo - Entire OAT Demo Collection</public_dns_name_of_your_instance> </li> <li>http://<public_dns_name_of_your_instance>:8890/rdfbrowser - RDF Browser</public_dns_name_of_your_instance> </li> <li>http://<public_dns_name_of_your_instance>:8890/isparql - SPARQL Query Builder (iSPARQL)</public_dns_name_of_your_instance> </li> <li>http://<public_dns_name_of_your_instance>:8890/qbe - SQL Query Builder (iSQL)</public_dns_name_of_your_instance> </li> <li>http://<public_dns_name_of_your_instance>:8890/formdesigner - Forms Builder (for building Meshups based on RDF, SQL, or Web Servives Data Souces)</public_dns_name_of_your_instance> </li> <li>http://<public_dns_name_of_your_instance>:8890/dbdesigner - SQL DB Schema Designer (note a Visual SQL-RDF Mapper is also on it's way</public_dns_name_of_your_instance> </li> <li>http://<public_dns_name_of_your_instance>:8890/DAV/JS/ - To view the OAT Tree (there are some experimental demos that are missing from the main demo app etc..) </public_dns_name_of_your_instance> </li> </ol> <p>There's more to come!</p>
Fourth Platform: Data Spaces in The Cloud (Update)
2008-10-26T21:59:33Z
2008-10-26T17:59:33-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1254
<p>The motivation behind this post is a response to the <a href="http://www.readwriteweb.com">Read/WriteWeb</a> post titled: <a href="http://www.readwriteweb.com/archives/semantic_web_difficulties_with_classic_approach.php">Semantic Web: Difficulties with the Classic Approach</a>.</p> <p>First off, I am going to focus on the Semantic Data Web aspect of the overall Semantic Web vision (a continuum) as this is what we have now. I am also writing this post as a deliberate contribution to the discourse swirling around the real topic: Semantic Web Value Proposition.</p> <h2>Situation Analysis</h2> <p>We are in the early stages of the long anticipated<a href="http://dbpedia.org/resource/Knowledge_economy"> Knowledge Economy</a>. That being the case, it would be safe to assume that information access, processing, and dissemination are of utmost importance to individuals and organizations alike. You don't produce knowledge in a vacum! Likewise, you can produce Information in a vacum, you need Data.</p> <h2>The Semantic Data Web's value to Individuals</h2> <b>Problem:</b> <p>Increasingly, <a href="http://dbpedia.org/resource/Blog">Blogs</a>, <a href="http://dbpedia.org/resource/Wiki">Wikis</a>, <a href="http://dbpedia.org/resource/Social_bookmarking">Shared Bookmarks</a>, Photo Galleries, Discussion Forums, Shared Calendars and the like, have become invaluable tools for individual and organizational participation in Web enabled global discourse (where a lot of knowledge is discovered). These tools, are typically associated with <a href="http://dbpedia.org/resource/Web_2">Web 2.0</a>, implying Read-Write access via <a href="http://dbpedia.org/resource/Web_service">Web Services</a>, centralized application hosting, and data lock-in (silos).</p> <p>The reality expressed above is a recipe for "<a href="http://dbpedia.org/resource/Information_overload">Information Overload</a>" and complete annihilation of ones effective pursuit and exploitation of knowledge due "Time Scarcity" (note: disconnecting is not an option). Information abundance is inversely related to available processing time (for humans in particular). In my case for instance, I was actively subscribed to over 500+ RSS feeds in 2003. As of today, I've simply stopped counting, and that's just my Weblog Data Space. Then add to that, all of the Discussions I track across Blogs, wikis, message boards, mailing lists, traditional usnet discussion forumns, and the like, and I think you get the picture. </p> <p>Beyond information overload, Web 2.0 data is "Semi-Structured" by way of it's dominant data containers ((X)HTML, RSS, Atom documents and data streams etc.) lacking semantics that formally expose individual data items as distinct entities, endowed with unambiguous naming / identification, descriptive attributes (a type of property/predicate), and relationships (a type of property/predicate).</p> <b>Solution:</b> <p>Devise a standard for Structured Data Semantics that is compatible with the <a href="http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1231">Web Information BUS</a>.</p> <p>Produce <a href="http://www.mkbergman.com/?p=153">structured data</a> (entities, entity types, entity relationships) from Web 1.0 and Web 2.0 resources that already exists on the Web such that individual entities, their attributes, and relationships are accessible and discernible to software agents (machines).</p> <p>Once the entities are individually exposed, the next requirement is a mechanism for selective access to these entities i.e. a query language. </p> <p> Semantic Data Web Technologies that facilitate the solution described above include:</p> <b>Structured Data Standards:</b> <ul> <a href="http://dbpedia.org/resource/RDF">RDF</a> - Data Model for structured data</ul> <ul>RDF/XML - A serialization format for RDF based structured data</ul> <ul> <a href="http://dbpedia.org/resource/Notation_3">N3</a> / <a href="http://dbpedia.org/resource/Turtle_%28syntax%29">Turtle</a> - more human friendly serialization formats for RDF based structured data</ul> <b>Entity Exposure & Generation:</b> <ul> <a href="http://dbpedia.org/resource/GRDDL">GRDDL</a> - enables association between XHTML pages and XSLT stylesheets that facilitates loosely coupled "on the fly" extraction of RDF from non RDF documents</ul> <ul> <a href="http://dbpedia.org/resource/RDFa">RDFa</a> - enables document publishers or viewers (i.e those repurposing or annotating) to embed structured data into existing XHTML documents</ul> <ul> <a href="http://research.talis.com/2005/erdf/wiki/Main/RdfInHtml">eRDF</a> - another option for embedding structured RDF data within (X)HTML documents</ul> <ul> <a href="http://www.openlinksw.com/blog/%7Ekidehen/?id=1172">RDF Middleware</a> - typically incorporating GRDDL, RDFa, eRDF, and custom extraction and mapping as part of a structured data production pipeline</ul>. <b>Entity Naming & Identification:</b> <p>Use of URIs or IRIs for uniquely identifying physical (HTML Documents, Image Files, Multimedia Files etc..) and abstract (People, Places, Music, and other abstract things). </p> <b>Entity Access & Querying:</b> <ul> <p> <a href="http://dbpedia.org/resource/SPARQL">SPARQL</a> Query Language - the <a href="http://dbpedia.org/resource/SQL">SQL</a> analog of the Semantic Data Web that enables query constructs that target named entities, entity attributes, and entity relationships</p> </ul> <ul> <a href="http://www.w3.org/TR/rdf-sparql-protocol/">SPARQL Protocol</a> - a <a href="http://dbpedia.org/resource/Representational_State_Transfer">REST</a> or <a href="http://dbpedia.org/resource/SOAP">SOAP</a> style Web Service for transporting SPARQL Queries to Structured Data Sources.</ul> <ul> <a href="http://www.w3.org/TR/rdf-sparql-XMLres/">SPARQL Results Serialization Formats</a> - query results serialization formats that includes XML(sparql+xml) and JSON.</ul> <h2>The Semantic Data Web's value to Organizations</h2> <b>Problem:</b> <p>Organizations are rife with a plethora of business systems that are built atop a myriad of database engines, sourced from a variety of DBMS vendors. A typical organization would have a different database engine, from a specific DBMS vendor, underlying critical business applications such as: Human Resource Management (HR), Customer Relationship Management (CRM), Accounting, Supply Chain Management etc. In a nutshell, you have DBMS Engines, and DBMS Schema heterogeneity permeating the IT infrastructure of organizations on a global scale, making Data & Information Integration the biggest headache across all IT driven organizations.</p> <b>Solution:</b> <p>Alleviation of the pain (costs) associated with Data & Information Integration. </p> <b>Semantic Data Web offerings:</b> <p>A dexterous data model (RDF) that enables the construction of conceptual views of disparate data sources across an organization based on existing web architecture components such as HTTP and URIs.</p> <p>Existing middleware solutions that facilitate the exposure of SQL DBMS data as RDF based Structured Data include:</p> <ul> <a href="http://virtuoso.openlinksw.com/wiki/main/Main/VOSSQLRDF">Virtuoso's Meta Schema Language for RDF Views of SQL Data</a> (also see the <a href="http://virtuoso.openlinksw.com/Whitepapers/pdf/Virtuoso_SQL_to_RDF_Mapping.pdf">Virtuoso SQL-RDF Technical White Paper</a>)</ul> <ul> <a href="http://sites.wiwiss.fu-berlin.de/suhl/bizer/D2RQ/">D2RQ</a> </ul> <ul> <a href="http://ccnt.zju.edu.cn/projects/dartgrid">DataGrid</a> </ul> <ul> <a href="http://esw.w3.org/topic/RdfAndSql">Others</a> </ul> <p> BTW - There is an upcoming <a href="http://www.w3.org/2007/03/RdfRDB/">W3C Workshop covering the integration of SQL and RDF data</a>.</p> <h2>Conclusion</h2> <p>The Semantic Data Web is here, it's value delivery vehicle is the URI. The URI is a conduit to Interlinked Structured Data (RDF based Linked Data) derived from existing data sources on the World Wide Web alongside data continuously injected into the Web by organizations world wide. Ironically, the Semantic Data Web only platform that crystallizes the: Information at Your Fingertips vision, without development environment, operating system, application, or database lock-in. You simply click on a <a href="http://dbpedia.org/resource/Linked_Data">Linked Data URI</a> and the serendipitous exploration and discovery of data commences.</p> <p>The unobtrusive emergence of the Semantic Data Web is a reflection of the soundness of the underlying Semantic Web vision.</p> <p>If you are excited about <a href="http://dbpedia.org/resource/Mashup_%28web_application_hybrid%29">Mash-ups</a> then your are a Semantic Web enthusiast and benefactor in the making, because you only "Mash" (brute force data extraction and interlinking) because you can't "Mesh" (natural data extraction and interlinking). Likewise, if you are a social-networking, open social-graph, or portable social-network enthusiast, then you are also a Semantic Data Web benefactor and enthusiasts, because your "values" (yes, the values associated with the properties that define you e.g your interests etc) are the fundamental basis for portable, open, social-networking, which is what the Semantic Data Web hands to you on a platter without compromise (i.e. data lock-in or loss of data ownership).</p> <b>Some practical examples of Semantic Data Web prowess:</b> <ul> <a href="http://demo.openlinksw.com/DAV/JS/rdfbrowser/index.html?uri=http%3A%2F%2Fwww.readwriteweb.com%2Farchives%2Fsemantic_web_difficulties_with_classic_approach.php">Read/WriteWeb via the OpenLink Data Web Browser</a> (click on the different viewing tabs to see what structured data exploitation in action)</ul> <ul> <a href="http://browser.zitgist.com/?uri=http%3A//www.readwriteweb.com/archives/semantic_web_difficulties_with_classic_approach.php">Read/WriteWeb via the Zitgist Data Web Browser</a> </ul> <ul> <a href="http:/dbpedia.org">DBpedia</a> (*note: I deliberately use DBpedia URIs in my posts where I would otherwise have used a Wikipedia article URI*)</ul> <ul> <a href="http://wordpress.org/extend/plugins/zitgist-browser-linker/">Zitgist zLinks</a> - <a href="http://www.mkbergman.com/?p=400">Mike Bergman's Blog Post also demonstrating zLinks</a> </ul>
Semantic Web Value Proposition
2007-09-21T12:05:07Z
2007-09-21T08:05:07.000009-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1231
<p> <a href="http://www.wiwiss.fu-berlin.de/suhl/ueber_uns/team/chris_bizer.htm">Chris Bizer</a>, <a href="http://richard.cyganiak.de/">Richard Cyganiak</a>, and <a href="http://kmi.open.ac.uk/people/tom/html">Tom Heath</a> have just published a <a href="http://sites.wiwiss.fu-berlin.de/suhl/bizer/pub/LinkedDataTutorial/">Linked Data Publishing Tutorial</a> that provides a guide to the mechanics of Linked Data injection into the Semantic Data Web.</p> <p> On different, but related, thread, <a href="http://www.mkbergman.com">Mike Bergman</a> recently penned a post titled: <a href="http://feeds.feedburner.com/~r/AI3_AdaptiveInformation/~3/134989485/">What is the Structured Web?</a>. Both of these public contributions shed light on the "Information BUS" essence of the World Wide Web by describing the evolving nature of the payload shuttled by the BUS. </p> <h3>What is an Information BUS? </h3> <p>Middleware infrastructure for shuttling "Information" between endpoints using a messaging protocol.</p> <p>The Web is the dominant Information BUS within the Network Computer we know as the "Internet". It uses HTTP to shuttle information payloads between "Data Sources" and "Information Consumers" - what happens when we interact with Web via User Agents / Clients (e.g Browsers). </p> <h3>What are Web Information Payloads?</h3> <p>HTTP transported streams of contextualized data. Hence the terms: "Information Resource" and "Non Information" when reading material related to <a href="http://www.w3.org/2001/tag/doc/httpRange-14/2007-05-31/HttpRange-14#iddiv2104118728">http-range-14 and Web Architecture</a>. For example, an (X)HTML document is a specific data context (representation) that enables us to perceive, or comprehend, a data stream originating from a Web Server as a Web Page. On the other hand, if the payload lacks contextualized data, a fundamental Web requirement, then the resource is referred to as a "Non Information" resource. Of course, there is really no such thing as a "Non Information" resource, but with regards to Web Architecture, it's the short way of saying: "the Web Transmits Information only". That said, I prefer to refer to these "Non Information" resources as "Data Sources", are term well understood in the world of Data Access Middleware (ODBC, JDBC, OLEDB, ADO.NET etc.) and Database Management Systems (Relational, Objec-Relational, Object etc).</p> <p>Examples of Information Resource and Data Source URIs:</p> <ul> <a href="http://demo.openlinksw.com/Northwind/Customer/ALFKI">http://demo.openlinksw.com/Northwind/Customer/ALFKI</a> (Information Resource)</ul> <ul> <a href="http://demo.openlinksw.com/sparql?query=CONSTRUCT+{+%3Chttp%3A//demo.openlinksw.com/Northwind/Customer/ALFKI%23this%3E+%3Fp+%3Fo+}+FROM+%3Chttp%3A//demo.openlinksw.com/Northwind%3E+WHERE+{+%3Chttp%3A//demo.openlinksw.com/Northwind/Customer/ALFKI%23this%3E+%3Fp+%3Fo+}&format=application/rdf%2Bxml">http://demo.openlinksw.com/Northwind/Customer/ALFKI#this</a> (Data Source)</ul> <p>Explanation: The Information Resource is a conduit to the Entity identified by Data Source (an entity in my RDF Data Space that is the Subject or Object of one of more Triple based Statements. The triples in question can that can be represented as an RDF resource when transmitted over the Web via an Information Resource that takes the form of a SPARQL REST Service URL or a Physical RDF based Information Resource URL). </p> <h3>What about Structured Data?</h3> <p>Prior to the emergence of the Semantic Data Web, the payloads shuttled across the Web Information BUS comprised primarily of the following:</p> <ol> <li>HTML - Web Resource with presentation focused structure (Web 1.0 dominant payload form)</li> <li>XML - Web Resource with structure that separates presentation and data (Web 2.0's dominant payload form).</li> </ol> <p>The Semantic Data Web simply adds <a href="http://blogs.usnet.private:8893/Resource_Description_Framework">RDF</a> to the payload formats that shuttle the Web Information BUS. RDF addresses formal data structure which XML doesn't cover since it is semi-structured (distinct data entities aren't formally discernible). In a nutshell, an RDF payload is basically a conceptual model database packaged as an Information Resource. It's comprised of granular data items called "Entities", that expose fine grained properties values, individual and/or group characteristics (attributes), and relationships (associations) with other Entities.</p> <h3>Where is this all headed? </h3> <p>The Web is in the final stages of the 3rd phase of it's evolution. A phase characterized by the shuttling of structured data payloads (RDF) alongside less data oriented payloads (HTML, XHTML, XML etc.). As you can see, <a href="http://linkeddata.org">Linked Data</a> and Structured Data are both terms used to describe the addition of more data centric payloads to the Web. Thus, you could view the process of creating a Structured Web of Linked Data as follows:</p> <ol> <li>Identify or Create Structured Data Sources</li> <li>Name these Data Sources using Data Source URIs</li> <li>Expose Structured Data Sources to the Web as Linked Data using Information Resource (conduit) URIs</li> </ol> <h3>Conclusions</h3> <p>The Semantic Data Web is an evolution of the current Web (an Information Space) that adds structured data payloads (RDF) to current, less data oriented, structured payloads (HTML, XHTML, XML, and others).</p> <p>The Semantic Data Web is increasingly seen as an inevitability because it's rapidly reaching the point of critical mass (i.e. network effect kick-in). As a result, Data Web emphasis is moving away from: "What is the Semantic Data Web?" To: "How will Semantic Data Web make our globally interconnected village an even better place?", relative to the contributions accrued from the Web thus far. Remember, the initial "Document Web" (Web 1.0) bootstrapped because of the benefits it delivered to blurb-style content publishing (remember the term electronic brochure-ware?). Likewise, in the case of the "Services Web" (Web 2.0), the bootstrap occurred because it delivered platform independence to Web Application Developers - enabling them to expose application logic behind Web Services. It is my expectation that the Data Integration prowess of the Data Web will create a value exchange realm for data architects and other practitioners from the database and data access realms.</p> <h3>Related Items</h3> <ol> <li> <a href="http://www.mkbergman.com/?p=153">Mike Bergman's post about Semi-Structured Data</a> </li> <li> <a href="http://www.openlinksw.com/weblog/public/search.vspx?blogid=127&q=structured%20data&type=text&output=html">My Posts covering Structured and Un-Structured Containers</a> </li> </ol>
Linked Data & The Web Information BUS
2007-08-08T22:26:55Z
2007-08-08T18:26:55-04:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1224
<p>Last week we <a href="http://www.openlinksw.com/press/virt_501.htm">officially released Virtuoso 5.0.1</a> (in Commercial and Open Source Editions). The press release provided us with an official mechanism and timestamp for the current Virtuoso feature set.</p> <p>A vital component of the new Virtuoso release is the finalization of our SQL to RDF mapping functionality -- enabling the declarative mapping of SQL Data to RDF. Additional technical insight covering other new features (delivered and pending) is provided by <a href="http://www.openlinksw.com/weblogs/oerling/">Orri Erling</a>, as part of a series of post-Banff posts.</p> <h2>Why is SQL to RDF Mapping a Big Deal?</h2> <p>A majority of the world's data (especially in the enterprise realm) resides in SQL Databases. In addition, Open Access to the data residing in said databases remains the biggest challenge to enterprises for the following reasons:</p> <ol> <li> SQL Data Sources are inherently heterogeneous because they are acquired with business applications that are in many cases inextricably bound to a particular DBMS engine </li> <li> Data is predictably dirty </li> <li> DBMS vendors ultimately hold the data captive and have traditionally resisted data access standards such as ODBC (*trust me they have, just look at the unprecedented bad press associated with ODBC the only truly platform independent data access API. Then look at how this bad press arose..*) </li> </ol> <p> Enterprises have known from the beginning of modern corporate times that data access, discovery, and manipulation capabilities are inextricably linked to the "Real-time Enterprise" nirvana (hence my use of 0.0 before this becomes 3.0).</p> <p>In my experience, as someone whose operated in the data access and data integration realms since the late '80s, I've painfully observed enterprises pursue, but unsuccessfully attain, full control over enterprise data (the prized asset of any organization) such that data-, information-, knowledge-workers are just a click away from commencing coherent platform and database independent data drill-downs and/or discovery that transcend intranet, internet, and extranet boundaries -- serendipitous interaction with relevant data, without compromise!</p> <p>Okay, situation analysis done, we move on.. </p> <p>At our most recent (<a href="http://esw.w3.org/topic/CambridgeSemanticWebGatherings/Meeting/2007-06-12_Gathering">12th June</a>) monthly <a href="http://esw.w3.org/topic/CambridgeSemanticWebGatherings">Semantic Web Gathering</a>, I unveiled to <a href="http://www.w3.org/People/Berners-Lee/card#i">TimBL</a> and a host of other attendees a simple, but powerful, demonstration of how <a href="http://en.wikipedia.org/wiki/Linked_Data">Linked Data</a>, as an aspect of the <a href="http://www.businessweek.com/technology/content/apr2007/tc20070409_961951.htm">Semantic Data Web</a>, can be applied to enterprise data integration challenges.</p> <h2>Actual SQL to RDF Mapping Demo / Experiment</h2> <h4>Hypothesis</h4> A SQL Schema can be effectively mapped declaratively to RDF such that SQL Rows morph into RDF Instance Data (Entity Sets) based on the Concepts & Properties defined in a Concrete Conceptual Data Model oriented Data Dictionary (<a href="http://www.w3schools.com/rdf/rdf_schema.asp">RDF Schema</a> and/or <a href="http://www.w3schools.com/rdf/rdf_owl.asp">OWL Ontology</a>). In addition, the solution must demonstrate how "Linked Data in the Web" is completely different from "Data on the Web" or "Linked Data on the Web" (btw - <a href="http://kasei.us/people/Tom_Heath/">Tom Heath</a> eloquently unleashed this point in his recent <a href="http://blogs.talis.com/nodalities/2007/06/tom_heath_talks_with_talis_abo.php">podcast interview with Talis</a>). <h4>Apparatus</h4> An Ontology - in this case we simply derived the <a href="http://demo.openlinksw.com/DAV/home/demo/Public/Queries/SQLRDFIntegraton/Explore_Northwind_Ontology.isparql">Northwind Ontology</a> from the XML Schema based CSDL (<a href="http://blogs.msdn.com/adonet/archive/2007/01/30/entity-data-model-part-1.aspx">Conceptual Schema Definition Language</a>) used by Microsoft's public <a href="http://astoria.mslivelabs.com/Default.aspx">Astoria demo</a> (specifically the <a href="http://astoria.mslivelabs.com/termsOfUseNorthwind.aspx?returnURL=Northwind">Northwind Data Services demo</a>). SQL Database Schema - <a href="http://www.microsoft.com/library/media/1033/technet/images/prodtechnol/sql/2000/maintain/sscpop07_big.gif">Northwind</a> (comes bundled with ACCESS, SQL Server, and Virtuoso) comprised of tables such as: <a href="http://www.openlinksw.com/schemas/northwind#Customer">Customer</a>, <a href="http://www.openlinksw.com/schemas/northwind#Employee">Employee</a>, <a href="http://www.openlinksw.com/schemas/northwind#Product">Product</a>, <a href="http://www.openlinksw.com/schemas/northwind#Category">Category</a>, <a href="http://www.openlinksw.com/schemas/northwind#Supplier">Supplier</a>, <a href="http://www.openlinksw.com/schemas/northwind#Shipper">Shipper</a> etc. <a href="http://www.openlinksw.com/virtuoso/">OpenLink Virtuoso</a> - SQL DBMS Engine (although this could have been any <a href="http://en.wikipedia.org/wiki/Open_Database_Connectivity">ODBC</a> or <a href="http://en.wikipedia.org/wiki/Java_Database_Connectivity">JDBC</a> accessible Database), <a href="http://www.openlinksw.com/virtuoso/Whitepapers/pdf/Virtuoso_SQL_to_RDF_Mapping.pdf">SQL-RDF Metaschema Language</a>, HTTP URL-rewriter, WebDAV Engine, and DBMS hosted XSLT processor Client Tools -<a href="http://demo.openlinksw.com/isparql/"> iSPARQL Query Builder</a>, <a href="http://demo.openlinksw.com/DAV/JS/rdfbrowser/index.html">RDF Browser</a> (which could also have been <a href="http://www.w3.org/2005/ajar/tab">Tabulator</a> or<a href="http://sites.wiwiss.fu-berlin.de/suhl/bizer/ng4j/disco/"> DISCO</a> or a standard Web Browser) <h4>Experiment / Demo</h4> <ol> <li> Declaratively map the Northwind SQL Schema to RDF using the Virtuoso Meta Schema Language (see: <a href="http://demo.openlinksw.com/DAV/home/demo/Public/Queries/SQLRDFIntegraton/northwind_sql_rdf.sql">Virtuoso PL based Northwind_SQL_RDF script</a>) </li> <li> Start browsing the data by clicking on the URIs that represent the RDF Data Model Entities resulting from the SQL to RDF Mapping </li> </ol> <h4>Observations</h4> <ol> <li> Via a single Data Link click I was able to obtain specific information about the Customer represented by the URI <a href="http://demo.openlinksw.com/Northwind/Customer/ALFKI">"ALFKI"</a> (act of URI Dereferencing as you would an Object ID in an Object or Object-Relational Database) </li> <li> Via a <a href="http://demo.openlinksw.com/DAV/home/demo/Public/Queries/SQLRDFIntegraton/Explore_Northwind.isparql">Dynamic Data Page </a> I was able to explore all the entity relationships or specific entity data (i.e Exploratory or Entity specific dereferencing) in the Northwind Data Space </li> <li> I was able to perform similar exploration (as per item 2) using our <a href="http://demo.openlinksw.com/DAV/home/demo/Public/Queries/SQLRDFIntegraton/Explore_Northwind_Customer_ALFKI.wqx">OpenLink Browser. </a> </li> </ol> <h4>Conclusions</h4> <p>The vision of data, information, or knowledge at your fingertips is nigh! Thanks to the infrastructure provided by the Semantic Data Web (URIs, <a href="http://en.wikipedia.org/wiki/Resource_Description_Framework">RDF Data Model</a>, variety of RDF Serialization Formats[<a href="http://www.dajobe.org/2004/01/turtle/">1</a>][<a href="http://www.w3.org/DesignIssues/Notation3">2</a>][<a href="http://www.w3.org/TR/2002/WD-rdf-syntax-grammar-20020325/">3</a>], and Shared Data Dictionaries / Schemas / Ontologies [<a href="http://xmlns.com/foaf/spec/">1</a>][<a href="http://rdfs.org/sioc/spec/">2</a>][<a href="http://www.w3.org/TR/swbp-skos-core-guide/">3</a>][<a href="http://musicontology.com/">4</a>][<a href="http://bblfish.net/work/atom-owl/2006-06-06/AtomOwl.html">5</a>]) it's now possible to Virtualize enterprise data from the Physical Storage Level, through the Logical Data Management Levels (Relational), up to a Concrete Conceptual Model (Graph) without operating system, development environment or framework, or database engine lock-in.</p> <h2>Next Steps</h2> <p>We produce a shared ontology for the CRM and Business Reporting Domains. I hope this experiment clarifies how this is quite achievable by converting XML Schemas to RDF Data Dictionaries (RDF Schemas or Ontologies). Stay tuned :-) </p> <p>Also watch <a href="http://news.com.com/1606-2-6189377.html">TimBL amplify and articulate Linked Data value</a> in a recent interview.</p> <h2>Other Related Matters</h2> <p>To deliver a mechanism that facilitates the crystallization of this reality is a contribution of boundless magnitude (as we shall all see in due course). Thus, it is easy to understand why even "her majesty", the queen of England, simply had to get in on the act and <a href="http://www.royal.gov.uk/output/Page1880.asp">appoint TimBL to the "British Order of Merit</a>" :-)</p> <p>Note: All of the demos above now work with IE & Safari (a "remember what Virtuoso is epiphany") by simply putting Virtuoso's DBMS hosted XSLT engine to use :-) This also applies to my earlier collection of demos from the <a href="http://www.openlinksw.com/weblog/public/search.vspx?blogid=127&q=hello%20data%20web&type=text&output=html">Hello Data Web</a> and other <a href="http://www.openlinksw.com/weblog/public/search.vspx?blogid=127&q=.isparql&type=text&output=html">Data Web & Linked Data related demo style posts</a>.</p>
Enterprise 0.0, Linked Data, and Semantic Data Web
2008-02-05T04:19:26Z
2008-02-04T23:19:26.000001-05:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1144
<p>As I have stated, and implied, in various posts about the Data Web and burgeoning Semantic Web in general; the value of RDF is felt rather than seen (driven by presence as opposed to web sites). That said, it is always possible to use the visual Interactive-Web dimension (Web 1.0) as a conduit to the Data-Web dimension.</p> <p>In this third take on my introduction to the Data Web I would like to share a link with you (a Dynamic Start Page in Web 2.0 parlance) with a Data Web twist: You do not have to preset the Start Page Data Sources (this is a small-big thing, if you get my drift, hopefully!).</p> <p>Here are some Data Web based Dynamic Start Pages that I have built for some key play ers from the Semantic Web realm (in random order):</p> <ol> <li> <a href="http://demo.openlinksw.com/DAV/home/demo/Public/Queries/danbri_dataspace.isparql.xml">Dan Brickley</a> </li> <li> <a href="http://demo.openlinksw.com/DAV/home/demo/Public/Queries/timbl_dataspace.isparql.xml">Tim Berners-Lee</a> </li> <li> <a href="http://demo.openlinksw.com/DAV/home/demo/Public/Queries/danc_dataspace.isparql.xml">Dan Connolly</a> </li> <li> <a href="http://demo.openlinksw.com/DAV/home/demo/Public/Queries/danja_dataspace.isparql.xml">Danny Ayers</a> </li> <li> <a href="http://demo.openlinksw.com/DAV/home/demo/Public/Queries/planet_rdf_dataspace.isparql.xml">Planet RDF</a> </li> </ol> <p>"These are RDF prepped Data Sources....", you might be thinking, right? Well here is the reminder: <a href="http://www.openlinksw.com/blog/~kidehen/?id=1122">The Data Web is a Global Data Generation and Integration Effort</a>. Participation may be active (Semantic Web & Microformats Community), or passive (web sites, weblogs, wikis, shared bookmarks, feed subscription, discussion forums, mailing lists etc..). Irrespective of participation mode, RDF instance can be generated from close to anything (I say this because I plan to add binary files holding metadata to this mix shortly). Here are examples of Dynamic Start Pages for non RDF Data Sources:</p> <ol> <li> <a href="http://demo.openlinksw.com/DAV/home/demo/Public/Queries/delicious_web20_events_dataspace.isparql.xml">del.icio.us Web 2.0 Events Bookmarks</a> </li> <li> <a href="http://demo.openlinksw.com/DAV/home/demo/Public/Queries/vecosys_dataspace.isparql.xml">Vecosys</a> </li> <li> <a href="http://demo.openlinksw.com/DAV/home/demo/Public/Queries/techcrunch_dataspace.isparql.xml">Techcrunch</a> </li> <li> <a href="http://demo.openlinksw.com/DAV/home/demo/Public/Queries/jonudell_dataspace.isparql.xml">Jon Udell's Blog</a> </li> <li> <a href="http://demo.openlinksw.com/DAV/home/demo/Public/Queries/davewiner_dataspace.isparql.xml">Dave Winer's Scripting News</a> </li> <li> <a href="http://demo.openlinksw.com/DAV/home/demo/Public/Queries/scobelizer_dataspace.isparql.xml">Robert Scoble's Blog</a> </li> </ol> <p>what about Microformats you may be wondering? Here goes:</p> <ol> <li> <a href="http://demo.openlinksw.com/DAV/home/demo/Public/Queries/microformats_dataspace.isparql.xml">Microformats Wiki</a> (click on the Brian Suda link for instance) </li> <li> <a href="http://demo.openlinksw.com/DAV/home/demo/Public/Queries/planet_microformats_dataspace.isparql.xml">Microformats Planet</a> </li> <li> <a href="http://demo.openlinksw.com/DAV/home/demo/Public/Queries/delicious_microformats_dataspace.isparql.xml">Del.icio.us Microformats Bookmarks</a> </li> <li> <a href="http://demo.openlinksw.com/DAV/home/demo/Public/Queries/ben_adida_dataspace.isparql.xml">Ben Adida's home page</a> (RDFa)</li> </ol> <p>Let's carry on.</p> <p>How about some traditional Web Sites? Here goes:</p> <ol> <li> <a href="http://demo.openlinksw.com/DAV/home/demo/Public/Queries/openlink_dataspace.isparql.xml">OpenLink Software's Home Page</a> </li> <li> <a href="http://demo.openlinksw.com/DAV/home/demo/Public/Queries/oracle_dataspace.isparql.xml">Oracle's Home Page</a> </li> <li> <a href="http://demo.openlinksw.com/DAV/home/demo/Public/Queries/apple_dataspace.isparql.xml">Apple's Home Page</a> </li> <li> <a href="http://demo.openlinksw.com/DAV/home/demo/Public/Queries/microsoft_dataspace.isparql.xml">Microsoft's Home Page</a> </li> <li> <a href="http://demo.openlinksw.com/DAV/home/demo/Public/Queries/ibm_dataspace.isparql.xml">IBM's Home Page</a> </li> </ol> <p>And before I forget, here is <a href="http://demo.openlinksw.com/DAV/home/demo/Public/Queries/kidehen_dataspace.isparql.xml">My Data Web Start Page </a>.</p> <p>Due to the use of Ajax in the Data Web Start Pages, IE6 and Safari will not work. For Mac OS X users, Webkit works fine. Ditto re. IE7 on Windows.</p>
Hello Data Web (Take 3 - Feel The "RDF" Force)
2007-02-24T22:01:28Z
2007-02-24T17:01:28-05:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1143
<p>I just overheard the following dialog between my six year old son and his play date:</p> <blockquote> <pre> Play Date: What is that thing on the Wall? My Son: Security Alarm Play Date: How does it work My Son: If you click on that top button and then open the door, I will have to enter a code when we come back in or the alarm will go off Play Date: What is the code? My Son: I can't tell you that! Play Date: Why not? My Son: You might come and steal something from our house! Play Date: No I won't! My Son: Well, you might tell someone that might come and steal something from our house! or that person could tell someone who could tell someone that would steal from our house</pre></blockquote> <p>LOL!! of course! At the same time wondering, how come a majority of adults don't quite see the need for granular access to Web Data in a manner that enables computers and humans to collectively arrive at similar decisions? </p> <p>Putting Data in context en route to producing actionable knowledge is a transient endeavor that engages a myriad of human senses. We demonstrate comprehension of this fact in our daily existence as social creatures (at a very early age as depicted above). That said, we seem to forget this fact when engaging the Web: If we can't see it then it can't be valuable.</p> <blockquote> <p>BTW - I just received a ping about the "<a href="http://www.flickr.com/photos/route79/399029535/">Sensory Web</a>" (which is just another way of describing a Data Driven Web experience from my vantage point.)</p> </blockquote> <p>In the popular M-V-C pattern you don't see the "M", but the "M" will kill you if you get it wrong (it is the FORCE)! Coming to think about it, the pattern could have been coined: V-C-M or C-M-V, but isn't for obvious reasons :-)</p> <p>RDF is the vehicle that enables us tap into the Data aspect of the Web. We started off with pages of blurb linked via hypertext (Web 1.0) and then looked to "Keywords" for some kind of data access; we then isolated some "Verbs" and discovered another dimension of Web Interaction (Web 2.0) but looked to these "Verbs" for data access which left us with Mashups; and now we are starting to extract "Nouns" and "Adjectives" from sentences (Subject, Predicate, Object - Triples) associated with resources on the Web (Data Web / Web 3.0 / Semantic Web Layer 1) which provides a natural data access substrate for <a href="http://www.openlinksw.com/weblog/public/search.vspx?blogid=127&q=meshups&type=text&output=html">Meshups</a> (natural joining of disparate data from a plethora of data sources) while providing the foundation layer for the <a href="http://en.wikipedia.org/wiki/Semantic_Web">Semantic Web</a>.</p> <p>For those who need use-cases that demonstrate tangible value re. the Semantic Web, here are some projects to note courtesy of the <a href="http://www.w3.org/2001/sw/sweo/">Semantic Web Education and Outreach</a> (SWEO) interest group: </p> <ol> <li> <a href="http://esw.w3.org/topic/SweoIG/TaskForces/CommunityProjects/FOAFWhitelisting">FOAF based White-lists</a> - Attacking SPAM </li> <li> <a href="http://esw.w3.org/topic/SweoIG/TaskForces/CommunityProjects/LinkingOpenData">Open Data Access and Linking for the Data Web</a> - Data Integration and Generation effort that creates a cluster of RDF instance data from a myriad of data sources relating to every day things such as: People, Places, Events, Projects, Discussions, Music, Books, and other things </li> <li> <a href="http://esw.w3.org/topic/SweoIG/TaskForces/CommunityProjects/PowderExtension">Content Labeling</a> - Protecting our kids on the Web amongst other matters relating to knowledge about data sources </li> <li> <a href="http://esw.w3.org/topic/SweoIG/TaskForces/CommunityProjects">Others..</a> </li> </ol> Related posts: <ol> <li> <a href="http://www.openlinksw.com/weblog/public/search.vspx?blogid=127&q=rdf%20data%20integration&type=text&output=html">Data Web and Global Data Integration & Generation Effort</a> </li> <li> <a href="http://www.openlinksw.com/weblog/public/search.vspx?blogid=127&q='data%20web'&type=text&output=html">Previous Data Web posts</a>.</li> </ol>
Our Basic Human Instincts
2007-02-24T00:55:49Z
2007-02-23T19:55:49-05:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1129
<blockquote> <p> <a href="http://ajaxian.com/archives/oat-openajax-alliance-compliant-toolkit">OAT: OpenAjax Alliance Compliant Toolkit</a>: "</p> <p>Ondrej Zara and his team at Openlink Software have created a Openlink Software JS Toolkit, known as OAT. It is a full-blown JS framework, suitable for developing<br /> rich applications with special focus to data access.</p> <p>OAT works standalone, offers vast number of widgets and has some rarely seen features, such as on-demand library loading (which reduces the total amount of downloaded JS code).</p> <p>OAT is one of the first JS toolkits which show full OpenAjax Alliance conformance: see the appropriate <a href="http://www.openajax.org/member/wiki/InteropFest_2007_March)">wiki page</a> and <a href="http://demo.openlinksw.com/DAV/JS/openajax/HubTest-OATConformance.html">conformance test page</a>.</p> <p>There is a lot to see with this toolkit:</p> <p>You can see some of the widgets in a <a href="http://demo.openlinksw.com/DAV/JS/demo/index.html">Kitchen sink application</a> </p> <p>Sample data access applications:</p> <ul> <li> <a href="http://demo.openlinksw.com/DAV/JS/qbe/index.html">SQL Query By Example</a> </li> <li> <a href="http://demo.openlinksw.com/DAV/JS/formdesigner/index.html">Forms designer</a> </li> <li> <a href="http://demo.openlinksw.com/DAV/JS/designer/index.html">DB Designer</a> </li> </ul> <p>OAT is Open Source and GPL’ed over at <a href="http://sourceforge.net/project/showfiles.php?group_id=168143">sourceforge</a> and the team has recently managed to incorporate our OAT data access layer as a<br /> module to <a href="http://demo.openlinksw.com/DAV/JS/dojo-oatstore-demo/test_OATStore_in_FilteringTable.html">dojo datastore</a>.</p> <p>(Via <a href="http://ajaxian.com">Ajaxian Blog</a>.)</p> </blockquote> <p>This is a corrected version of the initial post. Unfortunately, the initial post was inadvertently littered with invalid links :-( Also, since the original post we have released <a href="http://sourceforge.net/mailarchive/forum.php?thread_id=31568932&forum_id=49207">OAT 1.2</a> that includes integration of our iSPARQL QBE into the OAT Form Designer application.</p> <p>Re. Data Access, It is important to note that OAT's Ajax Database Connectivity layers supports data binding to the following data source types:</p> <ol> <li> <a href="http://www.w3.org/TR/rdf-primer/">RDF</a> - via <a href="http://demo.openlinksw.com/DAV/JS/demo/index.html?dav">SPARQL</a> (Query Language, Protocol, and Resultset Serialization formats: <a href="http://en.wikipedia.org/wiki/JDBC">RDF/XML</a>, <a href="http://www.w3.org/DesignIssues/Notation3">RDF/N3</a>, <a href="http://www.dajobe.org/2004/01/turtle/">RDF/Turtle</a>, <a href="http://en.wikipedia.org/wiki/XML">XML</a>, and <a href="http://www.json.org/">JSON</a>)</li> <li> <a href="http://en.wikipedia.org/wiki/SQL">SQL</a> - via <a href="http://www.xmla.org/faq.asp">XMLA</a> (somewhat forgotten <a href="http://en.wikipedia.org/wiki/SOAP">SOAP</a> protocol for SQL Data Access that can sit atop <a href="http://en.wikipedia.org/wiki/Open_Database_Connectivity">ODBC</a>, <a href="http://en.wikipedia.org/wiki/ADO.NET">ADO.NET</a>, <a href="http://en.wikipedia.org/wiki/OLE_DB">OLE-DB</a>, and even <a href="http://en.wikipedia.org/wiki/JDBC">JDBC</a>)</li> <li>XML - via SOAP or <a href="http://en.wikipedia.org/wiki/Representational_State_Transfer">REST</a> style Web Services</li> </ol> In all cases, OAT also provides Data Aware controls for the above that include: <ol> <li> <a href="http://demo.openlnksw.com/DAV/JS/demo/index.html?grid">Tabular Grids</a> </li> <li> <a href="http://demo.openlinksw.com/DAV/JS/demo/index.html?pivot">Pivot Tables</a> </li> <li> <a href="http://demo.openlnksw.com/DAV/JS/demo/index.html?timeline">TimeLines</a> </li> <li> <a href="http://demo.openlinksw.com/DAV/JS/demo/index.html?anchor">Extended Anchor Tags</a> </li> <li> <a href="http://demo.openlnksw.com/DAV/JS/demo/index.html?mashups">Map Service Controls</a> (Google, Yahoo!, OpenLayers, Microsoft Visual Earth)</li> <li> <a href="http://demo.openlinksw.com/DAV/JS/demo/index.html?rdf">SVG based RDF Graph Control</a> (Opera 9.x provides best viewing experience at the current time)</li> </ol> <p>OAT also includes a number of prototype applications that are completely developed using OAT Controls and Libraries:</p> <ol> <li> <a href="http://demo.openlnksw.com/isparql/">Visual SPARQL Query Builder</a> </li> <li> <a href="http://demo.openlinksw.com/DAV/JS/qbe/index.html">Visual SQL Query Builder</a> </li> <li> <a href="http://demo.openlinksw.com/DAV/JS/formdesigner/index.html">Web Forms Designer</a> (includes Drag-Drop usage of Data Aware Controls etc.)</li> <li> <a href="http://demo.openlinksw.com/DAV/JS/designer/index.html">Visual DB Designer</a> </li> </ol> <p>Note: Pick "Local DSN" from page initialization dialog's drop-down list control when prompted</p>
OAT: OpenAjax Alliance Compliant Toolkit (Live Links Version)
2007-02-02T15:29:55Z
2007-02-02T10:29:55-05:00
Kingsley Uyi Idehen
kidehen@openlinksw.com
http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1064
<p>A new technical white paper about our declarative language for SQL Schema to RDF Ontology Mapping has just been published.</p> <h2>What is this?</h2> <p>A declarative language adapted from SPARQL's graph pattern language (N3/Turtle) for mapping SQL Data to RDF Ontologies. We currently refer to this as a Graph Pattern based RDF VIEW Definition Language.</p> <h2>Why is it important?</h2> <p>It provides an effective mechanism for exposing existing SQL Data as virtual RDF Data Sets (Graphs) negating the data duplication associated with generating physical RDF Graphs from SQL Data en route to persistence in a dedicated Triple Store. </p> <p>Enterprise applications (traditional and web based) and most Web Applications (Web 1.0 and Web 2.0) sit atop relational databases, implying that SQL/RDF model and data integration is an essential element of the burgeoning "Data Web" (Semantic Web - Layer 1) comprehension and adoption process.</p> <p>In a nutshell, this is a quick route for non disruptive exposure of existing SQL Data to SPARQL supporting RDF Tools and Development Environments.</p> <h2>How does it work?</h2> <h3>RDF Side</h3> <ol> <li>locate one or more Ontologies (e.g FOAF, SIOC, AtomOWL, SKOS etc.) that effectively defines the Concepts (Classes) and Terms (Predicates) to be exposed via your RDF Graph</li> <li>Using the Virtuoso's RDF View Definition Language declare a International Resource Identifier (or URI) for your Graph. Example:<pre>CREATE GRAPH IRI("http://myopenlink.net/dataspace")</pre> </li> <li>Then create Classes (Concepts), Class Properties/Predicates (Memb), and Class Instances (Inst) for the new Graph. Example: <pre>CREATE IRI CLASS odsWeblog:feed_iri "http://myopenlink.net/dataspace/kidehen/weblog/MyFeeds" ( in memb varchar not null, in inst varchar not null)</pre> </li> </ol> <h3>SQL Side</h3> <ol> <li>If Virtuoso isn't your SQL Data Store, Identify the ODBC or JDBC SQL data source(s) containing the SQL data to be mapped to RDF and then link the relevant tables into Virtuoso's Virtual DBMS Layer</li> <li>Then use the RDF View Definition Language's graph pattern feature to generate SQL to RDF Mapping Template for your Graph. As shown in this <a href="http://www.usnet.private:8889/wiki/main/Main/VOSSQLRDF#MappingOdsBlogToAtomOwlExample">ODS Weblog -> AtomOWL Mapping example</a>.</li> </ol>
Virtuoso's SQL Schema to RDF Ontology Mapping Language (1.0)
2006-11-17T23:24:25Z
2006-11-17T18:24:25-05:00
Kingsley Uyi Idehen
kidehen@openlinksw.com