Details
Kingsley Idehen
Lexington, United States
Subscribe
Post Categories
Recent Articles
Display Settings
Translate
|
Showing posts in all categories Refresh
My Talis Podcast re. Semantic Web, Linked Data, and OpenLink Software
My podcast interview with Paul Miller of Talis is out. As I listened to the podcast (naturally awkward affair) I got a first hand sense of Paul's mastery of the art of interviewing, even when dealing with a fast talking data blitzers like me. Personally, I think I still talk a little too fast (the Nigerian in me), especially when the subject matter hones right into the epicenter of my professional passions: Open Data Access and Heterogeneous Data Integration (aka. Virtual Database Technology) -- so you may need to rewind every now and then during the interview :-)
During this particular podcast interview, I deliberately wanted to have an conversation about the practical value of Linked Data, rather than the technical innards. The fundamental utility of Linked Data remains somewhat mercurial, and I am certainly hoping to do my bit at the upcoming Linked Data Planet conference re. demonstrating and articulating linked data value across the blurring realms of "the individual" and "the enterprise".
Note to my old schoolmates on Facebook: when you listen to this podcast you will at least reconcile "Uyi Idehen" with "Kingsley Idehen". Unfortunately, Facebook refuses to let me Identify myself in the manner I choose. Ideally, I would like to have the name: "Kingsley (Uyi) Idehen" associated with my Facebook ID since this is the Identifier known to my personal network of friends, family, and old schoolmates. This Identity predicament is a long running Identity case study in the making.
|
05/16/2008 00:10 GMT
|
Modified:
05/16/2008 10:33 GMT
|
On "Semantic", "Semantic Web", and "Linked Data Web"
Nova Spivack has just penned a post titled: On the Difference Between "Semantic" and "Semantic Web", where he covers the fundamental difference between "Semantic" (what I call "Semantics Inside") and "Semantic Web" applications. I would like to extend the distinctions further by adding the "Linked Data Web" distinctions to the developing discourse.
The Linked Data Web (aka. Linked Data) describes RDF data injected into the Web, where the Data Object Identifiers (URIs) in an RDF graph (collection of RDF triples) are endowed with HTTP based URIs. The net effect of this approach to Data Object Identity is that it facilitates "Open Data Access by Reference" on the Web (aka data dereferencing).
If you recall pre Web ubiquity, in the enterprise realm for instance, Open Database Connectivity (ODBC) emerged as a mechanism for separating Data Access and Data Management in the database oriented Client-Sever model. Although ODBC gave you access to data, the data access entry point took the form of a data access specific naming mechanism called a "Data Source Name" (DSN). ODBC DSNs typically exposed Tables or Views. The same thing applies to JDBC where a non HTTP based URN scheme applies.
Zip forward to where we are today on the Web; the Web is evolving from a Document centric Database to a Distributed Object Database, and you should see that in Linked Data we are now truly looking at the best of all worlds: Web Open Database Connectivity (WODBC) with the following advantages:
- direct Access to a single Record (an Entity) or Record Sets (RDF based Entity Sets) by reference over HTTP across disparate Data Spaces on the Web
- the ability to mesh disparate data sources without being impeded by back-end DBMS engine model, vendor, host operating development frameworks, or host operating system specificity
- an opportunity to learn from the enterprise DBMS market and Client-Server markets of yore with regards to the shape and form of next generation Linked Data Web oriented solutions.
To conclude, we now have "Semantics Inside" (RDF or non RDF), "Semantic Web" (RDF graphs with Object Identifiers that may or may not be HTTP based), and "Linked Data Web" (RDF graphs with Object Identifiers that must be HTTP based and dereferencable) oriented applications, in the emerging landscape associated with the "Semantics" moniker.
As per usual, this post is a record in my Blog oriented Data Space on the Web. The permalink of this post is a URI constructed with Giant Global Graph enrichment in mind :-)
|
05/15/2008 14:11 GMT
|
Modified:
05/15/2008 14:51 GMT
|
Comments about recent Semantic Gang Podcast
After listening to the latest Semantic Web Gang podcast, I found myself agreeing with some of the points made by Alex Iskold, specifically:
-- Business exploitation of Linked Data on the Web will certainly be driven by the correlation of opportunity costs (which is more than likely what Alex meant by "use cases") associated with the lack of URIs originating from the domain of a given business (Tom Heath: also effectively alluded to this via his BBC and URI land grab anecdotes; same applies Georgi's examples)
-- History is a great tutor, answers to many of today's problems always lie somewhere in plain sight of the past.
Of course, I also believe that Linked Data serves Web Data Integration across the Internet very well too, and the fact that it will be beneficial to businesses in a big way. No individual or organization is an island, I think the Internet and Web have done a good job of demonstrating that thus far :-) We're all data nodes in a Giant Global Graph.
Daniel lewis did shed light on the read-write aspects of the Linked Data Web, which is actually very close to the callout for a Wikipedia for Data. TimBL has been working on this via Tabulator (see Tabulator Editing Screencast), Bengamin Nowack also added similar functionality to ARC, and of course we support the same SPARQL UPDATE into an RDF information resource via the RDF Sink feature of our WebDAV and ODS-Briefcase implementations.
|
05/02/2008 21:44 GMT
|
Modified:
05/05/2008 20:06 GMT
|
In Perpetual Pursuit of Context
I've always been of the opinion that concise value proposition articulation shouldn't be the achilles of the Semantic Web. As the Linked Data wave climbs up the "value Appreciation and Comprehension chain", it's getting clearer by the second that "Context" is a point of confluence for Semantic Web Technologies and easy to comprehend value, from the perspectives of those outside the core community.
In today's primarily Document centric Web, the pursuit of Context is akin to pursuing a mirage in a desert of user generated content. The quest is labor intensive, and you ultimaely end up without water at the end of the pursuit :-)
Listening to the Christine Connor's podcast interview with Talis simply reinforces my strong belief that "Context, Context, Context" is the Semantic Web's equivalent of Real Estate's "Location, Location, Location" (ignore the subprime loans mess for now). The critical thing to note is that you cannot unravel "Context" from existing Web content without incorporating powerful disambiguation technology into an "Entity Extraction" process. Of course, you cannot even consider seriously pursing any entity extraction and disambiguation endeavor without a lookup backbone that exposes "Named Entities" and their relationships to "Subject matter Concepts" (BTW - this is what UMBEL is all about). Thus, when looking at the broad subject of the Semantic Web, we can also look at "Context" as the vital point of confluence for the Data oriented (Linked Data) and the "Linguistic Meaning" oriented perspectives.
I am even inclined to state publicly that "Context" may ultimately be the foundation for 4th "Web Interaction Dimension" where practical use of AI leverages a Linked Data Web substrate en route to exposing new kinds of value :-)
"Context" may also be the focal point of concise value proposition articulation to VCs as in: "My solution offers the ability to discover and exploit "Context" iteratively, at the rate of $X.XX per iteration, across a variety of market segments :-)
|
05/02/2008 19:18 GMT
|
Modified:
05/02/2008 16:25 GMT
|
XTech Talks covering Linked Data
Courtesy a post by Chris Bizer to the LOD community mailing list, here is a list of Linked Data oriented talks at the upcoming XTech 2008 event (also see the XTech 2008 Schedule which is Linked Data friendly). Of course, I am posting this to my Blog Data Space with the sole purpose of adding data to the rapidly growing Giant Global Graph of Linked Data, basically adding to my collection of live Linked Data utility demos :-)
Here is the list:
-
Linked Data Deployment (Daniel Lewis, OpenLink Software)
-
The Programmes Ontology (Tom Scott, BBC and all)
-
SemWebbing the London Gazette (Jeni Tennison, The Stationery Office)
-
Searching, publishing and remixing a Web of Semantic Data (Giovanni Tummarello, DERI Galway)
-
Building a Semantic Web Search Engine: Challenges and Solutions (Aidan Hogan, DERI Galway)
- 'That's not what you said yesterday!' - evolving your Web API (Ian Davis, Talis)
-
Representing, indexing and mining scientific data using XML and RDF: Golem and CrystalEye (Andrew Walkingshaw,
University of Cambridge)
For the time challenged (i.e. those unable to view this post using it's permalink / URI as a data source via the OpenLink RDF Browser, Zitgist Data Viewer, DISCO Hyperdata Browser, or Tabulator), the benefits of this post are as follows:
- automatic URI generation for all linked items in this post
- automatic propagation of tags to del.icio.us, Technorati, and PingTheSemanticWeb
- automatic association of formal meanings to my Tags using the MOAT Ontology
- automatic collation and generation of statistical data about my tags using the SCOT Ontology (*missing link is a callout to SCOT Tag Ontology folks to sort the project's home page URL at the very least*)
- explicit typing of my Tags as SKOS Concepts.
Put differently, I cost-effectively contribute to the GGG across all Web interaction dimensions (1.0, 2.0, 3.0) :-)
|
05/02/2008 14:53 GMT
|
Modified:
05/02/2008 16:25 GMT
|
Linked Data enters state of Evoluation
During a brief chat with Michael Hausenblas about a new Linked Data project he is championing called: LForum, I made a freudian slip, in the form of the typo: Evoluation, which at the time was supposed to have been: Evolution. Anyway, we had a chuckle and realized we were on to something, so I proceeded to formalize the definition:
Evoluation is evolution devoid of the randomness of mutation. A state of being in which it is possible to evaluate and choose evolutionary paths.
Evoluation actually describes where we are today in relation to the World Wide Web; to the Linking Open Data community (LOD), it's taking the path towards becoming a Giant Global Graph of Linked Data; to the Web 2.0 community, it's simply a collection of Web Services and associated APIs; and to many others, it remains an opaque collection of interlinked documents.
The great thing about the Web is that it allows netizens to explore a plethora of paths without adversely affecting the paths of others. That said, controlling one's path may take mutation out of evolution, but we are still left with the requirement to adapt and eventually survive in a competitive environment. Thus, although we can evaluate and choose from the many paths the Web's evolution offers us, the path that delivers the most benefits ultimately dominates. :-)
Back to LForum
This project is about generating Linked Data from existing boardscape data spaces e.g the content of phpBB based discussion forums, in line with my earlier post about the changing Linked Data landscape and the rising desire for automagically generated Linked Data URIs and/or entire Linked Data Spaces from existing (X)HTML Web Pages. Of course, this means that the following takes place under the covers:
- Subject matter Concept Extraction
To effectively achieve the steps above, there has to be an underlying "Lookup" and "Reference" oriented Linked Data substrate (a backbone in a sense). This is essentially what the UMBEL effort achieves through the use of SKOS to construct a Subject matter oriented Concept Scheme that incorporates Concepts and Named Entities from OpenCyc and Yago respectively.
|
04/29/2008 19:56 GMT
|
Modified:
04/29/2008 23:46 GMT
|
Linked Data Trip Report - Part 1 (Update 2)
Typo cleansed edition :-)
Objectives
- Meet LOD Community Members
- Participate in Workshop
Meeting LOD Community Members
Although the Web continues to shrink the planet by removing the restrictions of geopgrahic location, meeting people face-to-face remains invaluable (*priceless in Mastercard AD speak*). Naturally, meeting and chatting with as many LOD community members as possible was high up on my agenda.
Participate in Workshop
As one of the co-chairs of the Linking Open Data Workshop (LODW), I had a 5 minute workshop opening slot during which I spoke about the following:
Where we are today:
We have DBpedia as a major hub on the burgeoning Linked Data Web. When OpenLink offered to host DBpedia (a combination of Virtuoso DBMS Software and sizable backend Hardware infrastructure), it did so knowing that such an effort would emphatically address the "chicken and egg" conundrum that, prior to this undertaking, stifled the ability to demonstrate practical utility of HTTP based Linked Data.
Today, the Linked Data bootstrap mission has been accomplished.
Where we go next:
Although DBpedia is a hub (ground zero of Linked Data), we have to put it into perspective in relation to a new set of needs and expectations moving forward. Today, DBpedia is a Sun at the heart of a Solar System within the Linked Data Galaxy. But unlike Space as we know it, in Cyberspace we can have connectivity and collaboration across Solar Systems -- life exists elsewhere and we are part of a collaborative collective unimpeded by constraints of space travel etc. Thus, expect to see the emergence of other Solar Systems accessible to DBpedia and its collections of planets (see. LOD diagram). Examples underway include UMBEL which will serve the Linked Data planets from OpenCyc (Subject Matter Concepts), Yago (Named Entities), and Bio2RDF (which provides powerful Bio Informatics based Linked Data planet).
I urged the community to veer more aggressively towards developing and demonstrating practical Linked Data driven solutions that are aligned to well known problems. Of course, I encouraged all presenters to make this an integral part of their presentations :-)
Workshop Summary:
The workshop was well attended and I found all the presentations engaging and full of enthusiasm.
As the sessions progressed, it became clear during a number of accompanying Q&A sessions that a new Linked Data exploitation frontier is emerging. The frontier in question takes the form of a Linked Data substrate capable of addressing the taxonomic needs of solutions aimed at automated Named Entity Extraction, Disambiguation, Subject matter Concept alignment, transparently integrated with existing Web Content. Thus, we are moving beyond the minting and deployment of of dereferencable URIs and RDF data sets to automagically associating existing Web Content with Named Entities (People, Organizations, Places, Events etc..) and Subject matter Concepts (Politics, Music, Sports, and others) while remaining true to the Linking Open Data Community creed i.e. ensuring the Named Entity and Subject matter Concept URIs are available to user agents or users seeking to produce alternative data views (i.e. Mesh-ups).
I will get to part 2 of this report once the actual workshop sessions slides go live (*these are different from the pre-event PDFs links*).
|
04/28/2008 18:48 GMT
|
Modified:
04/29/2008 11:07 GMT
|
Linked Data Illustrated and a Virtuoso Functionality Reminder
Daniel Lewis has put together a nice collection of Linked Data related posts that illustrate the fundamentals of the Linked Data Web and the vital role that Virtuoso plays as a deployment platform.
Remember, Virtuoso was architected in 1998 (see Virtuoso History) in anticipation of the eventual Internet, Intranet, and Extranet level requirements for a different kind of Server. At the time of Virtuoso's inception, many thought our desire to build a multi-protocol, multi-model, and multi-purpose, virtual and native data server was sheer craziness, but we pressed on (courtesy of our vision and technical capabilities).
Today, we have a very sophisticated Universal Server Platform (in Open Source and Commercial forms) that is naturally equipped to do the following via very simple interfaces:
- Provide highly scalable RDF Data Management via a Quad Store (DBpedia is an example of a live demonstration)
- Powerful WebDAV innovations that simplify read-write mode interaction with Linked Data
|
04/28/2008 17:32 GMT
|
Modified:
04/28/2008 14:47 GMT
|
Linked Data Solution for Exposing OpenLink Product Portfolio
At OpenLink Software, we've had an immense problem explaining the depth and breadth of our product porfolio via traditional Document Web pages. Thanks to SPARQL and Linked Data, we are now able to use Web Data Object IDs (HTTP based URIs) to produce super SKUs for every item in our product portfolio. Even better, we are able to handle the additional challenge of exposing features and benefits which by their very nature are mercurial across an array of fronts (products releases, product formats, and supported platforms etc).
Now I can simply state the following using Linked Data (hyperdata) links:
OpenLink Software's product porfolio is comprised of the following product families:
-
Universal Data Access Drivers Suite (UDA) for ODBC, JDBC, ADO.NET, OLE-DB, and XMLA
-
OpenLink Data Spaces
-
Virtuoso
We no longer have to explain (repeatedly) why our drivers exist in Express, Lite, and Multi-Tier Edition formats, or why you ultimately need Multi-Tier Drivers over Single Tier Drivers (Express or Lite Editions) since you ultimately heed high-performance, data encryption, and policy based security across each of the data access driver formats.
|
02/16/2008 21:08 GMT
|
Modified:
02/16/2008 16:15 GMT
|
|
|
|
| Sun | Mon | Tue | Wed | Thu | Fri | Sat |
| | | | |
1
|
2
|
3
|
|
4
|
5
|
6
|
7
|
8
|
9
|
10
|
|
11
|
12
|
13
|
14
|
15
|
16
|
17
|
|
18
|
19
|
20
|
21
|
22
|
| |