Aristotle University, Department of Mathematics Master in Web Sciencesupported by Municipality of VeriaFrom Gore’s information highways to Obama’s Linked dataΜιχάλης Βαφόπουλος
From Al Gore (internet highways) to Barack Obama (linked data)2
The Web of DocumentsAnalogy:a global file system
Designed for: human consumption
Primary objects: documents
Links between: documents(or sub-parts of)
Degree of structure in objects: fairly low
Semantics of content and links: implicit (humans)(Tom Heath)The web = the internet+ links + documents3
The Web of DocumentsSimple, big and unstructured
Organized in SilosBut humans are interested in:Things, no documents and
these Thingsmight bein documents or elsewhereHumans: Limited capacity to extract meaning...4
Limited SEARCH capacitySearch for: Football Players who went to the University of Texas at Austin, played for the Dallas Cowboys as Cornerback(Juan F. Sequeda)5
Google, Bing, yahoo! irrelevant6
Wikipedia through LD: relevant7
The Web of DataAnalogy:a global filesystem----> globaldatabase
Designed for: human consumption ->machines first, humans later
Primary objects: documents --> things (or descriptions of things)
Links between: documents--> things
Degree of structure in objects: fairly low ---> high
Semantics of content and links: implicit --> explicit(Tom Heath)8
The Modigliani TestShow me all the locations of all the original paintings of ModiglianiDaniel Koller (@dakoller) showed that you can find this with a SPARQL query on DBpediaThanks Richard MacManus - ReadWriteWeb
Results of the Modigliani TestAtanas Kiryakov from OntotextUsed LDSR – Linked Data Semantic RepositoryDbpediaFreebaseGeonamesUMBELWordnetPublished April 26, 2010: http://www.readwriteweb.com/archives/the_modigliani_test_for_linked_data.php
The Web of Data: why?– encourages reuse– reduces redundancy– maximises its (real and potential) inter-connectedness– enables network effects to add value to data13
The Web of Data: how?– current state on the WebRelational Databases
APIs
XML
CSV
XLS(see EXHIBIT)Computers can’t consume data because:Different formats & models
Not inter-connected14
The Web of Data: how?– we need to create a standard way of publishing Data on the Web (like HTML for docs)This is the Resource Description Framework (RDF)(a simple example here from Juan F. Sequeda), more next semester!)15
Resource Description Framework (RDF)A data model
A way to model data
Inspired form Relational databases and Logic
RDF is a triple data model
Labeled Graph (semantic networks)
Subject, Predicate, Object<Isidoro> <was born in> <Chios><Chios> <is part of> <Greece>
The RDF Data ModelTriplessubject -> predicate -> objectTom -> worksFor -> TalisTalis -> basedIn -> Birmingham<uri> -> <uri> -> <uri> or "literal"
παράδειγμα“Talis is Based Near Birmingham”<http://dbpedia.org/resource/Talis_Group><http://xmlns.com/foaf/0.1/based_near><http://sws.geonames.org/3333125/>
Example: Document on the Web

2011 05-02 linked data intro