SEMANTIC WEB PUBLISHING WITH DRUPAL
|
|
|
- Samson Matthews
- 10 years ago
- Views:
Transcription
1 DERI DIGITAL ENTERPRISE RESEARCH INSTITUTE SEMANTIC WEB PUBLISHING WITH DRUPAL Stéphane Corlosquet Axel Polleres Richard Cyganiak Stefan Decker DERI TECHNICAL REPORT APRIL 2009 DERI Ireland University Road Galway IRELAND DERI DIGITAL ENTERPRISE RESEARCH INSTITUTE
2
3 DERI TECHNICAL REPORT DERI TECHNICAL REPORT , APRIL 2009 SEMANTIC WEB PUBLISHING WITH DRUPAL Stéphane Corlosquet, Richard Cyganiak, Axel Polleres, Stefan Decker 1, Abstract. Getting Semantic Web data and annotations into and out of end-user applications is one of the many challenges of making the Semantic Web fly. While linked data principles are slowly taking off and being adopted by a small number of sites and gradually more exporters to link existing Web content to the Web of data, we still find ourselves in the cold start phase. While producing Web content has become easy for end users by content managment systems (CMS), blogging tools, etc. the problem of enabling end users to produce semantic Web content persists. In this short paper, we propose a framework for a one-click solution to lift the huge amounts of Web content residing in CMS systems to the Web of Data. We tackle one of the most popular CMS systems nowadays, Drupal, where we enable site administrators to export their site content model and data to the Web of Data without requiring extensive knowledge on Semantic Web technologies. We have developed a Drupal module that maps the inherent site structure of a typical Drupal site to a lightweight ontology that we call the Site Vocabulary, and that exposes site data directly in RDFa. As such, this simple solution would not link to existing Semantic Web data, since site vocabularies exist decoupled from the widely used vocabularies in the Linked data cloud. To close this gap, we have incorporated an easy-to-use, fail-safe ontology import and reuse mechanism in our module, that allows site administrators with a few clicks to link their site vocabulary, and thus their site data, to existing, widely used vocabularies on the Web. In whole, our approach shall help to bootstrap the Semantic Web by leveraging the huge amounts of data in CMS. In approaching CMS site administrators rather than end users, we tackle the problem of adding semantics where we consider it easiest: Semantics are fixed at design time based on the site structure, whereafter end users entering data produce Linked data automatically. We have evaluated our approach in user experiments and report on deployment of our module in the Drupal community. Keywords: Content Management, Linked Data, RDFa, Drupal. 1 Digital Enterprise Research Institute Galway, National University of Ireland Galway University Road Galway, Ireland [email protected] Acknowledgements: This work was supported by the European Commission under the projects DIP, Knowledge Web, Ontoweb, SEKT, SWWS, Esperonto, and h-techsight; by Science Foundation Ireland under the DERI-Lion project; and by the Vienna city government under the CoOperate program. Copyright c 2009 by the authors
4 DERI TR I Contents 1 Motivation 1 2 Related works 2 3 Drupal: An Popular CMS 2 4 From Content Models to Site Vocabularies 4 5 Adhering to linked data principles 5 6 Mapping to Existing Ontologies 5 7 Evaluation Acceptance Publishing effort Motivation and Benefits Conclusions and Outlook 9
5 DERI TR Motivation Since the late 90ies and early 2000s a paradigm shift has taken place in Web publishing towards a separation of data (content) and structure (mainly layout). The first ideas to have the data represented in a way which allows its reuse in various ways and not only HTML, emerged in parallel in various systems such as Typo3 (1997), Plone (1999), webml [13]. These open source systems and their commercial counterparts are nowedays typically subsumed under the common term content management systems (CMS). While it is worthwhile to mention that the first of these systems appreared at around the same time as Semantic Web technologies emerged, with RDF [14] being standardized in 1999, the development of CMS and Semantic Web technologies have gone largely separate parts. Semantic Web technologies have matured to the point where they are increasingly being deployed on the Web. Large amounts of RDF data can now be accessed over the Web as Linked Data. This data is used by a variety of clients, such as RDF data mashups that integrate information from various sources, search engines that allow structured queries across multiple sites and datasets, and data browsers that present a site s content in new ways. But the traditional Web still dwarfs this emerging Web of Data and in fact, the HTML Web boosted by technologies such as CMS is still growing at much faster pace as the Semantic Web, see also [3]. When thinking about how to re-link the HTML Web and the Semantic Wevb of Linked data. Thus, the task of RDFizing existing web sites, particularly the provision of means to produce Linked data form sites that are anyways already highly structured, is of high potential benefit. The recently finished RDFa [1] standard additionally supports this by allowing RDF to be embedded directly into existing HTML pages. Alongside with the success story of CMS, we observe that a large portion of Web data is de facto already available in such highly structured form, though typically still exposed in plain HTML to the outside: Content Management Systems are typically built on a structured model of the domain of the site which is reflected in both the underlying database, but also and often more accurately in the content types defined in the CMS system itself by a site administrator. A typical example of a state-of-the art CMS is Drupal, which has been gaining popularity recently by its rapidly growing user community, openness and modular design. In this paper we presnet a module for Drupal which allows to link back this content model to the Web of Data. After installing this module on an existing Drupal site, the content types and fields typically defined by a Drupal site administrator using Drupal s Content Construction Kit (CCK) will be automatially exposed in terms of classes and properties of a canonical OWL ontology, the site vocabulary. Likewise, the site data itself becomes available as RDFa embedded in the live Web site following linked data principles without additional effort for the administrator. While this off-the-shelf solution already potentially makes any Drupal site which installs our module amenable to Semantic Web tools such as RDF crawlers, search engines, and SPARQL engines to search and query the site, this is only a first step. For real linkage to the Web of data, we need to enable mappings of the site vocabulary to existing, widely used ontologies. To this end, our module extends CCK by additional means to import and map to existing vocabularies. To keep the burden low for site administrators who normally will have little interest in learning the details of RDF and description logics, we support save-reuse, i.e. only allow to link content types and fields to existing classes and properties in a fail-safe manner, that is we avoid the modification of existing ontologies as well as the introduction of potential inconsistencies in our user interface as far as possible.
6 2 DERI TR Related works Let us explain how our approach differs from previous attempts to link CMSs with the Semantic Web and why we believe that it is more adequate. Staab et al. [18] proposed at a stage where CMSs in their current form where still in their infancy what could be conceived an ontology based Content management system. The idea was to separate Web Content from domain models, very similar to the core ideas that made the success of CMSs. Similarly, Ontowebber [12], promoted the creation of Websites out of ontologies with a system built on a native RDF repository. These approaches though focus on ontologies and ontology design themselves as a backbone to organise data on Web sites. On the contrary, content management systems typically support very lightwight structuring of information items supported by User interfaces limited to the needed functionality to structure data for Web presentation, rather than full ontology editing tools. Moreover, current CMS typically rely on off-the-shelf relational databases to store data on the backend, rather than RDF repositories. Our module for Drupal thus has a very orthogonal goal:rather than building an ontology-based CMS we aim to extract and link ontologies from the content models and within the tools typical site administrators are familiar with nowadays. We belief that this strategy taking users from where they in an unobrusive way to be the key enabler to leverage Semantic Web technologies with low entry barriers. Somewhat closer to our approach are the ideas presented by Stojanovic et al. [19], where a relational database schema underlying a CMS is mapped to RDF Schema or Frame Logic. A more recent approach Triplify [3] follows a similar path, providing a generic mapping tool for lifting relational databases into Linked Data by dedicated mappings from a relational Schema to RDF. It should be mentioned that Triplify even provides some predifined mappings to wrap Drupal sites backend databases into RDF. Again, our approach is significantly different. Due to the flexible nature of Drupal the underlying database used to store the data underlying a CMS based site does not nesessarily reflect the sites content model and its constraints. Thus, the relational schema might vary between different versions of Drupal on the one hand, and on the other hand, the effects of changing the content model to the database schema underneath by the site administrator are not always obvious. Actually, a good part of the success of content management systems is grounded precisely in the fact that site administrators do not need to delve into details of the underlying database system or schema. Our approach works on a more abstract level (Drupal s CCK) directly in the API the site administrator is used to, where all the information about the content model is available, which we believe to model the information structure more adequately than the underlying database schema. The site administrator does not need to know anything about the underlying database to create mappings to existing RDF vocabularies or expose RDFa on her site. 3 Drupal: An Popular CMS Before we start explaining our work, let us briefly introduce the main concepts behind Drupal. Drupal 1 is a popular open-source content management system. It is among the top three open-source CMS products in terms of market share [17]. Drupal facilitates the creation of web sites by handling many aspects of site maintenance, such as data workflow, access control, user accounts, and the encoding and storage of data in the database. As typical for CMS, a site administrator initially sets up a site by installing the core Drupal web application and choosing from a large collection of modules that add specific functionality to the site, such as 1
7 DERI TR improved user interface, enhanced search, various export formats, extended statistics and so on. Site administrators need a fair bit of technical knowledge to choose and configure modules, but usually do not write code; this is done by module developers instead. After the site has been set up, Drupal allows non-technical users to add content and handle routine maintenance of the site. Each item of content in Drupal is called a node. Nodes usually correspond more or less directly to the pages of a site. Nodes can be created, edited and deleted by content authors. Some modules extend the nodes, for example a taxonomy module allows assignment of nodes to categories, and a comment module adds blog-style comment boxes to each node. The Content Construction Kit (CCK) is one of the most popular and powerful modules used on Drupal sites. It allows the site administrator to define types of nodes, called content types, and to define fields for each content type. Fields can be of different kinds such as plain text fields, dates, addresses, file uploads, or references to other nodes. Additional kinds of fields can be added via modules. When defining content types and fields, the site administrator has to provide the following information: ID, 2 label, and description for content types and fields. Additionally, CCK allows to specify the following constraints on fields: Cardinality: fields can be optional or required, and may have a maximum cardinality. Domain: fields can be shared among one or more content types. Range: fields can be of type text, integer, decimal, float, date, file attachment, or node reference; fields of type node reference can be restricted to nodes of specific content types; fields of type text can be restricted to a fixed list of text values. As a running example, we choose a cheese review web site, 3 the site administrator might define content types such as Cheese, Review, and Country of Origin. The Cheese type might have fields such as description, picture, or source of milk. Figure 1 shows the Adminsitrator interface to edit a content type Cheese in Drupal along with some constraints defined on the source of milk field. Figure 1: Administrating a content type in Drupal s CCK (left) and defining constraints on a field (right). Thus, site administrators use CCK to define a site-specific content model, which is then used by content authors to populate the site. The focus of our work is to expose (i) the CCK site content model as an OWL ontology that reflects the site structure which the designer had in mind and (ii) the site content as RDF data using this ontology. 2 a string of lower case alphanumeric characters. 3 Demo-site available at
8 4 DERI TR From Content Models to Site Vocabularies We have implemented a Drupal module that enhances Drupal s CCK with the ability to auto-generate RDF classes and properties for all content types and fields. We build a so-called site vocabulary, i.e., an RDFS/OWL ontology which describes the content types and fields used in the data model as classes and properties. The field and type names are extracted from field and type IDs from CCK, such that following common conventions fields are assigned a property name starting with a lower case character, and content types are assigned a class name starting with an upper case character. Field and content type labels and descriptions are likewise exported as rdfs:labels and rdfs:comments. Here goes a typical content type and field definition extracted from CCK into RDFS: site:mytype a rdfs:class; rdfs:label "My Type"; rdfs:comment "My Type description"; site:myfield1 a rdf:property; rdfs:label "my field"; rdfs:comment "My field description"; Likewise, field constraints on from CCK are reflected in the site vocabulary: Cardinalities are mapped to cardinality restrictions in OWL, i.e. required fields are restricted to owl:mincardinality 1. whereas fields with a maximum cardinality n are restricted to owl:maxcardinality n. For instance, if we assume that each Cheese is required to have name field, and at most 5 reviews, these constraints in CCK would be exported to OWL as follows. site:cheese a rdfs:class; rdfs:subclassof [ a owl:restriction; owl:onproperty site:name; owl:mincardinality 1], [ a owl:restriction; owl:onproperty site:review; owl:maxcardinality 5]. Domains are reflected by rdfs:domain constraints. Here, fields used by a single type can be modeled by a simple rdfs:domain triple. For instance, assuming that the review field for Cheeses is not shared with any other content type in the current content model, we can simply write: site:review rdfs:domain site:cheese. CCK fields shared among several types have the union of all types sharing the field as their domain. E.g., since Cheese, Country and Region share the name field, the site vocabulary contains site:englishname rdfs:domain [owl:unionof (site:cheese site:country site:region)]. Ranges of fields are analogously encoded by rdfs:range triples. Additionally, we distinguish between fields of range text, integer, decimal, float, or date, and those referring to file attachments, or node references. We declare the former as owl:datatypeproperty and assign the datatypes supported in Drupal with their respective XSD datatypes, i.e. text xs:string, integer xs:integer, decimal xs:decimal, float xs:float, or date xs:date. For instance, the text field name is reflected in the site vocabulary as: site:name rdfs:range xs:string; a owl:datatypeproperty.
9 DERI TR site: < < rdf:type site:cheese; site:origin "Germany"; site:name "Handkäse"; site:source_milk "cow"; site:description "Handkäse (literally: hand cheeses...)"; site:review < Figure 2: Drupal page with embedded RDF auto-generated from CCK content model. Fields that range over texts restricted to a fixed list of text values will be assigned a respective enumerated class of values using owl:dataranges, e.g. source of milk is modeled as site:sourceofmilk a owl:datatypeproperty; rdfs:range [ a owl:datarange; owl:oneof ( "cow", "goat", "sheep" ) ]. Finally, fields that range over file attachments (which get a URI in Drupal) or node reference, are declared of type owl:objectproperty. Fields ranging over more than one content type are reflected in the site vocabulary by owl:unionof. E.g., origin may be a Country or a Region, respectively. site:origin a owl:objectproperty; rdfs:range [ owl:unionof ( site:country site:region ) ]. 5 Adhering to linked data principles Following the conventions mentioned in the previous section, the site vocabulary is generated and published automatically at the site URL under the default namespace which we denoted by the namespace prefix site: in the examples before. Likewise, any Drupal page on a site will by our module be annotated with RDFa triples that dereference terms of this site vocabulary as classes and properties linking Drupal content nodes as subjects and objects. We are inline with the Linked data principles and best practices [5, 4] as we provide resolvable HTTP URIs for all resources: Each of the pages also represents a node of a certain content type in the CCK content model. That is, in our model, each node becomes an RDF resource, and the HTML Web page describing the node is enriched with RDFa [1] that reflect the links in the content model. By this design, any Drupal site usign our module is off-the-shelf amenable to any existing tool that can consume Linked Data style RDF content. Figure 2 shows a page in our cheese example site with embedded RDF dereferencing the site ontology along with the embedded auto-generated RDF data. 6 Mapping to Existing Ontologies While the functionality we have descirbed so far fits Drupal sites well into the Linked data world, so far, we have created nothing more than an isolated ontology from th existing site content model. However, this benefits of this exercise remain limited, unless we additionally allow linking the site vocabulary to existing vocabularies and ontologies populating the Semantic Web. For instance, instead of just exposing the Cheese type as a class in the site vocabulary, you might want to reuse a class in an existing ontology, such as for instance ov:cheese from the OpenVocab 4 4
10 6 DERI TR vocabulary which some other publishers on the web already used. Or likewise, we may wish to state that a Review is actually a foaf:document from the widely used FOAF 5 ontology, or that a cheese linked to its Description by the widely used dc:description property, from Dublin Core 6, etc. To this end, our module adds a new tab Manage RDF mappings to the content type administration panel of CCK for managing such mappings to existing ontologies, cf. Figure 3. After importing ontologies via a simple dialog, the terms of these ontologies can be mapped to the Cheese type and its fields. The mapped terms will result in rdfs:subclassof and rdfs:subpropertyof mappings being added to the site vocabulary as well as extra triples using the mapped terms being exposed in the RDF embedded in the page. Step 1: Import ontology Step 2: Define mappings Mappings added to the site vocabulary: site:cheese rdfs:subclassof ov:cheese. site:name rdfs:subpropertyof rdfs:label. site:description rdfs:subpropertyof dc:description. Triples added to the exposed RDF of Fig. 2 (italic): < rdf:type site:cheese; rdf:type ov:cheese; site:origin "Germany"; site:name "Handkäse"; rdfs:label "Handkäse"; site:source milk "cow"; site:description "Handkäse (literally: hand cheeses...)"; dc:description "Handkäse (literally: hand cheeses...)"; site:review < Figure 3: RDF mappings management through the Drupal interface. As mentioned before, the interface checks certain features to avoid inconsistent site data/vocabularies, such as checking consistency of cardinalities of reused properties with the field cardinalities. To map the site data model to existing ontologies, the site administrator first imports the ontology or vocabulary. We assume that it has been created using a tool such as Protégé 7, OpenVocab 8, or Neologism 9, and published somewhere on the Web in RDF Schema or OWL format. For every content type and field, the site administrator can choose a class or property it should be mapped to, see Figure 3. Such mappings are expressed as rdfs:subclass and rdfs:subproperty relationships. For instance, if the field englishname that was used on types Cheese and Country is mapped to FOAF s foaf:name, then a triple site:englishname rdfs:subpropertyof foaf:name. will be added to the site ontology. Additionally, we allow inverse reuse of existing properties. E.g., assume the site administrator imports a vocabulary ex: that defines a relation between Countries/Regions and goods that this region/coutry produces via the property ex:produces. Our user interface also allows to relate fields to the inverse of imported properties. For instance, the origin field could be related to ex:produces in such an inverse manner, resulting in site:origin rdfs:subpropertyof [ owl:inverseof ex:produces ]
11 DERI TR being added to the site vocabulary. Likewise, for assigning an existing class to a content type we use subclassing, e.g. site:review rdfs:subclassof foaf:document. Subclassing and the use of subproperties is a simple way of minimizing unintended conflicts between the semantics of local vocabulary and public terms. Per OWL semantics, constraints imposed on the local term by the content model/site vocabulary will thus not apply to the public term. This ensures safe vocabulary re-use, i.e. avoids what is sometimes referred to as Ontology Hijacking [2]. Intuitively, safe reuse means that a vocabulary importing another one does not modify the meaning of the imported vocabulary or hijack instance data of the imported vocabulary. Let us assume that we would, on the contrary, directly use the imported properties and classes in the site vocabulary. That would cause problems. For instance, the export of the content model as described in the previous section contains the triple site:englishname a owl:datatypeproperty. Would we have used foaf:name directly here, we would have changed the FOAF vocabulary, which, by itself doesn t explicitly declare foaf:name a owldatatypeliteral. Direct reuse of existing classes in the site vocabulary would raise similar issues. We emphasize, however, that reuse of rdfs:subclass, rdfs:subproperty as well as inverse rdfs:subproperty relations alone is not sufficient to guarantee consistency of the site vocabulary. While the following proposition intuitively holds, this is no longer true as soon as external vocabularies are imported. Proposition 1 A site vocabulary that does not import any external ontologies is always consistent. Nevertheless, in case of importing external properties and classes, consistency can no longer be guaranteed without further restrictions, even if both the site vocabularies and the imported ontology were consistent. This is easily illustrated by some examples. Example 1 Assume that one would inversely assign the foaf:homepage property to site:review. Given that now foaf:homepage is an inversefunctional property, this would imply functionality of site:review, i.e. that each cheese would have at most one review. This would, possibly result in strange side effects on the site instance data such as yielding reviews for cheeses with two or more reviews equal. In other cases, contradicting cardinalities could even make site instance data inconsistent. Likewise the inverse reuse of Datatype Properties is problematic. To address this problem, when displaying external properties assignable to fields in the content model (see Figure 3) our tool extending CCK could make several restrictions such as not displaying properties with cardinality restrictions attached that do not comply with those specified in CCK for the respective field. We emphasize that we do not aim to deploy a full OWL reasoner to detect all inconsistencies possibly introduced by ontology reuse in our system. The identification of syntactic fragments of OWL, which are safely usable in our tool without the need to deploy a fully-fledged OWL (DL) reasoner (i.e., which features used in the imported ontologies require which level of expressiveness for detecting possbible inconsistencies by reuse) is on our agenda. Our ultimate goal is a tool which allows the site administrator to import classes and properties from existing ontologies in a fail-safe way such that no possible inconsistencies may be possibly introduced by the extended CCK editor and later filling of the site with content.
12 8 DERI TR The reason why we are reluctant of deploying full reasoning services is that we want our tool to fit in the Drupal ecosystem as a normal module that is installable on a site without the need to install separate external software components such as a DL reasoner. Wherefore we restrict ourselves to lightweight reasoning by applying heuristics such as the detection of cardinality vialations mentioned above. An alternative path would be the invocation of a reasoning service deployed as a web-accessible service. It must be stressed that the whole mapping step is optional, and the main benefit of the Web of Data exposing site data for re-use by third parties is realized by the default mapping. 7 Evaluation We exposed our hypothesis and general rationale that ease-of-use and a one-click solution to export Linked data from CMSs will boost the Semantic Web to a small user evaluation. What we aimed to proof is that linking a site to existing vocabularies by use of our Drupal module does not impose a significant burden to site administrators and the benefits of exposing Semantic Web data such as searchability may outweigh this negligible extra effort. To this end, In this section we evaluate the usage of the implementation we created and the extra effort required by our approach. 7.1 Acceptance In order to make this implementation available to as many developers as possible, we have released the RDF CCK module on drupal.org. Since its release November 2008, the RDF CCK module has reached a number of 45 installations 10 as shown in the Figure 4. As a conrete example, the module is currently being tested and will be deployed in the next version of the Science Collaboration Framework (SCF) platform by Harvard Med School [9]. Figure 4: Evolution of the number of installations of RDF CCK since its release. 7.2 Publishing effort A formal evaluation was carried out on 10 students who were more or less familiar with the Semantic Web, but not with the Drupal User Interface. They were asked to set up a predefined content model using a guide. Each major step of the process was timed in order to be exploited in a statistical analysis. The content model used in this evaluation was composed of 2 content types Article and Editor which included 5 and 4 fields, respectively. 10 according to the Usage statistics for RDF CCK page at
13 Sheet1 A Vinh Tudor Sheila Aidan Knud Laura initial :00 03:00 02:15 03:00 01:49 02:00 initial :00 06:00 06:05 05:45 06:01 05:00 total initial 13:00 09:00 08:20 08:45 07:50 07:00 DERI TR import 1 voc 01:00 01:00 00:30 00:30 00:40 01:00 map article map editor 07:00 06:00 04:20 04:20 05:30 03:00 total mapping 07:00 06:00 04:20 04:20 05:30 03: % 66.67% 52.00% 49.52% 70.21% 42.86% mean 58.45% 39.85% Initial setup of the content model The first major step of the evaluation was to setup the content model the way any site administrator would on a typical Drupal site. The CCK User Interface was used here. The time each student took to execute this step is materialized in orange in the Figure 5. RDF mappings setup The second step we evaluated is the RDF mappings. The pool of 10 students was divided into two 5 people groups: the group A (user 1 to 5) had to decide which mappings the most appropriate and the group B (users 6user to1 10) was user 2 given user a3 list user of 4predefined user 5 mappings. user 6 Initial setup 13:00 09:00:00 08:20:00 08:45:00 07:50:00 07:00:00 According to our measures RDF mappings and as07:00:00 shown 06:00:00 on Figure 04:20:00 5, the04:20:00 extra step 05:30:00required 03:00:00 to have a fully mapped content model represent less than half of the time of the initial setup. Main Title B add ranking in the list of suggested terms and suggest for the one matching the id first and RDF mappings Initial setup 00 user 1 user 2 user 3 user 4 user 5 user 6 user 7 user 8 user 9 user 10 Figure 5: Comparison between Page initial 1 setup and RDF mappings. On average, the extra time spent on specifying the mappings took 58% of the initial setup for the group A and 39% for the group B. 7.3 Motivation and Benefits In the recent years, website administrators have been encouraged to publish sitemaps in order to promote their site on the web. Similarly, semantic sitemaps[8] allow sites to describe semantic web datasets and how to best consume it. Our approach requires an extra effort which has similar benefits. Moreover, most of this extra effort is eased by the User Interface and can be done automatically. Yahoo! SearchMonkey, Exhibit, Visinav, Sindice[16]. Summarizing, our findings show that whereas linking to exernal vocabularies was experienced easy by our test group, the main effort and time consumed was actually in deciding to which properties and classes to link, that is which ontologies should be reused how. Inspired by this finding we put on our agenda more investigation on how we can support non-semantic-web-savvy users in finding the right classes and properties for their needs. To this end, we aim to extend search functionalities of Semantic Web serach engines such as Sindice [16] or SWSE [11] by search and ranking functionalities to find and link widely used classes and ontologies for linking them to your content model, based on keywords or field names. 8 Conclusions and Outlook In this paper we have presented an extension to Drupal s CCK that allows to link existing and newly deployed Drupal sites to the Web of data by a few clicks. It auto-exports the site content model of a Drupal
14 10 DERI TR site to an ontology that is published following common best practices for ontology publication and enables the exposure of Drupal site content as RDFa out-of-the-box. The site vocabulary uses a fragment of the available OWL constructs which is sufficient to reflect the site content model s structure in a machine-readable manner and is guaranteed to be consistent. In order to link to existing vocabularies from the Semantic Web, we allow ontology import and linkage of existing properties and classes by subclass/subproperty relations. The system we have implemented is a working prototype 11. The module used for the prototype and discussed in this paper is available on drupal.org 12. The infection of promising power-user communities such as the rapidly growing Drupal site administrator and developer groups is in our opinion a key in boosing Semantic Web technologies. Next steps include the extension of RDF/OWL export for Drupal to other modules such as for instance the Taxonomy module for tag hierarchies usable in Drupal, which we plan to expose as RDF using SKOS[15]. Our solution shall provide easy-to-use, unobtrusive RDF exposure in a way general enough for a variety of Drupal sites, thus potentially contributing significantly to the further population of the Web of data with high-quality RDF data. Acknowledgments This work was supported by SFI grant No. SFI/08/CE/I1380 (Lion-2) References [1] B. Adida, M. Birbeck, S. McCarron, S. Pemberton (eds.), RDFa in XTML: Syntax and processing, W3C Rec., (Oct. 2008). [2] A. Hogan, A. Harth, A. Polleres, SAOR: Authoritative reasoning for the web, ASWC [3] S. Auer, S. Dietzold, J. Lehmann, S. Hellmann, D. Aumüller, Triplify - lightweight linked data publication from relational databases, WWW [4] D. Berrueta, J. P. (eds.), Best practice recipes for publishing rdf vocabularies, W3C Working Group Note, (Aug. 2008). [5] C. Bizer, T. Heath, K. Idehen, T. Berners-Lee (eds.), Linked Data on the Web (LDOW) [6] D. Brickley, R. Guha (eds.), RDF vocabulary description language 1.0: RDF Schema, W3C Rec., (Feb. 2004). [7] D. Brickley, L. Miller, FOAF Vocabulary Specification, (Nov. 2007). [8] R. Cyganiak, H. Stenzhorn, R. Delbru, S. Decker, G. Tummarello, Semantic sitemaps: Efficient and flexible access to datasets on the semantic web, ESWC [9] S. Das, L. Girard, T. Green, L. Weitzman, A. Lewis-Bowen, T. Clark, Building biomedical web communities using a semantically aware content management system, Briefings in Bioinformatics (2009) 10(2): A demo site is online at
15 DERI TR [10] M. Dean, G. Schreiber, S. Bechhofer, F. van Harmelen, J. Hendler, I. Horrocks, D. L. McGuinness, P. F. Patel-Schneider, L. A. Stein, OWL Web Ontology Language Reference, W3C Rec. (Feb. 2004). [11] A. Harth, J. Umbrich, S. Decker, Multicrawler: A pipelined architecture for crawling and indexing semantic web data, ISWC [12] Y. Jin, S. Decker, G. Wiederhold, Ontowebber: Model-driven ontology-based web site management, SWWS [13] Y. Jin, S. Xu, S. Decker, G. Wiederhold, Managing web sites with ontowebber, in: C. S. Jensen, K. G. Jeffery, J. Pokorný, S. Saltenis, E. Bertino, K. Böhm, M. Jarke (eds.), EDBT [14] O. Lassila, R. Swick (eds.), Resource Description Framework (RDF) Model and Syntax Specification, W3C Rec., (Feb. 1999). [15] A. Miles, S. Bechhofer (eds.), SKOS Simple knowledge organization system reference, W3C Cand. Rec., (Mar. 2009). [16] E. Oren, R. Delbru, M. Catasta, R. Cyganiak, H. Stenzhorn, G. Tummarello, Sindice.com: A document-oriented lookup index for open linked data, Int l Journal of Metadata, Semantics & Ontologies 3 (1). [17] R. Shreves, Open source cms market share, White paper, Water & Stone, waterandstone.com/downloads/2008opensourcecmsmarketsurvey.pdf. [18] S. Staab, J. Angele, S. Decker, M. Erdmann, A. Hotho, A. Maedche, H. P. Schnurr, R. Studer, Y. Sure, Semantic community web portals, Computer Networks 33 (1-6) (2000) [19] L. Stojanovic, N. Stojanovic, R. Volz, Migrating data-intensive web sites into the semantic web, ACM SAC 2002x.
How To Write A Drupal 5.5.2.2 Rdf Plugin For A Site Administrator To Write An Html Oracle Website In A Blog Post In A Flashdrupal.Org Blog Post
RDFa in Drupal: Bringing Cheese to the Web of Data Stéphane Corlosquet, Richard Cyganiak, Axel Polleres and Stefan Decker Digital Enterprise Research Institute National University of Ireland, Galway Galway,
Produce and Consume Linked Data with Drupal!
Produce and Consume Linked Data with Drupal! Stéphane Corlosquet 1,2, Renaud Delbru 1, Tim Clark 2,3, Axel Polleres 1, and Stefan Decker 1 1 Digital Enterprise Research Institute, National University of
Towards the Integration of a Research Group Website into the Web of Data
Towards the Integration of a Research Group Website into the Web of Data Mikel Emaldi, David Buján, and Diego López-de-Ipiña Deusto Institute of Technology - DeustoTech, University of Deusto Avda. Universidades
A Semantic web approach for e-learning platforms
A Semantic web approach for e-learning platforms Miguel B. Alves 1 1 Laboratório de Sistemas de Informação, ESTG-IPVC 4900-348 Viana do Castelo. [email protected] Abstract. When lecturers publish contents
Semantic Modeling with RDF. DBTech ExtWorkshop on Database Modeling and Semantic Modeling Lili Aunimo
DBTech ExtWorkshop on Database Modeling and Semantic Modeling Lili Aunimo Expected Outcomes You will learn: Basic concepts related to ontologies Semantic model Semantic web Basic features of RDF and RDF
Ampersand and the Semantic Web
Ampersand and the Semantic Web The Ampersand Conference 2015 Lloyd Rutledge The Semantic Web Billions and billions of data units Triples (subject-predicate-object) of URI s Your data readily integrated
Getting Started Guide
TopBraid Composer Getting Started Guide Version 2.0 July 21, 2007 TopBraid Composer, Copyright 2006 TopQuadrant, Inc. 1 of 58 Revision History Date Version Revision August 1, 2006 1.0 Initial version September
Developing Web 3.0. Nova Spivak & Lew Tucker http://radarnetworks.com/ Tim Boudreau http://weblogs.java.net/blog/timboudreau/
Developing Web 3.0 Nova Spivak & Lew Tucker http://radarnetworks.com/ Tim Boudreau http://weblogs.java.net/blog/timboudreau/ Henry Story http://blogs.sun.com/bblfish 2007 JavaOne SM Conference Session
Towards a reference architecture for Semantic Web applications
Towards a reference architecture for Semantic Web applications Benjamin Heitmann 1, Conor Hayes 1, and Eyal Oren 2 1 [email protected] Digital Enterprise Research Institute National University
Drupal. http://www.flickr.com/photos/funkyah/2400889778
Drupal 7 and RDF Stéphane Corlosquet, - Software engineer, MGH - Drupal 7 core RDF maintainer - SemWeb geek Linked Data Ventures, MIT, Oct 2010 This work is licensed under a Creative
Characterizing Knowledge on the Semantic Web with Watson
Characterizing Knowledge on the Semantic Web with Watson Mathieu d Aquin, Claudio Baldassarre, Laurian Gridinoc, Sofia Angeletou, Marta Sabou, and Enrico Motta Knowledge Media Institute (KMi), The Open
Semantic Search in Portals using Ontologies
Semantic Search in Portals using Ontologies Wallace Anacleto Pinheiro Ana Maria de C. Moura Military Institute of Engineering - IME/RJ Department of Computer Engineering - Rio de Janeiro - Brazil [awallace,anamoura]@de9.ime.eb.br
LinkZoo: A linked data platform for collaborative management of heterogeneous resources
LinkZoo: A linked data platform for collaborative management of heterogeneous resources Marios Meimaris, George Alexiou, George Papastefanatos Institute for the Management of Information Systems, Research
Leveraging existing Web frameworks for a SIOC explorer to browse online social communities
Leveraging existing Web frameworks for a SIOC explorer to browse online social communities Benjamin Heitmann and Eyal Oren Digital Enterprise Research Institute National University of Ireland, Galway Galway,
13 RDFS and SPARQL. Internet Technology. MSc in Communication Sciences 2011-12 Program in Technologies for Human Communication.
MSc in Communication Sciences 2011-12 Program in Technologies for Human Communication Davide Eynard nternet Technology 13 RDFS and SPARQL 2 RDF - Summary Main characteristics of RDF: Abstract syntax based
Chapter 2 AN INTRODUCTION TO THE OWL WEB ONTOLOGY LANGUAGE 1. INTRODUCTION. Jeff Heflin Lehigh University
Chapter 2 AN INTRODUCTION TO THE OWL WEB ONTOLOGY LANGUAGE Jeff Heflin Lehigh University Abstract: Key words: 1. INTRODUCTION The OWL Web Ontology Language is an international standard for encoding and
Linked Open Data Infrastructure for Public Sector Information: Example from Serbia
Proceedings of the I-SEMANTICS 2012 Posters & Demonstrations Track, pp. 26-30, 2012. Copyright 2012 for the individual papers by the papers' authors. Copying permitted only for private and academic purposes.
DISCOVERING RESUME INFORMATION USING LINKED DATA
DISCOVERING RESUME INFORMATION USING LINKED DATA Ujjal Marjit 1, Kumar Sharma 2 and Utpal Biswas 3 1 C.I.R.M, University Kalyani, Kalyani (West Bengal) India [email protected] 2 Department of Computer
Scalable End-User Access to Big Data http://www.optique-project.eu/ HELLENIC REPUBLIC National and Kapodistrian University of Athens
Scalable End-User Access to Big Data http://www.optique-project.eu/ HELLENIC REPUBLIC National and Kapodistrian University of Athens 1 Optique: Improving the competitiveness of European industry For many
Defining a benchmark suite for evaluating the import of OWL Lite ontologies
UNIVERSIDAD POLITÉCNICA DE MADRID FACULTAD DE INFORMÁTICA FREE UNIVERSITY OF BOLZANO FACULTY OF COMPUTER SCIENCE EUROPEAN MASTER IN COMPUTATIONAL LOGIC MASTER THESIS Defining a benchmark suite for evaluating
RDF Resource Description Framework
RDF Resource Description Framework Fulvio Corno, Laura Farinetti Politecnico di Torino Dipartimento di Automatica e Informatica e-lite Research Group http://elite.polito.it Outline RDF Design objectives
Lightweight Data Integration using the WebComposition Data Grid Service
Lightweight Data Integration using the WebComposition Data Grid Service Ralph Sommermeier 1, Andreas Heil 2, Martin Gaedke 1 1 Chemnitz University of Technology, Faculty of Computer Science, Distributed
The Ontology and Architecture for an Academic Social Network
www.ijcsi.org 22 The Ontology and Architecture for an Academic Social Network Moharram Challenger Computer Engineering Department, Islamic Azad University Shabestar Branch, Shabestar, East Azerbaijan,
Powl A Web Based Platform for Collaborative Semantic Web Development
Powl A Web Based Platform for Collaborative Semantic Web Development Sören Auer University of Leipzig [email protected] Abstract: We outline Powl, an opensource, web-based semantic web development
Semantic Knowledge Management System. Paripati Lohith Kumar. School of Information Technology
Semantic Knowledge Management System Paripati Lohith Kumar School of Information Technology Vellore Institute of Technology University, Vellore, India. [email protected] Abstract The scholarly activities
Logic and Reasoning in the Semantic Web (part I RDF/RDFS)
Logic and Reasoning in the Semantic Web (part I RDF/RDFS) Fulvio Corno, Laura Farinetti Politecnico di Torino Dipartimento di Automatica e Informatica e-lite Research Group http://elite.polito.it Outline
Semantic Exploration of Archived Product Lifecycle Metadata under Schema and Instance Evolution
Semantic Exploration of Archived Lifecycle Metadata under Schema and Instance Evolution Jörg Brunsmann Faculty of Mathematics and Computer Science, University of Hagen, D-58097 Hagen, Germany [email protected]
LDIF - Linked Data Integration Framework
LDIF - Linked Data Integration Framework Andreas Schultz 1, Andrea Matteini 2, Robert Isele 1, Christian Bizer 1, and Christian Becker 2 1. Web-based Systems Group, Freie Universität Berlin, Germany [email protected],
City Data Pipeline. A System for Making Open Data Useful for Cities. [email protected]
City Data Pipeline A System for Making Open Data Useful for Cities Stefan Bischof 1,2, Axel Polleres 1, and Simon Sperl 1 1 Siemens AG Österreich, Siemensstraße 90, 1211 Vienna, Austria {bischof.stefan,axel.polleres,simon.sperl}@siemens.com
Extending Java Web Applications for Semantic Web
Extending Java Web Applications for Semantic Web Lidia Rovan, Ivana Nižeti Department of applied computing University of Zagreb Faculty of Electrical Engineering and Computing Unska 3, Zagreb 10000, CROATIA
Annotea and Semantic Web Supported Collaboration
Annotea and Semantic Web Supported Collaboration Marja-Riitta Koivunen, Ph.D. Annotea project Abstract Like any other technology, the Semantic Web cannot succeed if the applications using it do not serve
technische universiteit eindhoven WIS & Engineering Geert-Jan Houben
WIS & Engineering Geert-Jan Houben Contents Web Information System (WIS) Evolution in Web data WIS Engineering Languages for Web data XML (context only!) RDF XML Querying: XQuery (context only!) RDFS SPARQL
Ontologies for elearning
Chapter 4 Applications and Impacts Ontologies for elearning B.Polakowski and K.Amborski Institute of Control and Industrial Electronics, Warsaw University of Technology. Warsaw, Poland e-mail: [email protected],
LD2SD: Linked Data Driven Software Development
LD2SD: Linked Data Driven Software Development Aftab Iqbal, Oana Ureche, Michael Hausenblas, Giovanni Tummarello Digital Enterprise Research Institute (DERI), National University of Ireland, Galway IDA
Layering the Semantic Web: Problems and Directions
First International Semantic Web Conference (ISWC2002), Sardinia, Italy, June 2002. Layering the Semantic Web: Problems and Directions Peter F. Patel-Schneider and Dieter Fensel Bell Labs Research Murray
Semantic Interoperability
Ivan Herman Semantic Interoperability Olle Olsson Swedish W3C Office Swedish Institute of Computer Science (SICS) Stockholm Apr 27 2011 (2) Background Stockholm Apr 27, 2011 (2) Trends: from
Extending the Linked Data API with RDFa
Extending the Linked Data API with RDFa Steve Battle 1, James Leigh 2, David Wood 2 1 Gloze Ltd, UK [email protected] 2 3 Round Stones, USA James, [email protected] Linked data is about connecting
Lift your data hands on session
Lift your data hands on session Duration: 40mn Foreword Publishing data as linked data requires several procedures like converting initial data into RDF, polishing URIs, possibly finding a commonly used
SmartLink: a Web-based editor and search environment for Linked Services
SmartLink: a Web-based editor and search environment for Linked Services Stefan Dietze, Hong Qing Yu, Carlos Pedrinaci, Dong Liu, John Domingue Knowledge Media Institute, The Open University, MK7 6AA,
daq, an Ontology for Dataset Quality Information
daq, an Ontology for Dataset Quality Information Jeremy Debattista University of Bonn / Fraunhofer IAIS, Germany [email protected] Christoph Lange University of Bonn / Fraunhofer IAIS,
OntoWebML: A Knowledge Base Management System for WSML Ontologies
OntoWebML: A Knowledge Base Management System for WSML Ontologies Christina Tziviskou and Marco Brambilla Politecnico di Milano, Dipartimento di Elettronica e Informazione Piazza L. Da Vinci 32, 20133
Linking Data in the Insurance Sector: A case study
Linking Data in the Insurance Sector: A case study Katerina Kalou and Dimitrios Koutsomitropoulos HPCLab, Computer Engineering and Informatics Dpt., University of Patas, Building B, 26500 Patras-Rio, Greece
Ontologies, Semantic Web and Virtual Enterprises. Marek Obitko [email protected] Rockwell Automation Research Center, Prague, Czech Republic 1
Ontologies, Semantic Web and Virtual Enterprises Marek Obitko [email protected] Rockwell Automation Research Center, Prague, Czech Republic 1 Agenda Motivation Ontologies Semantic Web Selected Applications
RDF Dataset Management Framework for Data.go.th
RDF Dataset Management Framework for Data.go.th Pattama Krataithong 1,2, Marut Buranarach 1, and Thepchai Supnithi 1 1 Language and Semantic Technology Laboratory National Electronics and Computer Technology
Use of OWL and SWRL for Semantic Relational Database Translation
Use of OWL and SWRL for Semantic Relational Database Translation Matthew Fisher, Mike Dean, Greg Joiner BBN Technologies, 1300 N. 17th Street, Suite 400, Arlington, VA 22209 {mfisher, mdean, gjoiner}@bbn.com
SemWeB Semantic Web Browser Improving Browsing Experience with Semantic and Personalized Information and Hyperlinks
SemWeB Semantic Web Browser Improving Browsing Experience with Semantic and Personalized Information and Hyperlinks Melike Şah, Wendy Hall and David C De Roure Intelligence, Agents and Multimedia Group,
Revealing Trends and Insights in Online Hiring Market Using Linking Open Data Cloud: Active Hiring a Use Case Study
Revealing Trends and Insights in Online Hiring Market Using Linking Open Data Cloud: Active Hiring a Use Case Study Amar-Djalil Mezaour 1, Julien Law-To 1, Robert Isele 3, Thomas Schandl 2, and Gerd Zechmeister
Linked Medieval Data: Semantic Enrichment and Contextualisation to Enhance Understanding and Collaboration
: Semantic Enrichment and Contextualisation to Enhance Understanding and Collaboration Prof. Dr. Stefan Gradmann Humboldt-Universität zu Berlin / School of Library and Information Science [email protected]
Annotation: An Approach for Building Semantic Web Library
Appl. Math. Inf. Sci. 6 No. 1 pp. 133-143 (2012) Applied Mathematics & Information Sciences @ 2012 NSP Natural Sciences Publishing Cor. Annotation: An Approach for Building Semantic Web Library Hadeel
REPRESENTATION AND CERTIFICATION
REPRESENTATION AND CERTIFICATION OF DATA QUALITY ON THE WEB (Research Paper) Cinzia Cappiello Chiara Francalanci Barbara Pernici Politecnico di Milano, Milano, Italy {cappiell, francala, pernici}@elet.polimi.it
Grids, Logs, and the Resource Description Framework
Grids, Logs, and the Resource Description Framework Mark A. Holliday Department of Mathematics and Computer Science Western Carolina University Cullowhee, NC 28723, USA [email protected] Mark A. Baker,
OWL based XML Data Integration
OWL based XML Data Integration Manjula Shenoy K Manipal University CSE MIT Manipal, India K.C.Shet, PhD. N.I.T.K. CSE, Suratkal Karnataka, India U. Dinesh Acharya, PhD. ManipalUniversity CSE MIT, Manipal,
Publishing Relational Databases as Linked Data
Publishing Relational Databases as Linked Data Oktie Hassanzadeh University of Toronto March 2011 CS 443: Database Management Systems - Winter 2011 Outline 2 Part 1: How to Publish Linked Data on the Web
SEMANTIC WEB BASED INFERENCE MODEL FOR LARGE SCALE ONTOLOGIES FROM BIG DATA
SEMANTIC WEB BASED INFERENCE MODEL FOR LARGE SCALE ONTOLOGIES FROM BIG DATA J.RAVI RAJESH PG Scholar Rajalakshmi engineering college Thandalam, Chennai. [email protected] Mrs.
Semantically Enhanced Web Personalization Approaches and Techniques
Semantically Enhanced Web Personalization Approaches and Techniques Dario Vuljani, Lidia Rovan, Mirta Baranovi Faculty of Electrical Engineering and Computing, University of Zagreb Unska 3, HR-10000 Zagreb,
New Generation of Social Networks Based on Semantic Web Technologies: the Importance of Social Data Portability
New Generation of Social Networks Based on Semantic Web Technologies: the Importance of Social Data Portability Liana Razmerita 1, Martynas Jusevičius 2, Rokas Firantas 2 Copenhagen Business School, Denmark
THE SEMANTIC WEB AND IT`S APPLICATIONS
15-16 September 2011, BULGARIA 1 Proceedings of the International Conference on Information Technologies (InfoTech-2011) 15-16 September 2011, Bulgaria THE SEMANTIC WEB AND IT`S APPLICATIONS Dimitar Vuldzhev
OSLC Primer Learning the concepts of OSLC
OSLC Primer Learning the concepts of OSLC It has become commonplace that specifications are precise in their details but difficult to read and understand unless you already know the basic concepts. A good
Linked Data Publishing with Drupal
Linked Data Publishing with Drupal Joachim Neubert ZBW German National Library of Economics Leibniz Information Centre for Economics SWIB13 Workshop Hamburg, Germany 25.11.2013 ZBW is member of the Leibniz
powl Features and Usage Overview
powl Features and Usage Overview Live demonstrations and further information is available from: http://powl.sourceforge.net/swc Sören Auer University of Leipzig [email protected] Norman Beck
María Elena Alvarado gnoss.com* [email protected] Susana López-Sola gnoss.com* [email protected]
Linked Data based applications for Learning Analytics Research: faceted searches, enriched contexts, graph browsing and dynamic graphic visualisation of data Ricardo Alonso Maturana gnoss.com *Piqueras
Evaluating Semantic Web Service Tools using the SEALS platform
Evaluating Semantic Web Service Tools using the SEALS platform Liliana Cabral 1, Ioan Toma 2 1 Knowledge Media Institute, The Open University, Milton Keynes, UK 2 STI Innsbruck, University of Innsbruck,
Managing enterprise applications as dynamic resources in corporate semantic webs an application scenario for semantic web services.
Managing enterprise applications as dynamic resources in corporate semantic webs an application scenario for semantic web services. Fabien Gandon, Moussa Lo, Olivier Corby, Rose Dieng-Kuntz ACACIA in short
CitationBase: A social tagging management portal for references
CitationBase: A social tagging management portal for references Martin Hofmann Department of Computer Science, University of Innsbruck, Austria [email protected] Ying Ding School of Library and Information Science,
How to Publish Linked Data on the Web
How to Publish Linked Data on the Web Tom Heath, Platform Division, Talis, UK Chris Bizer, FU Berlin, Germany Richard Cyganiak, DERI Galway, Ireland http://sites.wiwiss.fu-berlin.de/suhl/bizer/pub/linkeddatatutorial/
Semantic Lookup in Service-Oriented Architectures
Semantic Lookup in Service-Oriented Architectures Uwe Zdun Department of Information Systems, Vienna University of Economics, Austria [email protected] Abstract Lookup of services is an important issues in
An Efficient and Scalable Management of Ontology
An Efficient and Scalable Management of Ontology Myung-Jae Park 1, Jihyun Lee 1, Chun-Hee Lee 1, Jiexi Lin 1, Olivier Serres 2, and Chin-Wan Chung 1 1 Korea Advanced Institute of Science and Technology,
I. INTRODUCTION NOESIS ONTOLOGIES SEMANTICS AND ANNOTATION
Noesis: A Semantic Search Engine and Resource Aggregator for Atmospheric Science Sunil Movva, Rahul Ramachandran, Xiang Li, Phani Cherukuri, Sara Graves Information Technology and Systems Center University
An Ontology Based Method to Solve Query Identifier Heterogeneity in Post- Genomic Clinical Trials
ehealth Beyond the Horizon Get IT There S.K. Andersen et al. (Eds.) IOS Press, 2008 2008 Organizing Committee of MIE 2008. All rights reserved. 3 An Ontology Based Method to Solve Query Identifier Heterogeneity
Administrator s Guide
SEO Toolkit 1.3.0 for Sitecore CMS 6.5 Administrator s Guide Rev: 2011-06-07 SEO Toolkit 1.3.0 for Sitecore CMS 6.5 Administrator s Guide How to use the Search Engine Optimization Toolkit to optimize your
