Building the Multilingual Web of Data: A Hands-on tutorial (ISWC 2014, Riva del Garda - Italy)
|
|
- Hope Dorsey
- 2 years ago
- Views:
Transcription
1 Building the Multilingual Web of Data: A Hands-on tutorial (ISWC 2014, Riva del Garda - Italy) Multilingual Word Sense Disambiguation and Entity Linking on the Web based on BabelNet Roberto Navigli, Tiziano Flati Sapienza 20/10/2014 Presenter name 1
2 The instructors Roberto Navigli, associate professor, Department of Computer Science, Sapienza Tiziano Flati, PhD student, Department of Computer Science, Sapienza 2
3 And, if you resist until the end you will receive a prize!!! A BabelNet t-shirt!!! [model is not included] 3
4 Part 1: Identifying multilingual concepts and entities in text 4
5 The driving force Web content is available in many languages Information should be extracted and processed independently of the source/target language This could be done automatically by means of high-performance multilingual text understanding 5
6 Word Sense Disambiguation and Entity Linking «Thomas and Mario are strikers playing in Munich» Entity Linking: The task of discovering mentions of entities within a text and linking them in a knowledge base. WSD: The task aimed at assigning meanings to word occurrences within text. 6
7 The general problem POLYSEMY Natural language is ambiguous The most frequent words have several meanings! Our job: model meaning from a computational perspective 11
8 Monosemous vs. Polysemous words Monosemous words: only one meaning Examples: plant life internet Polysemous words: more than one meaning Example: bar a room or establishment where alcoholic drinks are served a counter where you can obtain food or drink a rigid piece of metal or wood musical notation for a repeating pattern of musical beats 12
9 Thesauri Groups words according to similar meaning Relations between groups (e.g., narrower meanings) Roget s Thesaurus (1911) Machine Readable Dictionaries Enumerates all meanings of a word Includes definitions, morphology, example usages, etc. Oxford Dictionary of English, LDOCE, Collins, etc. But how do we represent and encode semantics? Computation Lexicons Repositories of structured knowledge about a word semantics and syntax Include relations like hypernymy, meronymy, or entailment WordNet 15
10 What if we choose BabelNet as our sense inventory? 16
11 BabelNet 17
12 BabelNet [Navigli and Ponzetto, AIJ 2012] A wide-coverage multilingual semantic network including both encyclopedic (from Wikipedia) and lexicographic (from WordNet) entries NEs and specialized concepts from Wikipedia Concepts from WordNet Concepts integrated from both resources 18
13 50 languages covered (including Latin!) Anatomy of BabelNet 2.5 List of languages at 19
14 Anatomy of BabelNet languages covered (including Latin!) 9.3M Babel synsets (concepts and named entities) 67M word senses 262M semantic relations (28 edges per synset on avg.) 7.7M synset-associated images 21M textual definitions 20
15 New 2.5 version out! Seamless integration of: WordNet 3.0 Wikipedia Wikidata Wiktionary OmegaWiki Open Multilingual WordNet [Bond and Foster, 2013] Translations for all open-class parts of speech 1.1B RDF triples available via SPARQL endpoint 21
16 WordNet+OpenMultilingualWordNet+ Wikipedia+ 22
17 +OmegaWiki+automatic translations 23
18 +textual definitions 24
19 More definitions+wikipedia categories+ 25
20 +images 26
21 1. From six to 50 languages; 2. From two resources to six; BabelNet From 5 million to 9.3 million synsets; 4. From 50 million to 68 million word senses; 5. From 140 million semantic relations to 262 million semantic relations 27
22 BabelNet 3.0 available from November 2014!!! 28
23 BabelNet 3.0 available from November 2014!!! 29
24 New Babelfy interface available from November 2014!!! 30
25 Part 2: BabelNet-lemon: BabelNet as multilingual linked data! 31
26 babelnet.org#lemonrepresentation - the RDF resource consists of a set of Lexicons, one per language. lemmas - Lexicons gather Lexical Entries which comprise the forms of an entry; in our case: words of the Babel lexicon. - Lexical Forms encode the surface realisation(s) of Lexical Entries; in our case: lemmas of Babel words. - Lexical Senses represent the usage of a word as reference to a specific concept; in our case: Babel senses. - Skos Concepts represent units of thought ; in our case: Babel synsets. SKOS concepts Babel words Babel senses Babel synsets 32
27 babelnet.org#interlinking Links towards encyclopedic datasets: Wikipedia pages (36M): bn-lemon:wikipediapage on lemon:lexicalsense Wikipedia categories (46M): bn-lemon:wikipediacategory on skos:concept DBpedia pages (4M): skos:exactmatch on skos:concept DBpedia categories (15M): bn-lemon:dbpediacategory on skos:concept Links towards lexical datasets: lemon-wordnet (117k): skos:exactmatch on skos:concept lemon-uby, OmegaWiki (en) (15k): skos:exactmatch on skos:concept 33
28 babelnet.org#publicationontheweb RDF dumps format: n-triples We produced dumps for both URIs and IRIs SPARQL endpoint virtuoso universal server Dereferencing babelnet.org/2.0/ Pubby Linked Data Frontend (http://wifo5-03.informatik.uni-mannheim.de/pubby/) 34
29 Hands-on Session: BabelNet 35
30 Go to: 36
31 1. Open directory babelnet-lemon 2. Launch launch_fuseki.sh or launch_fuseki.bat 3. Open 4. Click on Control Panel 5. Click on Select Fuseki 6. Within the SPARQL query text area put your SPARQL queries 7. [List of possible words in file freqs.top10k.words.txt] 37
32 Some useful prefixes PREFIX bn: <http://babelnet.org/2.0/> PREFIX bn-lemon: <http://babelnet.org/model/babelnet#> PREFIX lemon: <http://www.lemon-model.net/lemon#> PREFIX skos: <http://www.w3.org/2004/02/skos/core#> PREFIX lexinfo: <http://www.lexinfo.net/ontology/2.0/lexinfo#> PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#> PREFIX dc: <http://purl.org/dc/elements/1.1/> PREFIX dcterms: <http://purl.org/dc/terms/> 38
33 Exercise 1: Retrieve the senses of a given lemma Given a word, e.g. home, retrieve all its senses and corresponding synsets in all supported languages: SELECT DISTINCT?sense?synset WHERE {?entries a lemon:lexicalentry.?entries lemon:sense?sense.?sense lemon:reference?synset.?entries rdfs:label?term. FILTER (str(?term)="home") } LIMIT 10 39
34 Exercise 2: Retrieve the senses of a lemma for a certain language We can restrict to a given language, e.g. English: SELECT DISTINCT?sense?synset WHERE {?entries a lemon:lexicalentry.?entries lemon:language "EN".?entries lemon:sense?sense.?sense lemon:reference?synset.?entries rdfs:label?term. FILTER (str(?term)="home") } LIMIT 10 40
35 Exercise 3: Retrieve the translations of a given sense For instance, given the sense SELECT?translation WHERE { }?entry a lemon:lexicalsense.?entry lexinfo:translation?translation. FILTER (str(?entry)="http://babelnet.org/2.0/home_en/s n") 41
36 Exercise 4: Retrieve license information about a sense For instance, given the sense: SELECT?license WHERE { }?entry a lemon:lexicalsense.?entry dcterms:license?license. FILTER (str(?entry)="http://babelnet.org/2.0/home_en/s n") 42
37 Exercise 5: Retrieve textual definitions in all languages For instance, given the synset SELECT DISTINCT?language?gloss?license?sourceurl WHERE { }?url a skos:concept.?url bn-lemon:synsetid?synsetid. OPTIONAL { }?url bn-lemon:definition?definition.?definition lemon:language?language.?definition bn-lemon:gloss?gloss.?definition dcterms:license?license.?definition dc:source?sourceurl. FILTER (str(?url)="http://babelnet.org/2.0/s n") 43
38 Exercise 6: Retrieve a synset s hypernyms For instance, given the synset: SELECT?broader WHERE {?entry a skos:concept. OPTIONAL {?entry skos:broader?broader } FILTER (str(?entry)="http://babelnet.org/2.0/s n") } 44
39 Part 2: WSD and Entity Linking Together! 45
40 Back to the original problem 46
41 strikers (target word) Word Sense Disambiguation in a Nutshell Thomas and Mario are strikers playing in Munich (context) knowledge WSD system sense of target word 47
42 Entity Linking in a Nutshell Thomas (target mention) Thomas and Mario are strikers playing in Munich (context) EL system knowledge Named Entity 48
43 Entity Linking EL encompasses a set of similar tasks: Named Entity Disambiguation, that is the task of linking entity mentions in a text to a knowledge base Wikification, that is the automatic annotation of text by linking its relevant fragments of text to the appropriate Wikipedia articles. 49
44 The multilingual aspect of disambiguation In both tasks, WSD and EL, knowledge-based approaches have been shown to perform well. What about multilinguality? Which kind of resources are available out there? Open Multilingual WordNet 51
45 A Joint approach to WSD and EL The main difference between WSD and EL is the kind of inventory used 52
46 But BabelNet can be used as a multilingual inventory for both: 1. Concepts Calcio in Italian can denote different concepts: 2. Named Entities The text Mario can be used to represent different things such as the video game character or a soccer player (Gomez) or even a music album 53
47 Calcio/Kick in BabelNet
48 Calcio/Calcium in BabelNet
49 Calcio/Soccer in BabelNet
50 Disambiguation and Entity Linking together! BabelNet is a huge multilingual inventory for both word senses and named entities! 57
51 So what? 58
52 Babelfy: A Joint approach to WSD and EL [Moro et al., TACL 2014] Based on Personalized PageRank, the state-of-the-art method for graph-based WSD. However, it cannot be run for each new input on huge graphs. Idea: Precompute semantic signatures for the nodes! Semantic signatures are the most relevant nodes for a given node in the graph computed by using random walk with restart Andrea Moro and Alessandro Raganato and Roberto Navigli Entity Linking meets Word Sense Disambiguation: a Unified Approach. Transactions of the Association for Computational Linguistics (TACL), 2. 59
53 Demo tomorrow afternoon! 60
54 Babelfying ISWC! 61
55 62
56 Multilingual Word Sense Disambiguation and Entity Linking COLING 2014 Tutorial 20/10/2014 Roberto Navigli and Andrea Moro Roberto Navigli e Tiziano Flati La Sapienza 63
57 64
58 65
59 66
60 67
61 68
62 Annotating with BabelNet: all in one! Key fact! Annotating with BabelNet implies annotating with WordNet and Wikipedia (now also OmegaWiki, Open Multilingual WordNet, Wiktionary and WikiData!) 7 BabelNet 90 90
63 Open Problems: grammar-agnostic All current approaches exploit: POS tagging Lemmatization Noisy (>90% for English, but much less on morphologically rich languages). How to improve? Waiting for better POS taggers Character-based analysis of text 92
64 Open Problems: language-agnostic All current approaches exploit: Knowledge of the input language Automatic language recognition How to improve? Waiting for better language recognition systems Unify the lexicalizations of different languages Noisy (>90% for English, but much less on resource poor languages). Moreover, text which consists of text in multiple languages will be wrongly analyzed for sure! 93
65 Open Problems: fragment recognition Most of the current approaches exploit: Named Entity Recognition Not overlapping text assumption How to improve? Waiting for better NER system Overlap and match everything Noisy (>80% for English, but much less on resource poor languages). Moreover, when assuming that entities and word senses should not overlap you lose information! 94
66 Hands-on Session: Babelfy 99
67 Exercise 1 Go to babelfy.org Type in or copy/paste your favourite text in your favourite language in the text area Select the text language Click on «Babelfy!» Understand the difference between green and yellow balloons 100
68 Part 3: Producing multilingual linked data 101
69 Babelfy 2 NIF (hackathon at SEMANTiCS 2014) Free text Text annotated with Babelfy Annotated text in RDF 102
70 From the previous talk you should already know everything about NIF, don t you? But just in case 103
71 NIF The NLP Interchange Format (NIF) is an RDF/OWL-based format that aims to achieve interoperability between Natural Language Processing (NLP) tools, language resources and annotations. NIF consists of specifications, ontologies and software 104
72 NIF Reuse of existing standards (such as RDF, OWL 2, the PROV Ontology, etc.) NIF identifiers are used in the Internationalization Tag Set (ITS) Version 2.0 Royalty-free and published under an open license. Driven by its open community project NLP2RDF good uptake by industry, open-source projects and developers. 105
73 106
74 Annotate text with a few lines of code! Set the language Set the text Obtain the annotations Convert into RDF 107
75 Babelfy2Nif: an example Toy sentence: the semantic web is a collaborative movement led by the international standards body world wide web consortium 108
76 Babelfy2Nif: an example the semantic web is a collaborative movement led by the international standards body world wide web consortium nif:context defines the overall text Text is modelled by fragments Fragments are identified by left and right indices The BabelNet synset (i.e., the annotation of the fragment) 109
77 Hands-on Session: Babelfy 2 NIF 110
78 1. Open directory babelfy 2. Open file babelfy2nif.properties under config/ directory Select your language (language variable) Type in your favourite text (text variable) Overview Select the algorithm for handling overlapping annotations (algorithm variable) Select the output format (rdf_format variable) Select the output stream (output_stream variable) 3. Launch sh run_babelfy2nif-demo.sh 111
79 Exercise 1 Take the first paragraph of the English ISWC Wikipedia page Web_Conference Feed it in the property file and produce 2 outputs: 1. With LONGEST_ANNOTATION_GREEDY_ALGORITHM algorithm and NTRIPLE rdf format 2. With FIRST_COME_FIRST_SERVED_ALGORITHM algorithm and TURTLE rdf format 112
80 To summarize 114
81 To summarize We have taken you through a tour of: A very large multilingual semantic network: BabelNet A state-of-the-art WSD and EL system: Babelfy 115
82 Acknowledgements European Research Council and the EU Commission for funding our research Maud Ehrmann and Andrea Moro for their help with slides 116
83 117
84 Google group: babelnet-group 118
Semantic Interoperability
Ivan Herman Semantic Interoperability Olle Olsson Swedish W3C Office Swedish Institute of Computer Science (SICS) Stockholm Apr 27 2011 (2) Background Stockholm Apr 27, 2011 (2) Trends: from
How much does word sense disambiguation help in sentiment analysis of micropost data?
How much does word sense disambiguation help in sentiment analysis of micropost data? Chiraag Sumanth PES Institute of Technology India Diana Inkpen University of Ottawa Canada 6th Workshop on Computational
Multilingual Word Sense Disambiguation and Entity Linking Roberto Navigli. http://lcl.uniroma1.it ERC Starting Grant n. 259234
Multilingual Word Sense Disambiguation and Entity Linking http://lcl.uniroma1.it ERC Starting Grant n. 259234 Lisbon, 22 nd July 2015 Multilingual Word Sense Disambiguation and Entity Linking [with BabelNet
Survey Results: Requirements and Use Cases for Linguistic Linked Data
Survey Results: Requirements and Use Cases for Linguistic Linked Data 1 Introduction This survey was conducted by the FP7 Project LIDER (http://www.lider-project.eu/) as input into the W3C Community Group
Mapping a Traditional Dialectal Dictionary with Linked Open Data
Mapping a Traditional Dialectal Dictionary with Linked Open Data Eveline Wandl-Vogt 1, Thierry Declerck 1,2 1 Institute for Corpus Linguistics and Text Technology, Austrian Academy of Sciences Sonnenfelsgasse
Domain Adaptive Relation Extraction for Big Text Data Analytics. Feiyu Xu
Domain Adaptive Relation Extraction for Big Text Data Analytics Feiyu Xu Outline! Introduction to relation extraction and its applications! Motivation of domain adaptation in big text data analytics! Solutions!
DBpedia German: Extensions and Applications
DBpedia German: Extensions and Applications Alexandru-Aurelian Todor FU-Berlin, Innovationsforum Semantic Media Web, 7. Oktober 2014 Overview Why DBpedia? New Developments in DBpedia German Problems in
RDF y SPARQL: Dos componentes básicos para la Web de datos
RDF y SPARQL: Dos componentes básicos para la Web de datos Marcelo Arenas PUC Chile & University of Oxford M. Arenas RDF y SPARQL: Dos componentes básicos para la Web de datos Valladolid 2013 1 / 61 Semantic
Towards the Integration of a Research Group Website into the Web of Data
Towards the Integration of a Research Group Website into the Web of Data Mikel Emaldi, David Buján, and Diego López-de-Ipiña Deusto Institute of Technology - DeustoTech, University of Deusto Avda. Universidades
Linked Data for sharing, discovery and re-use of Language Resources at a Web scale
Linked Data for sharing, discovery and re-use of Language Resources at a Web scale A. Gómez-Pérez Universidad Politécnica de Madrid asun@fi.upm.es Acknowledgement: J. Gracia, V. Rodriguez, P. Cimiano 10/06/2015
Natural Language Processing. Part 4: lexical semantics
Natural Language Processing Part 4: lexical semantics 2 Lexical semantics A lexicon generally has a highly structured form It stores the meanings and uses of each word It encodes the relations between
STAR Semantic Technologies for Archaeological Resources. http://hypermedia.research.glam.ac.uk/kos/star/
STAR Semantic Technologies for Archaeological Resources http://hypermedia.research.glam.ac.uk/kos/star/ Project Outline 3 year AHRC funded project Started January 2007, finish December 2009 Collaborators
Publishing Linked Data from relational databases
Publishing Linked Data from relational databases Iván Ruiz Rube Departamento de Lenguajes y Sistemas Informáticos Universidad de Cádiz 09/11/2011 Jornadas de Software Libre y Web 2.0 1 Roadmap The evolution
LinkZoo: A linked data platform for collaborative management of heterogeneous resources
LinkZoo: A linked data platform for collaborative management of heterogeneous resources Marios Meimaris, George Alexiou, George Papastefanatos Institute for the Management of Information Systems, Research
CMPT-825 Natural Language Processing
CMPT-825 Natural Language Processing Anoop Sarkar http://www.cs.sfu.ca/ anoop February 11, 2008 1 / 19 Lexical Semantics So far, we have listed words in our lexicon or vocabulary assuming a single meaning
ELIS Multimedia Lab. Linked Open Data. Sam Coppens MMLab IBBT - UGent
Linked Open Data Sam Coppens MMLab IBBT - UGent Overview: Linked Open Data: Principles Interlinking Data LOD Server Tools Linked Open Data: Principles Term Linked Data was first coined by Tim Berners Lee
We have big data, but we need big knowledge
We have big data, but we need big knowledge Weaving surveys into the semantic web ASC Big Data Conference September 26 th 2014 So much knowledge, so little time 1 3 takeaways What are linked data and the
Revealing Trends and Insights in Online Hiring Market Using Linking Open Data Cloud: Active Hiring a Use Case Study
Revealing Trends and Insights in Online Hiring Market Using Linking Open Data Cloud: Active Hiring a Use Case Study Amar-Djalil Mezaour 1, Julien Law-To 1, Robert Isele 3, Thomas Schandl 2, and Gerd Zechmeister
Lift your data hands on session
Lift your data hands on session Duration: 40mn Foreword Publishing data as linked data requires several procedures like converting initial data into RDF, polishing URIs, possibly finding a commonly used
Phrases. Topics for Today. Phrases. POS Tagging. ! Text transformation. ! Text processing issues
Topics for Today! Text transformation Word occurrence statistics Tokenizing Stopping and stemming Phrases Document structure Link analysis Information extraction Internationalization Phrases! Many queries
Using NLP and Ontologies for Notary Document Management Systems
Outline Using NLP and Ontologies for Notary Document Management Systems Flora Amato, Antonino Mazzeo, Antonio Penta and Antonio Picariello Dipartimento di Informatica e Sistemistica Universitá di Napoli
Semantic Advertising for Web 3.0
Semantic Advertising for Web 3.0 Edward Thomas, Jeff Z. Pan, Stuart Taylor, Yuan Ren, Nophadol Jekjantuk, and Yuting Zhao Department of Computer Science University of Aberdeen Aberdeen, Scotland Abstract.
Knowledge-Based WSD on Specific Domains: Performing Better than Generic Supervised WSD
Knowledge-Based WSD on Specific Domains: Performing Better than Generic Supervised WSD Eneko Agirre and Oier Lopez de Lacalle and Aitor Soroa Informatika Fakultatea, University of the Basque Country 20018,
Enriching Subtitled YouTube Media Fragments via Utilization of the Web-Based Natural Language Processors and Efficient Semantic Video Annotations
Enriching Subtitled YouTube Media Fragments via Utilization of the Web-Based Natural Language Processors and Efficient Semantic Video Annotations Babak Farhadi Department of Computer engineering University
ONTOLOGIES A short tutorial with references to YAGO Cosmina CROITORU
ONTOLOGIES p. 1/40 ONTOLOGIES A short tutorial with references to YAGO Cosmina CROITORU Unlocking the Secrets of the Past: Text Mining for Historical Documents Blockseminar, 21.2.-11.3.2011 ONTOLOGIES
Appendix A: Inventory of enrichment efforts and tools initiated in the context of the Europeana Network
1/12 Task Force on Enrichment and Evaluation Appendix A: Inventory of enrichment efforts and tools initiated in the context of the Europeana 29/10/2015 Project Name Type of enrichments Tool for manual
Chapter 8. Final Results on Dutch Senseval-2 Test Data
Chapter 8 Final Results on Dutch Senseval-2 Test Data The general idea of testing is to assess how well a given model works and that can only be done properly on data that has not been seen before. Supervised
DATA MANAGEMENT PLAN DELIVERABLE NUMBER RESPONSIBLE AUTHOR. Co- funded by the Horizon 2020 Framework Programme of the European Union
DATA MANAGEMENT PLAN Co- funded by the Horizon 2020 Framework Programme of the European Union DELIVERABLE NUMBER DELIVERABLE TITLE D7.4 Data Management Plan RESPONSIBLE AUTHOR DFKI GRANT AGREEMENT N. PROJECT
Efficient Techniques for Improved Data Classification and POS Tagging by Monitoring Extraction, Pruning and Updating of Unknown Foreign Words
, pp.290-295 http://dx.doi.org/10.14257/astl.2015.111.55 Efficient Techniques for Improved Data Classification and POS Tagging by Monitoring Extraction, Pruning and Updating of Unknown Foreign Words Irfan
Drupal and the Media Industry. Stéphane Corlosquet EMWRT IX, Sept 2013, Amsterdam
Drupal and the Media Industry Stéphane Corlosquet EMWRT IX, Sept 2013, Amsterdam 1 Agenda 1. 2. 3. 4. 5. 2 Introduction The case for Drupal in Media Drupal and Acquia in the Enterprise Drupal and Semantic
María Elena Alvarado gnoss.com* elenaalvarado@gnoss.com Susana López-Sola gnoss.com* susanalopez@gnoss.com
Linked Data based applications for Learning Analytics Research: faceted searches, enriched contexts, graph browsing and dynamic graphic visualisation of data Ricardo Alonso Maturana gnoss.com *Piqueras
STAR Semantic Technologies for Archaeological Resources. http://hypermedia.research.glam.ac.uk/kos/star/
STAR Semantic Technologies for Archaeological Resources http://hypermedia.research.glam.ac.uk/kos/star/ Project Outline 3 year AHRC funded project Started January 2007, finish December 2009 Collaborators
The Open University s repository of research publications and other research outputs
Open Research Online The Open University s repository of research publications and other research outputs OUSocial2: a platform for gathering students feedback from social media Conference Item How to
NATURAL LANGUAGE PROCESSING WORD SENSE DISAMBIGUATION
NATURAL LANGUAGE PROCESSING (COM4513/6513) WORD SENSE DISAMBIGUATION Andreas Vlachos a.vlachos@sheffield.ac.uk Department of Computer Science University of Sheffield 1 SO FAR part-of-speech tagging syntactic
Semantic tagging for crowd computing
Semantic tagging for crowd computing Roberto Mirizzi 1, Azzurra Ragone 1,2, Tommaso Di Noia 1, and Eugenio Di Sciascio 1 1 Politecnico di Bari Via Orabona, 4, 70125 Bari, Italy mirizzi@deemail.poliba.it,
Semantic Lifting of Unstructured Data Based on NLP Inference of Annotations 1
Semantic Lifting of Unstructured Data Based on NLP Inference of Annotations 1 Ivo Marinchev Abstract: The paper introduces approach to semantic lifting of unstructured data with the help of natural language
Semantics-aware Software Project Repositories
Semantics-aware Software Project Repositories Jonas Tappolet Department of Informatics, University of Zurich, Switzerland tappolet@ifi.uzh.ch Abstract. This proposal explores a general framework to solve
Presente e futuro del Web Semantico
Sistemi di Elaborazione dell informazione II Corso di Laurea Specialistica in Ingegneria Telematica II anno 4 CFU Università Kore Enna A.A. 2009-2010 Alessandro Longheu http://www.diit.unict.it/users/alongheu
GR4PHP: A Programming API for Consuming E-Commerce Data from the Semantic Web
GR4PHP: A Programming API for Consuming E-Commerce Data from the Semantic Web First Workshop on Programming the Semantic Web, ISWC 2012 November 11-15, 2012, Boston, USA Alex Stolz, Mouzhi Ge and {firstname.lastname}@unibw.de
PoS-tagging Italian texts with CORISTagger
PoS-tagging Italian texts with CORISTagger Fabio Tamburini DSLO, University of Bologna, Italy fabio.tamburini@unibo.it Abstract. This paper presents an evolution of CORISTagger [1], an high-performance
Comparing Ontology-based and Corpusbased Domain Annotations in WordNet.
Comparing Ontology-based and Corpusbased Domain Annotations in WordNet. A paper by: Bernardo Magnini Carlo Strapparava Giovanni Pezzulo Alfio Glozzo Presented by: rabee ali alshemali Motive. Domain information
Linked Data Publishing with Drupal
Linked Data Publishing with Drupal Joachim Neubert ZBW German National Library of Economics Leibniz Information Centre for Economics SWIB13 Workshop Hamburg, Germany 25.11.2013 ZBW is member of the Leibniz
XML, Seman9c Web and Content Analy9cs
XML, Seman9c Web and Content Analy9cs XML Prague Pre- conference 2014 Felix Sasaki DFKI / W3C Fellow 1 What do you need to follow this session? Ideal: a computer with internet access, to be able to provide
Data-Gov Wiki: Towards Linked Government Data
Data-Gov Wiki: Towards Linked Government Data Li Ding 1, Dominic DiFranzo 1, Sarah Magidson 2, Deborah L. McGuinness 1, and Jim Hendler 1 1 Tetherless World Constellation Rensselaer Polytechnic Institute
Word Meaning & Word Sense Disambiguation
Word Meaning & Word Sense Disambiguation CMSC 723 / LING 723 / INST 725 MARINE CARPUAT marine@cs.umd.edu Today Representing word meaning Word sense disambiguation as supervised classification Word sense
How semantic technology can help you do more with production data. Doing more with production data
How semantic technology can help you do more with production data Doing more with production data EPIM and Digital Energy Journal 2013-04-18 David Price, TopQuadrant London, UK dprice at topquadrant dot
Semantic Technologies: Linked Data and OER
Semantic Technologies: Linked Data and OER Opening and linking the data and content silos, to leverage the knowledge capital represented by our OER repositories Edmundo Tovar (UPM etovar@fi.upm.es ) Nelson
Processing: current projects and research at the IXA Group
Natural Language Processing: current projects and research at the IXA Group IXA Research Group on NLP University of the Basque Country Xabier Artola Zubillaga Motivation A language that seeks to survive
Taming Big Data Variety with Semantic Graph Databases. Evren Sirin CTO Complexible
Taming Big Data Variety with Semantic Graph Databases Evren Sirin CTO Complexible About Complexible Semantic Tech leader since 2006 (née Clark & Parsia) software, consulting W3C leadership Offices in DC
CIRGIRDISCO at RepLab2014 Reputation Dimension Task: Using Wikipedia Graph Structure for Classifying the Reputation Dimension of a Tweet
CIRGIRDISCO at RepLab2014 Reputation Dimension Task: Using Wikipedia Graph Structure for Classifying the Reputation Dimension of a Tweet Muhammad Atif Qureshi 1,2, Arjumand Younus 1,2, Colm O Riordan 1,
BabelNet: the largest multilingual encyclopedic dictionary! Roberto Navigli.
BabelNet: the largest multilingual encyclopedic dictionary! http://lcl.uniroma1.it Utrecht, the Netherlands, 25 th September 2015 Speaker Professore Associato Dipartimento di Informatica (Sapienza Università
DATA MODEL FOR STORAGE AND RETRIEVAL OF LEGISLATIVE DOCUMENTS IN DIGITAL LIBRARIES USING LINKED DATA
DATA MODEL FOR STORAGE AND RETRIEVAL OF LEGISLATIVE DOCUMENTS IN DIGITAL LIBRARIES USING LINKED DATA María Hallo 1, Sergio Luján-Mora 2, and Alejandro Mate 3 1 Department of Computer Science, National
HTML5 based Facet Browser for SPARQL Endpoints
HTML5 based Facet Browser for SPARQL Endpoints Martina Janevska, Milos Jovanovik, Dimitar Trajanov Faculty of Computer Science and Engineering Ss. Cyril and Methodius University Skopje, Republic of Macedonia
D3.3.1: Sematic tagging and open data publication tools
COMPETITIVINESS AND INNOVATION FRAMEWORK PROGRAMME CIP-ICT-PSP-2013-7 Pilot Type B WP3 Service platform integration and deployment in cloud infrastructure D3.3.1: Sematic tagging and open data publication
Il Data Model di Europeana!
EAGLE! EUROPEANA NETWORK OF! ANCIENT GREEK AND LATIN EPIGRAPHY! EAGLE & WIKI LOVES MONUMENTS! Il Data Model di Europeana! Vittore Casarosa, ISTI-CNR! Roma, 16 Maggio 2014 EDM Europeana Data Model q Europeana
Drupal. http://www.flickr.com/photos/funkyah/2400889778
Drupal 7 and RDF Stéphane Corlosquet, - Software engineer, MGH - Drupal 7 core RDF maintainer - SemWeb geek Linked Data Ventures, MIT, Oct 2010 This work is licensed under a Creative
Converging Web-Data and Database Data: Big - and Small Data via Linked Data
DBKDA/WEB Panel 2014, Chamonix, 24.04.2014 DBKDA/WEB Panel 2014, Chamonix, 24.04.2014 Reutlingen University Converging Web-Data and Database Data: Big - and Small Data via Linked Data Moderation: Fritz
WordNet Structure and use in natural language processing
WordNet Structure and use in natural language processing Abstract There are several electronic dictionaries, thesauri, lexical databases, and so forth today. WordNet is one of the largest and most widely
String-Based Semantic Web Data Management Using Ternary B-Trees PhD Seminar, April 29, 2010
String-Based Semantic Web Data Management Using Ternary B-Trees PhD Seminar, April 29, 2010 Jürg Senn Department of Computer Science, University of Basel RDF Resource Description Framework (RDF) basis
TECHNICAL Reports. Discovering Links for Metadata Enrichment on Computer Science Papers. Johann Schaible, Philipp Mayr
TECHNICAL Reports 2012 10 Discovering Links for Metadata Enrichment on Computer Science Papers Johann Schaible, Philipp Mayr kölkölölk GESIS-Technical Reports 2012 10 Discovering Links for Metadata Enrichment
Benchmarking the Performance of Storage Systems that expose SPARQL Endpoints
Benchmarking the Performance of Storage Systems that expose SPARQL Endpoints Christian Bizer 1 and Andreas Schultz 1 1 Freie Universität Berlin, Web-based Systems Group, Garystr. 21, 14195 Berlin, Germany
A Software Tool for Thesauri Management, Browsing and Supporting Advanced Searches
J. Nogueras-Iso, J.A. Bañares, J. Lacasta, J. Zarazaga-Soria 105 A Software Tool for Thesauri Management, Browsing and Supporting Advanced Searches J. Nogueras-Iso, J.A. Bañares, J. Lacasta, J. Zarazaga-Soria
A Semantic Web Application The Resilience Knowledge Base and RKBExplorer
A Semantic Web Application The Resilience Knowledge Base and RKBExplorer Hugh Glaser and Ian Millard School of Electronics & Computer Science University of Southampton, UK {hg, icm}@ecs.soton.ac.uk Abstract.
Representing Multilingual Data as Linked Data: the Case of BabelNet 2.0
Representing Multilingual Data as Linked Data: the Case of BabelNet 2.0 Maud Ehrmann 1, Francesco Cecconi 1, Daniele Vannella 1, John M c Crae 2, Philipp Cimiano 2, Roberto Navigli 1 1 Department of Computer
Semantic annotation of requirements for automatic UML class diagram generation
www.ijcsi.org 259 Semantic annotation of requirements for automatic UML class diagram generation Soumaya Amdouni 1, Wahiba Ben Abdessalem Karaa 2 and Sondes Bouabid 3 1 University of tunis High Institute
Tightly Integrated Data
From From Linked Linked Data Data to to Tightly Integrated Data Tightly Integrated Data May May 2014 2014 Tsinghua University, Beijing Tsinghua University, Beijing 25 Years of the World Wide Web: 1989
Industry 4.0 and Big Data
Industry 4.0 and Big Data Marek Obitko, mobitko@ra.rockwell.com Senior Research Engineer 03/25/2015 PUBLIC PUBLIC - 5058-CO900H 2 Background Joint work with Czech Institute of Informatics, Robotics and
LabelTranslator - A Tool to Automatically Localize an Ontology
LabelTranslator - A Tool to Automatically Localize an Ontology Mauricio Espinoza 1, Asunción Gómez Pérez 1, and Eduardo Mena 2 1 UPM, Laboratorio de Inteligencia Artificial, 28660 Boadilla del Monte, Spain
Linked Statistical Data Analysis
Linked Statistical Data Analysis Sarven Capadisli 1, Sören Auer 2, Reinhard Riedl 3 1 Universität Leipzig, Institut für Informatik, AKSW, Leipzig, Germany, 2 University of Bonn and Fraunhofer IAIS, Bonn,
Exploiting Comparable Corpora and Bilingual Dictionaries. the Cross Language Text Categorization
Exploiting Comparable Corpora and Bilingual Dictionaries for Cross-Language Text Categorization Alfio Gliozzo and Carlo Strapparava ITC-Irst via Sommarive, I-38050, Trento, ITALY {gliozzo,strappa}@itc.it
Walk Before You Run. Prerequisites to Linked Data. Kenning Arlitsch Dean of the Library @kenning_msu
Walk Before You Run Prerequisites to Linked Data Kenning Arlitsch Dean of the Library @kenning_msu First, Take Care of Basics Linked Data applications will not matter if search engines can t first find
LinksTo A Web2.0 System that Utilises Linked Data Principles to Link Related Resources Together
LinksTo A Web2.0 System that Utilises Linked Data Principles to Link Related Resources Together Owen Sacco 1 and Matthew Montebello 1, 1 University of Malta, Msida MSD 2080, Malta. {osac001, matthew.montebello}@um.edu.mt
LINKED DATA EXPERIENCE AT MACMILLAN Building discovery services for scientific and scholarly content on top of a semantic data model
LINKED DATA EXPERIENCE AT MACMILLAN Building discovery services for scientific and scholarly content on top of a semantic data model 22 October 2014 Tony Hammond Michele Pasin Background About Macmillan
Semantic Technologies for Big Data. Marin Dimitrov (Ontotext)
Semantic Technologies for Big Data Marin Dimitrov (Ontotext) XML Amsterdam 2012 XML Amsterdam 2012 #2 About Ontotext Provides products and services for creating, managing and exploiting semantic data Founded
Interactive Information Access on the Web of Data
Interactive Information Access on the Web of Data Lynda Hardman, Jacco van Ossenbruggen, Alia Amin and Michiel Hildebrand Interactive Information Access http://www.cwi.nl/interactive_information_access
Building a Question Classifier for a TREC-Style Question Answering System
Building a Question Classifier for a TREC-Style Question Answering System Richard May & Ari Steinberg Topic: Question Classification We define Question Classification (QC) here to be the task that, given
Open Data Integration Using SPARQL and SPIN
Open Data Integration Using SPARQL and SPIN A Case Study for the Tourism Domain Antonino Lo Bue, Alberto Machi ICAR-CNR Sezione di Palermo, Italy Research funded by Italian PON SmartCities Dicet-InMoto-Orchestra
Serendipity a platform to discover and visualize Open OER Data from OpenCourseWare repositories Abstract Keywords Introduction
Serendipity a platform to discover and visualize Open OER Data from OpenCourseWare repositories Nelson Piedra, Jorge López, Janneth Chicaiza, Universidad Técnica Particular de Loja, Ecuador nopiedra@utpl.edu.ec,
Introducing a Diversity-Aware Drupal Extension
Simon Hangl, Ioan Toma, and Andreas Thalhammer University of Innsbruck, Technikerstr. 21a, A-6020 Innsbruck {simon.hangl, ioan.toma, andreas.thalhammer}@sti2.at Abstract. This demonstration paper introduces
SI485i : NLP. Set 10 Lexical Relations. slides adapted from Dan Jurafsky and Bill MacCartney
SI485i : NLP Set 10 Lexical Relations slides adapted from Dan Jurafsky and Bill MacCartney Three levels of meaning 1. Lexical Semantics (words) 2. Sentential / Compositional / Formal Semantics 3. Discourse
The use of Semantic Web Technologies in Spatial Decision Support Systems
The use of Semantic Web Technologies in Spatial Decision Support Systems Adam Iwaniak Jaromar Łukowicz Iwona Kaczmarek Marek Strzelecki The INSPIRE Conference 2013, 23-27 June Wroclaw University of Environmental
SPARQL UniProt.RDF. Get these slides! Tutorial plan. Everyone has had some introduction slash knowledge of RDF.
SPARQL UniProt.RDF Everyone has had some introduction slash knowledge of RDF. Jerven Bolleman Developer Swiss-Prot Group Swiss Institute of Bioinformatics Get these slides! https://sites.google.com/a/jerven.eu/jerven/home/
Getting Started with GRUFF
Getting Started with GRUFF Introduction Most articles in this book focus on interesting applications of Linked Open Data (LOD). But this chapter describes some simple steps on how to use a triple store,
Semantic Modeling with RDF. DBTech ExtWorkshop on Database Modeling and Semantic Modeling Lili Aunimo
DBTech ExtWorkshop on Database Modeling and Semantic Modeling Lili Aunimo Expected Outcomes You will learn: Basic concepts related to ontologies Semantic model Semantic web Basic features of RDF and RDF
MULTIFUNCTIONAL DICTIONARIES
In: A. Zampolli, A. Capelli (eds., 1984): The possibilities and limits of the computer in producing and publishing dictionaries. Linguistica Computationale III, Pisa: Giardini, 279-288 MULTIFUNCTIONAL
Lexical Semantics CMSC 723 / LING 723 / INST 725 MARINE CARPUAT.
Lexical Semantics CMSC 723 / LING 723 / INST 725 MARINE CARPUAT marine@cs.umd.edu Q: What s meaning? Compositional semantics answer: Q: What s meaning? We now answer from a lexical semantics perspective
Information Systems & Semantic Web University of Koblenz Landau, Germany
Information Systems University of Koblenz Landau, Germany Exploiting Spatial Context in Images Using Fuzzy Constraint Reasoning Carsten Saathoff & Agenda Semantic Web: Our Context Knowledge Annotation
Using DEB Services for Knowledge Representation within the KYOTO Project
Using DEB Services for Knowledge Representation within the KYOTO Project Aleš Horák and Adam Rambousek Faculty of Informatics, Masaryk University Botanická 68a, 602 00 Brno, Czech Republic {hales,xrambous}@fi.muni.cz
Search and Information Retrieval
Search and Information Retrieval Search on the Web 1 is a daily activity for many people throughout the world Search and communication are most popular uses of the computer Applications involving search
Web NDL Authorities: Authority Data of the National Diet Library, Japan, as Linked Data
Submitted on: 6/20/2014 Web NDL Authorities: Authority Data of the National Diet Library, Japan, as Linked Data Tadahiko Oshiba Library Support Division, Kansai-kan of the National Diet Library, Kyoto,
The FAO Geopolitical Ontology: a reference for country-based information
The FAO Geopolitical Ontology: a reference for country-based information Editor(s): Name Surname, University, Country Solicited review(s): Name Surname, University, Country Open review(s): Name Surname,
BUSINESS VALUE OF SEMANTIC TECHNOLOGY
BUSINESS VALUE OF SEMANTIC TECHNOLOGY Preliminary Findings Industry Advisory Council Emerging Technology (ET) SIG Information Sharing & Collaboration Committee July 15, 2005 Mills Davis Managing Director
Scope. Cognescent SBI Semantic Business Intelligence
Cognescent SBI Semantic Business Intelligence Scope...1 Conceptual Diagram...2 Datasources...3 Core Concepts...3 Resources...3 Occurrence (SPO)...4 Links...4 Statements...4 Rules...4 Types...4 Mappings...5
A Configurable Translation-Based Cross-Lingual Ontology Mapping System to adjust Mapping Outcome
A Configurable Translation-Based Cross-Lingual Ontology Mapping System to adjust Mapping Outcome Bo Fu *, Rob Brennan, Declan O Sullivan Knowledge and Data Engineering Group, School of Computer Science
Bridging CAQDAS with text mining: Text analyst s toolbox for Big Data: Science in the Media Project
Bridging CAQDAS with text mining: Text analyst s toolbox for Big Data: Science in the Media Project Ahmet Suerdem Istanbul Bilgi University; LSE Methodology Dept. Science in the media project is funded
GetLOD - Linked Open Data and Spatial Data Infrastructures
GetLOD - Linked Open Data and Spatial Data Infrastructures W3C Linked Open Data LOD2014 Roma, 20-21 February 2014 Stefano Pezzi, Massimo Zotti, Giovanni Ciardi, Massimo Fustini Agenda Context Geoportal
SPC BOARD (COMMISSIONE DI COORDINAMENTO SPC) AN OVERVIEW OF THE ITALIAN GUIDELINES FOR SEMANTIC INTEROPERABILITY THROUGH LINKED OPEN DATA
SPC BOARD (COMMISSIONE DI COORDINAMENTO SPC) AN OVERVIEW OF THE ITALIAN GUIDELINES FOR SEMANTIC INTEROPERABILITY THROUGH LINKED OPEN DATA INDEX EXECUTIVE SUMMARY... 3 1. PREFACE... 5 1.1. Acronyms... 5
QASM: a Q&A Social Media System Based on Social Semantics
QASM: a Q&A Social Media System Based on Social Semantics Zide Meng, Fabien Gandon, Catherine Faron-Zucker To cite this version: Zide Meng, Fabien Gandon, Catherine Faron-Zucker. QASM: a Q&A Social Media
Word Polarity Detection Using a Multilingual Approach
Word Polarity Detection Using a Multilingual Approach Cüneyd Murad Özsert and Arzucan Özgür Department of Computer Engineering, Boğaziçi University, Bebek, 34342 İstanbul, Turkey muradozsert@gmail.com,
Achille Felicetti" VAST-LAB, PIN S.c.R.L., Università degli Studi di Firenze!
3D-COFORM Mapping Tool! Achille Felicetti" VAST-LAB, PIN S.c.R.L., Università degli Studi di Firenze!! The 3D-COFORM Project! Work Package 6! Tools for the semi-automatic processing of legacy information!