Discovering and Maintaining Links on the Web of Data
Discovering and Maintaining Links on the Web of Data | |
---|---|
Discovering and Maintaining Links on the Web of Data
| |
Bibliographical Metadata | |
Subject: | Link Discovery |
Keywords: | Linked data, web of data, link discovery, link maintenance, record linkage, duplicate detection |
Year: | 2009 |
Authors: | Julius Volz, Christian Bizer, Martin Gaedke, Georgi Kobilarov |
Venue | ISWC |
Content Metadata | |
Problem: | No data available now. |
Approach: | No data available now. |
Implementation: | No data available now. |
Evaluation: | No data available now. |
Contents
Abstract
The Web of Data is built upon two simple ideas: Employ the RDF data model to publish structured data on the Web and to create explicit data links between entities within di�erent data sources. This paper presents the Silk { Linking Framework, a toolkit for discovering and maintaining data links between Web data sources. Silk consists of three components: 1. A link discovery engine, which computes links between data sources based on a declarative speci�cation of the conditions that entities must ful�ll in order to be interlinked; 2. A tool for evaluating the generated data links in order to �ne-tune the linking speci�cation; 3. A protocol for maintaining data links between continuously changing data sources. The protocol allows data sources to exchange both linksets as well as detailed change information and enables continuous link recomputation. The interplay of all the components is demonstrated within a life science use case.
Conclusion
We presented the Silk framework, a flexible tool for discovering links between entities within di�erentWeb data sources. The Silk-LSL link speci�cation language was introduced and its applicability was demonstrated within a life science use case. We then proposed the WOD-LMP protocol for synchronizing and maintaining links between continuously changing Linked Data sources. Future work on Silk will focus on the following areas: We will implement further similarity metrics to support a broader range of linking use cases. To assist users in writing Silk-LSL speci�cations, machine learning techniques could be employed to adjust weightings or optimize the structure of the matching speci�cation. Finally, we will evaluate the suitability of Silk for detecting dupli- cate entities within local datasets instead of using it to discover links between disparate RDF data sources. The value of the Web of Data rises and falls with the amount and the quality of links between data sources. We hope that Silk and other similar tools will help to strengthen the linkage between data sources and therefore contribute to the overall utility of the network. The complete Silk { LSL language speci�cation, WoD Link Maintenance Protocol speci�cation and further Silk usage examples are found on the Silk project website at http://www4.wiwiss.fu-berlin.de/bizer/silk/.
Future work
No data available now.
Approach
Positive Aspects: No data available now.
Negative Aspects: No data available now.
Limitations: No data available now.
Challenges: No data available now.
Proposes Algorithm: No data available now.
Methodology: No data available now.
Requirements: No data available now.
Limitations: No data available now.
Implementations
Download-page: No data available now.
Access API: No data available now.
Information Representation: No data available now.
Data Catalogue: {{{Catalogue}}}
Runs on OS: No data available now.
Vendor: No data available now.
Uses Framework: No data available now.
Has Documentation URL: No data available now.
Programming Language: No data available now.
Version: No data available now.
Platform: No data available now.
Toolbox: No data available now.
GUI: No
Research Problem
Subproblem of: No data available now.
RelatedProblem: No data available now.
Motivation: No data available now.
Evaluation
Experiment Setup: No data available now.
Evaluation Method : No data available now.
Hypothesis: No data available now.
Description: No data available now.
Dimensions: {{{Dimensions}}}
Benchmark used: No data available now.
Results: No data available now.
Access API | No data available now. + |
Event in series | ISWC + |
Has Benchmark | No data available now. + |
Has Challenges | No data available now. + |
Has DataCatalouge | {{{Catalogue}}} + |
Has Description | No data available now. + |
Has Dimensions | {{{Dimensions}}} + |
Has DocumentationURL | http://No data available now. + |
Has Downloadpage | http://No data available now. + |
Has Evaluation | No data available now. + |
Has EvaluationMethod | No data available now. + |
Has ExperimentSetup | No data available now. + |
Has GUI | No + |
Has Hypothesis | No data available now. + |
Has Implementation | No data available now. + |
Has InfoRepresentation | No data available now. + |
Has Limitations | No data available now. + |
Has NegativeAspects | No data available now. + |
Has PositiveAspects | No data available now. + |
Has Requirements | No data available now. + |
Has Results | No data available now. + |
Has Subproblem | No data available now. + |
Has Version | No data available now. + |
Has abstract | The Web of Data is built upon two simple i … The Web of Data is built upon two simple ideas: Employ the RDF data model to publish structured data on the Web and to create explicit data links between entities within di�erent data sources. This paper presents the Silk { Linking Framework, a toolkit for discovering and maintaining data links between Web data sources. Silk consists of three components: 1. A link discovery engine, which computes links between data sources based on a declarative speci�cation of the conditions that entities must ful�ll in order to be interlinked; 2. A tool for evaluating the generated data links in order to �ne-tune the linking speci�cation; 3. A protocol for maintaining data links between continuously changing data sources. The protocol allows data sources to exchange both linksets as well as detailed change information and enables continuous link recomputation. The interplay of all the components is demonstrated within a life science use case. monstrated within a life science use case. + |
Has approach | No data available now. + |
Has authors | Julius Volz +, Christian Bizer +, Martin Gaedke + and Georgi Kobilarov + |
Has conclusion | We presented the Silk framework, a flexibl … We presented the Silk framework, a flexible tool for discovering links between entities within di�erentWeb data sources. The Silk-LSL link speci�cation language was introduced and its applicability was demonstrated within a life science use case. We then proposed the WOD-LMP protocol for synchronizing and maintaining links between continuously changing Linked Data sources. Future work on Silk will focus on the following areas: We will implement
tp://www4.wiwiss.fu-berlin.de/bizer/silk/. +further similarity metrics to support a broader range of linking use cases. To assist users in writing Silk-LSL speci�cations, machine learning techniques could be employed to adjust weightings or optimize the structure of the matching speci�cation. Finally, we will evaluate the suitability of Silk for detecting dupli- cate entities within local datasets instead of using it to discover links between disparate RDF data sources. The value of the Web of Data rises and falls with the amount and the quality of links between data sources. We hope that Silk and other similar tools will help to strengthen the linkage between data sources and therefore contribute to the overall utility of the network. The complete Silk { LSL language speci�cation, WoD Link Maintenance Protocol speci�cation and further Silk usage examples are found on the Silk project website at http://www4.wiwiss.fu-berlin.de/bizer/silk/. |
Has future work | No data available now. + |
Has keywords | Linked data, web of data, link discovery, link maintenance, record linkage, duplicate detection + |
Has motivation | No data available now. + |
Has platform | No data available now. + |
Has problem | No data available now. + |
Has relatedProblem | No data available now. + |
Has subject | Link Discovery + |
Has vendor | No data available now. + |
Has year | 2009 + |
ImplementedIn ProgLang | No data available now. + |
Proposes Algorithm | No data available now. + |
RunsOn OS | No data available now. + |
Title | Discovering and Maintaining Links on the Web of Data + |
Uses Framework | No data available now. + |
Uses Methodology | No data available now. + |
Uses Toolbox | No data available now. + |