SLINT: A Schema-Independent Linked Data Interlinking System

SLINT: A Schema-Independent Linked Data Interlinking System
SLINT: A Schema-Independent Linked Data Interlinking System
Bibliographical Metadata
Subject:	Link Discovery
Keywords:	linked data, schema-independent, blocking, interlinking
Year:	2012
Authors:	Khai Nguyen, Ryutaro Ichise, Bac Le
Venue	OM
Content Metadata
Problem:	Link Discovery
Approach:	Weighted co-occurrence and adaptive filtering in blocking and instance matching
Implementation:	SLINT
Evaluation:	Accuracy Evaluation

Abstract

Linked data interlinking is the discovery of all instances that represent the same real-world object and locate in different data sources. Since different data publishers frequently use different schemas for storing resources, we aim at developing a schema-independent interlinking system. Our system automatically selects important predicates and useful predicate alignments, which are used as the key for blocking and instance matching. The key distinction of our system is the use of weighted co-occurrence and adaptive filtering in blocking and instance matching. Experimental results show that the system highly improves the precision and recall over some recent ones. The performance of the system and the efficiency of main steps are also discussed.

Conclusion

In this paper, we present SLINT, an efficient schema-independent linked data interlinking system. We select important predicates by predicate’s coverage and discriminability. The predicate alignments are constructed and filtered for obtaining key alignments.We implement an adaptive filtering technique to produce candidates and identities. Compare with the most recent systems, SLINT highly outperforms the precision and recall in interlinking. The performance of SLINT is also very high when it takes around 1 minute to detect more than 13,000 identity pairs.

Future work

Although SLINT has good result on tested datasets, it is not sufficient to evaluate the scalability of our system, which we consider as the current limiting point because of the used of weighted co-occurrence matrix. We will investigate about a solution for this issue in our next work. Besides, we also interested in automatic configuration for every threshold used in SLINT and improving SLINT into a novel cross-domain interlinking system.

Approach

Positive Aspects: No data available now.

Negative Aspects: No data available now.

Limitations: No data available now.

Challenges: No data available now.

Proposes Algorithm: No data available now.

Methodology: No data available now.

Requirements: No data available now.

Limitations: No data available now.

Implementations

Download-page: http://ri-www.nii.ac.jp/SLINT/index.html

Access API: No data available now.

Information Representation: No data available now.

Data Catalogue: {{{Catalogue}}}

Runs on OS: No data available now.

Vendor: No data available now.

Uses Framework: No data available now.

Has Documentation URL: No data available now.

Programming Language: No data available now.

Version: No data available now.

Platform: No data available now.

Toolbox: No data available now.

GUI: No

Research Problem

Subproblem of: No data available now.

RelatedProblem: No data available now.

Motivation: No data available now.

Evaluation

Experiment Setup: 2.66Ghz quad-core CPU and 4GB of memory

Evaluation Method : Compare the system with AgreementMaker, SERIMI, and Zhishi.Links

Hypothesis: No data available now.

Description: No data available now.

Dimensions: Accuracy

Benchmark used: LinkedMDB, GeoNames

Results: SLINT system totally outperforms the others on both precision and recall. AgreementMaker has a competitive precision with SLINT on dataset D3 but this system is much lower in recall. Zhishi.Links results on dataset D3 are very high, but the F1 score of SLINT is still 0.05 higher in overall.

Access API	No data available now. +
Event in series	OM +
Has Benchmark	LinkedMDB + and GeoNames +
Has Challenges	No data available now. +
Has DataCatalouge	{{{Catalogue}}} +
Has Description	No data available now. +
Has Dimensions	Accuracy +
Has DocumentationURL	http://No data available now. +
Has Downloadpage	http://ri-www.nii.ac.jp/SLINT/index.html +
Has Evaluation	Accuracy Evaluation +
Has EvaluationMethod	Compare the system with AgreementMaker, SERIMI, and Zhishi.Links +
Has ExperimentSetup	2.66Ghz quad-core CPU and 4GB of memory +
Has GUI	No +
Has Hypothesis	No data available now. +
Has Implementation	SLINT +
Has InfoRepresentation	No data available now. +
Has Limitations	No data available now. +
Has NegativeAspects	No data available now. +
Has PositiveAspects	No data available now. +
Has Requirements	No data available now. +
Has Results	SLINT system totally outperforms the other … SLINT system totally outperforms the others on both precision and recall. AgreementMaker has a competitive precision with SLINT on dataset D3 but this system is much lower in recall. Zhishi.Links results on dataset D3 are very high, but the F1 score of SLINT is still 0.05 higher in overall. of SLINT is still 0.05 higher in overall. +
Has Subproblem	No data available now. +
Has Version	No data available now. +
Has abstract	Linked data interlinking is the discovery … Linked data interlinking is the discovery of all instances that represent the same real-world object and locate in different data sources. Since different data publishers frequently use different schemas for storing resources, we aim at developing a schema-independent interlinking system. Our system automatically selects important predicates and useful predicate alignments, which are used as the key for blocking and instance matching. The key distinction of our system is the use of weighted co-occurrence and adaptive filtering in blocking and instance matching. Experimental results show that the system highly improves the precision and recall over some recent ones. The performance of the system and the efficiency of main steps are also discussed. ficiency of main steps are also discussed. +
Has approach	Weighted co-occurrence and adaptive filtering in blocking and instance matching +
Has authors	Khai Nguyen +, Ryutaro Ichise + and Bac Le +
Has conclusion	In this paper, we present SLINT, an effici … In this paper, we present SLINT, an efficient schema-independent linked data interlinking system. We select important predicates by predicate’s coverage and discriminability. The predicate alignments are constructed and filtered for obtaining key alignments.We implement an adaptive filtering technique to produce candidates and identities. Compare with the most recent systems, SLINT highly outperforms the precision and recall in interlinking. The performance of SLINT is also very high when it takes around 1 minute to detect more than 13,000 identity pairs. to detect more than 13,000 identity pairs. +
Has future work	Although SLINT has good result on tested d … Although SLINT has good result on tested datasets, it is not sufficient to evaluate the scalability of our system, which we consider as the current limiting point because of the used of weighted co-occurrence matrix. We will investigate about a solution for this issue in our next work. Besides, we also interested in automatic configuration for every threshold used in SLINT and improving SLINT into a novel cross-domain interlinking system. a novel cross-domain interlinking system. +
Has keywords	linked data, schema-independent, blocking, interlinking +
Has motivation	No data available now. +
Has platform	No data available now. +
Has problem	Link Discovery +
Has relatedProblem	No data available now. +
Has subject	Link Discovery +
Has vendor	No data available now. +
Has year	2012 +
ImplementedIn ProgLang	No data available now. +
Proposes Algorithm	No data available now. +
RunsOn OS	No data available now. +
Title	SLINT: A Schema-Independent Linked Data Interlinking System +
Uses Framework	No data available now. +
Uses Methodology	No data available now. +
Uses Toolbox	No data available now. +

SLINT: A Schema-Independent Linked Data Interlinking System

Contents

Abstract

Conclusion

Future work

Approach

Implementations

Research Problem

Evaluation

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Search

Create

Data

Kuratierung

Tools