Distributed Query Processing and Reasoning Over Linked Big Data

Mohammed H.H.; Doğdu E.; Choupani R.; Zarbega T.S.A.

Scopus:
Distributed Query Processing and Reasoning Over Linked Big Data

dc.contributor.author	Mohammed H.H.
dc.contributor.author	Doğdu E.
dc.contributor.author	Choupani R.
dc.contributor.author	Zarbega T.S.A.
dc.date.accessioned	2023-04-11T22:34:47Z
dc.date.accessioned	2023-04-12T00:30:53Z
dc.date.available	2023-04-11T22:34:47Z
dc.date.available	2023-04-12T00:30:53Z
dc.date.issued	2022-01-01
dc.description.abstract	The enormous amount of structured and unstructured data on the web and the need to extract and derive useful knowledge from this big data make Semantic Web and Big Data Technology explorations of paramount importance. Open semantic web data created using standard protocols (RDF, RDFS, OWL) consists of billions of records in the form of data collections called “linked data”. With the ever-increasing linked big data on the Web, it is imperative to process this data with powerful and scalable techniques in distributed processing environments such as MapReduce. There are several distributed RDF processing systems, including SemaGrow, FedX, SPLENDID, PigSPARQL, SHARD, SPARQLGX, that are developed over the years. However, there is a need for computational and qualitative comparison of the differences and similarities among these systems. In this paper, we extend a previous comparative analysis to a diverse study with respect to qualitative and quantitative analysis views, through an experimental approach for these distributed RDF systems. We examine each of the selected RDF query systems with respect to the implementation setup, system architecture, underlying framework, and data storage. We use two widely used RDF benchmark datasets, FedBench and LUBM. Furthermore, we evaluate and examine their performances in terms of query execution time, thus, analyzing how those different types of large-scale distributed query engines, support long-running queries over federated data sources and the query processing times for different queries. The results of the experiments in this study show that SemaGrow distributed system performs more efficiently compared to FedX and Splendid, even though in smaller queries the former performs slower.
dc.identifier.doi	10.1007/978-3-031-23387-6_11
dc.identifier.isbn	9783031233869
dc.identifier.issn	18650929
dc.identifier.scopus	2-s2.0-85148003280
dc.identifier.uri	https://hdl.handle.net/20.500.12597/4208
dc.relation.ispartof	Communications in Computer and Information Science
dc.rights	false
dc.subject	Big Data \| Distributed RDF Query Processing \| Linked Data \| Resource Description Framework (RDF) \| Semantic Web \| SPARQL Protocol and RDF Query Language \| Triple Pattern (TP)
dc.title	Distributed Query Processing and Reasoning Over Linked Big Data
dc.type	Conference Paper
dspace.entity.type	Scopus
local.indexed.at	Scopus
oaire.citation.volume	1725 CCIS
person.affiliation.name	Norges Teknisk-Naturvitenskapelige Universitet
person.affiliation.name	Angelo State University
person.affiliation.name	Angelo State University
person.affiliation.name	Kastamonu University
person.identifier.orcid	0000-0001-7110-0154
person.identifier.orcid	0000-0001-5987-0164
person.identifier.orcid	0000-0003-3271-5054
person.identifier.scopus-author-id	57222721182
person.identifier.scopus-author-id	6603501593
person.identifier.scopus-author-id	8662600400
person.identifier.scopus-author-id	58102616000
relation.isPublicationOfScopus	b5f7bd03-54e1-4c07-8cd2-b78566ca4a5d
relation.isPublicationOfScopus.latestForDiscovery	b5f7bd03-54e1-4c07-8cd2-b78566ca4a5d

Collections

Scopus İndeksli Yayınlar

Scopus: Distributed Query Processing and Reasoning Over Linked Big Data

Files

Collections

Scopus:
Distributed Query Processing and Reasoning Over Linked Big Data