OpenBiodiv: an Implementaion of a Semantic System Running on top of the Biodiversity Knowledge Graph

We presentOpenBiodiv- an implementation of the Open Biodiversity Knowledge Management System. The need for an integrated information system serving the needs of the biodiversity community can be dated at least as far back as the sanctioning of theBouchout declarationin 2007. The Bouchout declaration...

Full description

Saved in:
Bibliographic Details
Published in:Biodiversity Information Science and Standards 2017-08, Vol.1, p.e20084
Main Authors: Senderov, Viktor, Georgiev, Teodor, Agosti, Donat, Catapano, Terry, Sautter, Guido, Ó Tuama, Éamonn, Franz, Nico, Simov, Kiril, Stoev, Pavel, Penev, Lyubomir
Format: Article
Language:eng
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:We presentOpenBiodiv- an implementation of the Open Biodiversity Knowledge Management System. The need for an integrated information system serving the needs of the biodiversity community can be dated at least as far back as the sanctioning of theBouchout declarationin 2007. The Bouchout declaration proposes to make biodiversity knowledge freely available as Linked Open Data (LOD)*1. At TDWG2016 Fig.1) we presented the prototype of the sytem - then called Open Biodiversity Knolwedge Management Sysyttem (OBKMS). The specification and design of OpenBiodiv was outlined bySenderov and Penev (2016)and in this talk we would like to showcase its pilot. We believe OpenBiodiv is possibly the first pilot-stage implenatation of a semantic system running on top of the biodiversity knowledge graph. OpenBiodiv has several components: OpenBiodiv ontology: general data model allowing the extraction of biodiversity knowledge from taxonomic articles or from databases such as GBIF. The ontology (in preparation, Journal of Biomedical Semantics, available on GitHub) incorporates several pre-existing models: Darwin-SW (Baskauf and Webb 2016), SPAR (Peroni 2014), Treatment Ontology, and several others. It defines classes, properties, and rules allowing to interlink these disparate ontologies and to create a LOD of biodiversity knowledge. New is the Taxonomic Name Usage class, accompanied by a Vocabulary of Taxonomic Statuses (created via an analysis of 4, 000 Pensoft articles) allowing for the automated inference of the taxonomic status of Latinized scientific names. The ontology allows for multiple backbone taxonomies via the introduction of a Taxon Concept class (equivalent to DarwinCore Taxon) and Taxon Concept Labels as a subclass of biological name. The Biodiversity Knowledge Graph - a LOD dataset of information extracted from taxonomic literature and databases. In practice, it has realized part of what has been proposed duringpro-iBiosphereand later discussed byPage (2016). Its main resources are articles, sub-article componets (tables, figures, treatents, references), author names, institution names, geographical locations, biological names, taxon concepts, and occurrences. Authors have been disambiguated via their affiliation with the use of fuzzy-logic based on theGraphDB Lucene connector. The graph interlinks: (1) Prospectively published literature viaPensoft Publishers.(2) Legacy literature viaPlazi. (3) Well-known resources such as geographical places or institutions
ISSN:2535-0897
2535-0897