The London Perl and Raku Workshop takes place on 26th Oct 2024. If your company depends on Perl, please consider sponsoring and/or attending.

NAME

FAIR::Accessor - all this does is assign the HTTP call to the correct routine

VERSION

version 1.0

SYNOPSIS

The following code is a complete implementation of a 'Hello, World!' FAIR Accessor

 C<##!/usr/local/bin/perl -w

 package Metadata;

 use strict;
 use warnings;
 use JSON;
 use FAIR::Accessor;

 #-----------------------------------------------------------------
 # Configuration and Daemon
 #-----------------------------------------------------------------

 use base 'FAIR::Accessor';

 my $config = {
   title => 'Semantic PHI Base Metadata Server',
   ETAG_Base => "TopLevelMetadata_Accessor_For_SemanticPHIBase", # this is a unique identificaiton string for the service (required by the LDP specification)
   localNamespaces => {
        ontology => 'http://example.org/ontologies/MyOntology#', 
        },  # add a few new namespaces to the list of known namespaces....
   basePATH => 'Nameof/ThisScript', # REQUIRED regexp to match the RESTful PATH part of the URL, before the ID number

 };

 my $service = Metadata->new(%$config);

 # start daemon
 $service->handle_requests;


 #-----------------------------------------------------------------
 # Accessor Implementation
 #-----------------------------------------------------------------

 
 =head2 Container
  Function: REQUIRED SUBROUTINE - returns the first-stage LD Platform list of contained URIs and the dataset metadata.
  Args    : $starting_at_record : this will be passed-in to tell you what record to start with (for paginated responses)
  $path : the webserver's PATH_INFO environment value (used to modify the behaviour of REST services)
  Returns : JSON encoded listref of 'meta URIs' representing individual records
  Note    :  meta URIs are generally URIs that point back to this same server; calling GET on a meta URI will
            return an RDF description of the set of DCAT distributions for that record.
            The format of the JSON response is as follows:
            
            {"metadata:element1" : "some value",
             "external:metadatatype":  "some other value",
             "void:entities" : "3",
             "ldp:contains" : ["http://myserver.org/ThisScript/record/479-467-29",
                               "http://myserver.org/ThisScript/record/479-467-32",
                               "http://myserver.org/ThisScript/record/479-467-434"
                               ]
            }
            
            Recommended metadata elements include dc:title, dcat:description,dcat:identifier,
            dcat:keyword,dcat:landingPage,dcat:publisher,dcat:theme
            
            note #1:  Using dcat:theme requires you to create a SKOS concept scheme of the various ontology
            terms that describe the data in your repository... this isn't hard, but it's not entirely trivial either...
            
            note #2:  if you return URLs in the ldp:contains, then you must also return the count of those URLs in void:entities
 =cut
 
 
 sub Container {

   my ($self, %ARGS) = @_;

   # this is how you would manage "RESTful" references to different subsets of your data repository
   #if ($ENV{'REQUEST_URI'} =~ /DataSliceX/) {
   #    # some behavior for Data Slice X
   #} elsif ($ENV{'REQUEST_URI'} =~ /DataSliceY/) {
   #    # some behavior for Data Slice Y
   #}
 
   my $metadata =  $self->getRepositoryMetadata();

  # you may chose to return no record IDs at all, if you only want to serve repository-level metadata
   
   my $BASE_URL = "http://" . $ENV{'SERVER_NAME'} . $ENV{'REQUEST_URI'};

   my @known_records = ('http://example.org/Nameof/ThisScript/123');  # generate a list of HTTP records
   # note that the record URLs point to this script!
   
   # these two pieces of metadata are required by the LDP specification
   $metadata->{'void:entities'} = scalar(@known_records);  #  THE TOTAL *NUMBER* OF RECORDS THAT CAN BE SERVED
   $metadata->{'ldp:contains'} = \@known_records; # the listref of record ids

   return encode_json($metadata);

 }

 


 =head2 MetaRecord

  Function: REQUIRED IF get_all_meta_URIs list of URIs point back to this script.
           returns the second-stage LD Platform metadata describing the DCAT distributions, formats, and URLs
           for a particular record
  Args    : $ID : the desired ID number, as determined by the Accessor.pm module
           $PATH_INFO : the webserver's PATH_INFO environment value (in case the $ID best-guess is wrong... then you're on your own!)
  Returns : JSON encoded hashref of 'meta URIs' representing individual DCAT distributions and their mime-type (mime-type is key)
            The format for this response is (you are always allowed to use lists as values if you wish):
            
            {"metadata":
                {"rdf:type": ["edam:data_0006","sio:SIO_000088"]
                 "my:metadatathingy":  "some value",
                 "external:metadatatype":  "some other value"
                },
            "distributions":
                {"application/rdf+xml" : "http://myserver.org/ThisScript/record/479-467-29X.rdf",
                 "text/html" : "http://myserver.org/ThisScript/record/479-467-29X.html"
                }
            }

 =cut


 sub MetaRecord {
   my ($self, %ARGS) = @_;

   my $ID = $ARGS{'ID'};
   # ID is the piece of the URL that comes after http://example.org/Nameof/ThisScript/___here____
   # you created these URLs in the MetaContainer routine above
   my %response;
   my %formats;
   my %metadata;

   # using the $ID, create the links to the actual records, in whatever format...
   my $accnumber = $ID;  # maybe transform ID to get the accession
   
   # make the links to teh text and RDF versions of this record
   $formats{'text/html'} = "http://example.org/query.php?acc=$accnumber";
   $formats{'application/rdf+xml'} = "http://example.org/Resource/$ID";

   # set the ontological type for the record  (optional)
   $metadata{'rdf:type'} = ['ontology:PHIBO_00022'];  # note the qNAME!  If you have defined 'ontology' in the $config hashref at the beginning, you can use a qname here

   # and whatever other metadata you wish (also optional)
   my $metadata = $self->getDistributionMetadata($accnumber);

   $response{distributions} = \%formats;
   $response{metadata} = $metadata if (keys %$metadata);  # only set it if you can provided something

   my $response  = encode_json(\%response);

   return $response;

 }

 sub getRepositoryMetadata {
   my %metadata = (
   'dc:title' => "Semantic PHI Base Accessor",
   'dcat:description' => "FAIR Accessor server for the Semantic PHI Base.  This server exposes the plant portion of the Pathogen Host Interaction database as Linked Data, following the FAIR Data Principles. This interface (the one you are reading) follows the W3C Linked Data Platform behaviors.   The data provided here is an RDF transformation of data kindly provided by the researchers at PHI Base (doi:10.1093/nar/gku1165)",
   'dcat:identifier' => "http://linkeddata.systems/SemanticPHI/Metadata",
   'dcat:keyword' => ["pathogenesis", "plant/pathogen interactions", "PHI Base", "semantic web", "linked data", "FAIR Data", "genetic database", "phytopathology"],
   'dcat:landingPage' => 'http://www.phi-base.org/',
   'foaf:page' => ['http://linkeddata.systems:8890/sparql','http://www.phi-base.org/'],
   'dcat:language' => 'http://id.loc.gov/vocabulary/iso639-1/en',
   'dc:language' => 'http://lexvo.org/id/iso639-3/eng', 
   'dcat:publisher' => ['http://wilkinsonlab.info',"Rothamsted Research", ' http://www.rothamsted.ac.uk', 'http://www.phi-base.org'],
   'dcat:theme'  => 'http://linkeddata.systems/ConceptSchemes/semanticphi_concept_scheme.rdf',  # this is the URI to a SKOS Concept Scheme
  'daml:has-Technical-Lead' => ["Dr. Alejandro Rodriguez Gonzalez","Alejandro Rodriguez Iglesias"],
  'daml:has-Principle-Investigator' => ["Dr. Mark Wilkinson","Dr. Kim Hammond-Kosack"],
  'dc:creator' => 'http://www.phi-base.org/',
  'pav:authoredBy' => ['http://orcid.org/0000-0002-9699-485X','http://orcid.org/0000-0002-6019-7306'],
  'dcat:contactPoint' => 'http://biordf.org/DataFairPort/MiscRDF/Wilkinson.rdf',
  'dcat:license' => 'http://purl.org/NET/rdflicense/cc-by-nd4.0',
  'dc:license' => 'http://purl.org/NET/rdflicense/cc-by-nd4.0',
  'dc:issued' => "2015-11-17", 
  'rdf:type' => ['prov:Collection', 'dctypes:Dataset'],
   );
   return \%metadata;
   
 } 

 sub getRecordMetadata {
  my ($self, $ID) = @_;
  my %metadata = (
        'dcat:description' => "RDF representation of PHI Base Interaction Record PHI:$ID",
        'dc:title' => "PHI-Base Interaction PHI:$ID",
        'dcat:modified' => "2015-11-17", 
        'dc:issued' => "2015-11-17", 
        'dcat:identifier' => "http://linkeddata.systems/SemanticPHI/Metadata",
        'dcat:keyword' => ["pathogenesis", "host/pathogen interaction", "PHI Base"],
        'dcat:landingPage' => ['http://www.phi-base.org/'],
        'foaf:page' => 'http://linkeddata.systems:8890/sparql',
        'foaf:page' => 'http://www.phi-base.org/',
        'dcat:language' => 'http://id.loc.gov/vocabulary/iso639-1/en',
        'dcat:publisher' => ['http://wilkinsonlab.info',"Rothamsted Research", 'http://www.rothamsted.ac.uk', 'http://www.phi-base.org'],
        'daml:has-Technical-Lead' => ["Dr. Alejandro Rodriguez Gonzalez", "Alejandro Rodriguez Iglesias"],
        'daml:has-Principle-Investigator' => ["Dr. Mark Wilkinson","Dr. Kim Hammond-Kosack"],
        'dcat:contactPoint' => 'http://biordf.org/DataFairPort/MiscRDF/Wilkinson.rdf',
        'void:inDataset' => 'http://linkeddata.systems/SemanticPHIBase/Metadata',
        'dcat:license' => 'http://purl.org/NET/rdflicense/cc-by-nd4.0',
        'dc:license' => 'http://purl.org/NET/rdflicense/cc-by-nd4.0',
        'dc:creator' => 'http://www.phi-base.org/',
        'pav:authoredBy' => ['http://orcid.org/0000-0002-9699-485X', 'http://orcid.org/0000-0002-6019-7306'],
        );
  return \%metadata;
 }

>

DESCRIPTION

FAIR Accessors are an implementation of the W3Cs Linked Data Platform.

FAIR Accessors follow a two-stage interaction, where the first stage retrieves metadata about the repository, and (optionally) a series of URLs representing 'meta-records' for every record in that repository (or whatever slice of the repository is being served). This is accomplished by the MetaContainer subroutine. These URLs will generally point back at this same Accessor script (e.g. with the record number appended to the URL: http://this.host/thisscript/12345).

The second stage involves retrieving metadata about individual recoreds. The metadata is up to you, but optimally it would include the available DCAT distributions and their file formats. The second stage can be accomplished by this same Accessor script, using the Distributions subroutine.

The two subroutine names - MetaContainer and Distributions - are not flexible, as they are called by-name, by the Accessor libraries.

You MUST create the MetaContainer subroutine, at a minimum, and it should return some metadata. It does not have to return a list of known records (in which case it simply acts as a metadata descriptor of the repository in general, nothing more... which is fine!... and there will be no second stage interaction. In this case, you do not need to provide a Distributions subroutine.)

NAME

    FAIR::Accessor - Module for creating Linked Data Platform Accessors for the FAIR Data project

Command-line testing

If you wish to test your Accessor server at the command line, you can run it with the following commandline arguments (in order):

 Method (always GET, at the moment)
 Domain
 Request URI (i.e. the path to this script, including the script name)
 PATH_INFO  (anything that should appear in the PATH_INFO variable of the webserver)

  perl  myAccessorScript  GET  example.net  /this/myAccessorScript /1234567

AUTHOR

Mark Denis Wilkinson (markw [at] illuminae [dot] com)

COPYRIGHT AND LICENSE

This software is Copyright (c) 2016 by Mark Denis Wilkinson.

This is free software, licensed under:

  The Apache License, Version 2.0, January 2004