Examples from the lectures: Difference between revisions

From info216
No edit summary
No edit summary
 
(8 intermediate revisions by the same user not shown)
Line 1: Line 1:


=Examples from the lectures=
''This page currently shows the examples used in the Spring of 2023.
It will be updated with examples from 2024 as the course progresses.''


==Lecture 1: Introduction to KGs==
==Lecture 1: Introduction to KGs==
Line 14: Line 15:
</syntaxhighlight>
</syntaxhighlight>


<!--
==Lecture 2: RDF==
==Lecture 2: RDF==
Blank nodes for anonymity, or when we have not decided on a URI:
Blank nodes for anonymity, or when we have not decided on a URI:
Line 139: Line 141:
</syntaxhighlight>
</syntaxhighlight>


==Lecture 7: SHACL==
==Lecture 3: SPARQL==
 
The KG4News knowledge graph can be accessed online [http://bg.newsangler.uib.no here (namespace ''kb'')] (read-only), or you can load the Turtle file into your own GraphDB repository.
(Remember to save the file with ''.ttl'' extension. You can use ''http://i2s.uib.no/kg4news/'' as base URI.)
 
===Limit===
<syntaxhighlight lang="SPARQL">
SELECT ?p WHERE {
    ?s ?p ?o .
}
LIMIT 10</syntaxhighlight>
 
===List distinct properties only (with limit)===
<syntaxhighlight lang="SPARQL">
SELECT DISTINCT ?p WHERE {
    ?s ?p ?o .
}
LIMIT 10
</syntaxhighlight>
 
===Limit with offset===
<syntaxhighlight lang="SPARQL">
SELECT DISTINCT ?p WHERE {
    ?s ?p ?o .
}
LIMIT 10 OFFSET 9
</syntaxhighlight>
 
===List rdf:types===
<syntaxhighlight lang="SPARQL">
PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
 
SELECT DISTINCT ?t WHERE {
    ?s rdf:type ?t .
}
LIMIT 50
</syntaxhighlight>
 
===URI for Tim Berners-Lee===
<syntaxhighlight lang="SPARQL">
PREFIX foaf: <http://xmlns.com/foaf/0.1/>
 
SELECT DISTINCT ?person WHERE {
    ?person foaf:name ?name . 
    FILTER(CONTAINS(?name, "Berners-Lee"))
}
LIMIT 10  # best to use limit if something goes wrong
</syntaxhighlight>
 
===Describe Tim Berners-Lee===
<syntaxhighlight lang="SPARQL">
DESCRIBE <http://i2s.uib.no/kg4news/author/1432678629>
</syntaxhighlight>
DESCRIBE returns a new RDF graph, whereas SELECT returns a table of rows.
 
===Papers that mention "Semantic Web" in the title===
<syntaxhighlight lang="SPARQL">
PREFIX ss: <http://semanticscholar.org/>
 
SELECT DISTINCT ?paper ?title WHERE {
    ?paper ss:title ?title . 
    FILTER(CONTAINS(STR(?title), "Semantic Web"))
}
LIMIT 50
</syntaxhighlight>
 
===Alternative filter that ignores capitalisation (lower/upper case)===
<syntaxhighlight lang="SPARQL">
    FILTER(REGEX(STR(?title), "Semantic Web", "i"))
</syntaxhighlight>
 
===Authors sorted by name===
<syntaxhighlight lang="SPARQL">
PREFIX foaf: <http://xmlns.com/foaf/0.1/>
 
SELECT DISTINCT * WHERE {
    ?author foaf:name ?name . 
}
ORDER BY ?name
LIMIT 10
</syntaxhighlight>
 
===Sorted by descending name instead===
<syntaxhighlight lang="SPARQL">
ORDER BY DESC(?name)
</syntaxhighlight>
 
===Count papers by author===
<syntaxhighlight lang="SPARQL">
PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX dct: <http://purl.org/dc/terms/>
PREFIX ss: <http://semanticscholar.org/>
 
SELECT DISTINCT ?author (COUNT(?paper) AS ?count) WHERE {
    ?author rdf:type ss:Author . 
    ?paper  rdf:type ss:Paper ;
            dct:contributor ?author .
}
GROUP BY ?author
LIMIT 10
</syntaxhighlight>
 
===Only list the most prolific authors===
<syntaxhighlight lang="SPARQL">
PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX dct: <http://purl.org/dc/terms/>
PREFIX ss: <http://semanticscholar.org/>
 
SELECT DISTINCT ?author (COUNT(?paper) AS ?count) WHERE {
    ?author rdf:type ss:Author . 
    ?paper  rdf:type ss:Paper ;
            dct:contributor ?author .
}
GROUP BY ?author
HAVING (?count >= 10)  # similar to a filter expression
LIMIT 10              # include limit when you test
</syntaxhighlight>
 
===Order by descending paper count===
<syntaxhighlight lang="SPARQL">
SELECT ... {
    ...
}
GROUP BY ?person
HAVING (?count > 10)
ORDER BY DESC(?count)
LIMIT 10
</syntaxhighlight>
 
===Order by descending paper count and then by author name===
<syntaxhighlight lang="SPARQL">
PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX dct: <http://purl.org/dc/terms/>
PREFIX foaf: <http://xmlns.com/foaf/0.1/>
PREFIX ss: <http://semanticscholar.org/>
 
SELECT DISTINCT ?person (SAMPLE(?name) AS ?name) (COUNT(?paper) AS ?count) WHERE {
    ?person rdf:type ss:Author ;
            foaf:name ?name . 
    ?paper  rdf:type ss:Paper ;
            ss:title ?title ;
            dct:contributor ?person .
}
GROUP BY ?person
HAVING (?count > 10)
ORDER BY DESC(?count)
LIMIT 10
</syntaxhighlight>
 
===Embedded Wikidata query===
This is a toy example only. Embedded queries like these are better suited for situations where the same URIs are used in more than one triple store and you want to combine data. But Wikidata and KG4News do not use the same URIs. So instead, the example searches for similar labels, and this is something graph databases may not be optimised for. Moreover, Wikidata uses language-tagged strings whereas KG4News uses plain strings, so the labels cannot even be directly compared.
<syntaxhighlight lang="SPARQL">
PREFIX foaf: <http://xmlns.com/foaf/0.1/>
PREFIX skos: <http://www.w3.org/2004/02/skos/core#>
 
SELECT DISTINCT ?enname ?person ?wdperson WHERE {
    BIND("T. Berners-Lee"@en AS ?enname)  # language-tagger label
 
    SERVICE <https://query.wikidata.org/bigdata/namespace/wdq/sparql> {
        # return a Wikidata identifier (URI) with this label as alternative
        SELECT ?wdperson ?enname WHERE {
            ?wdperson skos:altLabel ?enname .
        }
        LIMIT 10  # we use limit in case the label does not match
    }
           
    BIND(STR(?enname) AS ?name)  # the same label, but with language tag removed
    # return a KG4News identifier (URI) with this label as name
    ?person foaf:name ?name . 
 
}
LIMIT 10
</syntaxhighlight>
 
===Add one or more triples===
From now on you need a Blazegraph that allows writing, for example the [http://sandbox.i2s.uib.no I2S sandbox]. Remember to ''create a new namespace'' first and make sure you ''use'' it afterwards.
<syntaxhighlight lang="SPARQL">
PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX dct: <http://purl.org/dc/terms/>
PREFIX kg: <http://i2s.uib.no/kg4news/>
PREFIX ss: <http://semanticscholar.org/>
 
INSERT DATA {  # note the Turtle-like syntax
    kg:paper_123 rdf:type ss:Paper ;
            ss:title "Semantic Knowledge Graphs for the News: A Review"@en ;
            kg:year 2022 ;
            dct:contributor kg:auth_456, kg:auth_789 .
}
</syntaxhighlight>
 
===Remove one or more triples===
<syntaxhighlight lang="SPARQL">
PREFIX kg: <http://i2s.uib.no/kg4news/>
 
DELETE DATA
{
    kg:paper_123 kg:year 2022  .
}
</syntaxhighlight>
 
===Pattern-based addition and or removal of triples===
<syntaxhighlight lang="SPARQL">
PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX dct: <http://purl.org/dc/terms/>
PREFIX kg: <http://i2s.uib.no/kg4news/>
PREFIX ss: <http://semanticscholar.org/>
 
DELETE DATA {   
    ?paper dct:contributor kg:auth_456
}
INSERT DATA {   
    ?paper dct:contributor kg:auth_654
}
WHERE {  # the patterns are similar to SELECT patterns
    ?paper dct:contributor kg:auth_456
}
</syntaxhighlight>
 
==Lecture 8: SHACL==
The examples are for use with the [https://shacl.org/playground/ interactive SHACL Playground].
The examples are for use with the [https://shacl.org/playground/ interactive SHACL Playground].


Line 337: Line 559:
</syntaxhighlight>
</syntaxhighlight>


==Lecture 7: RDFS==
==Lecture 8: RDFS==
Create a new Blazegraph Namespace ''with inference'' and ''Use'' it.
Create two new GraphDB Repositories, one with RDFS inference and one with No inference. Try the SPARQL statements on both, so you can compare them.


===rdfs:subClassOf===
To test ''rdfs:subClassOf'':
In the ''Update'' tab, insert the ''Final data graph'' from the SHACL example above. (Select ''Type: RDF data'' and ''Format: Turtle''.)
 
Go to the ''Query'' tab, and SELECT all papers of type ''kg:MainPaper''.
<syntaxhighlight lang="ttl">
<syntaxhighlight lang="ttl">
PREFIX kg: <http://i2s.uib.no/kg4news/>
PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX sh: <http://www.w3.org/ns/shacl#>
PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>
PREFIX dcterm: <http://purl.org/dc/terms/>
PREFIX foaf: <http://xmlns.com/foaf/0.1/>
PREFIX skos: <http://www.w3.org/2004/02/skos/core#>
PREFIX xsd: <http://www.w3.org/2001/XMLSchema#>
PREFIX kg: <http://i2s.uib.no/kg4news/>
PREFIX th: <http://i2s.uib.no/kg4news/theme/>
PREFIX ss: <http://semanticscholar.org/>


SELECT ?paper WHERE {
INSERT DATA {  
    ?paper rdf:type kg:MainPaper
kg:LOD_paper rdf:type kg:MainPaper .
    kg:MainPaper rdfs:subClassOf kg:Paper .
}
}
</syntaxhighlight>
</syntaxhighlight>


Change the query to select all papers first of type ''ss:Paper'' and then of type ''foaf:Document''.
<syntaxhighlight lang="ttl">
PREFIX kg: <http://i2s.uib.no/kg4news/>
PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>


In the ''Update'' tab:
ASK {
<syntaxhighlight lang="ttl">
kg:LOD_paper rdf:type kg:Paper .
kg:MainPaper rdfs:subClassOf ss:Paper .
}
</syntaxhighlight>
</syntaxhighlight>


Back in the ''Query'' tab, rerun the query to select all papers first of type ''ss:Paper'' and then of type ''foaf:Document''.


===Transitive rdfs:subClassOf===
To test ''rdfs:domain'':
In the ''Update'' tab, add the following RDFS rule:
<syntaxhighlight lang="ttl">
<syntaxhighlight lang="ttl">
ss:Paper rdfs:subClassOf foaf:Document .
PREFIX kg: <http://i2s.uib.no/kg4news/>
PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>
 
INSERT DATA {
kg:TBL kg:author kg:LOD_paper .
    kg:author rdfs:domain kg:Author .
}
</syntaxhighlight>
</syntaxhighlight>


Back in the ''Query'' tab, rerun the query to select all papers first of type ''ss:Paper'' and then of type ''foaf:Document''.
===rdfs:range===
In the ''Update'' tab, add another paper, which is cited (referenced) by the LOD paper:
<syntaxhighlight lang="ttl">
<syntaxhighlight lang="ttl">
kg:SW_Paper
PREFIX kg: <http://i2s.uib.no/kg4news/>
    dcterm:title "The Semantic Web" ;
PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
    dcterm:contributor kg:TBL, kg:JH, kg:OL .
PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>


kg:JH
ASK {
    foaf:name "J. Hendler" .
kg:TBL rdf:type kg:Author .
 
}
kg:OL
    foaf:name "O. Lassila" .
 
kg:LOD_Paper kg:cites kg:SW_Paper  # the new SW paper is cited by the LOD paper
</syntaxhighlight>
</syntaxhighlight>
Note that we have ''not'' stated the RDF types of the new paper and new authors.


In the ''Query'' tab, rerun the SPARQL queries to check that ''kg:SW_Paper'' has no ''rdf:type''.


In the ''Update'' tab add this RDFS rule:
To see all the triples in the KG:
<syntaxhighlight lang="ttl">
<syntaxhighlight lang="ttl">
kg:cites rdfs:range ss:Paper .  # any paper that is cited by a kg:MainPaper is an ss:Paper
SELECT * WHERE {
</syntaxhighlight>
  ?s ?p ?o
 
}
Back in the ''Query'' tab, rerun the SPARQL queries to check that the new SW paper is now an ''ss:Paper'' and ''foaf:Document'', but not a ''kg:MainPaper''.
 
==Lecture 10: OWL-DL==
Here is the OWL file with the minimal Protégé / HermiT example from S10. Save it with the ''.owl'' suffix, for example ''s10-example.owl'', and ''File -> Open'' it in Protégé.
<syntaxhighlight lang='owl'>
<?xml version="1.0"?>
<rdf:RDF xmlns="http://www.semanticweb.org/sinoa/ontologies/2023/2/untitled-ontology-3#"
    xml:base="http://www.semanticweb.org/sinoa/ontologies/2023/2/untitled-ontology-3"
    xmlns:owl="http://www.w3.org/2002/07/owl#"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:xml="http://www.w3.org/XML/1998/namespace"
    xmlns:xsd="http://www.w3.org/2001/XMLSchema#"
    xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"
    xmlns:untitled-ontology-3="http://www.semanticweb.org/sinoa/ontologies/2023/2/untitled-ontology-3#">
    <owl:Ontology rdf:about="http://www.semanticweb.org/sinoa/ontologies/2023/2/untitled-ontology-3"/>
 
    <!--
    ///////////////////////////////////////////////////////////////////////////////////////
    //
    // Object Properties
    //
    ///////////////////////////////////////////////////////////////////////////////////////
    -->
 
    <!-- http://www.semanticweb.org/sinoa/ontologies/2023/2/untitled-ontology-3#hasSource -->
    <owl:ObjectProperty rdf:about="http://www.semanticweb.org/sinoa/ontologies/2023/2/untitled-ontology-3#hasSource"/>
 
    <!--
    ///////////////////////////////////////////////////////////////////////////////////////
    //
    // Classes
    //
    ///////////////////////////////////////////////////////////////////////////////////////
    -->
 
    <!-- http://www.semanticweb.org/sinoa/ontologies/2023/2/untitled-ontology-3#Content -->
    <owl:Class rdf:about="http://www.semanticweb.org/sinoa/ontologies/2023/2/untitled-ontology-3#Content"/>
 
    <!-- http://www.semanticweb.org/sinoa/ontologies/2023/2/untitled-ontology-3#Source -->
    <owl:Class rdf:about="http://www.semanticweb.org/sinoa/ontologies/2023/2/untitled-ontology-3#Source">
        <owl:equivalentClass>
            <owl:Restriction>
                <owl:onProperty>
                    <rdf:Description>
                        <owl:inverseOf rdf:resource="http://www.semanticweb.org/sinoa/ontologies/2023/2/untitled-ontology-3#hasSource"/>
                    </rdf:Description>
                </owl:onProperty>
                <owl:someValuesFrom rdf:resource="http://www.semanticweb.org/sinoa/ontologies/2023/2/untitled-ontology-3#Content"/>
            </owl:Restriction>
        </owl:equivalentClass>
    </owl:Class>
 
    <!--
    ///////////////////////////////////////////////////////////////////////////////////////
    //
    // Individuals
    //
    ///////////////////////////////////////////////////////////////////////////////////////
    -->
 
    <!-- http://www.semanticweb.org/sinoa/ontologies/2023/2/untitled-ontology-3#NYTimes -->
    <owl:NamedIndividual rdf:about="http://www.semanticweb.org/sinoa/ontologies/2023/2/untitled-ontology-3#NYTimes"/>
 
    <!-- http://www.semanticweb.org/sinoa/ontologies/2023/2/untitled-ontology-3#article_about_Trump_rally -->
    <owl:NamedIndividual rdf:about="http://www.semanticweb.org/sinoa/ontologies/2023/2/untitled-ontology-3#article_about_Trump_rally">
        <rdf:type rdf:resource="http://www.semanticweb.org/sinoa/ontologies/2023/2/untitled-ontology-3#Content"/>
        <hasSource rdf:resource="http://www.semanticweb.org/sinoa/ontologies/2023/2/untitled-ontology-3#NYTimes"/>
    </owl:NamedIndividual>
</rdf:RDF>
 
<!-- Generated by the OWL API (version 4.5.9.2019-02-01T07:24:44Z) https://github.com/owlcs/owlapi -->
</syntaxhighlight>
</syntaxhighlight>


Line 534: Line 675:
vector_neighbours(embeddings['oslo'] - embeddings['norway'] + embeddings['france'])
vector_neighbours(embeddings['oslo'] - embeddings['norway'] + embeddings['france'])
</syntaxhighlight>
</syntaxhighlight>
-->

Latest revision as of 10:54, 20 January 2025

This page currently shows the examples used in the Spring of 2023. It will be updated with examples from 2024 as the course progresses.

Lecture 1: Introduction to KGs

Turtle example:

@prefix ex: <http://example.org/> .
ex:Roger_Stone
    ex:name "Roger Stone" ;
    ex:occupation ex:lobbyist ;
    ex:significant_person ex:Donald_Trump .
ex:Donald_Trump
    ex:name "Donald Trump" .