Lab: SPARQL: Difference between revisions

← Older edit

Latest revision as of 12:22, 30 January 2025

Topics

Setting up GraphDB
SPARQL queries and updates

Useful materials

GraphDB documentation:

Getting Started with GraphDB

Introduction to SPARQL:

Getting Started with SPARQL

SPARQL reference:

Tasks

We recommend you download and install the free desktop version of OntoText's GraphDB to run the SPARQL exercises.

If you do not like proprietary software, it is still possible to do most of the exercises using Blazegraph, which you can download here (requires Java). Blazegraph is a powerful open-source tool, but GraphDB offers even more functionality and is what the lab leader will prepare for this semester.

Installing and running GraphDB

Follow the instructions in Getting Started with GraphDB to download and install GraphDB.

From the Desktop Installation page you can click on "GraphDB download page" and then on "Download GraphDB" to register and request to download GraphDB Free.

When GraphDB has been properly installed and is started, it should open in a web browser window at the address http://localhost:7200/ .

Setting up a repository

Follow the instructions in Getting Started with GraphDB to create a new GraphDB Repository called, for example, info216_lab2_NN, where NN are your initials. Choose No inference for now. Otherwise, the default parameters are fine.

Connect to the new repository and pin it as your default repository.

Load data

Download the Turtle file File:Russia investigation kg.txt, and save it with the correct extension, as russia_investigation_kg.ttl (not .txt). (You can also experiment with the Turtle file you saved after exercises 1 Load the Russia_investigation data through the GraphDB Workbench as described in the QuickStart guide.

You can use http://example.org/ as Base IRI.

Graph visualisation

Go to Explore -> Visual graph and create an Easy graph around the resource http://example.org#investigation_0. Double-click on nodes to expand them. Are there any more investigations related to Richard Nixon?

SPARQL tasks

Go to the SPARQL Query & Update tab.

Task: Using the data in russia_investigation_kg.ttl, write the following SPARQL SELECT queries. ( This page explains the Russian investigation KG a bit more.)

List all triples in your graph.
List the first 100 triples in your graph.
Count the number of triples in your graph.
Count the number of indictments in your graph.
List everyone who pleaded guilty, along with the name of the investigation.
List everyone who were convicted, but who had their conviction overturned by which president.
For each investigation, list the number of indictments made.
For each investigation with multiple indictments, list the number of indictments made.
For each investigation with multiple indictments, list the number of indictments made, sorted with the most indictments first.
For each president, list the numbers of convictions and of pardons made after conviction.

If you have more time

Task: Try to program some of the queries in a Python program (this will be the topic of later labs). You have two options:

Using rdflib: Read the Turtle file into an rdflib Graph and use the query() method.

g = Graph()
g.parse(..., format='ttl')
r = g.query(...your_query_string...)

The hard part is picking the results out of the object r...

Using SPARQLwrapper: You can use SPARQLwrapper (another Python API) to connect to your running GraphDB endpoint. See the Python example page for how to do this.

Task: If you want to explore more, try out the Wikidata Query Service (WDQS):

Wikidata Query Service

WDQS tutorials:

@@ Line 1: / Line 1: @@
 ==Topics==
-* Setting up the Blazegraph graph database.
+* Setting up GraphDB
-* SPARQL queries and updates.
+* SPARQL queries and updates
 ==Useful materials==
-Blazegraph:
+GraphDB documentation:
-* [https://blazegraph.com/ Welcome to Blazegraph]
+* [https://graphdb.ontotext.com/documentation/10.8/ Getting Started with GraphDB]
-SPARQL:
+Introduction to SPARQL:
+* [https://graphdb.ontotext.com/documentation/10.8/sparql.html Getting Started with SPARQL]
+SPARQL reference:
 * [https://www.w3.org/TR/sparql11-query/ SPARQL Query Documentation]
+<!--
 * [http://www.w3.org/TR/sparql11-update/ SPARQL Update Documentation]
+-->
+* [https://en.wikibooks.org/wiki/SPARQL/Expressions_and_Functions SPARQL Expressions and Functions]
 ==Tasks==
-===Running Blazegraph===
+We recommend you download and install the free desktop version of OntoText's GraphDB to run the SPARQL exercises.
-You can either run Blazegraph locally on your own machine (best) or online at a local server (also ok).
-'''Installing the Blazegraph database on your own computer:'''
-Download Blazegraph (blazegraph.jar) from here: [https://blazegraph.com/ https://blazegraph.com/]
-You can place blazegraph.jar in the same folder of your python project for the labs.
-Navigate to the folder of blazegraph.jar in your commandline/terminal using cd. (cd C:\Users\marti\info216 for me as an example). Now run this command:
- java -server -Xmx4g -jar blazegraph.jar
-You might have to [[https://www.oracle.com/technetwork/java/javase/downloads/ install Java 64-bit JDK] if you have problems running Blazegraph. If you get an "Address already in use" error, this is likely because Blazegraph has been terminated improperly. Either restart the terminal-session or try to run this command instead:
- java -server -Xmx4g -Djetty.port=19999 -jar blazegraph.jar
-This changes the port of the Blazegraph server.
-'''Running Blazegraph online:'''
+If you do not like proprietary software, it is still possible to do most of the exercises using Blazegraph, which you can [https://blazegraph.com/ download here] (requires Java). Blazegraph is a powerful open-source tool, but GraphDB offers even more functionality and is what the lab leader will prepare for this semester.
-If you have trouble installing Blazegraph, you can use [http://sandbox.i2s.uib.no/bigdata/ a shared online server] for now. It provides the same Blazegraph interface, but runs in the cloud and can only be used from inside the UiB network. (If you are outside the UiB campus, you can connect through the [https://hjelp.uib.no/tas/public/ssp/content/detail/service?unid=a566dafec92a4d35bba974f0733f3663 UiB VPN] first.)
-'''Using Blazegraph:'''
+===Installing and running GraphDB===
-* ''Create namespace:'' In the Blazegraph interface, you may go to the ''UPDATE'' tab and create a new namespace using default values and the ''Create namespace'' button. You '''must''' do this if you use the shared online server. You can also do this on your local server to keep your datasets separate. (If you do not create a namespace, the default is '''kb'''.)
+Follow the instructions in [https://graphdb.ontotext.com/documentation/10.8/ Getting Started with GraphDB] to download and install GraphDB.
-* ''Uploading data:'' In the Blazegraph interface, go to the ''UPDATE'' tab and use the ''Browse...'' and ''Update'' buttons to load the file into Blazegraph.
-** You can use the data in the Turtle file [[File:russia_investigation_kg.txt]]. Make sure you save it with the correct extension, as ''russia_investigation_kg.ttl'' (not ''.txt'').
-** You can also use the Turtle file you saved after exercises 1 and 2.
-* ''Querying and updating:'' In the Blazegraph interface, go to the ''QUERY'' and ''UPDATE'' tabs to enter queries and updates.
-===SPARQL tasks===
+From the [https://graphdb.ontotext.com/documentation/10.8/graphdb-desktop-installation.html Desktop Installation page] you can click on ''"GraphDB download page"'' and then on ''"Download GraphDB"'' to register and request to download ''GraphDB Free''.
-'''Task:'''
-Using the data in ''russia_investigation_kg.ttl'', write the following SPARQL queries:
-* SELECT all triples in your graph.
-* SELECT all the interests of Cade.
-* SELECT the city and country of where Emma lives.
-* SELECT only people who are older than 26.
-* SELECT Everyone who graduated with a Bachelor Degree.
-This page explains the [[Russian investigation KG]] a bit more.
+When GraphDB has been properly installed and is started, it should open in a web browser window at the address http://localhost:7200/ .
-'''Task:'''
+===Setting up a repository===
-Load the RDF graph you created in exercises 1 and 2. Use INSERT DATA to add these triples to your graph:
+Follow the instructions in [https://graphdb.ontotext.com/documentation/10.8/ Getting Started with GraphDB] to create a new GraphDB Repository called, for example, ''info216_lab2_NN'', where ''NN'' are your initials. Choose ''No inference'' for now. Otherwise, the default parameters are fine.
-* George Papadopoulos was adviser to the Trump campaign.
-** He pleaded guilty to lying to the FBI.
-** He was sentenced to prison.
-* Roger Stone is a Republican.
-** He was adviser to Trump.
-** He was an official in the Trump campaign.
-** He interacted with Wikileaks.
-** He was indicted for making false statements, witness tampering, and obstruction of justice.
-** He made a testimony for the House Intelligence Committee.
-Use SPARQL Update's DELETE DATA to delete that fact that Cade is interested in Photography. Run your SPARQL query again to check that the graph has changed.
+Connect to the new repository and pin it as your default repository.
-Use INSERT DATA to add information about Sergio Pastor, who lives in 4 Carrer del Serpis, 46021 Valencia, Spain. he has a M.Sc. in computer from the University of Valencia from 2008. His areas of expertise include big data, semantic technologies and machine learning.
+===Load data===
+Download the Turtle file [[File:russia_investigation_kg.txt]], and save it with the correct extension, as ''russia_investigation_kg.ttl'' (not ''.txt''). (You can also experiment with the Turtle file you saved after exercises 1 Load the Russia_investigation data through the GraphDB Workbench as described in the QuickStart guide.
+You can use ''http://example.org/'' as Base IRI.
-Write a SPARQL DELETE/INSERT update to change the name of "University of Valencia" to "Universidad de Valencia" whereever it occurs.
+===Graph visualisation===
+Go to ''Explore'' -> ''Visual graph'' and create an ''Easy graph'' around the resource ''http://example.org#investigation_0''. Double-click on nodes to expand them. Are there any more investigations related to ''Richard Nixon''?
-Write a SPARQL DESCRIBE query to get basic information about Sergio.
+===SPARQL tasks===
+Go to the ''SPARQL Query & Update'' tab.
-Write a SPARQL CONSTRUCT query that returns that: any city in an address is a cityOf the country of the same address.
+'''Task:'''
+Using the data in ''russia_investigation_kg.ttl'', write the following SPARQL SELECT queries.
+([[Russian investigation KG | This page explains]] the Russian investigation KG a bit more.)
+* List all triples in your graph.
+* List the first 100 triples in your graph.
+* Count the number of triples in your graph.
+* Count the number of indictments in your graph.
+* List everyone who pleaded guilty, along with the name of the investigation.
+* List everyone who were convicted, but who had their conviction overturned by which president.
+* For each investigation, list the number of indictments made.
+* For each investigation with multiple indictments, list the number of indictments made.
+* For each investigation with multiple indictments, list the number of indictments made, sorted with the most indictments first.
+* For each president, list the numbers of convictions and of pardons made after conviction.
 ==If you have more time==
-'''Task:''' Try to program some of the queries/updates in a Python program (this will be the topic of later labs). You have two options:
+'''Task:''' Try to program some of the queries in a Python program (this will be the topic of later labs). You have two options:
 ''Using rdflib:''
@@ Line 79: / Line 71: @@
 ''Using SPARQLwrapper:''
-You can use SPARQLwrapper (another Python API) to connect to your running Blazegraph endpoint. See the Python example page for how to do this.
+You can use SPARQLwrapper (another Python API) to connect to your running GraphDB endpoint. See the Python example page for how to do this.
-'''Task:''' If you want to explore more, try out Wikidata Query Service (WDQS)
+'''Task:''' If you want to explore more, try out the Wikidata Query Service (WDQS):
 * [https://query.wikidata.org/ Wikidata Query Service]

Anonymous

Search

Lab: SPARQL: Difference between revisions

Namespaces

More

Page actions

Latest revision as of 12:22, 30 January 2025

Contents

Topics

Useful materials

Tasks

Installing and running GraphDB

Setting up a repository

Load data

Graph visualisation

SPARQL tasks

If you have more time

Navigation

Navigation

Wiki tools

Wiki tools

Anonymous

Search

Lab: SPARQL: Difference between revisions

Latest revision as of 12:22, 30 January 2025

Topics

Useful materials

Tasks

Installing and running GraphDB

Setting up a repository

Load data

Graph visualisation

SPARQL tasks

If you have more time

Navigation

Wiki tools

Page tools