Triple Store Service

Overview

The Triple Store Service provides a unified interface for working with RDF (Resource Description Framework) data across different storage backends. It abstracts away the underlying storage implementation through adapters, allowing seamless switching between different storage solutions.

The service currently supports the following adapters:

Oxigraph Adapter: Default for local development - lightweight, high-performance SPARQL triple store
Filesystem Adapter: Alternative local storage that saves RDF triples directly to the file system
Naas Adapter: Uses Object Storage Service to store RDF triples in cloud storage
AWS Neptune Adapter: Connects to AWS Neptune graph database for enterprise-scale RDF storage

This abstraction allows applications to work with semantic data in a consistent way, regardless of whether they're running in development or production environments. The service provides core operations like:

Inserting triples into the store
Removing triples from the store
Querying the store using SPARQL
Working with subject-specific graphs
Loading and managing RDF/OWL schemas
Event subscriptions for ontology changes

The storage backend used is determined by the environment configuration, defaulting to Oxigraph in development and using cloud storage in production environments.

Usage

Default initialization

The Triple Store Service is typically initialized through the application's service manager. The initialization differs between development and production environments:

from src import services

# Access the triple store service
triple_store = services.triple_store_service

You can check the loading of the service in services.py

Development Environment

In development mode (when ENV=dev environment variable is set), the service automatically initializes with the Oxigraph adapter:

# Development initialization (happens automatically)
triple_store_service = TripleStoreFactory.TripleStoreServiceOxigraph()

Production Environment

In production, the service uses the Naas adapter with credentials from your configuration:

# Production initialization (happens automatically)
triple_store_service = TripleStoreFactory.TripleStoreServiceNaas(
    naas_api_key=secret.get('NAAS_API_KEY'),
    workspace_id=config.workspace_id,
    storage_name=config.storage_name
)

This initialization is handled automatically when the application starts, allowing you to use the triple store service directly without manual setup.

Factories

The Triple Store Service provides factory methods to create service instances based on your needs:

from abi.services.triple_store.TripleStoreFactory import TripleStoreFactory

# Create a filesystem-based triple store service
triple_store = TripleStoreFactory.TripleStoreServiceFilesystem("/path/to/triple_store")

# Create a Naas-based triple store service
triple_store = TripleStoreFactory.TripleStoreServiceNaas(
    naas_api_key="YOUR_NAAS_API_KEY",
    workspace_id="your-workspace-id",
    storage_name="your-storage-name",
    base_prefix="ontologies"  # Optional, defaults to "ontologies"
)

# Create an Oxigraph-based triple store service
triple_store = TripleStoreFactory.TripleStoreServiceOxigraph(
    oxigraph_url="http://localhost:7878"  # Optional, defaults to env var or localhost
)

# Create an AWS Neptune-based triple store service (with SSH tunnel)
triple_store = TripleStoreFactory.TripleStoreServiceAWSNeptuneSSHTunnel(
    aws_region_name="us-east-1",
    aws_access_key_id="YOUR_ACCESS_KEY",
    aws_secret_access_key="YOUR_SECRET_KEY",
    db_instance_identifier="your-neptune-instance",
    bastion_host="bastion.example.com",
    bastion_port=22,
    bastion_user="ubuntu",
    bastion_private_key="-----BEGIN RSA PRIVATE KEY-----..."
)

Each factory method returns an instance of TripleStoreService that provides a unified interface for RDF operations regardless of the underlying storage backend.

API Reference

`TripleStoreService`

The main service class implementing the ITripleStoreService interface. It provides a unified API for interacting with different triple store backends through adapters.

Methods

`insert(triples: Graph) -> None`

Inserts triples from the provided RDF graph into the store.

Parameters:
- triples: RDFlib Graph containing triples to insert
Returns: None

`remove(triples: Graph) -> None`

Removes triples from the provided graph from the store.

Parameters:
- triples: RDFlib Graph containing triples to remove
Returns: None

`get() -> Graph`

Gets the complete RDF graph from the triple store.

Returns: RDFlib Graph containing all stored triples

`query(query: str) -> Graph`

Executes a SPARQL query against the triple store.

Parameters:
- query: SPARQL query string to execute
Returns: RDFlib Graph containing query results

`query_view(view: str, query: str) -> Graph`

Executes a SPARQL query against a specific view of the triple store.

Parameters:
- view: Name of the view to query
- query: SPARQL query string to execute
Returns: RDFlib Graph containing query results
Raises: ViewNotFoundError if the view does not exist

`get_subject_graph(subject: str) -> Graph`

Gets the RDF graph containing all triples for a specific subject.

Parameters:
- subject: Subject URI to retrieve triples for
Returns: RDFlib Graph containing all triples for the specified subject
Raises: SubjectNotFoundError if no triples exist with the specified subject

`load_schema(filepath: str) -> None`

Loads an RDF/OWL schema file into the triple store.

Parameters:
- filepath: Path to the RDF/OWL schema file to load
Returns: None

`get_schema_graph() -> Graph`

Gets the RDF graph containing just the schema/ontology triples.

Returns: RDFlib Graph containing only the schema/ontology triples

`subscribe(topic: tuple, event_type: OntologyEvent, callback: Callable, background: bool = False) -> str`

Subscribes to events for a specific topic pattern.

Parameters:
- topic: A (subject, predicate, object) tuple specifying the pattern to match. Each element can be None to match any value in that position.
- event_type: Type of event to subscribe to (INSERT or DELETE)
- callback: Function to call when matching events occur
- background: Whether the callback should be executed in background (default: False)
Returns: A unique subscription ID that can be used to unsubscribe later

`unsubscribe(subscription_id: str) -> None`

Unsubscribes from events using a subscription ID.

Parameters:
- subscription_id: The subscription ID returned from a previous subscribe() call
Returns: None
Raises: SubscriptionNotFoundError if no subscription exists with the provided ID

Storage Adapters

The service supports multiple storage backends through adapters:

Filesystem Adapter (`TripleStoreServiceSecondaryAdaptorFilesystem`)

Adapter for local file system storage of RDF triples.

Initialization: TripleStoreFactory.TripleStoreServiceFilesystem(store_path: str)

Naas Adapter (`TripleStoreServiceSecondaryAdaptorNaasStorage`)

Adapter for using Naas-managed cloud storage for RDF triples.

Initialization:

TripleStoreFactory.TripleStoreServiceNaas(
    naas_api_key: str,
    workspace_id: str,
    storage_name: str,
    base_prefix: str = "ontologies"
)

Oxigraph Adapter (`Oxigraph`)

Adapter for connecting to Oxigraph triple store instances. Oxigraph provides lightweight, high-performance RDF storage and SPARQL query capabilities with minimal resource footprint.

Features:
- Full SPARQL 1.1 support
- HTTP REST API communication
- Minimal memory footprint (< 100MB)
- Fast startup time (seconds)
- Native Apple Silicon support
- Suitable for development and production use

Initialization:

TripleStoreFactory.TripleStoreServiceOxigraph(
    oxigraph_url: str = None  # Defaults to OXIGRAPH_URL env var or http://localhost:7878
)

Environment Variables:
- OXIGRAPH_URL: Base URL of the Oxigraph instance

AWS Neptune Adapter (`AWSNeptuneSSHTunnel`)

Adapter for connecting to AWS Neptune managed graph database service. Supports SSH tunneling for secure VPC access.

Features:
- AWS IAM authentication with SigV4 signing
- SSH tunnel support for VPC-deployed instances
- Named graph management
- Enterprise-scale RDF storage

Initialization:

TripleStoreFactory.TripleStoreServiceAWSNeptuneSSHTunnel(
    aws_region_name: str,
    aws_access_key_id: str,
    aws_secret_access_key: str,
    db_instance_identifier: str,
    bastion_host: str,           # SSH bastion host for VPC access
    bastion_port: int,           # SSH port (typically 22)
    bastion_user: str,           # SSH username
    bastion_private_key: str     # SSH private key content
)

Events and Subscriptions

The Triple Store Service provides an event system for reacting to changes in the RDF data:

from abi.services.triple_store.TripleStorePorts import OntologyEvent

# Subscribe to all triple insertions with any predicate for a specific subject
subscription_id = triple_store.subscribe(
    topic=("http://example.org/subject1", None, None),
    event_type=OntologyEvent.INSERT,
    callback=lambda event, triple: print(f"Triple added: {triple}")
)

# Subscribe to all triple deletions with rdf:type predicate
subscription_id = triple_store.subscribe(
    topic=(None, RDF.type, None),
    event_type=OntologyEvent.DELETE,
    callback=lambda event, triple: print(f"Triple deleted: {triple}")
)

# Run callback in background for heavy processing
subscription_id = triple_store.subscribe(
    topic=(None, None, None),
    event_type=OntologyEvent.INSERT,
    callback=process_triple_in_background,
    background=True
)

# Unsubscribe when no longer needed
triple_store.unsubscribe(subscription_id)

Exceptions

All adapters may throw these exceptions when operations fail:

SubjectNotFoundError: Raised when trying to access a subject that doesn't exist
SubscriptionNotFoundError: Raised when trying to unsubscribe with an invalid subscription ID
ViewNotFoundError: Raised when trying to query a view that doesn't exist

Usage Examples

from rdflib import Graph, URIRef, Literal, Namespace
from rdflib.namespace import RDF, RDFS
from src import services

# Access the triple store service
triple_store = services.triple_store_service

# Create a graph with triples
g = Graph()
EX = Namespace("http://example.org/")
g.bind("ex", EX)

# Add some triples
g.add((EX.Person, RDF.type, RDFS.Class))
g.add((EX.name, RDF.type, RDF.Property))
g.add((EX.name, RDFS.domain, EX.Person))
g.add((EX.alice, RDF.type, EX.Person))
g.add((EX.alice, EX.name, Literal("Alice")))

# Insert triples into the store
triple_store.insert(g)

# Get all triples
all_triples = triple_store.get()

# Query for people
query = """
    SELECT ?person ?name
    WHERE {
        ?person a ex:Person .
        ?person ex:name ?name .
    }
"""
result = triple_store.query(query)

# Get triples for a specific subject
alice_graph = triple_store.get_subject_graph("http://example.org/alice")

# Remove a triple
remove_g = Graph()
remove_g.add((EX.alice, EX.name, Literal("Alice")))
triple_store.remove(remove_g)

# Load a schema from file
triple_store.load_schema("path/to/schema.ttl")

How to create a new secondary adapter

Creating a new secondary adapter allows you to extend the Triple Store Service to work with additional storage backends. Here's a guide on how to create a new adapter.

Steps to Create a New Adapter

Create a new adapter class that implements the ITripleStorePort interface
Implement the required methods defined in the interface
Add a factory method to TripleStoreFactory to create instances of your adapter
Register your adapter with the application if necessary

Example: Creating an In-Memory Adapter

Here's a simplified example implementation of an in-memory adapter:

from abi.services.triple_store.TripleStorePorts import ITripleStorePort, OntologyEvent, Exceptions
from rdflib import Graph
from typing import Tuple

class TripleStoreService__SecondaryAdaptor__InMemory(ITripleStorePort):
    """In-memory implementation of the Triple Store adapter."""
    
    def __init__(self):
        """Initialize in-memory adapter."""
        self.__graph = Graph()
        
    def insert(self, triples: Graph):
        """Insert triples into the in-memory store."""
        self.__graph += triples
        
    def remove(self, triples: Graph):
        """Remove triples from the in-memory store."""
        self.__graph -= triples
        
    def get(self) -> Graph:
        """Get all triples from the in-memory store."""
        return self.__graph
        
    def get_subject_graph(self, subject: str) -> Graph:
        """Get all triples for a specific subject."""
        subject_graph = Graph()
        
        for s, p, o in self.__graph.triples((subject, None, None)):
            subject_graph.add((s, p, o))
            
        if len(subject_graph) == 0:
            raise Exceptions.SubjectNotFoundError(f"Subject {subject} not found")
            
        return subject_graph
        
    def query(self, query: str) -> Graph:
        """Execute a SPARQL query against the in-memory store."""
        return self.__graph.query(query)
        
    def query_view(self, view: str, query: str) -> Graph:
        """Query a specific view of the in-memory store."""
        # In this simple implementation, views are not supported
        raise Exceptions.ViewNotFoundError(f"View {view} not found in in-memory store")
        
    def handle_view_event(self, view: Tuple[str, str, str], event: OntologyEvent, triple: Tuple[str, str, str]):
        """Handle view events (not implemented for in-memory store)."""
        pass

Add a Factory Method

To make your adapter available through the factory pattern, add a new method to the TripleStoreFactory class:

# In lib/abi/services/triple_store/TripleStoreFactory.py

@staticmethod
def TripleStoreServiceInMemory() -> TripleStoreService:
    """Create a Triple Store Service using in-memory storage.
    
    Returns:
        TripleStoreService: Configured service instance using in-memory storage
    """
    from abi.services.triple_store.adaptors.secondary.TripleStoreService__SecondaryAdaptor__InMemory import TripleStoreService__SecondaryAdaptor__InMemory
    return TripleStoreService(TripleStoreService__SecondaryAdaptor__InMemory())

Usage Example

You can now use your in-memory adapter in your application:

from abi.services.triple_store.TripleStoreFactory import TripleStoreFactory

# Create an in-memory triple store service
triple_store = TripleStoreFactory.TripleStoreServiceInMemory()

# Use the service with the same interface
triple_store.insert(my_graph)
results = triple_store.query(my_query)

Key Considerations When Creating a New Adapter

Performance: Consider optimizing for your specific storage backend, especially for large RDF datasets
Concurrency: Ensure your adapter properly handles concurrent access if needed
Namespaces: Preserve namespace bindings when storing and retrieving RDF data
Testing: Create comprehensive tests for your adapter to ensure it behaves consistently
Views: Implement view support if your storage backend can benefit from it
Schema Management: Handle schema loading and versioning appropriately

By following these steps, you can extend the Triple Store Service to work with any storage backend that can store RDF data.

Overview​

Usage​

Default initialization​

Development Environment​

Production Environment​

Factories​

API Reference​

TripleStoreService​

Methods​

insert(triples: Graph) -> None​

remove(triples: Graph) -> None​

get() -> Graph​

query(query: str) -> Graph​

query_view(view: str, query: str) -> Graph​

get_subject_graph(subject: str) -> Graph​

load_schema(filepath: str) -> None​

get_schema_graph() -> Graph​

subscribe(topic: tuple, event_type: OntologyEvent, callback: Callable, background: bool = False) -> str​

unsubscribe(subscription_id: str) -> None​

Storage Adapters​

Filesystem Adapter (TripleStoreService__SecondaryAdaptor__Filesystem)​

Naas Adapter (TripleStoreService__SecondaryAdaptor__NaasStorage)​

Oxigraph Adapter (Oxigraph)​

AWS Neptune Adapter (AWSNeptuneSSHTunnel)​

Events and Subscriptions​

Exceptions​

Usage Examples​

How to create a new secondary adapter​

Steps to Create a New Adapter​

Example: Creating an In-Memory Adapter​

Add a Factory Method​

Usage Example​

Key Considerations When Creating a New Adapter​

Overview

Usage

Default initialization

Development Environment

Production Environment

Factories

API Reference

`TripleStoreService`

Methods

`insert(triples: Graph) -> None`

`remove(triples: Graph) -> None`

`get() -> Graph`

`query(query: str) -> Graph`

`query_view(view: str, query: str) -> Graph`

`get_subject_graph(subject: str) -> Graph`

`load_schema(filepath: str) -> None`

`get_schema_graph() -> Graph`

`subscribe(topic: tuple, event_type: OntologyEvent, callback: Callable, background: bool = False) -> str`

`unsubscribe(subscription_id: str) -> None`

Storage Adapters

Filesystem Adapter (`TripleStoreServiceSecondaryAdaptorFilesystem`)

Naas Adapter (`TripleStoreServiceSecondaryAdaptorNaasStorage`)

Oxigraph Adapter (`Oxigraph`)

AWS Neptune Adapter (`AWSNeptuneSSHTunnel`)

Events and Subscriptions

Exceptions

Usage Examples

How to create a new secondary adapter

Steps to Create a New Adapter

Example: Creating an In-Memory Adapter

Add a Factory Method

Usage Example

Key Considerations When Creating a New Adapter