Glossary of Terms
Note: this definitions in this glossary are sequenced so that they build on each other, one to the next, rather than alphabetically.
triple
a data entity composed of subject-predicate-object, like "Bob is 35" or "Bob knows Fred". (A predicate in traditional grammar...is seen as a property that a subject has or is characterized by.) source and [source](https://en.wikipedia.org/wiki/Predicate_(grammar)#Predicates_in_traditional_grammar)
triplestore
a purpose-built database for the storage and retrieval of triples... source
quad
where triples have the form
{subject, predicate, object}
, quads would have a form along the lines of{subject, predicate, object, context}
sourceYou can add context or extra values to triples that identifies them and makes it easy to define subgraphs, or named properties. source
From Cayley godoc:
link
Another name for a triple, since it "links" any two nodes.
Given the triple
{A, knows, C}
you would say in graph terminology thatA
andC
are "vertices" whileknows
is an "edge". You would also say thatA
,knows
, andC
are all "nodes", and they are "linked" to one another by the triple.
IRI
IRI is an RDF Internationalized Resource Identifier. source
An IRI (Internationalized Resource Identifier) within an RDF graph is a Unicode string that conforms to the syntax defined in RFC 3987. source
IRIs are a generalization of URIs that permits a wider range of Unicode characters. Every absolute URI and URL is an IRI, but not every IRI is an URI. source
RDF
Resource Description Framework, basically a set of standards defined around quads
An RDF triple consists of three components:
the subject, which is an IRI or a blank node
the predicate, which is an IRI
the object, which is an IRI, a literal or a blank node source
RDF store, quad store, named graph, semantic graph database
...persisting RDF — storing it — became a thing, and these stores were called triple stores. Next they were called quad stores and included information about context and named graphs, then RDF stores, and most recently they call themselves “semantic graph database.” source
Adding a name to the triple makes a "quad store" or named graph. source
Cayley
Cayley is a quad store that supports multiple storage backends. It supports multiple query languages for traversing and filtering the named graphs formed by its quads, and it has associated tooling such as a CLI, HTTP server, and so on.
Gizmo
A Gremlin/TinkerPop-inspired query language for Cayley. Looks a lot like JavaScript, the syntax is documented here.
g.V()
For Gremlin/TinkerPop, g.V() returns a list of all the vertices in the graph
.v()
is for "Vertex" in Gizmo, and it is used likepathObject = graph.Vertex([nodeId],[nodeId]...)
(see [[path|#path]])
inbound/outbound predicate
Inbound/outbound refers to the direction of a relation via a predicate. In the case of the triple "A follows B", "follows" is an outbound predicate for
A
and an inbound predicate forB
.In/out predicates can be expressed in a query language, for example using the format
resultSet = subject.out(predicate)
to discover matchingObject
s. In the case of the triple "A follows B",A.out(“follows”)
would return a set of nodes which containsB
. An excellent example of this sort of query format is given in the Gremlin/TinkerPop homepage example:
direction
Direction specifies a node's position within a quad. source
Direction is passed to the
Get
method of a quad to access one of its four parts, see quad.Get(d Direction) ValueThe term "Direction" comes about from the concept of traversing a graph. Take for example the triple
{A, follows, B}
and supposing you "select" the predicatefollows
. Now you want to traverse the graph, so you move in theObject
direction, and you now haveB
selected. Whereas the high-level path abstraction for queries uses inbound/outbound predicates to represent movement on the graph, the bottom-level iterator mechanic uses Direction.
path
Path represents either a morphism (a pre-defined path stored for later use), or a concrete path, consisting of a morphism and an underlying QuadStore. source
morphism
Morphism is basically a path that is not attached to any particular quadstore or a particular starting point in the graph. Morphisms are meant to be used as a query part that can be applied to other queries to follow a path specified in the Morphism.
A good example will be a
FollowRecursive
function that will apply a single morphism multiple times to get to all nodes that can be traversed recursively. source
iterator
So a graph query is roughly represented as a tree of iterators – things
that implement graph.Iterator. An iterator is (loosely) a stand-in for a
set of things that match a particular portion of the graph. source
subiterator
So a graph query is roughly represented as a tree of iterators...Evaluation is merely calling Next() repeatedly on the iterator at the top of the tree. Subiterators, then, are the branches and leaves of the tree. source
Example of converting the Cayley-Gremlin-Go-API query
g.V(“B”).In(“follows”).All()
into an iterator tree:HasA (subject) – gets the things in the subject field for:
And – the intersection of:
LinksTo (predicate) links that have the predicate of…:
Fixed iterator containing “follows” – … just the node “follows”.
LinksTo (object) links that have the object field of:
Fixed iterator containing “B” – … just the node “B”
LinkTo iterator
A LinksTo takes a subiterator of nodes, and contains an iteration of links which "link to" those nodes in a given direction. ... Can be seen as the dual of the HasA iterator. source
Next()ing a LinksTo is straightforward -- iterate through all links to things in the subiterator, and then advance the subiterator, and do it again.
To restate in pseudo-code;
results
is what would be returned in successiveNext()
calls:
Contains()ing a LinksTo means, given a link, take the direction we care about and check if it's in our subiterator.
To restate in pseudo-code:
HasA iterator
The HasA takes a subiterator of links, and acts as an iterator of nodes in the given direction. The name comes from the idea that a "link HasA subject" or a "link HasA predicate". source
Contains()
shape
Shape represent a query tree shape. source
This is the most interesting part of the query system - it describes how exactly the query looks like. ... This package also describes different query optimizations that are not specific to a backend. ... You can write a query using either Paths, Shapes or raw Iterators... source
A Shape seems to be an abstract representation of a query, a level above Iterators and a level below Paths. You can perform various operations on it (traverse inbound/outbound predicates, find unions and intersections, etc.) and most importantly build a tree of Iterators from it, which will do the mechanical act of processing quads to find results.
token
In the context of a quad store, a graph.Value. However the backend wishes to implement it, a Value is merely a token to a quad or a node that the backing store itself understands, and the base iterators pass around.
For example, in a very traditional, graphd-style graph, these are int64s (guids of the primitives). In a very direct sort of graph, these could be pointers to structs, or merely quads, or whatever works best for the backing store.
reification
“With reification, we create a metagraph on top of our graph that represents the statement that we have here. We create a new node that represents a statement and points at the subject...” source
Reifying a relationship means viewing it as an entity. The purpose of reifying a relationship is to make it explicit, when additional information needs to be added to it.
Viewing a relationship as an entity, one can say that the entity reifies the relationship. This is called reification of a relationship. Like any other entity, it must be an instance of an entity type. source
Last updated