Benjamin N. Grosof
MIT Sloan School of Management
Description Logic Programs: Combining Logic Programs with Description LogicCambridge,
MA, USA
bgrosof@mit.edu
Ian Horrocks
University of Manchester
Manchester, United Kingdom
horrocks@cs.man.ac.uk
Raphael Volz
University of Karlsruhe
Karlsruhe, Germany
volz@fzi.de
Stefan Decker
USC ISI
Los Angeles, CA, USA
stefan@isi.edu
Copyright is held by the
author/owner(s).
WWW2003, May 20-24, 2003, Budapest, Hungary.
ACM 1-58113-680-3/03/0005.
We show how to perform DLP-fusion: the bidirectional translation of premises and inferences (including typical kinds of queries) from the DLP fragment of DL to LP, and vice versa from the DLP fragment of LP to DL. In particular, this translation enables one to ``build rules on top of ontologies'': it enables the rule KR to have access to DL ontological definitions for vocabulary primitives (e.g., predicates and individual constants) used by the rules. Conversely, the DLP-fusion technique likewise enables one to ``build ontologies on top of rules'': it enables ontological definitions to be supplemented by rules, or imported into DL from rules. It also enables available efficient LP inferencing algorithms/implementations to be exploited for reasoning over large-scale DL ontologies.
The challenge we address in this paper is how and why to combine rules with ontologies for the Semantic Web (SW). In this paper, we focus on meeting key requirements for such a combination of rules and ontologies by establishing the basis for a combined logical knowledge representation (KR) formalism. We start from the current draft standards for ontologies (DAML+OIL) [6] and for rules (RuleML) [3] in the Semantic Web context, and show how aspects of each language can be translated to the other. Both standards correspond with established KR formalisms: Description Logic (DL) in the case of DAML+OIL, and (declarative) logic programs (LP) in the case of RuleML.1 This correspondence allows us to exploit results w.r.t. the mapping of each KR to classical First Order Logic (FOL).
A mapping between ontology and rule languages is important for many aspects of the Semantic Web:
Language layering The Semantic Web can be viewed as largely about ``KR meets the Web''. Over the last two years or so, a broad consensus has evolved in the Semantic Web community that the vision of the Semantic Web includes, specifically, rules as well as ontologies. A key requirement for the Semantic Web's architecture overall, then, is to be able to layer rules on top of ontologies--in particular to create and reason with rule-bases that mention vocabulary specified by ontology-based knowledge bases--and to do so in a semantically coherent and powerful manner.
Querying The capabilities of ontology languages with respect to instances can be rather low, and even conjunctive queries--the least expressive query language usually considered in database research--are often not supported [4]. This area is a stronghold of rules, which offer extensive facilities for instance reasoning. Hence, it is interesting to consider combining DLs with the rule paradigm in order to state expressive instance queries w.r.t. terminological knowledge bases.
Data integration The majority of today's data resides in relational databases. As the Semantic Web grows in importance, people will probably start exporting their data according to some chosen ontology. This essentially leads to data that is replicated in order to enable ontology-based processing, e.g., by reading the exported files into a classifier such as FaCT [12] or RACER [10]. Logic programming systems such as XSB [16], however, can access databases directly through built-in predicates. Furthermore, restricted variants of logic programs, such as the ones established in this paper, can be directly implemented on top of SQL99-compliant relational databases. Hence, an LP-based implementation of an ontology language allows a closer interaction with live data.
Semantic Web Services A task-oriented motivation for combining rules with ontologies arises from the efforts to design and build Semantic Web Services (SWS). Semantic Web Services attempt to describe services in a knowledge-based manner in order to use them for a variety of purposes, including: discovery and search; selection, evaluation, negotiation, and contracting; composition and planning; execution; and monitoring. Both rules and ontologies are necessary for such service descriptions and play complementary roles: while ontologies are useful for representing hierarchical categorisation of services overall and of their inputs and outputs, rules are useful for representing contingent features such as business policies, or the relationship between preconditions and postconditions.
The established correspondence is used to define a new intermediate KR called Description Horn Logic (DHL), which is contained within the intersection, and the closely related Description Logic Programs (DLP), which can be viewed as DHL with a moderate weakening as to the kinds of conclusion can be drawn.
Figure 1 illustrates the relationship between the various KRs and their expressive classes. DL and Horn are strict (decidable) subsets of FOL. LP, on the other hand, intersects with FOL but neither includes nor is fully included by FOL. For example, FOL can express (positive) disjunctions, which are inexpressible in LP. On the other hand, several expressive features of LP, which are frequently used in practical rule-based applications, are inexpressible in FOL (and consequently also outside of def-Horn). One example is negation-as-failure, a basic kind of logical non-monotonicity. Another example is procedural attachments, e.g., the association of action-performing procedural invocations with the drawing of conclusions about particular predicates.
Description Logic Programs, our newly defined intermediate KR, is contained within the intersection of DL and LP. ``Full'' LP, including non-monotonicity and procedural attachments, can thus be viewed as including an ``ontology sub-language'', namely the DLP subset of DL.
Rather than working from the intersection as we do in this paper, one may instead directly address the expressive union of DL and LP by studying the expressive union of DL and LP within the overall framework of FOL. This is certainly an interesting thing to do. However, to our knowledge, this has not yet been well characterised theoretically, e.g., it is unclear how, if at all, such a union differs from full FOL.
Full FOL has some significant practical and expressive drawbacks as a KR in which to combine DL and rules. First, full FOL has severe computational complexity: it is undecidable in the general case, and intractable even under the Datalog restriction (see Section 3.2). Second, it is not understood even at a basic research level how to expressively extend full FOL to provide non-monotonicity and procedural attachments; yet these are crucial expressive features in many (perhaps most) practical usages of rules. Third, full FOL and its inferencing techniques have severe practicable limitations since it is unfamiliar to the great majority of mainstream software engineers, whereas rules (e.g., in the form of SQL-type queries, or Prolog) are familiar conceptually to many of them.
Via the DLP KR, we give a new technique to combine DL and LP. We show how to perform DLP-fusion: the bidirectional mapping of premises and inferences (including typical kinds of queries) from the DLP fragment of DL to LP, and from the DLP fragment of LP to DL. DLP-fusion allows us to fuse the two logical KRs so that information from each can be used in the other. The DLP-fusion technique promises several benefits. In particular, DLP-fusion enables one to ``build rules on top of ontologies'': it enables the rule KR to have access to DL ontological definitions for vocabulary primitives (e.g., predicates and individual constants) used by the rules. Conversely, the technique enables one to ``build ontologies on top of rules'': it enables ontological definitions to be supplemented by rules, or imported into DL from rules. It also enables efficient LP inferencing algorithms/implementations, e.g., rule or relational database engines, to be exploited for reasoning over large-scale DL ontologies.
In this section we will introduce Horn Logic, Description Logic (DL) and the DL based ontology language DAML+OIL. In particular, we will describe their syntax and formalise their meaning in terms of classical First Order Logic (FOL).
DAML+OIL is an ontology language designed for use on the (semantic) web. Although DAML+OIL is syntactically ``layered'' on top of RDFS, semantically it is layered on a subset of RDFS. This subset does not include RDFS's recursive meta model (i.e., the unrestricted use of the type relation), but instead treats RDFS as a very simple DL supporting only atomic class names. Like other DLs, this ``DAML+OIL subset'' of RDFS corresponds to a fragment of classical FOL, making it much easier to develop mappings to rule languages as well as to DLs. From now on, when we talk about RDFS, we will be referring to the DAML+OIL subset of RDFS.
DAML+OIL is equivalent to a very expressive DL--in fact it is
equivalent to the
DL
[13,
11]. In addition to ``abstract'' classes and individuals,
DAML+OIL also supports the use of ``concrete'' datatypes and data
values (the
in
).
In this paper, however, we will restrict our attention to the
abstract part of the language, which corresponds to the
DL.
Figure 2 shows
how DAML+OIL statements correspond to
axioms, where
(possibly subscripted) is a class,
(possibly subscripted) is a property,
is the inverse of
,
is the transitive closure of
,
(possibly
subscripted) is an individual and
is an
abbreviation for
for some class
(i.e., the most general class, called
``Thing'' in DAML+OIL).
It can be seen that all DAML+OIL statements can be reduced to
class/property inclusion axioms and ground facts (asserted
class-instance and instance-property-instance
relationships).2 In the case of
transitiveProperty, however, the axiom
is taken to be
equivalent to asserting that
is a
transitive property (like DAML+OIL,
does not
include the transitive closure operator).
As in any DL, DAML+OIL classes can be names (URIs) or
expressions, and a variety of constructors are
provided for building class expressions.
Figure 3
summarises the available constructors and their correspondence with
class
expressions.
The meaning of
is usually
given by a model theory [13].
However,
can also be
seen in terms of a correspondence to FOL, where classes correspond
to unary predicates, properties correspond to binary predicates and
subclass/property axioms correspond to
implication [7,
4].
To be more precise, individuals are equivalent to FOL constants,
classes and class expressions are equivalent to FOL formulae with
one free variable, and properties (and property expressions when
supported by the DL) are equivalent to FOL formulae with two free
variables. Class and property inclusion axioms are equivalent to
FOL sentences consisting of an implication between two formulae
with the free variables universally quantified at the outer level.
E.g., a DL axiom of the form
is equivalent to a FOL
sentence of the form
. DL
axioms of the form
and
correspond to
ground atoms
and
.
Finally, DL axioms asserting the transitivity of a property
, the functionality of a property
and that property
is the inverse of property
are
equivalent to FOL sentences of the form
,
and
respectively.
Figure 4
summarises the above equivalences and shows the FOL formulae
corresponding to the DL class expressions described in
Figure 3,
where are constants, and
is the free variable. These formulae can be composed in the obvious
way, e.g.,
.
As a notational convention we will, throughout the paper, use
and
for
constants and
,
,
and
for
variables.
Declarative logic programs (LP) is the KR whose semantics underlies in a large part the four families of rule systems that are currently most commercially important--SQL relational databases, OPS5-heritage production rules, Prolog, and Event-Condition-Action rules--as well as the proposals for rules in context of the Semantic Web.
As mentioned before, the commonly used expressiveness of full LP includes features, notably negation-as-failure/priorities and procedural attachments, that are not expressible in FOL, much less in DL. We thus concentrate on only an expressive portion of LP.
An ordinary (a.k.a. ``normal''3) LP is a set of
rules each having the form:
where ,
are
atoms (atomic formulae), and
. Note that no
restriction is placed on the arity of the predicates appearing in
these atoms. Logical variables, and logical functions (with any
arity), may appear unrestrictedly in these atoms.
is called the head (a.k.a.
consequent) of the rule;
is called the body (a.k.a. antecedent) of the
rule.
is to be read as ``if'', so that the overall rule should be read
as ``[head] if [body]'', i.e., ``if [body] then [head]''. If
, then the body is empty, i.e.,
, and notationally the ``
'' is often omitted. A fact is a rule whose body is empty
and whose head is a ground atom.
stands for negation-as-failure, a logically non-monotonic form of
negation whose semantics differs, in general, significantly from
the semantics of classical negation (
). Intuitively,
means
``
is not believed'' (i.e., is
unknown or false), whereas
means ``
is false''. Intuitively, each rule
can be viewed as universally quantified at the outer level. More
precisely, each rule can be viewed as standing for the set of all
its ground instantiations.
A definite LP is an ordinary LP in which
negation-as-failure does not appear, i.e., a set of rules each
having the form:
where ,
are
atoms, and
.
Definite LP is closely related syntactically and semantically to
the Horn fragment of FOL, a.k.a. Horn-clause logic. A clause in FOL
has the form:
where each is a (classical) literal. A
literal
has either the form (1)
or (2)
, where
is an atom. The literal is said to be positive in case
(1), or to be negative in case (2). A clause is said to be
Horn when at most one of its literals is
positive. A Horn clause is said to be definite when
exactly one of its literals is positive. A definite Horn
clause is also known as a Horn rule. A definite Horn
clause, a.k.a. Horn rule, can thus be written in the
form:
where ,
are
atoms, and
. We say that this Horn rule
corresponds to the definite LP rule that has the same
syntactic form, and vice versa. Likewise, we say that a Horn
ruleset
and a definite LP ruleset
correspond to each other when
their rules do (isomorphically). We then also say that
is the
LP-correspondent of
, and
conversely that
is the
Horn-correspondent of
.
As mentioned above, it is implicit in this notation that all
logical variables are universally quantified at the outer level,
i.e., over the scope of the whole clause. E.g., the rule
can be written equivalently as:
.
Note the similarity with the FOL equivalent of a DL inclusion
(subClassOf) axiom given in
Figure 4.
An LP rule or Horn clause is said to be equality-free when the equality predicate does not appear in it. Likewise, each is said to be Datalog when no logical functions (of arity greater than zero) appear in it.4
The semantics of an ordinary LP is defined to be a
conclusion set, where each conclusion is a ground atom,
i.e., fact, entailed by the LP. Formally, the semantics of a
definite LP is defined as follows. Let
stand for the Herbrand base of
. The conclusion set
is the smallest (w.r.t. set
inclusion) subset
of
such that for any rule
,
if
then
.
The relationship of LP semantics to FOL semantics is relatively
simple to describe for the case of definite equality-free Datalog
LP, which we call def-LP. The syntactically corresponding
fragment of FOL is definite equality-free Datalog Horn FOL, which
we call def-Horn. Let be a
def-LP. Let
stand for the corresponding
def-Horn ruleset. The conclusion set of
then
coincides with the minimal (w.r.t. set inclusion) Herbrand model of
.
Hence, the def-LP and the def-Horn ruleset entail exactly the
same set of facts. Every conclusion of the def-LP is also a
conclusion of the def-Horn ruleset. Relative to the def-Horn
ruleset, the def-LP is thus sound; moreover, it is complete for
fact-form conclusions, i.e., for queries whose answers amount to
conjunctions of facts. However, the def-LP is a mildly
weaker version of the def-Horn ruleset, in the following
sense. Every conclusion of the def-LP must have the form of a fact.
By contrast, the entailments, i.e., conclusions, of the def-Horn
ruleset are not restricted to be facts. E.g., suppose
consists of the two
rules
and
.
Then it entails
(a non-unit derived clause) whereas does
not. In practical applications, however, quite often only the
fact-form conclusions are desired, e.g., an application might be
interested above only in whether or not
is
entailed. The def-LP has the virtue of conceptual and computational
simplicity. Thinking in terms of expressive classes, we will view
def-LP as an expressive subset of def-Horn--we will call
it the expressive f-subset. def-LP is a mild weakening of
def-Horn along the dimension of entailment power, permitting only
fact-form conclusions--we will call this f-weakening.
In return for this f-weakening, def-LP has some quite attractive
computational characteristics (as well as being expressively
extensible in directions that FOL is not, as discussed earlier).
For the propositional case of def-LP, exhaustive inferencing is
where
-- i.e.,
worst-case linear time [8].
For the general case with logical variables, the entire conclusion
set of a def-LP
can be computed in
time
, when there is a constant
bound
on the number of logical variables per
rule (this restriction, which we will call VB, is
typically met in practise). Inferencing in def-LP is thus tractable
(worst-case polynomial time) given VB. In contrast, DLs are
generally not tractable (typically ExpTime or even NExpTime
complexity for key inference problems), and full FOL is not
decidable.
In this section we will discuss how DL languages (e.g., DAML+OIL) can be mapped to def-Horn, and vice versa.
We will first discuss the expressive restrictions of DL and def-Horn as these will constrain the subset of DL and def-Horn for which a complete mapping can be defined.
DLs are decidable subsets of FOL where the decidability is due
in large part to their having (a form of) the tree model property
[19].5 This property says
that a DL class has a model (an
interpretation
in which
is non-empty) iff
has a tree-shaped model, i.e., one in
which the interpretation of properties defines a tree shaped
directed graph.
This requirement severely restricts the way variables and
quantifiers can be used. In particular, quantifiers must be
relativised via atomic formulae (as in the guarded fragment
of FOL [9]), i.e., the
quantified variable must occur in a property predicate along with
the free variable (recall that DL classes correspond to formulae
with one free variable). For example, the DL class corresponds to the FOL formula
, where
the property predicate
acts as a guard. One obvious
consequence of this restriction is that it is impossible to
describe classes whose instances are related to another anonymous
individual via different property paths. For example, it is
impossible to assert that individuals who live and work at the same
location are ``HomeWorkers''. This is easy with a Horn rule,
e.g.:
Definite Horn FOL requires that all variables are universally
quantified (at the outer level of the rule), and restricts logical
connectives in certain ways. One obvious consequence of the
restriction on quantifiers is that it is impossible to assert the
existence of individuals whose identity might not be known. For
example, it is impossible to assert that all persons have a father
(known or unknown). This is easy with a DL axiom, e.g.:
RDFS provides a subset of the DL statements described in Section 3.1: subclass, subproperty, range, and domain statements (which in a DL setting are often called Tbox axioms); and asserted class-instance (type) and instance-property-instance relationships (which in a DL setting are often called Abox axioms).
As we saw in Section 3.1, a DL inclusion axiom corresponds to an FOL implication. This leads to a straightforward mapping from class and property inclusion axioms to def-Horn rules as follows:
As shown in
Figure 2, RDFS range
and domain statements correspond to DL axioms of the form
(range of
is
) and
(domain
of
is
). From
Figure 4, we can see
that these are equivalent to the FOL sentences
and
, which can be simplified to
and
respectively. These FOL sentences are already in def-Horn form,
which gives us the following mappings for range and domain:
Finally, asserted class-instance (type) and
instance-property-instance relationships, which correspond to DL
axioms of the form and
respectively
(Abox axioms), are equivalent to FOL sentences of the form
and
, where
and
are
constants. These are already in def-Horn form: they are simply
rules with empty bodies (which are normally omitted):
DAML+OIL extends RDF with additional statements about classes and properties (Tbox axioms). In particular, it adds explicit statements about class, property and individual equality and inequality, as well as statements asserting property inverses, transitivity, functionality (unique) and inverse functionality (unambiguous).
As discussed in Section 3.1, class and property equivalence axioms can be replaced with a symmetrical pair of inclusion axioms, so they can be mapped to a symmetrical pair of def-Horn rules as follows:
As we saw in
Section 3.1, the
semantics of inverse axioms of the form
are captured by FOL sentences of the form
, and
the semantics of transitivity axioms of the form
are captured by FOL
sentences of the form
.
This leads to a direct mapping into def-Horn as follows:
As we saw in
Section 3.1, DL
axioms asserting the functionality of properties correspond to FOL
sentences with equality. E.g., a DL axiom
(
is a functional property) corresponds
to the FOL sentence
.7 This kind of
axiom cannot be dealt with in our current framework (see
Section 4.1)
as it would require def-Horn rules with equality in the head,
i.e., rules of the form
.
In the previous section we showed how DL axioms correspond with def-Horn rules, and how these can be used to make statements about classes and properties. In DLs, the classes appearing in such axioms need not be atomic, but can be complex compound expressions built up from atomic classes and properties using a variety of constructors. A great deal of the power of DLs derives from this feature, and in particular from the set of constructors provided.8 In the following section we will show how these DL expressions correspond to expressions in the body of def-Horn rules.
In the following we will, as usual, use to denote classes,
to denote
properties and
to denote an integer.
A DL class can be formed by conjoining existing classes, e.g.,
. From
Figure 4 it can be
seen that this corresponds to a conjunction of unary predicates.
Conjunction can be directly expressed in the body of a def-Horn
rule. E.g., when a conjunction occurs on the l.h.s. of a subclass
axiom, it simply becomes conjunction in the body of the
corresponding rule
Similarly, when a conjunction occurs on the r.h.s. of a subclass
axiom, it becomes conjunction in the head of the corresponding
rule:
A DL class can be formed from a disjunction of existing classes,
e.g., . From
Figure 4 it can be
seen that this corresponds to a disjunction of unary predicates.
When a disjunction occurs on the l.h.s. of a subclass axiom it
simply becomes disjunction in the body of the corresponding
rule:
When a disjunction occurs on the r.h.s. of a subclass axiom it becomes a disjunction in the head of the corresponding rule, and this cannot be handled within the def-Horn framework.
In a DL, the universal quantifier can only be used in
restrictions--expressions of the form (see
Section 4.1).
This is equivalent to an FOL clause of the form
(see Figure 4).
must be a single primitive property,
but
may be a compound expression.
Therefore, when a universal restriction occurs on the r.h.s. of a
subclass axiom it becomes an implication in the head of the
corresponding rule:
When a universal restriction occurs on the l.h.s. of a subclass axiom it becomes an implication in the body of the corresponding rule. This cannot, in general, be mapped into def-Horn as it would require negation in a rule body.
In a DL, the existential quantifier (like the universal
quantifier) can only be used in restrictions of the form
. This is equivalent to an
FOL clause of the form
(see
Figure 4).
must be a single primitive property,
but
may be a compound expression.
When an existential restriction occurs on the l.h.s. of a
subclass axiom, it becomes a conjunction in the body of a standard
def-Horn rule:
When an existential restriction occurs on the r.h.s. of a subclass axiom, it becomes a conjunction in the head of the corresponding rule, with a variable that is existentially quantified. This cannot be handled within the def-Horn framework.
These constructors cannot, in general, be mapped into def-Horn. The case of negation is obvious as negation is not allowed in either the head or body of a def-Horn rule. As can be seen in Figure 4, cardinality restrictions correspond to assertions of variable equality and inequality in FOL, and this is again outside of the def-Horn framework.
In some cases, however, it would be possible to simplify the DL
expression using the usual rewriting tautologies of FOL in order to
eliminate the offending operator(s). For example, negation can
always be pushed inward by using a combination of De Morgan's laws
and equivalences such as
and
[1]. Further
simplifications are also possible, e.g., using the equivalences
, and
. For the
sake of simplicity, however, we will assume that DL expressions are
in a canonical form where all relevant simplifications have been
carried out.
As we saw in
Section 4.3,
some DL constructors (conjunction and universal restriction) can be
mapped to the heads of rules whenever they occur on the r.h.s. of
an inclusion axiom, while some DL constructors (conjunction,
disjunction and existential restriction) can be mapped to the
bodies of rules whenever they occur on the l.h.s. of an inclusion
axiom. This naturally leads to the definition of two DL languages,
classes from which can be mapped into the head or body of LP rules;
we will refer to these two languages as
and
respectively.
The syntax of the two languages is defined as follows. In both
languages an atomic name is a class, and if
and
are classes,
then
is also a class. In
, if
is a class and
is a property, then
is also a class, while in
, if
are classes and
is a
property, then
and
are also classes.
Using the mappings from
Section 4.3, we
can now follow the approach of
[4] and define a recursive
mapping function which takes a DL
axiom of the form
, where
is an
-class and
is an
-class, and maps it into
an LP rule of the form
. The mapping is defined as follows:
As we saw in
Section 4.3,
rules of the form
are rewritten as two rules
and
; rules of the form
are rewritten as
; and rules of the form
are rewritten as two rules
and
.
For example, would map the DL
axiom
We call the intersection of
and
, i.e., the language where an atomic name
is a class, and if
and
are classes, then
is also a class. We then extend
to deal with axioms of the
form
, where
and
are both
-classes:
As we saw in
Section 4.2.1,
range and domain axioms
and
are
mapped into def-Horn rules of the form
and
respectively. Moreover, class-instance and
instance-property-instance axioms
and
are mapped into
def-Horn facts (i.e., rules with empty bodies) of the form
and
respectively. We therefore extend
to
deal with these axioms in the case that
is an
-class:
Finally, we extend to deal with the
property axioms discussed in
Section 4.2:
Using the relationships of (full) DL to FOL discussed in Section 3.1, especially Figure 4, it is straightforward to show the following.
DHL can, therefore, be viewed alternatively and precisely as an
expressive fragment of def-Horn -- i.e., as the range of
.
Although the asymmetry of DHL (w.r.t. classes on different sides of axioms) makes it rather unusual by DL standards, it is easy to see that it includes (the DAML+OIL subset of) RDFS, as well as that part of DAML+OIL which corresponds to a simple frame language.
As far as RDFS is concerned, we saw in
Section 4.2.1
that RDFS statements are equivalent to DL axioms of the form
,
,
,
,
and
, where
are classes,
are properties and
are
individuals. Given that all RDFS classes are
-classes (they are atomic class names), a set
of DL axioms corresponding to RDFS statements would clearly satisfy
the above definition of a DHL ontology.
DHL also includes the subset of DAML+OIL corresponding to simple
frame language axioms, i.e., axioms defining a primitive hierarchy
of classes, where each class is defined by a frame. A frame
specifies the set of subsuming classes and a set of slot
constraints. This corresponds very neatly to a set of DL axioms of
the form
.
Moreover, DHL supports the extension of this language to include equivalence of conjunctions of atomic classes, and axioms corresponding to DAML+OIL transitive property, and inverse property statements.
A DLP is directly defined as the LP-correspondent of a def-Horn
ruleset that results from applying the mapping . Semantically, a DLP is thus the f-weakening of
that DHL ruleset (recall
subsection 3.2).
The DLP expressive class is thus the expressive f-subset of DHL. By
Theorem 1,
DLP can, therefore, be viewed alternatively and precisely as an
expressive subset of DL, not just of def-Horn.
In summary, expressively DLP is contained in DHL which in turn is contained in the expressive intersection of DL and Horn.
As our discussion of expressive relationships has made clear, there is a bi-directional semantic equivalence of (1) the DHL fragment of DL and (2) the DHL fragment of def-Horn. Likewise, there is a bi-directional semantic equivalence of the DLP fragment of DL and the DLP fragment of def-Horn. So far, however, we have mostly concentrated on only one direction of syntactic mapping: from DL syntax to def-Horn syntax (and to the corresponding def-LP), rather than from def-Horn (or def-LP) to DL. Next, we elucidate our reasons for this emphasis.
First, a prime immediate goal for the Semantic Web is to enable rules (in LP / Horn) on top of ontologies (in DL) -- more than vice versa to enable DL ontologies on top of LP or Horn rules. Second, it is desirable to exploit the relatively numerous, mature, efficient, scalable algorithms and implementations (i.e., engines) already available for LP inferencing so as to perform some fragment of DL inferencing -- more than vice versa to perform LP via the fewer available DL engines, which are designed to handle more expressive languages (than DLP) and may not be optimised for DLP ontologies. Third, as compared to def-Horn, DL has a relatively detailed set of quite specific syntactic expressive constructs; it was easier to go through these one by one to define a translation mapping than to do so in the reverse direction where one has to invent more structure/forms.
We do not have space here to give detailed algorithms and
computational complexity analyses of the syntactic translations. We
will limit ourselves to some relatively high-level observations;
these are straightforward to show. The
mapping, from DL syntax to def-Horn/def-LP syntax, corresponds
immediately to an algorithm whose computational complexity is
tractable. This mapping is invertible (e.g., in the usual manner of
parsers) from def-Horn/def-LP syntax to DL syntax, again,
tractably.
As discussed in the previous section, one of the prime goals of this work is to enable some fragment of DL inferencing to be performed by LP engines. In this section we will discuss the kinds of inference typically of interest in DL and LP, and how they can be represented in each other, i.e., in LP and DL respectively. Although the emphasis is on performing DL inferencing, via our mapping translation, using an LP reasoning engine, the reverse mapping can be used in order to perform LP inferencing using a DL reasoning engine. In particular, we will show how inferencing in (the DHL fragment of) DL can be reduced, via our translation, to inferencing in LP; and how vice versa, inferencing in (the DLP fragment of) LP can be reduced to inferencing in DL.
In a DL reasoning system, several different kinds of query are
typically supported w.r.t. a knowledge base . These include queries about classes:
In addition, there are similar queries about properties:
property-instance membership, property subsumption, property
hierarchy, and property satisfiability. We will call
the
language defined by the above kinds of DL queries.
In LP reasoning engines, there is one basic kind of query
supported w.r.t. a ruleset :
atom queries. These include:
We call
the
language defined by the above kinds of LP queries.
Next, we discuss how to reduce
querying in (the DHL fragment of) DL to
querying in (the DLP fragment of) LP using the mapping
. We will assume that
is a ruleset derived from a
DL knowledge base
via
, and that all
queries are w.r.t.
.
(ground or open) atom queries can be used to answer
(ground or open) class-instance membership queries when the class
is an
-class, i.e.,
is an instance of
iff
entails
. When
is an atomic
class name, the mapping leads directly to a
atom
query. When
is a conjunction, the result is a
conjunction of
atom
queries, i.e.,
is an instance of
iff
entails
and
entails
. When
is a
universal restriction, the mapping
gives
. This can be transformed into a
atom
query using a simple kind of skolemisation, i.e.,
is replaced with a constant
, where
is new in
,
and we have
is an instance of
iff
entails
.
The case of property-instance membership queries is trivial as
all properties are atomic:
is an instance of
iff
entails
.
Complete information about class-instance relationships, to
answer open or ``all-classes'' class-instance queries, can then be
obtained via class-instance queries about all possible combinations
of individuals and classes in .9 (Note that the set of named
individuals and classes is known, and its size is worst-case linear
in the size of the knowledge/rule base.)
For -classes,
class subsumption queries can be reduced to
using a similar technique to class-instance membership queries,
i.e.,
is a subclass of
iff
entails
, for
new in
. For
property subsumption queries,
is a
subproperty of
iff
entails
, for
new in
.
Complete information about the class hierarchy can be obtained
by computing the partial ordering of classes in based on the subsumption relationship.
In the DHL (and DLP) fragment, determining class/property satisfiability is a non-issue as, with the expressive power at our disposal in def-Horn, it is impossible to make a class or a property unsatisfiable.
Now let us consider the reverse direction from
to
. In
the DLP fragment of LP, every predicate is either unary or binary.
Every atom query can thus be viewed as about either a named class
or a property. Also, generally in LP, any open atom query is
formally reducible to a set of ground atom queries--one for each of
its instantiations. Thus
is
reducible to class-instance and property-instance membership
queries in DL.
To recap, we have shown the following.
To compute the answer for user queries the translated views are used. This realises a form of bottom up processing, since the queries involved in view definitions are performed on the extensional data and intermediate results are propagated up to a final query, which is the user query. This results in many irrelevant facts being computed in the intermediate steps; more efficient procedures based on sideways information passing, however, have been developed in the deductive database literature.
The above mentioned strategy is, however, not possible for recursively defined rules. Here additional processing is required.
Usually, binary recursive rules such as transitivity can be
rewritten into a linear form. E.g. the mapping for transitive
properties (see 1)
can be rewritten into
The usual strategy to compute the remaining forms of recursive rules in relational databases is in-memory processing using some iterative strategy, e.g. the magic template procedure [15].
We can, however, translate this case into the database by exploiting the observation that this form of recursion decomposes into unions, since no join processing of intermediate results (such as involved in computing the transitive closure of transitive properties) is necessary. This is immediately clear for classes, since they are monadic predicates. A closer look at all axioms where binary predicates (properties) are in the head reveals the same. Hence, these cyclic references can be implemented via an algorithm that detects equivalence classes (each constituted by a cycle) in graphs. All incoming edges to an equivalence class must be duplicated to all members of the equivalence class; this may done by using a new intermediate predicate to collect the incoming edges and deriving the members of the equivalence class from this intermediate predicate. Afterwards, all rules that constitute the cyclic references within the equivalence class may safely be removed. The reader may note that this can also be performed (with appropriate adaptions) on the cyclic references imposed by inverse properties.
We have used the above techniques to realise a prototypical implementation of Description Horn Logic based on the Datalog engine written by Boris Motik in the KAON project. The implementation, called Bubo (after the Latin name of the biological genus of eagle owls), is freely available at http://kaon.semanticweb.org/owl/.
Initial tests of Bubo have been encouraging, but much more work needs to be done in order to determine if the benefits promised by the DLP-fusion approach are delivered by this implementation.
In this paper we have shown how to interoperate, semantically and inferentially, between the leading Semantic Web approaches to rules (RuleML Logic Programs) and ontologies (OWL/DAML+OIL Description Logic). We have begun by studying two new KRs, Description Logic Programs (DLP), which is defined by the expressive intersection of the two approaches, and the closely related Description Horn Logic (DHL).
We have shown that DLP (or DHL) can capture a significant fragment of DAML+OIL, including the whole of the DAML+OIL fragment of RDFS, simple frame axioms and more expressive property axioms. The RDFS fragment of DL permits: stating that a class D is a Subclass of a class E; stating that the Domain of a property P is a class C; stating that the Range of a property P is a class C; stating that a property P is a Subproperty of a property Q; stating that an individual b is an Instance of a class C; and stating that a pair of individuals (a,b) is an Instance of a property P. Additional DLP expressively permits (within DL): using the Intersection connective (conjunction) within class descriptions (i.e., in C, D, or E above); using the Union connective (disjunction) within subclass descriptions (i.e., in D above); using (a restricted form of) Universal quantification within superclass descriptions (i.e., in E above); using (a restricted form of) Existential quantification within subclass descriptions (i.e., in D above); stating that a property P is Transitive; stating that a property P is Symmetric; and stating that a property P is the Inverse of a property Q.
Many of the ontologies in the DAML ontology library are inside this fragment of DAML+OIL. An immediate result of this work is that LP engines could be used for reasoning with these ontologies and for reasoning with (possibly very large numbers of) facts, such as web page annotations, that use vocabulary from these ontologies.
This work represents only a first step in realising a more complete interoperability between rules and ontologies, and the layering of rules on top of ontology languages in the Semantic Web ``stack''. We were able to illustrate its utility both theoretically and within our prototypical implementation. We believe, however, that our study of the expressive intersection will provide a firm foundation for future investigations of more expressive languages up to and including the expressive union of rules and ontologies.
Future work will include extending the mapping to additional DL primitives, in particular those which require the ability to state and derive the equality of individuals, such as cardinality restrictions (including functional properties) and nominals (extensionally defined classes).
Thanks to Tim Berners-Lee, Harold Boley, Dan Connolly, Michael Dean, Richard Fikes, Patrick Hayes, Jim Hendler, Deborah McGuinness, Boris Motik, Daniel Oberle, Peter Patel-Schneider, Jos De Roo, Steffen Staab and members of the DAML+OIL Joint Committee for helpful comments and discussions.
This document was generated using the LaTeX2HTML translator Version 2K.1beta (1.50)
Copyright © 1993, 1994, 1995, 1996,
Nikos
Drakos, Computer Based Learning Unit, University of
Leeds.
Copyright © 1997, 1998, 1999,
Ross Moore,
Mathematics Department, Macquarie University, Sydney.
The command line arguments were:
latex2html -numbered_footnotes -split 0
p117-grosof
The translation was initiated by on 2003-03-28