Lifted Graphical Models: A Survey
This article presents a survey of work on lifted graphical models. We review a general form for a lifted graphical model, a par-factor graph, and show how a number of existing statistical relational representations map to this formalism. We discuss i…
Authors: Lilyana Mihalkova, Lise Getoor
Lifted Graphical Models: A Surv e y Lilyana Mihalko v a & Lise Getoor Uni versity of Maryland Colle ge P ark December 25, 2013 1 Motiv ation and Scope Multi-relational data, in which entities of different types engage in a rich set of relations, is ubiquitous in many domains of current interest. For example, in social network analysis the entities are indi viduals who relate to one another via friendships, family ties, or collaborations; in molecular biology , one is frequently interested in modeling ho w a set of chemical substances, the entities, interact with, inhibit, or catalyze one another; in web and social media applications, a set of users interact with each other and with a set of web pages or other online resources, which may themselves be related via hyperlinks; in natural language processing tasks, it is often necessary to reason about the relationships between documents, or words within a sentence or a document. By incorporating such relational information into learning and reasoning, rather than relying solely on entity-specific attributes, it is usually possible to achiev e higher predictiv e accuracy for an unobserv ed entity attrib ute, e.g., [SNB + 08]. For e xample, by exploiting h yperlinks between web pages, one can improve categorization accuracy [CSN98]. Dev eloping algorithms and representations that can effecti vely deal with relational information is important also because in many cases it is necessary to predict the existence of a relation between the entities. For e xample, in an online social network application, one may be interested in predicting friendship relations between people in order to suggest new friends to the users; in molecular biology domains, researchers may be interested in predicting how ne wly-de veloped substances interact. Gi ven the di versity of applications that in volve learning from and reasoning about multi-relational in- formation, it is not surprising that the field of statistical relational learning (SRL)[DGM04, FGM06, DK09, GT07, KRK + 10] has recently experienced significant growth. This survey provides a detailed overvie w of de velopments in the field. W e limit our discussion to representations that can be seen as defining a graphical model using a relational language, or alternati vely as “lifted” analogs of graphical models. Although in this way we omit discussions of several important representations, such as stochastic logic programs [Mug96] and ProbLog [DKT07], which are based on imposing a probabilistic interpretation on logical reasoning, by limiting the scope of the survey we are able to provide a more focused and unified discussion of the rep- resentations that we do cov er . F or these and other models, we refer the reader to [DK03]. Because of the great variety of e xisting SRL applications, we cannot possibly do justice to all of them; therefore, the focus is on representations and techniques, and applications are mentioned in passing where they help illustrate our point. The survey is structured as follows. In Section 2, we define SRL and introduce preliminaries. In Sec- tion 3, we describe sev eral recently introduced SRL representations that are based on lifting a graphical model. Our goal in this section is to establish a unified view on the av ailable representations by defining a generic, or template, SRL model and discussing ho w particular models implement its various aspects. 1 Concept Representation Example Random v ariable (R V) Upper-case letters X , Y Set of R Vs Bold upper-case letters X , Y V alue assigned to R V Lo wer-case letters x , y Set of v alues assigned to R Vs Bold lo wer-case letters x , y Logical v ariable T ypewriter upper -case letters X , Y Entity/constant T ypewriter lo wer-case letters x , y Set of items other than R Vs Calligraphic upper-case letters X , Y T able 1: Notation used throughout this survey In this way , we establish not just criteria for comparisons of the models, but also a common frame work in which to discuss inference (Section 4), parameter learning (Section 5.1), and structure learning (Section 5.2) algorithms. 2 Pr eliminaries 2.1 What is SRL? Statistical relational learning (SRL) studies kno wledge representations and their accompanying learning and inference techniques that allow for efficient modeling and reasoning in noisy and uncertain multi-relational domains. In classical machine learning settings, the data consists of a single table of feature vectors, one for each entity in the data. A crucial assumption made is that the entities in the data represent independent and identically distributed (IID) samples from the general population. In contrast, multi-relational domains contain entities of potentially different types that engage in a v ariety of relations. Thus, a multi-relational domain can be seen as consisting of sev eral tables: a set of attribute tables, one for each entity type, that con- tain feature-vector descriptions for each corresponding entity , and a set of relationship tables that establish relationships among tw o or more of the entities in the domain. As a consequence of the relationships among the entities, they are no longer independent, and the IID assumption is violated. A further characteristic of multi-relational domains is that they are typically noisy or uncertain. For example, there frequently is uncertainty regarding the presence or absence of a relation between a particular pair of entities. T o summarize, an ef fecti ve SRL representation needs to support the following two essential aspects: a) it needs to provide a language for e xpressing dependencies between dif ferent types of entities and their di verse relations; and b) it needs to allo w for probabilistic reasoning in a potentially noisy en vironment. 2.2 Background and Notation Here we establish the notation and terminology to be used in the rest of this survey . SRL draws on both probability theory and on logic programming, which sometimes use the same term to describe dif ferent concepts. For example, the word “variable” could mean random variable (R V), or a logical variable. T o av oid confusion, we distinguish between dif ferent meanings using different fonts, as summarized in T able 1. 2 2.2.1 T erminology of Relational Languages This section provides an ov ervie w of several commonly-used relational languages with a focus on the aspects that are most important to the rest of our discussion. First-order Logic First-order logic (FOL) provides a flexible and expressiv e language for describing typed objects and relations. FOL distinguishes among four kinds of symbols: constants, variables, pred- icates, and functions [RN03]. Constants describe the objects in the domain, and we will alternati vely call them entities. For e xample, in the notation of T able 1, x and y are entities. Entities are typically typed. Log- ical variables act as placeholders and allo w for quantification, e.g., X and Y . Predicates represent attributes or relationships and ev aluate to true or false , e.g., Publication ( paper , person ) , which establishes a relation between a paper and an author , and Category ( paper , category ) , which provides the category of a paper , are predicates, and the strings in the parentheses specify the types of entities on which these predicates operate. Functions ev aluate to an entity in the domain, e.g., MotherOf . W e will adopt the conv ention that the names of predicates and functions will start with a capital letter . The number of arguments of a predicate or a function is called its arity . A term is a constant, a v ariable, or a function on terms. A predicate applied to terms is called an atom, e.g., Publication ( X , Y ) . A positi ve literal is an atom and a negati ve literal is a negated atom. A formula consists of a set of positiv e or negati ve literals connected by conjunction ( ∧ ) or disjunction ( ∨ ) operators, e.g., ¬ Friends ( X , Y ) ∨ Friends ( Y , X ) . The variables in formulas are quantified, either by an existential quantifier ( ∃ ) or by a univ ersal quantifier ( ∀ ) . Here we follo w the typical assumption that when no quantifier is specified for a variable, ∀ is understood by default. A formula expressed as a disjunction with at most one positive literal is called a Horn clause ; if a Horn clause contains exactly one positi ve literal, then it is a definite clause. The positi ve literal in a definite clause is called the head, whereas the remaining literals constitute the body . Definite clauses can alternativ ely be re-written as an implication as Body ⇒ Head . T erms, literals, and formulas are called grounded if the y contain no v ariables. Otherwise, they are ungr ounded. Grounding, also called instantiation, is carried out by replacing variables with con- stants in all possible type-consistent ways. The set of all groundings of f will be denoted with G C f , where C is a set of constraints that specify which groundings are allowed. In general, if predicate and function arguments are typed, type constraints are present by def ault. T o connect this FOL terminology to probability theory , we note that when the value of a ground atom is gov erned by a random process, it becomes a random variable with values in the set { true , false } . For example, let x and y represent two entities, i.e., specific indi viduals; then the grounded atom Friends ( x , y ) represents the assertion that x and y are friends. If we are giv en a probability distribution that gov erns the v alue of Friends ( x , y ) , we can reason about it in the same way in which we reason about ordinary random v ariables. In addition, it is helpful to treat unground atoms as parameterized R Vs [Poo03], in the sense that once their variables, or parameters, are replaced by constants, the y become R Vs. For example, if X and Y are logical variables, Friends ( X , Y ) is a parameterized R V because once we ground it by replacing the parameters X and Y with actual entities, we obtain R Vs. W e will refer to parameterized R Vs as par-R Vs for short, e.g. Friends ( X , Y ) is a par -R V . Object-Oriented Representations As an alternati ve to FOL, the attributes and relations of entities can be described using an object-oriented representation (OOR). Here again, x and y represent specific entities in the domain, whereas X and Y are v ariables, or entity placeholders. As in FOL, entities are typed. Attributes and relations are e xpressed using a notation analogous to that commonly used in object-oriented languages. For example, x . Category refers to the category of paper x , whereas x . Author refers to its authors. In- verse relations are also allowed, e.g., y . Author − 1 refers to the papers of which y is an author . Using this 3 notation, chains of relations can be conv eniently specified, e.g. x . Author . Author − 1 . Category giv es the set of categories of all papers written by the authors of x . Note that because the Author relation is typi- cally one-to-many , in general, x . Author refers to a set of entities, i.e., the set of all authors of the paper . Because of this, object-oriented languages allow for aggregation functions, such as mean , mode , max , or sum . For example, we can write mode ( x . Author . Author − 1 . Category ) . As in FOL, OOR statements can be grounded, or instantiated, by replacing variables with entities from the domain. Analogous to FOL, we will vie w ungrounded relation/attribute chains, as well as aggre gations thereof, as par-R Vs. Structured Query Language (SQL) It is natural to manipulate relational data, which is often stored in a relational database, using SQL. Thus, not surprisingly , SQL has been used as a representation in some of the SRL models discussed in the surve y . For self-sufficienc y , we provide a brief overvie w . The attributes and relations of objects can be vie wed as defining a relational schema, in which an attribute table corresponds to each entity type and a relationship table corresponds to each relation type in which entities can engage. It is therefore natural to manipulate such data using SQL. Here we revie w the select statement, which has been used to represent relational dependencies in SRL models. For our purposes, the most useful form of the select statement is expressed as follo ws: SELECT FROM WHERE
Comments & Academic Discussion
Loading comments...
Leave a Comment