We use probabilistic graphical models to model some probability distribution 𝐏. This section introduces ways to compare the probability independencies induced by the graphical model and the modeled probability distribution. Or even between 2 different probabilistic graphical models.

Terminology for Comparing Independencies

model 𝐴 is a(n) (I-Map | D-Map | P-Map | Minimal I-Map | Maximal D-Map) of model 𝐵

we use the statement above to explain the terms below:

Term	Description
Independence Map (I-Map)	set of independencies induced by 𝐴 is a SUBSET of the independencies induced by 𝐵
Dependence Map (D-Map)	set of independencies induced by 𝐴 is a SUPERSET of the independencies induced by 𝐵
Perfect Map (P-Map)	set of independencies induced by 𝐴 is EQUAL to the set of independencies induced by 𝐵 a P-Map is both an I-Map and D-Map Independence Equivalence (I-Equivalence) - two models are I-Equivalent if they express the EXACT same set of independencies
Minimal I-Map	an I-Map in which the removal of ANY edge, no longer makes it not an I-Map (i.e. removal of edge in 𝐴 would introduce an independence not in 𝐵) most of the time a Minimal I-Map is a P-Map For Example suppose we have a graph below that: is a Minimal I-Map of 𝐏 is not a P-Map of 𝐏 removing the red edge would add several independencies such as: independence between B and D independence between A and D independence between E and C etc there are cases where a Minimal I-Map is not a P-Map: where it could not capture anymore of 𝐏‘s probability INDEPENDENCIES (due to reasons such as: the model was built poorly, limitations of the model type used, etc)
Maximal D-Map	a D-Map model in which an addition of ANY edge, no longer makes it not a D-Map (i.e. addition of edge in 𝐴 would remove an independence that exists in 𝐵, thus 𝐴 no longer contains a SUPERSET of 𝐵‘s independencies) most of the time a Maximal D-Map is a P-Map For Example suppose we have a graph below that: is a Maximal D-Map of 𝐏 is not a P-Map of 𝐏 adding an edge between B and D would remove several independencies such as: B and D A and D E and C etc there can be a case where a Maximal D-Map is not a P-Map: such when an addition of ANY edge removes 2 independencies, so jumping for a proper-superset to a proper-subset (i.e. D-Map to I-Map)

In most models:

the REMOVAL of edges typically INCREASES the number of independencies
the ADDITION of edges typically REDUCES the number of independencies

Formal Definitions

Click here to expand...

Term

Description

Independence Map (I-Map)

if a graphical model 𝐆 is an I-Map of distribution 𝐏, then every independence (conditional and marginal) in 𝐆 also exists in 𝐏 (denoted as 𝐈(𝐆) ⊆ 𝐈(𝐏))

a fully connected DAG 𝐆 is an I-Map for any distribution 𝐏, since 𝐈(𝐆) = ∅ ⊆ 𝐈(𝐏) for all 𝐏

Dependence Map (D-Map)

if a graphical model 𝐆 is a D-Map of distribution 𝐏, then every independence (conditional and marginal) in 𝐏 also exists in 𝐆 (denoted as 𝐈(𝐏) ⊆ 𝐈(𝐆))

an unconnected DAG 𝐆 is a D-Map for any distribution 𝐏, since 𝐈(𝐏) ⊆ 𝕌 = 𝐈(𝐆) for all 𝐏

Minimal I-Map

graphical model 𝐆 is a Minimal I-Map for 𝐏 if the removal of any single edge makes it not an I-Map

a distribution 𝐏 may have several Minimal I-Maps, each corresponding to a specific node-ordering

Maximal D-Map

graphical model 𝐆 is a Maximal D-Map for 𝐏 if an addition of any single edge makes it not a D-Map

Perfect Map (P-Map)

graphical model 𝐆 is a P-Map for distribution 𝐏 if 𝐈(𝐆) = 𝐈(𝐏)

in other words:

if 𝐆 is a P-Map, then 𝐆 is also an I-Map and D-Map

if 𝐆 is both an I-Map and a D-Map, then 𝐆 is also a P-Map

Independence Equivalence

(I-Equivalence)

two graphical models 𝐆𝟏 and 𝐆𝟐 are I-Equivalent if 𝐈(𝐆𝟏) = 𝐈(𝐆𝟐)

Term	Description
Independence Map (I-Map)	if a graphical model 𝐆 is an I-Map of distribution 𝐏, then every independence (conditional and marginal) in 𝐆 also exists in 𝐏 (denoted as 𝐈(𝐆) ⊆ 𝐈(𝐏)) a fully connected DAG 𝐆 is an I-Map for any distribution 𝐏, since 𝐈(𝐆) = ∅ ⊆ 𝐈(𝐏) for all 𝐏
Dependence Map (D-Map)	if a graphical model 𝐆 is a D-Map of distribution 𝐏, then every independence (conditional and marginal) in 𝐏 also exists in 𝐆 (denoted as 𝐈(𝐏) ⊆ 𝐈(𝐆)) an unconnected DAG 𝐆 is a D-Map for any distribution 𝐏, since 𝐈(𝐏) ⊆ 𝕌 = 𝐈(𝐆) for all 𝐏
Minimal I-Map	graphical model 𝐆 is a Minimal I-Map for 𝐏 if the removal of any single edge makes it not an I-Map a distribution 𝐏 may have several Minimal I-Maps, each corresponding to a specific node-ordering
Maximal D-Map	graphical model 𝐆 is a Maximal D-Map for 𝐏 if an addition of any single edge makes it not a D-Map
Perfect Map (P-Map)	graphical model 𝐆 is a P-Map for distribution 𝐏 if 𝐈(𝐆) = 𝐈(𝐏) in other words: if 𝐆 is a P-Map, then 𝐆 is also an I-Map and D-Map if 𝐆 is both an I-Map and a D-Map, then 𝐆 is also a P-Map
Independence Equivalence (I-Equivalence)	two graphical models 𝐆𝟏 and 𝐆𝟐 are I-Equivalent if 𝐈(𝐆𝟏) = 𝐈(𝐆𝟐)

Examples

Click here to expand...

suppose, we are given 2 distributions: 𝐏𝟏 and 𝐏𝟐

distribution 𝐏𝟏 has independence, 𝐈(𝐏𝟏) = {(𝐼⊥𝐷)}

distribution 𝐏𝟐 has no independence, 𝐈(𝐏𝟐) = {∅}

suppose, we are given 2 graphical models (DAGs in this case): 𝐆_𝑎 and 𝐆_𝑏

from the DAGs, we can observe:

𝐆_𝑎- 𝐈(𝐆_𝑎) = {(𝐼⊥𝐷)}

𝐆_𝑏 - 𝐈(𝐆_𝑏) = {∅}

therefore, we can say the following:

𝐆_𝑎 is an I-Map of 𝐏𝟏 because 𝐼 and 𝐷 are independent in both 𝐆_𝑎 and 𝐏𝟏

𝐆_𝑎 is not the I-Map of 𝐏𝟐 because 𝐏𝟐 fails to satisfy the independence between I and D

𝐆_𝑎 is a P-Map of 𝐏𝟏 because 𝐈(𝐆_𝑎) = 𝐈(𝐏𝟏)

𝐆_𝑎 is not a P-Map of 𝐏𝟐 because 𝐈(𝐆_𝑎) ≠ 𝐈(𝐏𝟐)

𝐆_𝑏 is an I-Map of both 𝐏𝟏 and 𝐏𝟐 because the independence in 𝐆_𝑏 is ∅. Since ∅ is a subset of every set, both 𝐏𝟏 and 𝐏𝟐 satisfy the independence implied in 𝐆_𝑏

𝐆_𝑏 is not a P-Map of 𝐏𝟏 because 𝐈(𝐆_𝑏) ≠ 𝐈(𝐏𝟏)

𝐆_𝑏 is a P-Map of 𝐏𝟐 because 𝐈(𝐆_𝑏) = 𝐈(𝐏𝟐)

𝐆_𝑎 and 𝐆_𝑏 are not I-Equivalent because their set independencies are not equivalent (i.e. 𝐆_𝑎‘s independencies {(𝐼⊥𝐷)} does not equal 𝐆_𝑏‘s independencies {∅})

Relation to Gibbs Distribution

Click here to expand...

Graphical Models

a probability distribution 𝐏(𝐗) is a Gibbs Distribution over a graphical model 𝐆 if it can be written as

𝐏(𝐗) = (1/𝘡) * ∏_𝑐∊_𝐶[ 𝜙_𝑐(𝐗_𝑐) ]

where the variables in each potential function 𝜙_𝑐form a clique in 𝐆

Theorem 1: Factorization Implies Conditional Independencies

If 𝐏(𝐗) is a Gibbs Distribution for 𝐆, then 𝐆 is an I-Map for probability distribution 𝐏(𝐗), i.e. 𝐈(𝐆) ⊆ 𝐈(𝐏)

proof:

suppose:

A, B, and C are disjoint sets of variables

A is connected to B

C is connected to B

B separates A from C

then we can write

𝐏(A, B, C) = (1/𝘡) * 𝜙_𝑐1(A,B) * 𝜙_𝑐2(B,C)

Theorem 2: Conditional Independencies Implies Factorization

If 𝐏(𝐗) is a positive distribution and 𝐆 is an I-Map for 𝐏(𝐗), then 𝐏(𝐗) is a Gibbs Distribution that factorizes over graphical model 𝐆

／var／log marcus chiu

Explorer

I-Map - D-Map - Minimal I-Map - Maximal D-Map - P-Map - I-Equivalence

Terminology for Comparing Independencies

For Example

For Example

Formal Definitions

Examples

Relation to Gibbs Distribution

Graphical Models

Theorem 1: Factorization Implies Conditional Independencies

Theorem 2: Conditional Independencies Implies Factorization

／var／logmarcus chiu

Explorer

I-Map - D-Map - Minimal I-Map - Maximal D-Map - P-Map - I-Equivalence

Terminology for Comparing Independencies

For Example

For Example

Formal Definitions

Examples

Relation to Gibbs Distribution

Graphical Models

Theorem 1: Factorization Implies Conditional Independencies

Theorem 2: Conditional Independencies Implies Factorization

／var／log marcus chiu