We use probabilistic graphical models to model some probability distribution 𝐏. This section introduces ways to compare the probability independencies induced by the graphical model and the modeled probability distribution. Or even between 2 different probabilistic graphical models.
Terminology for Comparing Independencies
- model 𝐴 is a(n) (I-Map | D-Map | P-Map | Minimal I-Map | Maximal D-Map) of model 𝐵
we use the statement above to explain the terms below:
|
Term |
Description |
|---|---|
|
Independence Map (I-Map) |
set of independencies induced by 𝐴 is a SUBSET of the independencies induced by 𝐵 |
|
Dependence Map (D-Map) |
set of independencies induced by 𝐴 is a SUPERSET of the independencies induced by 𝐵 |
|
Perfect Map (P-Map) |
set of independencies induced by 𝐴 is EQUAL to the set of independencies induced by 𝐵
|
|
Minimal I-Map |
an I-Map in which the removal of ANY edge, no longer makes it not an I-Map (i.e. removal of edge in 𝐴 would introduce an independence not in 𝐵)
|
|
Maximal D-Map |
a D-Map model in which an addition of ANY edge, no longer makes it not a D-Map (i.e. addition of edge in 𝐴 would remove an independence that exists in 𝐵, thus 𝐴 no longer contains a SUPERSET of 𝐵‘s independencies)
|
In most models:
- the REMOVAL of edges typically INCREASES the number of independencies
- the ADDITION of edges typically REDUCES the number of independencies
---structured-probabilistic-models-(spm)/pgm---model-properties/i-map---d-map---minimal-i-map---maximal-d-map---p-map---i-equivalence/i-map-and-d-map.png)
Formal Definitions
Click here to expand...
Term
Description
Independence Map (I-Map)
if a graphical model 𝐆 is an I-Map of distribution 𝐏, then every independence (conditional and marginal) in 𝐆 also exists in 𝐏 (denoted as 𝐈(𝐆) ⊆ 𝐈(𝐏))
a fully connected DAG 𝐆 is an I-Map for any distribution 𝐏, since 𝐈(𝐆) = ∅ ⊆ 𝐈(𝐏) for all 𝐏
Dependence Map (D-Map)
if a graphical model 𝐆 is a D-Map of distribution 𝐏, then every independence (conditional and marginal) in 𝐏 also exists in 𝐆 (denoted as 𝐈(𝐏) ⊆ 𝐈(𝐆))
an unconnected DAG 𝐆 is a D-Map for any distribution 𝐏, since 𝐈(𝐏) ⊆ 𝕌 = 𝐈(𝐆) for all 𝐏
Minimal I-Map
graphical model 𝐆 is a Minimal I-Map for 𝐏 if the removal of any single edge makes it not an I-Map
a distribution 𝐏 may have several Minimal I-Maps, each corresponding to a specific node-ordering
Maximal D-Map
graphical model 𝐆 is a Maximal D-Map for 𝐏 if an addition of any single edge makes it not a D-Map
Perfect Map (P-Map)
graphical model 𝐆 is a P-Map for distribution 𝐏 if 𝐈(𝐆) = 𝐈(𝐏)
in other words:
- if 𝐆 is a P-Map, then 𝐆 is also an I-Map and D-Map
- if 𝐆 is both an I-Map and a D-Map, then 𝐆 is also a P-Map
Independence Equivalence
(I-Equivalence)
two graphical models 𝐆𝟏 and 𝐆𝟐 are I-Equivalent if 𝐈(𝐆𝟏) = 𝐈(𝐆𝟐)
Examples
Click here to expand...
suppose, we are given 2 distributions: 𝐏𝟏 and 𝐏𝟐
- distribution 𝐏𝟏 has independence, 𝐈(𝐏𝟏) = {(𝐼⊥𝐷)}
- distribution 𝐏𝟐 has no independence, 𝐈(𝐏𝟐) = {∅}
suppose, we are given 2 graphical models (DAGs in this case): 𝐆𝑎 and 𝐆𝑏
from the DAGs, we can observe:
- 𝐆𝑎- 𝐈(𝐆𝑎) = {(𝐼⊥𝐷)}
- 𝐆𝑏 - 𝐈(𝐆𝑏) = {∅}
therefore, we can say the following:
𝐆𝑎 is an I-Map of 𝐏𝟏 because 𝐼 and 𝐷 are independent in both 𝐆𝑎 and 𝐏𝟏
𝐆𝑎 is not the I-Map of 𝐏𝟐 because 𝐏𝟐 fails to satisfy the independence between I and D
𝐆𝑎 is a P-Map of 𝐏𝟏 because 𝐈(𝐆𝑎) = 𝐈(𝐏𝟏)
𝐆𝑎 is not a P-Map of 𝐏𝟐 because 𝐈(𝐆𝑎) ≠ 𝐈(𝐏𝟐)
𝐆𝑏 is an I-Map of both 𝐏𝟏 and 𝐏𝟐 because the independence in 𝐆𝑏 is ∅. Since ∅ is a subset of every set, both 𝐏𝟏 and 𝐏𝟐 satisfy the independence implied in 𝐆𝑏
𝐆𝑏 is not a P-Map of 𝐏𝟏 because 𝐈(𝐆𝑏) ≠ 𝐈(𝐏𝟏)
𝐆𝑏 is a P-Map of 𝐏𝟐 because 𝐈(𝐆𝑏) = 𝐈(𝐏𝟐)
𝐆𝑎 and 𝐆𝑏 are not I-Equivalent because their set independencies are not equivalent (i.e. 𝐆𝑎‘s independencies {(𝐼⊥𝐷)} does not equal 𝐆𝑏‘s independencies {∅})
Relation to Gibbs Distribution
Click here to expand...
Graphical Models
a probability distribution 𝐏(𝐗) is a Gibbs Distribution over a graphical model 𝐆 if it can be written as
- 𝐏(𝐗) = (1/𝘡) * ∏𝑐∊𝐶 [ 𝜙𝑐(𝐗𝑐) ]
where the variables in each potential function 𝜙𝑐form a clique in 𝐆
Theorem 1: Factorization Implies Conditional Independencies
If 𝐏(𝐗) is a Gibbs Distribution for 𝐆, then 𝐆 is an I-Map for probability distribution 𝐏(𝐗), i.e. 𝐈(𝐆) ⊆ 𝐈(𝐏)
proof:
suppose:
- A, B, and C are disjoint sets of variables
- A is connected to B
- C is connected to B
- B separates A from C
then we can write
𝐏(A, B, C) = (1/𝘡) * 𝜙𝑐1(A,B) * 𝜙𝑐2(B,C)
Theorem 2: Conditional Independencies Implies Factorization
If 𝐏(𝐗) is a positive distribution and 𝐆 is an I-Map for 𝐏(𝐗), then 𝐏(𝐗) is a Gibbs Distribution that factorizes over graphical model 𝐆
---structured-probabilistic-models-(spm)/pgm---model-properties/i-map---d-map---minimal-i-map---maximal-d-map---p-map---i-equivalence/maximal-d-map.png)
---structured-probabilistic-models-(spm)/pgm---model-properties/i-map---d-map---minimal-i-map---maximal-d-map---p-map---i-equivalence/minimal-i-map.png)
---structured-probabilistic-models-(spm)/pgm---model-properties/i-map---d-map---minimal-i-map---maximal-d-map---p-map---i-equivalence/i-map.png)