source: arxiv machine learning: information lattice learning as probabilistic graphical model structure learning

level: research

information lattice learning (ill) learns interpretable rules from a signal by projecting it onto a partition lattice that encodes a hierarchy of abstractions, then lifting selected rules back to the signal domain. when the signal is a probability mass function, the learned probabilistic rules can be viewed as a probabilistic graphical model (pgm). a partition in ill creates a deterministic quotient variable, and a rule is the marginal distribution of that variable. a set of rules thus forms a collection of marginal constraints over interpretable abstractions.

general lifting in ill defines the family of all joint distributions that satisfy the given marginal constraints. special lifting picks a maximum-ignorance reconstruction, implemented via an l2 uniformity principle that is closely related to maximum entropy. under a shannon-entropy lifting, the same constraints lead to a maximum-entropy distribution. this connection shows that ill can be seen as a structure learning algorithm for pgms, where the learned rule set corresponds to the graph structure and the lifting step determines the joint distribution.

the pgm interpretation provides a principled way to understand ill's rule learning process. it clarifies how ill balances interpretability and statistical fidelity by selecting a set of marginal constraints and then reconstructing a joint distribution that respects those constraints while assuming as little additional structure as possible. this perspective may help in applying ill to tasks like density estimation, causal discovery, and feature engineering, where understanding the dependencies among variables is crucial.

why it matters: this interpretation connects rule-based learning with probabilistic graphical models, offering a new way to learn interpretable dependencies in data for ai and data science applications.


source: arxiv machine learning: information lattice learning as probabilistic graphical model structure learning