| |
Intelligent Software
Agents
Probabilistic Data Mining
Methods
for processing data have developed more slowly then methods for gathering
and storing it. Hence, automated and semiautomated data processing tools are
urgently needed. A promising new technique is based on learning Bayesian
networks from data. Bayesian networks are graphical representations of
the joint probability distributions for sets of variables. Currently,
they play crucial roles in expert systems, diagnosis engines, and decision
support systems. Many off-the-shelf tools can apply these learned networks;
also, adaptive Bayesian networks offer semantic clarity and understandability
by humans, ease of acquisition and incorporation of prior knowledge, ease
of integration with optimal decision-making methods, the possibility of
causal interpretation of learned models, and automatic handling of noisy
and missing data.
SRI tested and assessed network classification algorithms for implementation,
investigated applications of the algorithms and identified areas where
research is needed. We concluded that the main bottleneck in the current
implementations is in data structures for recording sufficient statistics,
particularly in databases containing more than 30,000 data instances for
training.
|
|