| |
Software downloads: Expert system, KBS shell kits |
|
GeNIe / SMILE
|
GeNIe: Graphical Network Interface to
SMILE: Structural Modeling, Inference and Learning Engine
|
|
| keywords |
clinical domains |
| Decision-analytic decision support, Probabilistic decision support systems, Bayesian Systems, graphical probabilistic models, Bayesian networks, decision
analysis, learning, diagnosis |
(Diagnosis) |
|
| developed by |
Decision Systems Laboratory, University of Pittsburgh |
| released |
SMILE: 1998 GeNIe: 1998 |
| status |
Available for use / under continuing development |
| download |
"The programs are made available in a compiled form, free of charge for any use (this permission includes explicitly potential commercial use).
... Even though we are making them available to the community, the programs and all accompanying graphics and manuals are copyrighted by the Decision Systems Laboratory, University of Pittsburgh and cannot be copied or
distributed without our explicit permission...".
[From License information, GeNIe/SMILE.]
|
|
| description |
Introduction
"Probabilistic DSSs, applicable to problems involving classification, prediction, and diagnosis,
are a new
generation of systems that are capable of modeling any real-world decision problem using
theoretically sound
and practically invaluable methods of probability theory and decision theory. Based on
graphical representation
of the problem structure, these systems allow for combining expert opinions with
frequency data, gather, manage,
and process information to arrive at intelligent solutions."
GeNIe environment for the creation of decision theoretic models
Main features:
- "Graphical editor to create and modify network models
- Uses the SMILE Engine. You may develop models in GeNIe and create a custom interface for them using SMILE
- Supports chance nodes with General, Noisy OR/MAX and Noisy AND distribution
- Open multiple networks and cut and paste sections of models between them
- Complete integration with MS. Excel, cut and paste data into internal spreadsheet view of GeNIe
- Cross compatibility with other software. Supports all major file types (e.g. Hugin, Netica, Ergo)
- Support for handling observation costs of nodes
- Support for diagnostic case management."
- "Supports causal discovery and learning
of models from data."
SMILE: fully portable Bayesian inference engine
SMILE is a fully platform independent
library of C++ classes implementing graphical probabilistic and decision-theoretic models,
such as Bayesian networks, influence diagrams, and structural equation models.
Its individual classes, defined in the SMILE API (Application Programming Interface),
allow you to create, edit, save, and load graphical models, and use them for probabilistic
reasoning and decision making under uncertainty. SMILE supports directly object-oriented
methodology.
SMILE can be embedded in programs that use graphical probabilistic models as their reasoning
engines. Models developed in GeNIe can be equipped with a user interface which utilizes
SMILE as the backend engine.
SMILE is released as a dynamic link library (DLL).
There are also several SMILE wrappers, such as SMILE.NET (.NET interface), SMILEX (Active X),
jSMILE (Java interface), etc.
Main features:
- "Graphical editor to create and modify network models.
- Platform independent, versions available for Windows, Unix (Solaris), Linux, Mac, Pocket PC, etc.
- SMILE.NET available for use with .NET framework. Compatible with all .NET languages, including C# and VB.NET. May be used to create web-based applications of Bayesian networks.
- Thorough and complete documentation.
- Responsive development team support, we will compile SMILE for your platform on demand.
- Tested in the field since 1998"
"Probabilistic DSSs are applicable in many domains" - in medicine, for example, for diagnosis and therapy planning.
A medical DSS developed using GeNIe and SMILE is the medical diagnostic system, Hepar II ...
- "a Bayesian network model that ...
aids physicians in the diagnosis of liver disorders.
The structure of the model, currently consisting of
almost 100 variables, has been elicited from physician experts, while its numerical parameters have been learned
from a database of patient cases."
|
|
| references |
Marek J. Druzdzel. Intelligent decision support systems based on SMILE.
Software 2.0, 2(February):12-33, 2005.
[]
[University of Pittsburgh]
|
"
In this article I share with the readers the basics of and the principles behind intelligent decision support systems based on the theoretically sound principles of probability theory and decision theory. Recent advances in probability theory have led to the development of graphical models that are capable of modeling the causal structure of systems, well understood by human experts, and at the same time give such structure a sound, probabilistic interpretation. Graphical models can serve as a convenient basis for modeling domains that involve high degree of uncertainty and also reasoning with them. Examples of problems that are addressed by systems based on graphical models include computer vision, robotics, pattern matching, medical diagnosis and therapy planning, machine diagnosis, and even on-line help. Microsoft Corporation uses graphical models inside the Windows operating system, in troubleshooting, and in user interfaces, such as on-line Office help and junk mail filtering in Outlook. The graphical models methodology is implemented in a general purpose decision modeling system SMILE and its Windows user interface, GeNIe, developed at the Decision Systems Laboratory. I try to give the readers a flavor of GeNIe models and building systems based on the SMILE library.
"
|
Marek J. Druzdzel. GeNIe: A development environment for graphical
decision-analytic models. In Proceedings of the 1999 Annual Symposium of
the American Medical Informatics Association (AMIA-1999), page 1206,
Washington, D.C., November 6-10, 1999.
[]
[University of Pittsburgh]
|
("One-page extended abstract of the conference presentation of GeNIe and SMILE").
|
Marek J. Druzdzel. SMILE: Structural Modeling, Inference, and Learning
Engine and GeNIe: A development environment for graphical
decision-theoretic models (Intelligent Systems Demonstration). In
Proceedings of the Sixteenth National Conference on Artificial
Intelligence (AAAI-99), pages 902-903, AAAI Press/The MIT Press, Menlo
Park, CA, 1999.
[]
[University of Pittsburgh]
|
(Two-page extended abstract of conference presentation of GeNIe and SMILE.)
|
Pieter Kraaijeveld and Marek J. Druzdzel. GeNIeRate: An interactive
generator of diagnostic Bayesian network models. In Working Notes of the
16th International Workshop on Principles of Diagnosis (DX-05), pages
175-180, Monterey, CA, USA, June 1-3, 2005.
[]
[University of Pittsburgh]
|
"
Constructing diagnostic Bayesian network models is a complex and time consuming task. In this paper, we propose a methodology to simplify and speed up the design of such models. The models are based on two simplifying assumptions: (1) the structure of the model has three levels of variables and (2) the interaction among the variables can be modelled by noisy-MAX gates. The methodology is implemented in an application named: GeNIeRate, which aims at supporting construction of diagnostic Bayesian network models consisting of hundreds or even thousands of variables. Preliminary qualitative evaluation of this application shows great promise. We are planning to conduct a systematic study to compare GeNIeRate to traditional techniques for building Bayesian network models and we hope to be able to present the results at the workshop.
"
|
Marek J. Druzdzel and F. Javier Diez.
Combining knowledge from different sources in probabilistic models. Journal of Machine Learning Research,
4(July):295-316, 2003.
[]
[University of Pittsburgh]
|
"
Building probabilistic and decision-theoretic models requires a considerable knowledge engineering effort in which the most daunting task is obtaining the numerical parameters. Authors of Bayesian networks usually combine various sources of information, such as textbooks, statistical reports, databases, and expert judgement. In this paper, we demonstrate the risks of such a combination, even when this knowledge encompasses such seemingly population-independent characteristics as sensitivity and specificity of medical symptoms. We show that the criteria "do not combine knowledge from different sources" or "use only data from the setting in which the model will be used" are neither necessary nor sufficient to guarantee the correctness of the model. Instead, we offer graphical criteria for determining when knowledge from different sources can be safely combined into the general population model. We also offer a method for building subpopulation models. The analysis performed in this paper and the criteria we propose may be useful in such fields as knowledge engineering, epidemiology, machine learning, and statistical meta-analysis.
"
|
|
| references: in clinical use |
Wasyluk H, Oniśko A, Druzdzel MJ.
Support of diagnosis of liver disorders based on a causal Bayesian network model.
Med Sci Monit. 2001 May;7 Suppl 1:327-32.
[PubMed]
[] |
"
We describe our work on HEPAR II, a probabilistic causal model for diagnosis of liver disorders. The model, a Bayesian network capturing the causal interactions among various risk factors, diseases, symptoms, and test results, is based on expert knowledge combined with clinical data captured in medical records. The main applications of HEPAR II are assistance is diagnosis and training of beginning diagnosticians. We outline the principles of the applied approach, present a brief description of the model, and report its diagnostic performance.
" |
Agnieszka Onisko, Marek J. Druzdzel and Hanna Wasyluk. An experimental
comparison of methods for handling incomplete data in learning
parameters of Bayesian networks. In Intelligent Information Systems
2002: Proceedings of the IIS'2002 Symposium, M. Klopotek, S.T.
Wierzchon, M. Michalewicz (eds.), pages 351-360, Advances in Soft
Computing Series, Physica-Verlag (A Springer-Verlag Company),
Heidelberg, 2002.
[]
[University of Pittsburgh]
|
"
Missing values of attributes in data sets, also referred to as incomplete data, pose difficulties in learning tasks, such as classification, data mining, or learning Bayesian network structure and its numerical parameters. Because of the predominance of incomplete data in practice, many methods have been proposed to deal with them while there are few studies that compare their performance. The HEPAR II project presents an excellent opportunity to test experimentally how these methods perform on a real data set. We briefly review several popular methods for handling incomplete data and then compare them on the task of learning conditional probability distributions of a Bayesian network model, where the comparison criterion is the resulting diagnostic accuracy. While substitution of "normal" values of missing attributes seemed to perform best, we observed only a small difference in performance among the studied methods.
"
|
Agnieszka Onisko and Marek J. Druzdzel.
Effect of imprecision in probabilities on the quality of results in Bayesian networks: An
empirical study. In Working Notes of the European Conference on
Artificial Intelligence in Medicine (AIME-03) Workshop on Qualitative
and Model-based Reasoning in Biomedicine, pages 45-49, Protaras, Cyprus,
19 October 2003.
[]
[University of Pittsburgh]
|
"
While most knowledge engineers believe that the quality of results obtained from Bayesian networks is not too sensitive to imprecision in probabilities, this remains a conjecture with only modest empirical support. Our work on a Bayesian network model for diagnosis of liver disorders, Hepar II, presented us with an opportunity to test this conjecture in a practical setting. We present the results of an empirical study in which we systematically introduce noise in Hepar II's probabilities and test the diagnostic accuracy of the resulting model. We replicate an experiment conducted by Pradhan et al. [13] and show that Hepar II is more sensitive to noise in parameters than the CPCS network that they examined. Our data show that the diagnostic accuracy of the model deteriorates almost linearly with noise. While our result is merely a single data point that sheds light on the hypothesis in question, we suggest that Bayesian networks are more sensitive to the quality of their numerical parameters than popularly believed.
"
|
Agnieszka Onisko, Peter Lucas and Marek J. Druzdzel.
Comparison of rule-based and Bayesian network approaches in medical diagnostic systems. Proceedings of the Eighth Annual Conference on Artificial
Intelligence in Medicine (AIME-2001), S. Quaglini, P. Barahona, S.
Andreassen (eds.) Artificial Intelligence in Medicine, Lecture Notes in
Computer Science Subseries, Springer Verlag, pages 281-292, 2001.
[]
[University of Pittsburgh]
|
"
Almost two decades after the introduction of probabilistic expert systems, their theoretical status, practical use, and experiences are matching those of rule-based expert systems. Since both types of systems are in wide use, it is more than ever important to understand their advantages and drawbacks. We describe a study in which we compare rule-based systems to systems based on Bayesian networks. We present two expert systems for diagnosis of liver disorders that served as the inspiration and vehicle of our study and discuss problems related to knowledge engineering using the two approaches. We finally present the results of a simple experiment comparing the diagnostic performance of each of the systems on a subset of their domain.
"
|
Hanna Wasyluk, Agnieszka Onisko and Marek J. Druzdzel.
Application of a computer-based diagnostic tool to training general practitioners. In
Fifth International Seminar on Statistics and Clinical Practice (68th
Seminar of the International Centre of Biocybernetics), Warsaw, Poland,
3-5 June 2002.
[]
[University of Pittsburgh]
|
"
The last two decades have brought considerable advances in the field of computer-based medical systems. These advances have resulted in noticeable improvements in medical care, starting from ease of storage and access of digital imaging through gathering of computerized medical data, accessing on-line literature, patient monitoring, and therapy planning. Systems addressing the task of diagnosis, however, have rarely been adopted in clinical practice, which has raised questions about their usefulness and feasibility. We show in this paper a practical application of a computer-based diagnostic system, HEPAR II, to training beginning diagnosticians.
"
|
Y-L. O.
Model-based guideline development for symptom-based indications.
To appear in: Proc. AI techniques in healthcare: evidence-based guidelines and protocols, 2006.
[]
[OC]
|
"
This work proposes a model-based guideline development
method for complex diagnostic and treatment policies, for instance
for symptom-based indications.
An influence diagram has advantages for the development of
outcome-based guidelines, because it provides an explicit link between
the policies and these outcomes. A Bayesian network have
a strong inference capability, suitable for complex symptom-based
medical diagnostic modelling incorporating many diseases. However,
neither supports temporal sequences.
The modelling of symptom-based indications for guideline distillation
can be supported by imposing a strict structure and a strict
order. A further extension is the user interface for evaluation and
ranking of the probabilities, policies, and the resulting outcomes.
Instead of seeking for the best solution, the guideline is distilled
by ranking these outcomes with respect to the treatment, anatomic
region and pathology, and the generally expected outcome if treated
properly. The proposed guideline distillation method is expected to
improve the development, and therefore the quality of the guidelines.
"
|
|
| contact |
Decision Systems Laboratory
School of Information Sciences University of Pittsburgh
135 North Bellefield Avenue, #B212
PA 15260, Pittsburgh USA
T: 412-624-7378
F: 412-624-2788
E: genie sis.pitt.edu
|
| links |
|
|
| acknowledgements |
| Marek J. Druzdzel, Decision Systems Laboratory, University of Pittsburgh, USA |
| page history |
Entry on OpenClinical: 04 April 2006
Last main update: 09 May 2006 |
|
|
|
|
|