EFTA02444384
EFTA02444385 DataSet-11
EFTA02444392

EFTA02444385.pdf

DataSet-11 7 pages 1,892 words document
P17 V11 V16 P24 D3
Open PDF directly ↗ View extracted text
👁 1 💬 0
📄 Extracted Text (1,892 words)
Simplifying Bayesian Inference Stefan KrauB, Laura Martignon & Ulrich Hoffrage Max Planck Institute For Human Development Lentzeallee 94, 14195 Berlin-Dahlem Probability theory can be used to model inference under uncertainty. The particular way in which Bayes'formula is stated, which is of only minor importance in standard probability textbooks, becomes central in this context. When events can be interpreted as evidences and hypotheses, Bayes'formula allows one to update one's belief in a hypothesis in light of new data. Is unaided human reasoning Bayesian? Kahneman and "Iversky (1972) affirmed: "In his evaluation of evidence, man is not Bayesian at all." In their book Judgment under uncertainty (1982), they attempted to prove that human judgment is riddled with systematic deviations from the logical and probabilistic norm. In chapter 18 of the same book David M. Eddy stressed that medical doctors do not follow Bayes'formula when solving the following task: Theprobability that a woman at age 40 has breast cancer (B) is I% (P(B) = prevalence = 1%) According to the literature, theprobability that the disease is detected by a mammography (M) is 80%. (P(M+ IB) = sensitivity = 80%) Theprobability that the test misdetects the disease although the patient does not have it is 9.6%. (P(M+ I0B) = 1- specificity = 9.6%) If a woman at age 40 is tested as positive, what is theprobability that she indeed has breast cancer (POW)? Bayes'formula yields the following result: P(M+ BrP(B) 80%?I% P(BI M+) - - 0.078 P(M+ I M?P(B)+ P(M+ - B) 9.11(- B) 80%?1%+ 9.6% ?99% Thus, the probability of breast cancer is only 7.8%, while Eddy reports that 95 out of 100 doctors estimated this probability to be between 70% and 80%. Gigerenzer and Hoffrage (1995) focused on another aspect of the problem: the representation of uncertainty. In Eddy's task, quantitative information was given in probabilities. Gigerenzer and Hoffrage presented Eddy's problem to medical doctors replacing probabilities with a different representation of uncertainty, namely natural frequencies. In their formulation the task was: 100 out ofevery 10000 women at age 40 who participate in routine screening have breast cancer. 80 ofevery 100 women with breast cancer willget a positive mammography. 950 out ofevery 9900 women without breast cancer will also get a positive mammography. Here is a new representative sample of women at ageforty who get a positive mammography in routine screening. How many ofthese women do you expect to actually have breast cancer? Now nearly half (46%) of all doctors gave the Bayesian answer: 80 out of 1030 (7.8%). EFTA_R1_01521127 EFTA02444385 Probabilities Natural Frequencies 10.000 breast N_ no breast cancer cancer p(B) = .01 p(T+ I B) = .80 9.900 p(T+ B) = .096 0 e 8.950 Test Test Test Test posifiv negativ post*/ ) neoafiv (T+) (T-) (T+ (T-) p(B T+) 80T+) • .01 x .80 .01 x .80 + .99 x .096 • p(B T+) 80 + 950 0 0 OO OOO Figure 1 What is the crucial property that helps one to find the Bayesian solution? To answer this question, it is helpful to consider a more general case. In real-life situations, decisions are usually based on several cues. A medical doctor, for instance, seldom diagnoses a disease based on a single test. The usual procedure after a mammography is to perform an ultrasound test (U). For an ultrasound test, sensitivity and specificity are usually given in the instructions: P(U+ IB) = 95% P(U+ I0B) = 4% In an empirical study, we presented this information together with P(B), P(M+ IB) and P(M+ IfaB) to a group of participants. They were asked: What is the probablity that a woman at age 40 has breast cancer, given that she has a positive mammography anda positive ultrasound test? When given this probability format, only 12.2% of our participants reached the correct solution (» 3/3 ). D. Massaro (1998) gave an example describing the same situation with frequencies': EFTA_R1_01521128 EFTA02444386 Wt..* I M./AU- I M•AU. ev1+8.t.l- M-11U+ 11 4/4-8,U• Figure 2 Massaro writes that in the case of two cues „a frequency algorithm will not work" and „it might not be reasonable to assume that people can maintain exemplars of all possible symptom configurations." However, his statements are not based on experimental evidence, and his frequency configuration is not really equivalent to the probability format because he works with combined sensitivity P(M+ & U+ IB) and combined specificity 1-P(M- & U- I0B). One possible frequency format, which does correspond to our probability format, is': re — 1 1100001 women (0000) women d breast cancer/ no breast cancer breast cancer no breast cancer r. /Th 100 99001 IWO) (9900i \_/ \ _/ M+ (80 \ M-(20) 1 950) M+ 8950 M- U U-( 5 e 396)U+ 950:1) U- Figure 3 EFTA_R1_01521129 EFTA02444387 In words: 100 out ofevery /0000 woman at age 40 who participate in routine screening have breast cancer. 80 ofevery 100 women with breast cancer willget a positive mammography. 950 out ofevery 9900 woman without breast cancer will also get a positive mammography. 95 out of100 women with cancer willget a positive ultrasound test 396 out of9900 women, although they do not have cancer, nevertheless obtain a positive ultrasound test. How many ofthe women who get a positive mammography and a positive ultrasound test do you expect to actually have breast cancer? 14.6% of our participants solved this version correctly. Another possibility is to consider the tests sequentially. This is possible because the ultrasound test and the mammography are conditionally independent, i.e. P(U+ IB) = P(U+ IB & M+). Now we have: —• (I,000) women breast cancer no breast cancer ( 100 I N M-1 20 ) 950 M+ 8950 M- fiTh c m % 1Th \-1 76 ( 4 19 ) 1 (38 / U+ U- U+ U- U+ U- U+ U- Figure 4 In words: 100 out ofevery 10000 women at age 40 who participate in routine screening have breast cancer. 80 ofevery 100 women with breast cancer willget a positive mammography. 950 out ofevery 9900 women without breast cancer will also get a positive mammography. 76 out of80 women who had a positive mammography and have cancer also have a positive ultrasound test. 38 out of950 women who had a positive mammography, although they do not have cancer, also have a positive ultrasound test. How many ofthe women who get a positive mammography and a positive ultrasound test do you expect to actually have breast cancer? 53.7% of our participants solved this task correctly. EFTA_R1_01521130 EFTA02444388 Not all frequencies in the tree were actually used. The next step is to eliminate all frequencies irrelevant to the task. Thus we obtain: 11000u women breast cancer no breast cancer 7 11)?) 9900 M+ 80 ) U+ /—• 76 Figure 5 These frequencies, namely those that really foster insight, deserve a special name. We decided to call them Markovfrequencies because of the natural analogy with Markov chains. In fact: 1) Our tree consists of two chains which are joined at the root. 2) Each node corresponds to the reference class that determines the next node. As in a Markov chain, the frequency in each node depends only upon its predecessor, not upon previous nodes. Being able to "think in chains" seems crucial for human insight and fits the modern view that problem solving, unlike perception, is sequential rather than parallel. Markov frequencies are task-oriented, i.e., only information that is relevant for the task appears in the tree. Gigerenzer and Hoffrage (1995) also used a tree (see Figure 1). Their tree contains the information (P(T- IB) and P(T- I0B)), which is not relevant to the question "P(BIT+) =?". In our chains, the odds of the problem can be read directly from the last two nodes. This is because the tree with Markov frequencies corresponds to the well-known likelihood-combination rule (see, for instance, Spies, 1993): prior odds • product of the likelihood ratios = posterior odds 100 The prior odds for breast cancer are 9900 Multiplying this with the likelihood ratio for the mammography', we obtain 985°0 . Again multiplying this with the likelihood ratio of the ultrasound test, we finally . get38 By using Markov frequencies, it is not only clear which information should be given to experts, but also which information should be omitteds. Appropriately deleting useless information is part of the overall computation, as we know from information theory. EFTA_R1_01521131 EFTA02444389 References Eddy, D. M. (1982). Probabilistic reasoning in clinical medicine: Problems and opportunities. In D. Kahneman, P. Slovic & A. Tversky (Eds.), Judgment under uncertainty: Heuristics and biases (pp. 249-267). Cambridge, England: Cambridge University Press. Gigerenzer, G. & Hoffrage, U. (1995). How to improve bayesian reasoning without instruction: Frequency formats. Psychological Review, 102, 684-704. Kahneman, D. & Tversky, A. (1972). Subjective probability: A judgement of representativeness. Cognitive Psychology, 3, 430454. Massaro, D. (1998). Perceiving talkingfaces (pp.174-179). Boston. MIT Press. Spies, M. (1993). Unsicheres Wissen: Wahrscheinlichkeit, Fuzzy-Logik, neuronale Netze and menschliches Denken (pp.51-54). Heidelberg, Berlin, Oxford: Spektrum Akademischer Verlag. Footnotes I) To integrate the research on this topic, we borrowed concepts from various sources and explored them in the breast cancer example. In fact, Gigerenzer and Hoffrage used a sample of 1.000 (not 10.000) women, Massaro speaks of symptoms instead of tests and we tested our subjects with ,tuberculosis tasks" instead of „breast cancer tasks." 2) Gigerenzer and Hoffrage stressed that only frequencies work that can be sampled „naturally". A doctor would get information of this kind when he samples instructions for different tests and translates the information therein into frequencies. 3) A doctor would get information of this kind when he samples patients with respect to their state of illness. (M + I B) 8 4) The likelihood ratio L(B, M+) is defined by , which is 0% » 8.3 P P(M+ B) The likelihood ratio L(B, U+) therefore is 5% = 23.75 4% 5) Because Baycs'formula can be used to model inference under uncertainty, it is also a tool in scientific reasoning. Klaus Hasselmann from the Max Planck Institute for Meteorology in Hamburg is presently applying a Bayesian analysis to hypotheses about changes in climate. The Society for Mathematics and Data Analysis in St. Augustin is investigating various methods for EFTA_R1_01521132 EFTA02444390 estimating credit risks, such as analysis of discriminance, fuzzy-pattern classification, and neural networks with the help ofBayes'theorem. The „Krebsatlas" (almanac of cancer patients) for Germany is being reviewed at the Ludwig Maximilian University in Munich by means of Bayesian methods. The task is to detect and eliminate spurious correlations. Even the Microsoft Office Assistant uses Bayesian procedures. The mathematician Anthony 0' Hagan „elicits" on behalf of the Britsh government hydrological conductivity of the rock at Sellafield from experts. He uses their beliefs to determine a prior distribution, with which the appropriateness of the area as a permanent diposal site for nuclear waste can be estimated (Neue Ziircher Zeitung, May 13, 1998, 5.39.). Even the most expert systems are based on Bayes'formula. A famous example is MUNIN (Muscle and Nerve Inference Network) from Lauritzen and Spiegelhalter (1988), which is used for making diagnoses on the basis of measurements of muscular electrical impulses („electromyography"). Maybe Markov frequencies can also help to facilitate programming those expert systems. Acknowledgments We thank Valerie Chase, Martin Lages, Donna Alexander and Matthias Licha for helpful comments and Ursula Dohme for running the experiments. (Submission to the 1998 Conference on „Model-Based Reasoning in Scientific Discovery") EFTA_R1_01521133 EFTA02444391
ℹ️ Document Details
SHA-256
5a9db6bfd26a9dc7f07ae9cc289bbe27786303fb476a6c141a257e257f22cf2e
Bates Number
EFTA02444385
Dataset
DataSet-11
Document Type
document
Pages
7

Comments 0

Loading comments…
Link copied!