1 / 57

Big questions in Science…

Big questions in Science…. What do I need to know about statistics to succeed in IB Biology?. Statisticians…. ‘..people who like figures, but don ’ t have the personality skills to become accountants…’ d o uncertainty, randomness and chance have a place in science?

zazu
Télécharger la présentation

Big questions in Science…

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Big questions in Science… What do I need to know about statistics to succeed in IB Biology?

  2. Statisticians… ‘..people who like figures, but don’t have the personality skills to become accountants…’ • do uncertainty, randomness and chance have a place in science? • How should we react to them?...

  3. Hummingbirds are nectarivores(herbivores that feed on the nectar of some species of flower). In return for food, they pollinate the flower. This is an example of mutualism – benefit for all. As a result of natural selection, hummingbird bills have evolved. Birds with a bill best suited to their preferred food source have the greater chance of survival. Photo: Archilochus colubris, from wikimedia commons, by Dick Daniels.

  4. Researchers studying comparative anatomy collect data on bill-length in two species of hummingbirds: Archilochus colubris (red-throated hummingbird) and Cynanthus latirostris (broadbilled hummingbird). To do this, they need to collect sufficient relevant, reliable data so they can test the Null hypothesis (H0) that: “there is no significant difference in bill length between the two species.” Photo: Archilochus colubris (male), wikimedia commons, by Joe Schneid

  5. The sample size must be large enough to provide sufficient reliable data and for us to carry out relevant statistical tests for significance. We must also be mindful of uncertainty in our measuring tools and error in our results. Photo: Broadbilled hummingbird (wikimedia commons).

  6. The mean is a measure of the central tendency of a set of data. • Calculate the mean using: • Your calculator • (sum of values / n) • Excel n = sample size. The bigger the better. In this case n=10 for each group. All values should be centred in the cell, with decimal places consistent with the measuring tool uncertainty. =AVERAGE(highlight raw data)

  7. The mean is a measure of the central tendency of a set of data. Descriptive table title and number. Uncertainties must be included. Raw data and the mean need to have consistent decimal places (in line with uncertainty of the measuring tool)

  8. Descriptive title, with graph number(graph number must match table number) Labeled point Y-axis clearly labeled, with uncertainty. Make sure that the y-axis begins at zero. x-axis labeled

  9. From the means alone you might conclude that C. latirostris has a longer bill than A. colubris. But the mean only tells part of the story.

  10. How can I find the mean and the standard deviation on my calculator? Head over to click4biology for instructions on how to calculate the mean and standard deviation using the TI 83 plus and the TI 84 plus calculator http://click4biology.info/c4b/1/gcStat.htm

  11. Standard deviationis a measure of the spread of most of the data. Which of the two sets of data has: The longest mean bill length? The greatest variability in the data? Standard deviation can have one more decimal place. =STDEV (highlight RAW data).

  12. Standard deviationis a measure of the spread of most of the data. Which of the two sets of data has: The longest mean bill length? The greatest variability in the data? C. latirostris A. colubris Standard deviation can have one more decimal place. =STDEV (highlight RAW data).

  13. Standard deviation is a measure of the spread of most of the data. Error bars are agraphical representationof the variabilityof data. Error bars could represent standard deviation, range or confidence intervals. Which of the two sets of data has: The highest mean? The greatest variability in the data?

  14. Standard deviation is a measure of the spread of most of the data. Error bars are agraphical representationof the variabilityof data. Error bars could represent standard deviation, range or confidence intervals. Which of the two sets of data has: The highest mean? A The greatest variability in the data? B

  15. Title is adjusted to show the source of the error bars. This is very important. You can see the clear difference in the size of the error bars. Variability has been visualised. The error bars overlap somewhat. What does this mean?

  16. The overlap of a set of error bars gives a clue as to the significance of the difference between two sets of data. Large overlap No overlap Lots of shared data points within each data set. Results are not likely to be significantly different from each other. Any difference is most likely due to chance. No (or very few) shared data points within each data set. Results are more likely to be significantly different from each other. The difference is more likely to be ‘real’.

  17. Our results show a very small overlap between the two sets of data. So how do we know if the difference is significant or not? We need to use a statistical test. Thet-test is a statistical test thathelps us determine the significance of the differencebetween the means of two sets of data.

  18. InferentialStatisticsComparing two data sets: The T-test… • Used to compare two normally distributed data sets (ideally with similar variances) • A t-test is a statistic that checks if the means of 2 groups are reliably different • Just looking at the means may show you that they are different, but doesn’t show if the difference is reliable • We always test the NULL Hypothesis (H0) • T-test…the movie…

  19. So what are degrees of freedom? Degrees of freedom represent sample size. For only one group, df = n-1, wheren = number of samples Usually we are looking at 2 groups, so df = (n1 + n2) -2

  20. We can calculate the value of ‘t’ for a given set of data and compare it to critical values that depend on the size of our sample and the level of confidence we need. Example two-tailed t-table. What happens to the value of P as the confidence in the results increases? What happens to the critical value as the confidence level increases? “critical values”

  21. We can calculate the value of ‘t’ for a given set of data and compare it to critical values that depend on the size of our sample and the level of confidence we need. Example two-tailed t-table. We usually use P<0.05 (95% confidence) in Biology, as our data can be highly variable “critical values”

  22. 2-tailed t-table source: http://www.medcalc.org/manual/t-distribution.php

  23. t was calculated as 2.15 (this is done for you) t cv 2.15 If t < cv, accept H0(there is no significant difference) If t > cv, rejectH0(there is asignificant difference) 2-tailed t-table source: http://www.medcalc.org/manual/t-distribution.php

  24. 0.05 t was calculated as 2.15 (this is done for you) t cv 2.15 If t < cv, accept H0(there is no significant difference) If t > cv, accept H0(there is asignificant difference) 2-tailed t-table source: http://www.medcalc.org/manual/t-distribution.php

  25. 0.05 2.069 t was calculated as 2.15 (this is done for you) t cv 2.15 > 2.069 If t < cv, accept H0(there is no significant difference) If t > cv, reject H0(there is asignificant difference) 2-tailed t-table source: http://www.medcalc.org/manual/t-distribution.php

  26. 0.05 2.069 t was calculated as 2.15 (this is done for you) t cv 2.15 > 2.069 If t < cv, accept H0(there is no significant difference) If t > cv, accept H0(there is asignificant difference) Conclusion: “There is a significant difference in the wing spans of the two populations of birds.” 2-tailed t-table source: http://www.medcalc.org/manual/t-distribution.php

  27. 2-tailed t-table source: http://www.medcalc.org/manual/t-distribution.php

  28. 2-tailed t-table source: http://www.medcalc.org/manual/t-distribution.php

  29. 2.045 2.045 “There is no significant difference in the size of shells between north-side and south-side snail populations.” 2-tailed t-table source: http://www.medcalc.org/manual/t-distribution.php

  30. 2-tailed t-table source: http://www.medcalc.org/manual/t-distribution.php

  31. 2.086 2.086 “There is a significant difference in the resting heart rates between the two groups of swimmers.” 2-tailed t-table source: http://www.medcalc.org/manual/t-distribution.php

  32. Correlation does not imply causation, but it does waggle its eyebrows suggestively and gesture furtively while mouthing "look over there." Cartoon from: http://www.xkcd.com/552/

More Related