Difference between revisions of "Flaws in Richard Lenski Study"

Revision as of 19:35, December 16, 2008

Richard Lenski turned down an offer to ship frozen specimens of bacteria to an unspecified facility so that his bacteria mutation data could be made more readily available to the public,^[1] but the following serious flaws are emerging about his work^[2] even without a full disclosure of the data. Note that the peer review on Lenski's paper took somewhere between zero (non-existent) and fourteen days (including administrative time), and Lenski himself does not have any obvious expertise in statistics. In fact, Richard Lenski admits in his paper that he based his statistical conclusions on the use of Monte Carlo resampling techniques, using giftware from the website called "statistics101.net".

Lenski's "historical contingency" hypothesis, as specifically depicted in Figure 3, is apparently contradicted by the data presented in the Third Experiment in Table 1 of his paper. The historical contingency hypothesis in Figure 3 suggests that the rate at which bacteria mutated to Cit⁺ increased at some point due to another mutation, the third (and largest) experiment in Table 1 shows Cit⁺ arising only in generations revived from the 20,000th generation onwards, suggesting this other mutation took place in the colony around that time.
Lenski's rare mutation hypothesis suggests a fixed mutation rate, but the failure of the mutations in his experiments to increase based on scale (number of samples) tends to disprove this hypothesis and hence back up the historical contingency hypothesis.
Richard Lenski included generations of the E. coli already known to contain Cit⁺ variants in his experiments.^[3] Once these generations are removed from the analysis, the data from the first (smallest) and third (largest) replay experiments still support Lenski's conclusions, whilst the second replay experiment fails to provide any evidence to that end.
The paper applies a Monte Carlo resampling test to test the data against the null hypothesis - that is the rare mutation hypothesis.
Lenski's third experiment lent the least support to his hypothesis with statistical significance.
Lenski claims that if the Third Experiment is erroneously combined with the other two experiments based on outcome rather than sample size, then a claim of overall statistical significance is achieved. He also points out that if the experiments are combined with respect to their relative sample sizes then this overall statistical significance is retained.
Lenski's paper points out that the results of his largest experiment (Third Experiment) have the highest Monte Carlo P value, and hence the least statistical significance. This is owing to the development of Cit⁺ variants in the 20,000th generation in this experiment. If the prior mutation necessary for the historical contingency occurred around the 20,000th generation then this result is the least significant in explaining it. All works published in PNAS are clear in defining statistical significance in the traditional way.^[4]
Lenski's paper claims that "During [30,000 generations], each population experienced billions of mutations,^[5] far more than the number of possible point mutations in the [approximately] 4.6-million-bp genome. This ratio implies, to a first approximation, that each population tried every typical one-step mutation many times." Lenski concludes that it the mutation required to evolve the Cit⁺phenotype must be 'difficult' in some sense, the sense being made clear by the historical contingency hypothesis.

References

↑ See Conservapedia:Lenski dialog.
↑ Blount et al., "Historical contingency and the evolution of a key innovation in an experimental population of Escherichia coli, 105 PNAS 7899-7906 (June 10, 2008).
↑ Richard Lenski included generations 31,500, 32,000 and 32,500.
↑ See, e.g., Cholera toxin induces malignant glioma cell differentiation
↑ Lenski cites one of his own prior articles for this.

@@ Line 1: / Line 1: @@
-[[Richard Lenski]] rejected a request to release his bacteria mutation data to the public,<ref>See [[Conservapedia:Lenski dialog]].</ref> but the following serious flaws are emerging about his work<ref>Blount et al., "Historical contingency and the [[evolution]] of a key innovation in an experimental population of ''Escherichia coli'', 105 PNAS 7899-7906 (June 10, 2008).</ref> even without a full disclosure of the data.  Note that the peer review on Lenski's paper took somewhere between 0 (non-existent) and at most 14 days (including administrative time), and Lenski himself does not have any obvious expertise in statistics.  In fact, Richard Lenski admits in his paper that he based his statistical conclusions on use of a website called "statistics101".
+[[Richard Lenski]] turned down an offer to ship frozen specimens of bacteria to an unspecified facility so that his bacteria mutation data could be made more readily available to the public,<ref>See [[Conservapedia:Lenski dialog]].</ref> but the following serious flaws are emerging about his work<ref>Blount et al., "Historical contingency and the [[evolution]] of a key innovation in an experimental population of ''Escherichia coli'', 105 PNAS 7899-7906 (June 10, 2008).</ref> even without a full disclosure of the data.  Note that the peer review on Lenski's paper took somewhere between zero (non-existent) and fourteen days (including administrative time), and Lenski himself does not have any obvious expertise in statistics.  In fact, Richard Lenski admits in his paper that he based his statistical conclusions on the use of Monte Carlo resampling techniques, using giftware from the website called "statistics101.net".
-.  Lenski's "historical contingency" hypothesis, as specifically depicted in Figure 3, is contradicted by the data presented in the Third Experiment in Table 1 of his paper.  Figure 3 proposes a step-up in mutation rate to Cit<sup>+</sup> due to a historical contingency (potentiating mutation) occurring at about the 31,000th generation, yet the Third (and largest) Experiment in Table 1 shows Cit<sup>+</sup> arising just as often before the 31,000th generation as after.  The abstract, in further contradiction with Figure 3, suggests that the historical contingency (potentiating mutation) occurred prior to the 20,000th generation.
+# Lenski's "historical contingency" hypothesis, as specifically depicted in Figure 3, is apparently contradicted by the data presented in the Third Experiment in Table 1 of his paper.  The historical contingency hypothesis in Figure 3 suggests that the rate at which bacteria mutated to Cit<sup>+</sup> increased at some point due to another mutation, the third (and largest) experiment in Table 1 shows Cit<sup>+</sup> arising only in generations revived from the 20,000th generation onwards, suggesting this other mutation took place in the colony around that time.
+# Lenski's rare mutation hypothesis suggests a fixed mutation rate, but the failure of the mutations in his experiments to increase based on scale (number of samples) tends to disprove this hypothesis and hence back up the historical contingency hypothesis.
-.  Lenski's two alternative hypotheses suggest a fixed mutation rate, but the failure of the mutations in his experiments to increase based on scale (number of samples) tends to disprove both of Lenski's alternative hypotheses.  Yet Lenski's paper fails to address adequately this obvious flaw in the paper.
+# Richard Lenski included generations of the ''E. coli'' already known to contain Cit<sup>+</sup> variants in his experiments.<ref>Richard Lenski included generations 31,500, 32,000 and 32,500.</ref>  Once these generations are removed from the analysis, the data from the first (smallest) and third (largest) replay experiments still support Lenski's conclusions, whilst the second replay experiment fails to provide any evidence to that end.
+# The paper applies a Monte Carlo resampling test to test the data against the null hypothesis - that is the rare mutation hypothesis.
-.  Richard Lenski incorrectly included generations of the ''E. coli'' already known to contain Cit<sup>+</sup> variants in his experiments.<ref>Richard Lenski incorrectly included generations 31,500, 32,000 and 32,500.</ref>  Once these generations are removed from the analysis, the data disprove Lenski's hypothesis.
+# Lenski's third experiment lent the least support to his hypothesis with statistical significance.
+# Lenski claims that if the Third Experiment is erroneously combined with the other two experiments based on outcome rather than sample size, then a claim of overall statistical significance is achieved. He also points out that if the experiments are combined with respect to their relative sample sizes then this overall statistical significance is retained.
-. The paper incorrectly applied a Monte Carlo resampling test to exclude the null hypothesis for rarely occurring events. The Third Experiment results are consistent with the null hypothesis, contrary to the paper's claim.
+# Lenski's paper points out that the results of his largest experiment (Third Experiment) have the highest Monte Carlo ''P'' value, and hence the least statistical significance.  This is owing to the development of Cit<sup>+</sup> variants in the 20,000th generation in this experiment.  If the prior mutation necessary for the historical contingency occurred around the 20,000th generation then this result is the least significant in explaining it.  All works published in PNAS are clear in defining statistical significance in the traditional way.<ref>See, e.g., [http://www.pnas.org/cgi/content/full/0701990104 Cholera toxin induces malignant glioma cell differentiation]</ref>
+# Lenski's paper claims that "During [30,000 generations], each population experienced billions of mutations,<ref>Lenski cites one of his own prior articles for this.</ref> far more than the number of possible point mutations in the [approximately] 4.6-million-bp genome.  This ratio implies, to a first approximation, that each population tried every typical one-step mutation many times."  Lenski concludes that it the mutation required to evolve the Cit<sup>+</sup>phenotype must be 'difficult' in some sense, the sense being made clear by the historical contingency hypothesis.
-.  Lenski's largest experiment (Third Experiment) failed to support his hypothesis with statistical significance.  Even though this largest experiment was nearly ten times the size of his other experiments, Richard Lenski did not weight this largest experiment correctly in combining his results.
-. It was error to include generations of the E. coli already known to contain trace Cit+ variants. The highly improbable occurrence of four Cit+ variants from the 32,000th generation in the Second Experiment suggests an origin from undetected, pre-existing Cit+ variants.
-. The Third Experiment was erroneously combined with the other two experiments based on outcome rather than sample size, thereby yielding a false claim of overall statistical significance.
-.  Lenski's paper is not clear in explaining how the results of his largest experiment (Third Experiment) failed to confirm his hypothesis with statistical significance, even with the incorrect inclusion of the Cit<sup>+</sup> variant generations.  Instead, his paper refers to his largest experiment as "marginally ... significant," which serves to obscure its statistical insignificance.  Other works published in PNAS are clear in defining statistical significance in the traditional way, which Lenski's Third Experiment (even with incorrect inclusion of the above-referenced generations) failed to satisfy.<ref>See, e.g., [http://www.pnas.org/cgi/content/full/0701990104 Cholera toxin induces malignant glioma cell differentiation]</ref>
-.  The long lag time (over 12,000 generations) between the historical contingency (potentiating mutation) in the largest experiment disproves Lenski's implicit assumption that the potentiating mutation likely occurred in proximity with the occurrence of the Cit<sup>+</sup> variant, and that the first occurrence of the Cit<sup>+</sup> variant in the Third Experiment at the 20,000th generation somehow implies that a potentiating mutation occurred in its proximity.
-.  Lenski's paper claims that "During [30,000 generations], each population experienced billions of mutations,<ref>Lenski cites one of his own prior articles for this.</ref> far more than the number of possible point mutations in the [approximately] 4.6-million-bp genome.  This ratio implies, to a first approximation, that each population tried every typical one-step mutation many times."  Lenski's conclusion is nonsensical because it assumes that the mutations are completely random '''and''' that each mutation has a roughly equal probability.
 == References ==

Difference between revisions of "Flaws in Richard Lenski Study"

Revision as of 19:35, December 16, 2008

References

See also

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Popular Links

donate

Edit Console