Hi!
I have this R output from a Welch t-test (to compare means of two samples)
data: sampleA and sampleB
t = 11.8184, df = 49705.89, p-value < 2.2e-16
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
3.162714 4.420318
sample estimates:
mean of x mean of y
6.128647 2.337131
Looking at the output, can someone tell me if the means of my samples are significantly different? I'm confused with the "alternative hypothesis" line...
Ok, in consequence, sometimes it's good to battle it out. Closing a question as of off-topic is always very subjective, and please don't take it personally. I am not always sure if it is correct, and I'd prefer if closing a question would be a voting process as well. Therefore, it is good to have strong advocacy from the community as feedback.
sorry, off topic, this is very basic statistics not bioinformatics
I think "off topic" should mean topics that cannot possibly have relevance to the field. While it is true that strictly speaking this is a question about statistics and a the way R displays a quantity we should allow more leeway in applying the rules - especially if the question garners a number of good answers - if anything that is proof that the question is of interest to many here.
Sure, I wouldn't want to be overly strict either, and one or more borderline questions don't hurt us, BUT: There are places where the op might get a more correct answer. In fact none of the answers touched upon the reason for the confusion about the alternative hypotheses yet. I believe that a question that can be formulated as a pure statistics question (even when it has -possibly- a biological background behind it) is better asked on a statistics site: e.g. http://stats.stackexchange.com/ , we have seen the same with pure programming questions and they have been pretty much handled that way.
I think that if the first or second answer to this were what you state above then I think it would be fine to also close the question and move on. But once there are a good number of answers and comments I think we should let the question take its course and allow people to edit/change their answers and overall sort it out. Closing the question at that point freezes everything in place and does not allow any kind of improvements to take place. In the end I think there are lot of interesting discussions below. To be honest with you I think p-values are a very subtle concept and perhaps statisticians are not always the right people to explain it.
I have to disagree with you on the last point on p-values. Most biologists, if not all, tend to allow for instincts in deciding which
candidate gene
is better than the other according to their p-value, especially in multiple testing cases (GWAS, candidate gene approaches etc..). At the end of the day, its a number. It doesn't have a biological meaning. But people tend to give it too much importance, 10^-5 or 10^-25? 10^-25 is a "strong" candidate. Numerically, yes. Biologically, both are. And I think understanding this point and accepting that intuition can lead to a conclusion otherwise rarely happens if you are NOT a statistician.What I meant is that the proper statistical terminology is not easy to understand. Biologists misusing p-value is also a major pet-peeve of mine and don't really have a disagreement there. I like to make the joke that the chance that all authors on a paper being habitual liars is bigger than E^-25 therefore no matter what context you see that reported is beyond the expected accuracy anyhow ;-). The problem is further compounded by not accounting for multiple comparisons when trying to come up with something "interesting".
Thanks for the link for the stats site Michael. Didn't even know it existed!
Since I'm working in bioinformatics, biostars.org was the only place it came to mind for asking such a question.
I'll have in mind the other site, however, for my next question on stats!
Not a rebuttal, but I wonder how one would construe a question on "error using samtools", for example. It is, strictly speaking, a software bug or misuse, even though it is a "bioinformatics" software. I am just saying its very hard to draw the line. I agree with Istvan's definition of "off-topic". Maybe this must be cited somewhere in the website where people can take easy notice?
Yes I never use basic statistics in any of my bioinformatics work, no one I've met has ever been confused by it, and I agree that people doing bioinformatics shouldn't discuss such topics. ;)
You can spare your irony. But in fact I want to give an example, you and most people in bioinformatics use computers, though it's most likely not the right place to discuss the choice of your computer (or telephone or car or whatever) brand here, unless you can make a connection to bioinformatics very clear. IMO it is the responsibility of the poster to prove the connection to bioinformatics if it is not obvious initially.
Also, questions that ask only for explanations or definitions of basic concepts and terms that could be easily read up elsewhere or textbook, googled should be discouraged (imho), especially when they belong to a different discipline.
When questions like this get closed, I fear sparing the irony would jeopardize my sanity. I tend to agree with Istvan that question closing is for varieties that "...cannot possibly have relevance to the field." Historically, off-topic is touchy because it is SO subjective. In the context of Istvan's comment, I simply prefer a hierarchy as follows. I agree with you that it is the op's job to connect their question as clearly as possible to bioinformatics. Failing that, people who answer can choose to provide some connection in their answers. Most of the controversial closed questions have a connection to bioinformatics that simply hasn't been explicitly stated, yet for many of us is as plain as day. Faced with a choice to close a question or provide a connection, I wish those who assumed moderation power would take the benevolent shepherd approach and choose to reconstruct the limb rather than amputate and cauterize the wound, as it were. As for asking only in the best place? Agreement on "best" will be tough.