# I’m a med student and I really need some quick help with basic statistics for a research project

Hey guys first post here, I wasn’t sure if I should post here or in homework help. I’m working on a medical research project involving cost analysis for patients coming to our hospital and have been burdened with the stats (of which I know little about, but I have been doing a good bit of research). I really just want to verify that I did things correctly before presenting the project a small poster conference at my school. ​ I have roughly 300 patients in my project and each one has an associated cost. The project tries to find differences in the workups of each of the patients and tries to identify any factors that cause the patients to cost significantly to be different on average. ​ For example I would group the costs into 2 columns, one for a patients with a normal ekg and the other for patients with an abnormal ekg. So there would be 160 patients with a normal ekg and and 140 with an abnormal. ​ I used T.tests to see if there was any significant difference the excel formula was =T.Test(Array1,Array2,2,3) 2 for 2 tailed and 3 for unequal variances. It my understanding that this formula will put out the p value, and if my p value is less that .05 then it is considered a significant difference. The Array1 would be a column with 160 rows each being the cost of a patient with a normal ekg and Array2 would be a column with 140 rows each being the cost of a patient with an abnormal ekg. ​ I used Chi square tests as well to confirm the data was not generated randomly, since abnormal ekg is categorical meaning its either a yes or no. But I’m not sure I need to include in my presentation the chi square results? Unfortunately one of my variables I found to have a significant difference in average cost from my T tests did not have a significant Chi square. ​ I am also really interested in the variance of each dataset, like how the variance in price of a patient with an abnormal ekg differs from a patient with a normal ekg. Because it seems that if certain variables are true then they can have use effects on the variance of the cost. For example if the patient has diabetes then the variability(variance) of their total cost is 2x that of a patient without diabetes. I have researched that determining the statistical power from variance can be very difficult, can I just compare two variances within my datasets like the variability increases 2 or 3 times depending on these factors? Also does variance essentially = variability for my uses here? ​ Can anyone please help validate that it looks like I am approaching this correctly? I’m really kind of flying blind here and the statistics department can’t help me since I’m just a student. ​ Disclaimer: most of the variables and amount of patients are made up since I don’t know how much of the project I can share and want to err on the side of caution. submitted by Nevin Manimala /u/thisispluto2 [link] [comments]