Hi every one, I’ve applied an two samples independent t -test on my data, but I’m not sure if what I did is statiscally correct.
The data consists out of 40,000 customer transactions over a period of 7 months. It contains the variables:
- Product the customer bought (A or B)
- Date (Day and month)
- Unique customer ID
- Money spend
Now I wanted/need to know if there is a difference if product B is bought more in the weekend (Saturday – Sunday) compared to a weekday (Monday – Friday). What I did is as follows:
- Calculated the total number of date days (213 days)
- Caculated the total number of transactions per date and what percentage of those consisted out of product B (called this the chance on product B)
- Created a dummy variable (1 for weekend day, 0 for a normal weekday)
Now I applied a Welch T-test with as indepedent variable the dummy for weekend/weekday and depedent the chance on product B being bought.
Outcome was siginificant at any level of alpha, therefore means are different for weekend. (Weekday 30% chance on average of product B being bougt, in the weekend 36%). I’m hesitating because somehow the raw data consists only out of categorical variables and I turned one into a continuous.
submitted by Nevin Manimala Nevin Manimala /u/Gunnen-Haney