Data mining problem
Posted by:
J.J.
Date: September 13, 2014 12:30PM
Hello. I recently entered the world of corporate data mining, and I'm a bit stumped with a request from my boss. I'm pretty well versed on a variety of statistical methods, but for the life of me, I can't think of a solution. Here's my problem:
We have sales data for various products our client sells. We have the date that each product was purchased (entered as a separate row). My boss is asking me to do something with this sales data.
First thing I need to do as flatten the dataset so that each row is a unique purchaser. Then, I'm thinking each product should become a column, and every person that purchases the product would have the date the purchase was made in that respective column. Obviously, if a person did not make that purchase, then that column would be blank (or null).
Anyone have any thoughts on a procedure I can run? I've run simple binary logistic regressions and decision trees based only on product ownership (either owned or un-owned) to see what products predict ownership of other products. But my boss wants the "time" a product was purchased element to see if product ownership at time A predicts product ownership at time B. I can also append the data with age, gender, and some other very basic demographics (by going through a data provider service).
Any help anyone can provide would be greatly appreciated – what procedure can I do with this data (I have SPSS with several module add-ons). And if something isn't clear, I'd be happy to further explain myself. Thanks!
J.J.