Re: Apriori algorithm in C# or Java
Posted by:
Pooja jardosh
Date: December 16, 2014 02:33AM
I do have 7 xml documents so that T1,T2,...,T7
Each XML documents have keywords,some are having same and others are different.
XML DOCUMENT Keywords
---------------------------------------------------------
Doc1 Student,Teach,School
Doc2 Student,School
Doc3 Teach,School,City,Game
Doc4 Baseball,Basketball
Doc5 Basketball,Player,Spectator
Doc6 Baseball,Coach,Game,Team
Doc7 Basketball,Team,City,Game
Attribtes i will be student(1),teach(2),school(3),city(4),game(5),baseball(6),basketball(7),player(8),spectator(9),coach(10),team(11)
SO boolean matrix will be of 11*7
-----1 2 3 4 5 6 7 8 9 10 11
---- --------------------------
doc1 1 1 1 0 0 0 0 0 0 0 0
doc2 1 0 1 0 0 0 0 0 0 0 0
doc3 0 1 1 1 1 0 0 0 0 0 0
doc4 0 0 0 0 0 1 1 0 0 0 0
doc5 0 0 0 0 0 0 1 1 1 0 0
doc6 0 0 0 0 1 1 0 0 0 1 1
doc7 0 0 0 1 1 0 1 0 0 0 1
I want to extract data from XML documents so that i can further proceed for mining.