FTC algorithm

iralala · September 14th, 2008, 11:42 PM

Hi..does anyone know about FTC algorithm (Frequent Term-Based Text Clustering) ?
i have a difficulty to cluster the terms..
thanks so much

this is the algorithm :

FTC(database D, float minsup)
SelectedTermSets:= {};
n:= |D|;

RemainingTermSets:= DetermineFrequentTermsets(D, minsup);

while |cov(SelectedTermSets)| Ââ n do
  for each set in RemainingTermSets do
    Calculate overlap for set;

  BestCandidate:=element of RemainingTermSets with minimum overlap;
  SelectedTermSets:= SelectedTermSets ÂÂ¾ {BestCandidate};
  RemainingTermSets:= RemainingTermSets -{BestCandidate};
  Remove all documents in cov(BestCandidate) from D and from the coverage of all of the
  RemainingTermSets;

return SelectedTermSets and the cover of the elements of SelectedTermSets;

furlong · February 3rd, 2009, 07:57 AM

Well... I dunno. I'm amateur in Java programming