Frequent sequence mining on longitudinaldata : Segregation of Swedish employees

Detta är en Master-uppsats från Linköpings universitet/Statistik; Linköpings universitet/Tekniska fakulteten

Sammanfattning: This thesis is based on longitudinal data of the Swedish population provided byStatistics Sweden and is conducted on behalf of the Institute for Analytical Sociology.The focus is on investigating the effectiveness of a frequent sequence miningmethod called constrained Sequential PAttern Discovery using Equivalence classes(cSPADE). The method is applied to data on segregation within workplaces, specificallyreasons for Swedish employees moving to more segregated workplaces. Thethesis found that no unique pattern of age, gender, education, unemployment, income,workplace size or foreignness index explain why a Swedish employee movesto a more segregated workplace. Evaluating the algorithm, it was found that thenumber of observations need to be smaller or an alteration of the algorithm needsto be done to reduce the process time for this specific data set.

  HÄR KAN DU HÄMTA UPPSATSEN I FULLTEXT. (följ länken till nästa sida)