Protection des Items dans l’Algorithme de Cluster K-means

Fatima MESKINE

Accueil de DSpace
→
Mémoires de Magister
→
Mathématiques et Informatique - رياضيات والاعلام الآلي
→
Voir le document

dc.contributor.author	Fatima MESKINE
dc.date.accessioned	2018-11-13T17:36:58Z
dc.date.available	2018-11-13T17:36:58Z
dc.date.issued	2010-04-25
dc.identifier.uri	http://e-biblio.univ-mosta.dz/handle/123456789/1534
dc.description.abstract	The interest of collecting and exploring large amounts of data to extract valuable knowledge has became critical to commercial companies and governmental organizations, which is the motivation of data mining. The current tendency is that two or several organizations share their datasets and give them as inputs to the process of data mining in order to have more effective results. This raised a real problem of privacy that most of these data relate to individuals and their personal information. The very active research area of privacy preserving data mining aims to extract useful information from data coming from multiple sources, while preserving these data against disclosure or loss. Clustering is also a more exploratory data mining task which the aim is to classify items described by features into groups, according to some similarities in a given context of application and poses the same problem of privacy when data come from different sources. K-means is one of the algorithms of clustering and the most widely used. Most of works in privacy preserving clustering are developed on the k-means algorithm by applying the model of secure multi- party computation. The ways in whish data are shared or distributed on these parties may be different. The first solution of preserving privacy in k-means algorithm was proposed by Jaideep Vaidya and Chris Clifton in 2003 on vertically partitioned dataset. Thus, approaches allowing solving the problem on a vertical, horizontal and even arbitrary partitioned dataset were proposed, but the preservation of privacy is still not complete. The major problem is to reveal the minimum of information during the execution of the algorithm, especially in kmeans iterations, which poses a real challenge for secure multi party computation. This paper consists in drawing up a panorama of all works of preserving privacy in k-means clustering algorithm, classifies the various approaches according to the used distribution dataset, while presenting the weaknesses and strengths of each solution. The permanent growth of the data and the need emerging to explore them requires a real thinking to effectively protect them, especially that these data become increasingly individualized.	en_US
dc.language.iso	fr	en_US
dc.subject	privacy preservation, k-means clustering algorithm, secure multi-party computation, data distribution.	en_US
dc.title	Protection des Items dans l’Algorithme de Cluster K-means	en_US
dc.type	Thesis	en_US

Fichier(s) constituant ce document

Nom: CD5.rar

Taille: 656.7Ko

Format: Inconnu

Voir/Ouvrir

Ce document figure dans la(les) collection(s) suivante(s)

Mathématiques et Informatique - رياضيات والاعلام الآلي [27]

Protection des Items dans l’Algorithme de Cluster K-means

Fichier(s) constituant ce document

Ce document figure dans la(les) collection(s) suivante(s)

Chercher dans le dépôt

Parcourir

Tout DSpace

Cette collection

Mon compte