Objectif The Web in it s current form presents the following challenges:- Web data collection, storage and management is carried out mainly by few large scale commercial organizations, leading to an oligopoly if not to a monopoly in web data management and searching.- Web search is predominantly based on keyword search reducing this important procedure in a simple string matching issue while disregards broader similarity aspects such as semantics, link structures, etc.- Web data search and organization is currently dominated by authority ranking [2] disregarding other very interesting dimensions that could validly affect a page ranking such as: users' bookmarks, click streams, personalized ontologies etc.The main objective of the proposed project is the design guidelines and prototypes development for next generation web mining and searching techniques. In this context the project will contribute advances in the following areas:- collection of web data (crawling),study and adoption of a P2P model for crawling that will be characterized by a completely distributed and decentralized Peer-to-Peer (P2P) crawler.- web data characterization and semantics extraction. The web pages' characterization procedure will take into account, in addition to the authority rank other dimensions such as: users book marks, click streams (web logs), semantic similarity based on ontologies in conjunction to link structures, etc.- organization and searching of such collections. Here the objective will be the computationally and semantically efficient organization of web documents into semantically coherent clusters.In order to achieve this the following constituent objectives have to be met:- the design of similarity measures that enable taking into account aggregate similarity among sets Champ scientifique natural sciencescomputer and information sciencesknowledge engineeringontology Mots‑clés clustering distributed knowledge management peer 2 peer semantics similarity measures Programme(s) FP6-MOBILITY - Human resources and Mobility in the specific programme for research, technological development and demonstration "Structuring the European Research Area" under the Sixth Framework Programme 2002-2006 Thème(s) MOBILITY-2.1 - Marie Curie Intra-European Fellowships (EIF) Appel à propositions FP6-2002-MOBILITY-5 Voir d’autres projets de cet appel Régime de financement EIF - Marie Curie actions-Intra-European Fellowships Coordinateur INRIA - THE FRENCH NATIONAL INSTITUTE FOR RESEARCH IN COMPUTER SCIENCE AND CONTROL Contribution de l’UE Aucune donnée Adresse Parc Club Orsay Universite ZAC des vignes 4, rue Jacques Monod - Bât G ORSAY France Voir sur la carte Liens Site web Opens in new window Coût total Aucune donnée