Objective
The Web in it s current form presents the following challenges:
- Web data collection, storage and management is carried out mainly by few large scale commercial organizations, leading to an oligopoly if not to a monopoly in web data management and searching.
- Web search is predominantly based on keyword search reducing this important procedure in a simple string matching issue while disregards broader similarity aspects such as semantics, link structures, etc.
- Web data search and organization is currently dominated by authority ranking [2] disregarding other very interesting dimensions that could validly affect a page ranking such as: users' bookmarks, click streams, personalized ontologies etc.
The main objective of the proposed project is the design guidelines and prototypes development for next generation web mining and searching techniques. In this context the project will contribute advances in the following areas:
- collection of web data (crawling),study and adoption of a P2P model for crawling that will be characterized by a completely distributed and decentralized Peer-to-Peer (P2P) crawler.
- web data characterization and semantics extraction. The web pages' characterization procedure will take into account, in addition to the authority rank other dimensions such as: users book marks, click streams (web logs), semantic similarity based on ontologies in conjunction to link structures, etc.
- organization and searching of such collections. Here the objective will be the computationally and semantically efficient organization of web documents into semantically coherent clusters.
In order to achieve this the following constituent objectives have to be met:- the design of similarity measures that enable taking into account aggregate similarity among sets
Fields of science (EuroSciVoc)
CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.
CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.
You need to log in or register to use this function
Keywords
Project’s keywords as indicated by the project coordinator. Not to be confused with the EuroSciVoc taxonomy (Fields of science)
Project’s keywords as indicated by the project coordinator. Not to be confused with the EuroSciVoc taxonomy (Fields of science)
Programme(s)
Multi-annual funding programmes that define the EU’s priorities for research and innovation.
Multi-annual funding programmes that define the EU’s priorities for research and innovation.
Topic(s)
Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.
Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.
Call for proposal
Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.
Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.
FP6-2002-MOBILITY-5
See other projects for this call
Funding Scheme
Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.
Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.
Coordinator
ORSAY
France
The total costs incurred by this organisation to participate in the project, including direct and indirect costs. This amount is a subset of the overall project budget.