Skip to main content
Go to the home page of the European Commission (opens in new window)
English English
CORDIS - EU research results
CORDIS
Content archived on 2024-06-25

Processing Large XML Data Sets: Algorithms and Limitations

Objective

During the past few years, XML has become the dominant format for storing and exchanging information on the Internet. XML is often used to represent large text data sets, such as scientific corpora, repositories of Web pages, or streams of stock quotes. Processing large XML data sets efficiently has thus become one of the major challenges that researchers at the database, information retrieval, and WWW communities face today.

This proposal focuses on three issues at the forefront of the XML research at the database community:
1) Evaluation of queries over XML streams.
2) Evaluation of queries over indexed XML data sets.
3) Fast approximate evaluation of queries over XML data sets.

The goals of the proposed project are three fold:
1) Develop the first theoretical and systematic framework of lower bounds on the amount of resources needed to accomplish the above tasks.
2) Exploit insights gained from the theoretical study to design more efficient and comprehensive algorithms that solve the above problems.
3) Build an experimental system to test the proposed algorithms on real and artificial data.

During the course of working on the project, I plan to continue existing collaborations in the area with researchers from the IBM Research Centre in California as well as to bring along new collaborators from the Technion whose areas of interest overlap the subject of the project. I plan to leverage on the expertise of my colleagues at the Technion in the areas of communication complexity, database, and information theory in order to obtain high quality results in this project.

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

You need to log in or register to use this function

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

FP6-2002-MOBILITY-12
See other projects for this call

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

IRG - Marie Curie actions-International re-integration grants

Coordinator

TECHNION - ISRAEL INSTITUTE OF TECHNOLOGY
EU contribution
No data
Total cost

The total costs incurred by this organisation to participate in the project, including direct and indirect costs. This amount is a subset of the overall project budget.

No data
My booklet 0 0