The Development and Evaluation of New Methods for Editing and Imputation

Project Information

EUREDIT

Grant agreement ID: IST-1999-10226

Project website

Project closed

Start date 1 March 2000

End date 28 February 2003

Funded under

Programme for research, technological development and demonstration on a "User-friendly information society, 1998-2002"

Total cost

€ 3 627 844,00

EU contribution

€ 2 100 000,00

2 100 000,00

1 527 844,00

Coordinated by

OFFICE OF NATIONAL STATISTICS
United Kingdom

Objective

The project will evaluate and compare the current methods for editing and imputation to establish current best practice methods. In addition, new methods for editing and imputation based on neural networks, support vector machine, fuzzy logic methodology and robust statistical methods will be developed and compared with the current best practice methods. The evaluation of different methods will require, (a) the creation of common data sets with known type of errors to be used by all the methods, and (b) the establishment of sound statistical criteria for the objective evaluation of the methods. Based on our evaluation, recommendations for the use of different methods for editing and imputation for different kinds of data sets will be made. A CD ROM containing the algorithms for different selected methods will be produced and widely disseminated for use by the NSIs and other private and public sector organisations interested in editing and imputation.

Objectives:
1. To establish a standard collection of data sets for EUREDIT
2. To develop a methodological evaluation framework and develop evaluation criteria
3. To establish a baseline by evaluating currently used methods for data editing and imputation.
4. To develop and evaluate a selected range of new techniques for data editing and imputation.
5. To evaluate different methods for edit and imputation and establish best methods for different types of data.
6. To disseminate the best methods via a single package for wider dissemination, and in a conference proceedings.

Work description:
In order to evaluate the editing and imputation methods, a set of representative data sets arising in social sciences (household surveys, business surveys, censuses, panel data) with known types of errors will be produced. The criteria for the evaluation of the methods in terms of Fellegi-Holt (1976) principles and operational efficiency will be established and agreed among the participants. Based on a review of currently used methods, a selection will be evaluated. This will establish the current best practice methods and also provide benchmark for the later phase of the project. Alongside the investigation of traditional methods, new methods for editing and imputation based upon advanced statistical and information technology techniques will be developed. Specifically, methods based upon: outlier robust methods and non-parametric regression, MLP neural networks, Radial Basis Function (RBF) neural networks, Correlation Matrix Memory (CMM) neural networks, Self-Organising Map (SOM) neural networks and Support Vector Machines (SVM), will be developed. This will involve the establishment of appropriate methodology, development of algorithms and application of methods to the selected data sets. All the methods (new and old) will be comparatively evaluated. This will form the basis for detailed recommendations about the optimal choice of methods in a wide range of common situations. The "best methods" will be selected for wider dissemination. This will be achieved through the development of portable software for the selected (best) methods. The CD containing the software will be produced and made available to the organisations interested in editing and imputation.

Milestones:
1. Selection and compilation of datasets for evaluating methods
2. Determination objective quality criteria for evaluating methodsL%3. Development and testing of selected new methods for error localisation
4. Development and testing of selected new methods for imputation
5. Evaluation of all (new and old) editing and imputation methods
6. Integration of the individual edit and imputation methods into a single

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

Programme(s)

Multi-annual funding programmes that define the EU’s priorities for research and innovation.

FP5-IST - Programme for research, technological development and demonstration on a "User-friendly information society, 1998-2002"

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

1.1.2.-5.1.4 - CPA4: New indicators and statistical methods

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

Data not available

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

CSC - Cost-sharing contracts

Coordinator

OFFICE OF NATIONAL STATISTICS

EU contribution

No data

Address

1, DRUMMOND GATE
SW1V 2QQ LONDON
United Kingdom

Total cost

No data

Participants (12)

CENTRAAL BUREAU VOOR DE STATISTIEK

Netherlands

EU contribution

No data

Address

PRINSES BEATRIXLAAN 428
2270 JM VOORBURG

Total cost

No data

CREDIT SUISSE FINANCIAL PLANNING SOLUTIONS GMBH

Germany

EU contribution

No data

Address

WILHELM-THEODOR-ROEMHELD-STRASSE 18
55130 MAINZ

Total cost

No data

ISTITUTO NAZIONALE DI STATISTICA

Italy

EU contribution

No data

Address

VIA CESARE BALBO 16
00184 ROMA

Total cost

No data

JYVAESKYLAEN YLIOPISTO

Finland

EU contribution

No data

Address

SEMINAARINKATU 15
40100 JYVASKYLA

Total cost

No data

QANTARIS GMBH

Germany

EU contribution

No data

Address

BAHNHOFSTRASSE 7
61476 KRONBERG

Total cost

No data

ROYAL HOLLOWAY AND BEDFORD NEW COLLEGE

United Kingdom

EU contribution

No data

Address

EGHAM HILL
TW20 0EX EGHAM, SURREY

Total cost

No data

STATISTICS DENMARK

Denmark

EU contribution

No data

Address

SEJROEGADE 11
2100 COPENHAGEN

Total cost

No data

STATISTICS FINLAND

Finland

EU contribution

No data

Address

TYOPAJAKATU 13
00022 HELSINKI

Total cost

No data

SWISS FEDERAL STATISTICAL OFFICE

Switzerland

EU contribution

No data

Address

ESPACE DE L'EUROPE 10
2010 NEUCHATEL

Total cost

No data

THE NUMERICAL ALGORITHMS GROUP LIMITED

United Kingdom

EU contribution

No data

Address

WILKINSON HOUSE, JORDAN HILL ROAD
OX2 8DR OXFORD

Total cost

No data

UNIVERSITY OF SOUTHAMPTON

United Kingdom

EU contribution

No data

Address

HIGHFIELD
SO17 1BJ SOUTHAMPTON

Total cost

No data

UNIVERSITY OF YORK

United Kingdom

EU contribution

No data

Address

HESLINGTON HALL
YO10 5DD YORK

Total cost

No data

Objective

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

Programme(s)

Multi-annual funding programmes that define the EU’s priorities for research and innovation.

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

Coordinator

Participants (12)

Share this page Share this page on social networks

Download Download the content of the page

The Development and Evaluation of New Methods for Editing and Imputation

Objective

Fields of science (EuroSciVoc) CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

Programme(s) Multi-annual funding programmes that define the EU’s priorities for research and innovation.

Topic(s) Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Call for proposal Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

Funding Scheme Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

Coordinator

Participants (12)

Share this page Share this page on social networks

Download Download the content of the page

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

Programme(s)

Multi-annual funding programmes that define the EU’s priorities for research and innovation.

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.