Skip to main content
Go to the home page of the European Commission (opens in new window)
English en
CORDIS - EU research results
CORDIS
Content archived on 2024-04-30

The creation of an integrated resource of protein domains and functional sites and its application to accelerate protein functional analysis

Objective



An avalanche of data has been gathered by the various genome mapping and sequencing projects. This data combined with the biological information in databases has an inestimable potential value. The realization of that potential is heavily dependent on investments in the development of tools to make use of the underlying data in databases and to make this knowledge available to the scientific community.
Protein sequence databases form a key computational resource for modern molecular biology. Searches against such a database frequently allow transfer of important functional information to a new sequence. However, transferring information from pairwise hits to other sequences is prone to error, particularly when this must be done on a large scale, for instance when annotating predicted genes from genome projects at the genome centres or when annotating sequences for inclusion in databases like SWISS-PROT + TREMBL. In recognition of these limitations, it is now essential to extend search strategies to include a range of "secondary" databases when analysing novel protein sequences. Databases on functional sites and domains like PROSITE, PRINTS, PFAM, PRODOM, BLOCKS, etc., have become vital resources for identifying distant relationships in novel sequences, and hence for predicting protein function and structure.
Unfortunately, these secondary databases do not share the same formats and nomenclature, which makes the use of all of them in an automated way difficult. In response to this the partners in this proposal (EBI, University of Geneva, ISREC, UCL, Sanger Centre, INRIA, CNRS/INRA, Compugen, LION, Pfizer and University of Bergen) want to address these problems by establishing an Integrated resource of Protein domains and functional sites (InterPro), and to apply this resource to accelerate protein functional analysis. This will create the pre-eminent world-wide secondary database for protein annotation.
The main objectives and goals of this proposal are:
- Creation of InterPro, allowing users to access a wider, complementary range of site and domain recognition methods in a single package.
- Establishment of WWW servers which allow the scientific community to useInterPro in protein sequence analysis.
- Incorporation of InterPro in the computer-generation of annotation in TREMBL,the protein database supplementing SWISS-PROT.
- Use of InterPro in the annotation of predicted genes from genome projects.

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

You need to log in or register to use this function

Programme(s)

Multi-annual funding programmes that define the EU’s priorities for research and innovation.

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

Data not available

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

CSC - Cost-sharing contracts

Coordinator

EUROPEAN MOLECULAR BIOLOGY LABORATORY
EU contribution
No data
Address
Wellcome Trust Genome Campus, Hinxton Hall
CB10 1SD SAFFRON WALDEN
United Kingdom

See on map

Total cost

The total costs incurred by this organisation to participate in the project, including direct and indirect costs. This amount is a subset of the overall project budget.

No data

Participants (7)

My booklet 0 0