Skip to main content
Go to the home page of the European Commission (opens in new window)
English English
CORDIS - EU research results
CORDIS
Content archived on 2024-06-18

Hierarchical Motif Vectors for Protein Alignment and Functional Classification

Objective

This proposal introduces hierarchical motif vectors for numerical analysis of sequence motifs, and develops a novel framework for alignment and functional classification of proteins. Hierarchical motif vectors will be computed using multi-scale decompositions of property sequences obtained by converting amino acid sequences into numeric sequences of various amino acid properties. These hierarchical motif vectors will capture the variations of amino acid properties in the vicinity of each amino acid in the sequence of a given protein. We will develop alignment algorithms for amino acid sequences that match their hierarchical motif vectors. We will also use unsupervised statistical learning algorithms to identify hierarchical motif vectors specific to functional protein groups, notably the antigen binding proteins, transcription factors, growth factors, and glycosylation proteins. We will then apply these methods to protein classification, using the overlap scores from the hierarchical motif vector-based sequence alignment as well as the presence and extent of hierarchical motif vectors specific to the protein group in consideration. We will validate all methods developed in this project against existing sequence alignment, motif detection, and protein classification algorithms in the literature. Among the innovations of the project is the use of hierarchical motif vectors for characterization of local physico-chemical variations along an amino acid sequence. This allows analyzing sequence motifs by general machine learning methods via the embedded vector space arrangement. Next, sequence alignment can be tuned to different amino acid properties at various scales, improving the potential for sequence alignment-based protein similarity in functional classification. Furthermore, group-specific hierarchical motif vectors will be identified as those that occur exclusively among the members of a protein group, increasing their likelihood of bearing functional specificity.

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: https://op.europa.eu/en/web/eu-vocabularies/euroscivoc.

You need to log in or register to use this function

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

FP7-PEOPLE-IRG-2008
See other projects for this call

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

MC-IRG - International Re-integration Grants (IRG)

Coordinator

IZMIR INSTITUTE OF TECHNOLOGY
EU contribution
€ 75 000,00
Address
GULBAHCE URLA
35430 İzmir
Türkiye

See on map

Region
Ege İzmir İzmir
Activity type
Higher or Secondary Education Establishments
Links
Total cost

The total costs incurred by this organisation to participate in the project, including direct and indirect costs. This amount is a subset of the overall project budget.

No data
My booklet 0 0