Skip to main content
Go to the home page of the European Commission (opens in new window)
English en
CORDIS - EU research results
CORDIS
Content archived on 2024-04-15

ARTIFICIAL INTELLIGENCE APPROACH TO PROTEIN STRUCTUR PREDICTION BY DEVELOMENT OF A KNOWLEDGE BASE

Objective

AVAILABILITY OF A TOOL FOR THE SOLUTION OF PRACTICAL PROTEIN DESIGN PROBLEMS IN MOLECULAR BIOLOGY, MOLECULAR MEDICINE AND BIOTECHNOLOGY, WITH LONG TERM ECONOMIC BENEFITS IN THESE FIELDS.
Data on known protein structures and amino acid sequences have proven to be very useful for deriving empirical rules for protein folding and design. With the growing volume of these data, more sophisticated systems for storing and handlingknowledge on macromolecular structures are urgently needed. Progress should be made by improving ways to exploit sequence homology to infer structural information from the more than 10000 proteins for which only sequence data are available.
Research was carried out in order to develop a database of protein knowledge containing structure and sequence information and extend the database to include information on inferred 3-dimensional structures of proteins for which only sequence data are available.

SESAM, a performing relational database for protein structure and sequence capable of containing data from public and private sources, was developed; it features powerful procedures for validating and cleaning up input data, and rapid data retrieval. It has been interfaced with a graphics package, BRUGEL, and specialized user friendly interfaces have been implemented. The database on known protein structures was extended to include inferred 3-dimensional structures, grouped into structural families, by exploiting the correlations between structural homology and sequence similarity above a certain threshold of the latter. A limited number of short sequence patterns characterising with high accuracy local structure motifs in proteins can be found. This does not improve protein structure prediction methods, due to the limited size of the structural database and to the influence of spatial interactions between distant residues in the sequence. Object oriented methods and logic programming (Prolog) yield important benefits in terms of speeding up the design, development and debugging stages.
DEVELOPMENT OF A SYSTEM FOR PROTEIN STRUCTURE PREDICTION, MERGING STATISTICAL AND INFORMATION ANALYSIS TECHNIQUES WITH ADVANCED TOOLS FROM COMPUTER SCIENCE (A.I.) AS WELL AS EXISTING METHODS IN MOLECULAR MODELLING.
IN THIS COLLABORATIVE EFFORT, THE SPECIFIC ROLE OF THIS PART OF THE PROJECT CONSISTS IN THE DEVELOPMENT OF THE REQUIRED COMPUTER TOOLS. DEFINITION AND IMPLEMENTATION OF LOGIC BASED REPRESENTATION SCHEMES (IN COLLABORATION WITH THE GROUPS A - C) OF RESOURCE EFFICIENT META-INTERPRETERS AND USER INTERFACES.

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

You need to log in or register to use this function

Programme(s)

Multi-annual funding programmes that define the EU’s priorities for research and innovation.

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Data not available

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

Data not available

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

CSC - Cost-sharing contracts

Coordinator

BELGIAN INSTITUTE OF MANAGEMENT
EU contribution
No data
Address
KWIKSTRAAT,4
3078 EVERBERG
Belgium

See on map

Total cost

The total costs incurred by this organisation to participate in the project, including direct and indirect costs. This amount is a subset of the overall project budget.

No data

Participants (4)

My booklet 0 0