Fast and flexible query response systems

Advances in biotechnology and particularly genomic techniques have produced a wealth of sequence data. Retrieving relevant sequence data from such vast databases requires a principled, formalised model, an aspect that EU-funded researchers have successfully resolved.

Fundamental Research

Health

Scientists generally utilise database sequence similarity searches for data retrieval. However, public databases such as GenBank and UniProt/SwissProt contain several hundred thousand sequences, and existing bioinformatics techniques cannot achieve good data retrieval quality. The PROMOS (Probabilistic models in pseudo-Euclidean spaces) team addressed this breach in bioinformatics approaches. Their goal was to devise algorithms that rapidly provide accurate sequence data from large-scale databases. To begin with, researchers employed generic non-metric score similarities to derive and implement data-specific probabilistic relational models. They successfully developed a probabilistic framework for relational methods in pseudo-Euclidean spaces. To enhance model learning and enable fast data retrieval, they developed approximation schemes for relational data as well as a hierarchical model and retrieval schema. This domain-specific approach is effective as it converts large-scale dissimilarity matrices into approximated positive semi-definite kernel matrices at linear costs. PROMOS technology was tested on several large-scale protein databases and demonstrated better run-time performance than classical retrieval systems with competitive model accuracy. The methods have been published in numerous highly ranked publications, with several more under preparation. Project activities and outcomes should considerably speed up research and development in the biotechnology and pharma sectors.

Keywords

Discover other articles in the same domain of application

Helping MedTech innovators navigate a river of regulation

7 June 2022

Assessing vaccination efficacy in elderly populations

6 May 2025

How to mass-produce flies the right way

5 December 2018

Training tomorrow’s volcano scientists in the eruption hotspot of Iceland

28 July 2022

Project Information

PROMOS

Grant agreement ID: 327791

Project closed

Start date 2 January 2014

End date 2 April 2016

Funded under

Specific programme "People" implementing the Seventh Framework Programme of the European Community for research, technological development and demonstration activities (2007 to 2013)

Total cost

€ 221 606,40

EU contribution

€ 221 606,40

221 606,40

Coordinated by

THE UNIVERSITY OF BIRMINGHAM
United Kingdom

Fast and flexible query response systems

Keywords

Discover other articles in the same domain of application

Share this page Share this page on social networks

Download Download the content of the page