Fast and flexible query response systems

Advances in biotechnology and particularly genomic techniques have produced a wealth of sequence data. Retrieving relevant sequence data from such vast databases requires a principled, formalised model, an aspect that EU-funded researchers have successfully resolved.

Fundamental Research

Health

Scientists generally utilise database sequence similarity searches for data retrieval. However, public databases such as GenBank and UniProt/SwissProt contain several hundred thousand sequences, and existing bioinformatics techniques cannot achieve good data retrieval quality. The PROMOS (Probabilistic models in pseudo-Euclidean spaces) team addressed this breach in bioinformatics approaches. Their goal was to devise algorithms that rapidly provide accurate sequence data from large-scale databases. To begin with, researchers employed generic non-metric score similarities to derive and implement data-specific probabilistic relational models. They successfully developed a probabilistic framework for relational methods in pseudo-Euclidean spaces. To enhance model learning and enable fast data retrieval, they developed approximation schemes for relational data as well as a hierarchical model and retrieval schema. This domain-specific approach is effective as it converts large-scale dissimilarity matrices into approximated positive semi-definite kernel matrices at linear costs. PROMOS technology was tested on several large-scale protein databases and demonstrated better run-time performance than classical retrieval systems with competitive model accuracy. The methods have been published in numerous highly ranked publications, with several more under preparation. Project activities and outcomes should considerably speed up research and development in the biotechnology and pharma sectors.

Keywords

Discover other articles in the same domain of application

Explainable AI for personalised therapy in metastatic colorectal cancer

8 May 2025

Assessing vaccination efficacy in elderly populations

6 May 2025

Training and tools to advance neonatal medicine

5 September 2025

Novel toxicology platform brings innovation and safety one step closer

28 August 2025

Project Information

PROMOS

Grant agreement ID: 327791

Project closed

Start date 2 January 2014

End date 2 April 2016

Funded under

Specific programme "People" implementing the Seventh Framework Programme of the European Community for research, technological development and demonstration activities (2007 to 2013)

Total cost

€ 221 606,40

EU contribution

€ 221 606,40

221 606,40

Coordinated by

THE UNIVERSITY OF BIRMINGHAM
United Kingdom

Keywords

Discover other articles in the same domain of application

Download Download the content of the page