Skip to main content
Go to the home page of the European Commission (opens in new window)
English English
CORDIS - EU research results
CORDIS
Content archived on 2024-06-18

Jumbled Strings: Theory and Applications

Objective

"We propose to investigate a number of algorithmic problems on jumbled strings, where we refer to a string t as a jumbled version of string s if t's positions can be permuted such that it is transformed into s. In other words, the two strings have the same Parikh vector, where the Parikh vector counts the number of occurrences of each character. For example, the strings AAGACGT and AAACGGT both have Parikh vector (3,1,2,1). All strings with the same Parikh vector build an equivalence class, which we refer to as a 'jumbled string.' We want to develop algorithms and dedicated data structures for searching, storing, comparing, and identifying jumbled strings.

Jumbled strings have important applications in bioinformatics, above all in interpretation of mass spectrometry data; but they have also been applied to alignment, pattern discovery in biological strings, or SNP detection. Searching for a jumbled pattern in a text constitutes a special case of approximate string matching, and is thus of particular interest in the pattern matching field. Similar problems regarding unique reconstruction of strings have been investigated in the area of formal languages.

The project involves both theoretical and practical parts. Besides searching for asymptotically optimal procedures for different models of the source which generates the text, we will also test on real instances of biological and textual data. We are not only interested in theoretically optimal algorithms but focus on algorithms that work well in practice. Thus, we consider also heuristics and ad hoc methods to enhance the practical implementation of our methods.

The project will enable the fellow to greatly enhance her competencies in algorithms development and formal languages, while training in information theory and extremal combinatorics, benefitting from the expertise at the host institution. This will constitute a major step in her career towards a professorship in algorithmic bioinformatics."

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

You need to log in or register to use this function

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

FP7-PEOPLE-2010-IEF
See other projects for this call

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

MC-IEF - Intra-European Fellowships (IEF)

Coordinator

UNIVERSITA DEGLI STUDI DI SALERNO
EU contribution
€ 244 075,00
Address
VIA GIOVANNI PAOLO II 132
84084 Fisciano Sa
Italy

See on map

Region
Sud Campania Salerno
Activity type
Higher or Secondary Education Establishments
Links
Total cost

The total costs incurred by this organisation to participate in the project, including direct and indirect costs. This amount is a subset of the overall project budget.

No data
My booklet 0 0