Project description
Universal data compression algorithm for arbitrary alphabets
The ITUL project, funded by an ERC Consolidator Grant, introduced a universal data compression algorithm based on circular context trees. This algorithm outperforms commercial algorithms like ZIP and BZIP, as well as advanced ones like PPM and CTW, delivering an impressive average improvement while maintaining linear decoding complexity. Building on this success, the ERC-funded UNIDACT project will enhance the current binary implementation by extending its capabilities to support arbitrary alphabets. It focuses on developing efficient encoder and decoder implementations, alongside creating specialised applications for compressing satellite observation data and genomic information. Additionally, an indexed version of the compression algorithm will be developed, allowing access to specific data without needing to decompress the entire dataset.
Objective
Within the ITUL project, funded by an ERC Consolidator Grant, we have developed a data compression algorithm based on circular context trees. The proposed algorithm is universal in that it can compress any type of data. The proposed compression algorithm offers unprecedented compression performance, and shows an average of at least 82% improvement over commercial algorithms like Lempel-Ziv (ZIP), Burrows-Wheeler transform compression (BZIP), and more complex state-of-the-art algorithms like prediction by partial matching (PPM) and context tree weighting (CTW). The proposed algorithm has linear decoding complexity. The proposed PoC is intended to extend the current binary implementation to arbitrary alphabets, provide fast encoder/decoder implementation, develop specific applications to compress satellite observation data and genomic data, develop an indexed version of the compression algorithm in order to access some of the data without full decompression, and investigate specific software licensing options and opportunities.
Fields of science (EuroSciVoc)
CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.
CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.
- natural sciences computer and information sciences software
- engineering and technology mechanical engineering vehicle engineering aerospace engineering satellite technology
You need to log in or register to use this function
Programme(s)
Multi-annual funding programmes that define the EU’s priorities for research and innovation.
Multi-annual funding programmes that define the EU’s priorities for research and innovation.
-
HORIZON.1.1 - European Research Council (ERC)
MAIN PROGRAMME
See all projects funded under this programme
Topic(s)
Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.
Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.
Funding Scheme
Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.
Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.
HORIZON-ERC-POC - HORIZON ERC Proof of Concept Grants
See all projects funded under this funding scheme
Call for proposal
Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.
Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.
(opens in new window) ERC-2023-POC
See all projects funded under this callHost institution
Net EU financial contribution. The sum of money that the participant receives, deducted by the EU contribution to its linked third party. It considers the distribution of the EU financial contribution between direct beneficiaries of the project and other types of participants, like third-party participants.
08034 BARCELONA
Spain
The total costs incurred by this organisation to participate in the project, including direct and indirect costs. This amount is a subset of the overall project budget.