Skip to main content
European Commission logo
English English
CORDIS - EU research results
CORDIS
CORDIS Web 30th anniversary CORDIS Web 30th anniversary
Content archived on 2024-05-27

Biological text mining

Objective

Genome research has spawned unprecedented volumes of data, but characterisation of DNA and protein sequences has not kept pace with the rate of data acquisition. To anyone trying to know more about a given sequence, the worldwide collection of abstracts and papers remains the ultimate information source. The goal of the BioMinT project is to develop a generic text mining tool that:
1) interprets diverse types of query;
2) retrieves relevant documents from the biological literature;
3) extracts the required information; and
4) outputs the result as a data- base slot filler or as a structured report.

The BioMinT tool will thus operate in two modes. As a curator's assistant, it will be validated on SWISS-PROT & PRINTS; as a researcher's assistant, its reports will be submitted to the scrutiny of biologists in academia and industry. The project will be conducted by an inter-disciplinary team from biology computational linguistics and data/text mining.

Fields of science

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques.

Call for proposal

Data not available

Coordinator

UNIVERSITY OF MANCHESTER
EU contribution
No data
Total cost
No data

Participants (5)