Decoding the language of DNA
Genomic sequences resemble the human language in the sense that nucleotides and codons transmit information in a similar way to how phonemes and syllables comprise the units of the spoken language. A genomic sequence can encode a protein, or it could relay a regulatory or structural message. Scientists of the EU-funded LanguageOfDNA project will employ algorithms designed for the processing of human languages to classify RNA transcripts and untranslated genomic regions. Through the establishment of DNA/RNA language models, they will be able to interpret any genomic sequence and contribute to the functional delineation of the human genome.
Fields of science
- humanitieslanguages and literaturegeneral language studies
- natural sciencesbiological sciencesgeneticsDNA
- natural sciencesbiological sciencesbiochemistrybiomoleculesproteins
- natural sciencesbiological sciencesgeneticsRNA
- natural sciencescomputer and information sciencesartificial intelligencemachine learning