Authors: Christoforos Nikolaou Yannis Almirantis
Publish Date: 2005/07/19
Volume: 61, Issue: 1, Pages: 23-35
Abstract
Extensive work on ntuplet occurrence in genomic sequences has revealed the correlation of their usage with sequence origin Parallel to that there exist different restrictions in the nucleotide composition of coding and noncoding sequences that may result in distinct modes of usage of ntuplets The relatively simple approaches described herein focus on such differences They are based on simple summation measures of ntuplet frequencies computed after filtering the background nucleotide composition Among the main targets of this work is to draw some conclusions on the qualitative differences in the composition of genomic sequences depending on their functionality Moreover an evolutionary model is formulated including simple forms of ubiquitous events of genome dynamics genomic fusions genome shuffling due to transpositions replication slippage and point mutations This model is shown to be able to reproduce all the statistical features of genomic sequences discussed herein
Keywords: