H. Akaike, A new look at the statistical identification model, IEEE Transactions on Automatic Control, vol.19, pp.716-723, 1974.

H. Almagor, A Markov analysis of DNA sequences, J.Theor. Biol, vol.104, pp.633-645, 1983.

G. Bernardi, The vertebrate Genome: Isochores and Evolution, Mol. Biol. Evol, vol.10, pp.186-204, 1993.

B. Blaisdell, Markov chain analysis finds a significant influence of neighboring bases on the occurrence of a base in eucaryotic nuclear DNA sequences both protein-coding and noncoding, J. Mol. Evol, vol.21, pp.278-288, 1985.

F. Blattner, G. Plunkett, C. Bloch, N. Perna, V. Burland et al., The complete genome sequence of escherichia coli k-12, Science, vol.277, pp.1453-74, 1997.

G. Churchill, Stochastic models for heterogeneous DNA sequences, Bull. Math. Biol, vol.268, pp.8-14, 1989.

M. El-karoui, V. Biaudet, S. Schbath, and A. Gruss, Characteristics of Chi distribution on different bacterial genomes, Res. Microbiol, vol.150, pp.579-587, 1999.

J. W. Fickett, D. C. Torney, and D. R. Wolf, Base compositional Structure of Genomes, Genomics, vol.13, pp.1056-1064, 1992.

R. Fleischmann, M. Adams, O. White, R. Clayton, E. Kirkness et al., Whole-genome random sequencing and assembly of haemophilus influenzae rd, vol.269, pp.496-512, 1995.

M. Gelfand, C. P. Kozhukhin, and P. , Extendable words in nucleotide sequences, Bioinformatics, vol.8, pp.129-135, 1992.

S. Karlin, C. Burge, and A. Campbell, Statistical analyses of counts and distributions of restriction sites in dna sequences, Nucl. Acids Res, vol.20, pp.1363-1370, 1992.

A. Krogh, L. Mian, and D. Haussler, A hidden Markov model that finds genes in escherichia coli DNA, Nucl. Acids Res, vol.22, pp.4768-4778, 1994.

J. Lobry, Genomic landscapes, Microbiol. Today, vol.26, pp.164-165, 1999.

J. Lobry, Oriloc: prediction of replication boundaries in unannotated bacterial chromosomes, Bioinformatics, vol.16, pp.560-561, 2000.
URL : https://hal.archives-ouvertes.fr/hal-00427080

W. Lou, On runs and longest run tests: A method of finite markov chain imbedding, J. Am. Statis. Assoc, vol.91, pp.373-380, 1996.

V. Miele, P. Bourguignon, D. Robelin, G. Nuel, and H. Richard, seq++ : analyzing biological sequences with a range of Markov-related models, Bioinformatics, vol.21, pp.2783-2784, 2005.

E. Miller, E. Kutter, G. Mosig, F. Arisaka, T. Kunisawa et al., Bacteriophage T4 genome, Microbiology and molecular biology reviews, vol.67, issue.1, pp.86-156, 2003.

F. Muri, Comparaisons d'algorithmes d'identification de chaînes de Markov cachées et applicationà la détection de régions homogènes dans les séquences d'ADN, pp.156-194, 1997.

P. Nicodème, T. Doerks, and M. Vingron, Proteome analysis based on motif statistics, Bioinformatics, vol.18, pp.5161-5171, 2002.

P. Nicolas, L. Bize, F. Muri, M. Hoebeke, F. Rodolphe et al., Mining bascillus subtilis chromosome heterogeneity using hidden Markov models, Nucl. Acids Res, vol.30, pp.1418-1426, 2002.

G. Nuel, Grandes déviations et chaînes de Markov pour l'étude des occurrences de mots dans les séquences biologiques, 2001.

G. Nuel, Effective p-value computations using Finite Markov Chain Imbedding (FMCI): application to local score and to pattern statistics, Journal of Computational Biology, vol.11, pp.1023-1033, 2004.
URL : https://hal.archives-ouvertes.fr/hal-00271494

G. Nuel, Numerical Solutions for Patterns Statistics on Markov Chains, Statistical Applications in Genetics and Molecular Biology, vol.5, 2006.
URL : https://hal.archives-ouvertes.fr/hal-00271482

G. Nuel and B. Prum, Analyse statistique des séquences biologiques: modélisation markovienne, alignements et motifs, 2007.

J. Oliver, P. Bernaola-galván, P. Carpena, and R. Román-roldán, Isochore chromosome maps of eukaryotic genomes, Gene, vol.276, pp.47-56, 2001.

G. Phillips, J. Arnold, and R. Ivarie, The effect of codon usage on the oligonucleotide composition of the e. coli genome and identification of overand underrepresented sequences by Markov chain analysis, Nucl. Acids Res, vol.15, pp.2627-2638, 1987.

G. Reinert and S. Schbath, Compound Poisson and Poisson process approximations for occurrences of multiple words in Markov chains, J. Comput. Biol, vol.5, pp.223-253, 1998.

S. Schbath, B. Prum, and E. De-turckheim, Exceptional motifs in different Markov chain models for a statistical analysis of DNA sequences, Journal of Computational Biology, vol.2, pp.417-437, 1995.

G. Schwarz, Estimating the dimension of a model, Ann. Statist, vol.6, pp.461-464, 1978.

G. Simons, Y. Yao, and G. Morton, Global Markov models for eukaryote nucleotide data, J. Statist. Plann. Inference, vol.130, pp.251-275, 2005.

G. Smith, S. Kunes, D. Schultz, A. Taylor, and K. Triman, Structure of chi hotspots of generalized recombination, Cell, vol.24, pp.429-465, 1981.

H. Smith, M. Gwinn, and S. Salzberg, DNA uptake signal sequences in naturally transformable bacteria, Res. Microbiol, vol.150, pp.603-616, 1999.

M. Stanke and S. Waack, Gene prediction with a hidden Markov model and a new intron submodel, Bioinformatics, vol.19, pp.215-225, 2003.

R. Stephens, S. Kalman, C. Lammel, J. Fan, R. Marathe et al., Genome sequence of an obligate intracellular pathogen of humans: Chlamydia trachomatis, Science, vol.282, pp.754-759, 1998.

J. Van-helden, M. Del-olmo, and J. Pérez-ortín, Statistical analysis of yeast genomic downstream sequences reveals putative polyadenylation signals, Nucl. Acids Res, vol.28, pp.1000-1010, 2000.

R. Wu and E. Taylor, Nucleotide sequence analysis of DNA. II. Complete nucleotide sequence of the cohesive ends of bacteriophage lambda DNA, J Mol Biol, vol.57, pp.491-511, 1971.

S. Zoubak, O. Clay, and G. Bernardi, The gene distribution of the human genome, Gene, vol.174, pp.95-102, 1996.