How we got the sequences

We visit PFAM entry for Phage tail repeat like here From this page we obtain the SEED and curate it, obtaining this new SEED

_images/SEEDHxH.png

This SEED gave us diferent architecture domains

_images/domains1.png _images/domains2.png _images/domains3.png _images/domains4.png _images/domains5.png

The taxonomy distribution is shown here:

_images/taxonomi.png

We obtained all 1071 complete sequences gathered using the previous profile and eliminate 80% redundancy sequences (162 sequences).

Then this 162 sequences where analized and was confirmed that only 149 contain the Motif. The following table shows how they are represented in average :

_images/averagecount.png

The patters of the presence of the motifs in quantity are shown in the following table:

_images/table.png _images/table2.png _images/table3.png _images/table4.png