Eukaryotic Taxonomic Distribution
With the results of Hmmer, we divided the analisis in 3, Eukaryot 808 sequences, Virus 178 sequences and Bacteria 85 sequences, in this section we work on the 808 Eukaryotic sequences.
We can observe the domains architectures for this sequences, and we will see 3 “big” clusters
This SEED gave us diferent architecture domains and we analized each of them by getting all the complete sequences of the cluster, align them and eliminate the redundacy considering a 75% . The results are the following:
Cluster 379
After aligning and cleaning the redundancy we had 16 complete sequences that were trimmed
The motif HxH is present in 3 different sections
Position 330 of the alignment HTH or HIH in position 148 of ost of the sequences. There are some particular cases: -A2H129 has the motif in position 17 of the sequence. -Four of the sequences ( A2Fw47,A2E7R7, A2GBQ3,A2H129) present a second motif after aprox 20 amino acids (HTH).
Position 480 of the alignment HTH or HKH in position 170-190 of the sequences. There are some particular cases: -A2GLX5 has the motif in position 8 of the sequence. -Again in some cases we can find a second Motif HXH and even a third and fourth motif in a 20 amino acid distance each all HTH motifs. -Some sequences have more motifs in them, but there is no conservation with the others.
Position 890 of the alignment HTH or HSH in the tail of the sequences. There are some particular cases: -After 11 amino acids there is a second motif HSH or HKH in almost all of the sequences. -After 36 amino acids there is a third motif HSH or HTH or HQH or HIH or HAH or HVH in almost all of the sequences.
Considering the 16 complete sequences, we evaluate the type and quantity of amino acids present in the sequence: the amino acids more present in the sequence are marked in red.
Considering the 16 complete sequences, we evaluate the type and quantity of the motif HxH present in the sequence and their relative positions:
Considering the 16 complete sequences, we evaluate the type and order of the motif HxH present in the sequence and their relative positions:
All the complete sequences as trimmers were sent to AlphaFold Cluster, only A2DGZ2, A2FEE1, A2GLX5 and A2H129 retrive an output.
For A2FEE1 , position 14 of the list, the spectrum: range is 21.94000 to 96.58000.
For A2DGZ2 , position 7 of the list, the spectrum: range is 31.46000 to 98.06000.
For A2GLX5 , position 5 of the list, the spectrum: range is 20.93000 to 95.96000.
For A2H129 , position 9 of the list, the spectrum: range is 15.60000 to 97.58000.
We edit some of the non resulting sequences, elimating XX regions and in some cases even some of the N-ter or C-ter
For A2E7R7 , position 13 of the list, the spectrum: range is 23.76000 to 97.17000 after eliminating N-term aa.
For A2FAH0 , position 8 of the list, the spectrum: range is 17.58000 to 96.23000 after eliminating N-term aa.
For A2FCB6 , position 4 of the list, the spectrum: range is 16.93000 to 94.57000 after eliminating C-term aa.
For A2FI27 , position 2 of the list, the spectrum: range is 17.19000 to 97.01000 after eliminating C-term aa.
For A2FI29 , position 3 of the list, the spectrum: range is 23.37000 to 96.74000 after eliminating C-term aa.
For A2FPI4 , position 12 of the list, the spectrum: range is 20.75000 to 98.56000 after eliminating N-term aa.
For A2FW47 , position 15 of the list, the spectrum: range is 18.24000 to 90.82000 after eliminating C-term aa.
For A2GBQ3 , position 10 of the list, the spectrum: range is 18.24000 to 90.82000 after eliminating N-term aa.
For A2DF55 , position 16 of the list, the spectrum: range is 22.58000 to 91.56000 after eliminating N-term aa.
Cluster 231
After aligning and cleaning the redundancy we had 12 complete sequences that were trimmed
The motif HxH is present in 2 different sections
Position 150 of the alignment HTH or HIH in position 148 of ost of the sequences. There are some particular cases: - Position 117 or 67 an HTH motif and after 30 aa another HTH motif and a third one HTH after 20, a fourth HTH after 20aa - For one of the sequences there is a HRH motif in position 51
Position 520 of the alignment HTH or HSH in the tail of the sequences. There are some particular cases: -After 11 amino acids there is a second motif HSH or HKH in almost all of the sequences. -After 36 amino acids there is a third motif HSH or HTH or HQH or HIH or HAH in almost all of the sequences. -For some of the sequences a tail of 3 motifs, a mix of HTH HSH HIH HKH
Considering the 12 complete sequences, we evaluate the type and quantity of amino acids present in the sequence: the amino acids more present in the sequence are marked in red.
Considering the 12 complete sequences, we evaluate the type and quantity of the motif HxH present in the sequence and their relative positions:
Considering the 12 complete sequences, we evaluate the type and order of the motif HxH present in the sequence and their relative positions:
All the complete sequences as trimmers were sent to AlphaFold Cluster, only A2GSU3, A2GVQ6, A2H050, A2H5T2, A2HFA0, A2HL88, A2HP84 retrive an output.
For A2GSU3 , position 1 of the list, the spectrum: range is 30.83000 to 98.92000.
For A2GVQ6 , position 11 of the list, the spectrum: range is 18.42000 to 96.75000.
For A2H050 , position 9 of the list, the spectrum: range is 12.80000 to 98.39000.
For A2H5T2 , position 2 of the list, the spectrum: range is 29.77000 to 95.89000.
For A2HFA0 , position 5 of the list, the spectrum: range is 23.05000 to 94.38000.
For A2HL88 , position 8 of the list, the spectrum: range is 31.95000 to 96.71000.
For A2HP84 , position 6 of the list, the spectrum: range is 32.40000 to 98.01000
For A2FHF8 , position 7 of the list, the spectrum: range is 16.16000 to 95.41000
For A2G0Z7 , position 4 of the list, the spectrum: range is 50.72000 to 98.88000
For A2EZ81 , position 12 of the list, the spectrum: range is 42.22000 to 97.00000
For A2E9U7 , position 3 of the list, the spectrum: range is 28.36000 to 98.59000
For A2E7D1 , position 10 of the list, the spectrum: range is 23.44000 to 94.72000
Cluster 122
After aligning and cleaning the redundancy we had 15 complete sequences that were trimmed
The motif HxH is present in 2 different sections
Position 150 of the alignment HTH or HIH in position 148 of ost of the sequences. There are some particular cases: - Position 117 or 67 an HTH motif and after 30 aa another HTH motif and a third one HTH after 20, a fourth HTH after 20aa - For one of the sequences there is a HRH motif in position 51
Position 520 of the alignment HTH or HSH in the tail of the sequences. There are some particular cases: -After 11 amino acids there is a second motif HSH or HKH in almost all of the sequences. -After 36 amino acids there is a third motif HSH or HTH or HQH or HIH or HAH in almost all of the sequences. -For some of the sequences a tail of 3 motifs, a mix of HTH HSH HIH HKH
Position 520 of the alignment HTH or HSH in the tail of the sequences. There are some particular cases: -After 11 amino acids there is a second motif HSH or HKH in almost all of the sequences. -After 36 amino acids there is a third motif HSH or HTH or HQH or HIH or HAH in almost all of the sequences. -For some of the sequences a tail of 3 motifs, a mix of HTH HSH HIH HKH
Position 520 of the alignment HTH or HSH in the tail of the sequences. There are some particular cases: -After 11 amino acids there is a second motif HSH or HKH in almost all of the sequences. -After 36 amino acids there is a third motif HSH or HTH or HQH or HIH or HAH in almost all of the sequences. -For some of the sequences a tail of 3 motifs, a mix of HTH HSH HIH HKH
Considering the 12 complete sequences, we evaluate the type and quantity of amino acids present in the sequence: the amino acids more present in the sequence are marked in red.
Considering the 12 complete sequences, we evaluate the type and quantity of the motif HxH present in the sequence and their relative positions:
Considering the 12 complete sequences, we evaluate the type and order of the motif HxH present in the sequence and their relative positions:
All the complete sequences as trimmers were sent to AlphaFold Cluster, only retrive an output.
For A2H8J0 , position 6 of the list, the spectrum: range is 13.97000 to 96.50000
For A2HVD0 , position 6 of the list, the spectrum: range is 19.90000 to 98.34000
For A2EM49 , position 6 of the list, the spectrum: range is 16.30000 to 95.08000
For A2EZZ0 , position 6 of the list, the spectrum: range is 11.01000 to 96.72000
For A2EVF3 , position 6 of the list, the spectrum: range is 13.71000 to 96.46000
For A2FAV7 , position 6 of the list, the spectrum: range is 23.80000 to 96.32000
For A2F762 , position 12 of the list, the spectrum: range is 16.71000 to 94.09000
For A2FDY2 , position 7 of the list, the spectrum: range is 13.67000 to 97.50000
For A2GPM0 , position 10 of the list, the spectrum: range is 1.99000 to 94.60000
For A2GRE0 , position 4 of the list, the spectrum: range is 17.13000 to 89.36000
For A2GRF3 , position 15 of the list, the spectrum: range is 16.07000 to 96.35000